BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000545
(1432 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225455571|ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Vitis vinifera]
Length = 1442
Score = 2437 bits (6316), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1166/1458 (79%), Positives = 1283/1458 (87%), Gaps = 42/1458 (2%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS+AAYKMMHWPTGI NC SGF+THSRAD+ PQI IQT++L+SE P+KR IGP+PNL+V
Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
TAAN++E+Y+VRVQE+ S+ES+ S ETKR +M GIS A+LELVC YRLHGNVE++ +L
Sbjct: 61 TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
GG DNSRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEW HLKRG ESFARG
Sbjct: 121 SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
PLVKVDPQGRC GVLVYGLQMIILKASQ G GLVGDE+ SG SAR+ESS+VI+LRD
Sbjct: 181 PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
LDMKHVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
Sbjct: 241 LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
A+NLPHDAYKLL VPSPIGGV+V+ AN+IHYHSQSASCALALNNYAVS D+SQE+PRSSF
Sbjct: 301 AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
SVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGRVV RLDLSK+ SVLTS I IGN
Sbjct: 361 SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
SLFFLGSRLGDSLLVQFT S+LSS +KEE GDIE D PS KRLR+SSSDALQDMVN
Sbjct: 421 SLFFLGSRLGDSLLVQFT-----SILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVN 475
Query: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
GEELSLYGSA N+TE++QKTFSF+VRDS +N+GPLKDF+YGLRINAD ATGI+KQSNYE
Sbjct: 476 GEELSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSNYE 535
Query: 541 LV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
LV ELPGCKGIWTVYHK++RGHNADS++MA DDE
Sbjct: 536 LVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKDDE 595
Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
YHAYLIISLE+RTMVLETADLL EVTESVDY+VQG TI+AGNLFGRRRV+QV+ RGARIL
Sbjct: 596 YHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGARIL 655
Query: 635 DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 694
DG++MTQDL SE+STVLSVSIADPYVLL MSDG+I+LLVGDPSTCTVS+
Sbjct: 656 DGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCTVSI 705
Query: 695 QTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 754
PA ESSKK +S+CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG DQGDIY
Sbjct: 706 NIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDIYC 765
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
VV YESG LEIFDVPNFNCVF+VDKF+SG H+VDT + E +D++ ++ +SEE QG
Sbjct: 766 VVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQG 825
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
RKEN H++KVVELAMQRWS HSRPFLF ILTDGTILCY AYL+EGPE+T K+++ VS
Sbjct: 826 RKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQ 885
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
SLS+SNVSASRLRNLRF R PLD YTREE G R+T+FKNI G QG FLSGSRP
Sbjct: 886 NSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPL 945
Query: 935 WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
W MVFRER+RVHPQLCDGSIVAFTVLHN+NCNHG IYVTSQG LKICQLP+ S+YDNYWP
Sbjct: 946 WFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWP 1005
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
VQK IPLK TPHQ+TYFAEKNLYPLIVSVPVLKPLN VLS L+DQE GHQ++N NLSS +
Sbjct: 1006 VQK-IPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSDE 1064
Query: 1055 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1114
LHR+Y+V+E+EVR+LEP+++G PWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI
Sbjct: 1065 LHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1124
Query: 1115 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1174
GTAYVQGEDVAARGRVLLFS G+N DN QNLV+E+YSKELKGAISA+ASLQGHLLIASGP
Sbjct: 1125 GTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGP 1184
Query: 1175 KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1234
KIILHKWTGTELNG+AF+DAPPLYVVSLNIVKNFILLGDIH+SIYFLSWKEQGAQLNLLA
Sbjct: 1185 KIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLA 1244
Query: 1235 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1294
KDFGSLDCFATEFLIDGSTLSL+VSD+QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH
Sbjct: 1245 KDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1304
Query: 1295 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1354
VTKFLRLQML SSDRT A GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK
Sbjct: 1305 VTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1364
Query: 1355 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1414
LVD+VPHVAGLNPRSFRQF SNGKAHRPGPD+IVDCELL HYEMLP EEQLEIA Q GTT
Sbjct: 1365 LVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTT 1424
Query: 1415 RSQILSNLNDLALGTSFL 1432
R QILSNLNDL+LGTSFL
Sbjct: 1425 RMQILSNLNDLSLGTSFL 1442
>gi|296084122|emb|CBI24510.3| unnamed protein product [Vitis vinifera]
Length = 1448
Score = 2431 bits (6301), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1166/1464 (79%), Positives = 1283/1464 (87%), Gaps = 48/1464 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS+AAYKMMHWPTGI NC SGF+THSRAD+ PQI IQT++L+SE P+KR IGP+PNL+V
Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
TAAN++E+Y+VRVQE+ S+ES+ S ETKR +M GIS A+LELVC YRLHGNVE++ +L
Sbjct: 61 TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
GG DNSRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEW HLKRG ESFARG
Sbjct: 121 SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
PLVKVDPQGRC GVLVYGLQMIILKASQ G GLVGDE+ SG SAR+ESS+VI+LRD
Sbjct: 181 PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
LDMKHVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
Sbjct: 241 LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
A+NLPHDAYKLL VPSPIGGV+V+ AN+IHYHSQSASCALALNNYAVS D+SQE+PRSSF
Sbjct: 301 AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
SVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGRVV RLDLSK+ SVLTS I IGN
Sbjct: 361 SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
SLFFLGSRLGDSLLVQFT S+LSS +KEE GDIE D PS KRLR+SSSDALQDMVN
Sbjct: 421 SLFFLGSRLGDSLLVQFT-----SILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVN 475
Query: 481 GEELSLYGSASNNTESAQ------KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
GEELSLYGSA N+TE++Q KTFSF+VRDS +N+GPLKDF+YGLRINAD ATGI+
Sbjct: 476 GEELSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIA 535
Query: 535 KQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
KQSNYELV ELPGCKGIWTVYHK++RGHNADS++M
Sbjct: 536 KQSNYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKM 595
Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
A DDEYHAYLIISLE+RTMVLETADLL EVTESVDY+VQG TI+AGNLFGRRRV+QV+
Sbjct: 596 ATKDDEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYA 655
Query: 629 RGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
RGARILDG++MTQDL SE+STVLSVSIADPYVLL MSDG+I+LLVGDPS
Sbjct: 656 RGARILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPS 705
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
TCTVS+ PA ESSKK +S+CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG D
Sbjct: 706 TCTVSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQD 765
Query: 749 QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE 808
QGDIY VV YESG LEIFDVPNFNCVF+VDKF+SG H+VDT + E +D++ ++ +SE
Sbjct: 766 QGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSE 825
Query: 809 EGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSD 868
E QGRKEN H++KVVELAMQRWS HSRPFLF ILTDGTILCY AYL+EGPE+T K++
Sbjct: 826 EEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTE 885
Query: 869 DPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFL 928
+ VS SLS+SNVSASRLRNLRF R PLD YTREE G R+T+FKNI G QG FL
Sbjct: 886 EAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFL 945
Query: 929 SGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
SGSRP W MVFRER+RVHPQLCDGSIVAFTVLHN+NCNHG IYVTSQG LKICQLP+ S+
Sbjct: 946 SGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSS 1005
Query: 989 YDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNH 1048
YDNYWPVQK IPLK TPHQ+TYFAEKNLYPLIVSVPVLKPLN VLS L+DQE GHQ++N
Sbjct: 1006 YDNYWPVQK-IPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLEND 1064
Query: 1049 NLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 1108
NLSS +LHR+Y+V+E+EVR+LEP+++G PWQTRATIPMQSSENALTVRVVTLFNTTTKEN
Sbjct: 1065 NLSSDELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 1124
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
ETLLAIGTAYVQGEDVAARGRVLLFS G+N DN QNLV+E+YSKELKGAISA+ASLQGHL
Sbjct: 1125 ETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHL 1184
Query: 1169 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
LIASGPKIILHKWTGTELNG+AF+DAPPLYVVSLNIVKNFILLGDIH+SIYFLSWKEQGA
Sbjct: 1185 LIASGPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGA 1244
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
QLNLLAKDFGSLDCFATEFLIDGSTLSL+VSD+QKNIQIFYYAPKMSESWKGQKLLSRAE
Sbjct: 1245 QLNLLAKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAE 1304
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1348
FHVGAHVTKFLRLQML SSDRT A GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL
Sbjct: 1305 FHVGAHVTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1364
Query: 1349 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
QSLQKKLVD+VPHVAGLNPRSFRQF SNGKAHRPGPD+IVDCELL HYEMLP EEQLEIA
Sbjct: 1365 QSLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIA 1424
Query: 1409 HQTGTTRSQILSNLNDLALGTSFL 1432
Q GTTR QILSNLNDL+LGTSFL
Sbjct: 1425 QQIGTTRMQILSNLNDLSLGTSFL 1448
>gi|255539681|ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
[Ricinus communis]
gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative
[Ricinus communis]
Length = 1461
Score = 2420 bits (6272), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1175/1462 (80%), Positives = 1296/1462 (88%), Gaps = 31/1462 (2%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELP-SKRGIGPVPNLV 59
MS+AAYKM+HWPTGI +C SG+ITHSRAD+VPQIP IQT+ LDSE P SKRGIGP+PNL+
Sbjct: 1 MSYAAYKMLHWPTGIESCASGYITHSRADFVPQIPPIQTDNLDSEWPPSKRGIGPMPNLI 60
Query: 60 VTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119
VTA +V+E+YVVRVQE+GS+ES++S ETKR LMDG+S ASLELVCHYRLHGNVES+ +L
Sbjct: 61 VTAGSVLEVYVVRVQEDGSRESRSSRETKRGGLMDGVSGASLELVCHYRLHGNVESMVVL 120
Query: 120 SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179
G D+SRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEWLHLKRGRESFAR
Sbjct: 121 PTEGGDSSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLKRGRESFAR 180
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
GPL+KVDPQGRCGG+LVY +QMIIL+A+Q SGLVGD+D SGG SAR++SS+VINLR
Sbjct: 181 GPLLKVDPQGRCGGILVYDMQMIILRAAQASSGLVGDDDALSSGGSISARVQSSYVINLR 240
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
D+DMKHVKDFIF+H YIEPV+VILHERELTWAGRVSWKHHTCMISALSISTTLKQ LIW
Sbjct: 241 DMDMKHVKDFIFLHDYIEPVVVILHERELTWAGRVSWKHHTCMISALSISTTLKQPTLIW 300
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
S +NLPHDAYKLLAVP PIGGVLV+ ANTIHYHS+SA+ ALALNNYAVS+DSSQELPR+S
Sbjct: 301 SVVNLPHDAYKLLAVPPPIGGVLVICANTIHYHSESATYALALNNYAVSIDSSQELPRAS 360
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419
FSVELDA A WL NDVALLS K G+L+LL++VYDGRVVQRLDLSK+ SVLTSDITTIG
Sbjct: 361 FSVELDAVKAAWLLNDVALLSAKNGELLLLSLVYDGRVVQRLDLSKSKASVLTSDITTIG 420
Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
NSLFFLGSRLGDSLLVQFT G G S++SSGLKEE G+IE D PS KRL+RS+SD LQDMV
Sbjct: 421 NSLFFLGSRLGDSLLVQFTNGLGPSVVSSGLKEEVGEIEGDVPSAKRLKRSASDGLQDMV 480
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
+GEELSLYGS +NNTESAQK+FSFAVRDSL+N+GPLKDFSYGLR N DASATGI+KQSNY
Sbjct: 481 SGEELSLYGSTANNTESAQKSFSFAVRDSLINVGPLKDFSYGLRSNYDASATGIAKQSNY 540
Query: 540 EL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDD 573
+L V+LPGC+GIWTVYHK++RGHN D S+MAA D
Sbjct: 541 DLVCCSGHGKNGTLCILRQSIRPEMITEVDLPGCRGIWTVYHKNARGHNVDLSKMAAAAD 600
Query: 574 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
EYHAYLIIS+EARTMVLETADLL+EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI
Sbjct: 601 EYHAYLIISMEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 660
Query: 634 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
LDGS+MTQDLS G SNSES GSE++TV SVSIADPYVL+ M+DGSIRLL+GD STC VS
Sbjct: 661 LDGSFMTQDLSIGSSNSESSPGSESATVSSVSIADPYVLIKMTDGSIRLLIGDSSTCMVS 720
Query: 694 VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG---ADGGPLDQG 750
+ TP+A E+S++ VS+CTLYHDKGPEPWLRK STDAWLSTGV EAIDG ADGGP DQG
Sbjct: 721 INTPSAFENSERSVSACTLYHDKGPEPWLRKASTDAWLSTGVSEAIDGAESADGGPHDQG 780
Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
DIY +VCYESGALEIFDVPNFN VF+VDKFVSG+TH+ D Y+RE KDS+ + N SEE
Sbjct: 781 DIYCIVCYESGALEIFDVPNFNRVFSVDKFVSGKTHLADAYVREPPKDSQEKTNRISEEV 840
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
G GRKEN H+MK VELAMQRWS HHSRPFLF +LTDGTILCY AYLFE P+ TSK++D
Sbjct: 841 AGLGRKENAHNMKAVELAMQRWSGHHSRPFLFGVLTDGTILCYHAYLFEAPDATSKTEDS 900
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
VS + + ++SASRLRNLRF R PLD+Y +EET CQRITIF NISGHQGFFL G
Sbjct: 901 VSAQNPVGLGSISASRLRNLRFVRVPLDSYIKEETSTENSCQRITIFNNISGHQGFFLLG 960
Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
SRP W MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG LKICQLPS S YD
Sbjct: 961 SRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGNLKICQLPSFSNYD 1020
Query: 991 NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
NYWPVQK IPLK TPHQ+TYF EKNLYPLIVSVPV KP+NQVLS L+DQEVGHQI+NHNL
Sbjct: 1021 NYWPVQK-IPLKGTPHQVTYFPEKNLYPLIVSVPVHKPVNQVLSSLVDQEVGHQIENHNL 1079
Query: 1051 SSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET 1110
SS +L +TY+VEE+EVRILE + GGPWQT+ATIPMQSSENALTVRVVTLFN TTKENET
Sbjct: 1080 SSDELLQTYSVEEFEVRILESENGGGPWQTKATIPMQSSENALTVRVVTLFNATTKENET 1139
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
LLAIGTAYVQGEDVAARGRVLLFS ++ +N Q LV+EVYSKELKGAISALASLQGHLLI
Sbjct: 1140 LLAIGTAYVQGEDVAARGRVLLFSVVKSTENSQVLVSEVYSKELKGAISALASLQGHLLI 1199
Query: 1171 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1230
ASGPKIILHKWTGTELNG+AFYDAPPLYV S+NIVKNFILLGDIHKSIYFLSWKEQGAQL
Sbjct: 1200 ASGPKIILHKWTGTELNGVAFYDAPPLYVASMNIVKNFILLGDIHKSIYFLSWKEQGAQL 1259
Query: 1231 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1290
+LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM ESWKGQKLLSRAEFH
Sbjct: 1260 SLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMLESWKGQKLLSRAEFH 1319
Query: 1291 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1350
VGAH+TKF+RL ML+TSSDR+GAAPG DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS
Sbjct: 1320 VGAHITKFIRLSMLSTSSDRSGAAPGPDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1379
Query: 1351 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
LQ+KLVD+VPHVAGLNPRSFRQF S+GK HRPGP+SIVDCELLSH+EMLPLEEQLEIA Q
Sbjct: 1380 LQRKLVDAVPHVAGLNPRSFRQFRSDGKVHRPGPESIVDCELLSHFEMLPLEEQLEIAQQ 1439
Query: 1411 TGTTRSQILSNLNDLALGTSFL 1432
GTTR+QILSNLNDL+LGTSFL
Sbjct: 1440 VGTTRAQILSNLNDLSLGTSFL 1461
>gi|356559917|ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Glycine max]
Length = 1447
Score = 2387 bits (6187), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1148/1460 (78%), Positives = 1290/1460 (88%), Gaps = 41/1460 (2%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK--RGIGPVPNL 58
MSFAAYKMM PTGI NC +GF+THSR+D+VP +Q ++LD+E PS+ +G +PNL
Sbjct: 1 MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAEWPSRPRHHVGSLPNL 56
Query: 59 VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAI 118
VVTAANV+E+Y VR+QE+ + K + +++R L+DGI+ ASLELVCHYRLHGNVE++A+
Sbjct: 57 VVTAANVLEVYAVRLQED--QPPKAAADSRRGALLDGIAGASLELVCHYRLHGNVETMAV 114
Query: 119 LSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA 178
LS GG D SRRRDSI+L F DAKISVLE+DDSIHGLR +S+HCFE PEWLHLKRGRE FA
Sbjct: 115 LSIGGGDVSRRRDSIMLTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQFA 174
Query: 179 RGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
RGP+VKVDPQGRCGGVL+Y LQMIILKA+Q GSGLVG++D GS G +ARIESS++INL
Sbjct: 175 RGPVVKVDPQGRCGGVLIYDLQMIILKATQAGSGLVGEDDALGSSGAVAARIESSYMINL 234
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
RDLDM+HVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI
Sbjct: 235 RDLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 294
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
WSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALN+YAV+LDSSQE+PRS
Sbjct: 295 WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNSYAVTLDSSQEIPRS 354
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
SF+VELDAA+ATWL +DVALLSTKTG+L+LLT+VYDGRVVQRLDLSK+ SVL+S ITTI
Sbjct: 355 SFNVELDAANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTI 414
Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
GNSLFFL SRLGDS+LVQF+CGSG SMLSS LKEE GDIEADAPS KRLRRS SDALQDM
Sbjct: 415 GNSLFFLASRLGDSMLVQFSCGSGVSMLSSNLKEEVGDIEADAPS-KRLRRSPSDALQDM 473
Query: 479 VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN 538
V+GEELSLYGSA N TESAQK+FSFAVRDSL+N+GPLKDFSYGLRINADA+ATGI+KQSN
Sbjct: 474 VSGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 533
Query: 539 YEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
YEL VELPGCKGIWTVYHKS+R HNADSS+MA D
Sbjct: 534 YELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADDD 593
Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
DEYHAYLIISLEARTMVLETADLL+EVTESVDY+VQG+T+AAGNLFGR RVIQV+ERGAR
Sbjct: 594 DEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRCRVIQVYERGAR 653
Query: 633 ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
ILDGS+MTQD+SFG SN ESGS S+++ LSVSIADP+VLL MSDGSIRLL+GDPSTCT+
Sbjct: 654 ILDGSFMTQDVSFGASNLESGSASDSAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCTI 713
Query: 693 SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
SV +PA+ ESSK VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE IDG DG D GDI
Sbjct: 714 SVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGETIDGTDGAAQDHGDI 773
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
Y VVC+++G LEIFDVPNFNCVF+V+ F+SG++H+VD M+E LKDS+ +
Sbjct: 774 YCVVCFDNGNLEIFDVPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVIN 830
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QGRKENI MKVVELAMQRWS HSRPFLF IL+DGTILCY AYL+E P++TSK +D S
Sbjct: 831 QGRKENIPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDSTSKVEDSAS 890
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
S+ +S+ + SRLRNLRF R PLDAY RE+T +G PCQ+ITIFKNI ++GFFLSGSR
Sbjct: 891 AGGSIGLSSTNVSRLRNLRFVRVPLDAYAREDTSNGPPCQQITIFKNIGSYEGFFLSGSR 950
Query: 933 PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
P W MV RERLRVHPQLCDGSIVAFTVLHNVNCN G IYVTSQG+LKICQLPSGS YD+Y
Sbjct: 951 PAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNQGLIYVTSQGVLKICQLPSGSNYDSY 1010
Query: 993 WPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1052
WPVQK IPLKATPHQ+TYFAEKNLYPLIVS PVLKPLNQV+S L+DQ++ HQ ++ N++
Sbjct: 1011 WPVQK-IPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVIS-LVDQDINHQNESQNMNP 1068
Query: 1053 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1112
+ +R Y ++E+EVRI+EP+++GGPWQT+ATIPMQSSENALTVR+VTL NTT+KENETLL
Sbjct: 1069 DEQNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLVNTTSKENETLL 1128
Query: 1113 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
AIGTAYVQGEDVAARGR+LLFS G+N DNPQ LV+EVYSKELKGAISALASLQGHLLIAS
Sbjct: 1129 AIGTAYVQGEDVAARGRILLFSLGKNTDNPQTLVSEVYSKELKGAISALASLQGHLLIAS 1188
Query: 1173 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1232
GPKIILHKW GTELNGIAF+DAPPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+L
Sbjct: 1189 GPKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSL 1248
Query: 1233 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1292
LAKDFGSLDCFATEFLIDGSTLSL+VSD+ +NIQIFYYAPKMSESWKGQKLLSRAEFHVG
Sbjct: 1249 LAKDFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1308
Query: 1293 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1352
AHVTKFLRLQML+T SDR GA PGSDKTNRFALLFGTLDGSIGCIAPLDE+TFRRLQSLQ
Sbjct: 1309 AHVTKFLRLQMLST-SDRAGAVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQ 1367
Query: 1353 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
+KLVD+VPHVAGLNPR+FR F SNGKAHRPGPDSIVDCELL HYEMLPLEEQLEIAHQ G
Sbjct: 1368 RKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQVG 1427
Query: 1413 TTRSQILSNLNDLALGTSFL 1432
TTRSQILSNL+DL+LGTSFL
Sbjct: 1428 TTRSQILSNLSDLSLGTSFL 1447
>gi|356530945|ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Glycine max]
Length = 1449
Score = 2362 bits (6122), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1147/1461 (78%), Positives = 1292/1461 (88%), Gaps = 41/1461 (2%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDS-ELPSK--RGIGPVPN 57
MSFAAYKMM PTGI NC +GF+THSR+D+VP +Q ++LD+ E PS+ +GP+PN
Sbjct: 1 MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAAEWPSRPRHHVGPLPN 56
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVVTAANV+E+Y VR+QE+ + S +++R L+DGI+ ASLEL CHYRLHGNVE++A
Sbjct: 57 LVVTAANVLEVYAVRLQED-QQPKDASDDSRRGTLLDGIAGASLELECHYRLHGNVETMA 115
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+LS GG D SR+RDSIIL F DAKISVLE+DDSIHGLR +S+HCFE PEWLHLKRGRE F
Sbjct: 116 VLSIGGGDVSRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQF 175
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
ARGP+VK+DPQGRCGGVL+Y LQMIILKA+Q GSGLVGD+D FGS G +ARIESS++IN
Sbjct: 176 ARGPVVKIDPQGRCGGVLIYDLQMIILKATQVGSGLVGDDDAFGSSGAVAARIESSYMIN 235
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDLDM+HVKDF FV+GYIEPVMVILHERELTWAGRVSW HHTCMISALSISTTLKQHPL
Sbjct: 236 LRDLDMRHVKDFTFVYGYIEPVMVILHERELTWAGRVSWTHHTCMISALSISTTLKQHPL 295
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+GANTIHYHSQSASCALALNNYAV+LDSSQE+PR
Sbjct: 296 IWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNNYAVTLDSSQEIPR 355
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
SSF+VELDAA+ATWL +DVALLSTKTG+L+LL +VYDGRVVQRLDLSK+ SVL+S ITT
Sbjct: 356 SSFNVELDAANATWLLSDVALLSTKTGELLLLMLVYDGRVVQRLDLSKSKASVLSSGITT 415
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
IGNSLFFL SRLGDS+LVQF+CGSG SM+SS LKEE GDIE DAPS KRLRRS SDALQD
Sbjct: 416 IGNSLFFLASRLGDSMLVQFSCGSGVSMMSSNLKEEVGDIEVDAPS-KRLRRSPSDALQD 474
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
MV+GEELSLYGSA+N TESAQK+FSFAVRDSL+N+GPLKDFSYGLRINADA+ATGI+KQS
Sbjct: 475 MVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQS 534
Query: 538 NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
NYEL VELPGCKGIWTVYHKS+R HNADSS+MA
Sbjct: 535 NYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADD 594
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
DDEYHAYLIISLEARTMVLETADLL+EVTESVDY+VQG+T+AAGNLFGRRRVIQV+ERGA
Sbjct: 595 DDEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERGA 654
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
RILDGS+MTQD+SFG SNSESGS SE++ LSVSIADP+VLL MSDGSIRLL+GDPSTCT
Sbjct: 655 RILDGSFMTQDVSFGASNSESGSASESAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCT 714
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
+SV +PA+ ESSK VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG DG D GD
Sbjct: 715 ISVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHGD 774
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
IY VVC+++G LEIFD+PNFNCVF+V+ F+SG++H+VD M+E LKDS+ +
Sbjct: 775 IYCVVCFDNGNLEIFDIPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVV 831
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
QGRK+NI +MKVVELAMQRWS HSRPFLF IL+DGTILCY AYL+E P+ TSK +D
Sbjct: 832 NQGRKDNIPNMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSA 891
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
S S+ +S+ + SRLRNLRF R PLDAY RE+T +G+PCQ+ITIFKNI +QGFFLSGS
Sbjct: 892 SAGGSIGLSSTNVSRLRNLRFVRVPLDAYPREDTSNGSPCQQITIFKNIGSYQGFFLSGS 951
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
RP W MV RERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG+LKICQLPSGS YD+
Sbjct: 952 RPAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDS 1011
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
YWPVQK IPLKATPHQ+TYFAEKNLYPLIVS PVLKPLNQV+S L+DQ+ HQ ++ N++
Sbjct: 1012 YWPVQK-IPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVIS-LVDQDFNHQNESQNMN 1069
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1111
+ +R Y ++E+EVRI+EP+++GGPWQT+ATIPMQSSENALTVR+VTL NTT+KENETL
Sbjct: 1070 PDEQNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLLNTTSKENETL 1129
Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1171
LAIGTAYVQGEDVAARGR+LLFS G+ DNPQ LV+EVYSKELKGAISALASLQGHLLIA
Sbjct: 1130 LAIGTAYVQGEDVAARGRILLFSLGKITDNPQTLVSEVYSKELKGAISALASLQGHLLIA 1189
Query: 1172 SGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1231
SGPKIILHKW GTELNGIAF+DAPPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+
Sbjct: 1190 SGPKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLS 1249
Query: 1232 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1291
LLAKDFGSLDCFATEFLIDGSTLSL+VSD+ +NIQIFYYAPKMSESWKGQKLLSRAEFHV
Sbjct: 1250 LLAKDFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHV 1309
Query: 1292 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1351
GAHVTKFLRLQML+T SDR G+ PGSDKTNRFALLFGTLDGSIGCIAPLDE+TFRRLQSL
Sbjct: 1310 GAHVTKFLRLQMLST-SDRAGSVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSL 1368
Query: 1352 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1411
Q+KLVD+VPHVAGLNPR+FR F SNGKAHRPGPDSIVDCELL HYEMLPLEEQLEIA+Q
Sbjct: 1369 QRKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIANQI 1428
Query: 1412 GTTRSQILSNLNDLALGTSFL 1432
GTTRSQILSNL+DL+LGTSFL
Sbjct: 1429 GTTRSQILSNLSDLSLGTSFL 1449
>gi|224120960|ref|XP_002318462.1| predicted protein [Populus trichocarpa]
gi|222859135|gb|EEE96682.1| predicted protein [Populus trichocarpa]
Length = 1455
Score = 2355 bits (6103), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1150/1467 (78%), Positives = 1271/1467 (86%), Gaps = 47/1467 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKR----GIGPVP 56
MS+AAYKMMHWPT I C SGF+THSR++ +P + T++LDS+ PS+R GIGP P
Sbjct: 1 MSYAAYKMMHWPTTIDTCVSGFVTHSRSESA-HLPQLHTDDLDSDWPSRRRHGGGIGPTP 59
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V + NV+E+YVVRVQEEG++ +SGE KR +MDG++ ASLELVCHYRLHGNVES+
Sbjct: 60 NLIVASGNVLELYVVRVQEEGAR---SSGELKRGGVMDGVAGASLELVCHYRLHGNVESM 116
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+LS G D+SRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE P+W HLKRGRES
Sbjct: 117 GVLSVEGGDDSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPDWRHLKRGRES 176
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
FARGPLVKVDPQGRCGGVLVY LQMIILKA+Q GS LV DED FGSG SA I SS++I
Sbjct: 177 FARGPLVKVDPQGRCGGVLVYDLQMIILKAAQAGSALVQDEDAFGSGAAISAHIASSYII 236
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
NLRDLDMKHVKDFIFVH YIEPV+V+LHERELTWAGRV WKHHTCMISALSISTTLKQ
Sbjct: 237 NLRDLDMKHVKDFIFVHDYIEPVVVVLHERELTWAGRVVWKHHTCMISALSISTTLKQPT 296
Query: 297 LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
LIWS NLPHDAYKLLAVPSPIGGVLV+G NTIHYHS+SASCALALN+YA S+DSSQELP
Sbjct: 297 LIWSIGNLPHDAYKLLAVPSPIGGVLVIGVNTIHYHSESASCALALNSYAASVDSSQELP 356
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT 416
R++FSVELDAA+ATWL DVALLSTKTG+L+LLT+VYDGRVVQRLDLSK+ SVLTSDIT
Sbjct: 357 RATFSVELDAANATWLLKDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDIT 416
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ 476
T+GNS FFLGSRLGDSLLVQFT G G+SMLS GLKEE GDIE D PS KRL+ SSSDALQ
Sbjct: 417 TLGNSFFFLGSRLGDSLLVQFTSGLGSSMLSPGLKEEVGDIEGDLPSAKRLKVSSSDALQ 476
Query: 477 DMVNGEELSLYGSASNNTESAQ-----KTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
DMV+GEELSLY SA NN ES+Q KTFSF VRDSL+N+GPLKDF+YGLRINADA+AT
Sbjct: 477 DMVSGEELSLYSSAPNNAESSQVVSVIKTFSFTVRDSLINVGPLKDFAYGLRINADANAT 536
Query: 532 GISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGHNADS 565
GISKQSNYEL VELPGCKGIWTVYHK++R H+ DS
Sbjct: 537 GISKQSNYELVCCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNARIHSVDS 596
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
+MA+ DDEYHAYLIIS+EARTMVLETAD LTEVTESVDYFVQGRTIAAGNLFGRRRV+Q
Sbjct: 597 LKMAS-DDEYHAYLIISMEARTMVLETADHLTEVTESVDYFVQGRTIAAGNLFGRRRVVQ 655
Query: 626 VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
VFERGARILDGS+MTQDLSFG SNSE+G SE+STV+ VSI DPYVL+ M+DGSI++LVG
Sbjct: 656 VFERGARILDGSFMTQDLSFGGSNSETGR-SESSTVMHVSIVDPYVLVRMADGSIQILVG 714
Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
DPS CTVSV TP+A +SS K VS+CTLYHDKGPEPWLRKTSTDAWLSTG+ EAIDGAD G
Sbjct: 715 DPSACTVSVNTPSAFQSSTKSVSACTLYHDKGPEPWLRKTSTDAWLSTGISEAIDGADSG 774
Query: 746 PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
+QGDIY VVCYE+GALEIFDVPNFN VF VDKFVSG+TH++DT E KD +
Sbjct: 775 AHEQGDIYCVVCYETGALEIFDVPNFNSVFFVDKFVSGKTHLLDTCTGEPAKDM---MKG 831
Query: 806 SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
EE G GRKE+ +MKVVEL M RWS HSRPFLF ILTDGTILCY AYLFEGP+ TS
Sbjct: 832 VKEEVAGAGRKESTQNMKVVELTMLRWSGRHSRPFLFGILTDGTILCYHAYLFEGPDGTS 891
Query: 866 KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQG 925
K +D VS S+ S +SASRLRNLRF R PLD YTREET CQRIT FKNISG+QG
Sbjct: 892 KLEDSVSAQNSVGASTISASRLRNLRFVRVPLDTYTREETSSETSCQRITTFKNISGYQG 951
Query: 926 FFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
FFLSGSRP W MVFRERLRVHPQLCDGSIVAFTVLH VNCNHG IYVTSQG LKIC L S
Sbjct: 952 FFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHTVNCNHGLIYVTSQGNLKICHLSS 1011
Query: 986 GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI 1045
S+YDNYWPVQK IPLK TPHQ+TYFAE+NLYPLIVSVPV KP+NQVLS L+DQEVGHQI
Sbjct: 1012 VSSYDNYWPVQK-IPLKGTPHQVTYFAERNLYPLIVSVPVQKPVNQVLSSLVDQEVGHQI 1070
Query: 1046 DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT 1105
+NHNLSS ++HRTY+V+E+EVRILEP + GPWQ +ATIPMQ+SENALTVR+V+LFNT+T
Sbjct: 1071 ENHNLSSEEIHRTYSVDEFEVRILEP--SNGPWQVKATIPMQTSENALTVRMVSLFNTST 1128
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
KENETLLA+GTAYVQGEDVAARGR+LLFS +N +N Q LV+EVYSKELKGAISALASLQ
Sbjct: 1129 KENETLLAVGTAYVQGEDVAARGRILLFSVVKNPENSQILVSEVYSKELKGAISALASLQ 1188
Query: 1166 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1225
GHLLIASGPKIILHKWTGTEL G+AF DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE
Sbjct: 1189 GHLLIASGPKIILHKWTGTELTGVAFSDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1248
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1285
QGAQL+LLAKDF SLDCF+TEFLIDGSTLSLVVSDEQKN+QIFYYAPKMSESWKGQKLLS
Sbjct: 1249 QGAQLSLLAKDFASLDCFSTEFLIDGSTLSLVVSDEQKNVQIFYYAPKMSESWKGQKLLS 1308
Query: 1286 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1345
RAEFHVGA VTKF+RLQML+ S DR+GAAP SDKTNRFALLFGTLDGSIGCIAPLDELTF
Sbjct: 1309 RAEFHVGALVTKFMRLQMLSPSLDRSGAAPVSDKTNRFALLFGTLDGSIGCIAPLDELTF 1368
Query: 1346 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1405
RRLQSLQKKLVD+VPHVAGLNP+SFRQF S+GKAHRPGP+SIVDCE+LS+YEM+PLEEQ+
Sbjct: 1369 RRLQSLQKKLVDAVPHVAGLNPKSFRQFRSDGKAHRPGPESIVDCEMLSYYEMIPLEEQV 1428
Query: 1406 EIAHQTGTTRSQILSNLNDLALGTSFL 1432
EIA Q GTTR+QILSNLNDL LGTSFL
Sbjct: 1429 EIAQQIGTTRAQILSNLNDLTLGTSFL 1455
>gi|297792471|ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
lyrata]
gi|297309955|gb|EFH40379.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
lyrata]
Length = 1444
Score = 2251 bits (5833), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1096/1461 (75%), Positives = 1254/1461 (85%), Gaps = 46/1461 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQ-TEELDSELPS-KRGIGPVPNL 58
MSFAA+KMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAFKMMHWPTGVENCASGYITHSLSDSTLQIPIVSGDDDMEAEWPNHKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TA N++E+Y+VR QEEG ++E + KR +MDG+S SLELVCHYRLHGNVES+A
Sbjct: 61 VITAGNILEVYIVRAQEEGNTQELRIPKLVKRGGVMDGVSGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILKASQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKASQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI+TTLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINTTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ +DVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISSDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRL R SSD QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRL-RISSDTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ EELSL+GS NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 538 NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
NYEL VELPGCKGIWTVYHKSSRGHNADSS+MAA
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSVRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+DEYHAYLIIS+EARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600 EDEYHAYLIISVEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
RILDGS+M Q+LSFG NSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660 RILDGSFMNQELSFGAPNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
VS+ +P+ +E SKK +S+CTL+HDKGPEPWLRK STDAWLS+GVGEA+D ADGGP DQGD
Sbjct: 720 VSISSPSVLEGSKKKISACTLFHDKGPEPWLRKASTDAWLSSGVGEAVDSADGGPQDQGD 779
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
IY V+CYESGALEIFDVP FNCVF+VDKF SGR H+ D + E E E+N +SE+
Sbjct: 780 IYCVLCYESGALEIFDVPGFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSED-N 834
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
R E I + KVVEL+MQRWS H+RPFLFA+L DGTILCY AYLFEG ++T K+++ V
Sbjct: 835 ASSRNEEIKNTKVVELSMQRWSGPHTRPFLFAVLADGTILCYHAYLFEGVDST-KAENSV 893
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
S+ ++++ +S+LRNL+F R P D TRE T G QRIT+FKNISGHQGFFLSGS
Sbjct: 894 SSENPAALNSSGSSKLRNLKFLRIPFDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 953
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVTSQ +LKICQLPS S YDN
Sbjct: 954 RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTSQVVLKICQLPSASIYDN 1013
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
YWPVQK IPLKATPHQ+TY+AEKNLYPLIVS PV KP+NQVLS L+DQE G QIDNHNLS
Sbjct: 1014 YWPVQK-IPLKATPHQVTYYAEKNLYPLIVSYPVSKPINQVLSSLVDQEAGQQIDNHNLS 1072
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1111
S DL RTYTVEE+E++ILEP+R+GGPW+T+ATIPMQSSE+ALTVRVVTL N +T ENETL
Sbjct: 1073 SDDLQRTYTVEEFEIQILEPERSGGPWETKATIPMQSSEHALTVRVVTLLNASTGENETL 1132
Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1171
LA+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+QGHLLI+
Sbjct: 1133 LAVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLIS 1192
Query: 1172 SGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1231
SGPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK FILLGD+HKSIYFLSWKEQG+QL+
Sbjct: 1193 SGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKTFILLGDVHKSIYFLSWKEQGSQLS 1252
Query: 1232 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1291
LLAKDFGSLDCFATEFLIDG+TLSL VSDEQKNIQ+FYYAPKM+ESWKGQKLLSRAEFHV
Sbjct: 1253 LLAKDFGSLDCFATEFLIDGNTLSLAVSDEQKNIQVFYYAPKMAESWKGQKLLSRAEFHV 1312
Query: 1292 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1351
G+HVTKFLRLQM+ + G+DKTNRFALLFGTLDGS GCIAPLDE+TFRRLQSL
Sbjct: 1313 GSHVTKFLRLQMVTS---------GADKTNRFALLFGTLDGSFGCIAPLDEVTFRRLQSL 1363
Query: 1352 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1411
QKKLVD+VPHVAGLNP SFRQF ++GKA R GPDSI+DCELL HYEMLPLEEQLE+AHQ
Sbjct: 1364 QKKLVDAVPHVAGLNPHSFRQFRTSGKARRSGPDSIIDCELLCHYEMLPLEEQLELAHQI 1423
Query: 1412 GTTRSQILSNLNDLALGTSFL 1432
GTTRS IL NL +L++GTSFL
Sbjct: 1424 GTTRSVILLNLVELSVGTSFL 1444
>gi|449470342|ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Cucumis sativus]
Length = 1504
Score = 2248 bits (5824), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1119/1510 (74%), Positives = 1248/1510 (82%), Gaps = 84/1510 (5%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MSFAAY+MMHWPTGI NC S +ITHSRAD+VP + +++LDS+ +R IGPVPNLVV
Sbjct: 1 MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVT-SHSDDLDSDWHPRRDIGPVPNLVV 59
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
TA NV+E+YVVRV EEG +ESK+SGE +R +MDG+S ASLELVCHYRLHGNVES+AILS
Sbjct: 60 TAGNVLEVYVVRVLEEGGRESKSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILS 119
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
G D S++RDSIIL F++AKISVLEFDDS H LR +SMHCF+ P+WLHLKRGRESFARG
Sbjct: 120 SRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARG 179
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
P+VKVDPQGRCGGVLVYGLQMIILKASQ GSGLV D++ FG+ G SAR+ESS++INLRD
Sbjct: 180 PVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRD 239
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
LD+KHVKDF+FVHGYIEPVMVILHE+ELTWAGRVSWKHHTCM+SALSISTTLKQHPLIWS
Sbjct: 240 LDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMVSALSISTTLKQHPLIWS 299
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
A NLPHDAYKLLAVPSPIGGVLV+ AN+IHY+SQSASC LALNNYAVS DSSQ++PRS+F
Sbjct: 300 ASNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNF 359
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
+VELDAA+ATWL NDVALLSTKTG+L+LL +VYDGRVVQRLDLSK+ SVLTS I +IGN
Sbjct: 360 NVELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGN 419
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF-------------------------- 454
SLFFLGSRLGDSLLVQF+CG G+S L+S LK+E
Sbjct: 420 SLFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEITYYTQNLQKEMVPPTLPSALVHESKP 479
Query: 455 ----GDIEADAPS----------------------TKRLRRSSSDALQDMVNGEELSLYG 488
G IE + + R+ R V G+ELSLYG
Sbjct: 480 TQAKGTIELNNNNLCVENDIVDVVEVDITNMTILGENRIARRDETLTDTQVGGDELSLYG 539
Query: 489 SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------ 542
SA+NNTESAQK FSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNYELV
Sbjct: 540 SAANNTESAQKIFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGHG 599
Query: 543 --------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
ELPGCKGIWTVYHK++RG ADSSRM DDEYHAYLIIS
Sbjct: 600 KNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIIS 659
Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
LEARTMVL T +LLTEVTESVDYFV GRTIAAGNLFGRRRVIQV+E GARILDGS+MTQD
Sbjct: 660 LEARTMVLVTGELLTEVTESVDYFVHGRTIAAGNLFGRRRVIQVYESGARILDGSFMTQD 719
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
L+ + +ESG+ SE TVLS SI+DPYVLL M+DGSIRLLVGD S+C+VSV PAA S
Sbjct: 720 LNLVVNGNESGNASEGCTVLSASISDPYVLLTMTDGSIRLLVGDSSSCSVSVSAPAAFGS 779
Query: 703 SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA 762
SKK VSSCTLY DKG EPWLR TSTDAWLSTGVGE IDG DG DQGDIY V CY++G
Sbjct: 780 SKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVACYDNGD 839
Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
LEIFDVPNF VF VDKFVSG++H+VD + + K SE + NS +E GR E+ +M
Sbjct: 840 LEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQNS--QELISHGRNESSQNM 897
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
KV+E+AMQRWS HSRPFLF ILTDGTILCY AYLFE ++ SK DD VS S+S SN+
Sbjct: 898 KVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNSVSSSNM 957
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRER 942
S+SRLRNLRF R PLD RE+ P+G R++IFKNISG+QG FL GSRP W MVFRER
Sbjct: 958 SSSRLRNLRFLRVPLDIQGREDMPNGTLSCRLSIFKNISGYQGLFLCGSRPAWFMVFRER 1017
Query: 943 LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLK 1002
LRVHPQLCDG IVAF VLHNVNCNHG IYVTSQG+LKICQLPS S YDNYWPVQKV PLK
Sbjct: 1018 LRVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQKV-PLK 1076
Query: 1003 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE 1062
TPHQ+TYF EKNLYP+I+S PV KPLNQVLS ++DQ+VGH ++NHNLS+ +L +TY+VE
Sbjct: 1077 GTPHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGH-VENHNLSADELQQTYSVE 1135
Query: 1063 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGE 1122
E+E+RILEP+++GGPWQTRATI M SSENALT+RVVTL NTTTKENETLLA+GTAYVQGE
Sbjct: 1136 EFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAYVQGE 1195
Query: 1123 DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1182
DVAARGRVLLFS G++ADN Q LV+EVYSKELKGAISALASLQGHLLIASGPKIILHKWT
Sbjct: 1196 DVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1255
Query: 1183 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1242
G ELNGIAFYD PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL+LLAKDFGSLDC
Sbjct: 1256 GAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDC 1315
Query: 1243 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1302
+ATEFLIDGSTLSL VSD+QKNIQIFYYAPK +ESWKGQKLLSRAEFHVGAHVTKFLRLQ
Sbjct: 1316 YATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQ 1375
Query: 1303 MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHV 1362
ML+TSSD+ + SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL D+VPHV
Sbjct: 1376 MLSTSSDK-ACSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGDAVPHV 1434
Query: 1363 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
GLNPRSFRQFHSNGK HR GPDSIVDCELL HYEMLPLEEQL+IAHQ GTTRSQILSNL
Sbjct: 1435 GGLNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNL 1494
Query: 1423 NDLALGTSFL 1432
NDL+LGTSFL
Sbjct: 1495 NDLSLGTSFL 1504
>gi|30696088|ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1
[Arabidopsis thaliana]
gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=AtCPSF160; Short=CPSF 160
kDa subunit
gi|332008729|gb|AED96112.1| cleavage and polyadenylation specificity factor subunit 1
[Arabidopsis thaliana]
Length = 1442
Score = 2245 bits (5817), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1092/1461 (74%), Positives = 1254/1461 (85%), Gaps = 48/1461 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
MSFAAYKMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TAAN++E+Y+VR QEEG ++E +N KR +MDG+ SLELVCHYRLHGNVES+A
Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRLR +SD QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ EELSL+GS NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 538 NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
NYEL VELPGCKGIWTVYHKSSRGHNADSS+MAA
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600 EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660 RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D DGGP DQGD
Sbjct: 720 VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D + E E E+N +SE+ T
Sbjct: 780 IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+ I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836 S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
S+ ++++ +S+LRNL+F R PLD TRE T G QRIT+FKNISGHQGFFLSGS
Sbjct: 892 SSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952 RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
YWPVQK IPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+DNHN+S
Sbjct: 1012 YWPVQK-IPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMS 1070
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1111
S DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T ENETL
Sbjct: 1071 SDDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETL 1130
Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1171
LA+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+QGHLLI+
Sbjct: 1131 LAVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLIS 1190
Query: 1172 SGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1231
SGPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK+FILLGD+HKSIYFLSWKEQG+QL+
Sbjct: 1191 SGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLS 1250
Query: 1232 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1291
LLAKDF SLDCFATEFLIDGSTLSL VSDEQKNIQ+FYYAPKM ESWKG KLLSRAEFHV
Sbjct: 1251 LLAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHV 1310
Query: 1292 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1351
GAHV+KFLRLQM+++ G+DK NRFALLFGTLDGS GCIAPLDE+TFRRLQSL
Sbjct: 1311 GAHVSKFLRLQMVSS---------GADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSL 1361
Query: 1352 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1411
QKKLVD+VPHVAGLNP +FRQF S+GKA R GPDSIVDCELL HYEMLPLEEQLE+AHQ
Sbjct: 1362 QKKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQI 1421
Query: 1412 GTTRSQILSNLNDLALGTSFL 1432
GTTR IL +L DL++GTSFL
Sbjct: 1422 GTTRYSILKDLVDLSVGTSFL 1442
>gi|24415580|gb|AAN41460.1| putative cleavage and polyadenylation specificity factor 160 kDa
subunit [Arabidopsis thaliana]
Length = 1442
Score = 2243 bits (5812), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1091/1461 (74%), Positives = 1253/1461 (85%), Gaps = 48/1461 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
MSFAAYKMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TAAN++E+Y+VR QEEG ++E +N KR +MDG+ SLELVCHYRLHGNVES+A
Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRLR +SD QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ EELSL+GS +N++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480 TIGNEELSLFGSTPDNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 538 NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
NYEL VELPGCKGIWTVYHKSSRGHNADSS+MAA
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600 EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660 RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D DGGP DQGD
Sbjct: 720 VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D + E E E+N +SE+ T
Sbjct: 780 IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+ I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836 S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
S ++++ +S+LRNL+F R PLD TRE T G QRIT+FKNISGHQGFFLSGS
Sbjct: 892 SPENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952 RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
YWPVQK IPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+DNHN+S
Sbjct: 1012 YWPVQK-IPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMS 1070
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1111
S DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T ENETL
Sbjct: 1071 SDDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETL 1130
Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1171
LA+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+QGHLLI+
Sbjct: 1131 LAVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLIS 1190
Query: 1172 SGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1231
SGPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK+FILLGD+HKSIYFLSWKEQG+QL+
Sbjct: 1191 SGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLS 1250
Query: 1232 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1291
LLAKDF SLDCFATEFLIDGSTLSL VSDEQKNIQ+FYYAPKM ESWKG KLLSRAEFHV
Sbjct: 1251 LLAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHV 1310
Query: 1292 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1351
GAHV+KFLRLQM+++ G+DK NRFALLFGTLDGS GCIAPLDE+TFRRLQSL
Sbjct: 1311 GAHVSKFLRLQMVSS---------GADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSL 1361
Query: 1352 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1411
QKKLVD+VPHVAGLNP +FRQF S+GKA R GPDSIVDCELL HYEMLPLEEQLE+AHQ
Sbjct: 1362 QKKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQI 1421
Query: 1412 GTTRSQILSNLNDLALGTSFL 1432
GTTR IL +L DL++GTSFL
Sbjct: 1422 GTTRYSILKDLVDLSVGTSFL 1442
>gi|10257491|dbj|BAB11613.1| cleavage and polyadenylation specificity factor subunit [Arabidopsis
thaliana]
Length = 1448
Score = 2239 bits (5802), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1092/1467 (74%), Positives = 1254/1467 (85%), Gaps = 54/1467 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
MSFAAYKMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TAAN++E+Y+VR QEEG ++E +N KR +MDG+ SLELVCHYRLHGNVES+A
Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRLR +SD QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQ------KTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
+ EELSL+GS NN++SAQ K+FSFAVRDSLVN+GP+KDF+YGLRINADA+AT
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQVTSSVLKSFSFAVRDSLVNVGPVKDFAYGLRINADANAT 539
Query: 532 GISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGHNADS 565
G+SKQSNYEL VELPGCKGIWTVYHKSSRGHNADS
Sbjct: 540 GVSKQSNYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADS 599
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
S+MAA +DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQ
Sbjct: 600 SKMAADEDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQ 659
Query: 626 VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
VFE GARILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVG
Sbjct: 660 VFEHGARILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVG 719
Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
DPSTCTVS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D DGG
Sbjct: 720 DPSTCTVSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGG 779
Query: 746 PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
P DQGDIY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D + E E E+N
Sbjct: 780 PQDQGDIYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNK 835
Query: 806 SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
+SE+ T + I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T
Sbjct: 836 NSEDNTS---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST- 891
Query: 866 KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQG 925
K+++ +S+ ++++ +S+LRNL+F R PLD TRE T G QRIT+FKNISGHQG
Sbjct: 892 KAENSLSSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQG 951
Query: 926 FFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
FFLSGSRP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS
Sbjct: 952 FFLSGSRPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPS 1011
Query: 986 GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI 1045
S YDNYWPVQK IPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+
Sbjct: 1012 ASIYDNYWPVQK-IPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQL 1070
Query: 1046 DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT 1105
DNHN+SS DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T
Sbjct: 1071 DNHNMSSDDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNAST 1130
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
ENETLLA+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+Q
Sbjct: 1131 GENETLLAVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQ 1190
Query: 1166 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1225
GHLLI+SGPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK+FILLGD+HKSIYFLSWKE
Sbjct: 1191 GHLLISSGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKE 1250
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1285
QG+QL+LLAKDF SLDCFATEFLIDGSTLSL VSDEQKNIQ+FYYAPKM ESWKG KLLS
Sbjct: 1251 QGSQLSLLAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLS 1310
Query: 1286 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1345
RAEFHVGAHV+KFLRLQM+++ G+DK NRFALLFGTLDGS GCIAPLDE+TF
Sbjct: 1311 RAEFHVGAHVSKFLRLQMVSS---------GADKINRFALLFGTLDGSFGCIAPLDEVTF 1361
Query: 1346 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1405
RRLQSLQKKLVD+VPHVAGLNP +FRQF S+GKA R GPDSIVDCELL HYEMLPLEEQL
Sbjct: 1362 RRLQSLQKKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQL 1421
Query: 1406 EIAHQTGTTRSQILSNLNDLALGTSFL 1432
E+AHQ GTTR IL +L DL++GTSFL
Sbjct: 1422 ELAHQIGTTRYSILKDLVDLSVGTSFL 1448
>gi|222628488|gb|EEE60620.1| hypothetical protein OsJ_14038 [Oryza sativa Japonica Group]
Length = 1441
Score = 1872 bits (4850), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 945/1473 (64%), Positives = 1123/1473 (76%), Gaps = 73/1473 (4%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
MS+AAYKMMHWPTG+ +C +GF+THS +D ++DS + R +G
Sbjct: 1 MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60
Query: 54 PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
P PNLVV AANV+E+Y VR + G++ S +SG ++DGIS A LELVC+Y
Sbjct: 61 PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RLHGN+ES+ +LS G A+N RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116 RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
HLKRGRESFA GP++K DP GRCG L YGLQMIILKA+Q G LVG+++ + +
Sbjct: 173 QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+ KHHTCMISA S
Sbjct: 233 VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++
Sbjct: 293 ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+
Sbjct: 353 HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVL+S +T+IGNS FFLGSRLGDSLLVQF+ + S+L E DIE D P +KRL
Sbjct: 413 ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472
Query: 468 RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
+R SD LQD+ + EELS A N+ ESAQK S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473 KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531
Query: 527 DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
D +A G +KQSNYEL VELP C+GIWTVY+KS RG
Sbjct: 532 DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
A+ D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ TIAAGNLFGR
Sbjct: 592 QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
RRVIQV+ +GAR+LDGS+MTQ+L+F +++ S SE V SIADPYVLL M DGS+
Sbjct: 645 RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
+LL+GD TCT+SV P+ SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704 QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
G DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG +VD + + +DS
Sbjct: 764 GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
E ++ +KE S+++VELAM RWS SRPFLF +L DGT+LCY A+ +E
Sbjct: 824 KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
E+ K P+S S N S SRLRNLRF R +D +RE+ P G P RIT F N
Sbjct: 881 SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+ G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938 VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
ICQLPS YD+YWPVQKV PL TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQ
Sbjct: 998 ICQLPSAYNYDSYWPVQKV-PLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQ 1056
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
E H +DN S+ LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ ENALTVR+VT
Sbjct: 1057 ESVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVT 1116
Query: 1100 LFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAIS 1159
L NTTTKENETLLAIGTAYV GEDVAARGRVLLFS + ++N QNLVTEVYSKE KGA+S
Sbjct: 1117 LHNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SENSQNLVTEVYSKESKGAVS 1175
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
A+ASLQGHLLIASGPKI L+KWTG EL +AFYDA PL+VVSLNIVKNF+L GDIHKSIY
Sbjct: 1176 AVASLQGHLLIASGPKITLNKWTGAELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSIY 1234
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
FLSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD KN+QIFYYAPKM ESWK
Sbjct: 1235 FLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWK 1294
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
GQKLLSRAEFHVGAH+TKFLRLQML T S+KTNRFALLFG LDG IGCIAP
Sbjct: 1295 GQKLLSRAEFHVGAHITKFLRLQMLPTQ------GLSSEKTNRFALLFGNLDGGIGCIAP 1348
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
+DELTFRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HRPGPD+I+D ELL+HYEML
Sbjct: 1349 IDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPGPDNIIDFELLAHYEML 1408
Query: 1400 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
L+EQL++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 1409 SLDEQLDVAQQIGTTRSQILSNFSDISLGTSFL 1441
>gi|75145059|sp|Q7XWP1.2|CPSF1_ORYSJ RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
gi|38345987|emb|CAD39979.2| OSJNBa0032B23.5 [Oryza sativa Japonica Group]
Length = 1441
Score = 1869 bits (4841), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 944/1473 (64%), Positives = 1121/1473 (76%), Gaps = 73/1473 (4%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
MS+AAYKMMHWPTG+ +C +GF+THS +D ++DS + R +G
Sbjct: 1 MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60
Query: 54 PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
P PNLVV AANV+E+Y VR + G++ S +SG ++DGIS A LELVC+Y
Sbjct: 61 PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RLHGN+ES+ +LS G A+N RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116 RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
HLKRGRESFA GP++K DP GRCG L YGLQMIILKA+Q G LVG+++ + +
Sbjct: 173 QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+ KHHTCMISA S
Sbjct: 233 VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++
Sbjct: 293 ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+
Sbjct: 353 HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVL+S +T+IGNS FFLGSRLGDSLLVQF+ + S+L E DIE D P +KRL
Sbjct: 413 ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472
Query: 468 RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
+R SD LQD+ + EELS A N+ ESAQK S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473 KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531
Query: 527 DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
D +A G +KQSNYEL VELP C+GIWTVY+KS RG
Sbjct: 532 DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
A+ D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ TIAAGNLFGR
Sbjct: 592 QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
RRVIQV+ +GAR+LDGS+MTQ+L+F +++ S SE V SIADPYVLL M DGS+
Sbjct: 645 RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
+LL+GD TCT+SV P+ SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704 QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
G DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG +VD + + +DS
Sbjct: 764 GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
E ++ +KE S+++VELAM RWS SRPFLF +L DGT+LCY A+ +E
Sbjct: 824 KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
E+ K P+S S N S SRLRNLRF R +D +RE+ P G P RIT F N
Sbjct: 881 SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+ G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938 VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
ICQLPS YD+YWPVQKV PL TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQ
Sbjct: 998 ICQLPSAYNYDSYWPVQKV-PLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQ 1056
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
E H +DN S+ LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ ENALTVR+VT
Sbjct: 1057 ESVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVT 1116
Query: 1100 LFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAIS 1159
L NTTTKENETLLAIGTAYV GEDVAARGRVLLFS + ++N QNLVTEVYSKE KGA+S
Sbjct: 1117 LHNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SENSQNLVTEVYSKESKGAVS 1175
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
A+ASLQGHLLIASGPKI L+KWTG EL +AFYDA PL+VVSLNIVKNF+L GDIHKSIY
Sbjct: 1176 AVASLQGHLLIASGPKITLNKWTGAELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSIY 1234
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
FLSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD KN+QIFYYAPKM ESWK
Sbjct: 1235 FLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWK 1294
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
GQKLLSRAEFHVGAH+TKFLRLQML T S+KTNRFALLFG LDG IGCIAP
Sbjct: 1295 GQKLLSRAEFHVGAHITKFLRLQMLPTQ------GLSSEKTNRFALLFGNLDGGIGCIAP 1348
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
+DELTFRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HRPGPD+I+D ELL YEML
Sbjct: 1349 IDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPGPDNIIDFELLCSYEML 1408
Query: 1400 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
L+EQL++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 1409 SLDEQLDVAQQIGTTRSQILSNFSDISLGTSFL 1441
>gi|357162146|ref|XP_003579318.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 1-like [Brachypodium distachyon]
Length = 1442
Score = 1861 bits (4820), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 946/1472 (64%), Positives = 1128/1472 (76%), Gaps = 70/1472 (4%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE--ELDSELPSK----RGIGP 54
MS+AAYKMMHWPTGI +C +GFITH +D E D L + + +GP
Sbjct: 1 MSYAAYKMMHWPTGIDHCAAGFITHCPSDAAAFCSAAAASGPEGDVGLVAAARHPKRLGP 60
Query: 55 VPNLVVTAANVIEIYVVRVQEEGS------KESKNSGETKRRVLMDGISAASLELVCHYR 108
PNLVV AANV+E+Y VR + + S +SG + DGIS A LELVCHYR
Sbjct: 61 TPNLVVAAANVLEVYAVRADAAAADGAGGAQPSSSSG-----AVFDGISGARLELVCHYR 115
Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
LHGN+ES+AILS G A+N RRDSI LAF DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116 LHGNIESMAILSDG-AEN--RRDSIALAFRDAKITCLEFDDAIHGLRTSSMHCFEGPEWQ 172
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
HLKRGRESFA GP++K DP GRCG LVYGLQMIILK++Q G LVG+++ + +
Sbjct: 173 HLKRGRESFAWGPVIKSDPLGRCGAALVYGLQMIILKSAQVGQSLVGEDEPTRALSSAAV 232
Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
RIESS++I+LR LD HVKDF FVHGYIEPV+VILHERE TWAGR+S KHHTCMISA SI
Sbjct: 233 RIESSYLIDLRALDTNHVKDFTFVHGYIEPVLVILHEREPTWAGRISSKHHTCMISAFSI 292
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
S TLKQHP+IWSA N+PHDAY++L+VP PI GVLV+ AN+IHYHSQS SC+LALNN+A
Sbjct: 293 SMTLKQHPMIWSAANIPHDAYQILSVPPPISGVLVICANSIHYHSQSTSCSLALNNFASQ 352
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
D S E+ + +F VELDAA ATWL ND+ + S KTG+++LLTVVYDGR VQ+LDL K+
Sbjct: 353 PDGSPEIHKVNFHVELDAAKATWLSNDIVMFSAKTGEMLLLTVVYDGRTVQKLDLMKSKA 412
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
SV++S +TTIG+S FFLGSR+GDSLLVQF+CG TS++ E DIE D P +KRL+
Sbjct: 413 SVISSGVTTIGSSFFFLGSRVGDSLLVQFSCGVPTSVIPDIADERSADIEGDLPFSKRLK 472
Query: 469 RSSSDALQDMVNGEELSLYGSA-SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
R SD LQD+ + EELS + N+ ESAQK S+ VRD+LVN+GPLKDFSYGLR+NAD
Sbjct: 473 RVPSDILQDVTSVEELSFQNNMLPNSLESAQK-ISYVVRDALVNVGPLKDFSYGLRVNAD 531
Query: 528 ASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGH 561
+ATG +KQSNYEL VELP C+GIWTVY+KSSRGH
Sbjct: 532 PNATGNAKQSNYELVCCSGHGKNGALSVLQQSIRPDLITEVELPSCRGIWTVYYKSSRGH 591
Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
+ D+EYHAYLIISLE+RTMVLET D L EVTE+VDY+VQG TI AGNLFGRR
Sbjct: 592 TTE-------DNEYHAYLIISLESRTMVLETGDDLGEVTETVDYYVQGATITAGNLFGRR 644
Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST-VLSVSIADPYVLLGMSDGSI 680
RVIQV+ GAR+LDGS+MTQ+L+F +SES S V S SIADPYVLL M DG+I
Sbjct: 645 RVIQVYATGARVLDGSFMTQELNFTALSSESSSSGSEPLGVASASIADPYVLLKMVDGTI 704
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
+LLVGD STC +S+ P+ + S + +S+CTLYHD+GPEPWLRKT DAWLS+GV A+D
Sbjct: 705 QLLVGDHSTCALSINAPSTLTSRGERISACTLYHDRGPEPWLRKTRGDAWLSSGVTVAVD 764
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
+ DQ DIY ++CYESG LEIF+VP+F VF+V F SG + +VD + + +DS
Sbjct: 765 VSGSSSQDQSDIYCIICYESGKLEIFEVPSFRQVFSVGSFFSGESLLVDAFAQGFTEDSA 824
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
+E +KE +++++VELAM RWS SRPFLF +L DGT+LCYQAY +EG
Sbjct: 825 ---EGRQDETKVSLKKEVANNIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYQAYCYEG 881
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
E+ K +S S+ + N S SRL+NLRF R +D +RE+ A RITIF N+
Sbjct: 882 LESNIKGTS-LSPDGSVDLGNASDSRLKNLRFHRVSVDITSREDISSLAR-PRITIFNNV 939
Query: 921 SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKI 980
G++G FLSG+RP W MV R+R RVHPQLCDG I AFTVLHNVNC+HG IYVTSQG LKI
Sbjct: 940 GGYEGLFLSGTRPVWVMVCRQRFRVHPQLCDGPIEAFTVLHNVNCSHGLIYVTSQGFLKI 999
Query: 981 CQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1040
CQLPS YDNYWPVQK IPL TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS++ DQE
Sbjct: 1000 CQLPSAYNYDNYWPVQK-IPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSIMADQE 1058
Query: 1041 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1100
+ H +DN S+ DL +TYTVEE+EVR+LE ++ GG W+TR+TIPMQS ENALTVR+VTL
Sbjct: 1059 MIHHMDNDASSADDLQKTYTVEEFEVRVLELEKPGGRWETRSTIPMQSFENALTVRIVTL 1118
Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISA 1160
NTTTKENETL+AIGTAYVQGEDVAARGRVLLFS + ++N QNLVTEVYSKE KGA+SA
Sbjct: 1119 HNTTTKENETLMAIGTAYVQGEDVAARGRVLLFSFTK-SENSQNLVTEVYSKESKGAVSA 1177
Query: 1161 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
+ASLQGHL+IASGPKI L+KW G+EL +AFYDA PL+VVSLNIVKNF+L GDIHKS+YF
Sbjct: 1178 VASLQGHLVIASGPKITLNKWNGSELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSVYF 1236
Query: 1221 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1280
LSWKEQG+QL LLAKDFGSLDCFATEFLIDGSTLSLVVSD KN+QIFYYAPKM ESWKG
Sbjct: 1237 LSWKEQGSQLTLLAKDFGSLDCFATEFLIDGSTLSLVVSDSDKNLQIFYYAPKMVESWKG 1296
Query: 1281 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
QKLLSRAE HVGAH+TKFLRLQML G A S+KTNRFALLFGTLDGSIGCIAP+
Sbjct: 1297 QKLLSRAELHVGAHMTKFLRLQMLPAQ----GLA--SEKTNRFALLFGTLDGSIGCIAPV 1350
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1400
DELTFRRLQSLQ+KLVD+V HV GLNPRSFRQF SNGKAHRPGPD+I+D ELL++YE+L
Sbjct: 1351 DELTFRRLQSLQRKLVDAVSHVCGLNPRSFRQFKSNGKAHRPGPDNIIDFELLTYYEILS 1410
Query: 1401 LEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
LEEQL++A Q GTTR+QILSN +D++LGTSFL
Sbjct: 1411 LEEQLDMAQQIGTTRAQILSNFSDISLGTSFL 1442
>gi|218194461|gb|EEC76888.1| hypothetical protein OsI_15095 [Oryza sativa Indica Group]
Length = 1503
Score = 1793 bits (4644), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 926/1529 (60%), Positives = 1114/1529 (72%), Gaps = 123/1529 (8%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
MS+AAYKMMHWPTG+ +C +GF+THS +D ++DS + R +G
Sbjct: 1 MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60
Query: 54 PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
P PNLVV AANV+E+Y VR + G++ S +SG ++DGIS A LELVC+Y
Sbjct: 61 PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RLHGN+ES+ +LS G A+N RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116 RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
HLKRGRESFA GP++K DP GRCG L YGLQMIILKA+Q G LVG+++ + +
Sbjct: 173 QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
RIESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+ KHHTCMISA S
Sbjct: 233 VRIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++
Sbjct: 293 ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
D S E+ +S+F VELDAA ATW ND+ + S+K G+++LLTVVYDGRVVQRLDL K+
Sbjct: 353 HPDGSPEISKSNFQVELDAAKATWFSNDIVMFSSKAGEMLLLTVVYDGRVVQRLDLMKSK 412
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVL+S +T+IGNS FFLGSRLGDSLLVQF+ G+ S+L E DIE D P +KRL
Sbjct: 413 ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYGASKSVLQDLTNERSADIEGDLPFSKRL 472
Query: 468 RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
+R SD LQD+ + EELS A N+ ESAQK S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473 KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531
Query: 527 DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
D +A G +KQSNYEL VELP C+GIWTVY+KS RG
Sbjct: 532 DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
A+ D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ TIAAGNLFGR
Sbjct: 592 QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
RRVIQV+ +GAR+LDGS+MTQ+L+F +++ S SE V SIADPYVLL M DGS+
Sbjct: 645 RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
+LL+GD TCT+SV P+ SS + +++CTLY D+GPEPWLRKT +DAWLSTG+ EAID
Sbjct: 704 QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLRKTRSDAWLSTGIAEAID 763
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
G DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG +VD + + +DS
Sbjct: 764 GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
E ++ +KE S+++VELAM RWS SRPFLF +L DGT+LCY A+ +E
Sbjct: 824 KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
E+ K P+S S N S SRLRNLRF R +D +RE+ P G P RIT F N
Sbjct: 881 SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+ G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938 VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
ICQLPS YDNYWPVQKV PL TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQ
Sbjct: 998 ICQLPSAYNYDNYWPVQKV-PLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQ 1056
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
E H +DN S+ LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ ENALTVR+VT
Sbjct: 1057 ESVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVT 1116
Query: 1100 LFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAIS 1159
L NTTTKENETLLAIGTAYV GEDVAARGRVLLFS + ++N QNLVTEVYSKE KGA+S
Sbjct: 1117 LHNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFMK-SENSQNLVTEVYSKESKGAVS 1175
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
A+ASLQGHLLIASGPKI L+KWTG EL +AFYDA PL+VVSLNIVKNF+L GDIHKSIY
Sbjct: 1176 AVASLQGHLLIASGPKITLNKWTGAELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSIY 1234
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI--FYYAPKMSE- 1276
FLSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD KN+Q+ F + +
Sbjct: 1235 FLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQVKNFVLFGDIHKS 1294
Query: 1277 ----SWKGQ----KLLSRAEFHVGAHVTKFL----RLQMLATSSDRTGA----AP----- 1315
SWK Q LL++ + T+FL L ++A+ SD+ AP
Sbjct: 1295 IYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVES 1354
Query: 1316 --------------------------------GSDKTNRFALLFGTLDGSIGCIAPLDEL 1343
S+KTNRFALLFG LDG IGCIAP+DEL
Sbjct: 1355 WKGQKLLSRAEFHVGAHITKFLRLQMLPTQGLSSEKTNRFALLFGNLDGGIGCIAPIDEL 1414
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
TFRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HRPGPD+I+D ELL+HYEML L+E
Sbjct: 1415 TFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPGPDNIIDFELLAHYEMLSLDE 1474
Query: 1404 QLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
QL++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 1475 QLDVAQQIGTTRSQILSNFSDISLGTSFL 1503
>gi|168021793|ref|XP_001763425.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685218|gb|EDQ71614.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1452
Score = 1581 bits (4093), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 827/1497 (55%), Positives = 1035/1497 (69%), Gaps = 110/1497 (7%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADY-VPQIPLIQTEELDSELPSKRGIGPVPNLV 59
MS+AA+KM+H PTG+ NC + ++THS + IPL ++L + G G PNLV
Sbjct: 1 MSYAAFKMVHCPTGVDNCVAAYVTHSAGETDSDSIPLP-----GADLIASGGSGFPPNLV 55
Query: 60 VTAANVIEIYVVRVQE------EGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
+T ANV+E++ VR+ E GS N T R LM G+S LEL CHYRLHGNV
Sbjct: 56 ITKANVLEVFHVRLLEGDDSAANGSNGVGNPETTPRGGLMAGLSYVKLELACHYRLHGNV 115
Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
ESL +LS A+ + RD+IIL F DAKISVLEFDDS HGLRI S+H FE PEW +LKRG
Sbjct: 116 ESLGVLSYRHAEGRKGRDAIILTFRDAKISVLEFDDSTHGLRIGSLHYFEGPEWQYLKRG 175
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
RE FA GP V+ DP GRC GVL+Y Q+++LKA+Q G GL ++++ GG A + +S
Sbjct: 176 REQFASGPSVRADPVGRCAGVLIYNSQLVLLKAAQVGYGLGDEDESLIMGGKLCAHVATS 235
Query: 234 HVINLRDLDMKHVKDFIFVHG--------------YIEPVMVILHERELTWAGRVSWKHH 279
++++LRDLDMKH+KDF+F+HG YIEPV+V+LHE++ TWAGRV+ + H
Sbjct: 236 YIVSLRDLDMKHIKDFVFLHGKLLFLIQYIFAFSSYIEPVLVVLHEKDPTWAGRVAVRRH 295
Query: 280 TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCA 339
TC I+ALSI+TTLKQHP IWSA NLP+DAYKLLAVP+PIGGVLV AN++HYHSQS SCA
Sbjct: 296 TCAITALSINTTLKQHPHIWSATNLPYDAYKLLAVPAPIGGVLVFCANSLHYHSQSGSCA 355
Query: 340 LALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
L LN +AV+ + S E PRS SVELD AHATW+ N+VAL+STK G L+ L +VY+GR VQ
Sbjct: 356 LGLNEFAVAPEGSAEYPRSKMSVELDCAHATWVANEVALISTKNGMLLFLNLVYEGRSVQ 415
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
RL+L+K+ SVLTS + TIG + FFLGSRL DSLLVQ T GS + SS + GDIEA
Sbjct: 416 RLELTKSKASVLTSCMCTIGENFFFLGSRLADSLLVQHTLGSASGRTSSLM----GDIEA 471
Query: 460 D--APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVRDSLVNIGPLK 516
D AP+ KRL+R S+ + + EE+SLY S ++ S +KTF+F VRDSLVNI PL+
Sbjct: 472 DLSAPAAKRLKREPSEEEEGVSA-EEMSLYYSTPTASDISQKKTFTFTVRDSLVNICPLR 530
Query: 517 DFSYGLRINADASATGISKQSNYEL--------------------------VELPGCKGI 550
DF+YGLR NAD SATG+ KQSNYEL V LPGC GI
Sbjct: 531 DFAYGLRSNADQSATGLGKQSNYELVACSGHGKNGSLSVLHQSIRPDLINKVALPGCSGI 590
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTVYHK+ R + + + DDE+HAYLIISLE+RTMVLET D L EVTE+V+Y+ +G
Sbjct: 591 WTVYHKTDRDDSNEFDFGTSEDDEFHAYLIISLESRTMVLETGDTLGEVTENVEYYTEGN 650
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS-ENSTVLSVSIADP 669
TIAAGNLFGRR V+QV++ G R+LDG+ M Q+L S E+ S N+ V+ IADP
Sbjct: 651 TIAAGNLFGRRFVVQVYQNGLRLLDGAKMLQELLITNSELENNSSEVANNLVIEAVIADP 710
Query: 670 YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDA 729
Y+LL M+DGS++L+VGD +S+ P + +++ TLY DKGP WLR+T ++
Sbjct: 711 YMLLKMTDGSLQLVVGDVENTKLSIPQPQGFGITTDAITAFTLYQDKGPHQWLRRTCSEM 770
Query: 730 ------WLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSG 783
W ST DQG +Y +VC SG EI+++P CV+ VD F G
Sbjct: 771 NSDRSQWSSTS-------------DQGYVYCIVCRISGRFEIYELPRMVCVYAVDNFNHG 817
Query: 784 RTHIVDTYMREALKDSETEINSSSEEGTGQGR---KENIHSMKVVELAMQRWSAHHSRPF 840
+ + D + E +S + +EE G ++ S+ V ++ + W RPF
Sbjct: 818 MSVLWDQKVLERRANSNAALKEGAEEDKAPGDALLRDAGLSLHVSQICFESWGEKFGRPF 877
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
L A L+DGT+LCY A+ ++ E++ + R + S SRL +LRF+R P+D
Sbjct: 878 LLATLSDGTMLCYHAFSYDANESSDALE-----FRETATSLKDLSRLTHLRFARIPIDWV 932
Query: 901 TREETPHGAPC---QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
+ +E GA + FKN+ G F++G RP W MV R RLR HPQ CDG+I+ F
Sbjct: 933 SGQED--GAKVLYETKFCSFKNVGSFPGVFVTGLRPTWLMVCRGRLRPHPQFCDGAILGF 990
Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
T LHNVNC HGFIY+T+QG LKICQLPS YDN WPVQK IPL+ TPHQITY ++ NLY
Sbjct: 991 TPLHNVNCAHGFIYITAQGQLKICQLPSLLFYDNDWPVQK-IPLRGTPHQITYHSDVNLY 1049
Query: 1018 PLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV--DLHRTYTVEEYEVRILEPDRAG 1075
LI+S PV +P +QVL GH D +S+ D R T E+YEVRI+EP + G
Sbjct: 1050 ALIISTPVSRPTSQVL-----MGDGHPFDQQQENSIGEDGQRLVTSEDYEVRIIEPAQPG 1104
Query: 1076 GPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1135
G W+ +A I M +ENALTVR+V++ N TT + +TLLAIGT+YVQGEDVAA+GR++L S
Sbjct: 1105 GNWEAKAAIKMHLTENALTVRIVSIKNITTDQTQTLLAIGTSYVQGEDVAAKGRIILVSV 1164
Query: 1136 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1195
G++ +P + EVYSKELKG+ISA+ASLQGHLLIA GPKIILH W G+ELNG AF+DA
Sbjct: 1165 GKDPQDPGSWAREVYSKELKGSISAIASLQGHLLIAIGPKIILHSWNGSELNGAAFFDA- 1223
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
PLYVVSLNIVKNFIL GDIHKSIYFL WKE GAQL LLAKDFGSLDC+ATEFLIDGSTLS
Sbjct: 1224 PLYVVSLNIVKNFILFGDIHKSIYFLCWKEDGAQLTLLAKDFGSLDCYATEFLIDGSTLS 1283
Query: 1256 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1315
L+VSD +KN+QIF YAPK ESWKGQKLLSRAEFH+GAHV KF RLQML T P
Sbjct: 1284 LLVSDSRKNLQIFSYAPKSMESWKGQKLLSRAEFHLGAHVNKFHRLQMLPT--------P 1335
Query: 1316 GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1375
GS ++NR+A+LFGTLDG+I +APLDELTFRRL +LQ+KLVD V HVAG+NPR+FRQF
Sbjct: 1336 GSARSNRYAVLFGTLDGAIDYLAPLDELTFRRLHTLQRKLVDCVSHVAGVNPRAFRQFRC 1395
Query: 1376 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
+GKAHRPGPD+IVDCELLSHY+MLPL+EQLEIA Q GTTR+ +LSNL DLAL TSFL
Sbjct: 1396 DGKAHRPGPDNIVDCELLSHYDMLPLDEQLEIARQIGTTRAHVLSNLRDLALSTSFL 1452
>gi|302814354|ref|XP_002988861.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
gi|300143432|gb|EFJ10123.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
Length = 1413
Score = 1450 bits (3754), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 773/1486 (52%), Positives = 989/1486 (66%), Gaps = 127/1486 (8%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS+AA K++H PTG++ C S FITHS P P S S +PNLV+
Sbjct: 1 MSYAAIKLVHGPTGVSACASAFITHS-----PVNP-----ASSSGWKSGNAKDSLPNLVL 50
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGE-------------TKRRVLMDGISAASLELVCHY 107
ANV+EIY VR QE G ++S GE KR M GI+AA LELVC Y
Sbjct: 51 VKANVLEIYNVRFQE-GDEKSARGGEQLVGSACVAFPASAKRGGFMSGITAAWLELVCQY 109
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RL G V+S+AIL +G D R RD+IILAF AK SVL FDD+ L+ +SMH FE PEW
Sbjct: 110 RLFGIVDSMAILHRG-RDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEW 168
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
+HLKRGRE F GPLV+ D QGRC GVL+Y Q++++KA+Q GLV ++D SG S
Sbjct: 169 IHLKRGREKFPGGPLVRADSQGRCAGVLIYKSQLVMMKAAQEAYGLVEEDDP--SGNIVS 226
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
ARIESS+V+NL++L M HVKDF+F++GYIEPV+ ILHERELTWAGRV+++ TC ++ALS
Sbjct: 227 ARIESSYVVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALS 286
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
I+T K+HP +W LP+DAY LLAVPSPIGGVLV+ AN+I Y+SQ ++C +A+N A
Sbjct: 287 INTNTKKHPRLWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELAT 346
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
S E+PRS FS+ELDAAHATWL D ALLSTKTG LV L +++DGR VQRL+LSK+
Sbjct: 347 PPAGSLEMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSK 406
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVL+S + TIG+ FF+GSRLGDSLLVQF S ++ LS G+ + +KR+
Sbjct: 407 GSVLSSSLCTIGDMFFFVGSRLGDSLLVQFGSASTSNSLSQSYD---GEDDIMVRPSKRM 463
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFS-------- 519
R L D N + L Y SA ++++ F F+VRDSL NIGP++D +
Sbjct: 464 R------LDDDANEQSLYQYKSAVSDSQK-NMNFLFSVRDSLCNIGPIRDITGRSQNPSE 516
Query: 520 -------------YG----LRINADASATGISKQSNYEL------------VELPGCKGI 550
+G L I + + Q+N L V+LPGC G+
Sbjct: 517 QPGSAQDLIACCGHGKNGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCVGV 576
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTVYH+S + + A DEYHAYLIISLE+RTMVLET + L EVT+SV+Y+ +G
Sbjct: 577 WTVYHRSGQ--------IPAEKDEYHAYLIISLESRTMVLETGETLGEVTDSVEYYTEGP 628
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
+I+AGNLFGRRR+ QV+++G RILDG+ TQDL G E G+ E S S ADPY
Sbjct: 629 SISAGNLFGRRRIAQVYQKGVRILDGARQTQDLQVG----EPGNAIE-----SASFADPY 679
Query: 671 VLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
VLL M DGS +L+VGD T TVSV TP + S P+S+CTLY+D+GP PWLR+ + D W
Sbjct: 680 VLLRMQDGSCQLVVGDSETLTVSVSTPPELGLSPDPISACTLYNDRGPSPWLRRATGDVW 739
Query: 731 LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDT 790
+ GV +A DQGD+Y +VC SG +E ++P+ C++ V++ G + D
Sbjct: 740 QTLGVPDA-----NFAFDQGDMYCIVCRNSGTMEFLELPSMACLYRVERLPYGVQVLADN 794
Query: 791 YMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTI 850
R A K ++ EEG + R E + +KVV++ + W + RPF+F +L+DGT+
Sbjct: 795 --RTASKVPVDTSSNKDEEGAEEIR-ERMSKIKVVDICVDTWGEKYGRPFVFVLLSDGTL 851
Query: 851 LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG-- 908
L Y+A+++EG ++ + + D S RNLRF R LD EE +
Sbjct: 852 LSYRAFIYEGQDSGAHASDGTS--------------FRNLRFLRLQLDLELGEEDSNADE 897
Query: 909 -APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
Q+I FK++ G QG FL+G +P W M+FRE++R+HPQ DG IVAFT LHNVNC H
Sbjct: 898 VRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIFREQIRLHPQASDGPIVAFTSLHNVNCQH 957
Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLK 1027
G IYVT++ LKIC+L + YDN WPVQK IPLK TPHQ+ + + N+Y L++S V
Sbjct: 958 GLIYVTNEASLKICRLSNILNYDNDWPVQK-IPLKGTPHQMAHHPDLNIYVLVLSFSVSV 1016
Query: 1028 PLNQVLSLLIDQEVGHQIDNHNLSS-VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1086
P + VL D GHQID S +D + V+++EVR+LEP G PW+T+ TI
Sbjct: 1017 PTSLVLPSAADGPPGHQIDQSEASDGLDPQKMVQVDDFEVRLLEPMAQGVPWETKDTIKF 1076
Query: 1087 QSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
Q +EN LTVR+V++ N T++ E LLAIGT Y+QGEDVA+RGR++L S G + +P+
Sbjct: 1077 QPAENVLTVRIVSIKNAATEQVENLLAIGTGYLQGEDVASRGRIILVSLGEDPSDPKVWA 1136
Query: 1147 TEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK 1206
E+YSKELKGAISALA+LQGHLL+A GPKIILH W G+EL G AF+DAP LYVVSLNIVK
Sbjct: 1137 KELYSKELKGAISALAALQGHLLLAIGPKIILHTWNGSELIGTAFFDAP-LYVVSLNIVK 1195
Query: 1207 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
NF+L GD HKSIYFL WKE+GAQL LLAKDFGSLDC+ATEFLIDGSTLSL+VSD +KNIQ
Sbjct: 1196 NFVLFGDFHKSIYFLCWKEEGAQLVLLAKDFGSLDCYATEFLIDGSTLSLLVSDSRKNIQ 1255
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1326
+F YAPK +ESWKGQKLL R EFH+G+HVTKFLRLQML T PGS +TNRFAL
Sbjct: 1256 VFSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRLQMLQT--------PGSSRTNRFALC 1307
Query: 1327 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1386
FGTLDG IG I PLDELTFRRLQ+LQ+KLVD VPHVAGLNP+++RQF +NG+ H+ GPD+
Sbjct: 1308 FGTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLNPKAYRQFQANGEHHKHGPDN 1367
Query: 1387 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
VD E L YE L L++Q+ IA Q GTTR QI +NL D++L TSF
Sbjct: 1368 TVDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDISLSTSFF 1413
>gi|302761560|ref|XP_002964202.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
gi|300167931|gb|EFJ34535.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
Length = 1413
Score = 1449 bits (3751), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 773/1488 (51%), Positives = 991/1488 (66%), Gaps = 131/1488 (8%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS+AA K++H PTG++ C S FITHS P P S S +PNLV+
Sbjct: 1 MSYAAIKLVHGPTGVSACASAFITHS-----PVNP-----ASSSGWKSGNAKDSLPNLVL 50
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGE-------------TKRRVLMDGISAASLELVCHY 107
ANV+EIY VR QE G ++S GE KR M GI+AA LELVC Y
Sbjct: 51 VKANVLEIYNVRFQE-GDEKSARGGEQLVGSACVAFPASAKRGGFMSGITAAWLELVCQY 109
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RL G V+S+AIL +G D R RD+IILAF AK SVL FDD+ L+ +SMH FE PEW
Sbjct: 110 RLFGIVDSMAILHRG-RDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEW 168
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
+HLKRGRE F GPLV+ D QGRC GVL+Y Q++++KA+Q GLV ++D SG S
Sbjct: 169 IHLKRGREKFPGGPLVRADSQGRCAGVLIYKCQLVMMKAAQEAYGLVEEDDP--SGNIVS 226
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
ARIESS+V+NL++L M HVKDF+F++GYIEPV+ ILHERELTWAGRV+++ TC ++ALS
Sbjct: 227 ARIESSYVVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALS 286
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
I+T K+HP +W LP+DAY LLAVPSPIGGVLV+ AN+I Y+SQ ++C +A+N A
Sbjct: 287 INTNTKKHPRLWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELAT 346
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
S E+PRS FS+ELDAAHATWL D ALLSTKTG LV L +++DGR VQRL+LSK+
Sbjct: 347 PPAGSLEMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSK 406
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF-GDIEADAPSTKR 466
SVL+S + TIG+ FF+GSRLGDSLLVQF G++ S+ L+ + G+ + +KR
Sbjct: 407 GSVLSSSLCTIGDKFFFVGSRLGDSLLVQF----GSASTSNSLEHSYDGEDDIMVRPSKR 462
Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSY----- 520
+R L D + E SLY S ++S + F F+VRDSL NIGP++D +
Sbjct: 463 MR------LDD--DASEQSLYQYKSGVSDSQKNMNFLFSVRDSLCNIGPIRDITCRSQNP 514
Query: 521 --------------------GLRINADASATGISKQSNYEL------------VELPGCK 548
L I + + Q+N L V+LPGC
Sbjct: 515 SEQPGSAQDLIACCGHGKNGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCV 574
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
G+WTVYH+S + + A DEYHAYLIISLE+RTMVLET + L EVT+SV+Y+ +
Sbjct: 575 GVWTVYHRSGQ--------IPAEKDEYHAYLIISLESRTMVLETGETLGEVTDSVEYYTE 626
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
G +I+AGNLFGRRR+ QV+++G RILDG+ TQDL G E G+ E S S AD
Sbjct: 627 GPSISAGNLFGRRRIAQVYQKGVRILDGARQTQDLQVG----EPGNAIE-----SASFAD 677
Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
PYVLL M DGS +L+VGD T TVSV TP + S P+S+CTLY+D+GP PWLR+ + D
Sbjct: 678 PYVLLRMQDGSCQLVVGDSETLTVSVSTPPELGLSPDPISACTLYNDRGPSPWLRRATGD 737
Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
W + GV +A DQGD+Y +VC SG +E ++P+ C++ V++ G +
Sbjct: 738 VWQTLGVPDA-----NFAFDQGDMYCIVCRNSGTMEFLELPSMACLYRVERLPYGVQVLA 792
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D+ R A K ++ EEG + R E + +KVV++ + W + RPF+F +L+DG
Sbjct: 793 DS--RTASKVPVDTSSNKDEEGAEEIR-ERMSKIKVVDICVDTWGEKYGRPFVFVLLSDG 849
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
T+L Y+A+++EG ++ + + D S RNLRF R LD EE +
Sbjct: 850 TLLSYRAFIYEGQDSGAHASDGTS--------------FRNLRFLRLQLDLELGEEDSNA 895
Query: 909 ---APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
Q+I FK++ G QG FL+G +P W M+FRE++R+HPQ DG IVAFT LHNVNC
Sbjct: 896 DEVRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIFREQIRLHPQASDGPIVAFTSLHNVNC 955
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
HG IYVT++ LKIC+L + YDN WPVQK IPLK TPHQ+ + + N+Y L++S V
Sbjct: 956 QHGLIYVTNEASLKICRLSNILNYDNDWPVQK-IPLKGTPHQMAHHPDLNIYVLVLSFSV 1014
Query: 1026 LKPLNQVLSLLIDQEVGHQIDNHNLSS-VDLHRTYTVEEYEVRILEPDRAGGPWQTRATI 1084
P + VL D GHQID S +D + V+++EVR+LEP G PW+T+ TI
Sbjct: 1015 SVPTSLVLPSAADGPPGHQIDQSEASDGLDPQKMVQVDDFEVRLLEPMAQGVPWETKDTI 1074
Query: 1085 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1144
Q +EN LTVR+V++ N T++ E LLAIGT Y+QGEDVA+RGR++L S G + +P+
Sbjct: 1075 KFQPAENVLTVRIVSIKNAATEQVENLLAIGTGYLQGEDVASRGRIILVSLGEDPSDPKV 1134
Query: 1145 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNI 1204
E+YSKELKGAISALA+LQGHLL+A GPKIILH W G+EL G AF+DAP LYVVSLNI
Sbjct: 1135 WAKELYSKELKGAISALAALQGHLLLAIGPKIILHTWNGSELIGTAFFDAP-LYVVSLNI 1193
Query: 1205 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
VKNF+L GD HKSIYFL WKE+GAQL LLAKDFGSLDC+ATEFLIDGSTLSL+VSD +KN
Sbjct: 1194 VKNFVLFGDFHKSIYFLCWKEEGAQLVLLAKDFGSLDCYATEFLIDGSTLSLLVSDSRKN 1253
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
IQ+F YAPK +ESWKGQKLL R EFH+G+HVTKFLRLQML T PGS +TNRFA
Sbjct: 1254 IQVFSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRLQMLQT--------PGSSRTNRFA 1305
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1384
L FGTLDG IG I PLDELTFRRLQ+LQ+KLVD VPHVAGLNP+++RQF +NG+ H+ GP
Sbjct: 1306 LCFGTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLNPKAYRQFQANGEHHKHGP 1365
Query: 1385 DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
D+ VD E L YE L L++Q+ IA Q GTTR QI +NL D++L TSF
Sbjct: 1366 DNTVDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDISLSTSFF 1413
>gi|449524573|ref|XP_004169296.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Cucumis sativus]
Length = 741
Score = 1199 bits (3102), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 584/736 (79%), Positives = 641/736 (87%), Gaps = 5/736 (0%)
Query: 697 PAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVV 756
PAA SSKK VSSCTLY DKG EPWLR TSTDAWLSTGVGE IDG DG DQGDIY V
Sbjct: 11 PAAFGSSKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVA 70
Query: 757 CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK 816
CY++G LEIFDVPNF VF VDKFVSG++H+VD + + K SE + NS +E GR
Sbjct: 71 CYDNGDLEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQNS--QELISHGRN 128
Query: 817 ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
E+ +MKV+E+AMQRWS HSRPFLF ILTDGTILCY AYLFE ++ SK DD VS S
Sbjct: 129 ESSQNMKVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNS 188
Query: 877 LSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC 936
+S SN+S+SRLRNLRF R PLD RE+ P+G +R++IFKNISG+QG FL GSRP W
Sbjct: 189 VSSSNMSSSRLRNLRFLRVPLDIQGREDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 248
Query: 937 MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
MVFRERLRVHPQLCDG IVAF VLHNVNCNHG IYVTSQG+LKICQLPS S YDNYWPVQ
Sbjct: 249 MVFRERLRVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 308
Query: 997 KVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLH 1056
KV PLK TPHQ+TYF EKNLYP+I+S PV KPLNQVLS ++DQ+VGH ++NHNLS+ +L
Sbjct: 309 KV-PLKGTPHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGH-VENHNLSADELQ 366
Query: 1057 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGT 1116
+TY+VEE+E+RILEP+++GGPWQTRATI M SSENALT+RVVTL NTTTKENETLLA+GT
Sbjct: 367 QTYSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGT 426
Query: 1117 AYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1176
AYVQGEDVAARGRVLLFS G++ADN Q LV+EVYSKELKGAISALASLQGHLLIASGPKI
Sbjct: 427 AYVQGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKI 486
Query: 1177 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1236
ILHKWTG ELNGIAFYD PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL+LLAKD
Sbjct: 487 ILHKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKD 546
Query: 1237 FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 1296
FGSLDC+ATEFLIDGSTLSL VSD+QKNIQIFYYAPK +ESWKGQKLLSRAEFHVGAHVT
Sbjct: 547 FGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVT 606
Query: 1297 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1356
KFLRLQML+TSSD+ + SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL
Sbjct: 607 KFLRLQMLSTSSDK-ACSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLG 665
Query: 1357 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1416
D+VPHV GLNPRSFRQFHSNGK HR GPDSIVDCELL HYEMLPLEEQL+IAHQ GTTRS
Sbjct: 666 DAVPHVGGLNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRS 725
Query: 1417 QILSNLNDLALGTSFL 1432
QILSNLNDL+LGTSFL
Sbjct: 726 QILSNLNDLSLGTSFL 741
>gi|255075065|ref|XP_002501207.1| predicted protein [Micromonas sp. RCC299]
gi|226516471|gb|ACO62465.1| predicted protein [Micromonas sp. RCC299]
Length = 1423
Score = 678 bits (1750), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 505/1530 (33%), Positives = 762/1530 (49%), Gaps = 205/1530 (13%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MSFA +K +H PTG+ + + + TH D P PNLVV
Sbjct: 1 MSFAIHKQVHPPTGVDHAVAAYFTHPIGDGGP-----------------------PNLVV 37
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
AN + ++ +R + SG+ G A SLE+V + L+G V S+A++
Sbjct: 38 MQANHLTVFAIRRD----PSADASGDAAL-----GAKAMSLEVVAEFDLNGTVGSIAVMR 88
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR- 179
+ +RD++++A ++K+SV+E+D S + +S+H +E+P G S R
Sbjct: 89 RRSGAPRNQRDALLIAVRESKLSVIEWDPSEMTVVPSSLHSWETPVG---TGGVPSALRV 145
Query: 180 ---GPLVKVDPQGRCGGVLVY--GLQMIIL----KASQGGSGLVGDEDTFGSGGGFSARI 230
PL DP+GRC VL+ G + L A G G +D G G +A +
Sbjct: 146 APLPPLAIADPEGRCAAVLLRAEGRSRLALCPAVDADADADGDGGGDDGDRRGQGPAASV 205
Query: 231 ESSHVINLR-DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S V++L DL + V+D F+HGY EPV++ILHERE TWA R+ + TC+++A+SI+
Sbjct: 206 RKSFVVDLTADLALSGVRDAAFLHGYGEPVVLILHEREPTWAARMPLVNDTCVLTAVSIN 265
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
K+ +IW LP Y+L A+P P+GG +V+ N + + SQ +S ALALN A
Sbjct: 266 LDTKRCTVIWQREKLPCTCYRLCAMPDPLGGAIVLSNNFLLHESQESSKALALNPLAGGG 325
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ---RLDLSKT 406
S SVELD+AHA L L++TK G L+LL++ +GR + + L +
Sbjct: 326 TESA----LGVSVELDSAHAAVLSERQVLVTTKQGALMLLSLRVEGRRLAAHGAMHLRRA 381
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGL--KEEFGDIEADA 461
+VL+S + I L FLGSR+GDSLLV ML + K + G+ E
Sbjct: 382 GGAVLSSGMCLITKRLLFLGSRVGDSLLVSLKKKEAAGAAQMLPAAAPKKRKAGEAEPPK 441
Query: 462 PSTKRLRRSSS----DALQDMVNGE-ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLK 516
P + +S D L+ M+ GE E + + + E ++F VRDS++ I P+
Sbjct: 442 PPPPPQKVGTSQDDEDELEAMLYGEGEAAAKAANAGRKE--DPGYTFTVRDSVLGISPII 499
Query: 517 DFSYGLRINADA---------SATGISKQSNYELVE---------------LPGCKGIWT 552
D + G + +A G K +++ LPG G WT
Sbjct: 500 DLTAGASASVQGDTEERAELVAACGHGKNGALAILQRGIQPELVTEVEAGTLPGLMGTWT 559
Query: 553 VYHKS---SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
VYH+S R + ++ AA D +H+YL+ISLE+ TMVLET + L EV+E+V+
Sbjct: 560 VYHESRDNERLRESGAAAAAANVDPFHSYLVISLESTTMVLETGEELREVSEAVELVTDA 619
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
T+AAGN+ GR+R+ QV + G RI +G QDLS +G S + +++ + DP
Sbjct: 620 ATLAAGNMHGRKRIAQVHKGGVRICEGPVKIQDLSAA-EMPAAGDVSPDLEIIAAQVLDP 678
Query: 670 YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES--SKKPVSSCTLYHDKGP--------- 718
YVL MSDGS+R+L GD +V +P++ + + + ++S L D P
Sbjct: 679 YVLCRMSDGSLRVLKGDEEKGSVEAMSPSSYANLPTGESIASAALVDDSVPAAERPGLTT 738
Query: 719 -EP-WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFT 776
EP +LR+T+T STGV P D+ V G LE++ +P+ +++
Sbjct: 739 REPGFLRRTAT----STGV---------LPEDEEGTVLAVTRVGGTLELYALPSCERIWS 785
Query: 777 VDKFVSGRTHIVDTYMREALK-DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAH 835
D G + + + D + E+ + ++ + ++VE + +
Sbjct: 786 ADGLSEGLNVLAPGGAGDDVNVDGDGEVEPT----------DDYPAPEIVEFRLDAFPRA 835
Query: 836 HSRPFLFAILTDGTILCYQAYLF-EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR 894
H RP L A+ DG++L Y+A+L G N P LRF R
Sbjct: 836 HERPMLTALRGDGSVLVYRAFLCPPGAGNVGHEAKP------------------QLRFCR 877
Query: 895 TPLD------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ 948
P++ + G+ R + G +G F+SG RP W +V R R+ P
Sbjct: 878 VPIELEGGGGGMVDTKALSGSRLTRFERVGDRGGIRGVFVSGPRPLWLLVRRSRVLALPI 937
Query: 949 LCDGS-IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQ 1007
+ V+FT HNVNC +GF+ T+ G ++ICQ+P Y+ WPV+K + L+ TPH
Sbjct: 938 RGEAQRTVSFTPFHNVNCLNGFMLGTAAGGVRICQIPGRMHYEAAWPVRK-LALRCTPHH 996
Query: 1008 ITYFAEKNLYPLIVSVPVLKPLNQV------LSLLIDQEVGHQIDNHNLSSVDLHRTYTV 1061
+ Y + LY L S PV ++V LS LI + + + V
Sbjct: 997 VQYLPDFRLYALSTSAPVKWKDHEVNEDDIHLSTLIKVRKANAMAKGGVEQV-------- 1048
Query: 1062 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG 1121
+ +R+L P WQ + E+ ++R V L NT T +++L +GTA G
Sbjct: 1049 --FSLRLLVPGTLECAWQ----YTVDPGEHVQSIRNVQLRNTMTGALQSMLVVGTALPGG 1102
Query: 1122 EDVAARGRVLLFS-----TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1176
ED RGRVL+F T R LV ++ K A +AL + GHL +A G K+
Sbjct: 1103 EDAPCRGRVLIFEVVWQMTDRGTKWQGQLVC---VRDAKMACTALEGVGGHLAVAIGTKL 1159
Query: 1177 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ--LNLLA 1234
I+H W G L +AF+D PL+ V++N+VKNFILLGDI K +F WK+ + L +A
Sbjct: 1160 IVHSWDGHSLMPVAFFDT-PLHTVTMNVVKNFILLGDIQKGAFFFRWKDTPDEKLLVQMA 1218
Query: 1235 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1294
KDF +D ATEFL+DGSTLS++ +D N IF Y PK ESWKGQKLL++ FHVG+
Sbjct: 1219 KDFEGMDILATEFLVDGSTLSMLTTDMTGNAFIFSYDPKSLESWKGQKLLTKGAFHVGSP 1278
Query: 1295 VTKFLRLQMLATSSDRTGAAPGSD--------KTNRFALLFGTLDGSIGCIAPLDELTFR 1346
V + +R ++ A + AAPG + NR A+ FGTLDGS+G + P++E
Sbjct: 1279 VHRMVRFRLKAPT-----AAPGQTISPAEQKAQANRHAVFFGTLDGSLGILVPIEEAAHA 1333
Query: 1347 RLQSLQKKLVDSVPH--VAGLNPRSFRQFHS-NGKAHR-PGPDSIVDCELLSHYEMLPLE 1402
LQSLQ+ L + PH +AGLN R+ R + G+ R P P S++D LL+ YE +P +
Sbjct: 1334 SLQSLQRYLTYATPHAALAGLNARTHRHPKTVEGRPMRQPAPHSLLDGGLLAVYEHMPWK 1393
Query: 1403 EQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
Q + A + G TR L +L+ L+ T+F+
Sbjct: 1394 AQAKAAKEAGMTRDVALGHLHQLSARTAFM 1423
>gi|145348791|ref|XP_001418827.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579057|gb|ABO97120.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1386
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 477/1523 (31%), Positives = 747/1523 (49%), Gaps = 228/1523 (14%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS A ++ +H PTG+ + + + T D G PNL+V
Sbjct: 1 MSHAVHREVHPPTGVDHAVTAYFTRPVGD-----------------------GGDPNLIV 37
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
+AN I +Y V G +ES L++ + G + S+++L
Sbjct: 38 ASANRITVYAV--NRRGDEES-------------------LDVCAEFDAQGAIGSMSVLR 76
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES-----PEWLHLKRGRE 175
+ +RD++++A + K+SV+E+D + + +SMH FES P L+ RE
Sbjct: 77 RRFGAPRNQRDALLIAIRERKLSVVEYDAATGDVCCSSMHSFESALGCNPLGTTLRMSRE 136
Query: 176 SFARGPLVKVDPQGRCGGVLV----YGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE 231
+ PLV DP+GRC V++ ++ +L + GG GLV ++D G G +A +
Sbjct: 137 A----PLVVSDPEGRCAAVVLREDGVAGKVRVLPSVDGGLGLVANDDE-GRVRGPAASVR 191
Query: 232 SSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
S ++L + + ++D F+HGY EP + +L+E+ TWAGR + TC I ALS+
Sbjct: 192 ESFPLHLPGVRL--IRDACFLHGYGEPALAVLYEKTPTWAGRYNLSKDTCEIVALSVDVD 249
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
++ +IW NLP +YKL A+ P+GG LV + + + SQ +S L LN +
Sbjct: 250 KQKGTVIWRRQNLPSSSYKLTALLPPLGGALVFSQDFLLHESQESSSVLGLNTFGHG--G 307
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
QE + + LD A A+ + D L++TKTG L+LL + DGR ++R+ L + +VL
Sbjct: 308 PQE--GNDAEITLDGAQASVVSEDRVLVTTKTGALLLLALHTDGRSLRRMMLQRAGGAVL 365
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---ML-----------SSGLKEEFGDI 457
+S + + L FLGSR+GDSLLV+FT + ML + K++ ++
Sbjct: 366 SSGMCLLSRDLLFLGSRIGDSLLVKFTPKEEPTAPLMLPDAEDESEDEATEKSKDDDDEL 425
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
EA T + +DA+Q E G A + V+DSL+ + P+ D
Sbjct: 426 EALLYGTTKTETVQTDAVQ-----TEKKREGLAGIIPGLKVAGYDLKVKDSLLGVAPVVD 480
Query: 518 FSYGLRINADASATGISKQSNYELVE-----------------------------LPGCK 548
+ G ++ G +K EL+ LP +
Sbjct: 481 IAVGA-----SAPMGSNKNERTELITACGQGKNGALAILTRGVQPELVTEVESGTLPNLQ 535
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
G+WT+++ R + R + +H +L++S+++ TM++ET + L EV+ S+++
Sbjct: 536 GLWTLHY---RKEGSKEER-----EPFHHHLLLSMKSSTMIMETGEELQEVSASLEFITN 587
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
T+AA N+FG +QV G R+L G QD+ ++ G+ + S I D
Sbjct: 588 QATLAASNIFGHYCSVQVTGTGIRVLKGGVKVQDVGLQDMDAPKGA-----AIASAQILD 642
Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK--------GPE- 719
PY+++ +SDGSIRLL GD +VS+ AI +S V++ L D G E
Sbjct: 643 PYIIVRLSDGSIRLLSGDEKQMSVSLMETGAIPTSS--VTAFALVDDSVEAADAAGGGER 700
Query: 720 --PWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV 777
W+ + +T+ ++ G GA + + + E G+LE+F +P+ ++
Sbjct: 701 KSGWIHRAATNGTITGLEGNKKSGA----CNNSEAIVALTREGGSLELFSLPSCTRIWCA 756
Query: 778 DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
D G MR + +T +N+ S ++V++ + + H
Sbjct: 757 DGLSEG--------MR--VLSPQTPVNAESS------------VPEIVDIRIDSFQDAHE 794
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
RP L A+ DGT+L Y+ ++ D+P+ + LRFSR +
Sbjct: 795 RPLLTAVRGDGTLLLYKGFIVPAGTTYEGQDEPLEKN--------------ELRFSRVNV 840
Query: 898 D-------------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLR 944
D A ++ GA RI G QG F++G P W +V R R+
Sbjct: 841 DVEGSGLNVAGIGAAGQLRDSLAGARLTRIGNVGEGQGVQGIFVAGPNPLWLIVRRSRVL 900
Query: 945 VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKAT 1004
P +G +VAFTV HNVNC HGFI T+ G ++ICQ+PS Y+ WPV+KV LK T
Sbjct: 901 ALPTRGEGEVVAFTVFHNVNCPHGFILGTALGGVRICQMPSKMHYEAAWPVRKV-ALKCT 959
Query: 1005 PHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE-- 1062
PH ITY + LY L+ S PV + I+Q+ H I L+ V R +
Sbjct: 960 PHTITYLPDFKLYALVTSAPV-----PWVEREIEQDNVHGI---ALAKVRRERAKANDDM 1011
Query: 1063 --EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQ 1120
+Y VR+L P WQ ++ E+ VR V L + T +LLA+GTA
Sbjct: 1012 ELQYSVRLLVPGSLDSAWQH----ALEPGEHVQCVRNVQLRDINTGALLSLLAVGTAMPG 1067
Query: 1121 GEDVAARGRVLLFST--GRNADNPQNLVTE---VYSKELKGAISALASLQGHLLIASGPK 1175
GED RGRV+LF R+A++ + +E K A +AL++L GHL++A G K
Sbjct: 1068 GEDTPCRGRVILFQMVWERDAESMDGYRWKGQVCCVREAKMACTALSALDGHLIVAVGTK 1127
Query: 1176 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL--L 1233
+ +H W G ELN +AF+D P++ VS+N+VKNFIL+GD+ K ++F WK G + ++ L
Sbjct: 1128 LTVHTWDGVELNSVAFFDT-PIHTVSINVVKNFILVGDLEKGLHFFRWKANGFEKSIIQL 1186
Query: 1234 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1293
+KDF +D +TEFLIDG+TLSL+ SD N +IF Y PK ESWKGQKLL R+ +HVG+
Sbjct: 1187 SKDFDRMDVVSTEFLIDGATLSLLGSDMSGNARIFGYDPKSLESWKGQKLLVRSAYHVGS 1246
Query: 1294 HVTKFLRLQMLATSSDRTGAAPGS--DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1351
+++ +R + T++ AAPG TNR A+ FGTLDG++G P DE T+ +L +L
Sbjct: 1247 PISRMVRFNVEGTTAK---AAPGERPKGTNRHAVFFGTLDGALGIFMPTDEPTYAKLHAL 1303
Query: 1352 QKKLVDSVPHVAGLNPRSFR--QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1409
Q++L +V G NPR+FR + P ++D LLS +E L EQ +A
Sbjct: 1304 QRELNTTVRSPIGCNPRTFRTPKVFEGKHVQLLAPLDVLDGGLLSKFETLTFTEQRAVAE 1363
Query: 1410 QTGTTRSQILSNLNDLALGTSFL 1432
++G R L + L+ +F+
Sbjct: 1364 RSGVDRDLALGLIQHLSASNAFV 1386
>gi|414587801|tpg|DAA38372.1| TPA: hypothetical protein ZEAMMB73_993613 [Zea mays]
Length = 573
Score = 657 bits (1695), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/569 (58%), Positives = 411/569 (72%), Gaps = 34/569 (5%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE--------ELDSELP-SKRG 51
MS+AAYKMMH PTGI +C +GFITHS AD ++DS + R
Sbjct: 1 MSYAAYKMMHLPTGIDHCAAGFITHSPADAAAFSTPAPAPTAAAGPDGDIDSTAARAPRR 60
Query: 52 IGPVPNLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYR-- 108
+GP PNLVV+AANV+E+Y VR + G++++ NS T ++DGIS A LELVCHYR
Sbjct: 61 VGPTPNLVVSAANVLEVYAVRAEVATGAEDAGNSSSTG--TILDGISGARLELVCHYRCK 118
Query: 109 ----------------LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
LHGN+ES+A+LS G RRDSI + F DAKI+ LEFDDSI+
Sbjct: 119 QMALASLHSLLAVNFRLHGNIESMAVLSDG---TENRRDSIAVTFNDAKITCLEFDDSIN 175
Query: 153 GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
GLR +SMHCFE PEW HLKRGRESFA GP++K DPQGRCG VLVYGLQ+IILKA+Q G
Sbjct: 176 GLRTSSMHCFEGPEWFHLKRGRESFAWGPIIKGDPQGRCGAVLVYGLQIIILKAAQVGQS 235
Query: 213 LVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
LVG+++ + RIESS+VI+LRDL+M H+KDF FVHGYIEPV+VILHERE TWAG
Sbjct: 236 LVGEDEPTRVLSSTAVRIESSYVIDLRDLEMNHIKDFTFVHGYIEPVLVILHEREPTWAG 295
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
R+S K TCM+SA SIS LKQHP+IWSA LPHDAY+LLAVP PI G+LV+ AN+IHYH
Sbjct: 296 RISSKSQTCMLSAFSISMGLKQHPMIWSAAKLPHDAYQLLAVPPPISGILVICANSIHYH 355
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
SQS SC+LALN+++ D S E+ ++SF VELD A ATWL +D+ + S+K G+++LLTVV
Sbjct: 356 SQSTSCSLALNSFSSQPDGSPEILKTSFHVELDVAKATWLSHDIVMFSSKNGEILLLTVV 415
Query: 393 YDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
YDGR VQRLDL K+ SVL+S TT+G+S FLGSRL DSLLVQF+CG TS+L L +
Sbjct: 416 YDGRAVQRLDLMKSKASVLSSGATTLGSSFIFLGSRLADSLLVQFSCGMPTSVLPD-LTD 474
Query: 453 EFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
E DIE+D P +KRL+R SD LQD+ + EELS + A N + + SF VRD+L+N+
Sbjct: 475 EPADIESDLPFSKRLKRIPSDVLQDVTSVEELSFHNKAVPNIVDSAEKISFVVRDALINV 534
Query: 513 GPLKDFSYGLRINADASATGISKQSNYEL 541
GPLKDF+YGLR N+D +A GI+KQSNYEL
Sbjct: 535 GPLKDFAYGLRTNSDPNAAGIAKQSNYEL 563
>gi|303285993|ref|XP_003062286.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455803|gb|EEH53105.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1469
Score = 657 bits (1695), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 487/1550 (31%), Positives = 773/1550 (49%), Gaps = 199/1550 (12%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MSFA +K +H PTG+ + + + TH P+ G G PNLVV
Sbjct: 1 MSFAIHKQVHPPTGVDHACAAYFTH---------PI--------------GSGAPPNLVV 37
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRR---------VLMDGISAA------------ 99
AN + IY +R +G SG + ++ D IS A
Sbjct: 38 LQANRLTIYAIR--RDGDARDNPSGNATKEADDAAIAASLVADAISGAGATASATIDADD 95
Query: 100 ---SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
SLE+V + L+G V S+A L + +RD+++LA ++K+SV+EFD S L
Sbjct: 96 AEVSLEVVAEFDLNGTVGSIATLRRRFGAPREQRDALLLAVRESKLSVVEFDPSTLSLVC 155
Query: 157 TSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVY---GLQMIILKASQG 209
+S+H +E+P G S R P+V DP+GRC VL+ G ++ +L
Sbjct: 156 SSLHSWETPPG---AGGVPSALRLAPTPPVVVADPEGRCAAVLLRAEGGTRLALLPTDND 212
Query: 210 GSGLVGDEDTFGSGG----GFSARIESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILH 264
+ G + + G G G +A ++ S+V++L R++ +++V+D F+HGY EPV+++LH
Sbjct: 213 AMDVDGGDGSEGKGRRTLRGTAAAVKKSYVVDLVREMGVRYVRDVCFLHGYGEPVLLVLH 272
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
E LTWA R + T +SA+S++ ++H +IW LPH Y+L A+P+P+GG +V+
Sbjct: 273 EERLTWAARATLVKDTMRLSAISLNVDARKHTVIWRRSALPHSCYRLTAMPAPLGGAIVL 332
Query: 325 GANTIHYHSQSASCALALNNYA---VSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
N + + SQ +S ALALN A D + + ++ + LD A+A + AL++T
Sbjct: 333 SQNFLLHESQESSAALALNPLAGGGRGDDPAAKAAAAASAAALDGAYAAVISEKQALVTT 392
Query: 382 KTGDLVLLTVVYDGRVVQR---LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
K G L LL++ +GR + + L + +VL+S + + L FLGSR+GDSLLV
Sbjct: 393 KAGALYLLSLRIEGRRLATRGGMHLKRAGGAVLSSGMCLVTRRLLFLGSRVGDSLLVS-R 451
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG-EELSLYGSASNNTESA 497
C + + ++ + A + +R D V G SL +A+ +
Sbjct: 452 CSTARASTAAPGRRPRAAAAAATTAAAEVRLLPIRPQIDGVGGVSAASLRAAAAAHRAPD 511
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATG----------------------ISK 535
++F VRDS++ I P+ D + G A AS +G + +
Sbjct: 512 HPGYTFTVRDSVLGISPVIDLTVG----ASASVSGDTIERTELIAACGHGKNGALAVLQR 567
Query: 536 QSNYELVE------LPGCKGIWTVYHKSS---RGHNADSSRMAAYDDEYHAYLIISLEAR 586
ELV LPG KG WTV+H S+ R + ++ A D YHAYL+ISL +
Sbjct: 568 GIQPELVTEVESGTLPGLKGTWTVHHDSADNERLRGSAAAAAAQAVDPYHAYLVISLASS 627
Query: 587 TMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
TM+LET + L EV+E V+ T+ AGN FGR R++QV+++G R+ G QD++
Sbjct: 628 TMILETGEELKEVSEHVELVTDAATLCAGNAFGRERIVQVYDKGVRVAAGPVKVQDIAST 687
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-------VQTP-- 697
+++G G E +++ I+ PYVL +SDGS+ +L GD + T+ + P
Sbjct: 688 ELVADAGDG-EGIEIVAAEISFPYVLCRLSDGSLAVLKGDEESKTLVKLDVDALARLPPG 746
Query: 698 -----AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI--DGADGGPLDQG 750
A + P ++ HD+ P +L++ +T +T + + D +
Sbjct: 747 GGIACATLVDDSTPAAAHGGLHDRSPG-FLKRATTATATTTTTTASASREDGDDDDDSRR 805
Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
++ V GALE++ +P+ + +T + G + +
Sbjct: 806 PMFLAVTRTGGALELYSLPSCDKAWTANGLSEGVAVLSPA--------GSASAALVDRDA 857
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
+ ++VEL + ++ H RP L A+ DG +L Y+A+
Sbjct: 858 AAAADAGADRAPEIVELRVDAFARAHERPLLTALRADGAVLVYRAF-------------- 903
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA------PCQRITIFKNIS--- 921
+ +V+ L LRF+R P++ E GA P R+T F+ +
Sbjct: 904 -----TCAVAGPGGRALTQLRFARVPVEL---EGGGGGAVDLSALPGSRLTRFERVGDRG 955
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS-IVAFTVLHNVNCNHGFIYVTSQGILKI 980
G +G F+SG +P W + R R+ P + +V+FT HNVNC+ GFI T+ G ++I
Sbjct: 956 GIRGVFVSGPQPLWLLARRSRVLALPVRGEAQRVVSFTAFHNVNCHAGFILGTAAGGVRI 1015
Query: 981 CQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1040
CQ+P Y+ WPV+K + L+ TPH + Y + LY L S P + ++ E
Sbjct: 1016 CQIPGRMHYEAAWPVRK-LALRCTPHHVQYLPDFKLYALSTSAP---------AKWVEPE 1065
Query: 1041 VGHQIDNHNLSSVDLHRTYTV------EEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
V + D H + V R + E++ V++L P G +T + M E+
Sbjct: 1066 VAEE-DIHAATVVKTRRAKAMARGGVEEQFAVKLLVP----GSLETAWSRTMDPGEHVQA 1120
Query: 1095 VRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1154
V+ V + N T ++LA+GTA GED RGRV+LF + + +
Sbjct: 1121 VKNVQVRNLRTGALHSMLAVGTAMPGGEDTPCRGRVILFEISWQMVDGETRRVPLLLLFF 1180
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
A++AL+ L+GHL++A G K+I+H W G EL +AF+D P ++ V++N+VKNF+ +GD+
Sbjct: 1181 DDALAALSGLEGHLVVAIGTKLIVHAWDGAELIPVAFFDTP-VHTVTINVVKNFVCIGDV 1239
Query: 1215 HKSIYFLSWKE--QGAQLNL--LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1270
K YF WK+ + + NL LAKDF S+D +TEFL+DGSTLSL+ +D N +F Y
Sbjct: 1240 QKGAYFFRWKDDPRTGEKNLIQLAKDFESMDVLSTEFLVDGSTLSLLAADTAGNAYVFAY 1299
Query: 1271 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML----ATSSDRTGAAPGSDK--TNRFA 1324
PK SESWKGQKLL++A FHVG+ V + +R ++ A + R P K NR A
Sbjct: 1300 DPKSSESWKGQKLLTKASFHVGSPVHRMVRFKLKTPTGAGNDGRAAPTPAEIKANANRHA 1359
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN-GKAHR-P 1382
+ FGTLDGS+G + P++ T +L+ LQ+ L + AGLN RS+R + G+A R P
Sbjct: 1360 VFFGTLDGSLGILVPMESSTHAKLEVLQRWLNYNTAQNAGLNGRSYRAPKTTEGRAMRSP 1419
Query: 1383 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
P +++D E+L +E L +Q E A G TR + L+ L+ L+ T+F+
Sbjct: 1420 APHNLLDGEMLQGFESLAWTKQAEAADAAGMTREEALTYLHTLSAKTAFM 1469
>gi|297722899|ref|NP_001173813.1| Os04g0252200 [Oryza sativa Japonica Group]
gi|255675253|dbj|BAH92541.1| Os04g0252200, partial [Oryza sativa Japonica Group]
Length = 432
Score = 644 bits (1662), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/411 (77%), Positives = 356/411 (86%), Gaps = 8/411 (1%)
Query: 1022 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1081
SV +PLNQVLS + DQE H +DN S+ LH+TYTV+E+EVRILE ++ GG W+T+
Sbjct: 30 SVCSFRPLNQVLSSMADQESVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETK 89
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1141
+TIPMQ ENALTVR+VTL NTTTKENETLLAIGTAYV GEDVAARGRVLLFS + ++N
Sbjct: 90 STIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SEN 148
Query: 1142 PQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS 1201
QNLVTEVYSKE KGA+SA+ASLQGHLLIASGPKI L+KWTG EL +AFYDAP L+VVS
Sbjct: 149 SQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGAELTAVAFYDAP-LHVVS 207
Query: 1202 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1261
LNIVKNF+L GDIHKSIYFLSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD
Sbjct: 208 LNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDS 267
Query: 1262 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1321
KN+QIFYYAPKM ESWKGQKLLSRAEFHVGAH+TKFLRLQML T S+KTN
Sbjct: 268 DKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRLQMLPTQ------GLSSEKTN 321
Query: 1322 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
RFALLFG LDG IGCIAP+DELTFRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HR
Sbjct: 322 RFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHR 381
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
PGPD+I+D ELL+HYEML L+EQL++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 382 PGPDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSDISLGTSFL 432
>gi|242075246|ref|XP_002447559.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
gi|241938742|gb|EES11887.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
Length = 389
Score = 632 bits (1630), Expect = e-178, Method: Compositional matrix adjust.
Identities = 311/397 (78%), Positives = 344/397 (86%), Gaps = 8/397 (2%)
Query: 1036 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+ DQE+G I+N S DL + YTV+E+EVRI+E + G W+TR TIPMQS ENALTV
Sbjct: 1 MADQELGLHIENDVTSGDDLQKVYTVDEFEVRIMELGKPSGHWETRFTIPMQSFENALTV 60
Query: 1096 RVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK 1155
R+VTL NT+TKENETL+AIGTAYVQGEDVAARGRVLL+S R ++N QNLVTEVYSKE K
Sbjct: 61 RIVTLQNTSTKENETLMAIGTAYVQGEDVAARGRVLLYSFSR-SENSQNLVTEVYSKESK 119
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1215
GA+SA+ASLQGHLLIASGPKI L+KWTG+EL +AFYDAP L+VVSLNIVKNF+L GDIH
Sbjct: 120 GAVSAVASLQGHLLIASGPKITLNKWTGSELTAVAFYDAP-LHVVSLNIVKNFVLFGDIH 178
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
KSIYFLSWKEQG+QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD KN+QIFYYAPKM
Sbjct: 179 KSIYFLSWKEQGSQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDSDKNVQIFYYAPKMV 238
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
ESWKGQKLLSRAEFHVGAHV+KFLRLQML T S+KTNRFAL+FGTLDG IG
Sbjct: 239 ESWKGQKLLSRAEFHVGAHVSKFLRLQMLPTQ------GLASEKTNRFALVFGTLDGGIG 292
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
CIAP+DELTFRRLQSLQ+KLVD+VPHV GLNPRSFR F SNGKAHRPGPD+I+D ELLSH
Sbjct: 293 CIAPVDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRHFKSNGKAHRPGPDNIIDFELLSH 352
Query: 1396 YEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
YEML LEEQLEIA Q GTTRSQILSN +D LGTSFL
Sbjct: 353 YEMLSLEEQLEIAQQIGTTRSQILSNFSDFLLGTSFL 389
>gi|432883539|ref|XP_004074300.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Oryzias latipes]
Length = 1456
Score = 608 bits (1567), Expect = e-171, Method: Compositional matrix adjust.
Identities = 461/1509 (30%), Positives = 732/1509 (48%), Gaps = 228/1509 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + S S + K R LE V + L GNV S+
Sbjct: 29 NLVVAGTSQLFVYRIIHDVESTSSSDKSSDAKTR-------KEKLEQVASFSLFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +D+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLTGAS----KDALLLSFKDAKLSVIEYDPGTHDLKTLSLHYFEEPE---LRDGFFQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+ RC +L+YG ++++L + + DE G G G + S++I
Sbjct: 135 NVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKD---TLSDEQEGGVGEGPKSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 192 DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS Y VSL+S
Sbjct: 252 HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVSLNSQTN 305
Query: 352 -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ P + + LD + ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 GTTSFPLRVQEEVKITLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 365
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLT+ + T+ FLGSRLG+SLL+++T + G ++ + E P K+
Sbjct: 366 AASVLTTCMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPAEDGNDKQ--EKEKQEPPNKK 423
Query: 467 LRRSSSD-----------ALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGP 514
R SS L D V+ E+ +YGS A + T+ A TFSF V DS++NIGP
Sbjct: 424 KRVESSSNWTGCSASYFFVLSDEVD--EIEVYGSEAQSGTQLA--TFSFEVCDSILNIGP 479
Query: 515 LKDFSYG--------LRINADAS-----ATGISKQSNYELV------------ELPGCKG 549
+ S G + N + +G K ++ ELPGC
Sbjct: 480 CANASMGEPAFLSEEFQSNPEPDLEIVVCSGYGKNGALSVLQRSIRPQVVTTFELPGCHD 539
Query: 550 IWTVY----HKSSRG--HNADSSRMAAYDD---------EYHAYLIISLEARTMVLETAD 594
+WTV K S G AD+ + D + H +LI+S E TM+L+T
Sbjct: 540 MWTVISGEDKKESEGGEKEADAEKKEEQDKTEPPLEDDAKKHGFLILSREDSTMILQTGQ 599
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
+ E+ S + QG T+ AGN+ + +IQV G R+L+G + L F P +
Sbjct: 600 EIMELDTS-GFATQGPTVFAGNIGDNQYIIQVSPMGLRLLEG---VKQLHFIPVDL---- 651
Query: 655 GSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT-----VSVQTPAAIESSK----- 704
S ++ S+ADPYV++ ++G + + V T +++Q P S+
Sbjct: 652 ---GSPIVHCSVADPYVVIMTAEGVVTMFVLKSDTYMGKTHRLALQKPQISTLSRVIALC 708
Query: 705 -----------KPVSSC---------------TLYHDKG----PEPWLRKTSTDAWLSTG 734
+ SSC T+Y D E + + A ++ G
Sbjct: 709 AYRDVSGMFTTENKSSCSSKEDLILRSNSETETVYQDLSNTVDDEEEMLYGESGASMAAG 768
Query: 735 V-----GEAIDGADGGPLDQGDI----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
G A GG G + V+ E+G +EI+ +P++ VF V F G+
Sbjct: 769 KEEMSRGSAATAPPGGEGSAGKAEPSHWCVLIRENGVMEIYQLPDWRLVFLVKNFPVGQR 828
Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
+VD+ + S T+ + EE T QG + + +V L R SRP+L +
Sbjct: 829 VLVDS----SSGQSATQGDGKKEEVTRQGEIPLVKEVALVALGNNR-----SRPYLL-VH 878
Query: 846 TDGTILCYQAYLF--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
+ +L Y+A+ + + P+N K +RF + P RE
Sbjct: 879 VENELLVYEAFPYDQQQPQNNLK-----------------------VRFKKVPHSINFRE 915
Query: 904 ETPH---------GAPCQRITI---------FKNISGHQGFFLSGSRPCWCMVF-RERLR 944
+ P G P + + + F++ISG+ G F+ G P W ++ R LR
Sbjct: 916 KKPKLKKDKKAEGGGPEENVAVKSRISRFRYFEDISGYSGVFICGPSPHWMLITSRGGLR 975
Query: 945 VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKAT 1004
+HP DG I +F+ HN+NC GF+Y QG L+I LP+ +YD WPV+K IPL+ T
Sbjct: 976 LHPMTIDGPIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRK-IPLRCT 1034
Query: 1005 PHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEY 1064
H ++Y E +Y + SV + L I + G + + + + + E++
Sbjct: 1035 VHFVSYHVESKVYAVCTSV-------KELCTRIPRMTGEEKEFETIERDERYINPLQEKF 1087
Query: 1065 EVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQG 1121
++++ P W+T I ++ E+ ++ V L + T + +A GT +QG
Sbjct: 1088 SIQLISPVS----WETIPNTRIDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTCVLQG 1143
Query: 1122 EDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKI 1176
E+V RGR+L+ P +T+ +Y KE KG ++AL G+L+ A G KI
Sbjct: 1144 EEVTCRGRILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCHGYLVSAIGQKI 1203
Query: 1177 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1236
L +L G+AF D LY+ + +KNFIL D+ KSI L ++E+ L+L+++D
Sbjct: 1204 FLWALKDNDLTGMAFIDTQ-LYIHQMISIKNFILAADVMKSISLLRYQEESKTLSLVSRD 1262
Query: 1237 FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 1296
L+ ++ EF++D + L +VSD KN+ ++ Y P+ ES+ G +LL RA+F+ GAH+
Sbjct: 1263 AKPLEVYSIEFIVDNNQLGFLVSDRDKNLFVYMYLPEAKESFGGMRLLRRADFNAGAHIN 1322
Query: 1297 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1356
R M + +G+ N+ F TLDG IG + P+ E T+RRL LQ L
Sbjct: 1323 SLWR--MPCRGALDSGSKKALTWDNKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALT 1380
Query: 1357 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1416
+PH AGLNP++FR HSN ++ + +I+D ELL+ Y L E+ E+A + GTT+
Sbjct: 1381 TMLPHHAGLNPKAFRMMHSNRRSLQNAVKNILDGELLAKYLYLSTMERSELAKKIGTTQD 1440
Query: 1417 QILSNLNDL 1425
IL +L ++
Sbjct: 1441 IILDDLLEI 1449
>gi|444523674|gb|ELV13604.1| Cleavage and polyadenylation specificity factor subunit 1 [Tupaia
chinensis]
Length = 1469
Score = 606 bits (1562), Expect = e-170, Method: Compositional matrix adjust.
Identities = 458/1506 (30%), Positives = 716/1506 (47%), Gaps = 215/1506 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVS 348
T + HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQRVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTAG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD AHA ++ D ++S K G++ +LT+V DG R V+ K
Sbjct: 307 TTAFPLRTQDGVRLTLDCAHAAFISYDKMVISLKGGEIYVLTLVTDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASAVREAADKEEPPSKKKRV 424
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFS-- 519
+ SS QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + +
Sbjct: 425 DPTGGWSGSSTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 480
Query: 520 ---------------YGLRINADA--SATGI---------SKQSNYELV----------- 542
YGL A+ TG+ S + + E+V
Sbjct: 481 EPAFLSEEVGTGVAEYGLIGQAEGWGRRTGLTPAPVQFQNSPEPDLEIVVCSGYGKNGAL 540
Query: 543 ---------------ELPGCKGIWTVY-------HKSSRGHNADSSRMAAYDD--EYHAY 578
ELPGC +WTV ++ + + R A +D H +
Sbjct: 541 SVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKDEEETPKAEGTEQPRAAEAEDGVRRHGF 600
Query: 579 LIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY 638
LI+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 601 LILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG-- 657
Query: 639 MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC-----TVS 693
L F P + + ++ ++ADPYV++ ++G + + + T ++
Sbjct: 658 -VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFLLKSDTYGGRHHRLA 709
Query: 694 VQTPAAIESSKKPVSSCTLYHDKG-------------PEPWLRKTSTDAWLSTGVGEAID 740
+ P SK V + LY D EP R + L +D
Sbjct: 710 LHKPPLHHQSK--VITLCLYRDVSGMFTTESRLGGARDEPGARGSCEVEGLGAETSPTVD 767
Query: 741 G------ADGG----------------PLDQGDI--------YSVVCYESGALEIFDVPN 770
D G P D+ + ++ E+G +E++ +P+
Sbjct: 768 DEEEMLYGDSGSLFSPSKEETRRSSQPPADRDPAPFRAEPTHWCLLVRENGTMEMYQLPD 827
Query: 771 FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQ 830
+ VF V F G+ +VD+ + T+ + EE T QG + + +V L
Sbjct: 828 WRLVFLVKNFPVGQRVLVDS----SFGQPATQAEARKEEATRQGELPLVKEVLLVALG-- 881
Query: 831 RWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNL 890
+ SRP+L + D +L Y+A+ P ++ + N++ +
Sbjct: 882 ---SRQSRPYLL-VHVDQELLLYEAF----PHDSQLGQGNLKVRFKKVPHNINFREKKLK 933
Query: 891 RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQL 949
+ T E R F++I G+ G F+ G P W +V R LR+HP
Sbjct: 934 PSKKKAEGGSTEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMG 993
Query: 950 CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQIT 1009
DG I +F HNVNC GF+Y QG L+I LP+ +YD WPV+K IPL+ T H +
Sbjct: 994 IDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVA 1052
Query: 1010 YFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1069
Y E +Y + S P + I + G + + + D + E + ++++
Sbjct: 1053 YHVESKVYAVATSTNA--PCTR-----IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLI 1105
Query: 1070 EPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAA 1126
P W+ A I ++ E+ ++ V+L + T + +A GT +QGE+V
Sbjct: 1106 SPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTC 1161
Query: 1127 RGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKW 1181
RGR+L+ P +T+ +Y KE KG ++AL GHL+ A G KI L
Sbjct: 1162 RGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSL 1221
Query: 1182 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1241
+EL G+AF D LY+ + VKNFIL D+ KSI L ++E+ L+L+++D L+
Sbjct: 1222 RASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLE 1280
Query: 1242 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1301
++ +F++D + L +VSD +N+ ++ Y P+ ES+ G LL RA+FH+GAHV F R
Sbjct: 1281 VYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGLLLLRRADFHLGAHVNTFWR- 1339
Query: 1302 QMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1356
+ GA G K N+ F TLDG IG + P+ E T+RRL LQ L
Sbjct: 1340 ------TPCRGAVEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALT 1393
Query: 1357 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1416
+PH AGLNPR+FR H + + + +++D ELLS Y L E+ E+A + GTT
Sbjct: 1394 TMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLSRYLYLSTMERSELAKKIGTTPD 1453
Query: 1417 QILSNL 1422
IL +L
Sbjct: 1454 IILDDL 1459
>gi|348512553|ref|XP_003443807.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Oreochromis niloticus]
Length = 1456
Score = 605 bits (1559), Expect = e-170, Method: Compositional matrix adjust.
Identities = 455/1510 (30%), Positives = 732/1510 (48%), Gaps = 236/1510 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + ++ S ++K R LE V + L GN+ S+
Sbjct: 29 NLVVAGTSQLFVYRIIHDVESTSKADKSSDSKSR-------KEKLEQVASFSLFGNIMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLVGAS----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+ RC +LVYG ++++L + + DE G G G + S++I
Sbjct: 135 NVHIPVVRVDPENRCAVMLVYGTKLVVLPFRKDT---LTDEQESGVGEGPKSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 192 DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS Y VSL+S
Sbjct: 252 HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVSLNSQTN 305
Query: 352 -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ P + + LD + ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 GTTAFPLRVQDEVKLTLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 365
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA---PS 463
SVLT+ + T+ FLGSRLG+SLL+++T + G + + + + D PS
Sbjct: 366 AASVLTTCMVTMEPGYLFLGSRLGNSLLLKYTEKLQETPAEEGKERQDKEKDKDKQEPPS 425
Query: 464 TKRLRRSSSD----------ALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
K+ SS++ L D V+ E+ +YGS A + T+ A T+SF V DS++NI
Sbjct: 426 KKKRVESSTNWTVCVILDFFVLSDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 481
Query: 513 GPLKDFSYG--------LRINADAS-----ATGISKQSNYELV------------ELPGC 547
GP + S G + N + +G K ++ ELPGC
Sbjct: 482 GPCANASMGEPAFLSEEFQSNPEPDLEVVVCSGYGKNGALSVLQRSIRPQVVTTFELPGC 541
Query: 548 KGIWTVYHKSSRGHNADSSRMAAY------------DDEYHAYLIISLEARTMVLETADL 595
+WTV + D + D + H +LI+S E TM+L+T
Sbjct: 542 HDMWTVISSDVKEDKTDKEEVEKEEEEKKTEPPLEDDAKKHGFLILSREDSTMILQTGQE 601
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ S + QG T+ AGN+ + +IQV G R+L+G + L F P +
Sbjct: 602 IMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGLRLLEG---VRQLHFIPVDL----- 652
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDP-----STCTVSVQTPAAIESSKKPVSSC 710
S ++ S+ADPYV++ ++G + + V T +++Q P I S + ++ C
Sbjct: 653 --GSPIVHCSVADPYVVIMTAEGVVTMFVLKSDSYMGKTHRLALQKPQ-IPSQSRVITLC 709
Query: 711 -------------------------------TLYHDKG-----PEPWLRKTSTDAWLSTG 734
T+ HD E L S + +T
Sbjct: 710 AYRDVSGMFTTENKVSCSIKEDTIRSQSEAETIIHDMSNTVDDEEEMLYGDSNAS--ATP 767
Query: 735 VGEAIDGADGGPLDQGDI----------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGR 784
E I+ + P G + ++ E+G +EI+ +P++ VF V F G+
Sbjct: 768 AKEDINRSFVAPTTSGSEATSSKAEPTHWCMIIRENGVMEIYQLPDWRLVFLVKNFPVGQ 827
Query: 785 THIVDTYMREALKDSETEINSSSEEGT-GQGRKENIHSMK----VVELAMQRWSAHHSRP 839
+VD+ SS + T G+G+KE + V E+A+ +HS+P
Sbjct: 828 RVLVDS--------------SSGQSATQGEGKKEEVTRQGEIPLVKEVALVSLGNNHSKP 873
Query: 840 FLFAILTDGTILCYQAYLF--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
+L + + +L Y+A+ + + P+N K +RF + P
Sbjct: 874 YLL-VHVEQELLIYEAFQYDQQQPQNNLK-----------------------VRFKKVPH 909
Query: 898 DAYTRE----------------ETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCMVF 939
+ RE E G + R F++ISG+ G F+ G P W +V
Sbjct: 910 NINFREKKSKLKKDKKAESSATEESSGVKGRIARFRFFEDISGYSGVFICGPSPHWMLVT 969
Query: 940 -RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
R LR+HP DGSI +F+ HN+NC GF+Y QG L+I LP+ +YD WPV+K
Sbjct: 970 SRGALRLHPMTIDGSIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRK- 1028
Query: 999 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1058
IPL+ T H ++Y E +Y + SV +P + I + G + + + + +
Sbjct: 1029 IPLRCTVHYVSYHVESKVYAVCTSVK--EPCTR-----IPRMTGEEKEYEVIERDERYIH 1081
Query: 1059 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTA 1117
E++ ++++ P TR I ++ E+ ++ V L + T + +A GT
Sbjct: 1082 PQQEKFSIQLISPVSWEAIPNTR--IDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTC 1139
Query: 1118 YVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+QGE+V RGR+L+ P +N +Y KE KG ++AL G+L+ A
Sbjct: 1140 LMQGEEVTCRGRILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAI 1199
Query: 1173 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1232
G KI L +L G+AF D LY+ + +KNFIL D+ KSI L ++E+ L+L
Sbjct: 1200 GQKIFLWVLKDNDLTGMAFIDTQ-LYIHQMFSIKNFILAADLMKSISLLRYQEESKTLSL 1258
Query: 1233 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1292
+++D L+ ++ EF++D + L +VSD KN+ ++ Y P+ ES+ G +LL RA+F+ G
Sbjct: 1259 VSRDAKPLEVYSIEFMVDNNQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAG 1318
Query: 1293 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1352
A++ F R+ + A D N+ F TLDG IG + P+ E T+RRL LQ
Sbjct: 1319 ANINTFWRMPCRGALDASSKKALTWD--NKHITWFATLDGGIGLLLPMQEKTYRRLLMLQ 1376
Query: 1353 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
L +PH AGLNP++FR HS+ ++ + +I+D ELL+ Y L + E+ E+A + G
Sbjct: 1377 NALTTMLPHHAGLNPKAFRMLHSDRRSLQNPVKNILDGELLNKYLYLSMMERSELAKKIG 1436
Query: 1413 TTRSQILSNL 1422
TT+ IL +L
Sbjct: 1437 TTQDIILDDL 1446
>gi|410911304|ref|XP_003969130.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Takifugu rubripes]
Length = 1444
Score = 600 bits (1548), Expect = e-168, Method: Compositional matrix adjust.
Identities = 440/1475 (29%), Positives = 721/1475 (48%), Gaps = 192/1475 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + ++ S ++K R LE V + L GNV S+
Sbjct: 29 NLVVAGTSQLFVYRIIHDVESTSKTDKSSDSKTR-------KEKLEQVAAFSLFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ GA+ RD+++L+F+DAK+SV+E+D H L+ S+H FE L L+ G
Sbjct: 82 ESVQLVGAN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEE---LELRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+ RC +L+YG ++++L + + DE G G G + +++I
Sbjct: 135 NVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKDT---LTDEQEVGVGEGPKSSFLPTYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 192 DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD + A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRLQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA-----PSTKRL 467
+ + T+ FLGSRLG+SLL+++T L G ++ + E D P +K+
Sbjct: 372 TCMVTMEPGYLFLGSRLGNSLLLKYTEKLQDMPLEEGKDQQDKEKEKDMDKQEEPPSKKK 431
Query: 468 RRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
R SS D V+ E+ +YGS A + T+ A T+SF V DS++NIGP + S G
Sbjct: 432 RVESSSNWTDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANASMGEPAFL 487
Query: 522 ---LRINADAS-----ATGISKQSNYELV------------ELPGCKGIWTVY------- 554
+ N + +G K ++ ELPGC +WTV
Sbjct: 488 SEEFQSNPEPDLEVVVCSGHGKNGALSVLQRSIRPQVVTTFELPGCHDMWTVISNEPVQK 547
Query: 555 --HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
++ R + A D + H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 548 EQEETEREGKEKTEPPAEEDTKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTV 606
Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
AGN+ + +IQV G R+L+G +TQ L F P + S ++ S+ADPYV+
Sbjct: 607 FAGNIGDNKYIIQVSPMGIRLLEG--VTQ-LHFIPVDL-------GSPIVHCSLADPYVV 656
Query: 673 LGMSDGSIRLLVGD-----PSTCTVSVQTPAAIESSK-------KPVS---------SCT 711
+ ++G + + V T +++Q P S+ + VS SC+
Sbjct: 657 IMTAEGVVTMFVLKIDSYMGKTHRLALQKPQISTQSRVIALCAYRDVSGMFTTENKVSCS 716
Query: 712 LYHDKGPEPWLRKTSTDAWLSTGV---------GEAIDGADGGPLDQGDI---------- 752
+ D + LST + G++ G +++
Sbjct: 717 ITEDISIRSQSEAETIIQDLSTNIVDDEEEMLYGDSNTGPSKEEMNRSSFAGPSEGSYSK 776
Query: 753 -----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
+ ++ +SG +EI+ +P++ VF V F G+ +VD+ ++ E E
Sbjct: 777 AEPSHWCLITRDSGVMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQSATQGEKE--GKK 834
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF--EGPENTS 865
EE T QG + + +V L +HSRP+L + D +L Y+A+ + + P+N
Sbjct: 835 EEVTRQGEIPLVKEVTLVSLGY-----NHSRPYLL-VHVDQELLIYEAFPYDQQQPQNNL 888
Query: 866 KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--------PHGAPCQ----- 912
K +RF + P + RE+ G +
Sbjct: 889 K-----------------------VRFKKVPHNINFREKKSKLRKDKKAEGTAAEDSVAA 925
Query: 913 -----RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCN 966
R F++ISG+ G F+ G P W +V R LR+HP DG I +F+ HN+NC
Sbjct: 926 RGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRLHPMSIDGPIESFSPFHNINCP 985
Query: 967 HGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVL 1026
GF+Y QG L+I LP+ +YD WPV+K IPL+ T H ++Y E +Y + S+
Sbjct: 986 KGFLYFNKQGELRISVLPTYLSYDAPWPVRK-IPLRCTVHYVSYHVESKVYAVCTSL--- 1041
Query: 1027 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1086
+ L I + G + + + + + +++ ++++ P TR I +
Sbjct: 1042 ----KELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLISPVSWEAIPNTR--IDL 1095
Query: 1087 QSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP--- 1142
+ E ++ V L + T + +A GT +QGE+V RGR+L+ P
Sbjct: 1096 EEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCRGRILILDVIEVVPEPGQP 1155
Query: 1143 --QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1200
+N +Y KE KG ++AL G+L+ A G KI L +L G+AF D L++
Sbjct: 1156 LTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWVLKDNDLTGMAFIDTQ-LHIH 1214
Query: 1201 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1260
+ +KNFIL D+ KS+ L ++E+ L+L+++D L+ ++ EF++D + L +VSD
Sbjct: 1215 QMMSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVYSIEFMVDNNQLGFLVSD 1274
Query: 1261 EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT 1320
KN+ ++ Y P+ ES+ G +LL RA+F+ GA++ F R M + G+
Sbjct: 1275 RDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTFWR--MPCRGALEAGSRKAMTWD 1332
Query: 1321 NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1380
N+ F TLDG +G + P+ E T+RRL LQ L + H AGLNP++FR H + ++
Sbjct: 1333 NKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLSHHAGLNPKAFRMLHCDRRSL 1392
Query: 1381 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1415
+ +I+D ELL+ Y L + E+ E+A + GTT+
Sbjct: 1393 QNPVKNILDGELLNKYLYLSMMERSELAKKIGTTQ 1427
>gi|156364999|ref|XP_001626630.1| predicted protein [Nematostella vectensis]
gi|156213514|gb|EDO34530.1| predicted protein [Nematostella vectensis]
Length = 1420
Score = 597 bits (1538), Expect = e-167, Method: Compositional matrix adjust.
Identities = 456/1540 (29%), Positives = 728/1540 (47%), Gaps = 245/1540 (15%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A YK H PTG+ C + +R NLVV
Sbjct: 2 YAIYKETHPPTGVEFCVNCHFYSARES---------------------------NLVVAG 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
+ ++ + Q+EGS +++ G + +R LELV + L GN+ESL +
Sbjct: 35 TTEVRVFRLCYQQEGSSSAESGGSSLKR---------KLELVGQHSLFGNIESLHAIRLA 85
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
G RDS++++F+DAK+S++++D H ++ S+H FE + +K + R P+
Sbjct: 86 G----NTRDSLLMSFKDAKLSIVDYDPGKHDIKTRSLHFFEDEK---IKSHCLAQDRAPV 138
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
V++DP+ RC +L YG +++L Q G +D+ S + S++I+++++D
Sbjct: 139 VRIDPERRCAVMLAYGTHLVVLPFRQEGGIDDTAQDSIISSSD-RPPVLPSYIIDVKEID 197
Query: 243 MK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
K ++ D F+HGY EP ++IL+E TWAGR++ ++ TC + A+S++ + K HP++W
Sbjct: 198 EKTCNILDIQFLHGYYEPTLLILYEPLKTWAGRLAMRNDTCALVAVSLNMSQKAHPVVWQ 257
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE------ 354
LP D ++ VP PIGGVLV N + Y +QS Y VS++S E
Sbjct: 258 LSCLPFDCIYVMPVPKPIGGVLVCCMNALLYLNQSVP------PYGVSVNSIGENSTVFP 311
Query: 355 -LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
P+ ++ L+ ++A ++ ND + S K G++ ++T++ DG R V+ KT SVLT
Sbjct: 312 LKPQKGVTITLEGSNAIFIANDKLVFSLKGGEIYVVTLIADGVRSVRNFVFDKTAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
S + G+ FLGSRLG+SLLV++T + G + ++ D + +R + +
Sbjct: 372 SCVCECGDGYLFLGSRLGNSLLVKYT--EKPQDIVYGTENNAQSMQCD--NIERWQILNG 427
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----LRIN--- 525
L + + +EL +YG A +++F V DSL+NIGP G L ++
Sbjct: 428 SLLLIVDDLDELEVYG-AQQEAGVELTSYTFEVCDSLLNIGPCSCMDIGEPAFLSVSSYF 486
Query: 526 ADA--------SATGISKQSNYELV------------ELPGCKGIWTVYHKSSRG----- 560
ADA S +G K ++ ELPGC +WTV+ K +
Sbjct: 487 ADAQELDLEVVSCSGYGKNGALTVLQRSIRPQVVTTFELPGCTDMWTVFSKDQKKGAQTN 546
Query: 561 --HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
H S +++YH++LI+S E +M+L+T + EV +S + Q TI AGN
Sbjct: 547 AIHRYPSQPCTQGNEKYHSFLILSREDSSMILKTEQEIMEVDQS-GFSTQCATIYAGNFG 605
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
++QV G R+L+G Q + +SG S ++ S+ DPY +L M+DG
Sbjct: 606 NGSYILQVTPLGVRLLEGVNQLQHIPM-----DSGL----SNIVWCSVCDPYAVLLMADG 656
Query: 679 SIRLL--VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD----------------KGPEP 720
S+ L+ + S ++V P+ +SSK V +C Y D K P P
Sbjct: 657 SVILIEFIKSASGPKLTVSRPSLSQSSK--VCACCTYKDMSGLFTTENSNLEEVSKVPSP 714
Query: 721 WLRKTS----------------------TDAWLSTGVGEAIDGADGGPLD------QGDI 752
T+ T L+ E P++ Q
Sbjct: 715 KPEMTAPPRQEKESLTIDEEDELLYGGDTSLTLTFEPPEPSKAESAAPVEVFEEPLQPSY 774
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ +VC E+G +EI+ +P F VF V F IVD+ DS SS E
Sbjct: 775 WCLVCRENGVMEIYSLPGFTRVFFVKNFSKAPRVIVDS------GDSGASTQSSVSEE-- 826
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
S+ V E+ + + R L A++ D +L Y+A+ + E
Sbjct: 827 -------ESLNVREVLLTGLGYKNRRATLVAVM-DQDLLIYEAFSYPTVEGH-------- 870
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-------------CQRITIFKN 919
NLRF + + RE+ P P + +F +
Sbjct: 871 ---------------LNLRFKKLQHNIQIREKKPKQEPKNDSETKSGLDPKVAMLRVFND 915
Query: 920 ISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
IS + G F+ GS P W V R HP DG + F HNVNC GF+Y ++G L
Sbjct: 916 ISSYSGIFVCGSYPFWIFVTNRGAFHWHPMSIDGPVTCFAAFHNVNCPKGFLYFNTRGEL 975
Query: 979 KICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI- 1037
+I LP+ +YD+ WPV+KV PL+ TPH ++Y E Y ++ S +P ++ +
Sbjct: 976 RISVLPTHLSYDSPWPVRKV-PLRYTPHMVSYNRESKTYAIVTSEQ--EPCKKIPRVTAE 1032
Query: 1038 DQEVGHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1096
D+E I D Y + E + ++++ P W+ IP +
Sbjct: 1033 DKEFVDTIR-------DARFIYPSTERFVLQLISPIS----WEV---IPNTRHDLDEWEH 1078
Query: 1097 VVTLFNTTTKENET------LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1148
V T+ N ET + +GT + GE++A RGR+L+F P +T+
Sbjct: 1079 VTTMKNLLLHSEETHTGRKGFICVGTTQLYGEEIAVRGRILIFDIIEVVPEPGQPLTKNK 1138
Query: 1149 ---VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1205
+Y KE KG ++AL + G+L+ G KI + +T +L G+AF D LY+ SL +
Sbjct: 1139 FKLLYEKEQKGPVTALNQVNGYLVSGIGQKIYIWNFTDNDLVGMAFIDTQ-LYIHSLVTI 1197
Query: 1206 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1265
+NF++ D+ KSI L +E+ L ++KD +L+ +A +F IDG + +VSD +KN+
Sbjct: 1198 RNFVIAADVCKSITLLRLQEETKTLAFVSKDPKNLEVYAADFFIDGPQIGFLVSDVEKNL 1257
Query: 1266 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1325
+F Y P+ ES GQ+LL RA+ +VG H+T F R+ A A+ K R
Sbjct: 1258 VLFTYQPEAIESQGGQRLLQRADINVGTHITSFFRIAAKA----HLKASGEKSKEMRQLT 1313
Query: 1326 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1385
FGTLDG++G + P+ E TFRRL LQ KLVD +PHVAGLNP++FR +
Sbjct: 1314 CFGTLDGALGLMLPMTEKTFRRLHMLQTKLVDCIPHVAGLNPKAFRMLQWRKRKLCNPHR 1373
Query: 1386 SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+++D +LL Y L E+ E+A + GTT +QI+ ++ D+
Sbjct: 1374 NVLDWQLLFKYMHLSFMERQEVARKIGTTPAQIMDDMMDI 1413
>gi|405977622|gb|EKC42064.1| Cleavage and polyadenylation specificity factor subunit 1
[Crassostrea gigas]
Length = 1369
Score = 588 bits (1515), Expect = e-164, Method: Compositional matrix adjust.
Identities = 440/1432 (30%), Positives = 700/1432 (48%), Gaps = 181/1432 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
+E + + L GN+ S+ + GA RDS++L+F +AK+SV+E+D H L+ TS+H
Sbjct: 5 MECLATFTLFGNIMSMKYVKLPGA----LRDSLLLSFSEAKLSVVEYDPGTHDLQTTSLH 60
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE P +K G + P V+VDP GRC +LVYG M+IL + GD
Sbjct: 61 FFEEPS---MKGGFFTNYCIPEVRVDPDGRCAAMLVYGTHMVILPFRRDVMVEEGD---- 113
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G + I SS++I+LR+ D K +VKDF F+HGY EP + IL E TWAGR + +
Sbjct: 114 NLAGTSKSPILSSYIIDLRNFDEKIINVKDFQFLHGYYEPTVFILFEPLQTWAGRTAVRA 173
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
TC I A+S++ K HP+IWS +LP D ++LAVP PIGGV+++ N++ Y +QS
Sbjct: 174 DTCSIVAISLNLQEKVHPVIWSLGSLPFDCCQVLAVPRPIGGVIIIAVNSLLYLNQSVP- 232
Query: 339 ALALNNYAVSLDS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
Y VSL+S S P + + LD A ++ D +LS K G+L +LT+
Sbjct: 233 -----PYGVSLNSISAQSTLFPLRVQEGVRIALDCCQAAFMSYDKIVLSLKGGELYVLTL 287
Query: 392 VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
V DG R V+ + K+ SVLTS + + FLGSRLG+SLL+++T + + + L
Sbjct: 288 VVDGMRSVRSFNFDKSAASVLTSCMCICEDGFLFLGSRLGNSLLLKYTEKASECLENGDL 347
Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMV----NGEELSLYGSASNNTESAQKTFSFAVR 506
++ + D P+ K+ + S + V N +L +YGSA N T + +++F V
Sbjct: 348 DKK----KEDEPAAKKKKVEGSTEIASDVSQIENLYDLEVYGSAENPTSTTITSYTFEVC 403
Query: 507 DSLVNIGPL------------KDFSYGLRINADASAT-GISKQSNYELV----------- 542
D++ NIGP ++FS + + T G K ++
Sbjct: 404 DNIWNIGPCGNIVMGEPAFLSEEFSSCEDPDIEMVMTSGYGKNGALSVLQRSIRPQVVTT 463
Query: 543 -ELPGCKGIWTVYH--KSSRGHNADSSRMAAYDDEY---HAYLIISLEARTMVLETADLL 596
ELPGC +WTV + + ++S DD H++LI+S +M+LET +
Sbjct: 464 FELPGCLDMWTVKSLVPKEKSEDKENSMEDDSDDNIEGGHSFLILSRSDSSMILETGQEM 523
Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS 656
E+ S + Q TI AGN+ G R ++QV + R+L+G Q + ++GS
Sbjct: 524 NELDHS-GFSTQTTTIFAGNIGGDRYIVQVSDTSLRLLEGVRQIQHIPL-----DTGS-- 575
Query: 657 ENSTVLSVSIADPYVLLGMSDGSI--------------RLLVGDPSTCTVSVQTPAAIES 702
V+ S+ADPY++L +G I RL+VG PS +S + + S
Sbjct: 576 ---PVVQCSLADPYIVLLTQEGQILMFTLRTESVGLGVRLVVGKPS---ISQHSKVEVIS 629
Query: 703 SKKPVSSCTLYHDK------GPEPWLRKTSTDAWLSTGV-----------GEAIDGADGG 745
+ K VS ++ P+ KT T+ S GE
Sbjct: 630 AYKDVSGLFTCMNQMEDVQVTPDTKATKTVTERSFSIDAKTADEEDELLYGETESNVFNS 689
Query: 746 PLDQGDI-------------------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTH 786
+ G + ++C E+G LEI+ +P++ V+ V F G+
Sbjct: 690 SFNMGQTAEMESPTKEKKQTEAKPTYWLLLCRENGVLEIYSIPDYKKVYYVKNFPMGQKL 749
Query: 787 IVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILT 846
+VD+ ++ G Q K N + EL M SRP L A +
Sbjct: 750 LVDS----------VQVTDKLSSGERQ-EKVNAECPALKELLMVGLGYKDSRPHLLARVE 798
Query: 847 DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP 906
D Y++E S D R + + R + + + + + +EE
Sbjct: 799 D------DLYIYEAFSYPQSSIDNHLKLRFKKIQHDLILREKRSKSKKKDPEEFQKEEKK 852
Query: 907 HGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNC 965
G ++ FK+++G+ G F+ G+ P W V R LR+HP DG + F+ HN+NC
Sbjct: 853 VG----KMRYFKDVAGYSGVFVCGAYPHWIFVTSRGSLRIHPMGIDGPVWCFSEFHNINC 908
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
HGF+Y G L+I LP+ TYD WPV+KV PL+ TPH + Y E +Y ++ S P
Sbjct: 909 PHGFLYFNKMGELRISVLPTHLTYDAPWPVRKV-PLRCTPHFVAYHFENKIYAVVTSTPE 967
Query: 1026 LKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQT--RA 1082
+ N++ + I+ D Y T+ + +++ P W+
Sbjct: 968 I--CNKLPKTTTEDREWDTIEK------DERFIYPTIPRFTLQLYSPTS----WEVVPNT 1015
Query: 1083 TIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1141
I + E+ ++++ + L + T ++ + +GT GE+V +RGRV++
Sbjct: 1016 KIECEEWEHVVSMKTIRLRSEETLSGFKSYIVMGTNLSLGEEVTSRGRVIIADIIEVVPE 1075
Query: 1142 P-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1196
P ++ + +Y KE KG ++ALA + G L+ A G K+ + + +L G+AF D
Sbjct: 1076 PGMPLTKHKIKTLYEKEQKGPVTALADINGLLITAIGQKLYIWQLKDNDLMGVAFIDTH- 1134
Query: 1197 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1256
+Y+ +L +K+ IL GDI KS+ ++E+ L+++++D L+ + +FLID + L
Sbjct: 1135 IYIHTLVTIKHIILAGDILKSVSVYQYQEEHKVLSIVSRDPRPLEVYTADFLIDNTQLCC 1194
Query: 1257 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSD-RTGA 1313
+VSD KN+ ++ Y P+ ES GQ+L+ +A+F+ G++V+ R+ ++ SSD R
Sbjct: 1195 LVSDRMKNLVVYSYQPEARESHGGQRLIRKADFNAGSNVSSMFRVRCKLYDPSSDKRMTG 1254
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
AP R F TLDGS+G + PL E +RRL LQ LV +PHVAGLNPRS+R
Sbjct: 1255 AP----EKRHITYFATLDGSLGFVLPLSEKVYRRLFMLQNALVTHIPHVAGLNPRSYRHV 1310
Query: 1374 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
R +I+D ELL Y L + E++EIA + GT+ QI+ +L ++
Sbjct: 1311 IGTFPELRNPQKNILDGELLWKYTNLSIMEKIEIAKRLGTSNDQIMDDLMEI 1362
>gi|307190910|gb|EFN74734.1| Cleavage and polyadenylation specificity factor subunit 1 [Camponotus
floridanus]
Length = 1418
Score = 574 bits (1480), Expect = e-160, Method: Compositional matrix adjust.
Identities = 437/1479 (29%), Positives = 717/1479 (48%), Gaps = 198/1479 (13%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV ANVI ++ + + ++ K + ++ LE + Y LHGN+ S+
Sbjct: 30 LVVAGANVIRVFRLIPDIDMTRREKYTENRPPKM--------KLECLAQYTLHGNIMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+ G+ +RDS++L+F DAK+SV+E+D IH LR S+H FE E +K G +
Sbjct: 82 AVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IKDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR---IESSH 234
P+V+VDP+GRC +L+YG ++++L + S + D D S S+ I SS+
Sbjct: 135 HHIPIVRVDPEGRCAIMLIYGRKLVVLPFRKDPS--LDDGDLLDSAKLTSSNKTPILSSY 192
Query: 235 VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+I L+ L+ M +V D F++GY EP ++IL+E T++GR++ + TC + A+S++
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLYGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDS 351
+ HP+IWS NLP D Y+++ V P+GG L++ N++ Y +QS ++LN+ A + +
Sbjct: 253 RVHPIIWSVSNLPFDCYQVVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTN 312
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
P+ + L+ + ++ D ++S K+G+L +L++ D R V+ K SV
Sbjct: 313 FPLKPQEGVKMSLEGSQVAFISGDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKRLRR 469
LTS + ++ FLGSRLG+SLL++FT ++ + E + E++ K+ ++
Sbjct: 373 LTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNDNEITIEENESEETPAKKAKQ 432
Query: 470 S------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-- 521
+SD L D+ + EEL +YGS ++ T ++ F V DSL+NIGP + S G
Sbjct: 433 DFLGDWMASDVL-DIKDPEELEVYGSETH-TSIQITSYIFEVCDSLLNIGPCGNISMGEP 490
Query: 522 ------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSS 558
N D + +G K ++ ELPGC+ +WTV
Sbjct: 491 AFLSEEFLHNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI---- 546
Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
G + ++ A + HA+LI+S E TM+L+T + EV +S + QG T+ AGNL
Sbjct: 547 -GTLNNDEQVKAEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTVFAGNLG 604
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
R ++QV + G R+L G Q + ++ S ADPYV L DG
Sbjct: 605 ANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLSEDG 654
Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGP--EPWLRKTSTDAWLST--G 734
+ LL T + A + + + Y D L +T+ D +
Sbjct: 655 QVVLLTLREVRGTARLHAQPANLLFRPQIEALCTYRDVSGIFTTQLSETTDDEQVEEEHN 714
Query: 735 VGEA-----IDGADG-----------------GPLD----------------QGDIYSVV 756
V E ID D PLD + + +V
Sbjct: 715 VEEPSLLSNIDNEDDLLYGDAPAFQMPAPSYQKPLDGVSKKAPWWQRHLQEIKPTYWLLV 774
Query: 757 CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK 816
+SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 775 YRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQSAPVNEIPNPE-------- 826
Query: 817 ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 827 -----MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYKY--PKGYLK---------- 868
Query: 877 LSVSNVSASRLRNLRFSRTP--LDAYTREE-TPHGAPCQRITI---FKNISGHQGFFLSG 930
R + L P L +EE P A RI + F NI+G+ G F+
Sbjct: 869 --------LRFKKLEHGIIPGRLSPKPKEEDMPMNASETRICMMRYFSNIAGYNGVFICC 920
Query: 931 SRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
P W + R LR HP DG I +F +NVNC GF+Y + L+IC LP+ +Y
Sbjct: 921 DYPHWIFLTGRGELRTHPMGIDGPITSFAAFNNVNCPQGFLYFNRKEELRICVLPTHLSY 980
Query: 990 DNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1049
D WPV+KV PL+ TPH +TY E Y +I S+ +PL +
Sbjct: 981 DAPWPVRKV-PLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------YR 1022
Query: 1050 LSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVT 1099
+ D + +T EE R L P + + W+T I ++ E+ ++ V+
Sbjct: 1023 FNGED--KEFTEEERPERFLYPSQEQFSIVLFSPVSWETIPNTKIELEQWEHVTCLKNVS 1080
Query: 1100 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKE 1153
L T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+KE
Sbjct: 1081 LAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKE 1140
Query: 1154 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1213
KG I+A+ + G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+ D
Sbjct: 1141 QKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDT-QIYIHQMLSIKSLILIAD 1199
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
++KSI L ++E+ L+L+++DF + + E+LID + L ++D + N+ +F Y P+
Sbjct: 1200 VYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFFLADGESNLALFMYQPE 1259
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLDG 1332
ES GQKL+ +A+FH+G V F R++ ++ ++ G+DK R ++ TLDG
Sbjct: 1260 SRESLGGQKLIRKADFHLGQKVNTFFRIRCRVSDPANDKKQFSGADK--RHVTMYATLDG 1317
Query: 1333 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCE 1391
S+G I P+ E T+RRL LQ LV + H+AGLNP+S+RQ + + ++ P I+D +
Sbjct: 1318 SLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKSYRQTYKSYIRNQGNPARGIIDGD 1377
Query: 1392 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
L+ Y LP E+ ++A + GT +I+ ++ ++ T+
Sbjct: 1378 LVWRYLFLPNNEKTDVAKKIGTRVQEIIEDITEIDRQTA 1416
>gi|340710064|ref|XP_003393618.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Bombus terrestris]
Length = 1417
Score = 573 bits (1478), Expect = e-160, Method: Compositional matrix adjust.
Identities = 429/1479 (29%), Positives = 709/1479 (47%), Gaps = 199/1479 (13%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
L V AN+I I+ + + +K+ K + ++ LE + Y LHGNV S+
Sbjct: 30 LAVAGANIIRIFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D H LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR--IESSHV 235
P+V+VDP+GRC +L+YG ++++L + S + D D + S + I SS++
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPS--LDDGDLLDNSKALSNKTPILSSYM 192
Query: 236 INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
I L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ +
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
HP+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLA 306
Query: 354 EL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
E P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 307 ETSTNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDK 366
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE-ADAPST 464
SVLTS + ++ FLGSRLG+SLL++FT ++ ++ E + +
Sbjct: 367 AAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPA 426
Query: 465 KRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPLKD 517
K++++ +SD L D+ + EEL +YGS S Q T + F V DSL+NIGP +
Sbjct: 427 KKIKQDFIGDWMASDVL-DIKDPEELEVYGSERETHTSIQITSYIFEVCDSLLNIGPCGN 485
Query: 518 FSYGLRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
S G + S+ + ELV ELPGC+ +W
Sbjct: 486 ISMGEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMW 544
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV G + ++ + HA+LI+S E TM+L+T + EV +S + QG T
Sbjct: 545 TVI-----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGST 598
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
I AGNL R ++QV + G R+L G Q + ++ S ADPYV
Sbjct: 599 IFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYV 648
Query: 672 LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDK 716
L DG + LL T + AA + + + Y D+
Sbjct: 649 TLLSEDGQVMLLTLREGRGTAKLHAQAANLLFRPQIEALCAYRDVSGIFTTQLPENVEDE 708
Query: 717 GPE--------------------------PWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
PE + T + S G+ + +
Sbjct: 709 APEEEHNIEEPPIVGNIDNEDDLLYGDAPAFQMPTPSHTKTSEGISKRTPWWQKHLQEIK 768
Query: 751 DIYSVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
Y ++ Y +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 769 PTYWLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE- 827
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 828 ------------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL-- 870
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
+ L + LR P+ TR C + F NI+G+ G F+
Sbjct: 871 ---RFKKLDHGIIPGQLKPKLRDEDIPMMNETRH-------CM-MRYFSNIAGYNGVFIC 919
Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +
Sbjct: 920 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 979
Query: 989 YDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNH 1048
YD WPV+KV PL+ TPH +TY E Y +I S+ +PL +
Sbjct: 980 YDAPWPVRKV-PLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------Y 1021
Query: 1049 NLSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVV 1098
+ D + +T EE R + P + + W+T I + E+ ++ V
Sbjct: 1022 RFNGED--KEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNV 1079
Query: 1099 TLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSK 1152
+L T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+K
Sbjct: 1080 SLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAK 1139
Query: 1153 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
E KG I+A+ + G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+
Sbjct: 1140 EQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYIHQMLSIKSLILIA 1198
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D++KSI L ++E+ L+L+++DF + + E+LID + L +V+D + N+ +F Y P
Sbjct: 1199 DVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGESNMALFMYQP 1258
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLD 1331
+ ES GQKL+ +A+FH+G V F R++ L+ ++ G+DK R ++ +LD
Sbjct: 1259 ESRESLGGQKLIRKADFHLGQKVNTFFRIRCRLSDPANDKKHFSGADK--RHVTMYASLD 1316
Query: 1332 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1391
GS+G I P+ E T+RRL LQ LV + H+AGLNP+++R + S+ + I+D +
Sbjct: 1317 GSLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGD 1376
Query: 1392 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
L+ Y LP E++++A + GT +I+ +L ++ T+
Sbjct: 1377 LVWRYFYLPNNEKIDVAKKIGTRVQEIIEDLTEIDRQTA 1415
>gi|91078626|ref|XP_968117.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
cpsf [Tribolium castaneum]
Length = 1413
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 432/1478 (29%), Positives = 708/1478 (47%), Gaps = 209/1478 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LV + ANVI+++ + + ET + LE V Y L GN+ S+
Sbjct: 29 SLVTSGANVIKVFRLIPDIDTKTRIDKFNETNP-------PKSKLECVAQYTLFGNIMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
++ + RD+++LAF+DAK+SV+E+D H L+ S+H FE + +K G
Sbjct: 82 QSVNLANSP----RDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDD---MKDGWTH 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED---TFGSGGGFSARIESS 233
P+V+ DP+ RC + V+G ++++L + + D D G G A I +S
Sbjct: 135 HYHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILAS 194
Query: 234 HVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
++I L+D + ++ D F+HGY EP ++IL E T+AGRV+ + TC ++A+S++
Sbjct: 195 YMIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNLQ 254
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
K HP+IWS NLP D K + + P+GG L+ N + Y +QS + Y VSL+S
Sbjct: 255 QKVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQS------IPPYGVSLNS 308
Query: 352 SQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
E P+ + LD A AT+L++D +LS K G+L +LT++ D R V+
Sbjct: 309 IAENSTNFPLKPQDDLCISLDCAQATFLEDDTIVLSLKGGELYVLTLLADNMRYVRSFHF 368
Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--CGSGTSMLSSGLKEEFGDIEADA 461
K SVLT+ I+ N+ FLGSRLG+SLL++FT C ++ E
Sbjct: 369 EKAAASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITL-----------DETIE 417
Query: 462 PSTKRLRRSSS----------DALQDMV--------NGEELSLYGSASNNTESAQKTFSF 503
PS KRL+ S+S D L D + + EEL +YG+ + ++ F
Sbjct: 418 PSAKRLKASNSTSENEDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQIS-SYVF 476
Query: 504 AVRDSLVNIGPL------------KDFSYGLRINADASATG----------ISKQSNYEL 541
V DSL+NIGP ++FS L ++ + T + K ++
Sbjct: 477 EVCDSLLNIGPCGNISLGEPAFLSEEFSENLDLDLELVTTAGYGKNGALCVLQKSVRPQI 536
Query: 542 VE---LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
V LPGC +WTV+ + HA+LI+S E TM+L+T D + E
Sbjct: 537 VTTFTLPGCSNMWTVHAGEDK----------------HAFLILSQEDGTMILQTGDEINE 580
Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSEN 658
+ ++ + T+ AGNL + ++QV R+L G Q + E GS
Sbjct: 581 I-DNTGFATHIPTVYAGNLGNLKYIVQVTSSAVRLLQGINQLQHIPL-----ELGS---- 630
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG- 717
++ V+ DPY+ L +DG + L+ + + + S+ PV++ +Y D
Sbjct: 631 -PIVHVTSVDPYISLLTTDGQVITLMLREARGVAKLVISKSTLSNSPPVTTICMYRDVSG 689
Query: 718 -------------PEPWLRKTSTDAWLSTGVGEAIDGADGG--------PLDQGDIYS-- 754
PE ++ ++ T + + + G D P + +Y
Sbjct: 690 LFTSKIPEDFTHIPEHFINESETKMEVENE-DDLLYGDDSDFKMPTLNPPQPKPKVYYNW 748
Query: 755 --------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
V E+ LEI+ +P+F + + G +VD E++ S
Sbjct: 749 WKKYLLDVRPSYWLFVVRENSNLEIYSIPDFKLCYYITNLCFGHKVLVDNL--ESVTISA 806
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
+ S++ E Q R+ ++ + VV L H SRP L L + + Y+ + F
Sbjct: 807 STPISAAHEANIQ-RQFDVKEILVVALG-----NHGSRPLLMVRL-ERDLYIYEVFRF-- 857
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
P K + NVS R D + +E ++ F NI
Sbjct: 858 PRGNLKMRFRKIKHSLIYSPNVSG------RIDTEDSDFFAIQER-----IIKMRYFTNI 906
Query: 921 SGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+G+ G F+ G+ P W M R LR HP DG +++F +NVNC GF+Y + L+
Sbjct: 907 AGYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELR 966
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
I LP+ +YD WPV+KV PL+ TPH +TY E Y L+ S+ +P N+
Sbjct: 967 IGVLPTHLSYDAAWPVRKV-PLRCTPHFVTYHLESKTYCLVTSIA--EPSNKYYKF---- 1019
Query: 1040 EVGHQIDNHNLSSVDLHRTYTV---EEYEVRILEPDRAGGPWQT--RATIPMQSSENALT 1094
++ LS D + E++ + + P W I + E+
Sbjct: 1020 ----NGEDKELSVEDRGDRFPYPLQEKFSLMLFSP----VSWDVIPNTKIDLDEWEHVNC 1071
Query: 1095 VRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTE 1148
++ V+L T+ + +A+GT Y GEDV +RGR+L+F P +N E
Sbjct: 1072 LKNVSLAYEGTRSGLKGYIAVGTNYNYGEDVTSRGRILIFDIIEVVPEPGQPLTKNRFKE 1131
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
+Y+K+ KG ++AL+ ++G L+ A G KI + + +L G+AF D +Y + +K+
Sbjct: 1132 IYAKDQKGPVTALSQVKGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYTHQILTIKSL 1190
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1268
+L+ D++KSI L ++E+ L+L+++DF + F+ E++ID +T+ +VSD +KN+ ++
Sbjct: 1191 LLVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMIDNTTMGFLVSDSEKNLVLY 1250
Query: 1269 YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLF 1327
Y P+ ES GQ+LL +A+FH+G V F R++ L + G+DK R ++
Sbjct: 1251 MYQPESRESLGGQRLLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADK--RHITMY 1308
Query: 1328 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1387
TLDG +G I P+ E T+RRL LQ LV H+AGLNP++FR + S K S+
Sbjct: 1309 ATLDGGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTNPARSV 1368
Query: 1388 VDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+D EL+ +Y L + E+LE++ + GT ++L +L+D+
Sbjct: 1369 IDGELVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDI 1406
>gi|350413821|ref|XP_003490124.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Bombus impatiens]
Length = 1417
Score = 570 bits (1470), Expect = e-159, Method: Compositional matrix adjust.
Identities = 428/1479 (28%), Positives = 709/1479 (47%), Gaps = 199/1479 (13%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
L V AN+I I+ + + +K+ K + ++ LE + Y LHGNV S+
Sbjct: 30 LAVAGANIIRIFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D H LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR--IESSHV 235
P+V+VDP+GRC +L+YG ++++L + S + D D + S + I SS++
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPS--LDDGDLLDNSKALSNKTPILSSYM 192
Query: 236 INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
I L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ +
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
HP+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLA 306
Query: 354 EL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
E P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 307 ETSTNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDK 366
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPST 464
SVLTS + ++ FLGSRLG+SLL++FT ++ ++ + + E +
Sbjct: 367 AAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPA 426
Query: 465 KRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPLKD 517
K++++ +SD L D+ + EEL +YGS S Q T + F V DSL+NIGP +
Sbjct: 427 KKIKQDFIGDWMASDVL-DIKDPEELEVYGSERETHTSIQITSYIFEVCDSLLNIGPCGN 485
Query: 518 FSYGLRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
S G + S+ + ELV ELPGC+ +W
Sbjct: 486 ISMGEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMW 544
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV G + ++ + HA+LI+S E TM+L+T + EV +S + QG T
Sbjct: 545 TVI-----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGST 598
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
I AGNL R ++QV + G R+L G Q + ++ S ADPYV
Sbjct: 599 IFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYV 648
Query: 672 LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDK 716
L DG + LL T + AA + + + Y D+
Sbjct: 649 TLLSEDGQVMLLTLREGRGTAKLHVQAANLLFRPQIEALCAYRDVSGIFTTQLPENVEDE 708
Query: 717 GPE--------------------------PWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
PE + T + S GV + +
Sbjct: 709 APEEEHNIEEPPIVGNIDNEDDLLYGDAPAFQMPTPSHTKTSEGVSKRTPWWQKHLQEIK 768
Query: 751 DIYSVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
Y ++ Y +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 769 PTYWLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE- 827
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 828 ------------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLK--- 869
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
+ L + R P+ TR C + F NI+G+ G F+
Sbjct: 870 --LRFKKLDHGIIPGQLRPKPRDEDIPMMNETRH-------CM-MRYFSNIAGYNGVFIC 919
Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +
Sbjct: 920 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 979
Query: 989 YDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNH 1048
YD WPV+KV PL+ TPH +TY E Y +I S+ +PL +
Sbjct: 980 YDAPWPVRKV-PLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------Y 1021
Query: 1049 NLSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVV 1098
+ D + +T EE R + P + + W+T I + E+ ++ V
Sbjct: 1022 RFNGED--KEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNV 1079
Query: 1099 TLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSK 1152
+L T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+K
Sbjct: 1080 SLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAK 1139
Query: 1153 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
E KG I+A+ + G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+
Sbjct: 1140 EQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDT-QIYIHQMLSIKSLILIA 1198
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D++KSI L ++E+ L+L+++DF + + E+LID + L +V+D + N+ +F Y P
Sbjct: 1199 DVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGESNMALFMYQP 1258
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLD 1331
+ ES GQKL+ +A+FH+G V F R++ ++ ++ G+DK R ++ +LD
Sbjct: 1259 ESRESLGGQKLIRKADFHLGQKVNTFFRIKCRVSDPANDKKHFSGADK--RHVTMYASLD 1316
Query: 1332 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1391
GS+G I P+ E T+RRL LQ LV + H+AGLNP+++R + S+ + I+D +
Sbjct: 1317 GSLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGD 1376
Query: 1392 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
L+ Y LP E++++A + GT +I+ +L ++ T+
Sbjct: 1377 LVWRYLYLPNNEKIDVAKKIGTRVQEIIEDLTEIDRQTA 1415
>gi|242021233|ref|XP_002431050.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
putative [Pediculus humanus corporis]
gi|212516279|gb|EEB18312.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
putative [Pediculus humanus corporis]
Length = 1409
Score = 564 bits (1453), Expect = e-157, Method: Compositional matrix adjust.
Identities = 430/1458 (29%), Positives = 701/1458 (48%), Gaps = 179/1458 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LVV N++ ++ + + +K T+RR LE + + L NV S+
Sbjct: 29 SLVVAGKNILRVFQLIPDID---PTKRDAYTERRP-----PKMKLECLSSFSLFANVMSM 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S G+ RD+++L+F +AK+ V+E+D H LR S+H FE + +K G +
Sbjct: 81 QAVSLAGSS----RDALLLSFREAKLCVVEYDPDSHDLRTLSLHYFEEED---MKGGWTN 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF------GSGGGFSARI 230
P V+VDP+GRC +LVYG +++IL + + D D S A +
Sbjct: 134 HYDIPYVRVDPEGRCAAMLVYGRKLVILPFRRESK--LDDPDIALLDPHSSSVATAKAPV 191
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
SS+ I LR++D +++V D F++GY EP ++IL+E T+AGR++ + TC + A+S+
Sbjct: 192 LSSYTITLREIDEKLENVIDIQFLYGYYEPTLLILYEPLKTFAGRIAVRSDTCAMIAVSL 251
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAV 347
+ + HP IWS NLP + + + VP P+GG L+ N + Y +QS +++N+ A
Sbjct: 252 NIQQRVHPAIWSVGNLPFNCTQAIPVPKPLGGTLIFSVNALIYLNQSIPPFGVSVNSIAE 311
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ + Q + + L+ + AT++ +D +LS KTG+L +L+++ D R V+ K
Sbjct: 312 NSTNFQLKIQEGVKITLEGSQATFISHDRLVLSLKTGELYVLSLLADNIRSVRGFHFDKA 371
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLT+ + + FLGSRLG+SLL++FT + L E E PS +R
Sbjct: 372 AASVLTTCLCVCEDKYLFLGSRLGNSLLLRFTEKESSEAPIITLDESIR--EVPVPSKRR 429
Query: 467 LRRSSSDALQ----DMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
+ + D + D+ + +EL +YG+ ++ +F F V DSL+NIGP + S G
Sbjct: 430 RQDALGDWMASDVADIRDLDELEVYGTQEASSSVQITSFMFEVCDSLLNIGPCGNVSMGE 489
Query: 523 RINADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHK 556
+ ++ + ELV ELPGC +WTV
Sbjct: 490 PAFLSEEFSN-NRDPDLELVTTSGHGKNGAICVLQRTIRPQVVTTFELPGCLDMWTVI-- 546
Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
G +DS A DD HA+LI+S + TM+L+T + EV S + QG TI AGN
Sbjct: 547 ---GPQSDSGPTQAEDDISHAFLILSQKDSTMILQTGQEINEVDHS-GFNTQGPTIFAGN 602
Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
L + ++QV + G R+L G Q + S+V+ S ADPYV L
Sbjct: 603 LASNKYIVQVSKAGVRLLRGLEQIQHIPL----------DLGSSVVHASTADPYVALLTE 652
Query: 677 DGSI------------RLLVGDPST---------CTVSVQTPAAIESSKKPVSSCTLYHD 715
DG + RL V P+ CT + ++++ + + T D
Sbjct: 653 DGQVVLLTLRESRGQGRLSVFKPTIPTNPRVSKICTYRDVSGLFTLTTEEELQNATFKSD 712
Query: 716 KGPEPWLRKTS--TDAWLSTG---------VGEAIDGADGGPLDQGDIYS---------V 755
++K + D L G + + + P + YS
Sbjct: 713 SKN---MKKEADDEDEMLYGGSEVKFQLLPITNTNEPSPPRPFVRWKKYSQEIKPNYWMF 769
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
V E+G L+I+ +P+F F + + G + D ++ + G
Sbjct: 770 VLRETGTLDIYSLPDFRPSFQIRRIGQGHRVLYDV------------LDMAQTSGMDGSD 817
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF-EGPENTSKSDDPVSTS 874
IH + VV L H R + + T+ ++ YQA+ F +GP +
Sbjct: 818 DPEIHELLVVSLG------HLGRRPILLLRTENDLMIYQAFKFAKGPNLKIRF------- 864
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
R L + + R + Y E A R+ F NISG+ G F+ G P
Sbjct: 865 RRLPQTLILKERKAKFKVK------YENEVESERA--TRLRYFSNISGYNGVFVCGPNPH 916
Query: 935 WC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
W + R LR HP L DG + +F HNVNC GF+Y TS+ L+IC LP+ +YD W
Sbjct: 917 WLFLTARGELRSHPMLIDGRVTSFASFHNVNCPLGFLYFTSKCELRICILPTHLSYDAPW 976
Query: 994 PVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
PV+KV PL+ TPH +TY E Y LI S +P N+ ++ H +++ +
Sbjct: 977 PVRKV-PLRCTPHMVTYHLESKTYCLITSSS--EPSNEYFRF-NGEDKEHSVEDRD---- 1028
Query: 1054 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE--NALTVRVVTL-FNTTTKENET 1110
D +++ + + P W+ M+ E + V+ V L + T +
Sbjct: 1029 DRFPLPLQDKFSIVLFSP----VSWEVIPNTKMELDEWEHVTCVKTVNLSYEGTRSGLKG 1084
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQ 1165
+A+GT Y ED+ ++GR+L++ P +N VY+KE KG ++AL +
Sbjct: 1085 YVAVGTNYNYSEDITSKGRILIYDIIEVVPEPGQPLTKNRFKTVYAKEQKGPVTALCHVL 1144
Query: 1166 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1225
G L+ A G KI + + +L GIAF D +Y+ + VK+ IL+ D++KSI L ++E
Sbjct: 1145 GFLVTAMGQKIYIWQLKDNDLVGIAFIDTQ-IYIHQMISVKSLILVADVYKSISLLRFQE 1203
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1285
+ L+L+++DF + +A E L+D + + ++SD + NI ++ Y P+ +S GQKLL
Sbjct: 1204 EYRTLSLVSRDFRPCEVYAIELLLDNTQMGFLISDVEMNIIMYMYKPEDRDSVGGQKLLR 1263
Query: 1286 RAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1344
+A+FH+G H+ + R++ L ++ G++K R +F TLDG++G + P+ E T
Sbjct: 1264 KADFHLGQHINSWFRIRCRLGDQAENYDFPIGAEK--RHISMFATLDGALGYLLPIPEKT 1321
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1404
+RRLQ LQ LV +PH+AGLNP++FR + S K IVD EL+ Y L + E+
Sbjct: 1322 YRRLQMLQNILVYHIPHLAGLNPKAFRIYKSGRKLLGNPCKRIVDGELIWMYLSLTVMEK 1381
Query: 1405 LEIAHQTGTTRSQILSNL 1422
++A + G+ I+ ++
Sbjct: 1382 QDVAKKMGSKMDDIIEDI 1399
>gi|383863556|ref|XP_003707246.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Megachile rotundata]
Length = 1415
Score = 563 bits (1451), Expect = e-157, Method: Compositional matrix adjust.
Identities = 430/1469 (29%), Positives = 705/1469 (47%), Gaps = 181/1469 (12%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV N+I ++ + + +K K + ++ LE + Y LHGNV S+
Sbjct: 30 LVVAGGNIIRVFRLIPDVDITKREKYTESRPPKM--------KLECLAQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D IH LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
P+V+VDP+GRC +L+YG ++++L + S GD I SS++I
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ + H
Sbjct: 195 LKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
P+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S E
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308
Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG-SGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLTS + ++ FLGSRLG+SLL++F S S + + + E + K+
Sbjct: 369 ASVLTSCVCMCDDNYLFLGSRLGNSLLLRFIEKESENSQNMNENEITIEENETEETPAKK 428
Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+++ +SD L D+ + EEL +YGS ++ T ++ F V DSL+NIGP + S
Sbjct: 429 VKQDFIGDWMASDVL-DIKDPEELEVYGSETH-TSIQITSYIFEVCDSLLNIGPCGNISM 486
Query: 521 GLRINADASATGISKQSNYELV--------------------------ELPGCKGIWTVY 554
G + S+ + ELV ELPGC+ +WTV
Sbjct: 487 GEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI 545
Query: 555 HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAA 614
G + ++ + HA+LI+S E TM+L+T + EV +S + QG T+ A
Sbjct: 546 -----GALNNDEQVRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTVFA 599
Query: 615 GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
GNL R ++QV + G R+L G Q + ++ S ADPYV L
Sbjct: 600 GNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVSLL 649
Query: 675 MSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDKGPE 719
DG + LL T + A + + + Y D+ PE
Sbjct: 650 SEDGQVMLLTLREGRGTAKLHAQTANLLFRPQIEALCAYRDVSGIFTTQLPENVEDEVPE 709
Query: 720 --------PWLRKTSTDAWLSTGVGEAID--------GADG----GPLDQGDI------Y 753
P + + L G G A ++G P Q + Y
Sbjct: 710 EEHNTEEPPIVGNIDNEDDLLYGDGPAFQMPAPSQTKSSEGTSKRAPWWQKHLQEIKPTY 769
Query: 754 SVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
++ Y +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 770 WLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTAPVNEIPNPE---- 825
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
M+V E+ M H +RP L L D + YQ Y + P+ K
Sbjct: 826 ---------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQTYRY--PKGHLK------ 867
Query: 873 TSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
L + + NLR D ET H C + F NI+G+ G F+
Sbjct: 868 ----LRFKKLDHGIIPGNLRPKPKEEDMSAMNETRH---CM-MRYFSNIAGYNGVFICSD 919
Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W + R LR HP DG I +F +N+NC GF+Y + L+IC LP+ +YD
Sbjct: 920 YPHWIFLTGRGELRTHPMGIDGPITSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYD 979
Query: 991 NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
WPV+KV PL+ TPH +TY E Y +I S+ +PL G +
Sbjct: 980 APWPVRKV-PLRCTPHFVTYHLESKTYCVITSIA--EPLKSYYRF-----NGEDKEFTEE 1031
Query: 1051 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1108
D + E++ + + P W+T I + E+ ++ V+L T+
Sbjct: 1032 DRPDRFIFPSQEQFSIVLFSPVS----WETIPNTKIELDQWEHVTCLKNVSLAYEGTRSG 1087
Query: 1109 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALA 1162
+ + +GT Y GED+ +RGR+L+F P +N ++Y+KE KG I+A+
Sbjct: 1088 LKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPITAIT 1147
Query: 1163 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
+ G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+ D++KSI L
Sbjct: 1148 QVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYIHQMLSIKSLILIADVYKSISLLR 1206
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1282
++E+ L+L+++DF + + E+LID + L +V+D + NI +F Y P+ ES GQK
Sbjct: 1207 FQEEYRTLSLVSRDFRPAEVYTIEYLIDNNNLGFLVADGESNIALFMYQPESRESLGGQK 1266
Query: 1283 LLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1341
L+ +A+FH+G V F R++ ++ ++ G+DK R ++ +LDGS+G I P+
Sbjct: 1267 LIRKADFHLGQKVNTFFRIRCRISDPANDKKHFSGADK--RHVTMYASLDGSLGYILPVP 1324
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1401
E T+RRL LQ LV + H+AGLNP+++R + S + I+D +L+ Y LP
Sbjct: 1325 EKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYIRTQGNPARGIIDGDLVWRYLYLPN 1384
Query: 1402 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
E++++A + GT +I+ +L ++ T+
Sbjct: 1385 NEKIDVAKKIGTRVQEIIEDLTEIDRQTA 1413
>gi|414587798|tpg|DAA38369.1| TPA: hypothetical protein ZEAMMB73_163106, partial [Zea mays]
Length = 483
Score = 551 bits (1419), Expect = e-153, Method: Compositional matrix adjust.
Identities = 271/459 (59%), Positives = 328/459 (71%), Gaps = 11/459 (2%)
Query: 685 GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
DPSTCT+S+ PA SS + +S+CTLY D+GPEPWLRKT TDAWLST VGEAID D
Sbjct: 33 ADPSTCTISINAPAIFASSSERISACTLYCDRGPEPWLRKTHTDAWLSTDVGEAIDDNDN 92
Query: 745 GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
D DIY ++CYESG LEIF+VP+F VF+VD FVSG + D + R + KDS
Sbjct: 93 SSHDLSDIYCIICYESGKLEIFEVPSFKRVFSVDNFVSGPAILFDVFSRNSTKDSGIGDR 152
Query: 805 SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
+S+ +KE ++K+VELAM RWS SRPFLF +L DGT+LCY AY FEG E+
Sbjct: 153 DASKVSV---KKEEAANIKIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAYYFEGSESN 209
Query: 865 SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPC---QRITIFKNIS 921
+ S + N + SRLRNLRF R +D +R++ C RITIF N+
Sbjct: 210 VQCAPFSPHGGSPDIGNATDSRLRNLRFCRVSIDISSRDDIS----CLVRPRITIFNNVG 265
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
G++G FL G RP W V R+R RVHPQLCDG IVAFTVLHNVNC G IYVTSQG LKIC
Sbjct: 266 GYEGLFLGGPRPTWVFVCRQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIYVTSQGFLKIC 325
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
QLPS YDNYWPVQKV PL TPHQ+TY+ E++LYPLIVSVP ++PLNQVLS + DQE+
Sbjct: 326 QLPSAYNYDNYWPVQKV-PLHGTPHQVTYYGEQSLYPLIVSVPQVRPLNQVLSSMADQEL 384
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1101
G ++N S DL YTV+E+EVRI+E ++ G W+TR+TIPMQS ENALTVR+VTL
Sbjct: 385 GLHMENDVTSGGDLQEVYTVDEFEVRIMELGKSNGRWETRSTIPMQSFENALTVRIVTLQ 444
Query: 1102 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1140
NT+TKENETL+AIGTAYVQGEDVAARGRVLLFS ++ +
Sbjct: 445 NTSTKENETLMAIGTAYVQGEDVAARGRVLLFSFSKSEN 483
>gi|308805673|ref|XP_003080148.1| cleavage and polyadenylation specificity factor (ISS) [Ostreococcus
tauri]
gi|116058608|emb|CAL54315.1| cleavage and polyadenylation specificity factor (ISS), partial
[Ostreococcus tauri]
Length = 1473
Score = 551 bits (1419), Expect = e-153, Method: Compositional matrix adjust.
Identities = 390/1294 (30%), Positives = 603/1294 (46%), Gaps = 205/1294 (15%)
Query: 260 MVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
+ IL+E+ TWAGR + TC I ALS+ ++ +IW NLP +YKL A+ P+G
Sbjct: 6 LAILYEKTPTWAGRYNLAKDTCEIVALSVDVDKQKSTVIWRRQNLPSSSYKLTALLPPLG 65
Query: 320 GVLVVGANTIHYHSQSASCALALNNYA------------------VSLDSSQELPRS--- 358
GVLV + + + SQ +S AL LN + + D+ P +
Sbjct: 66 GVLVFSQDFLLHESQESSSALCLNTFGRGGPQEGNDAETVARLAGMGEDAVANPPPACAA 125
Query: 359 -----SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
+ LD A A + D L++TK G L LL + DGR ++R+ L + +VL+S
Sbjct: 126 RAVDCGLEITLDGAQAAVVSEDRVLVTTKMGALFLLALHTDGRSLRRMMLQRAGGAVLSS 185
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---MLSSGLKEEFGDIEADAPSTKRLRRS 470
+ + L FLGSR+GDSLLV+FT S + ML G +E E + S KR +
Sbjct: 186 GMCLLSRDLLFLGSRIGDSLLVKFTPKSEPAAPLMLPKGEDDEETVDEVEKGSGKRSKSG 245
Query: 471 SSDALQDMV-----------------NGEELS--LYGS-------ASNNTESAQKT---- 500
A++ + +EL LYG+ T++A+K
Sbjct: 246 DGAAIRKRAKSTEDPPPAPSTPSPEDDDDELEALLYGTTKAESVIGDETTQTAEKKREGL 305
Query: 501 -----------FSFAVRDSLVNIGPLKDFSYGLR--INAD-------ASATGISKQ---- 536
+ F V+DSL+ + P+ D + G + D +A G K
Sbjct: 306 AGVVPGLKVAGYDFKVKDSLLGVAPVVDITVGASAPVGTDTAERTELVTACGQGKNGALA 365
Query: 537 -----------SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
+ E LP +G+W ++ + + +R + +H +L++ L+
Sbjct: 366 ILTRGVQPELVTEVEAGTLPTLQGLWALHDRK------EGTR--EVREPFHNHLLLKLQ- 416
Query: 586 RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
EV+ S+++ T+AA N FG +Q+ E RIL QD++
Sbjct: 417 ------------EVSASLEFITDQATLAAANFFGHFCSLQITETSIRILKSGMKVQDVTL 464
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
+ GS + S I DPY+++ +SDG++RLL GD TVS+ A+ +S +
Sbjct: 465 ADIKAPKGS-----VIASAEILDPYIMIRLSDGTLRLLAGDEKKMTVSLMESGAMPTSSR 519
Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
G W+ +++T+ ++ G GA +Q + + E G+LE+
Sbjct: 520 RTRLVEALKKSG---WIHRSATNGTITGLEGSKKSGAS----NQKEAIVAIAREGGSLEL 572
Query: 766 FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
F +P+ ++ D G + T SE I ++V
Sbjct: 573 FSLPSCTRIWNADGLSEGSRVLSPTRPVH----SELRIP------------------EIV 610
Query: 826 ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS 885
++ + + H RP L A+ DGT+L Y+ ++ S++P++
Sbjct: 611 DIRIDSFEEAHERPLLTAVRGDGTLLLYRGFIVPAGTTCEGSEEPLARG----------- 659
Query: 886 RLRNLRFSRTPLD-------------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
LRFSR +D A ++ G RI+ G QG F++G
Sbjct: 660 ---ELRFSRVNIDVEGSGLNVAGVGVAGQVRDSLAGTRLTRISNVGEGQGLQGIFVAGPN 716
Query: 933 PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
P W +V R R+ P +G IVAFT HNVNC +GFI T+ G ++ICQ+PS Y+
Sbjct: 717 PLWLIVRRSRVLALPTRGEGEIVAFTDFHNVNCPYGFILGTAVGGVRICQMPSKMHYEAA 776
Query: 993 WPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV-GHQIDNHNLS 1051
WPV+K I LK TPH + Y + LY L+ S V +D+E+ G + +LS
Sbjct: 777 WPVRK-IALKCTPHAVAYLPDFKLYALVTSANVP---------WVDREIDGENVHGLSLS 826
Query: 1052 SVDLHRTYTVE----EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE 1107
R + +Y VR+L P WQ ++ E+ VR V L + T
Sbjct: 827 KARRERAKAHDDMELQYSVRLLVPGSLDCVWQHT----LEPGEHVQCVRNVQLKDINTGH 882
Query: 1108 NETLLAIGTAYVQGEDVAARGRVLLFST-----GRNADNPQNLVTEVYSKELKGAISALA 1162
+ + LA+GTA GED RGRV LF+ +AD + +E K A +AL
Sbjct: 883 SLSYLAVGTAMPGGEDTPCRGRVYLFNMVWERDSESADGYRWKGQVCCVREAKMACTALE 942
Query: 1163 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
L GHL++A G K+ +H W G ELN +AF+D P ++ VS+N+VKNFIL+GD+ K ++F
Sbjct: 943 GLGGHLIVAVGTKLTVHTWDGRELNSVAFFDTP-IHTVSINVVKNFILVGDLEKGLHFFR 1001
Query: 1223 WKEQGAQLNL--LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1280
WK+ G + +L L+KDF +D ++EFLIDG+TLSL+ SD N + F Y PK ESWKG
Sbjct: 1002 WKDTGFEKSLIQLSKDFERMDVVSSEFLIDGTTLSLLGSDMSGNARTFGYDPKSIESWKG 1061
Query: 1281 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
QKLL RA +HVG+ +++ +R + + S NRFA+ FGTLDG++G P
Sbjct: 1062 QKLLPRAAYHVGSPISRMVRFNVEGSKSKMASTDGKPKGANRFAVFFGTLDGALGIFMPT 1121
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFR--QFHSNGKAHRPGPDSIVDCELLSHYEM 1398
D +T+ +L ++Q++L +V G NPR+FR + P ++D LLS +E
Sbjct: 1122 DPVTYEKLLAIQRELTTAVRSPIGCNPRTFRTPKVFEGKHVQLRAPLDVLDGGLLSKFET 1181
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
L EQ++IA R L + L+ +FL
Sbjct: 1182 LTFSEQVKIASSAQVDRDLTLGLIQQLSASNAFL 1215
>gi|270003792|gb|EFA00240.1| hypothetical protein TcasGA2_TC003068 [Tribolium castaneum]
Length = 1392
Score = 547 bits (1409), Expect = e-152, Method: Compositional matrix adjust.
Identities = 424/1478 (28%), Positives = 698/1478 (47%), Gaps = 230/1478 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LV + ANVI+++ + + ET + LE V Y L GN+ S+
Sbjct: 29 SLVTSGANVIKVFRLIPDIDTKTRIDKFNETNP-------PKSKLECVAQYTLFGNIMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
++ + RD+++LAF+DAK+SV+E+D H L+ S+H FE + +K G
Sbjct: 82 QSVNLANSP----RDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDD---MKDGWTH 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED---TFGSGGGFSARIESS 233
P+V+ DP+ RC + V+G ++++L + + D D G G A I +S
Sbjct: 135 HYHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILAS 194
Query: 234 HVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
++I L+D + ++ D F+HGY EP ++IL E T+AGRV+ + TC ++A+S++
Sbjct: 195 YMIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNLQ 254
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
K HP+IWS NLP D K + + P+GG L+ N + Y +QS + Y VSL+S
Sbjct: 255 QKVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQS------IPPYGVSLNS 308
Query: 352 SQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
E P+ + LD A AT+L++D +LS K G+L +LT++ D R V+
Sbjct: 309 IAENSTNFPLKPQDDLCISLDCAQATFLEDDTIVLSLKGGELYVLTLLADNMRYVRSFHF 368
Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--CGSGTSMLSSGLKEEFGDIEADA 461
K SVLT+ I+ N+ FLGSRLG+SLL++FT C ++ E
Sbjct: 369 EKAAASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITL-----------DETIE 417
Query: 462 PSTKRLRRSSS----------DALQDMV--------NGEELSLYGSASNNTESAQKTFSF 503
PS KRL+ S+S D L D + + EEL +YG+ + ++ F
Sbjct: 418 PSAKRLKASNSTSENEDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQIS-SYVF 476
Query: 504 AVRDSLVNIGPL------------KDFSYGLRINADASATG----------ISKQSNYEL 541
V DSL+NIGP ++FS L ++ + T + K ++
Sbjct: 477 EVCDSLLNIGPCGNISLGEPAFLSEEFSENLDLDLELVTTAGYGKNGALCVLQKSVRPQI 536
Query: 542 VE---LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
V LPGC +WTV+ + HA+LI+S E TM+L+T D + E
Sbjct: 537 VTTFTLPGCSNMWTVHAGEDK----------------HAFLILSQEDGTMILQTGDEINE 580
Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSEN 658
+ ++ + T+ AG I ++ +L
Sbjct: 581 I-DNTGFATHIPTVYAG-----------------INQLQHIPLELG-------------- 608
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG- 717
S ++ V+ DPY+ L +DG + L+ + + + S+ PV++ +Y D
Sbjct: 609 SPIVHVTSVDPYISLLTTDGQVITLMLREARGVAKLVISKSTLSNSPPVTTICMYRDVSG 668
Query: 718 -------------PEPWLRKTSTDAWLSTGVGEAIDGADGG--------PLDQGDIYS-- 754
PE ++ ++ T + + + G D P + +Y
Sbjct: 669 LFTSKIPEDFTHIPEHFINESETKMEVENE-DDLLYGDDSDFKMPTLNPPQPKPKVYYNW 727
Query: 755 --------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
V E+ LEI+ +P+F + + G +VD E++ S
Sbjct: 728 WKKYLLDVRPSYWLFVVRENSNLEIYSIPDFKLCYYITNLCFGHKVLVDNL--ESVTISA 785
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
+ S++ E Q R+ ++ + VV L H SRP L L + + Y+ + F
Sbjct: 786 STPISAAHEANIQ-RQFDVKEILVVALG-----NHGSRPLLMVRL-ERDLYIYEVFRF-- 836
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
P K + NVS R D + +E ++ F NI
Sbjct: 837 PRGNLKMRFRKIKHSLIYSPNVSG------RIDTEDSDFFAIQER-----IIKMRYFTNI 885
Query: 921 SGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+G+ G F+ G+ P W M R LR HP DG +++F +NVNC GF+Y + L+
Sbjct: 886 AGYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELR 945
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
I LP+ +YD WPV+KV PL+ TPH +TY E Y L+ S+ +P N+
Sbjct: 946 IGVLPTHLSYDAAWPVRKV-PLRCTPHFVTYHLESKTYCLVTSIA--EPSNKYYKF---- 998
Query: 1040 EVGHQIDNHNLSSVDLHRTYTV---EEYEVRILEPDRAGGPWQT--RATIPMQSSENALT 1094
++ LS D + E++ + + P W I + E+
Sbjct: 999 ----NGEDKELSVEDRGDRFPYPLQEKFSLMLFSP----VSWDVIPNTKIDLDEWEHVNC 1050
Query: 1095 VRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTE 1148
++ V+L T+ + +A+GT Y GEDV +RGR+L+F P +N E
Sbjct: 1051 LKNVSLAYEGTRSGLKGYIAVGTNYNYGEDVTSRGRILIFDIIEVVPEPGQPLTKNRFKE 1110
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
+Y+K+ KG ++AL+ ++G L+ A G KI + + +L G+AF D +Y + +K+
Sbjct: 1111 IYAKDQKGPVTALSQVKGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYTHQILTIKSL 1169
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1268
+L+ D++KSI L ++E+ L+L+++DF + F+ E++ID +T+ +VSD +KN+ ++
Sbjct: 1170 LLVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMIDNTTMGFLVSDSEKNLVLY 1229
Query: 1269 YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLF 1327
Y P+ ES GQ+LL +A+FH+G V F R++ L + G+DK R ++
Sbjct: 1230 MYQPESRESLGGQRLLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADK--RHITMY 1287
Query: 1328 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1387
TLDG +G I P+ E T+RRL LQ LV H+AGLNP++FR + S K S+
Sbjct: 1288 ATLDGGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTNPARSV 1347
Query: 1388 VDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+D EL+ +Y L + E+LE++ + GT ++L +L+D+
Sbjct: 1348 IDGELVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDI 1385
>gi|47217773|emb|CAG05995.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1446
Score = 533 bits (1374), Expect = e-148, Method: Compositional matrix adjust.
Identities = 430/1517 (28%), Positives = 694/1517 (45%), Gaps = 274/1517 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + ++ S ++K R LE V + L GNV S+
Sbjct: 29 NLVVAGTSQLFVYRIIHDVESTSKTDKSSDSKTR-------KEKLEQVAAFSLFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ GA+ RD+++L+F+DAK+SV+E+D H L+ S+H FE PE R++
Sbjct: 82 ESVQLVGAN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPEL------RDT 131
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
DE G G G + +++I
Sbjct: 132 LT-------------------------------------DEQELGVGEGPKSSFLPTYII 154
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + C I A+S++ K
Sbjct: 155 DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQAQCSIVAISLNIMQKV 214
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 215 HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFP 274
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD + A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 275 LRLQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 334
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL----KEEFGDIEADAPSTKRLR 468
+ + T+ FLGSRLG+SLL+++T L G KE+ D++ L
Sbjct: 335 TCMVTMEPGYLFLGSRLGNSLLLKYTEKLQEMPLEEGKDKQEKEKDNDMDKQV-YVHTLN 393
Query: 469 RSSSDALQDMVNGE--ELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN 525
S+ + D E E+ +YGS A + T+ A T+SF V DS++NIGP + S G
Sbjct: 394 SFSAHSQHDFFVDEVDEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANASMGEPAF 451
Query: 526 ADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSR 559
G + + + E+V ELPGC +WTV +
Sbjct: 452 LSEEFQG-NPEPDLEVVVCSGHGKNGALSVLQRSIRPQVVTTFELPGCHDMWTVISNEVK 510
Query: 560 GHNADSSRMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
++ D + H +LI+S E TM+L+T + E+ S + QG
Sbjct: 511 EDKKVPQSPGSFTATHYSLEEDTKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGP 569
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
T+ AGN+ + +IQV G R+L+G + L F P + S ++ S+ADPY
Sbjct: 570 TVFAGNIGDNKYIIQVSPMGIRLLEG---VKQLHFIPVDL-------GSPIVHCSVADPY 619
Query: 671 VLLGMSDGSI---------------RLLVGDPSTCTVSVQTPAAIESSKKPVS------- 708
V++ ++G + RL + P +S Q+ + + VS
Sbjct: 620 VVIMTAEGVVTMFVLKVDSYMGKTHRLALQKPQ---ISTQSRVIALCAYRDVSGMFTTEN 676
Query: 709 --SC---------------TLYHD------KGPEPWLRKTST-------DAWLSTGVGEA 738
SC T+ HD E L S+ + + + V
Sbjct: 677 KVSCAIAEDFNIRSQSETETVIHDLSSNIVDDEEEMLYGDSSSNAGPSKEEMIRSFVAPG 736
Query: 739 IDGADGGPLD-QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALK 797
++GGP + + +V ESG +EI+ +P++ VF V F G+ +VD+ ++
Sbjct: 737 PSVSEGGPSKAEPSHWCLVTRESGVMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQSAT 796
Query: 798 DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYL 857
+ E EE T QG + + +V L +HSRP+L + + +L Y+A+
Sbjct: 797 QGDKE--GKKEEMTRQGEIPLVKEVTLVSLGY-----NHSRPYLL-VHVEQELLVYEAFP 848
Query: 858 F--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--------PH 907
+ + P+N K +RF + P + RE+
Sbjct: 849 YDQQQPQNNLK-----------------------VRFKKVPHNINFREKKSKLRKDKKAE 885
Query: 908 GAPCQ----------RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVA 956
GA + R F++ISG+ G F+ G P W +V R LR+HP DG I +
Sbjct: 886 GAAAEDGVAARGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRLHPMTIDGPIES 945
Query: 957 FTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKN- 1015
F+ HN+NC GF+Y QG L+I LP+ +YD WPV+K IPL+ T H ++Y E
Sbjct: 946 FSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRK-IPLRCTVHYVSYHVESKA 1004
Query: 1016 ------LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1069
+Y + SV + L I + G + + + + + +++ ++++
Sbjct: 1005 SLSHCCVYAVCTSV-------KELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLI 1057
Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARG 1128
P TR I ++ E ++ V L + T + +A GT +QGE+V RG
Sbjct: 1058 SPVSWEAIPNTR--IDLEEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCRG 1115
Query: 1129 RVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
R+L+ P +N +Y KE KG ++AL G+L+ A G KI L
Sbjct: 1116 RILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWVLKD 1175
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+L G+AF D LY+ + +KNFIL D+ KS+ L ++E+ L+L+++D L+ +
Sbjct: 1176 NDLTGMAFIDTQ-LYIHQMMSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVY 1234
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1303
+ EF++D S L +VSD KN+ ++ Y P+ ES+ G +LL RA+F+ GA++ F R M
Sbjct: 1235 SIEFMVDNSQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTFWR--M 1292
Query: 1304 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1363
+ G+ N+ F TLDG +G + P+ E T+RRL LQ L + H A
Sbjct: 1293 PCRGALEAGSRKAMTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLSHHA 1352
Query: 1364 GLNPRSF-------------------------RQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
GLNP++F R H + ++ + +I+D ELL+ Y
Sbjct: 1353 GLNPKAFRCVGADRTSAAMLSGMLPDFATSVSRMLHCDRRSLQNPVKNILDGELLNKYLY 1412
Query: 1399 LPLEEQLEIAHQTGTTR 1415
L + E+ E+A + GTT+
Sbjct: 1413 LSMMERSELAKKIGTTQ 1429
>gi|443684051|gb|ELT88095.1| hypothetical protein CAPTEDRAFT_161045 [Capitella teleta]
Length = 1410
Score = 529 bits (1363), Expect = e-147, Method: Compositional matrix adjust.
Identities = 429/1472 (29%), Positives = 687/1472 (46%), Gaps = 206/1472 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLV N I +Y + + + ++ ++ ETK + LE V Y L GNV S+
Sbjct: 29 NLVTAGVNQIRVYRLVAESKPVEKESHTTETKS-------AKQKLECVADYELCGNVSSI 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S GA RD+++L FE+AK+S+ ++D L+ S+H FE + L+ G
Sbjct: 82 ESISLVGA----ARDALLLCFEEAKLSLCDYDPDTDDLKTISLHYFEDAD---LENG--C 132
Query: 177 FARG---PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
RG V+VDP+GRC +L+YG +I+L + D + S + I S+
Sbjct: 133 CQRGLHHSEVRVDPEGRCAVMLIYGTHLIVLPFRKESPSDEIDATSCAS----KSPIMST 188
Query: 234 HVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
++I+LR LD + +V D F+HGY EP ++IL+E TW RV+ + TC I A+S++
Sbjct: 189 YIIDLRTLDERVTNVVDIQFLHGYYEPTVLILYEPLPTWTCRVAVRKDTCSIVAISLNLQ 248
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
K HP+IWS NLP+D + VP PIGGV+V N++ Y +QS Y VSL+S
Sbjct: 249 DKTHPIIWSHSNLPYDCLRTFPVPKPIGGVIVFAVNSLLYLNQS------FPPYGVSLNS 302
Query: 352 SQEL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
P+ + LD A A ++ ND ++S K G+L +LT+V D R V+ L
Sbjct: 303 LTSFNTEFLLKPQEGVRMSLDCAQAEFIDNDKLVISLKGGELYVLTLVIDSMRAVRSFHL 362
Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
K SVLT+ + G++ FLGSRLG+SLL+++ SS G+ +
Sbjct: 363 DKAAASVLTTCMCMCGDNYLFLGSRLGNSLLLRYQ--EKKPEASSSSDASPGEEQRKEKM 420
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPL------- 515
T + S + + + +EL +YG S ES T F F V DS++NIGP
Sbjct: 421 TLAIGLVGSSDVSKLDDLDELEVYGRDSQAVESEDITQFMFEVCDSIINIGPCGQVEMGE 480
Query: 516 -----KDFSYGLRINADASATG----------ISKQSNYELV---ELPGCKGIWTVYHKS 557
++FS+ + + T + +Q ++V ELPGC +WTV
Sbjct: 481 PAFLSEEFSHQEDPDLELVTTSGYGKNGAISILQRQIRPQVVTTFELPGCTDVWTVLGSP 540
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
+D + HA+L++S +MVLET + E+ S + T+ A N+
Sbjct: 541 DEQQGSDEKLAGS-----HAFLLLSRADSSMVLETGQEIMELDHS-GFCTDAPTVHAANI 594
Query: 618 FGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
R ++QV +L G Q L+ S S V+S S+ADP+VLL D
Sbjct: 595 GNGRYIVQVGPNAIWLLKGVERIQHLALDVS----------SPVVSCSLADPHVLLLCED 644
Query: 678 GSIRLLV-----GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG---------PEP--- 720
G + LV DP T+S+ T + SK V + LY D EP
Sbjct: 645 GQLLHLVLSVQGDDP---TLSLLTTKLHQKSK--VIAINLYRDTSGLFVVASSESEPSAT 699
Query: 721 --------------------------------------WLRKTSTDAWLSTGVGEAIDGA 742
W ++ S E +GA
Sbjct: 700 TTTEATETTTPQQQTEEGVDDEDDLLYGDSDISAITSTWQKQESEKEEKKEEEEEEAEGA 759
Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
D P ++V+ +G LE++ +P++ F V F +G ++D+ L S
Sbjct: 760 DIQP----TYWAVIIRATGNLELYSLPDWQLCFLVKNFATGNKLLIDSMQAADLSASFVA 815
Query: 803 INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
S++E V E+ + + + S+P L A + D + Y+ + G +
Sbjct: 816 PERSTQEVPF-----------VHEVMLHGFGVNGSQPLLMARVHD-ELYIYKVFSHVGSK 863
Query: 863 NTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTR-----EETPHGAPCQRITIF 917
+ RL+ +RF R R E+ P R F
Sbjct: 864 --------------------AKGRLQ-VRFKRRSHGLIIRPRDREEKIPENKKWLRP--F 900
Query: 918 KNISGHQGFFLSGSRPCW-CMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
+ISG+ G F+ GS P W M R LR HP DG+I FT HNVNC GF+Y +S
Sbjct: 901 TDISGYSGVFICGSYPHWLIMTQRGTLRGHPMAIDGTIPCFTAFHNVNCPKGFLYFSSNE 960
Query: 977 ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1036
L+IC LP+ +YD WPV+KV PL+ TPH + Y + Y ++ S V P Q++ +
Sbjct: 961 ELRICVLPTHLSYDAPWPVRKV-PLRCTPHFVVYHPDSKTYSVVSSQQV--PCTQLVRVA 1017
Query: 1037 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1096
D E +I+ + D + ++ +++ P TR ++ E+ + ++
Sbjct: 1018 GDGE--KEIEA--VQKDDRFVFPIMNKFNIQLFSPVSWEPIPNTR--FDLEEWEHVMCIK 1071
Query: 1097 VVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVY 1150
+ L + T + + +GT EDV++RG++ ++ P +N + VY
Sbjct: 1072 TINLKSEGTLSGLKGYVVVGTNLNYNEDVSSRGKLTIYDVIDVVPEPGQPLTKNKIKVVY 1131
Query: 1151 SKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFIL 1210
+KE KG ++AL +QG L+ A G K+ + + +L GIAF D +Y+ + +KN I+
Sbjct: 1132 NKEQKGPVTALDGVQGFLVTAIGQKVYIWQLKDNDLAGIAFIDTQ-IYIHKMEALKNLII 1190
Query: 1211 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1270
+GD+ KSI L ++E L+L++KD L + +L+D ++L+ +V+D+ KN ++ Y
Sbjct: 1191 IGDVCKSISVLRYQEDMKVLSLVSKDVRPLAVYGVAYLVDETSLAFIVADKLKNFLVYCY 1250
Query: 1271 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
P + +S GQ+L+ +A+ ++G+ V F R++ SD + + + + TL
Sbjct: 1251 QPDLVQSQGGQRLIRKADINIGSLVNAFFRVK--CRVSDPSTSKTDQSLAMKHITYYVTL 1308
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1390
DGSIG + P+ E +RRL LQK L+ V AGLNP+++R + + +I+D
Sbjct: 1309 DGSIGYLLPISESLYRRLYMLQKMLIQQVQQTAGLNPKAYRTCQTEFRQLINIQRNIIDG 1368
Query: 1391 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+L Y L ++ E+A + GTT QI +L
Sbjct: 1369 DLAWKYLALTSHDRAEMAKRIGTTSHQIEDDL 1400
>gi|427780291|gb|JAA55597.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
cft1 cpsf subunit [Rhipicephalus pulchellus]
Length = 1237
Score = 524 bits (1349), Expect = e-145, Method: Compositional matrix adjust.
Identities = 399/1306 (30%), Positives = 624/1306 (47%), Gaps = 199/1306 (15%)
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
F+HGY EP ++IL+E TW GR++ + TC I ALS++ + HP+IWS NLP D +
Sbjct: 3 FLHGYYEPTLLILYEPLRTWPGRIAIRQDTCCILALSLNLQQRVHPVIWSYTNLPFDCLR 62
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL-------PRSSFSVE 363
LLAVP P+GGVL++ +++ Y +QS Y VSL+S + P+ +
Sbjct: 63 LLAVPRPLGGVLIMAVDSLLYLNQSVP------PYGVSLNSFTDFSTSFPLKPQEGLKIS 116
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSL 422
LD A A +L D +LS K G+L +LT+ DG R V+ K SVLT+ +T +
Sbjct: 117 LDCAQACFLSYDRLVLSLKGGELYVLTLFNDGMRSVRNFYFDKAAASVLTTSMTLCEDGY 176
Query: 423 FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD----- 477
FLGSRLG+SLL+ +T + ++E + +A+ P +K+ R DA+ D
Sbjct: 177 LFLGSRLGNSLLLHYT-EKAAEVDDIAKRDEKTESDANDPPSKKKRM---DAIGDWMASD 232
Query: 478 --MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRINAD 527
+++ +EL +YGS + T+ +++F V DSL+NIGP G N D
Sbjct: 233 VALIDPDELEVYGSETMATKQL-TSYTFEVCDSLINIGPCGKICMGEPAFLSEEFVQNTD 291
Query: 528 -----ASATGISKQSNYELV------------ELPGCKGIWTVY---------HKSSRGH 561
+ G K ++ ELPGC +WTV K+
Sbjct: 292 PDLELVTTAGYGKNGALCVLQRSVRPQVVTTFELPGCVHMWTVMGPPAEKKPPEKTEESD 351
Query: 562 NADSSRMAAYDD--EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFG 619
+ S AA HA+LI+S +M+L+T + E+ S + Q T+ AGNL
Sbjct: 352 DPASEDKAAEQPLTNTHAFLILSRADSSMILQTDQEINELDHS-GFSTQNPTVFAGNLGD 410
Query: 620 RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
R V+QV G R+LDG+ Q + S++++ S+ADP+V++ ++G
Sbjct: 411 GRYVLQVCPMGVRLLDGTRQLQHIPL----------DVGSSIVAGSLADPHVIIRSAEGL 460
Query: 680 I--RLLVGDPST-CTVSVQTPAAIESSKKPVSSCT------LYHDKGPEPWLRKTSTDAW 730
+ L GDP+ C ++V P K +S C L+ + EP + +
Sbjct: 461 VIHLTLRGDPAAGCRLAVLRPQLTAVKAKILSICVYKDVSGLFTTQYREP--DEPAKPEK 518
Query: 731 LSTGVGEAIDGADGGPLDQGD--------------------------------------- 751
E+ID + G LD D
Sbjct: 519 PLPPPKESIDMSSNGLLDDEDELLYGESEENPIQKEPVRMTSEEAPSVAESMFEIKEVAP 578
Query: 752 -IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEE 809
+ V E+G LEI+ +P + F V F G+ +VD+ A +++E ++ S E
Sbjct: 579 TYWLFVARENGVLEIYSLPEYKLCFLVKNFPMGQKVLVDSVQMTAPSGTKSEKLSDMSHE 638
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
+H + VV L ++ HSRP L A + D +L Y+A+ F
Sbjct: 639 SM-----PVVHEILVVGLGIR-----HSRPLLLARV-DEDLLIYEAFPF----------- 676
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-----ETPHGAPCQR-------ITIF 917
T R + LRF + D + RE + P ++ + F
Sbjct: 677 -YETQREGHL---------KLRFKKMSHDIFLRERKYKTQKPENEEEEKAFQSRQWLHPF 726
Query: 918 KNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
+ISG+ G FL G RP W M R LR HP DG I F HNVNC GF++ QG
Sbjct: 727 SDISGYSGVFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQG 786
Query: 977 ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1036
L+I LP+ TYD WPV+KV PL+ TPH + Y + Y ++ S P P N ++
Sbjct: 787 ELRISTLPTHLTYDAPWPVRKV-PLRCTPHFVNYHVDSKTYCVVTSQP--DPCNHLVRF- 842
Query: 1037 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALT 1094
G + + L + T++++ +++L P W+T + + E+
Sbjct: 843 ----TGEEKEYELLERDSRYIFPTMDKFSLQLLSPVS----WETIPNTRVDLDEWEHLTC 894
Query: 1095 VRVVTLFNT-TTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTE 1148
++ V L + TT + LA+GT Y GEDV +RGR+++ P +N +
Sbjct: 895 LKNVMLSSEGTTTGMKGYLALGTNYCYGEDVTSRGRIIILDIIDVVPEPGQPLTKNKIKI 954
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
VYSKE KG ++AL+ + G LL A G KI + + EL G+AF D +Y+ S+ VKN
Sbjct: 955 VYSKEQKGPVTALSQVVGFLLSAIGQKIYIWQLKDNELVGVAFIDTQ-IYIHSVVTVKNL 1013
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1268
IL+GD+ KS+ L ++E L+L+++D L+ +A EF ID + +S +V+D ++N+ ++
Sbjct: 1014 ILVGDVFKSVSLLRYQEASRTLSLVSRDVRPLEVYAVEFFIDNTQMSFLVTDAERNLLLY 1073
Query: 1269 YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
Y P+ ES GQ+LL R +FHVG+ V R++ + S R +
Sbjct: 1074 MYQPESRESCGGQRLLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMA 1133
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN----GKAHRPGP 1384
TLDGS+ + P+ E T+RRL LQ LV ++PH AGLNP+++R ++S G H+
Sbjct: 1134 TLDGSLAYVLPVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHK--- 1190
Query: 1385 DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
+I+D EL+ + L E+ E++ + GTT +QI +L ++ T+
Sbjct: 1191 -NILDGELIWKFMHLSFMERSELSKKIGTTVTQITDDLLEIETYTA 1235
>gi|194883064|ref|XP_001975624.1| GG22421 [Drosophila erecta]
gi|190658811|gb|EDV56024.1| GG22421 [Drosophila erecta]
Length = 1455
Score = 513 bits (1320), Expect = e-142, Method: Compositional matrix adjust.
Identities = 425/1487 (28%), Positives = 697/1487 (46%), Gaps = 191/1487 (12%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E G ++ N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEAGQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E I + +L + + A + EEL +YGS + + + F F V D
Sbjct: 430 RNLQDE-EQIMEEIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 489 SLMNVAPVNYMCAGERVEFEEDGATLRPHAESLQDVKIELVAATGHSKNGALSVFVNCIN 548
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 549 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++ E G
Sbjct: 600 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-----EVG-- 651
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 652 ---SPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 709 LSGLFTVKGDDINLTGSSNSGFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + VV +SG LEI+ +P+ V+ V+ G IV
Sbjct: 769 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDV--GNGAIV 826
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
T E + S T +S ++ +S +EL++ + RP L + T
Sbjct: 827 LTDAMEFVPISLTTQENSKAGIVQACMPQHANSPLPLELSLTGLGLNGERPLLM-VRTRV 885
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L YQ +F P+ K R L N+ + ++ D E+
Sbjct: 886 ELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DEQEDIESYQM 936
Query: 909 AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 937 QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
+GF+Y + LKI LPS +YD+ WPV+KV PL+ TP Q+ Y E +Y LI
Sbjct: 997 PNGFLYFDTTYELKISVLPSYLSYDSTWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE- 1054
Query: 1026 LKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--R 1081
+P+ + D+E+ + Y + ++E+ ++ P+ W+
Sbjct: 1055 -EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPD 1102
Query: 1082 ATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1140
A+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1103 ASISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVP 1162
Query: 1141 NPQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1195
P +T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF D
Sbjct: 1163 EPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT- 1221
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
+YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L
Sbjct: 1222 NIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLG 1281
Query: 1256 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1315
+V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 1282 FLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL 1341
Query: 1316 GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1375
N+ +++GTLDG++G PL E +RR LQ LV H+ GLNP+ +R S
Sbjct: 1342 ---YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLVSYQEHLCGLNPKEYRTLKS 1398
Query: 1376 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
K I+D +L+ Y ++ E+ E+A + GT +IL +L
Sbjct: 1399 FKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1445
>gi|45552619|ref|NP_995833.1| cleavage and polyadenylation specificity factor 160, isoform A
[Drosophila melanogaster]
gi|18203551|sp|Q9V726.1|CPSF1_DROME RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit;
Short=dCPSF 160
gi|7303176|gb|AAF58240.1| cleavage and polyadenylation specificity factor 160, isoform A
[Drosophila melanogaster]
Length = 1455
Score = 512 bits (1319), Expect = e-142, Method: Compositional matrix adjust.
Identities = 421/1487 (28%), Positives = 698/1487 (46%), Gaps = 191/1487 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E ++E + +L + + A + EEL +YGS + + + F F V D
Sbjct: 430 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 489 SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 549 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 600 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 649 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 709 LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + VV +SG LEI+ +P+ V+ V+ +G +
Sbjct: 769 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 828
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D E + S T +S ++ +S +EL++ + RP L + T
Sbjct: 829 DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L YQ +F P+ K R + N+ + ++ D E+
Sbjct: 886 ELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQM 936
Query: 909 AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 937 QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
+GF+Y + LKI LPS +YD+ WPV+KV PL+ TP Q+ Y E +Y LI
Sbjct: 997 PNGFLYFDTTYELKISVLPSYLSYDSVWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE- 1054
Query: 1026 LKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--R 1081
+P+ + D+E+ + Y + ++E+ ++ P+ W+
Sbjct: 1055 -EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPD 1102
Query: 1082 ATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1140
A+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1103 ASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVP 1162
Query: 1141 NPQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1195
P +T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF D
Sbjct: 1163 EPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT- 1221
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
+YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L
Sbjct: 1222 NIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLG 1281
Query: 1256 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1315
+V+D ++NI ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 1282 FLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL 1341
Query: 1316 GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1375
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R S
Sbjct: 1342 ---YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKS 1398
Query: 1376 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ K I+D +L+ Y ++ E+ E+A + GT +IL +L
Sbjct: 1399 SKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1445
>gi|195485994|ref|XP_002091320.1| GE12310 [Drosophila yakuba]
gi|194177421|gb|EDW91032.1| GE12310 [Drosophila yakuba]
Length = 1455
Score = 512 bits (1318), Expect = e-142, Method: Compositional matrix adjust.
Identities = 418/1487 (28%), Positives = 699/1487 (47%), Gaps = 191/1487 (12%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E G ++ N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEAGQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E ++E + +L + + A + EEL +YG+ + + + F F V D
Sbjct: 430 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGTGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 489 SLMNVAPINYMCAGERVEFEEDGATLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 549 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 600 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 649 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 709 LSGLFTVKGDDINLTGSSNSGFGHSFGGYMKAEPNMKVEDEEDLLYGDAGNAFKMNSMAD 768
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + +V +SG LEI+ +P+ V+ V+ +G +
Sbjct: 769 LAKQSKQKNSDWWRRLLVQAKPSYWLIVARQSGTLEIYSMPDMKLVYLVNDVGNGAMVLT 828
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D E + S T +S ++ +S +EL++ + RP L + T
Sbjct: 829 DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L YQ +F P+ K R L N+ + ++ DA E+
Sbjct: 886 ELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DAQEEIESYQM 936
Query: 909 AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 937 QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
+GF+Y + LKI LPS +YD+ WPV+KV PL+ TP Q+ Y E +Y LI
Sbjct: 997 PNGFLYFDTTYELKISVLPSYLSYDSTWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE- 1054
Query: 1026 LKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--R 1081
+P+ + D+E+ + Y + ++E+ ++ P+ W+
Sbjct: 1055 -EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPD 1102
Query: 1082 ATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1140
A+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1103 ASISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVP 1162
Query: 1141 NPQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1195
P +T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF D
Sbjct: 1163 EPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT- 1221
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
+YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L
Sbjct: 1222 NIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLG 1281
Query: 1256 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1315
+V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 1282 FLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL 1341
Query: 1316 GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1375
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R S
Sbjct: 1342 ---YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKS 1398
Query: 1376 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
K ++D +L+ Y ++ E+ E+A + GT +IL++L
Sbjct: 1399 FKKQGINPSRCVIDGDLIWSYRLMANSERNEVAKKIGTRTEEILADL 1445
>gi|194756960|ref|XP_001960738.1| GF11349 [Drosophila ananassae]
gi|190622036|gb|EDV37560.1| GF11349 [Drosophila ananassae]
Length = 1455
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 419/1495 (28%), Positives = 691/1495 (46%), Gaps = 207/1495 (13%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E G ++ N E M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEAGQRQKLNPTE------MRVAPKMRLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D + L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P+V+VDP RC +LVYG ++++L + + L + + R
Sbjct: 136 GRYFVPVVRVDPDSRCAVMLVYGKRLVVLPFRKDNTLDEIELADVKPIKKAPTAMVTRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCQQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK-------- 451
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVDQQADQQL 429
Query: 452 ----------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+E D++ AP+ + RR + EEL +YGS + + +
Sbjct: 430 QRQQSEDQTLDEILDVDQLELAPTQAKSRR---------IEDEELEVYGSGAKASVLQLR 480
Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS-- 537
F F V DSL+N+ P+ G R+ + +ATG SK
Sbjct: 481 KFVFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLNDLKIELVAATGHSKNGAL 540
Query: 538 -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
N +++ EL GC +WTV+ D+++ + D+ H ++++S T
Sbjct: 541 SVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLLSQRNST 591
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 592 LVLQTGQEINEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 648
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
S V+ VSIADPYV L + +G + L + T + SS V
Sbjct: 649 --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAV 700
Query: 708 SSCTLYHD-----------------------------KGPEPWLRKTSTDAWLSTGVGEA 738
+ + Y D EP ++ + L G A
Sbjct: 701 VAISAYKDLSGLFTVKADDVNLTGSSSSAFGHSFGGYMKAEPHMKVEDEEDLLYGDAGNA 760
Query: 739 I------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKF 780
D A D + VV +SG LEI+ +P+ V+ V+
Sbjct: 761 FKMNSMADLAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDV 820
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
+G + D E + S T +S ++ +S +EL + + RP
Sbjct: 821 GNGAMVLTDAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELTVLGLGLNGERPL 878
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
L + T +L YQ +F P+ K R L N+ + ++ D
Sbjct: 879 LL-VRTRVELLIYQ--VFRYPKGHLKI-----RFRKLEQLNLMDHQPSHIELDEN--DER 928
Query: 901 TREETPHGAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAF 957
E+ P Q++ F N+ G G + G PC+ + R LR+H L +G + +F
Sbjct: 929 EEMESYQMQPKYVQKLRPFANVGGLSGIMVCGVNPCFVFLTSRGELRIHRLLGNGDVRSF 988
Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
+NVN +GF+Y + LKI LPS +YD+ WP++KV PL+ TP Q+ Y E +Y
Sbjct: 989 AAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSTWPIRKV-PLRCTPRQLVYHRENRVY 1047
Query: 1018 PLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAG 1075
LI +P+ + D+E+ + Y + ++E+ ++ P+
Sbjct: 1048 CLITQNE--EPMTKFYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET-- 1096
Query: 1076 GPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1132
W+ A+I + E+ ++V L + T + L IGT + ED+ +RG + +
Sbjct: 1097 --WEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHI 1154
Query: 1133 FSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELN 1187
+ P +T EV+ KE KG +SA++ + G L+ G KI + + +L
Sbjct: 1155 YDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLI 1214
Query: 1188 GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEF 1247
G+AF D +YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF
Sbjct: 1215 GVAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEF 1273
Query: 1248 LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS 1307
++D S L +V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q
Sbjct: 1274 MVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKG 1333
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNP 1367
+ N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP
Sbjct: 1334 LHQRQPFLYE---NKHFVVYGTLDGALGYCLPLPEKLYRRFLMLQNVLLSYQEHLCGLNP 1390
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ +R + K I+D +L+ Y +L E+ E+A + GT +ILS+L
Sbjct: 1391 KEYRTIKAVKKQGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEEILSDL 1445
>gi|195334368|ref|XP_002033855.1| GM20208 [Drosophila sechellia]
gi|194125825|gb|EDW47868.1| GM20208 [Drosophila sechellia]
Length = 1455
Score = 511 bits (1316), Expect = e-141, Method: Compositional matrix adjust.
Identities = 420/1492 (28%), Positives = 695/1492 (46%), Gaps = 201/1492 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
K SVLTS I + + FLGSRLG+SLL+ FT +++++ D+E
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVIT------LDDVEQ 423
Query: 460 DAPSTKRLRRSSSDALQDM-----------------VNGEELSLYGSASNNTESAQKTFS 502
+ +R + L+++ + EEL +YGS + + + F
Sbjct: 424 QSEQQQRNLQDEDQNLEEIFDVDQVEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFI 483
Query: 503 FAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS----- 537
F V DSL+N+ P+ G R+ + +ATG SK
Sbjct: 484 FEVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVF 543
Query: 538 ----NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
N +++ EL GC +WTV+ D+++ ++ +D+ H ++ +S T+VL
Sbjct: 544 VNCLNPQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMFLSQRNSTLVL 594
Query: 591 ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNS 650
+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 595 QTGQEINEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI----- 648
Query: 651 ESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSC 710
S V+ VSIADPYV L + +G + L + T + SS V +
Sbjct: 649 -----DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAI 703
Query: 711 TLYHD-------KG----------------------PEPWLRKTSTDAWLSTGVGEAI-- 739
+ Y D KG EP ++ + L G A
Sbjct: 704 SAYKDLSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKM 763
Query: 740 ----DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSG 783
D A D + VV +SG LEI+ +P+ V+ V+ +G
Sbjct: 764 NSMADLAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNG 823
Query: 784 RTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
+ D E + S T +S ++ +S +EL++ + RP L
Sbjct: 824 AMVLTDAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL- 880
Query: 844 ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
+ T +L YQ +F P+ K R L N+ + ++ D
Sbjct: 881 VRTRVELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DEQEEI 931
Query: 904 ETPHGAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVL 960
E+ P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F
Sbjct: 932 ESYQMQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAF 991
Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLI 1020
+NVN +GF+Y + LKI LPS +YD+ WPV+KV PL+ TP Q+ Y E +Y LI
Sbjct: 992 NNVNIPNGFLYFDTTYELKISVLPSYLSYDSIWPVRKV-PLRCTPRQLVYHRENRVYCLI 1050
Query: 1021 VSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPW 1078
+P+ + D+E+ + Y + ++E+ ++ P+ W
Sbjct: 1051 TQTE--EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----W 1097
Query: 1079 QT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1135
+ A+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1098 EIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDI 1157
Query: 1136 GRNADNPQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIA 1190
P +T+ KE+ KG +SA++ + G L+ G KI + + +L G+A
Sbjct: 1158 IEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVA 1217
Query: 1191 FYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLID 1250
F D +YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D
Sbjct: 1218 FIDT-NIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVD 1276
Query: 1251 GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1310
S L +V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 1277 NSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQ 1336
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1370
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +
Sbjct: 1337 RQPFL---YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEY 1393
Query: 1371 RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
R S+ K I+D +L+ Y ++ E+ E+A + GT +IL +L
Sbjct: 1394 RTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1445
>gi|195455711|ref|XP_002074834.1| GK23274 [Drosophila willistoni]
gi|194170919|gb|EDW85820.1| GK23274 [Drosophila willistoni]
Length = 1463
Score = 509 bits (1312), Expect = e-141, Method: Compositional matrix adjust.
Identities = 418/1499 (27%), Positives = 688/1499 (45%), Gaps = 207/1499 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSE------MRVAPKMRLECLATYSLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D + L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA--GAMRDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 137
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +L+YG ++++L + S L + + R
Sbjct: 138 GRYYVPEVRVDPDARCAVMLIYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVTRTP 197
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T AGR+ + TC++ A+S
Sbjct: 198 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDTCVLVAIS 257
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ NLP D +LL + PIGG LV+ N + Y +QS Y V
Sbjct: 258 LNIQQRVHPIIWTVNNLPFDCLRLLPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 311
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 312 SLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVVSLRTGDLYVLTLCVDSMRTVR 371
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I FLGSRLG+SLL+ FT +++++
Sbjct: 372 NFHFHKAASSVLTSCICVCHMEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQQQQQA 431
Query: 448 ----SGLKEEFGDIEAD-------APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES 496
S E G ++ D APS + RR + EEL +YG+ + +
Sbjct: 432 AEEPSEEAEIEGILDMDQLEAATSAPSQAKSRR---------IEDEELEVYGTGAKASVL 482
Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQ 536
+ F F V DSL+N+ P+ G R+ + +ATG SK
Sbjct: 483 QLRKFVFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLQDVKIELVAATGHSKN 542
Query: 537 S---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
N +++ EL GC +WTV+ D+++ + D+ H ++++S +
Sbjct: 543 GALSVFVNCINPQIITSFELEGCLDVWTVFD--------DATKKTSRQDQ-HDFMLLSQK 593
Query: 585 ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
T+VL+T + E+ E+ + V TI GNL R ++QV R R+L G+ + Q++
Sbjct: 594 NSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQNRFIVQVTTRHVRLLQGTRLVQNVP 652
Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSK 704
S V+ V+IADPYV L + +G + L S T + SS
Sbjct: 653 I----------DVGSPVVQVAIADPYVCLRVFNGQVITLALRESRGTPRLAINKHTISSS 702
Query: 705 KPVSSCTLYHD-------------------------------KGPEPWLRKTSTDAWLST 733
V + Y D EP ++ + L
Sbjct: 703 PAVVAIAAYKDLSGLFTVKSDDILNLTGSGSNSAFGSTFGGYMKSEPHMKVEDEEDLLYG 762
Query: 734 GVGEAI------DGADGGPLDQGDIYS-------------VVCYESGALEIFDVPNFNCV 774
G A D A D + VV +SG LEI+ +P+ V
Sbjct: 763 DAGNAFKMNTMADLAKQSKQKNSDWWRRMLVQAAKPTYWLVVARQSGTLEIYSMPDMKLV 822
Query: 775 FTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSA 834
+ V+ +G + D E + S T +S ++ +S +EL++
Sbjct: 823 YLVNDVGNGAMVLTDAM--EFVPISLTSQENSKAGIVQSCMPQHANSPLPLELSLVGLGL 880
Query: 835 HHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR 894
+ RP L + T +L YQ +F P+ K R + N+ + ++
Sbjct: 881 NGERPLLL-VRTRLELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHVNLDD 932
Query: 895 TPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGS 953
+ Q++ F N+ G G + G PC+ + R LR+H L +G
Sbjct: 933 NEENEELESYNMQPKYVQKLRPFNNVGGMSGVMICGVNPCFLFLTSRGELRIHRLLGNGE 992
Query: 954 IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAE 1013
+ +F +N+N +GF++ + LKI LPS +YD+ WPV+KV PL+ TP Q+ Y E
Sbjct: 993 VRSFAAFNNINIPNGFLFFDTTFELKISVLPSYLSYDSTWPVRKV-PLRCTPRQLVYHRE 1051
Query: 1014 KNLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEP 1071
+Y LI +P+ + D+E+ + Y + ++++ ++ P
Sbjct: 1052 NRVYCLITQTE--EPMTKFYRFNGEDKELSEESRGERF-------IYPIGSQFDMVLISP 1102
Query: 1072 DRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARG 1128
+ W+ A+I + E+ ++V L + T + L IGT + ED+ +RG
Sbjct: 1103 E----TWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRG 1158
Query: 1129 RVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
+ ++ P +T EV+ KE KG +SA++ + G L+ G KI + +
Sbjct: 1159 NIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRD 1218
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+L G+AF D +YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ +
Sbjct: 1219 GDLIGVAFIDT-NIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVY 1277
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1303
EF++D + L +V+D + N+ ++ Y P+ ES GQKLL +A++H+G V R+Q
Sbjct: 1278 GIEFMVDNTNLGFLVTDAESNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQC 1337
Query: 1304 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1363
+ N+ +++GTLDG++G PL E +RR LQ L+ H+
Sbjct: 1338 HQRGLHQRQPFL---YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDHLC 1394
Query: 1364 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
GLNP+ +R S+ + I+D +L+ Y +L E+ E+A + GT +IL++L
Sbjct: 1395 GLNPKEYRTLKSSKRLGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEEILADL 1453
>gi|290981010|ref|XP_002673224.1| CPSF A subunit [Naegleria gruberi]
gi|284086806|gb|EFC40480.1| CPSF A subunit [Naegleria gruberi]
Length = 1373
Score = 509 bits (1312), Expect = e-141, Method: Compositional matrix adjust.
Identities = 412/1534 (26%), Positives = 687/1534 (44%), Gaps = 266/1534 (17%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
FA YK +H PT ++ C T + + NL++
Sbjct: 2 FACYKQLHPPTAVSFCLKARFTSANDE---------------------------NLIIVK 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL-AILSQ 121
N++E+Y+++ + +++ LV + L G ++S+ A+ Q
Sbjct: 35 NNIMEVYLIKP-----------------------NTSNIVLVKVFELFGVIDSIIAVCLQ 71
Query: 122 GGADNSRRRDSIILAFED-AKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
G +++ +++ FED AK+SV+EFD+ L+ S+H E L+ G+ F
Sbjct: 72 G-----MKKEMLLINFEDEAKVSVVEFDEKRSDLKTLSLHYLEDD---FLREGKARFFHN 123
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGS------------GLVGDEDTFGSGGGFSA 228
+ +DPQ R V++ +++IL Q G L GD++ G
Sbjct: 124 QPIILDPQNRFATVIICDSKLVILPFRQSGEDVSLSTEDNFLFALSGDQEEANENVGDQK 183
Query: 229 R-----IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
+ ++ +I+L DL +K+VKD+ F++GY EP ++ LHE E TW+GR++ K +T +
Sbjct: 184 KHHQPEVQRQVIIDLNDLGIKNVKDYCFLNGYNEPTILFLHENEQTWSGRLAAKSNTSTV 243
Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI-GGVLVVGANTIHYHSQSASCALAL 342
+A+S K +P IWS +LPHD KL+ + + GG LV+G N+I + +Q A+ L+
Sbjct: 244 TAVSFDLFRKYYPKIWSVGSLPHDCNKLIPLQEDVAGGALVIGMNSIIHINQCATYGLSF 303
Query: 343 NNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
N++AVS + + + ++F ++ D T++ D L+S K G+L + + G +
Sbjct: 304 NDFAVS-NPNLSINFNTFDGPALFFDTVAYTFIARDKLLVSLKDGELYTMYLESGGSRIN 362
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
+++ KT+ + S + T+ +L FLGS++GDS+L ++ S EE + A
Sbjct: 363 NINIKKTSNTTPASCMCTLKGNLIFLGSKIGDSVLYEYQEKVEVETSSLDTDEEMSSVFA 422
Query: 460 -----DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
+ KR D + EE ++ S S ++ ++ NIGP
Sbjct: 423 AGENFEPEKKKRKLADDDDFFAALEKDEEPTVIESFSKVSKKETTKVELKIKHVFTNIGP 482
Query: 515 LKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WTVYHKSSRGHNADSSRMA 569
+ + + + D S G ++N + C GI TV ++S + + +
Sbjct: 483 ISHLTAAVTSSFDMS--GFKSKTNDNQLSAIACSGIGRHGCLTVLNRSLQPDIQSEATLP 540
Query: 570 ---------AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
+ E+ YLI+SLE +T V E+ L EVT + T+ G + R
Sbjct: 541 FLVKQVWTISQKTEHDLYLILSLEDKTKVFESKATLAEVTSKSMFVTNETTLNIGKI--R 598
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
++QV R + +L GS P S++ I DPYVLL DGS+
Sbjct: 599 ESIVQV-TRKSVMLIGS--------EPKQVHHSKKEIRSSI----ILDPYVLLHFYDGSL 645
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
LL D T IES+ +++ LY PE + G+ E
Sbjct: 646 VLLTHDNGRVT---SKQLDIESNHGKITAVCLYK-TNPE----------FEFFGINEK-- 689
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
+G V + GA EI VP+ CVF+ +F T + D
Sbjct: 690 --------EGKYLCCVYWTDGAFEILSVPDMTCVFSFSQFYQFHTTLFD----------- 730
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
E + + + V E+A++ + P+L ++L+D T+ Y+++L
Sbjct: 731 -------EGQSSNTTQSEVKYPYVTEMALRGIGSDSEMPYLVSVLSDNTVHIYRSFL--- 780
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR------TPLDAYTREETPHGAPCQ-- 912
+ T+KS D +RL LRFS+ P+ ++ +
Sbjct: 781 -DRTTKSKD---------------NRLTRLRFSKFQHDDLLPISEIDKKSQTFTLNLKSK 824
Query: 913 -----------RITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 961
++ FKNI G+ G F +G +P W LRVHP + FT H
Sbjct: 825 YLFPKSDLGRSQLIPFKNIGGYGGLFKTGEKPFWLFTEHSNLRVHPTQSRDPVTTFTPYH 884
Query: 962 NVNCNHGFIYVT-------SQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEK 1014
+ NC HGFIY+T Q L I L + ++ YWP +K++ LK+TP+ IT+ +
Sbjct: 885 HENCPHGFIYLTDKEQDNKKQSKLHISSLNANVKFNAYWPQRKIL-LKSTPNVITFHQDT 943
Query: 1015 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA 1074
N SVPV +L I G + +TV+ + +
Sbjct: 944 NTCLAFTSVPV----KAILPDSIPFPEGK-------CPPPAEQKHTVKLF---------S 983
Query: 1075 GGPWQTRATIPMQSSENALTVRVVTL----------------FNTTTKENETLLAIGTAY 1118
G WQ E+A+ +VV L N+ ++ +++A+GTAY
Sbjct: 984 GHNWQEMDKFEFDLHESAVAAKVVYLSKEEYNDDTDISFEEPLNSRKQDLVSVVAVGTAY 1043
Query: 1119 VQGEDVAARGRVLLFST----GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1174
VQ E RGR+LLF GR + NL++ S +KG I+ L + +++ + G
Sbjct: 1044 VQSERELCRGRLLLFDLDPILGRENEYKLNLIS---STSVKGPITTLEQVDRYIICSVGN 1100
Query: 1175 KIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1232
+I + W ++ +FYD Y SLN V+NFI+ GDI+KS+ FL WKE+G +L L
Sbjct: 1101 RIYTYYFDWEEKRMHITSFYDTQ-FYTASLNTVRNFIMFGDIYKSVSFLRWKEKGHRLIL 1159
Query: 1233 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1292
LAKD L ++EFL++ L L V D KN+QIF Y P+ ES G+ L+ +FH+G
Sbjct: 1160 LAKDNRPLQVVSSEFLVNNDLLGLAVIDTSKNLQIFSYLPQHQESNDGRNLVPVCDFHIG 1219
Query: 1293 AHVTKFLRLQM----------LATSSDRTGAAPGSDKT----NRFALLFGTLDGSIGCIA 1338
+ +R+++ L +++ + D T N +LFG++DG+IG +A
Sbjct: 1220 TLINSLIRMKVRELPDDNTIRLGNVNEKPKQSGKKDITKTNPNHQFILFGSVDGAIGYVA 1279
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P++E+T RRL +LQ K+ + AGL+P+SFR + + +I+D +L+ +Y
Sbjct: 1280 PINEVTHRRLFALQLKMYTQLEQAAGLHPKSFRLYKPLERTEYNYKKNIIDGQLIWNYAN 1339
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
+ Q ++A Q GT IL ++ +L T F
Sbjct: 1340 INTILQRDLARQIGTNSDNILRSIQELNQATFFF 1373
>gi|195150431|ref|XP_002016158.1| GL10645 [Drosophila persimilis]
gi|194110005|gb|EDW32048.1| GL10645 [Drosophila persimilis]
Length = 1459
Score = 507 bits (1305), Expect = e-140, Method: Compositional matrix adjust.
Identities = 419/1505 (27%), Positives = 693/1505 (46%), Gaps = 223/1505 (14%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV AN++++Y + E G ++ N E M LE + Y L+GNV S
Sbjct: 29 NLVVAGANMLKVYRISPNVEAGQRQKLNPNE------MRIAPKMRLECLATYFLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA +D+++++F+DAK+SVL+ D + L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR------ 229
P+V+VDP RC +LVYG ++++L + S DE F
Sbjct: 136 GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSL---DEIELTDVKPFKKAPTAMVS 192
Query: 230 ---IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
I +S++I L++LD K +V D F+HGY EP ++IL+E T +GR+ + TC++
Sbjct: 193 RTPIMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCSGRIKVRSDTCVLV 252
Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
A+S++ + HP+IW+ +LP D +++ + PIGG LV+ N + Y +QS
Sbjct: 253 AISLNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVP------P 306
Query: 345 YAVSLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
Y VSL+SS + P+ + LD A+ ++ D ++S +TG+L +LT+ D R
Sbjct: 307 YGVSLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGELYVLTLCVDSMR 366
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK----- 451
V+ K SVLTS I + FLGSRLG+SLL+ FT +++++ +
Sbjct: 367 TVRNFHFHKAAASVLTSCICVCHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDVDAEQQA 426
Query: 452 ----------------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
EE D++ AP + RR + EEL +YGS +
Sbjct: 427 EQQQQKQQRVQEDQDIEEVYDVDQIELAPPQAKSRR---------IEDEELEVYGSGAKA 477
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGI 533
+ + F F V DSL+N+ P+ G R+ + +ATG
Sbjct: 478 SVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGH 537
Query: 534 SKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII 581
SK N +++ EL GC +WTV+ D+++ + D+ H ++++
Sbjct: 538 SKNGALSVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLL 588
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
S T+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q
Sbjct: 589 SQSNSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQ 647
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG-----SIRLLVGDPSTCT---VS 693
++ S V+ V+IADPYV L M +G ++R G P
Sbjct: 648 NVPI----------DVGSPVVQVAIADPYVCLRMLNGQVITLALRETRGSPRLAINKHTI 697
Query: 694 VQTPA--AIESSKKPVSSCTLYHDK--------------------GPEPWLRKTSTDAWL 731
+PA AI + K T+ D EP ++ + L
Sbjct: 698 TSSPAVVAIAAYKDLSGLFTVKSDDVLNLTGGTGSGFGHSFGGYMKAEPNMKVEDEEDLL 757
Query: 732 STGVGEAIDGADGGPLDQGD------------------IYSVVCYESGALEIFDVPNFNC 773
G A L Q + VV +SG LEI+ +P+
Sbjct: 758 YGDAGNAFKINSMAVLAQQSKQKNSDWWRRLLVQAKPSYWLVVSRKSGTLEIYSMPDMKL 817
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRW 832
V+ ++ +G + D +L S E +S+ G Q ++ +S +EL++
Sbjct: 818 VYHINDVGNGAMVLSDALEFVSLSSSTQE---NSKVGIVQSCMPQHANSPLPLELSLVGL 874
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
+ RP L + T +L YQ +F P+ K R L N+ + ++
Sbjct: 875 GLNGERPVLM-VRTRVELLIYQ--VFRYPKGNLKI-----RFRKLEQLNLLDQQPSHIEL 926
Query: 893 SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCD 951
+ Q++ F N+ G G + G PC+ + R LR+H +
Sbjct: 927 EENDEEEELESYNMQPKYVQKLRPFSNVGGLAGIMVCGVNPCFVFLTARGELRIHRLQGN 986
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYF 1011
G + +F +NVN +GF+Y + LKI LPS +YD+ WPV+KV PL+ TP Q+ Y
Sbjct: 987 GDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSVWPVRKV-PLRCTPRQLVYH 1045
Query: 1012 AEKNLYPLIVSVPVLKPLNQVLSL------LIDQEVGHQIDNHNLSSVDLHRTYTVEEYE 1065
E +Y LI +P+ + L ++ G + N S ++E
Sbjct: 1046 RENRVYCLITQTE--EPMTKYYRFNGEDKELSEESRGERFIYPNGS-----------QFE 1092
Query: 1066 VRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGE 1122
+ ++ P+ W+ A+I + E+ ++V L + T + L IGT + E
Sbjct: 1093 MVLISPET----WEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1148
Query: 1123 DVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKII 1177
D+ +RG + ++ P +T EV+ KE KG +SA++ + G L+ G KI
Sbjct: 1149 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIY 1208
Query: 1178 LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1237
+ + +L G+AF D +YV + VK+ I + D++KSI L ++E+ L+L ++DF
Sbjct: 1209 IWQLRDGDLIGVAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLASRDF 1267
Query: 1238 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1297
L+ + EF++D S L +V+D ++N+ ++ Y P+ ES GQKL+ +A++H+G V
Sbjct: 1268 NPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNT 1327
Query: 1298 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1357
R+Q + N+ +++GTLDG +G PL E +RR LQ L+
Sbjct: 1328 MFRVQCHQRGVHQRQPFLYE---NKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLS 1384
Query: 1358 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1417
H+ GLNP+ FR S K I+D +L+ Y +LP ++ E+A + GT +
Sbjct: 1385 YQDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEE 1444
Query: 1418 ILSNL 1422
ILS+L
Sbjct: 1445 ILSDL 1449
>gi|198457226|ref|XP_001360595.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
gi|198135905|gb|EAL25170.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
Length = 1459
Score = 506 bits (1304), Expect = e-140, Method: Compositional matrix adjust.
Identities = 419/1505 (27%), Positives = 692/1505 (45%), Gaps = 223/1505 (14%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV AN++++Y + E G ++ N E M LE + Y L+GNV S
Sbjct: 29 NLVVAGANMLKVYRISPNVEAGQRQKLNPNE------MRIAPKMRLECLATYFLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA +D+++++F+DAK+SVL+ D + L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR------ 229
P+V+VDP RC +LVYG ++++L + S DE F
Sbjct: 136 GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSL---DEIELTDVKPFKKAPTAMVS 192
Query: 230 ---IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
I +S++I L++LD K +V D F+HGY EP ++IL+E T GR+ + TC++
Sbjct: 193 RTPIMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLV 252
Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
A+S++ + HP+IW+ +LP D +++ + PIGG LV+ N + Y +QS
Sbjct: 253 AISLNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVP------P 306
Query: 345 YAVSLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
Y VSL+SS + P+ + LD A+ ++ D ++S +TG+L +LT+ D R
Sbjct: 307 YGVSLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGELYVLTLCVDSMR 366
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK----- 451
V+ K SVLTS I + FLGSRLG+SLL+ FT +++++ +
Sbjct: 367 TVRNFHFHKAAASVLTSCICVCHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDVDAEQQA 426
Query: 452 ----------------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
EE D++ AP + RR + EEL +YGS +
Sbjct: 427 EQQQQKQQRVQEDQDIEEVYDVDQIELAPPQAKSRR---------IEDEELEVYGSGAKA 477
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGI 533
+ + F F V DSL+N+ P+ G R+ + +ATG
Sbjct: 478 SVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGH 537
Query: 534 SKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII 581
SK N +++ EL GC +WTV+ D+++ + D+ H ++++
Sbjct: 538 SKNGALSVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLL 588
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
S T+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q
Sbjct: 589 SQSNSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQ 647
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG-----SIRLLVGDPSTCT---VS 693
++ S V+ V+IADPYV L M +G ++R G P
Sbjct: 648 NVPI----------DVGSPVVQVAIADPYVCLRMLNGQVITLALRETRGSPRLAINKHTI 697
Query: 694 VQTPA--AIESSKKPVSSCTLYHDK--------------------GPEPWLRKTSTDAWL 731
+PA AI + K T+ D EP ++ + L
Sbjct: 698 TSSPAVVAIAAYKDLSGLFTVKSDDVLNLTGGSGSGFGHSFGGYMKAEPNMKVEDEEDLL 757
Query: 732 STGVGEAIDGADGGPLDQGD------------------IYSVVCYESGALEIFDVPNFNC 773
G A L Q + VV +SG LEI+ +P+
Sbjct: 758 YGDAGNAFKINSMAVLAQQSKQKNSDWWRRLLVQAKPSYWLVVSRKSGTLEIYSMPDMKL 817
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRW 832
V+ ++ +G + D +L S E +S+ G Q ++ +S +EL++
Sbjct: 818 VYHINDVGNGAMVLSDALEFVSLSSSTQE---NSKVGIVQSCMPQHANSPLPLELSLVGL 874
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
+ RP L + T +L YQ +F P+ K R L N+ + ++
Sbjct: 875 GLNGERPVLM-VRTRVELLIYQ--VFRYPKGNLKI-----RFRKLEQLNLLDQQPSHIEL 926
Query: 893 SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCD 951
+ Q++ F N+ G G + G PC+ + R LR+H +
Sbjct: 927 EENDEEEELESYNMQPKYVQKLRPFSNVGGLAGIMVCGVNPCFVFLTARGELRIHRLQGN 986
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYF 1011
G + +F +NVN +GF+Y + LKI LPS +YD+ WPV+KV PL+ TP Q+ Y
Sbjct: 987 GDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSVWPVRKV-PLRCTPRQLVYH 1045
Query: 1012 AEKNLYPLIVSVPVLKPLNQVLSL------LIDQEVGHQIDNHNLSSVDLHRTYTVEEYE 1065
E +Y LI +P+ + L ++ G + N S ++E
Sbjct: 1046 RENRVYCLITQTE--EPMTKYYRFNGEDKELSEESRGERFIYPNGS-----------QFE 1092
Query: 1066 VRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGE 1122
+ ++ P+ W+ A+I + E+ ++V L + T + L IGT + E
Sbjct: 1093 MVLISPET----WEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1148
Query: 1123 DVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKII 1177
D+ +RG + ++ P +T EV+ KE KG +SA++ + G L+ G KI
Sbjct: 1149 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIY 1208
Query: 1178 LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1237
+ + +L G+AF D +YV + VK+ I + D++KSI L ++E+ L+L ++DF
Sbjct: 1209 IWQLRDGDLIGVAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLASRDF 1267
Query: 1238 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1297
L+ + EF++D S L +V+D ++N+ ++ Y P+ ES GQKL+ +A++H+G V
Sbjct: 1268 NPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNT 1327
Query: 1298 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1357
R+Q + N+ +++GTLDG +G PL E +RR LQ L+
Sbjct: 1328 MFRVQCHQRGVHQRQPFLYE---NKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLS 1384
Query: 1358 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1417
H+ GLNP+ FR S K I+D +L+ Y +LP ++ E+A + GT +
Sbjct: 1385 YQDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEE 1444
Query: 1418 ILSNL 1422
ILS+L
Sbjct: 1445 ILSDL 1449
>gi|195583398|ref|XP_002081509.1| GD25678 [Drosophila simulans]
gi|194193518|gb|EDX07094.1| GD25678 [Drosophila simulans]
Length = 1450
Score = 503 bits (1295), Expect = e-139, Method: Compositional matrix adjust.
Identities = 414/1464 (28%), Positives = 685/1464 (46%), Gaps = 191/1464 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E +E + +L + + A + EEL +YGS + + + F F V D
Sbjct: 430 RNLQDEDQSLE-EILDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 489 SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCLN 548
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 549 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 600 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 649 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 709 LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + VV +SG LEI+ +P+ V+ V+ +G T +
Sbjct: 769 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGATVLT 828
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D E + S T +S ++ +S +EL++ + RP L + T
Sbjct: 829 DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L YQ +F P+ K R L N+ + ++ D E+
Sbjct: 886 ELLIYQ--VFRYPKGHLK-----IRFRKLDXXNLLDQQPTHIELDEN--DEQEEIESYQM 936
Query: 909 AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 937 QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
+GF+Y + LKI LPS +YD+ WPV+KV PL+ TP Q+ Y E +Y LI
Sbjct: 997 PNGFLYFDTTYELKISVLPSYLSYDSIWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE- 1054
Query: 1026 LKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--R 1081
+P+ + D+E+ + Y + ++E+ ++ P+ W+
Sbjct: 1055 -EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPD 1102
Query: 1082 ATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1140
A+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1103 ASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVP 1162
Query: 1141 NPQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1195
P +T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF D
Sbjct: 1163 EPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT- 1221
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
+YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L
Sbjct: 1222 NIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLG 1281
Query: 1256 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1315
+V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 1282 FLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL 1341
Query: 1316 GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1375
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R S
Sbjct: 1342 ---YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKS 1398
Query: 1376 NGKAHRPGPDSIVDCELLSHYEML 1399
+ K I+D +L+ Y ++
Sbjct: 1399 SKKQGINPSRCIIDGDLIWSYRLM 1422
>gi|198415711|ref|XP_002123169.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
1, partial [Ciona intestinalis]
Length = 1370
Score = 500 bits (1287), Expect = e-138, Method: Compositional matrix adjust.
Identities = 409/1477 (27%), Positives = 678/1477 (45%), Gaps = 216/1477 (14%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A Y+ +H PTG+ C + NL+VTA
Sbjct: 2 YAWYRQIHAPTGVEQCVYCNFASEKEK---------------------------NLLVTA 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
A+ + +Y + E + +++N E + + L+ + ++L GNV + +
Sbjct: 35 ASQLTVYRLERNYEVTTKTENGEE-------NTVVKEKLQQIGSWQLFGNVVRMRSVRLA 87
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
GA + DS++L+F +AK+S++EFD + H ++ TS+H FE + K G P
Sbjct: 88 GA----KLDSVLLSFAEAKLSIIEFDQATHDIKTTSLHYFEDALY---KDGSYQRITLPK 140
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGF--SARIESSHVINLRD 240
+ VDP+ RC + + + ++ + L D+ + R +S+ I+L
Sbjct: 141 IAVDPESRCVALQLTTKSVAVVPLRANTAALATDDGAAPQDNVSLQNKRSTTSYTIDLHA 200
Query: 241 LD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
+D ++ + D F+HGY EP +++L E TWAGRV+ + TC I A+S++ + HP++
Sbjct: 201 VDARLQRIIDIQFLHGYNEPTLLVLFESLRTWAGRVAMRQDTCNIVAISLNMAEQLHPVV 260
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE---- 354
WS LP D VP PIGGVL+ N+I Y +QS Y SL+S+ E
Sbjct: 261 WSLNGLPFDCKYAYPVPKPIGGVLIFAVNSILYLNQSVP------PYGTSLNSTTENSTS 314
Query: 355 ---LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
P+ + LD +HA ++ + ++S K G+L +LT++ D R V+ K+ SV
Sbjct: 315 FPLKPQEDVCMTLDCSHAMFISPESLVISLKNGELYVLTLLVDSMRSVRNFHFDKSASSV 374
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
LTS +T + + FLGSRLG+SLL+++T + + E A KRL +
Sbjct: 375 LTSCLTVLDDGFLFLGSRLGNSLLLKYT-------EARPVFRNCYHTEEPAAKRKRLNTA 427
Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
+ A D N +L +YG + +E ++ F V DSLVNIGP G A S
Sbjct: 428 ADWAASD-TNDIDLQMYGKDTVTSEPL-SSYKFEVCDSLVNIGPCGAAELGEP--AFLSE 483
Query: 531 TGIS-KQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNA 563
+S ++S+ EL ELPGC +WTV +
Sbjct: 484 EFVSQRESDLELAILSGHGKNGAISVLQRSVKPQVVTTFELPGCIDMWTVKSVCEKTELP 543
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
++ + H+YLI+S E T++LET + EV E+ + + +++ GN+ G + +
Sbjct: 544 TKTQ-----QQQHSYLILSREESTLILETGKEIMEV-ENSGFNTREQSVFVGNIGGDKEL 597
Query: 624 I-QVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
I QV G +L G + Q + E GS + SI DPY LL SDG + +
Sbjct: 598 ILQVCASGVWLLAGVKLLQHIPL-----ELGS-----PITQCSICDPYALLLTSDGDLIM 647
Query: 683 --LVGDPST--------CTVSVQTPAAIE--------------------------SSKK- 705
L D + C S+ IE SS K
Sbjct: 648 LTLTNDLDSENGVKLECCNPSINQVPQIEHVCLYKDTSGLFKTASGPSDVFLPEDSSNKG 707
Query: 706 -------------PVSSCTLYHDKGPEPWLRKTSTDAWL----------STGVGEAIDGA 742
P+SS T D+ E ++ D S E +DG
Sbjct: 708 VSDSEIPSSLPRTPLSSKTFTVDEEDELLYGESDPDVIFAPQFAPNVPKSPTQNEPLDGD 767
Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
G ++ ++++ E+ LEI+ +P+ + V+T+ F G+ + ++ + S+ +
Sbjct: 768 KEGN-EEFTFWAIIARENRNLEIYSMPSLDLVYTIKNFSFGQKLLTNSGPVHSYSVSKDD 826
Query: 803 INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
++S T K I + +V L + +S P L A + + IL Y+ + F PE
Sbjct: 827 KSTS----TRYSDKPRIFEILLVGLGYK-----NSSPHLIARIEE-EILIYEVFKFSAPE 876
Query: 863 NTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG 922
K + S + V+ S + R P+ T+ + C R F NI G
Sbjct: 877 KFKKYN-----SLQIRFKKVNHSMM----IRRAPVTHETKTDQLEHRNCLRT--FSNIGG 925
Query: 923 HQGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G FL G P W V R L HP DGS+ F HNVNC +GF+Y SQG L+IC
Sbjct: 926 YSGVFLCGPYPYWIFVTIRGALCCHPMSVDGSVSCFVPFHNVNCPNGFLYFNSQGELRIC 985
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
LP YD WP++K I L+ + H + Y E +Y L+ SV +P ++ L + E
Sbjct: 986 MLPPHMKYDTAWPMRK-ITLRCSVHFLAYSIEHKVYALVTSVS--EPCTRLPYLTFENER 1042
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1100
+ +L D ++++ V+++ P A A + M E+ ++ V L
Sbjct: 1043 EFE----DLEKGDRFIYPHIDKFSVQLISP--ASWDLVPNARLDMGEFEHITCMKNVWLS 1096
Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELK 1155
+ + L +GT V GE++++RG++++ P +N + ++YS+E K
Sbjct: 1097 CGQDSSARQNFLVLGTVNVFGEEMSSRGKIIILEVIEVVPEPGQPLTKNKLKQIYSEEQK 1156
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
G ++A+ L+G+LL A G KI + ++ + L G+AF D +Y+ ++F L+GDI
Sbjct: 1157 GPVTAVCGLEGNLLTAIGQKIFIWRFDENQSLRGLAFVDT-NVYIHHALSFRSFALVGDI 1215
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
+SI L ++ L++ ++D L+ + + ++DG+ ++ +VSD +KN+ +F Y P+
Sbjct: 1216 QRSITLLRYQTDFKTLSVTSRDVRPLEVYTADLVVDGTGINFLVSDHEKNLVLFAYDPED 1275
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
ES G +L RA+ H+G+ R+ A DR+ P + GTLDGSI
Sbjct: 1276 HESHGGSRLTKRADMHIGSRANCMWRVA--ACGVDRSTGLPNQPYAGVHITMMGTLDGSI 1333
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
+ P+ E +RRL LQ ++ + H+AGLNP++FR
Sbjct: 1334 CHVLPVAEKVYRRLLMLQNIMITGLQHIAGLNPKAFR 1370
>gi|157110889|ref|XP_001651294.1| cleavage and polyadenylation specificity factor cpsf [Aedes aegypti]
gi|108883895|gb|EAT48120.1| AAEL000832-PA [Aedes aegypti]
Length = 1417
Score = 498 bits (1282), Expect = e-137, Method: Compositional matrix adjust.
Identities = 415/1462 (28%), Positives = 684/1462 (46%), Gaps = 173/1462 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LV ANV+++Y R+ + S++ T R M LE + Y L GN+ S+
Sbjct: 29 SLVTGGANVLKVY--RLIPDADATSRDKFTTTRPPNM------KLECMATYTLFGNIMSM 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S G+ +RD+++++F+DAK+SV++FD L+ S+H FE + +K G
Sbjct: 81 QSVSLAGS----QRDALLISFQDAKLSVVQFDPDNFELKTLSLHYFEEED---IKGGWTG 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSGGG---FSARI 230
P+V+VDP RC +LVYG ++++L K S V D I
Sbjct: 134 HYHTPIVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEVQDVKPMKKAPTQLIAKTPI 193
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+S+VI L++ + + +V D F+HGY EP ++IL+E T+ GR++ + TCM+ ALS+
Sbjct: 194 LASYVIELKESEERIDNVIDIQFLHGYYEPTLLILYEPVKTFPGRIAVRSDTCMMVALSL 253
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAV 347
+ + HP+IW+ LP D + +A+ PIGG L++ N + Y +QS ++LN+ A
Sbjct: 254 NIQQRVHPVIWTVNCLPFDCLQAIAISKPIGGCLILSVNALIYLNQSVPPYGVSLNSIAD 313
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ P+ + LDAA +++ + +LS K G+L +LT+ D R V+ SK
Sbjct: 314 HCTNFPLKPQDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRSFHFSKA 373
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLT I + FLGSRLG+SLL++F + +++ EE + E KR
Sbjct: 374 ASSVLTCCICVVEEEYLFLGSRLGNSLLLRFKEKDESMVITIDDTEEVVEKEP-----KR 428
Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI-- 524
LR + EEL +YGS T ++ F V DS++NIGP+ + G RI
Sbjct: 429 LR----------LEQEELEVYGSG-QKTSVQLTSYIFEVCDSILNIGPIGHMAVGERISE 477
Query: 525 ---NADASATGISKQSNYELVELP--GCKGIWTVYHKSSRGHNADSSRMAAY--DDEYHA 577
+ + + + + E+V G G V S + S ++ D+ H+
Sbjct: 478 EEQDENKDVQFVPNKLDLEIVTSSGHGKNGALCVLQNSIKPQVITSFGLSGCLDVDDMHS 537
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
++I+S EA TMVL+T D + E+ E+ + TI GN+ G R ++QV + R+L G+
Sbjct: 538 FMILSQEAGTMVLQTGDEINEI-ENTGFATNVPTIHVGNIGGNRFIVQVTTKSIRLLQGT 596
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV-----GDPSTC-- 690
+ Q++ + +VSIADPYV + S+G + L G P
Sbjct: 597 RLLQNIPI----------DLGCPLAAVSIADPYVCVRSSEGRVITLALREGKGTPRLAVN 646
Query: 691 --TVSVQTPAAIE-SSKKPVSS--CTLYHD------------------KGPEPWLRKTST 727
T+S TPA + S K VS T Y D PEP ++
Sbjct: 647 KNTIST-TPAVVAISVYKDVSGMFTTKYEDFYDGSKAGSSAYSSGFGYMKPEPHMKIEDE 705
Query: 728 DAWLSTGVGEAI------DGADGGPLDQGDIYS------------VVCYESGALEIFDVP 769
+ L G + D A D + ++G LEI+ +P
Sbjct: 706 EDLLYGESGRSFKMTSMADMAIETKKKNTDFWRKFMQPVKPTFWLYAVRDNGNLEIYSMP 765
Query: 770 NFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN---SSSEEGTGQGRKENIHSMKVVE 826
+ V+ + +G + D+ L+ +T + +S+ + G N+ +++
Sbjct: 766 DLKLVYLITNIGNGNKVLQDSMEFVPLQVGQTAADADVTSNAFTSPFGFNPNLLPKEILM 825
Query: 827 LAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASR 886
+A+ H +RP LF L + +L Y+ Y + SK + R S V+
Sbjct: 826 VAL---GHHGTRPMLFVRL-ENDLLVYRVYRY------SKGHLKLRFRR--VPSGVTGPI 873
Query: 887 LRNLRFSRTPLDAYTREETPHGAPCQ-----RITIFKNISGHQGFFLSGSRPCWCMVFRE 941
+ P D + H I F N++G+ G + G +P + M+
Sbjct: 874 FKIAPRQSAPTDQEGEKPDEHSTKIMYENISMIRYFNNVNGYNGVAVCGEKP-YIMLLTS 932
Query: 942 R--LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
R LR H + F +NVNC +GF+Y Q LKI P +YD+ WPV+K I
Sbjct: 933 RGELRAHRLYAKTIMKGFAPFNNVNCPNGFLYFDEQYELKIAVFPGYLSYDSIWPVRK-I 991
Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY 1059
PL+++P QI Y E +Y +++ +EV ++ N +L
Sbjct: 992 PLRSSPKQIVYHKENKVYCVVMDA---------------EEVCNKYYRFNGEDKELTEEN 1036
Query: 1060 TVE--------EYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKEN 1108
E ++ V ++ P W+ +I + E+ + ++ V+L +
Sbjct: 1037 KGERFLYPMAHKFSVVLVTP----SAWEIIPETSINLDEWEHVIALKNVSLSYEGARSGF 1092
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALAS 1163
+ +A+GT + ED+ +RGR+LL+ P +T EV KE KG +SA+
Sbjct: 1093 KEYIAVGTNFNYSEDITSRGRLLLYDIIEVVPEPGKPLTRYKFKEVIVKEQKGPVSAITH 1152
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
+ G L+ A G K+ L + +L G+AF D ++V L +K+ IL+ D++KS+ L +
Sbjct: 1153 VSGFLVGAVGQKVYLWQLKDDDLVGVAFIDT-NIFVHQLVSIKSLILVADVYKSVSLLRF 1211
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E L+L+++D+ L+ F E+++D L +VSDEQ NI + Y P+ ES+ GQ+L
Sbjct: 1212 QEDYRTLSLVSRDYQPLNVFQIEYVVDNHNLGFLVSDEQCNIITYMYQPESRESFGGQRL 1271
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1343
L + ++HVG + R+Q D S+ + F TLDG IG + PL E
Sbjct: 1272 LRKCDYHVGQKINSMFRVQCDFHEMDYKR---NSNYECKHTTYFATLDGGIGYVLPLPEK 1328
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
T+RRL LQ L+ PH+ GLNP++FR + K +VD +L+ + LP E
Sbjct: 1329 TYRRLFMLQNVLMTHSPHLCGLNPKAFRTIKTVKKLPINPARCVVDGDLIWTFLTLPANE 1388
Query: 1404 QLEIAHQTGTTRSQILSNLNDL 1425
+LE+A + GT I ++L ++
Sbjct: 1389 KLEVAKKIGTRIDDICADLMEI 1410
>gi|193702313|ref|XP_001945086.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Acyrthosiphon pisum]
Length = 1335
Score = 494 bits (1273), Expect = e-136, Method: Compositional matrix adjust.
Identities = 413/1441 (28%), Positives = 674/1441 (46%), Gaps = 215/1441 (14%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV N++ +Y + + + K E + Y L GN+ L
Sbjct: 30 LVVAGVNILRVYRLVPTDTTCQPPK----------------TKFECLAQYTLFGNIMCL- 72
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
Q D+++L+F +AK S++E+D +H LR S+H FE ++ K G
Sbjct: 73 ---QSVTLCPSSPDALLLSFSEAKFSLVEYDRDMHSLRTLSLHYFEDDKF---KNGHTQH 126
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
PL++VDP GRC LVYG ++L G D++ SA++ S+ I
Sbjct: 127 WSPPLIRVDPDGRCVVGLVYGSYFVVLPF-----GRTIDDN------AKSAQVMPSYTIP 175
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
+ +D M ++ DF F+HGY EP ++IL+E T+AGR++ + TC + A+S++ H
Sbjct: 176 ISKIDPKMNNIMDFDFLHGYYEPTLLILYEPVKTFAGRIAVRKDTCAMVAISLNIQQHVH 235
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQE 354
P+IWS +LP+D K++AV PIGGVL++ N++ Y +QS +ALN+ A +L +
Sbjct: 236 PVIWSLDSLPYDCQKVIAVSRPIGGVLIMAVNSLIYLNQSVPPFGVALNSIAKTLTNFPL 295
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTS 413
+ ++ LD A AT++ +D + S GDL ++T+ D R V+ K SVLT+
Sbjct: 296 GQQEDINLVLDRATATFISSDKLVTSLCNGDLYVITLYADSMRAVRSFHFEKCASSVLTT 355
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
IT +S FLGSRLG+SLL+++ S ++ D PS KR + +D
Sbjct: 356 CITVCLDSYLFLGSRLGNSLLLRYYARSQSN--------------DDEPSIKRKKTDETD 401
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+D+V EL +YGS T +++SF V DS++NIGP S G A S
Sbjct: 402 --EDLV---ELEVYGSEVQ-TSICLESYSFEVCDSIINIGPCSQASIGE--PAYISDEFS 453
Query: 534 SKQSNYELVELPG--CKGIWTVYHKSSRGHNADSSRMAAYDD--------EYHAYLIISL 583
S + + EL+ G G +V H+S + + + Y D ++H ++I++
Sbjct: 454 SDEHDVELLCTSGHGKNGALSVLHRSIKPQLVTTFHLDGYKDMWTVHGENDFHTFMILTN 513
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
T++L+T + E+ +S Y + T+ N+ + VIQV R+L+GS Q +
Sbjct: 514 VDSTLILQTGQEINEL-DSSGYATREHTVFVCNM--NKFVIQVLRYSVRLLNGSEQLQSV 570
Query: 644 S--FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI---------RLLVGDPSTCTV 692
S FG S ++ S +PY +L DG + R+L+ P+
Sbjct: 571 SLDFG------------SPIIHGSSCNPYAVLLTEDGQVIVLTVKSTGRILLMRPTNFEQ 618
Query: 693 SVQTP--------AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
QT + + SS P + L GP K D ++S V + + G
Sbjct: 619 IPQTKTLAVYRDVSGLFSSTMPQAEIPLV---GP-----KLQHDHFVSDSVEDEEEMLYG 670
Query: 745 GPLDQGD--------------------------IYSVVCYESGALEIFDVPNFNCVFTVD 778
D + V+ ++G +EI+ +P+F
Sbjct: 671 DARDPSSRETPHNSVSNKNTMWWLKFLEVPTPTYWVVLTRDNGYMEIYTLPDFKI----- 725
Query: 779 KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK-ENIHSMKVVELAMQRWSAHHS 837
Y + +S + S EEG +K E I + +V L Q
Sbjct: 726 -----------KYRAANIDESPMILKDSLEEGCYFPKKTEIIKEILIVPLGYQ-----DK 769
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
RP +F L D ++ Y + PE T K +RF + +
Sbjct: 770 RPIMFVRL-DNEVVIYGIH--RHPEGTLK-----------------------MRFHK--M 801
Query: 898 DAYTREETPHGAPCQRITI---FKNISGHQGFFLSGSRPCWCMV-FRERLRVHPQLCDGS 953
+ ++ G P + ++ F ++GH G F+ G P ++ R LR HP DG
Sbjct: 802 TSLLTFQSRSGNPLEGTSLLRYFSKVAGHNGVFICGQNPHLILLTVRGELRCHPLHIDGP 861
Query: 954 IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAE 1013
I+ F HNVNC+ GF+Y S L+I LP+ +YD WP++KV PL+ TPH I Y E
Sbjct: 862 IMCFAPFHNVNCSQGFLYFNSDHKLRISILPTHLSYDEPWPLRKV-PLRKTPHFIAYHLE 920
Query: 1014 KNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDR 1073
Y ++ S L + D+E+ + + L + H +T+E + EP
Sbjct: 921 TKTYCVVTSSSELSASYYRFNGE-DKELTTE-ERDPLFPLPSHEVFTLELFSPASWEP-- 976
Query: 1074 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLL 1132
+I + E+ ++ V L + + +A+GT Y ED+ +RGR+ L
Sbjct: 977 -----IPDTSIETEDWEHITCLKNVALAYEGARSGLKGYIAMGTNYSYSEDITSRGRIFL 1031
Query: 1133 FSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELN 1187
F P +N + +Y+KE KG ++A+ + G L+ A G KI + + +L
Sbjct: 1032 FDIIDVVPEPGKPLTKNKIKMIYAKEQKGPVTAITHVVGFLVTAVGQKIYIWQLKDNDLI 1091
Query: 1188 GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEF 1247
GIAF D +YV + +K+ IL+ D+ KSI L ++E+ L+L+ +D L+ F F
Sbjct: 1092 GIAFIDTE-VYVHQMLSIKSLILVADLFKSITLLRFQEEYRTLSLVCRDSKPLEVFDINF 1150
Query: 1248 LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS 1307
LID + L + SD +N+ ++ Y P ES+ GQ L+ R +F++G++V F RL+ ++
Sbjct: 1151 LIDNTELGFLASDRDQNLLLYLYQPMARESYGGQHLVRRGDFNIGSNVNSFFRLRCKQST 1210
Query: 1308 --SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1365
DR A GSDK R ++ TLDGSIG I P+ E +RRL +LQ LV ++ H+AGL
Sbjct: 1211 VAPDRREAI-GSDK--RHVTMYTTLDGSIGYIVPIHEKNYRRLLTLQNMLVKNITHLAGL 1267
Query: 1366 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHY-EMLPLEEQLEIAHQTGTTRSQILSNLND 1424
NP+++R F + ++D EL+ + + ++ EIA++ G ++L ++ +
Sbjct: 1268 NPKAYRSFKATAPERMNQARRVIDGELVWMFVTCMNARQRNEIANKVGVKTIELLQDIYE 1327
Query: 1425 L 1425
L
Sbjct: 1328 L 1328
>gi|414587799|tpg|DAA38370.1| TPA: hypothetical protein ZEAMMB73_163106 [Zea mays]
Length = 461
Score = 489 bits (1260), Expect = e-135, Method: Compositional matrix adjust.
Identities = 241/421 (57%), Positives = 293/421 (69%), Gaps = 11/421 (2%)
Query: 685 GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
DPSTCT+S+ PA SS + +S+CTLY D+GPEPWLRKT TDAWLST VGEAID D
Sbjct: 33 ADPSTCTISINAPAIFASSSERISACTLYCDRGPEPWLRKTHTDAWLSTDVGEAIDDNDN 92
Query: 745 GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
D DIY ++CYESG LEIF+VP+F VF+VD FVSG + D + R + KDS
Sbjct: 93 SSHDLSDIYCIICYESGKLEIFEVPSFKRVFSVDNFVSGPAILFDVFSRNSTKDSGIGDR 152
Query: 805 SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
+S+ +KE ++K+VELAM RWS SRPFLF +L DGT+LCY AY FEG E+
Sbjct: 153 DASKVSV---KKEEAANIKIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAYYFEGSESN 209
Query: 865 SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPC---QRITIFKNIS 921
+ S + N + SRLRNLRF R +D +R++ C RITIF N+
Sbjct: 210 VQCAPFSPHGGSPDIGNATDSRLRNLRFCRVSIDISSRDDI----SCLVRPRITIFNNVG 265
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
G++G FL G RP W V R+R RVHPQLCDG IVAFTVLHNVNC G IYVTSQG LKIC
Sbjct: 266 GYEGLFLGGPRPTWVFVCRQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIYVTSQGFLKIC 325
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
QLPS YDNYWPVQKV PL TPHQ+TY+ E++LYPLIVSVP ++PLNQVLS + DQE+
Sbjct: 326 QLPSAYNYDNYWPVQKV-PLHGTPHQVTYYGEQSLYPLIVSVPQVRPLNQVLSSMADQEL 384
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1101
G ++N S DL YTV+E+EVRI+E ++ G W+TR+TIPMQS ENALTVR+VTL
Sbjct: 385 GLHMENDVTSGGDLQEVYTVDEFEVRIMELGKSNGRWETRSTIPMQSFENALTVRIVTLQ 444
Query: 1102 N 1102
N
Sbjct: 445 N 445
>gi|158287218|ref|XP_309311.4| AGAP011340-PA [Anopheles gambiae str. PEST]
gi|157019545|gb|EAA05261.4| AGAP011340-PA [Anopheles gambiae str. PEST]
Length = 1434
Score = 483 bits (1243), Expect = e-133, Method: Compositional matrix adjust.
Identities = 408/1486 (27%), Positives = 701/1486 (47%), Gaps = 204/1486 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LV ANV+++Y RV + +++ R M LE V YRL+GN++S+
Sbjct: 29 SLVTGGANVLKVY--RVIPDADPATRDKYTAARPPNM------KLECVASYRLNGNIKSM 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S G+ RD+++++F DAK+SV++FD L+ S+H FE + ++ G
Sbjct: 81 QSVSLAGS----LRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDED---IRGGWTG 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR--I 230
P+V+VDP RC +LVYG ++++L + S L + + A+ I
Sbjct: 134 HYHIPMVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIELQDVKPIKKAPMQLVAKTPI 193
Query: 231 ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+S++I L+DLD K +V D F+HGY EP ++IL+E T+ GR++ + TC + ALS+
Sbjct: 194 LASYIIELKDLDEKIDNVIDIQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSL 253
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
+ + HP+IW+ +LP D + + + PIGG LV+ N++ Y +QS Y VS
Sbjct: 254 NIQQRVHPVIWTVNSLPFDCIQAIPINKPIGGCLVMCVNSLIYLNQSVP------PYGVS 307
Query: 349 LDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQR 400
L+SS + P+ + LDAA +++ + +LS K G+L +LT+ D R V+
Sbjct: 308 LNSSADHSTSFPLKPQDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRN 367
Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
+K SVLTS I + FLGSRLG+SLL++F + +++ ++ G +E +
Sbjct: 368 FHFNKAAASVLTSCICVCEDEYLFLGSRLGNSLLLRFKEKDESLVITI---DDSGAVEKE 424
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
KR R + +YGS T ++ F V D+++NIGP+ +
Sbjct: 425 P---KRPRLEEEEL----------EVYGSGYK-TSVQLTSYIFEVCDNVLNIGPIAHMAV 470
Query: 521 GLRINADASATG-----ISKQSNYELVE--------------------------LPGCKG 549
G R+ + + + + + E+V L GC
Sbjct: 471 GERVAEEDAENQPDVQIVQNKLDIEVVTSSGHGKNGALCVLQSSIKPQVITSFGLSGCVD 530
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+WTV+ ++ +R A HA++I+S E TMVL+T + + E+ E+ +
Sbjct: 531 VWTVFDEAV-------ARRAEDGPSTHAFMILSQEGGTMVLQTGEEINEI-ENTGFATTV 582
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TI GN+ R ++QV + R+L G+ + Q++ + SV+I DP
Sbjct: 583 PTIHVGNIGTNRFIVQVTTKSIRLLQGTRLLQNIPI----------DLGCPLASVAIVDP 632
Query: 670 YVLLGMSDGSIRLLV-----GDPSTC----TVSVQTPAAIE-SSKKPVSSCTLYHDK--- 716
YV + S+G + L G P T+S TPA + S+ + VS L+ K
Sbjct: 633 YVCVRSSEGRVITLALREGKGTPRLAVNKNTIS-PTPAVVAISAYRDVSG--LFTKKIED 689
Query: 717 --------------------GPEPWLRKTSTDAWLSTGVGE----------AIDGADGGP 746
PEP ++ + L G AI G GG
Sbjct: 690 VYDLSRGGAASAYSSGFGSMKPEPHMKIEDEEDLLYGESGRSFKMTSMADMAIAGKSGGS 749
Query: 747 LD---------QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDT--YMREA 795
D + + ++G LEI+ +P+ V+ + +G + D+ ++
Sbjct: 750 ADFWMKYMQQVKPTYWLFAARDNGTLEIYSMPDLKLVYLITNVGNGNKVLSDSMEFVPLP 809
Query: 796 LKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQA 855
+ S ++ ++SS G G ++ +++ +A+ ++ SRP LF I + +L Y+
Sbjct: 810 MGKSASQEDASSAFGASFGVSASLLPKEILMVAL---GSYGSRPLLF-IRLEHDLLIYRV 865
Query: 856 YLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP----- 910
+ + SK + R LS S V+ R S E+ A
Sbjct: 866 FRY------SKGHLKLRFKR-LSTS-VTCPVFRTPEPSGAGATEAANEQQQARATKVLYE 917
Query: 911 -CQRITIFKNISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHG 968
I F N+SG+ G + G +P + + LR H + AF +NVNC +G
Sbjct: 918 NISMIRYFANVSGYAGVAVCGEKPYFLFLTAHGELRSHRLYARTVMKAFAPFNNVNCPNG 977
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1028
F+Y Q LKI P+ +YD+ WPV+K IPL+++P QI Y E +Y +++ +
Sbjct: 978 FLYFDEQYELKISIFPTYLSYDSVWPVRK-IPLRSSPKQIVYHRENKVYCVVMDAEEI-- 1034
Query: 1029 LNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIP 1085
N+ D+E+ + HR + V ++ P W+ +I
Sbjct: 1035 CNKYYRFNGEDKELTEENKGERFLYPMGHR------FSVVLVTP----AAWEVVPETSIN 1084
Query: 1086 MQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1144
++ E+ + ++ V+L + + +A+GT + ED+ +RGR+LL+ P
Sbjct: 1085 LEEWEHVIALKNVSLTYEGARSGLKEYIAVGTNFNYSEDITSRGRLLLYDIIEVVPEPGK 1144
Query: 1145 LVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1199
+T EV K+ KG +SA++ + G L+ A G K+ L + +L G+AF D ++V
Sbjct: 1145 PLTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKVYLWQMKDDDLVGVAFIDTN-IFV 1203
Query: 1200 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1259
+ +K+ IL+ D++KS+ L ++E+ L+++++D+ L+ F E+++D + L +VS
Sbjct: 1204 HQMVSIKSLILVADVYKSVSLLRFQEEYRTLSVVSRDYHPLNVFQVEYVVDNANLGFLVS 1263
Query: 1260 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1319
D+Q N+ + Y P+ ES+ GQ+LL ++++H+G V R+Q +D D
Sbjct: 1264 DDQCNLITYMYQPESRESFGGQRLLRKSDYHLGQQVNCMFRVQCDFHETDVMKRTLNYD- 1322
Query: 1320 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1379
N+ F TLDG IG + PL E T+RRL LQ L+ PH GLNP+++R K
Sbjct: 1323 -NKHTTFFATLDGGIGFVLPLPEKTYRRLFMLQNVLLTHSPHTCGLNPKAYRTIKQTRKL 1381
Query: 1380 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+VD +L+ + LP E+ E+A + GT +I ++L ++
Sbjct: 1382 PINPSRCVVDGDLVWSFLELPANEKHEVAKKIGTRIEEICADLMEI 1427
>gi|195056749|ref|XP_001995154.1| GH22991 [Drosophila grimshawi]
gi|193899360|gb|EDV98226.1| GH22991 [Drosophila grimshawi]
Length = 1426
Score = 473 bits (1218), Expect = e-130, Method: Compositional matrix adjust.
Identities = 416/1499 (27%), Positives = 668/1499 (44%), Gaps = 244/1499 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + + + K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVDAVQRQKLNPSEMRLAPKM------RLECLASYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQSVSLAGA----MRDALLISFKDAKLSVLQLDADTQTLKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P+V+VDP RC +LVYG ++++L + S L + + R
Sbjct: 136 GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I L DLD K +V D F+HGY EP ++IL+E T AGR+ + T
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDT------- 248
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
+ PIGG LV+ N + Y +QS Y V
Sbjct: 249 -----------------------FFPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 279
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 280 SLNSSADNSTAFPLKPQDNVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 339
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I FLGSRLG+SLL+ FT +++++
Sbjct: 340 NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEATVEQQT 399
Query: 448 -----SGLKEE--FGDIEA-DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
L EE D+E +AP + RR + EEL +YGS + + +
Sbjct: 400 IEQSPEELAEESPVYDVEQHEAPPQSKSRR---------IEDEELEVYGSGAKASVLQLR 450
Query: 500 TFSFAVRDSLVNIGPLKDFSYG-----------LRINAD---------ASATGISKQS-- 537
F F V DSL+N+ P+ G LR +AD +ATG SK
Sbjct: 451 KFIFEVCDSLINVAPINYMCAGERVEFEEDGATLRPHADNLNDLKIELVAATGHSKNGAL 510
Query: 538 -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
N +++ EL GC +WTV+ ++R A ++R + H ++++S + T
Sbjct: 511 SVFVNCINPQIITSFELEGCLDVWTVFDDATR--KATTAR-----QDQHDFMLLSQRSST 563
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 564 LVLQTGQEINEI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 620
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
S V+ VSIADPYV L + +G + L + T + SS V
Sbjct: 621 --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAV 672
Query: 708 SSCTLYHD------------------------------KGPEPWLRKTSTDAWLSTGVGE 737
+ Y D EP ++ + L G
Sbjct: 673 VAIAAYKDLSGLFTCKADDVLNLTGSSGAGFANSFGGYMKAEPHMKVEDEEDLLYGDAGS 732
Query: 738 AI------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDK 779
A D A D + VV +SG LEI+ +P+ V+ V+
Sbjct: 733 AFKLNSMADLAKQSKQKNSDWWRRQLIQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVND 792
Query: 780 FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSR 838
+G + D E + S T+ NS + G ++ +S +EL + H R
Sbjct: 793 IGNGALVLSDAM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELCLVGLGQHGER 848
Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
P L + T +L YQ + + + + L + + LD
Sbjct: 849 PLLL-VRTRLELLIYQVFRY------------AKGHLKIRFRKLEQLHLLEQQPTHIELD 895
Query: 899 AYTREETP----HGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGS 953
EE Q++ F N+ G G + G PC+ + R LR+H L +G
Sbjct: 896 GEDVEEAESYNMQAKYVQKLRYFANVGGLAGIMVCGVNPCFVFLTSRGELRIHRLLGNGD 955
Query: 954 IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAE 1013
+ +F +NVN HGF+Y + LKI LPS +YD WPV+KV PL+ TP Q+ Y E
Sbjct: 956 VRSFAAFNNVNIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKV-PLRCTPRQLVYHRE 1014
Query: 1014 KNLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEP 1071
+Y LI +P+ + D+E+ + Y + +E+ ++ P
Sbjct: 1015 NRVYCLITQKE--EPMTKYYRFNGEDKELSEECRGERF-------IYPIGSLFEMVLISP 1065
Query: 1072 DRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARG 1128
+ W+ A+I + E+ ++V L + T + L IGT + ED+ +RG
Sbjct: 1066 E----TWEIVPDASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRG 1121
Query: 1129 RVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
+ ++ P +T EV+ KE KG +SA++ + G L+ G KI + +
Sbjct: 1122 NIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKIYIWQLRD 1181
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+L G+AF D +YV + VK+ I + D++KSI L ++E+ L+L ++DF ++ F
Sbjct: 1182 GDLIGVAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLASRDFNPMEVF 1240
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1303
EF++D S L +V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q
Sbjct: 1241 GIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQC 1300
Query: 1304 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1363
+ N+ +++G+LDG++G PL E +RR LQ L+ H+
Sbjct: 1301 HQRGLHQRQPFL---YENKHLVIYGSLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDHLC 1357
Query: 1364 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
GLNP+ +R S K I+D +L+ + ML E+ E+A + GT +IL++L
Sbjct: 1358 GLNPKEYRTIKSVKKLGINPSRCIIDGDLIWSFRMLAHSERNEVAKKIGTRTEEILADL 1416
>gi|195122290|ref|XP_002005645.1| GI18959 [Drosophila mojavensis]
gi|193910713|gb|EDW09580.1| GI18959 [Drosophila mojavensis]
Length = 1431
Score = 469 bits (1208), Expect = e-129, Method: Compositional matrix adjust.
Identities = 408/1502 (27%), Positives = 663/1502 (44%), Gaps = 245/1502 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + + ++ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVDATQRQKLNPSEMRLAPKM------RLECLASYSLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S G RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQSVSLAGG----MRDALLVSFKDAKLSVLQLDADTQTLKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P+V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I L DLD K +V D F+HGY EP ++IL+E T AGR+
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRI------------- 242
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
K+ + PIGG LV+ N + Y +QS Y V
Sbjct: 243 ----------------------KVFPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 274
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 275 SLNSSADNSTSFPLKPQDNVRLSLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I FLGSRLG+SLL+ FT +++++
Sbjct: 335 NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVESAATAAA 394
Query: 448 --------SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+ + ++ D + S A + EEL +YGS + + +
Sbjct: 395 TGAGEQQQQAIDQSPPQMDEDQVYDVEQHEAPSQAKSRRIEDEELEVYGSGAKASVLQLR 454
Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS-- 537
F F V DSL+N+ P+ G R+ + +ATG SK
Sbjct: 455 KFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLTDLKIELVAATGHSKNGAL 514
Query: 538 -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
N +++ EL GC +WTV+ ++R + + E H ++++S + T
Sbjct: 515 SVFVNCINPQIITSFELDGCLDVWTVFDDATR-------KPSTARQEQHDFMLLSQRSST 567
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 568 LVLQTGQEINEI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 624
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
S V+ VSIADPYV L + +G + L + T + SS V
Sbjct: 625 --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSAPAV 676
Query: 708 SSCTLYHD------------------------------KGPEPWLRKTSTDAWLSTGVGE 737
+ Y D EP ++ + L G
Sbjct: 677 VAIAAYKDLSGLFTCKADDVLNLTGSTGAGFANSFGGYMKAEPHMKVEDEEDLLYGDAGN 736
Query: 738 AI------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDK 779
A D A D + VV +SG LEI+ +P+ V+ V+
Sbjct: 737 AFKLNSMADLAKQSKQKNTDWWRRQLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVND 796
Query: 780 FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSR 838
+G + D E + S T+ NS + G ++ +S +EL++ H R
Sbjct: 797 VGNGALVLTDAM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELSLVGLGQHGDR 852
Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
P L + T +L YQ + + L + +L L T ++
Sbjct: 853 PLLL-VRTRLELLIYQVFRY--------------AKGHLKIRFRKLEQLHLLDQQPTHIE 897
Query: 899 AYTREETPHGAP-------CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLC 950
EET Q++ F N+ G G + G PC+ + R LR+H L
Sbjct: 898 LINEEETDEAESYNMQPKYVQKLRYFNNVGGLAGIMVCGVNPCFIFLTARGELRIHRLLG 957
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
+ + +F +NVN HGF+Y + LKI LP+ +YD WPV+KV PL+ TP Q+ Y
Sbjct: 958 NAEVRSFAAFNNVNIPHGFLYFDTTYELKISVLPTYLSYDAAWPVRKV-PLRCTPRQLVY 1016
Query: 1011 FAEKNLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRI 1068
E +Y LI +P+ + D+E+ + Y + +E+ +
Sbjct: 1017 HRENRVYCLITQKE--EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSLFEMVL 1067
Query: 1069 LEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVA 1125
+ P+ W+ A+I + E+ ++V L + T + L IGT + ED+
Sbjct: 1068 ISPE----TWEIVPDASIQFEPWEHVTAFKLVKLSYEGTRSGLKEYLCIGTNFNYSEDIT 1123
Query: 1126 ARGRVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHK 1180
+RG + ++ P +T EV+ KE KG +SA++ + G L+ G KI + +
Sbjct: 1124 SRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKIYIWQ 1183
Query: 1181 WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1240
+L G+AF D +YV + VK+ I + D++KSI L ++E+ L+L ++DF L
Sbjct: 1184 LRDGDLIGVAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPL 1242
Query: 1241 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1300
+ F EF++D S L +V+D ++NI ++ Y P+ ES GQKLL +A++H+G V R
Sbjct: 1243 EVFGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFR 1302
Query: 1301 LQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVP 1360
+Q + N+ +++GTLDG++G PL E +RR LQ L+
Sbjct: 1303 VQCHQRGLHQRQPFLYE---NKHFVIYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQD 1359
Query: 1361 HVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1420
H+ GLNP+ +R + K I+D +L+ Y ML E+ E+A + GT +IL+
Sbjct: 1360 HLCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIGTRTEEILA 1419
Query: 1421 NL 1422
+L
Sbjct: 1420 DL 1421
>gi|195381337|ref|XP_002049409.1| GJ21566 [Drosophila virilis]
gi|194144206|gb|EDW60602.1| GJ21566 [Drosophila virilis]
Length = 1420
Score = 469 bits (1206), Expect = e-129, Method: Compositional matrix adjust.
Identities = 409/1489 (27%), Positives = 661/1489 (44%), Gaps = 230/1489 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + + ++ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVDAAQRQKLNPTEMRLAPKM------RLECLASYSLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S G RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQSVSLAGG----MRDALLISFKDAKLSVLQLDADTQALKTLSLHYFEEED---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P+V+VDP RC +LVYG ++++L + S L + + R
Sbjct: 136 GRYHVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I L DLD K +V D F+HGY EP ++IL+E T AGR+
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRI------------- 242
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
K+ + PIGG LV+ N I Y +QS Y V
Sbjct: 243 ----------------------KVFPIQKPIGGCLVMTVNAIIYLNQSVP------PYGV 274
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 275 SLNSSADNSTSFPLKPQDNVRLSLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
K SVLTS I FLGSRLG+SLL+ FT +++++ E + +A
Sbjct: 335 NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDMENAVEQQA 394
Query: 460 DAPSTKRL----------RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSL 509
+ +L + S A + EEL +YGS + + + F F V DSL
Sbjct: 395 VEQAPPQLDEEQVYDVDQHEAPSQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSL 454
Query: 510 VNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------NYE 540
+N+ P+ G R+ + +ATG SK N +
Sbjct: 455 INVAPINYMCAGERVEFEEDGSTLRPHAESLNEVKIELVAATGHSKNGALSVFVNCINPQ 514
Query: 541 LV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
++ EL GC +WTV+ ++R + E H ++++S + T+VL+T +
Sbjct: 515 IITSFELDGCLDVWTVFDDATR-------KPTTARQEQHDFMLLSQRSSTLVLQTGQEIN 567
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 568 EI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI----------DV 616
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-- 715
S V+ VSIADPYV L + +G + L + T + SS V + Y D
Sbjct: 617 GSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAIAAYKDLS 676
Query: 716 ----------------KGP------------EPWLRKTSTDAWLSTGVGEAI------DG 741
GP EP ++ + L G A D
Sbjct: 677 GLFTCKADDVLNLTGSSGPGFVNSFGGYMKAEPHMKVEDEEDLLYGDAGNAFKLNSMADL 736
Query: 742 ADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD 789
A D + VV +SG LEI+ +P+ V+ V+ +G + D
Sbjct: 737 AKQSKQKNSDWWRRQLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGALVLND 796
Query: 790 TYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
E + S T+ NS + G ++ +S +EL + H RP L + T
Sbjct: 797 AM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELCLVGLGQHGERPLLL-VRTRL 851
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP-- 906
+L YQ + + + + L + + + LD EE
Sbjct: 852 ELLIYQVFRY------------AKGHLKIRFRKLEQLHLLDQQPTHIELDGDEAEEAESY 899
Query: 907 --HGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNV 963
Q++ F N+ G G + G P + + R LR+H L + + +F +NV
Sbjct: 900 NMQPKYVQKLRYFSNVGGLAGIMVCGMNPVFVFLTARGELRIHRLLGNADVRSFAAFNNV 959
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
N HGF+Y + LKI LPS +YD WPV+KV PL+ TP Q+ Y E +Y LI
Sbjct: 960 NIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKV-PLRCTPRQLVYHRENRVYCLITQK 1018
Query: 1024 PVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPDRAGGPWQT- 1080
+P+ + D+E+ + Y + +E+ ++ P+ W+
Sbjct: 1019 E--EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSLFEMVLISPET----WEIV 1065
Query: 1081 -RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRN 1138
A+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1066 PDASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEV 1125
Query: 1139 ADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1193
P +T EV+ KE KG +SA++ + G L+ G KI + + +L G+AF D
Sbjct: 1126 VPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKIYIWQLRDGDLIGVAFID 1185
Query: 1194 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1253
+YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ F EF++D S
Sbjct: 1186 TN-IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVFGIEFMVDNSN 1244
Query: 1254 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1313
L +V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q
Sbjct: 1245 LGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQRGLHHRQP 1304
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R
Sbjct: 1305 FLYE---NKHLVIYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDHLCGLNPKEYRTI 1361
Query: 1374 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ K I+D +L+ Y ML E+ E+A + GT +IL+++
Sbjct: 1362 KTVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIGTRTEEILADM 1410
>gi|24653655|ref|NP_725397.1| cleavage and polyadenylation specificity factor 160, isoform B
[Drosophila melanogaster]
gi|15292103|gb|AAK93320.1| LD38533p [Drosophila melanogaster]
gi|21627189|gb|AAM68553.1| cleavage and polyadenylation specificity factor 160, isoform B
[Drosophila melanogaster]
Length = 1420
Score = 468 bits (1204), Expect = e-128, Method: Compositional matrix adjust.
Identities = 411/1487 (27%), Positives = 677/1487 (45%), Gaps = 226/1487 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRI------------- 242
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
K+ + PIGG LV+ N + Y +QS Y V
Sbjct: 243 ----------------------KVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 274
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 275 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 335 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 394
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E ++E + +L + + A + EEL +YGS + + + F F V D
Sbjct: 395 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 453
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 454 SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 513
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 514 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 564
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 565 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 613
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 614 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 673
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 674 LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 733
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + VV +SG LEI+ +P+ V+ V+ +G +
Sbjct: 734 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 793
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D E + S T +S ++ +S +EL++ + RP L + T
Sbjct: 794 DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 850
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L YQ +F P+ K R + N+ + ++ D E+
Sbjct: 851 ELLIYQ--VFRYPKGHLKI-----RFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQM 901
Query: 909 AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 902 QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 961
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
+GF+Y + LKI LPS +YD+ WPV+KV PL+ TP Q+ Y E +Y LI
Sbjct: 962 PNGFLYFDTTYELKISVLPSYLSYDSVWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE- 1019
Query: 1026 LKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--R 1081
+P+ + D+E+ + Y + ++E+ ++ P+ W+
Sbjct: 1020 -EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPD 1067
Query: 1082 ATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1140
A+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1068 ASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVP 1127
Query: 1141 NPQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1195
P +T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF D
Sbjct: 1128 EPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDTN 1187
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
+YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L
Sbjct: 1188 -IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLG 1246
Query: 1256 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1315
+V+D ++NI ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 1247 FLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL 1306
Query: 1316 GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1375
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R S
Sbjct: 1307 ---YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKS 1363
Query: 1376 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ K I+D +L+ Y ++ E+ E+A + GT +IL +L
Sbjct: 1364 SKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1410
>gi|312380158|gb|EFR26239.1| hypothetical protein AND_07834 [Anopheles darlingi]
Length = 1503
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 398/1500 (26%), Positives = 679/1500 (45%), Gaps = 224/1500 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LV ANV+++Y R+ + ++ R M LE + YRL GN+ SL
Sbjct: 42 SLVTGGANVLKVY--RIIPDADPATREKYSATRPPNM------KLECMASYRLFGNIMSL 93
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S G+ +RD+++++F DAK+SV++FD L+ S+H FE + ++ G
Sbjct: 94 QSVSLAGS----QRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDED---IRGGWTG 146
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSGGGF---SARI 230
PLV+VDP RC +LVYG ++++L K S + D I
Sbjct: 147 HYHIPLVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEMQDVKPIKKTPTLLIAKTPI 206
Query: 231 ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+S++I L+DLD K +V D F+HGY EP ++IL+E T+ GR++ + TC + ALS+
Sbjct: 207 LASYIIELKDLDEKIDNVIDVQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSL 266
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
+ + HP+IW+ +LP D + + + PIGG LV+ N++ Y +QS Y VS
Sbjct: 267 NIQQRVHPVIWTVNSLPFDCLQAVPISKPIGGCLVMCVNSLIYLNQSVP------PYGVS 320
Query: 349 LDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
L+SS + P+ + LDAA +++++ +LS K G+L +LT+ D
Sbjct: 321 LNSSADHSTNFPLKPQDGVRISLDAAQVCFIESEKLVLSLKGGELYVLTLCADS------ 374
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
I FLGSRLG+SLL++F + +++ ++ G +E +
Sbjct: 375 ----------MRSICVCETEYLFLGSRLGNSLLLRFREKDESLVITI---DDSGTVEKEQ 421
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
KR R + +YGS T ++ F V DS++NIGP+ + G
Sbjct: 422 ---KRQRLEEEEL----------EVYGSGYK-TSVQLTSYIFEVCDSVLNIGPIAHMAVG 467
Query: 522 LRINAD-------------------ASATGISKQSNYELVE------------LPGCKGI 550
RI + +A+G K +++ L GC +
Sbjct: 468 ERICEEEMEEGAEVQFVPNKLDVEVVTASGHGKNGALCVLQSSIKPQVITSFGLSGCLDV 527
Query: 551 WTVYHKSSRGHNADSSRMAAYDD---EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
WTV+ +++ +R DD HA++I+S E TMVL+T + + E+ E+ +
Sbjct: 528 WTVFDEAAGPGGVTGTRKP--DDAPPPNHAFMILSQEGATMVLQTGEEINEI-ENTGFAT 584
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
TI GN+ R ++QV + R+L G+ + Q++ + SVSI
Sbjct: 585 DVPTIHVGNIGSNRFIVQVTTKSIRLLQGTRLLQNIPI----------DLGCPLASVSIV 634
Query: 668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD------------ 715
DPYV + S+G + L T + S+ PV + + Y D
Sbjct: 635 DPYVCVRSSEGRVITLALREGKGTPRLAVNKNTISASPPVIAISAYRDVSGMFTRKLEDS 694
Query: 716 ----KG---------------PEPWLRKTSTDAWLSTGVGEAID---GADGGPLDQG--- 750
KG PEP ++ + L G + AD D+G
Sbjct: 695 FDVSKGGGATSAYSSGFGSMKPEPNMKIEDEEDLLYGESGRSFKVTSMADMALADKGGGN 754
Query: 751 -------------DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREAL- 796
+ + ++G LEI+ +P+ + + +G + D+ L
Sbjct: 755 ADFWLKYMQQIKPTYWLLAARDNGNLEIYSMPDLKLAYLISNVGNGNKVLSDSMEFVPLP 814
Query: 797 --KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
K ++ ++S G G S+ E+ M ++ SRP LF I + +L Y+
Sbjct: 815 MAKPGTSQEEATSAFGASFGSGGVPVSLLPKEILMVALGSYGSRPILF-IRLEQDLLIYR 873
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR- 913
+ + + S+ + V A RL NL + A T P+G Q
Sbjct: 874 VFRYAKGHLKLRFKRLTSSVTCPAFRTVPA-RLANLP-DKPATGATTDATEPNGKDTQEH 931
Query: 914 -----------ITIFKNISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLH 961
I F N+SG+ G + G +P + + LR H + AF +
Sbjct: 932 ATKVQYENISMIRYFGNVSGYAGVAVCGEKPYFLFLTAHGELRSHRLYARTVMKAFAPFN 991
Query: 962 NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021
NVNC +GF+Y Q LKI LP+ +YD+ WPV+K IPL+++P QI Y E +Y +++
Sbjct: 992 NVNCPNGFLYFDEQYQLKISILPTYLSYDSVWPVRK-IPLRSSPKQIVYHRENRVYCVVM 1050
Query: 1022 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE--------EYEVRILEPDR 1073
+E+ ++ N +L E ++ V ++ P
Sbjct: 1051 DA---------------EEICNKYYRFNGEDKELTEENKGERFLYPMGHQFSVVLVNP-- 1093
Query: 1074 AGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRV 1130
W+ I ++ E+ ++++ V+L + + +A+GT + ED+ +RGR+
Sbjct: 1094 --AAWEIVPDTAIALEEWEHVVSLKNVSLAYEGARSGLKEYIAVGTNFNYSEDITSRGRL 1151
Query: 1131 LLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1185
LL+ P +T EV K+ KG +SA++ + G L+ A G K+ L + +
Sbjct: 1152 LLYDIIEVVPEPGKPLTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKVYLWQMKDDD 1211
Query: 1186 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1245
L G+AF D ++V + +K+ IL+ D++KS+ L ++++ L+L+++D+ L+ +
Sbjct: 1212 LVGVAFIDTN-IFVHQMVSIKSLILVADVYKSVSLLRFQDEFRTLSLVSRDYHPLNVYQV 1270
Query: 1246 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA 1305
E+++D + L +V+D+Q N+ + Y P+ ES+ GQ+LL + ++H+G V R+Q
Sbjct: 1271 EYVVDNTNLGFLVADDQANLITYMYQPESRESFGGQRLLRKGDYHLGQRVNAMFRVQCDF 1330
Query: 1306 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1365
SD D N+ F TLDG G + PL E T+RRL LQ L+ PH GL
Sbjct: 1331 HESDVMRRTLNYD--NKHTTFFATLDGGFGFVLPLPEKTYRRLFMLQNVLLTHSPHTCGL 1388
Query: 1366 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
NP+++R + +VD +L+ + LP E+ E+A + GT +I ++L ++
Sbjct: 1389 NPKAYRTIKQSRALPINPSRCVVDGDLVWSFLELPANEKQEVAKKIGTRIEEICADLMEI 1448
>gi|242075248|ref|XP_002447560.1| hypothetical protein SORBIDRAFT_06g003580 [Sorghum bicolor]
gi|241938743|gb|EES11888.1| hypothetical protein SORBIDRAFT_06g003580 [Sorghum bicolor]
Length = 374
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 220/359 (61%), Positives = 267/359 (74%), Gaps = 14/359 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-------ELDSELPSK-RGI 52
MS+AAYKMMHWPT I +C +GFITHS AD ++DS S R +
Sbjct: 1 MSYAAYKMMHWPTSIDHCAAGFITHSPADAAAFSSAAPAAAASGPDGDIDSAAASAPRRV 60
Query: 53 GPVPNLVVTAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHG 111
GP PNLVV+AANV+E+Y VR G+++ NS T ++DGIS A LELVCHYRLHG
Sbjct: 61 GPTPNLVVSAANVLEVYAVRADSATGAEDVGNSSSTG--AILDGISGARLELVCHYRLHG 118
Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
N+ES+A+LS G RRDSI + F+DAKI+ +EFDDS +GLR +SMHCFE PEW HLK
Sbjct: 119 NIESMAVLSDG---TENRRDSIAVTFKDAKIACMEFDDSTNGLRTSSMHCFEGPEWFHLK 175
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE 231
RGRESFA GP++K DPQGRCG VLVYGLQMIILKA++ G LVG+++ + RIE
Sbjct: 176 RGRESFAWGPIIKADPQGRCGAVLVYGLQMIILKAAEVGQSLVGEDEPTRMLSSTAVRIE 235
Query: 232 SSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
SS+VI+LRDL+M H+KDF FVHGYIEPV+VILHERE TWAGR+S K TCM+SA SIS
Sbjct: 236 SSYVIDLRDLEMNHIKDFTFVHGYIEPVLVILHEREPTWAGRISSKSQTCMLSAFSISMG 295
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
LKQHP+IWSA LPHDAY+LLAVP PI G+LV+ AN+IHYHSQS SC+LALN+++ D
Sbjct: 296 LKQHPMIWSAAKLPHDAYQLLAVPPPISGILVICANSIHYHSQSTSCSLALNSFSSQPD 354
>gi|307107849|gb|EFN56091.1| hypothetical protein CHLNCDRAFT_145620 [Chlorella variabilis]
Length = 1626
Score = 428 bits (1100), Expect = e-116, Method: Compositional matrix adjust.
Identities = 428/1603 (26%), Positives = 653/1603 (40%), Gaps = 376/1603 (23%)
Query: 4 AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
A +H PT + +C + ++TH++ Q P+P+LVV +
Sbjct: 7 AVCTQVHPPTAVTHCTAAWLTHAQRQ--------QGSGSADGDDGGGSGDPLPDLVVVRS 58
Query: 64 NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
+E+Y VR E G + ++ A SL+ + RL G ES+A+L +G
Sbjct: 59 TQLELYSVRGSEAGGPATTHT-------------AQSLDQLASCRLFGVAESVAVL-RGR 104
Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
A +RD ++L F DAK+SVL +D H L +S+H FE LK GR F PL
Sbjct: 105 APG--QRDVLLLTFRDAKLSVLHWDAGRHELAPSSLHYFEGDA--SLKLGRTVFPYPPLA 160
Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG-------------------- 223
DP GRCG V+++ Q+ +L A D + FG G
Sbjct: 161 VTDPLGRCGAVIIFRHQLAVLPAV--------DSELFGLGLSAAEEDEEEAAATAALGLA 212
Query: 224 --------------------------GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIE 257
+A + +S+V NL +K V+D F+HGY E
Sbjct: 213 PPDGGGAADGEAGAPRGGAAAAAAGLPAAAAAVGNSYVDNLGKAGIKEVRDACFLHGYSE 272
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
PV+++LHE E TWAG + K TC+++ALS++ T K HP IW A LP DAY+L A +P
Sbjct: 273 PVLMVLHEAEPTWAGNLRQKKDTCVLTALSLNLTRKHHPKIWGAQELPSDAYRLSA--AP 330
Query: 318 IGGVLVVGANTIHYHSQSASCALALN---------------------------------- 343
GGVLV+ + + ++ Q + L+
Sbjct: 331 CGGVLVLCQHLVLHYRQGQQSGVVLHPSALPPAAAPPPLLFDPQAMAEAGGPGPASAAYA 390
Query: 344 -NYAVSL------------DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT 390
+AV + D+SQ ++ V D A WL + ALL ++G L+ L
Sbjct: 391 RQHAVDVHPETVPAAVRFCDASQA---AALKVTADGASVCWLSPESALLCLRSGQLLQLA 447
Query: 391 VVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS----------------------LFFLG 426
++ G + L +++ + S ++ + L FLG
Sbjct: 448 LLPQQAGGSARHLAVARAGAAPHPSCCCSLSGAHRAPHMPGSAAAAAAGQAPQPALVFLG 507
Query: 427 SRLGDSLLVQFT----CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--- 479
S GDSLLV+ T G+ ++ D AD P++KRLR +
Sbjct: 508 SAAGDSLLVRATPAAAAGTKRPAEAATGAAGEEDGTADEPASKRLRLEGIEVGSAAAAVE 567
Query: 480 --------------------------------NGEELSLYGSA--------------SNN 493
EE +YG+A +
Sbjct: 568 ATAAAAAAAQGAAAAAAEARAAAGGGPAGSDSEDEEALIYGTALYSSAAGVAPAAAAAVP 627
Query: 494 TESAQ-KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWT 552
T S Q + + V DSL NIGPL+DF+ A A G + G G T
Sbjct: 628 TPSWQLQRYQLKVLDSLANIGPLRDFAVA---EPAAGAGGEAVPPALVGCSGEGKGGTLT 684
Query: 553 VYHKS----------------SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
V +S G + A + +HAYL++S + T VL T + L
Sbjct: 685 VLRRSVVPDVITEHRGAASASGGGSGQAAGEAAGQEGGHHAYLLLSFQGATKVLATGEEL 744
Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD-----LSFGPSNSE 651
EVTESV++ V T+AAG++ RR+ Q F +G R+LDG QD L+ + +
Sbjct: 745 REVTESVEFAVDTPTLAAGSVCCGRRIAQAFPQGLRLLDGEESVQDVWASELAAPAAAAA 804
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQT--------------- 696
+G ++S + DPYVLL ++DG+ R L DP C +S +
Sbjct: 805 AGGAPGGGAIVSADMCDPYVLLYLADGTARFLTADPVACRLSAASAAGAGPEAAAAAEAA 864
Query: 697 -----PAAIESSKKPVSSCTLYHD-------KGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
P A E +++C+L+ D + P+ + G A
Sbjct: 865 EAALRPVAAEER---ITACSLFADSCGWLAARLPQTQQQTQQQQQQQGQQDGGTTAQAAA 921
Query: 745 GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
G +Y+VVC SGA +++ +P + VF+ ++G ++ T A +
Sbjct: 922 SGGGCGAVYAVVCRASGACQLYALPAWQPVFSSSTSLAGGPALL-TGSGGAGGVAAAAAA 980
Query: 805 SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR-----------------PFLFAILTD 847
+++ E +VVE+ + + + P L A+ D
Sbjct: 981 AAAAAAAAGVEDEMDGPGEVVEVRLVSFGPAAAGRRDAAAARASPAPACEPPLLLALTAD 1040
Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-----NLRFSRTPLDAYTR 902
+L YQA+ ++ T R + L LR R
Sbjct: 1041 HQLLAYQAFSASPGSGGTRGSSGSGTPRFRRLRLDLPPLLPPAGGPQLRLRRLHCFEGLG 1100
Query: 903 EETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD---------GS 953
EE P+ G F++G P W + R L HP
Sbjct: 1101 EEAPY----------------SGVFVAGQHPHWLVASRGGLLPHPHFLPQPAGPGAAAVG 1144
Query: 954 IVAFTVLHNVNCNHGFIYVTS--QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYF 1011
FT HNVNC HGFI TS + ++I QLP + D WP Q+V +K TP ++ ++
Sbjct: 1145 AAGFTPFHNVNCPHGFIVATSGARSGIQISQLPPRTRLDAPWPRQRV-SIKGTPLKVAHY 1203
Query: 1012 AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEP 1071
AE +++ VLS + G + +E +EVR + P
Sbjct: 1204 AEADMF-------------AVLSSRQGRARGRGV---------------MEGHEVRWVWP 1235
Query: 1072 DRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL 1131
GG WQ + E AL+V V L + T LLA+G A GED GR+L
Sbjct: 1236 ---GGGWQGVGRHQRRPGERALSVGAVRLKDHATGATVPLLAVGAALPAGEDYPCGGRLL 1292
Query: 1132 LFSTGRNADNP----QNLVTEVYSKELKGAISALASLQGHLLIASGPKI----------- 1176
LF R Q +Y++E KG +++++ L+G+LL+ASG +I
Sbjct: 1293 LFEVTRGDGGGGGGGQWAGRLIYTREFKGPVTSVSGLEGYLLLASGNRIETCSLSSTTIT 1352
Query: 1177 ------ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1230
+ T ++ AFYD P L + SLN+VKNF+LLGD S+ F+ +K++G QL
Sbjct: 1353 STADDGTVAATTTWKVQRSAFYDGPVL-LTSLNVVKNFVLLGDCQHSVQFVRYKDEGRQL 1411
Query: 1231 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1290
+LL+KDF D AT+FLI+GS+L L D +++ YAP SWKGQ+L++ FH
Sbjct: 1412 SLLSKDFNRADTAATQFLINGSSLHLASCDSAGTLRLLSYAPSHPASWKGQRLVAWGSFH 1471
Query: 1291 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
VG + RL++ +S + D+T R A+L + GS
Sbjct: 1472 VGEAASCMRRLRLHPSSPE--------DRTVRQAVLLSSAAGS 1506
>gi|449477808|ref|XP_004155129.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Cucumis sativus]
Length = 643
Score = 404 bits (1039), Expect = e-109, Method: Compositional matrix adjust.
Identities = 188/255 (73%), Positives = 220/255 (86%), Gaps = 1/255 (0%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MSFAAY+MMHWPTGI NC S +ITHSRAD+VP + +++LDS+ +R IGPVPNLVV
Sbjct: 1 MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVTS-HSDDLDSDWHPRRDIGPVPNLVV 59
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
TA NV+E+YVVRV EEG +ESK+SGE KR +MDG+S ASLELVCHYRLHGNVES+AILS
Sbjct: 60 TAGNVLEVYVVRVLEEGGRESKSSGEVKRGGIMDGVSWASLELVCHYRLHGNVESMAILS 119
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
G D S++RDSIIL F++AKISVLEFDDS H LR +SMHCF+ P+WLHLKRGRESFARG
Sbjct: 120 SRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARG 179
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
P+VKVDPQGRCGGVLVYGLQMIILKASQ GSGLV D++ FG+ G SAR+ESS++INLRD
Sbjct: 180 PVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRD 239
Query: 241 LDMKHVKDFIFVHGY 255
LD+KHVKDF+FVH Y
Sbjct: 240 LDVKHVKDFVFVHVY 254
Score = 368 bits (945), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 189/258 (73%), Positives = 208/258 (80%), Gaps = 26/258 (10%)
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
GDIE DA + KR+RRSSSDALQDMV G+ELSLYGSA+NNTESAQK FSFAVRDSL+NIGP
Sbjct: 342 GDIEVDAHTAKRMRRSSSDALQDMVGGDELSLYGSAANNTESAQKIFSFAVRDSLINIGP 401
Query: 515 LKDFSYGLRINADASATGISKQSNYELV--------------------------ELPGCK 548
LKDFSYGLRINAD +ATGI+KQSNYELV ELPGCK
Sbjct: 402 LKDFSYGLRINADPNATGIAKQSNYELVCCSGHGKNGALCILRQSIRPEMITEVELPGCK 461
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
GIWTVYHK++RG ADSSRM DDEYHAYLIISLEARTMVL T +LLTEVTESVDYFV
Sbjct: 462 GIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIISLEARTMVLVTGELLTEVTESVDYFVH 521
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
GRTIAAGNLFGRRRVIQV+E GARILDGS+MTQDL+ + +ESG+ SE TVLS SI+D
Sbjct: 522 GRTIAAGNLFGRRRVIQVYESGARILDGSFMTQDLNLVVNGNESGNASEGCTVLSASISD 581
Query: 669 PYVLLGMSDGSIRLLVGD 686
PYVLL M+DGSIRLLVG+
Sbjct: 582 PYVLLTMTDGSIRLLVGE 599
>gi|49619065|gb|AAT68117.1| cleavage and polyadenylation specific factor 1 [Danio rerio]
Length = 1105
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 325/1158 (28%), Positives = 539/1158 (46%), Gaps = 181/1158 (15%)
Query: 388 LLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
+LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T +
Sbjct: 2 VLTLITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYT----EKLQ 57
Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG------EELSLYGS-ASNNTESAQK 499
+ ++E + E + + +R S+ G +E+ +YGS A + T+ A
Sbjct: 58 ETPMEEGKENEEKEKEPPNKKKRVDSNWAGCPKKGNLPDELDEIEVYGSEAQSGTQLA-- 115
Query: 500 TFSFAVRDSLVNIGPLKDFSYG--------LRINADAS-----ATGISKQSNYELV---- 542
T+SF V DS++NIGP S G + N + +G K ++
Sbjct: 116 TYSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVVVCSGYGKNGALSVLQKSI 175
Query: 543 --------ELPGCKGIWTVYHKSSR---------GHNADSSRMAAY---DDEYHAYLIIS 582
ELPGC +WTV + + G + + + D + H +LI+S
Sbjct: 176 RPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILS 235
Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
E TM+L+T + E+ S + QG T+ AGN+ + +IQV G R+L+G
Sbjct: 236 REDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---VNQ 291
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG---------------SIRLLVGDP 687
L F P + S ++ S+ADPYV++ ++G S RL + P
Sbjct: 292 LHFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFVLKNDSYMGKSHRLALQKP 344
Query: 688 STCT------------------------------VSVQTPAAIESSKKPVSSCT------ 711
T ++++T + E+ + +S+
Sbjct: 345 QIHTQSRVITLCAYRDVSGMFTTENKVSFLAKEEIAIRTNSETETIIQDISNTVDDEEEM 404
Query: 712 LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
LY + P K + + G + + ++ E+G +EI+ +P++
Sbjct: 405 LYGESNPLTSPNKEESSRGSAAASSAHTGKESGSGRQEPSHWCLLVRENGVMEIYQLPDW 464
Query: 772 NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
VF V F G+ +VD+ + S T+ EE T QG +I +K E+A+
Sbjct: 465 RLVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTRQG---DIPLVK--EVALVS 515
Query: 832 WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
HSRP+L A + + +L Y+A+ ++ + + S L+ +R
Sbjct: 516 LGYSHSRPYLLAHV-EQELLIYEAFPYDQQQ--------------------AQSNLK-VR 553
Query: 892 FSRTPLDAYTREET--------PHG---------APCQRITIFKNISGHQGFFLSGSRPC 934
F + P + RE+ P G R F++ISG+ G F+ G P
Sbjct: 554 FKKMPHNINYREKKVKVRKDKKPEGQGEDSLGVKGRVARFRYFQDISGYSGVFICGPSPH 613
Query: 935 WCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
W +V R +R+HP DG+I +F+ HN+NC GF+Y QG L+I LP+ +YD W
Sbjct: 614 WMLVTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPW 673
Query: 994 PVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
PV+K IPL+ T H ++Y E +Y + SV +P + I + G + + +
Sbjct: 674 PVRK-IPLRCTVHYVSYHVESKVYAVCTSVK--EPCTR-----IPRMTGEEKEFETIERD 725
Query: 1054 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLL 1112
+ + +++ ++++ P TR + ++ E+ ++ V L + T + +
Sbjct: 726 ERYIHPQQDKFSIQLISPVSWEAIPNTR--VDLEEWEHVTCMKTVALKSQETVSGLKGYV 783
Query: 1113 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGH 1167
A+GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL G
Sbjct: 784 ALGTCLMQGEEVTCRGRILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCSGF 843
Query: 1168 LLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
L+ A G KI L +L G+AF D LY+ + +KNFIL D+ KSI L ++ +
Sbjct: 844 LVSAIGQKIFLWSLKYNDLTGMAFIDTQ-LYIHQMYSIKNFILAADVMKSISLLRYQPES 902
Query: 1228 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
L+L+++D L+ ++ EF++D + L +VSD KN+ ++ Y P+ ES+ G +LL RA
Sbjct: 903 KTLSLVSRDAKPLEVYSIEFMVDNNQLGFLVSDRDKNLMVYMYLPEAKESFGGMRLLRRA 962
Query: 1288 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1347
+F+VG+HV F R+ T A D N+ F TLDG +G + P+ E T+RR
Sbjct: 963 DFNVGSHVNAFWRMPCRGTLDTANKKALTWD--NKHITWFATLDGGVGLLLPMQEKTYRR 1020
Query: 1348 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEI 1407
L LQ L +PH AGLNP++FR H + + + +I+D ELL+ Y L E+ E+
Sbjct: 1021 LLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQNAVKNILDGELLNKYLYLSTMERSEL 1080
Query: 1408 AHQTGTTRSQILSNLNDL 1425
A + GTT IL +L ++
Sbjct: 1081 AKKIGTTPDIILDDLLEI 1098
>gi|412986884|emb|CCO15310.1| predicted protein [Bathycoccus prasinos]
Length = 1595
Score = 380 bits (977), Expect = e-102, Method: Compositional matrix adjust.
Identities = 306/1012 (30%), Positives = 469/1012 (46%), Gaps = 164/1012 (16%)
Query: 501 FSFAVRDSLVNIGPLKDFSYGLR--INAD-------ASATGISKQSNY---------ELV 542
+ F+V+DSL+ I P+ D + G + D +A G K ELV
Sbjct: 668 YKFSVKDSLLCISPVVDLTVGASAPVGTDLDPRTELVAACGHGKNGALAILTRGITPELV 727
Query: 543 ------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA--RTMVLETAD 594
LPG + W + N + R D+ + +LI+SL + TMVLET +
Sbjct: 728 TEVESGALPGLRACWAT---RTEDDNDGTVRPKRKDELFDEHLILSLSSTKTTMVLETGE 784
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG- 653
L EV++ VD+ V T+A +F R + QV + R + + F + +
Sbjct: 785 ELREVSKEVDFIVDEETLACERIFNGRAIAQVTKTKIR-----FTRKGKKFAVDDIDLAF 839
Query: 654 -SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI-------ESSKK 705
G E + + I + + L +SDGSIR+++GD T T ++ ++
Sbjct: 840 LKGGEGAQITLAIIQNDAIALRLSDGSIRIILGDSKTNTFTLLEKVGELFASDNHSNTGS 899
Query: 706 PVSSCTLYHD----------------KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ 749
V++ TLY D + P WL +T + G E D + +
Sbjct: 900 DVTAFTLYDDSVACTDSFGGGGGGLNRAP-GWLERT------ACGDREEKDESK----EN 948
Query: 750 GDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
++ G L ++ +P+ +++ G RE L + T I+S
Sbjct: 949 NNVVFATISRDGTLALYSLPSLKKLWSSGGVSDG---------REILAPNSTGIDSIDFN 999
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
+ K + +++ A +A + RP L DG++L YQA F+ P +
Sbjct: 1000 DECEVEKYTVSDIRLDAFA----NAAYERPLLTCFRADGSVLAYQA--FKSPSSN----- 1048
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYT--REETPHGAPCQ---RITIFKNIS--- 921
LRF+R P++ T E T + Q R+T +NI
Sbjct: 1049 -------------------ELRFARVPIEIETAGSELTNNDVSVQGGSRLTRIENIGDGR 1089
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI-VAFTVLHNVNCNHGFIYVTSQGILKI 980
G G F+SG P W +V R R+ P +G +AF HNVNC GFI T++G +++
Sbjct: 1090 GIAGVFVSGLNPIWLIVRRGRVLALPTRGEGGARIAFAPFHNVNCPKGFILATNEGGIRV 1149
Query: 981 CQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1040
C+LP + WPV+K + L+ TP ITY + LY L+ S V P D E
Sbjct: 1150 CRLPGKMHIEAQWPVRK-LALRCTPRAITYMNDFKLYALVTSASV--PWK-------DFE 1199
Query: 1041 VGHQIDNHNLSSVDLHRTYT------VEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
+ + D+H + + V+++ +R+L P WQ ++ E+ L
Sbjct: 1200 I-DETDSHARALYRFRKEKAKSEGNVVQQFAIRLLVPGTLETAWQK----AVEPGEHILC 1254
Query: 1095 VRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST--GRNADNPQNLVTEVY-S 1151
V+ V + + +T ++LAIGTA GED RGR+LLF+ R D E+
Sbjct: 1255 VKNVQIRDQSTGALLSMLAIGTAMPGGEDTPCRGRILLFAIMWERARDGGVRWRGELKCE 1314
Query: 1152 KELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILL 1211
K K A SA+ S+ G ++A G K+ H W G LN IAFYD PLY +L VKNF+L
Sbjct: 1315 KPSKMACSAIESVDGTFMVAIGTKLTAHSWDGKHLNPIAFYDT-PLYTTTLCCVKNFLLC 1373
Query: 1212 GDIHKSIYFLSWKE-QGAQ-LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
GD+HKSI F+ WK+ QG + L+ L KD+ LDC A+EF+IDG TLSL+ +D N +F
Sbjct: 1374 GDLHKSIRFVRWKDSQGEKTLSQLGKDYEVLDCIASEFMIDGGTLSLLAADANGNAHVFQ 1433
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
YAPK++ESWKG KLL ++ +H G+ + K +R Q+ G K NR A+ FG+
Sbjct: 1434 YAPKLAESWKGDKLLPKSAYHAGSLIRKMVRFQI----------GVGEQKQNRHAVFFGS 1483
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA------GLNPRSFRQFHSN--GKAHR 1381
DG +G +P+DE TF L+ LQ + ++ + GLN +++R S+ A +
Sbjct: 1484 SDGGLGIFSPVDEHTFLNLEKLQDAMRSNIVASSNSINPLGLNSKTYRALKSSEGSVARQ 1543
Query: 1382 PGPDSIVDCELLSHYE-MLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
P +IVD LLS +E L + Q +A + G TR Q LS + SF+
Sbjct: 1544 TPPRTIVDGGLLSKFEHSLSITAQTRVAAKAGLTRDQALSLARTIIAEQSFM 1595
Score = 142 bits (358), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 100/325 (30%), Positives = 163/325 (50%), Gaps = 72/325 (22%)
Query: 181 PLV-KVDPQGRCGGVLVYGLQMIILK----------------ASQGGSGLVGDEDTFGSG 223
P++ + DP+GRC VL+ + +K +S G + ++ G G
Sbjct: 208 PIIGRADPEGRCAAVLLRNEEKAKVKIMPASETSTSSNYIKESSNGSKKMTTKKE--GEG 265
Query: 224 GGF-SARIESSHVINLRDL---DMKHVKDFIFVHGYIEPVMVILHEREL-TWAGRVSWKH 278
+ A I SS +++R + V+D F+HGY EPV++IL+E TW+GR+S +
Sbjct: 266 TVYVPATIGSSFDLDVRKILGPSAAFVRDCCFLHGYGEPVLMILYESNPPTWSGRLSLRM 325
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
TC + A+SI T K++ ++W+ LP AY L VP+P+GGVLV+ + I Y SQS+S
Sbjct: 326 DTCKLVAVSIDCTKKKYTIVWTREKLPSAAYSLFPVPNPLGGVLVLSSGHILYESQSSSA 385
Query: 339 ALALN----------NYAVSLD------------------------SSQELPRSSFSVEL 364
+ N+A + SS E ++ F V+L
Sbjct: 386 TYISDFLGKGGPQEGNFAEEIARNNGVEGQAAHANPVPHVNSNKNVSSYETTQNEFQVQL 445
Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTVVYD------------GRVVQRLDLSKTNPSVLT 412
DAA ++ +VA++S+KTG L+ TV+ + GR +R+ + K+ +VL+
Sbjct: 446 DAAKIEMIRENVAIISSKTGQLI--TVILETVGGAASVGSKVGRRCRRIRVLKSGNAVLS 503
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
S + +G L F+GSR+GDSLL+ +
Sbjct: 504 SGLAAVGKDLLFIGSRVGDSLLIGY 528
>gi|301093545|ref|XP_002997618.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110008|gb|EEY68060.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1744
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 372/1499 (24%), Positives = 633/1499 (42%), Gaps = 338/1499 (22%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR++++ V D F+ GY+EP +++LHE + + GR++ T ++ +SI+
Sbjct: 278 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 337
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+ HP IW+ NLP D ++L+ +P+GGV+V+ AN I Y +Q+ LA N +A +
Sbjct: 338 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTVN 397
Query: 352 SQELPRS------------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----- 394
P S +V L +LQ LL+ +G + +L++ Y+
Sbjct: 398 QSVFPLSEAVYETPEHETVQLNVVLYDCQFEYLQEKELLLTMPSGQVYVLSLPYEDTSSR 457
Query: 395 ----------GRVVQRLDLSKTNPSVLTSDITTIG-NSLFFLGSRLGDSLLVQFTCGSGT 443
GR L L S+ S + F+GSR GDS+L
Sbjct: 458 GLYGFGGVSSGRNAS-LSLRMLRSSIQASCVCIDDEKQTLFIGSRSGDSVLFALDKKKLV 516
Query: 444 SMLSSGLKEEFGDI------EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
+ K+E I + AP K S A ++ + ++L LYG+A E A
Sbjct: 517 TATEEEQKDEEMPIKEVVIKQESAPEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPA 571
Query: 498 QKT---------------------------FSFAVR--DSLVNIGPLKDFSYGLRINADA 528
+ + + +R D L +IG + G+ NAD+
Sbjct: 572 ATSSTECTNGVGVSSVKTEENGAPEQDTGSYDYELRQIDVLPSIGQITSIELGVENNADS 631
Query: 529 S--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
+ + G + ++ EL GC+ +WTV + R
Sbjct: 632 NEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATRSSDGR- 690
Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
Y+AYLI+S+ RTMVL T + + + + ++ G T+AA NLF ++R++Q+F+
Sbjct: 691 -----SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGSTLAAANLFNKQRIVQIFK 745
Query: 629 RGARIL------------------DGSYM----------------TQDLSFGPSNSESGS 654
+GAR++ +G+ TQ+++ G
Sbjct: 746 QGARVMMEVPEEETSNGQEKSAKTEGAEDEEEDDEDDGPRVKLVCTQEITLEGDVECGGM 805
Query: 655 GSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP------------AAI 700
+ S+V +SV + DPY+LL ++DGS+RLL+GD +SV P +
Sbjct: 806 NVDTSSVGIVSVDVVDPYILLLLTDGSVRLLMGDEEDLELSVIDPEIDYAEGISEANGSA 865
Query: 701 ESSKKPVSSCTLYHD----------------------------------------KGPEP 720
+ SK SS L++D P P
Sbjct: 866 DMSKHGSSSACLFYDWAGMFVENAWVEEEQEERHEATQSRAKRAEDDDDMDALYSSKPSP 925
Query: 721 WLRKT-STDAWLSTGVGEAIDGADGGPLDQ---GDIYSVVCYESGALEIFDVPNF----- 771
+ T +T + ST DG+ PL Q + +C+ G+L +F +P+F
Sbjct: 926 KVATTNATKSTPSTATPRNEDGSVSIPLLQQKDAKMMCGMCFGDGSLHVFSLPDFKKRGV 985
Query: 772 ---------NCVFTVDKFVSGRTHIVDTYMREALKDSETEIN-SSSEEGTGQGRKENIHS 821
+ V T++ + GR V L +N S+S G+ +K + +
Sbjct: 986 FPYLTFAPQSLVNTLEHYQVGRNKTVK------LSAPVLGLNASTSSANDGRIKKSHTIN 1039
Query: 822 MKVVELAMQRW--------SAHHSRPFLFAILTDGTILCYQAY-LFEGPENTSKSD-DPV 871
V ++ + R + + SR + L +G +L Y A FE + + + PV
Sbjct: 1040 SPVADIVIHRVGPSEGQHNAQYLSRMVMLVFLANGDLLMYSAAPKFESLKPRANGEIAPV 1099
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI---------TIFKNISG 922
+ ++ L + +A E A ++ T F N++
Sbjct: 1100 FHFVRVGTELITRPFLPPKARTNAHNEAGNNPEVNTSAVLAKLRAGFRYPMLTCFHNVNN 1159
Query: 923 HQGFFLSGSRPCWCMVFRERLRVHPQLCDGS------IVAFTVLHNVNCNHGFIYVTSQG 976
G F G+ P W + R P +C+ + +++FT H+ NC +GFIY S+G
Sbjct: 1160 MSGAFFRGAHPMWILGDRGHASFVP-MCNAAPRVSVPVLSFTSFHHWNCPNGFIYFHSRG 1218
Query: 977 ILKICQLPSGSTYDNYWP-----VQKVIPLKATPHQITYFA-----------EKNLYPLI 1020
L++C+LPS T P V + AT H + Y E Y ++
Sbjct: 1219 ALRVCELPSSKT-STILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGVAEALEAPTYAVV 1277
Query: 1021 VSV--------------------------PVLKPL-NQVLSLLIDQEVGHQIDNHNLSSV 1053
S P PL + V++ + ++ D
Sbjct: 1278 CSARLKPADADRATEVEGAEEELEPENLDPNGNPLGSNVMAPTAEMFADYETD------- 1330
Query: 1054 DLHRTYTVEE-YEVRILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT------ 1104
H +T E+ YE+R+++ D G W R + + E L+V+++ L++++
Sbjct: 1331 --HMAHTEEDVYELRLVQTDEFG-EWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKEEV 1387
Query: 1105 -------TKENETLLAIGTAYV--QGEDVAARGRVLLFST----------GRNADNPQNL 1145
K+ L +GT +V GED + RGR+LL+ G + L
Sbjct: 1388 ASTSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLYELDYAQYVNEEGGATSGKLPKL 1447
Query: 1146 VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1205
+ +GAIS ++ L ++L A G K+I++++ +L G AFYDA +Y+V+L++V
Sbjct: 1448 RLVFIKEHRQGAISMVSQLGPYVLAAVGSKLIVYEFKSEQLIGCAFYDAQ-MYIVTLSVV 1506
Query: 1206 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1265
K+F++ GD++KS++FL W+E QL LLAKD+ L ATEF + L+L+ D +N+
Sbjct: 1507 KDFVMYGDVYKSVHFLRWREMQRQLVLLAKDYEPLAVSATEFSVFEKKLALLAVDMDENL 1566
Query: 1266 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS-----DRTGAAPGSDKT 1320
+ +AP+ ES GQ+LL ++FH+G V+ R ++ A+ S + AAP S+
Sbjct: 1567 HVMQFAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYV 1626
Query: 1321 NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1380
N + GT +G +G + P+ E FRRL +LQ +V+++P LNPR FR +N +
Sbjct: 1627 N----VMGTSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRR 1682
Query: 1381 RPGPDS---------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
PD+ +D +L + L Q E+A GTT ++ NL ++ TS
Sbjct: 1683 CGRPDAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVMHNLLEVQHATS 1741
>gi|301103686|ref|XP_002900929.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
gi|262101684|gb|EEY59736.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
Length = 1561
Score = 368 bits (944), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 370/1485 (24%), Positives = 629/1485 (42%), Gaps = 323/1485 (21%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR++++ V D F+ GY+EP +++LHE + + GR++ T ++ +SI+
Sbjct: 108 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYYLTVISINMK 167
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL-- 349
+ HP IW+ NLP D ++L+ +P+GGV+V+ AN I Y +Q+ LA N +A
Sbjct: 168 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTVN 227
Query: 350 -------DSSQELPR---SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----- 394
D+ E P + +V L +LQ+ LL+ G + +L++ Y+
Sbjct: 228 QSVFPLSDAVYETPEHETAQLNVVLYDCQFEYLQDKELLLTMPCGQVYVLSLPYEDTSSR 287
Query: 395 ----------GRVVQRLDLSKTNPSVLTSDITTIG-NSLFFLGSRLGDSLLVQFTCGSGT 443
GR L L S+ S + F+GSR GDS+L
Sbjct: 288 GLYGFGGVSSGRNAS-LSLRMLRSSIQASCVCIDDEKQTLFIGSRSGDSVLFALDKKKLV 346
Query: 444 SMLSSGLKEEFGDI------EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
+ K+E I + AP K S A ++ + ++L LYG+A E A
Sbjct: 347 TATEEEQKDEEMPIKEVVIKQESAPEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPA 401
Query: 498 QKT---------------------------FSFAVR--DSLVNIGPLKDFSYGLRINADA 528
+ + + +R D L +IG + G+ NAD+
Sbjct: 402 ATSSTECTNGVGVSSVKTEENGAPEQDTGPYDYELRQIDVLPSIGQITSIELGVENNADS 461
Query: 529 S--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
+ + G + ++ EL GC+ +WTV + R
Sbjct: 462 NEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATRSSDGR- 520
Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
Y+AYLI+S+ RTMVL T + + + + ++ G T+AA NLF ++R++Q+F+
Sbjct: 521 -----SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGPTLAAANLFNKQRIVQIFK 575
Query: 629 RGARIL------------------DGSYM----------------TQDLSFGPSNSESGS 654
+GAR++ +G+ TQ+++ G
Sbjct: 576 QGARVMMEVPEEETSNGQEKSGKAEGAEDEEEDDEDDGPRVKLVCTQEITLEGDVECGGM 635
Query: 655 GSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP------------AAI 700
+ S+V +SV + DPY+LL ++D S+RLL+GD +SV P +
Sbjct: 636 NVDTSSVGIVSVDVVDPYILLLLTDVSVRLLMGDEEDLELSVIDPEIDYAEGISEANGSA 695
Query: 701 ESSKKPVSSCTLYHD-------------KGPEPWLRK-TSTDAWLSTGVGEAIDGADGGP 746
+ SK SS L++D P P + +T + ST DG+ P
Sbjct: 696 DMSKHGSSSACLFYDWAEDDDDMDALYSSKPSPKVATMNATKSMPSTATPRNEDGSVSIP 755
Query: 747 LDQ---GDIYSVVCYESGALEIFDVPNF--------------NCVFTVDKFVSGRTHIVD 789
L Q + +C+ G+L +F +P+F + V T++ + GR V
Sbjct: 756 LLQQKDAKMMCSMCFGDGSLHVFSLPDFKKRGVFPYLTFAPQSLVNTLEHYQVGRNKTVK 815
Query: 790 TYMREALKDSETEIN-SSSEEGTGQGRKENIHSMKVVELAMQRW--------SAHHSRPF 840
L +N S+S G+ +K + + V ++ + R + + SR
Sbjct: 816 ------LSAPALGLNASTSSANDGRIKKSHTINSPVADIVIHRVGPSEGQHNAQYLSRMV 869
Query: 841 LFAILTDGTILCYQAY-LFEGPENTSKSD-DPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
+ L +G +L Y A FE + + + PV + ++ L + +
Sbjct: 870 MLVFLANGDLLMYSAAPKFESLKPRANGEIAPVFHFVRVGTELITRPFLPPKARTNAHNE 929
Query: 899 AYTREETPHGAPCQRI---------TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQL 949
A E A ++ T F N++ G F G+ P W + R P
Sbjct: 930 AGNNPEVNTSAVLAKLRAGFRYPMLTCFYNVNNMSGAFFRGAHPMWILGDRGHASFVPMC 989
Query: 950 CDGS-------------------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
S +++FT H+ +C +GFIY S+G L++C+LPS T
Sbjct: 990 VPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHWSCPNGFIYFHSRGALRVCELPSSKT-S 1048
Query: 991 NYWP-----VQKVIPLKATPHQITYFA-----------EKNLYPLIVSV----------- 1023
P V + AT H + Y E Y ++ S
Sbjct: 1049 TILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGVAEALEAPTYAVVCSARLKPADADRAT 1108
Query: 1024 ---------------PVLKPL-NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE-YEV 1066
P PL + V++ + ++ D H +T E+ YE+
Sbjct: 1109 EVEGAEEELEPENLDPNGNPLGSNVMAPTAEMFADYETD---------HMAHTEEDVYEL 1159
Query: 1067 RILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT-------------TKENETL 1111
R+++ D G W R + + E L+V+++ L++++ K+
Sbjct: 1160 RLVQTDEFG-EWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKEEVASTSPEWNKKKRPY 1218
Query: 1112 LAIGTAYV--QGEDVAARGRVLLFST----------GRNADNPQNLVTEVYSKELKGAIS 1159
L +GT +V GED + RGR+LL+ G + L + +GAIS
Sbjct: 1219 LVVGTGWVGPHGEDESGRGRLLLYELDYAQYVNEEGGATSGKLPKLRLVFIKEHRQGAIS 1278
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
++ L ++L A G K+I++++ +L G AFYDA +Y+V+L++VK+F++ GD++KS++
Sbjct: 1279 MVSQLGPYVLAAVGSKLIVYEFKSEQLIGCAFYDAQ-MYIVTLSVVKDFVMYGDVYKSVH 1337
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
FL W+E QL LLAKD+ L ATEF + L+L+ D +N+ + +AP+ ES
Sbjct: 1338 FLRWREMQRQLVLLAKDYEPLAVSATEFSVFEKKLALLAVDMDENLHVMQFAPQDIESRG 1397
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSS-----DRTGAAPGSDKTNRFALLFGTLDGSI 1334
GQ+LL ++FH+G V+ R ++ A+ S + AAP S+ N + GT +G +
Sbjct: 1398 GQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVN----VMGTSEGGV 1453
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS-------- 1386
G + P+ E FRRL +LQ +V+++P LNPR FR +N + PD+
Sbjct: 1454 GALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRILKTNAQRRCGRPDAWSKKKWKK 1513
Query: 1387 -IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
+D +L + L Q E+A GTT + NL ++ TS
Sbjct: 1514 SFLDAFVLFRFLQLDYVAQKELARCIGTTPEVAMHNLLEVQHATS 1558
>gi|241060959|ref|XP_002408050.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
gi|215492346|gb|EEC01987.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
Length = 1241
Score = 354 bits (908), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 309/1106 (27%), Positives = 494/1106 (44%), Gaps = 149/1106 (13%)
Query: 388 LLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
+LT+ DG R V+ + K SVLT+ +T FLGSRLG+SLL+ +T M
Sbjct: 198 VLTLFNDGMRSVRNFNFDKAAASVLTTSMTLCEEGYLFLGSRLGNSLLLHYT-EKAAEME 256
Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
+G KE+ ++ D +++ +EL +YGS + T+ +++F V
Sbjct: 257 EAGKKED---------------KAEGDVNVALIDPDELEVYGSETLATKQL-TSYTFEVC 300
Query: 507 DSLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELV----------- 542
DSL+NIGP G N+D + G K ++
Sbjct: 301 DSLINIGPCGKICMGEPAFLSEEFTQNSDPDLELVTTAGYGKNGALCVLQRSVRPQVVTT 360
Query: 543 -ELPGCKGIWTVYHKSSRGHNA------DSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
ELPGC +WTV + D A HA+LI+S +M+L+T
Sbjct: 361 FELPGCVHMWTVMGPPTEKKKKEASEESDEQAADATLTNTHAFLILSRADSSMILQTDQE 420
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ S + Q T+ AGNL R V+QV G R+LDG+ Q +
Sbjct: 421 INELDHS-GFSTQNPTVFAGNLGDGRYVLQVCPMGVRLLDGTRQLQHIPL---------- 469
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLV--GDPST-CTVSVQTP--AAIESSKKPVSSC 710
S ++ S+ADP+VL+ G + L GDP++ C ++V P A+ S + +C
Sbjct: 470 DVGSPIVGGSLADPHVLIRSEGGLVVHLTLRGDPASGCRLAVLRPQLTAVVSHRANALTC 529
Query: 711 --------------TLYHD----KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
LY D + + +R T+ + + + + P
Sbjct: 530 HCIAVSGVLDDEDELLYGDSEDTRATKEPVRVTAMET--ESETANVFELKEVKP----TF 583
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEEGT 811
+ V E+G LEI+ +P++ F V F G+ +VD+ A +++E ++ S E
Sbjct: 584 WVFVARENGVLEIYSLPDYKLCFLVKNFPMGQRVLVDSVQMTAPSGTKSEKLSDMSHE-- 641
Query: 812 GQGRKENIHSMKVV-ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
M VV E+ M SRP L A + D +L Y+A+ F +
Sbjct: 642 ---------CMPVVHEILMVGLGVRQSRPLLLARV-DEDLLIYEAFPFYETQREGH---- 687
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
L ++ + R +T EE + + F +ISG+ G FL G
Sbjct: 688 ----LKLRFKKLNHDIILRSRKYKTQKPENEEEEKAFQSRLW-LQPFSDISGYSGVFLCG 742
Query: 931 SRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-----ILK---IC 981
RP W M R LR HP DG + F HNVNC GF++ Q +L +
Sbjct: 743 HRPHWLFMSSRGELRYHPMFVDGPVYCFAPFHNVNCPKGFLHFNKQSDSYALLLHSYWLS 802
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
QLPS + K H+ +FA + + P L + D
Sbjct: 803 QLPSPKRHGERLLFNCPSHKKICIHRCHFFALQQKAADFLWPPPFVTTVSPLPFVADSR- 861
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVT 1099
+ T++++ +++L P W+T + + E+ ++ V
Sbjct: 862 --------------YIFPTMDKFSLQLLSPVS----WETIPNTRVDLDEWEHLTCIKNVM 903
Query: 1100 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKE 1153
L + T + LA+GT Y GEDV +RGR+ + P +N + VYSKE
Sbjct: 904 LSSEGTSTGMKGYLALGTNYCYGEDVTSRGRITILDIIDVVPEPGQPLTKNKIKIVYSKE 963
Query: 1154 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1213
KG ++AL+ + G LL A G K+ + + L G+AF D +Y+ S+ VKN IL+GD
Sbjct: 964 QKGPVTALSQVVGFLLSAIGQKMYIWQLKDNGLVGVAFIDTQ-IYIHSVVTVKNLILVGD 1022
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
+ KS+ L ++E L+L+++D L+ FA EF ID S +S +V+D ++N+ ++ Y P+
Sbjct: 1023 VFKSVSLLRYQEASRTLSLVSRDVRPLEVFAVEFFIDNSQMSFLVTDSERNMILYMYQPE 1082
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
ES GQ+LL R +FH+G+ V R++ + + R + TLDGS
Sbjct: 1083 SRESCGGQRLLRRGDFHIGSPVVSMFRIKCRMGEVAKHDRRLAASVDGRHITMLATLDGS 1142
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+G + P+ E T+RRL LQ LV ++PH AGLNP++FR +HS + +I+D EL+
Sbjct: 1143 LGYVLPVPEKTYRRLLMLQNVLVTNMPHYAGLNPKAFRMYHSQRRVLGNPHKNILDGELI 1202
Query: 1394 SHYEMLPLEEQLEIAHQTGTTRSQIL 1419
+ L E+ E++ + GTT +Q++
Sbjct: 1203 WKFMHLSFMERSELSKKIGTTVTQVV 1228
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 5/95 (5%)
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
P+++VDP RC +LV+ + ++ + + +E G G + + + L +
Sbjct: 58 PMIRVDPCNRCAAMLVFSRTIAVVPFRKDTAA---EEQETGPTFGNKPPLLDWYPVALTE 114
Query: 241 LDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
LD K +V D F+HGY EP ++IL+E TW G+
Sbjct: 115 LDEKINNVIDMQFLHGYYEPTLLILYEPLRTWPGK 149
Score = 40.0 bits (92), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 9/103 (8%)
Query: 236 INLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+ L +LD K +V D F+HGY EP ++IL+E TW G V + M S + +
Sbjct: 158 VALTELDEKINNVIDMQFLHGYYEPTLLILYEPLRTWPGYVLTLFNDGMRSVRNFNFDKA 217
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
++ ++M L + Y L S +G L+ +HY ++A
Sbjct: 218 AASVLTTSMTLCEEGYLFLG--SRLGNSLL-----LHYTEKAA 253
>gi|391328522|ref|XP_003738737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Metaseiulus occidentalis]
Length = 1500
Score = 350 bits (898), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 276/966 (28%), Positives = 442/966 (45%), Gaps = 158/966 (16%)
Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
ELPGC +WTV S+R + D ++ H +LI+S TM+L+T + E+ S
Sbjct: 603 ELPGCTDLWTVRSSSTRSPDVD--------EDSHQFLILSRPDSTMILQTGQEINELDHS 654
Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVL 662
+ Q TI AGNL R +IQV R+L+G Q + S ++
Sbjct: 655 -GFCTQSPTIFAGNLADGRYIIQVCPNSVRLLEGVKQLQQVPI----------DVGSPLV 703
Query: 663 SVSIADPYVLLGMSDGSIRLLV--GDPST-CTVSVQTPAAIESSKKPVSSCTLYHD---- 715
S SIAD +VL+ DG + L GD +T +SV P +K +++ +Y D
Sbjct: 704 SASIADLHVLVMSQDGLVIQLTLRGDDTTGYKLSVLKPQ-FPGAKSKITALCIYKDVSGL 762
Query: 716 -----KGPEPWLR-KTSTDAWLSTGV------------------GEAIDGAD--GGPLDQ 749
+ PE + KT + T V G ++D D G L+
Sbjct: 763 FVTKIQKPEDIAKPKTEAKTKVKTEVAKKVLRSADFDDEDELLYGSSVDIKDLVAGGLNA 822
Query: 750 GDI-----------------------------YSVVCYESGALEIFDVPNFNCVFTVDKF 780
+I + + E+GALEI+ P++ + V F
Sbjct: 823 ANIVPTTQTKDTAEEEDYEENVRKIAPVEPTFWVFLARENGALEIYSFPDYKLRYFVKNF 882
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
+ + ++ A +T S+SE KV+E+ + H SRP
Sbjct: 883 -----PLCNKILQNAAATGQTTSASTSEAQLP----------KVMEIFVCALGMHQSRPL 927
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR------ 894
LFA + D + Y+AY F N L+ L+F R
Sbjct: 928 LFARV-DSELHIYEAYPF---------------------VNQKEGHLK-LQFRRLQHAVT 964
Query: 895 -TPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDG 952
P Y ++E + I F+++ G+ G F+ G RP W + R LR HP L DG
Sbjct: 965 MEPRRVYKQKEGDPTLSLRWIRAFQDVCGYNGVFVCGRRPHWIFLTARGELRAHPMLNDG 1024
Query: 953 SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFA 1012
I +F HNVNC GF++ G L+IC LPS YD WP++K IP+ TPH + Y
Sbjct: 1025 RIYSFATFHNVNCEKGFLFFNKYGELRICALPSYLNYDAPWPMRK-IPIYETPHSVNYHV 1083
Query: 1013 EKNLYPLIVS-------VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE 1065
+ Y + S VP L ++ I++E I TV+++
Sbjct: 1084 DSRTYCVATSKEETATCVPKLANEDKEFEP-IERESSRFIPP------------TVDKFA 1130
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDV 1124
+ + P TR +PM+ E V+ V + + T E L+A+GT + GED+
Sbjct: 1131 LELWSPVSWEAIPNTR--MPMEDWEKITCVKNVMIASEGTTSGEKGLIAVGTIHNFGEDI 1188
Query: 1125 AARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1179
A+GR+LL P ++ V + SK ++AL S++GHL+ A G K+ L
Sbjct: 1189 TAKGRILLIDIIEVVPEPGQPLTRSKVKTILSKPQNAPVTALCSVKGHLMAAVGQKLFLF 1248
Query: 1180 KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1239
+ +L G+AF D +Y++S +K+FIL+GD+HKSI L ++E+ L +++KD
Sbjct: 1249 QLKDNDLVGMAFLDTQ-IYILSAISIKSFILIGDVHKSITLLRYQEESKTLAVVSKDTKP 1307
Query: 1240 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1299
+ ++ E+L+D S ++ + +D Q NI ++ Y P+ E++ GQ+L+ R +F++G+ +
Sbjct: 1308 VQIYSIEYLVDNSQMAFLATDAQCNILVYMYQPENRETFGGQRLIRRGDFNIGSRINTMF 1367
Query: 1300 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSV 1359
R++ R+ SD R L+ +LDG+ G + P+ E T+RRL LQ L
Sbjct: 1368 RIRCRLAEVPRSERRLLSDLEARHVTLYASLDGAFGYLLPISEKTYRRLLMLQNVLNSYC 1427
Query: 1360 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1419
HV GLNP++FR ++ +A +IVD +L++ + L E+ E+A + GTT QI
Sbjct: 1428 QHVGGLNPKAFRIMQTDVRALSNPQKNIVDGDLINVFMDLNFNEKAEVARKIGTTVHQIQ 1487
Query: 1420 SNLNDL 1425
+L ++
Sbjct: 1488 LDLAEI 1493
Score = 184 bits (466), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 122/404 (30%), Positives = 199/404 (49%), Gaps = 61/404 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS----AASLELVCHYRLHGN 112
NLVV VI++Y R++ DG++ A LE + GN
Sbjct: 29 NLVVAGGTVIKVY--------------------RLVCDGLNETDDKAKLEHQQTFNCFGN 68
Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ + + + RDS++ F++ KIS++E+D + H L+ ++ E E+ K
Sbjct: 69 ISGMEKIRLNAS-----RDSLLFVFKETKISLVEYDPATHELQTLAIRSLEKEEY---KE 120
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFS 227
G +F L+KVDP RC VL+YG + I+ A+ + + T + GF
Sbjct: 121 GFYNFVGNTLIKVDPLNRCAAVLIYGKHLAIIPFVKKDATDLSDPIASSKSTQTNTSGFL 180
Query: 228 ARIESSHVINLRDLD----MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
+ I L DLD + ++ D F++GY EP +++L+E TW GRV+ + TC I
Sbjct: 181 ----EYYTIRLIDLDEEKGVNNIHDMTFLNGYYEPTLLLLYEPIRTWTGRVAIRQDTCSI 236
Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALN 343
ALS++ + HP +WS LP +++K+L VP PIGGVL++ N + Y +QS
Sbjct: 237 MALSLNVYQRVHPPVWSFSGLPFNSFKVLPVPKPIGGVLILSVNALLYLNQSVPA----- 291
Query: 344 NYAVSLDSSQELPRSSFSVE--------LDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
Y VSL+ E +SF ++ LD +L LLS GDL +L++ DG
Sbjct: 292 -YGVSLNCFTEC-STSFPLKDQAGPPLTLDCCRCEFLSETKILLSVANGDLYVLSLFTDG 349
Query: 396 -RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
R + + + K + + + I+ F+GSR+G+SLL+++T
Sbjct: 350 MRSINQFEFKKIATTTVATCISLCEPGYLFVGSRIGNSLLLRYT 393
>gi|440793679|gb|ELR14857.1| CPSF A subunit region protein [Acanthamoeba castellanii str. Neff]
Length = 1477
Score = 348 bits (892), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 256/898 (28%), Positives = 429/898 (47%), Gaps = 169/898 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGI----------------SAAS 100
NL+V NV+E+Y + E+ + T+ DG+ + S
Sbjct: 30 NLIVAKTNVLEVYALHRHEDSKARPIDRQSTRP---TDGVISLRGEEPKDAPPYAGTQHS 86
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
+ LV L GN+ES+A + G +D+++L+F DAKISVLEFD + + LR S+H
Sbjct: 87 MRLVLSSSLFGNIESMAAVRFPGTS----KDALLLSFRDAKISVLEFDIATNDLRTISLH 142
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE +K G + + P ++VDPQ RC +L + ++++L Q S +
Sbjct: 143 YFED---YKVKEGHDHYIHVPELRVDPQQRCAAMLAFDRKLVVLPFRQHASLM-----EI 194
Query: 221 GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
+GG ++ S +++LR + + +VKDF+F+ GY EP ++IL+E TW+GRV+ +T
Sbjct: 195 ENGGQEDQPVKPSFLLDLRAMGIINVKDFVFLQGYYEPTLLILYEPTQTWSGRVAVNRNT 254
Query: 281 CMISALSIST-----TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI------ 329
C+ +A+S++ HP++WSA LP+D +L+AVP PIGG L + N++
Sbjct: 255 CVAAAVSLNLWQHRGQTSAHPVVWSAEFLPYDTQRLIAVPGPIGGALALSTNSLLYLNQV 314
Query: 330 -------------------HYHSQSASCALALNNYA-VSLDSSQELP---RSSFSVELDA 366
H+ +Q+++ L LN +A + L P ++ + LDA
Sbjct: 315 SFPYRLILPAHGADVSITSHHDTQASASCLPLNVFADLYLSPQTPFPSAGKNRVGIALDA 374
Query: 367 AHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNS----- 421
A +L +D L+S K G+L + ++ DGR V + L+K SV+TS + T+
Sbjct: 375 ARDVFLADDQLLVSLKGGELYIFHLLSDGRTVNDIQLTKAGSSVITSCMATLSGEGADER 434
Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE--FGDIEADAPSTKRLRRSSSDA----- 474
FLGSR+GDSLL+Q+T ++ +G + F DI+ + + +A
Sbjct: 435 FLFLGSRVGDSLLLQYTTADASAPKQNGATKGSLFDDIKKEEDNDDDDEDEEEEASGEGE 494
Query: 475 LQDMVNGE-ELSLYGSASNNTESAQK-----TFSFAVRDSLVNIGPLKDFSYGLRIN-AD 527
+++ +GE E+ +G + +K T+ F V DSLVN+GP+ DF+ G + A
Sbjct: 495 VKEEPDGEGEVDEFGRRIREEDRRKKKGLLTTYKFKVCDSLVNVGPITDFAIGESFDPAS 554
Query: 528 ASATGISKQSNYELV---------------------------ELPGCKGIWTVYHKSSRG 560
S Q + E+V +L GCK WT+YH+S
Sbjct: 555 VSMAEQEGQRSVEIVTCSGQGKNGSLCVLQHGVRPELVHASADLAGCKAFWTLYHRSEER 614
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT-ESVDYFVQGRTIAAGNLFG 619
++ EYHAYL++S E +T V+ D L E++ E D+ V T+ AGNLF
Sbjct: 615 QGEEA--------EYHAYLLLSEEEQTRVI-AGDGLDELSNEETDFNVAAPTVDAGNLFE 665
Query: 620 RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
+ R++QV + G +LDG TQ + S + + SIADPYVL+ M+DG+
Sbjct: 666 QTRIVQVHQHGLILLDGVKATQRI------------STPGQIAAASIADPYVLVLMADGA 713
Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI 739
+RL DP++ + VQT + + + L++ G A+
Sbjct: 714 LRLYFADPTSSKL-VQTSLQNIHEVRDIMAMHLFY---------------------GGAM 751
Query: 740 DGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
G D+ I++ + ++G L+I+ VP F+ VF+ ++ +G I + MR + +
Sbjct: 752 RGKKARTNDE--IFAAIAKDNGRLDIYSVPEFDLVFSAERAANGPRLINNVLMRPPPQSA 809
Query: 800 ETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYL 857
+ ++ + S ++ E+A+ S P LF L +G +L Y+ +L
Sbjct: 810 AAQQSADTT------------SARIAEIALHSIGNIPSLPHLFLYLDNGELLLYRGFL 855
Score = 303 bits (776), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 190/538 (35%), Positives = 277/538 (51%), Gaps = 59/538 (10%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
+RI F + G F+SGS P W R R++P D + AF HN NC HGFIY
Sbjct: 967 RRIHYFGTVGKSNGVFISGSAPAWVFAQRGYARLYPMKLDTFVRAFAEFHNANCPHGFIY 1026
Query: 972 VTSQGILKICQLPS--GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1029
+G LKICQLP+ G+ + V + +PL TP +I Y Y +V++
Sbjct: 1027 FNHEGTLKICQLPAAEGAIHWELPGVVRKVPLGRTPREIAYHPPSRTY--VVALATPVTT 1084
Query: 1030 NQVLSLLIDQE-------------VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG 1076
D E +G + + E +E+ ++ P
Sbjct: 1085 VVPTPPETDMERQEREREEEESREMGIEPEEKQRDMGPREIAMMEERHELHLISPRT--- 1141
Query: 1077 PWQTRATIPMQSSENALTVRVVTLFNTTTKENETL----LAIGTAYVQGEDVAARGRVLL 1132
WQ + ++ E+ LT+ V+ L + ++ N L L I V GE+
Sbjct: 1142 -WQILHHVELEPKEHVLTLSVLKLGDNYSQVNRELRPPHLLIYEIDVTGEE--------- 1191
Query: 1133 FSTGRNADNPQNLVTEVYSKELK-----GAISALASLQGHLLIASGPKIILHKWTGTELN 1187
Q +T Y K +K G +SA ASLQG+L+IA GPKI + + G
Sbjct: 1192 ----------QCKLTMAYQKPMKEKPMKGPVSAAASLQGYLIIAVGPKIWVFNFDGGSTE 1241
Query: 1188 GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEF 1247
+AFYDAP Y+VS+ +KNF+L GDI+KSI+FL WK+ +QL LLAKD G + FATE+
Sbjct: 1242 AVAFYDAPH-YIVSIKTLKNFVLCGDIYKSIFFLRWKDSASQLALLAKDVGRVSVFATEY 1300
Query: 1248 LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS 1307
++D L+L++SDE++N+Q+ YAP +ES GQ L+ R +F+VG + KF+RL M S
Sbjct: 1301 VVDKQNLALLMSDERQNLQVTAYAPHTAESRGGQLLVPRGDFNVGQSINKFVRLPMTLPS 1360
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNP 1367
G+ R AL FGTL G +G +AP+DE FRRL LQ L+ ++PH AGL+P
Sbjct: 1361 --------GTTSLQRHALWFGTLSGGVGYLAPMDESVFRRLGMLQSALLSAIPHTAGLHP 1412
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+++R + R +I+D LLS Y L Q +IA + GT+R +IL++L +
Sbjct: 1413 QAYRALQRE-RLLRNRKHTILDGLLLSRYLALDSATQQQIALKLGTSRERILNDLQGI 1469
>gi|431908146|gb|ELK11749.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
alecto]
Length = 820
Score = 340 bits (872), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 220/684 (32%), Positives = 344/684 (50%), Gaps = 49/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 162 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 217
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 218 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLVYEAF----PHDSQLGQGNLK 267
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ R + R S+ + E R F++I G+ G F+ G
Sbjct: 268 VRFKKVPHNINF-REKKPRPSKKKAEGGAEEGPGARGRVARFRYFEDIYGYSGVFICGPS 326
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 327 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 386
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S I + G + + +
Sbjct: 387 PWPVRK-IPLRCTAHYVAYHVESKVYAVATS-------TNTPCTRIPRMTGEEKEFETIE 438
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 439 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 494
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 495 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 554
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 555 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 613
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 614 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 673
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 674 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLL 726
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 727 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLY 786
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 787 LSTMERGELAKKIGTTPDIILDDL 810
>gi|393245434|gb|EJD52944.1| hypothetical protein AURDEDRAFT_81080 [Auricularia delicata TFB-10046
SS5]
Length = 1422
Score = 339 bits (870), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 356/1476 (24%), Positives = 647/1476 (43%), Gaps = 199/1476 (13%)
Query: 52 IGPVPNLVVTAANVIEIYVVRVQ-------------EEGSKE---SKNSGETKRRVLMDG 95
+G NLVV N++ ++ VR++ E+G GE + V +G
Sbjct: 40 LGVATNLVVARQNLLRVFEVRIEAAPLPSQEKLLADEQGRGRRGMEAVEGEVEMDVGGEG 99
Query: 96 ISAAS------------------LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAF 137
+A L LV +RLHG V L + Q A + D ++++F
Sbjct: 100 FVSAGIVKSAGQHARQRQRTVTRLYLVRQHRLHGIVTGLGRV-QTMASLEDKLDRLLVSF 158
Query: 138 EDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLV 196
+DAKI++LE+ + H L S+H +E +P+ L R +++DP RC + +
Sbjct: 159 KDAKIALLEWSEVSHDLSTISIHTYERAPQMLAFDSARALTE----LRIDPNSRCAALTL 214
Query: 197 YGLQMIILKASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFV 252
G + IL + + L D D GG S + S +++L ++D ++++ D F+
Sbjct: 215 PGDAVAILPFYESQAELDMDVDQ----GGVSRDVPYSPSFILSLPEVDNDIRNIIDIAFL 270
Query: 253 HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
G+ P + +L E + TW GR++ T + L++ + +P+I S LP+D+ +L+
Sbjct: 271 PGFNNPTLAVLFETQRTWTGRLAEFKDTVRLRILTLDVVTRTYPIIGSVDGLPYDSMRLV 330
Query: 313 AVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATW 371
A P+ +GGV+V+ AN + + QS + A+A N +A + S P L+ + A +
Sbjct: 331 ACPAALGGVIVLTANAVLHIDQSGKNVAVAANGWAARV-SEFPTPAPERDETLEGSRAVF 389
Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL-SKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ + LL + G +V + ++ +GR+V ++D+ + + + + + + + L +GS G
Sbjct: 390 VSDKTFLLVYRDGSIVPVELILEGRMVTKIDMGQRLAQTTIPTVVCAVQDDLVLVGSTAG 449
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-----DMVNGEELS 485
S+L++ T E DI DA S + ++ ++ D ++ E+
Sbjct: 450 PSILLKVT-------------HEEEDITPDAGSARENGAANGNSTNGATYDDPMDSEDED 496
Query: 486 LYGSASNNTESAQKTFS--------------FAVRDSLVNIGPLKDFSYGLRINAD---- 527
LYG S T + A+ DSL GP+ D ++ L N +
Sbjct: 497 LYGGTDMMVTSTSGTLTVGGTAALEKRRILRLALADSLCGHGPISDMAFILGRNGERHVP 556
Query: 528 -------ASATG--------ISKQSNYELVELPGCKGIWTVYHKSS---RGHNADSSRMA 569
TG + + +L + G +G+W+ + + G N + A
Sbjct: 557 ELLAGVGVGHTGGLARFQRDLPARVKRKLHRISGNRGVWSFPVRRAVKVAGMNIERPTGA 616
Query: 570 AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ--GRTIAAGNLFGRRRVIQVF 627
A D +I+S +A + + + + VD + TI AG F R ++QV
Sbjct: 617 ADWD----TVIVSTDATPSPGLSRVAVKDSSTDVDILTRLPAITIGAGPFFQRTAILQVV 672
Query: 628 ERGARIL--DGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
R+L DGS + +DL + + + + SI+DP+V++ D ++ L
Sbjct: 673 NNAIRVLEADGSERQVIKDLD---------GTTPRAKIRACSISDPFVVVVREDDTLGLF 723
Query: 684 VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK------GPEPWLR-KTSTDAWLSTGVG 736
VG+ + + + + + + Y D G L+ K +A ST +
Sbjct: 724 VGETGKGKLRRKDMSMLGDKASRYLAASFYQDHSGLFQVGTARSLKGKEKANAPASTTIE 783
Query: 737 EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREAL 796
A+D +G + V+C G +EI+ +P VF+ + DT+
Sbjct: 784 AAMDEG------RGSQWLVLCRPQGVVEIWALPKLTLVFSCGGVSDIPPVMADTF----- 832
Query: 797 KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+ S ++ Q ++ E+ + RP L +L GT+ Y
Sbjct: 833 ---DLATPSPVQDPPRQAEDHDVE-----EILISPIGETTPRPHLLVLLRSGTVAVYDTA 884
Query: 857 LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI 916
E P PV+T R + ++ R+ + P++ + P AP I
Sbjct: 885 PVELP--------PVATGREAGL-QLAFVRIMSRAVDTAPIERAEKRGAP--APRHLIPF 933
Query: 917 FKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
++S G FL+G +P W + + +R+ P + + + +FT F+ T +
Sbjct: 934 STSVS---GVFLTGGKPGWILGTDKTAVRLVPAV-NQVVHSFTACSLWGNRGEFLMNTDE 989
Query: 976 GILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1035
G + +P D P +P + P+ T A + ++++ L+
Sbjct: 990 GPCLVEWMPD-LRLDEELP-SFFMP-RGRPY--TSIAYEATTGMVIAASSLRS-----RF 1039
Query: 1036 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1095
++ E G+ + + + T + + +++P+ W T +E TV
Sbjct: 1040 VLFDEDGNTVWKPDAEFIS---DPTTDTSSLELIDPET----WTTVDGFEFAFNEMINTV 1092
Query: 1096 RVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNA-DNPQNLVTEVY--- 1150
R V L +T+ ++ +A+GT +GED+A +G +F D+ Q ++
Sbjct: 1093 RTVNLETVSTEAGSKDFIAVGTTVFRGEDLAVKGATYIFEVIEVVPDDTQQRRHKLKLWC 1152
Query: 1151 SKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFI 1209
E KG +SAL + G+L+ + G K+ + + E L G+AF D +YV SL +KN +
Sbjct: 1153 RDEAKGPVSALCGINGYLVSSMGQKVFVRAFDLNERLVGVAFMDV-GIYVTSLRTLKNLL 1211
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
L+GD KS++F++++E +L LL KDF + EF +++V +DEQ ++IF
Sbjct: 1212 LIGDAVKSVWFVAFQEDPFKLQLLGKDFQRAALTSAEFFFGFGEMTIVSTDEQNVLRIFR 1271
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
Y P +E+ GQKLL + EF+ + +L +SD P S +++
Sbjct: 1272 YDPMHAEAQDGQKLLCQTEFNTQSDARG--TTTILRRTSDEDILLPQS------KIMYCG 1323
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1389
DGS+ + P++E F+RL LQ ++ ++ HVAGL+P++FR ++ A RP I+D
Sbjct: 1324 TDGSLSALLPVEEHVFKRLHLLQGQMTRNIQHVAGLHPKAFRVVRNDFTA-RPLARGILD 1382
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
LL+ +E LPL Q+E Q G +R IL + L
Sbjct: 1383 SNLLAKFEELPLSRQVEFTKQIGQSREVILGDWMQL 1418
>gi|395512730|ref|XP_003760588.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Sarcophilus harrisii]
Length = 1449
Score = 337 bits (864), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 225/704 (31%), Positives = 352/704 (50%), Gaps = 82/704 (11%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V+ ESGA+EI+ +P++ VF V F G+ +VD+ + T+ ++ EE T
Sbjct: 790 WCVLVRESGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPATQGDTKKEEVTR 845
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L ++ +RP+L + D +L Y+A+ +
Sbjct: 846 QGELPLVKEVLLVALGNRQ-----TRPYLL-VHVDQELLIYEAFAHD------------- 886
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
S + S L+ +RF + P + RE+ P + R
Sbjct: 887 -------SQLGQSNLK-VRFKKVPHNINFREKKPKPSKKKPEGGGAEEGAGARGRVARFR 938
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 939 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 998
Query: 975 QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
QG L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S L
Sbjct: 999 QGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATS-------TNALC 1050
Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENA 1092
I + G + + + D + E + ++++ P W+ A I ++ E+
Sbjct: 1051 TRIPRMTGEEKEFETIERDDRYIHPLQEAFSIQLISPVS----WEAIPNARIELEEWEHV 1106
Query: 1093 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE--- 1148
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1107 TCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKF 1166
Query: 1149 --VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK 1206
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VK
Sbjct: 1167 KVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVK 1225
Query: 1207 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
NFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+
Sbjct: 1226 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDSAQLGFLVSDRDRNLM 1285
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----N 1321
++ Y P+ ES+ G +LL RA+FHVGAHV F R + GAA G K N
Sbjct: 1286 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPTKKSIVWEN 1338
Query: 1322 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
+ F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1339 KHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQ 1398
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+++D ELL+ Y L E+ E+A + GTT IL +L ++
Sbjct: 1399 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLEI 1442
Score = 305 bits (781), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 224/687 (32%), Positives = 353/687 (51%), Gaps = 99/687 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E S +S + E K + LELV + GNV S+
Sbjct: 29 NLVVAGTSQLYVYRLNHDAETSTKSDRNAEGK----LHKEHKEKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 138 NVHTPRVRVDPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQKSSFL 189
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
K HP+IWS NLP D + LAVP PIGGV++ N++ Y +QS Y VSL
Sbjct: 250 ILQKVHPVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSL 303
Query: 350 DS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRL 401
+S + P + + LD A A ++ D ++S K G++ +LT++ DG R V+
Sbjct: 304 NSLTAGTTAFPLRMQDGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSF 363
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI-EAD 460
K SVLT+ + T+ FLGSRLG+SLL+++T SS + ++ + D
Sbjct: 364 HFDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSAREAPSREVSDKD 423
Query: 461 APSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
P K+ R S+ A QD V+ E+ +YGS A + T+ A T+SF V DS++NI
Sbjct: 424 EPPVKKKRVESTLGWAGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 479
Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
GP + + G L I + + + + K ++V ELPGC
Sbjct: 480 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 539
Query: 548 KGIWTVY-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
+WTV ++++G + SS D + H +LI+S E TM+L+T +
Sbjct: 540 YDMWTVIAPLRKEEDETTKGEGPEQEPSSPETEDDGKRHGFLILSREDSTMILQTGQEIM 599
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 600 ELDTS-GFATQGPTVYAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL------- 648
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 649 GSPIVQCAVADPYVVIMSAEGHVTMFL 675
>gi|410987992|ref|XP_004000273.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Felis catus]
Length = 1432
Score = 336 bits (861), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 226/701 (32%), Positives = 350/701 (49%), Gaps = 82/701 (11%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G LEI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 773 WCLLVRENGTLEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 828
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 829 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 871
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
L N+ +RF + P + RE E GA + R
Sbjct: 872 ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKVEGGSAEEGAGARGRVARFR 921
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 922 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 981
Query: 975 QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
QG L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S + P +
Sbjct: 982 QGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNM--PCTR--- 1035
Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENA 1092
I + G + + + D + E + ++++ P W+ A I ++ E+
Sbjct: 1036 --IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHV 1089
Query: 1093 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE--- 1148
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1090 TCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKF 1149
Query: 1149 --VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK 1206
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VK
Sbjct: 1150 KVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVK 1208
Query: 1207 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
NFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+
Sbjct: 1209 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1268
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----N 1321
++ Y P+ ES+ G +LL RA+FHVGAHV F R + GAA G K N
Sbjct: 1269 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWEN 1321
Query: 1322 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
+ F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1322 KHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQ 1381
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1382 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILEDL 1422
Score = 300 bits (768), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 211/631 (33%), Positives = 334/631 (52%), Gaps = 76/631 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 57 LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 112
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P V+VDP GRC +L+YG ++++L + + +E
Sbjct: 113 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEG 166
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 167 LMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQ 226
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS
Sbjct: 227 DTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPP 286
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
+ALN + + + LD A A ++ D ++S K G++ +LT++ DG R
Sbjct: 287 YGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMR 346
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
V+ K SVLT+ + T+ FLGSRLG+SLL+++T +S ++E
Sbjct: 347 SVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA--- 402
Query: 457 IEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDS 508
+ + P +K+ R S+ QD V+ E+ +YGS A + T+ A T+SF V DS
Sbjct: 403 ADKEEPPSKKKRVDSTVGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDS 458
Query: 509 LVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV---E 543
++NIGP + + G L I + + + + K ++V E
Sbjct: 459 ILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFE 518
Query: 544 LPGCKGIWTVY-------HKSSRGHNADS--SRMAAYDD-EYHAYLIISLEARTMVLETA 593
LPGC +WTV ++S+G A+ S + A DD H +LI+S E TM+L+T
Sbjct: 519 LPGCYDMWTVIAPVRKEQEETSKGEGAEQEPSTLEAEDDGRRHGFLILSREDSTMILQTG 578
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 579 QEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL--- 631
Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 632 ----GSPIVQCAVADPYVVIMSAEGHVTMFL 658
>gi|334326317|ref|XP_001364707.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Monodelphis domestica]
Length = 1449
Score = 336 bits (861), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 224/704 (31%), Positives = 352/704 (50%), Gaps = 82/704 (11%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V+ ESGA+EI+ +P++ VF V F G+ +VD+ + T+ ++ EE T
Sbjct: 790 WCVLVRESGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPATQGDTKKEEVTR 845
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L ++ +RP+L + D +L Y+A+ +
Sbjct: 846 QGELPLVKEVLLVALGNRQ-----TRPYLL-VHVDQELLIYEAFAHD------------- 886
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
S + S L+ +RF + P + RE+ P + R
Sbjct: 887 -------SQLGQSNLK-VRFKKVPHNINFREKKPKPSKKKPEGGGTEEGAGARGRVARFR 938
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 939 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 998
Query: 975 QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
QG L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S L
Sbjct: 999 QGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATS-------TNALC 1050
Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENA 1092
I + G + + + + + E + ++++ P W+ A I ++ E+
Sbjct: 1051 TRIPRMTGEEKEFETIERDERYIHPLQEAFSIQLISPVS----WEAIPNARIELEEWEHV 1106
Query: 1093 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE--- 1148
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1107 TCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKF 1166
Query: 1149 --VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK 1206
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VK
Sbjct: 1167 KVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVK 1225
Query: 1207 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
NFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+
Sbjct: 1226 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDSAQLGFLVSDRDRNLM 1285
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----N 1321
++ Y P+ ES+ G +LL RA+FHVGAHV F R + GAA G K N
Sbjct: 1286 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSIVWEN 1338
Query: 1322 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
+ F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1339 KHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQ 1398
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+++D ELL+ Y L E+ E+A + GTT IL +L ++
Sbjct: 1399 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLEI 1442
Score = 306 bits (783), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 224/687 (32%), Positives = 354/687 (51%), Gaps = 99/687 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E S +S + E K + LELV + GNV S+
Sbjct: 29 NLVVAGTSQLYVYRLNHDAETSTKSDRNAEGK----LHKEHKEKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 138 NVHTPRVRVDPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQKSSFL 189
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
K HP+IWS NLP D + LAVP PIGGV++ N++ Y +QS Y VSL
Sbjct: 250 ILQKVHPVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSL 303
Query: 350 DS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRL 401
+S + P + + LD A A ++ D ++S K G++ +LT++ DG R V+
Sbjct: 304 NSLTAGTTAFPLRMQDGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSF 363
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI-EAD 460
K SVLT+ + T+ FLGSRLG+SLL+++T S+ + ++ + D
Sbjct: 364 HFDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAAREAPSREVSDKD 423
Query: 461 APSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
P K+ R S+ A QD V+ E+ +YGS A + T+ A T+SF V DS++NI
Sbjct: 424 EPPVKKKRVESTLGWAGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 479
Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
GP + + G L I + + + + K ++V ELPGC
Sbjct: 480 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 539
Query: 548 KGIWTVY-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
+WTV ++++G A+ SS D + H +LI+S E TM+L+T +
Sbjct: 540 YDMWTVIAPLRKEEDETTKGEGAEQEPSSPETEDDGKRHGFLILSREDSTMILQTGQEIM 599
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 600 ELDTS-GFATQGPTVYAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL------- 648
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 649 GSPIVQCAVADPYVVIMSAEGHVTMFL 675
>gi|355680843|gb|AER96659.1| cleavage and polyadenylation specific factor 1, 160kDa [Mustela
putorius furo]
Length = 1399
Score = 335 bits (859), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 218/684 (31%), Positives = 344/684 (50%), Gaps = 48/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 741 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 796
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 797 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 846
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 847 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAAARGRVARFRYFEDIYGYSGVFICGPS 906
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 907 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 966
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S + P + I + G + + +
Sbjct: 967 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNM--PCTR-----IPRMTGEEKEFEAIE 1018
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1019 RDDRYVHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1074
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1075 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1134
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1135 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1193
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1194 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1253
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 1254 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLL 1306
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H++ +A + +++D ELL+ Y
Sbjct: 1307 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRLLHADRRALQNAVRNVLDGELLNRYLY 1366
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1367 LSTMERGELAKKIGTTPDIILDDL 1390
Score = 298 bits (762), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 210/635 (33%), Positives = 329/635 (51%), Gaps = 81/635 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 22 LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 77
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVL----VYGLQMIILKASQGGSGLVGD 216
FE PE L+ G P V+VDP GRC +L +YG ++++L + + +
Sbjct: 78 YFEEPE---LRDGFVQNVHAPRVRVDPDGRCAAMLTAMLIYGSRLVVLPFRRES---LAE 131
Query: 217 EDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRV 274
E G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GRV
Sbjct: 132 EHEGLMGEGQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRV 191
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
+ + TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +Q
Sbjct: 192 AVRQDTCCIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQ 251
Query: 335 SA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
S +ALN + + + LD A A ++ D ++S K G++ +LT++
Sbjct: 252 SVPPYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLIT 311
Query: 394 DG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++ T L
Sbjct: 312 DGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKY-----TEKLQEAPAG 366
Query: 453 EFGDIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFA 504
+ + D P +K+ R S+ A QD V+ E+ +YGS A + T+ A T+SF
Sbjct: 367 AVRETDKDEPPSKKKRVESAVGWSGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFE 422
Query: 505 VRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV 542
V DS++NIGP + + G L I + + + + K ++V
Sbjct: 423 VCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVV 482
Query: 543 ---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMV 589
ELPGC +WTV + + G + S + A DD H +LI+S E TM+
Sbjct: 483 TTFELPGCYDMWTVIAPARKEQEETPKGDGAEQEPSALEADDDGRRHGFLILSREDSTMI 542
Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 543 LQTGQEIMELDTS-GFATQGPTVFAGNIGDGRYIVQVSPLGIRLLEG---VSQLHFIPVD 598
Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 599 L-------GSPIVQCAVADPYVVIMSAEGHVTMFL 626
>gi|358415280|ref|XP_003583063.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Bos taurus]
Length = 1490
Score = 335 bits (858), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 217/684 (31%), Positives = 343/684 (50%), Gaps = 48/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 831 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 886
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + RP+L + D +L Y+A+ P ++ +
Sbjct: 887 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 936
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E T R F++I G+ G F+ G
Sbjct: 937 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 996
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 997 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1056
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P +V + G + + +
Sbjct: 1057 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIE 1108
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
+ + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1109 RDERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1164
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1165 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1224
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1225 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1283
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1284 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1343
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 1344 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLL 1396
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1397 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLY 1456
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1457 LSTMERGELAKKIGTTPDIILDDL 1480
Score = 295 bits (754), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 209/635 (32%), Positives = 327/635 (51%), Gaps = 82/635 (12%)
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+
Sbjct: 114 KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 169
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLV 214
H FE PE L+ G P V+VDP GRC +L+YG ++++L ++ GLV
Sbjct: 170 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLV 226
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAG 272
G+ G + S++I++R LD K ++ D F+HGY EP ++IL E TW G
Sbjct: 227 GE--------GQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPG 278
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
+V+ + TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV++ N++ Y
Sbjct: 279 KVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYL 338
Query: 333 SQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
+QS +ALN+ + + + LD A A ++ D ++S K G++ +LT+
Sbjct: 339 NQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTL 398
Query: 392 VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
+ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T S+
Sbjct: 399 ITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA- 457
Query: 451 KEEFGDIEADAPSTKRLRRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFA 504
E D E KR+ + S QD V+ E+ +YGS A + T+ A T+SF
Sbjct: 458 -REAADKEEPPSKKKRVDATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFE 512
Query: 505 VRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV 542
V DS++NIGP + + G L I + + + + K ++V
Sbjct: 513 VCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVV 572
Query: 543 ---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMV 589
ELPGC +WTV + G + A DD H +LI+S E TM+
Sbjct: 573 TTFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMI 632
Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 633 LQTGQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVD 688
Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 689 L-------GSPIVQCAVADPYVVIMSAEGHVTMFL 716
>gi|27807297|ref|NP_777145.1| cleavage and polyadenylation specificity factor subunit 1 [Bos
taurus]
gi|1706101|sp|Q10569.1|CPSF1_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|929007|emb|CAA58152.1| cleavage and polyadenylation specificity factor, 160 kDa subunit [Bos
taurus]
gi|296480730|tpg|DAA22845.1| TPA: cleavage and polyadenylation specificity factor subunit 1 [Bos
taurus]
Length = 1444
Score = 334 bits (857), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 217/684 (31%), Positives = 343/684 (50%), Gaps = 48/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 785 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + RP+L + D +L Y+A+ P ++ +
Sbjct: 841 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E T R F++I G+ G F+ G
Sbjct: 891 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 950
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 951 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1010
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P +V + G + + +
Sbjct: 1011 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIE 1062
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
+ + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1063 RDERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1118
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1119 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1178
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1179 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1237
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1238 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1297
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 1298 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLL 1350
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1351 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLY 1410
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1411 LSTMERGELAKKIGTTPDIILDDL 1434
Score = 304 bits (779), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 221/678 (32%), Positives = 345/678 (50%), Gaps = 86/678 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 309
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 369
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T S+ E D E KR+
Sbjct: 370 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRV 427
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 428 DATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 483
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
L I + + + + K ++V ELPGC +WTV
Sbjct: 484 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 543
Query: 557 SSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 544 VRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFA 602
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + S ++ ++
Sbjct: 603 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAV 652
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 653 ADPYVVIMSAEGHVTMFL 670
>gi|345779232|ref|XP_532356.3| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Canis lupus familiaris]
Length = 1460
Score = 334 bits (856), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 217/684 (31%), Positives = 342/684 (50%), Gaps = 48/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 801 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 856
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 857 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 906
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 907 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 966
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 967 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1026
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S + P + I + G + + +
Sbjct: 1027 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNM--PCTR-----IPRMTGEEKEFETIE 1078
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1079 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1134
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1135 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1194
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1195 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1253
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1254 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1313
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 1314 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLL 1366
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1367 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLY 1426
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1427 LSTMERGELAKKIGTTPDIILDDL 1450
Score = 301 bits (771), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 212/629 (33%), Positives = 331/629 (52%), Gaps = 72/629 (11%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 85 LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 140
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P V+VDP GRC +L+YG ++++L + + +E
Sbjct: 141 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEG 194
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 195 LMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQ 254
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS
Sbjct: 255 DTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPP 314
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
+ALN + + + LD A A ++ D ++S K G++ +LT++ DG R
Sbjct: 315 YGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMR 374
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
V+ K SVLT+ + T+ FLGSRLG+SLL+++T S+ E D
Sbjct: 375 SVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAA--REAAD 432
Query: 457 IEADAPSTKRLRRSS-----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLV 510
E KR+ ++ QD V+ E+ +YGS A + T+ A T+SF V DS++
Sbjct: 433 KEEPPSKKKRVDCAAGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSIL 488
Query: 511 NIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELP 545
NIGP + + G L I + + + + K ++V ELP
Sbjct: 489 NIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELP 548
Query: 546 GCKGIWTVY-------HKSSRGHNA--DSSRMAAYDD-EYHAYLIISLEARTMVLETADL 595
GC +WTV ++S+G A +SS + A DD H +LI+S E TM+L+T
Sbjct: 549 GCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTMILQTGQE 608
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 609 IMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL----- 659
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 660 --GSPIVQCAVADPYVVIMSAEGHVTMFL 686
>gi|16751835|ref|NP_444423.1| cleavage and polyadenylation specificity factor subunit 1 isoform 2
[Mus musculus]
gi|17374611|sp|Q9EPU4.1|CPSF1_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|11762096|gb|AAG40326.1|AF322193_1 cleavage and polyadenylation specificity factor 1 [Mus musculus]
gi|38614159|gb|AAH56388.1| Cleavage and polyadenylation specific factor 1 [Mus musculus]
Length = 1441
Score = 334 bits (856), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 218/684 (31%), Positives = 341/684 (49%), Gaps = 48/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1008 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1059
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1060 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1115
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1116 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1175
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1176 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1234
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1235 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1294
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 1295 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLL 1347
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1348 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLY 1407
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1408 LSTMERSELAKKIGTTPDIILDDL 1431
Score = 307 bits (787), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
>gi|395860104|ref|XP_003802355.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Otolemur garnettii]
Length = 1441
Score = 334 bits (856), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 224/701 (31%), Positives = 348/701 (49%), Gaps = 82/701 (11%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 880
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-------------ETPHGAPCQ----RIT 915
L N+ +RF + P + RE T GA + R
Sbjct: 881 ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGVRGRVARFR 930
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 931 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 990
Query: 975 QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
QG L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S P +
Sbjct: 991 QGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR--- 1044
Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENA 1092
I + G + + + D + E + ++++ P W+ A I ++ E+
Sbjct: 1045 --IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHV 1098
Query: 1093 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE--- 1148
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1099 TCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKF 1158
Query: 1149 --VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK 1206
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VK
Sbjct: 1159 KVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVK 1217
Query: 1207 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
NFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+
Sbjct: 1218 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1277
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----N 1321
++ Y P+ ES+ G +LL RA+FHVGAHV F R + GA G K N
Sbjct: 1278 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGPSKKSVVWEN 1330
Query: 1322 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
+ F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1331 KHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQ 1390
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1391 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 1431
Score = 300 bits (767), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 217/678 (32%), Positives = 343/678 (50%), Gaps = 89/678 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEVLTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASATRESADKEEPPSKKKRV 424
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 425 DPSVGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 480
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 481 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAS 540
Query: 555 -----HKSSRGHNADSSR---MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
++ +G + D H +LI+S E TM+L+T + E+ S +
Sbjct: 541 VRKEEEETPKGEGTEQESGVPEGEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 599
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 600 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 649
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 650 ADPYVVIMSAEGHVTMFL 667
>gi|296227035|ref|XP_002807684.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Callithrix jacchus]
Length = 1394
Score = 333 bits (855), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 216/684 (31%), Positives = 342/684 (50%), Gaps = 48/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 735 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 790
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ S +
Sbjct: 791 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLSQGNLK 840
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 841 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 900
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 901 PHWLLVTGRGALRLHPMGIDGPVDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 960
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 961 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFETIE 1012
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
+ + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1013 RDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGL 1068
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P+ +T +Y KE KG ++AL
Sbjct: 1069 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVTEPRQTLTXXKFKVLYEKEQKGPVTALCH 1128
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1129 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1187
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1188 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1247
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GA G K N+ F TLDG IG +
Sbjct: 1248 LRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVMWENKHITWFATLDGGIGLLL 1300
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1301 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLY 1360
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1361 LSTMERSELAKKIGTTPDIILDDL 1384
Score = 267 bits (682), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 201/672 (29%), Positives = 322/672 (47%), Gaps = 124/672 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN + + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSAEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ ++ F C +G +
Sbjct: 367 ASVLTTSVSGTEG----------------FLCAAGGKSVP-------------------- 390
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------ 521
QD +E+ +YGS + + + T+SF V DS++NIGP + + G
Sbjct: 391 --------QD--EXDEIEVYGSETQSG-TQLATYSFEVCDSILNIGPCANAAMGEPAFLS 439
Query: 522 ----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-------- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 440 EEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEE 499
Query: 555 -HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
+ G + S A DD H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 500 ENPKGEGTEQEPSTPEADDDSRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTV 558
Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV+
Sbjct: 559 FAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFVPVDL-------GAPIVQCAVADPYVV 608
Query: 673 LGMSDGSIRLLV 684
+ ++G + + +
Sbjct: 609 IMSAEGHVTMFL 620
>gi|354491122|ref|XP_003507705.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 1 [Cricetulus griseus]
Length = 1441
Score = 333 bits (855), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 218/684 (31%), Positives = 341/684 (49%), Gaps = 48/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1008 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1059
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1060 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1115
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1116 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1175
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1176 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1234
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1235 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1294
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 1295 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLL 1347
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1348 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLY 1407
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1408 LSTMERSELAKKIGTTPDIILDDL 1431
Score = 302 bits (774), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 217/673 (32%), Positives = 347/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE E L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ ++
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429
Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY------- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 555 HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
++ R + + ++ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
>gi|344236599|gb|EGV92702.1| Cleavage and polyadenylation specificity factor subunit 1 [Cricetulus
griseus]
Length = 1419
Score = 333 bits (855), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 218/684 (31%), Positives = 341/684 (49%), Gaps = 48/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 760 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 815
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 816 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 865
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 866 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 925
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 926 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 985
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 986 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1037
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1038 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1093
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1094 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1153
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1154 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1212
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1213 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1272
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 1273 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLL 1325
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1326 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLY 1385
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1386 LSTMERSELAKKIGTTPDIILDDL 1409
Score = 299 bits (765), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 217/671 (32%), Positives = 347/671 (51%), Gaps = 82/671 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE E L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA--DAPSTKRLRRS 470
+ + T+ FLGSRLG+SLL+++T SS ++E A + P +K+ R
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS-VREAADKASAHNEEPPSKKKRVD 430
Query: 471 SSDAL-------QDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 431 PTAGWTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGE 486
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 487 PAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 546
Query: 555 ----HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
++ R + + ++ A D H +LI+S E TM+L+T + E+ S +
Sbjct: 547 RKEEEEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 605
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++A
Sbjct: 606 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 655
Query: 668 DPYVLLGMSDG 678
DPYV++ ++G
Sbjct: 656 DPYVVIMSAEG 666
>gi|197245729|gb|AAI68713.1| Cpsf1 protein [Rattus norvegicus]
Length = 1439
Score = 333 bits (855), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 218/684 (31%), Positives = 341/684 (49%), Gaps = 48/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 780 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 836 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 886 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 945
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 946 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1006 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1057
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1058 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1113
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1114 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1173
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1174 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1232
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1233 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1292
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 1293 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLL 1345
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1346 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLY 1405
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1406 LSTMERSELAKKIGTTPDIILDDL 1429
Score = 308 bits (789), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 219/671 (32%), Positives = 345/671 (51%), Gaps = 77/671 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429
Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSE 485
Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH-------- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 EFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545
Query: 556 --KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 546 TPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654
Query: 674 GMSDGSIRLLV 684
++G + + +
Sbjct: 655 MSAEGHVTMFL 665
>gi|213407244|ref|XP_002174393.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
gi|212002440|gb|EEB08100.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
Length = 1431
Score = 333 bits (854), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 379/1514 (25%), Positives = 641/1514 (42%), Gaps = 249/1514 (16%)
Query: 57 NLVVTAANVIEIY-VVRVQ------EEGSKESKNSGETKRRVLMDGI---------SAAS 100
NL+V + ++++ +VRVQ E+ S N + M+ + +
Sbjct: 29 NLIVAKDDFLQVFDIVRVQRDSDDVEDAFGSSMNLRMEENDAFMETNMQLIRTHEHTVYT 88
Query: 101 LELVCHYRLHGNVESLAILSQ--GGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
L LV R+ G ++ LA++ GG D ++L AK+S+L +D L S
Sbjct: 89 LRLVYQTRVFGTIKDLAVVKPKLGGFTT----DLLVLLTNYAKVSILVWDSLTQQLSTVS 144
Query: 159 MHCFES---PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
MH +ES P+ + E+ A+ + VDP+ C + YG M I+ + +
Sbjct: 145 MHYYESVVPPKPI----AEETPAQ---LIVDPESTCCVLRFYGDMMAIIPFRKPEDLEME 197
Query: 216 DEDTFGSGG-GFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
D + S V+ LD + V D F+ GY E + +L+ E T
Sbjct: 198 DANAQSEKPVDVQCVYLPSFVLTASQLDYSIARVLDSKFLEGYREATLALLYCPEQTSTV 257
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY- 331
+ + T ++ +++ + +I S NLP+D Y +L VP+P+GG L++G N I Y
Sbjct: 258 FLPVRKDTVSLAVITLDIEQRASAVITSIHNLPYDIYCILPVPAPLGGSLLLGGNEIIYV 317
Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDL 386
S ++ +A+N + + + Q RSSF +EL+ L ++ LL TG L
Sbjct: 318 DSAGSTVGIAVNPFYRNATNFQLEDRSSFQLELEGTIGVPLSSPRTESVSVLLIHPTGQL 377
Query: 387 VLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTC 439
L + DG+ V+ LDL + N ++L S +T + + FLGS+ GDS LVQ++
Sbjct: 378 FYLDFLMDGKNVKNLDLHPASDELNNALLQSGVTCALPVADHELFLGSQTGDSYLVQWSR 437
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
S + + D E DA D L D +Y + S K
Sbjct: 438 RSINNQTQEEGTLTYKDEENDA-------DEEVDELDD--------IYDTGSKEKAKRNK 482
Query: 500 -----TFSFAVRDSLVNIGPLKDFSYG---------------LRINADASATGIS----- 534
V D L N+GP+ +F G L + + TG S
Sbjct: 483 FVELGPLRLEVHDVLSNVGPIIEFCTGKAGSLAYFPQDNHGPLEVTC-VTGTGKSGSLVV 541
Query: 535 -KQSNYELVE----LPGCKGIWTVYHKSSRGHNADSSRMAAY-DDEYHAYLIISLEARTM 588
++S +VE GC+ +WT+ H + R N S Y DDEY YL++S E +
Sbjct: 542 FRRSISPVVEGKFNFEGCQSLWTI-HVTGRLKNPRSHGSERYLDDEYDTYLVVSKEKESF 600
Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGP 647
V + EV +S D+ +G TI G L G R++Q+ R+ D + ++ Q ++ G
Sbjct: 601 VFTAGETFDEVEDS-DFNTKGSTINVGGLLGGMRIVQICTTSLRVYDPNIHLVQRINLG- 658
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
+ V++ S+ DPYV+L + G I L D T + + K V
Sbjct: 659 ---------KKQNVVAASVCDPYVVLVLLGGRILLYSMDAETQRL---IKMDLHKQLKNV 706
Query: 708 SSCTLYHDKGPEPWLRKTSTDAWL-----STGVGE-AIDGADGGP--------------- 746
+ +LY +P +++ ++ L S G + +DG D P
Sbjct: 707 KAASLYSTN--DPVMQELFSELDLGRNNSSPGKSDIQMDGVDTQPDRPSMPAGNQVTETN 764
Query: 747 ---LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
LD+ + V ++ G L++ +P ++CV D F + T + + L
Sbjct: 765 VSTLDEQSFAAHFVLFVLHDDGRLKVLHLPTYSCVLECDVF------DLPTVLYDGLSSE 818
Query: 800 E-TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF 858
TE++ SS+E +VE+ L I Y+ ++
Sbjct: 819 RVTEMHESSQE--------------LVEVLATDLGDEAKEAHLLIRSRMNEITVYKPFVC 864
Query: 859 EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET------------- 905
P T K++ LRFS+ P + TRE T
Sbjct: 865 SNPV-THKTE---------------------LRFSKIPQEGMTRESTECSLQDLVAETEQ 902
Query: 906 ---PHGAPCQ------------RITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQL 949
P A Q R+ + I H F++G++P + + + + HP L
Sbjct: 903 ENAPKDASEQKPQKSSSTVDKPRMVALQRIGNHSAVFITGAKPFFLLKTAHSVAKFHPLL 962
Query: 950 CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQIT 1009
+ I++ H + G+I+V + IC+ YD+ W +KV + + H I
Sbjct: 963 SECRILSLASFHTEHAPKGYIFVDENYDINICRFQDDINYDHRWGYKKV-NVGRSVHGIA 1021
Query: 1010 YFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1069
Y K +Y + S L P + E G+ + L RT + + ++
Sbjct: 1022 YHPTKMVYAIATST--LTPYE------VTDEEGNVVYPLKNEGEYLPRTNS---GMLELV 1070
Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARG 1128
P W E L VR+V L + TK + +A+GT+ +GED+A RG
Sbjct: 1071 SP----LTWTVIDRYKFLDYEIPLCVRLVNLEISDVTKLRKPFIAVGTSITKGEDIAVRG 1126
Query: 1129 RVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
LF P + T V +E+KG ++ ++ + G+LL G K+I+
Sbjct: 1127 STYLFEIIDVVPQPGHPETRHKLKLVTREEIKGTVAVVSEINGYLLSGQGQKVIVRALED 1186
Query: 1184 TE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1242
+ L G+AF D VV+ ++ +N ++ GDI +SI F+ + E+ ++ L AK L
Sbjct: 1187 EDHLVGVAFIDLGSYTVVAKSL-RNLLIFGDIRQSISFVGFAEEPYRMTLFAKGQDPLSV 1245
Query: 1243 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1302
+ +FL+ G +L V+D + N++I Y P+ ES G++L++R + HVG H+ + L
Sbjct: 1246 SSADFLVQGQSLYFAVADMRGNLRILAYDPENPESHSGERLVTRGDIHVG-HIITAIHL- 1303
Query: 1303 MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHV 1362
+ DR G D+ + FA + DGS+ + P+ E +RRL +Q L + + V
Sbjct: 1304 VPKMKKDRPGEV-DYDEGDEFACITTNSDGSLQALCPISERVYRRLNIIQNYLANRIETV 1362
Query: 1363 AGLNPRSFRQFHS----NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
GLNPRS+R ++ N HR I+D L+ H+ + + + E+A++ G S I
Sbjct: 1363 GGLNPRSYRLINTVSSLNNATHR-----ILDGGLIEHFSYMSVAHRQEMAYKCGVPISTI 1417
Query: 1419 LSNLNDLALGTSFL 1432
+++L +L +++
Sbjct: 1418 MNDLVELDEALNYM 1431
>gi|417406474|gb|JAA49895.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
cft1 cpsf subunit [Desmodus rotundus]
Length = 1444
Score = 332 bits (851), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 223/701 (31%), Positives = 346/701 (49%), Gaps = 82/701 (11%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 785 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ +
Sbjct: 841 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAFAHD------------- 881
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
S + L+ +RF + P + RE+ P + R
Sbjct: 882 -------SQLGQGNLK-VRFKKVPHNINFREKKPKPSKKKADGGGAEEGAGARGRVARFR 933
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 934 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 993
Query: 975 QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
QG L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S P +
Sbjct: 994 QGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR--- 1047
Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENA 1092
I + G + + + D + E + ++++ P W+ A I ++ E+
Sbjct: 1048 --IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHV 1101
Query: 1093 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE--- 1148
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1102 TCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKF 1161
Query: 1149 --VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK 1206
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VK
Sbjct: 1162 KVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVK 1220
Query: 1207 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
NFIL D+ KSI L ++E L+L+++D L+ ++ +F++D + L +VSD +N+
Sbjct: 1221 NFILAADVMKSISLLRYQEDSKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1280
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----N 1321
++ Y P+ ES+ G +LL RA+FHVGAHV F R + GAA G K N
Sbjct: 1281 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWEN 1333
Query: 1322 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
+ F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1334 KHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQ 1393
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+I+D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1394 NAVRNILDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1434
Score = 300 bits (768), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 216/675 (32%), Positives = 346/675 (51%), Gaps = 80/675 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRSTEGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTSGTTAFP 314
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 315 LRTQEGVRTTLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAAASVLT 374
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR---- 468
S + T+ FLGSRLG+SLL+++T +S ++E + + PS+K+ R
Sbjct: 375 SSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEAPASAVREA---ADKEEPSSKKKRVDPT 430
Query: 469 ---RSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 431 VGWSGGQSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGEPA 486
Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 487 FLSEEFQNSPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVVAPVRK 546
Query: 555 --HKSSRGHNADSSRMAAY---DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
++ +G + + D H +LI+S E TM+L+T + E+ S + QG
Sbjct: 547 EQEETPKGEGTEQEPITPETEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 605
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
T+ AGN+ R ++QV G R+L+G L F P + S ++ ++ADP
Sbjct: 606 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVADP 655
Query: 670 YVLLGMSDGSIRLLV 684
V++ ++G + + +
Sbjct: 656 CVVIMSAEGHVAMFL 670
>gi|338728513|ref|XP_003365689.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Equus caballus]
Length = 1450
Score = 332 bits (851), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 217/684 (31%), Positives = 341/684 (49%), Gaps = 48/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 791 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 846
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 847 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 896
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 897 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGVGARGRVARFRYFEDIYGYSGVFICGPS 956
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 957 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1016
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1017 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFETIE 1068
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1069 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1124
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1125 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1184
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1185 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1243
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1244 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1303
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 1304 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLL 1356
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1357 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLY 1416
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1417 LSTMERGELAKKIGTTPDIILDDL 1440
Score = 303 bits (775), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 219/681 (32%), Positives = 350/681 (51%), Gaps = 86/681 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN + + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRNAEGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+ + T+ FLGSRLG+SLL+++T +S ++E E + P +K+ R S+
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---AEKEEPPSKKKRVDST 430
Query: 473 -------------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP +
Sbjct: 431 VGWSGSPRAAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANA 486
Query: 519 SYG----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTV 553
+ G L I + + + + K ++V ELPGC +WTV
Sbjct: 487 AMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTV 546
Query: 554 Y-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
++ +G + S+ A D H +LI+S E TM+L+T + E+ S
Sbjct: 547 IAPVRKEQEETPKGEGTEQEPSAPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS- 605
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
+ QG T+ AGN+ R ++QV G R+L+G L F P + S ++
Sbjct: 606 GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQ 655
Query: 664 VSIADPYVLLGMSDGSIRLLV 684
++ADPYV++ ++G + + +
Sbjct: 656 CAVADPYVVIMSAEGHVTMFL 676
>gi|338728511|ref|XP_001505047.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like isoform 1 [Equus caballus]
Length = 1444
Score = 332 bits (851), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 217/684 (31%), Positives = 341/684 (49%), Gaps = 48/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 785 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 841 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 891 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGVGARGRVARFRYFEDIYGYSGVFICGPS 950
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 951 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1010
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1011 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFETIE 1062
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1063 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1118
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1119 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1178
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1179 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1237
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1238 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1297
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 1298 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLL 1350
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1351 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLY 1410
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1411 LSTMERGELAKKIGTTPDIILDDL 1434
Score = 305 bits (782), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 219/675 (32%), Positives = 350/675 (51%), Gaps = 80/675 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN + + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRNAEGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+ + T+ FLGSRLG+SLL+++T +S ++E E + P +K+ R S+
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---AEKEEPPSKKKRVDST 430
Query: 473 -------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 431 VGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGEPA 486
Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 487 FLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRK 546
Query: 555 --HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
++ +G + S+ A D H +LI+S E TM+L+T + E+ S + QG
Sbjct: 547 EQEETPKGEGTEQEPSAPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 605
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
T+ AGN+ R ++QV G R+L+G L F P + S ++ ++ADP
Sbjct: 606 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVADP 655
Query: 670 YVLLGMSDGSIRLLV 684
YV++ ++G + + +
Sbjct: 656 YVVIMSAEGHVTMFL 670
>gi|403302917|ref|XP_003942095.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Saimiri boliviensis boliviensis]
Length = 1390
Score = 332 bits (850), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 223/701 (31%), Positives = 349/701 (49%), Gaps = 82/701 (11%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 731 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 786
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 787 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 829
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
LS N+ +RF + P + RE E GA + R
Sbjct: 830 ----LSQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSAEEGAGARGRVARFR 879
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG + +F HN+NC GF+Y
Sbjct: 880 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPVDSFAPFHNINCPRGFLYFNR 939
Query: 975 QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
QG L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S P +
Sbjct: 940 QGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR--- 993
Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENA 1092
I + G + + + + + E + ++++ P W+ A I +Q E+
Sbjct: 994 --IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHV 1047
Query: 1093 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE--- 1148
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1048 TCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKF 1107
Query: 1149 --VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK 1206
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VK
Sbjct: 1108 KVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVK 1166
Query: 1207 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
NFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+
Sbjct: 1167 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1226
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----N 1321
++ Y P+ ES+ G +LL RA+FHVGAHV F R + GA G K N
Sbjct: 1227 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWEN 1279
Query: 1322 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
+ F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1280 KHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQ 1339
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1340 NAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1380
Score = 273 bits (699), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 203/645 (31%), Positives = 322/645 (49%), Gaps = 105/645 (16%)
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 2 SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSAR 229
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 55 VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--------------------------GGV 321
++ T K HP+IWS +LP D + LAVP PI GGV
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGENPGGAEGSAGRGAVSLPTSLCPPPGGV 226
Query: 322 LVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
++ N++ Y +QS +ALN+ + + + LD A AT++ D ++S
Sbjct: 227 VIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQATFISYDKMVIS 286
Query: 381 TKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
K G++ +LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T
Sbjct: 287 LKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTE 346
Query: 440 G----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNT 494
+++ +G KEE + +T QD V+ E+ +YGS A + T
Sbjct: 347 KLQEPPASAVREAGDKEEPPSKKKRVDATAGWSAGGKSVPQDEVD--EIEVYGSEAQSGT 404
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASATG 532
+ A T+SF V DS++NIGP + + G L I + + +
Sbjct: 405 QLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSV 462
Query: 533 ISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYL 579
+ K ++V ELPGC +WTV + G + S A DD H +L
Sbjct: 463 LQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEENPKGEGTEQEPSTPEADDDSRRHGFL 522
Query: 580 IISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYM 639
I+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 523 ILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNVGDDRYIVQVSPLGIRLLEG--- 578
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 579 VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 616
>gi|348555854|ref|XP_003463738.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 1 [Cavia porcellus]
Length = 1440
Score = 331 bits (848), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 217/684 (31%), Positives = 340/684 (49%), Gaps = 48/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 781 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQPTTQGEVR----KEEATR 836
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 837 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 886
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 887 VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGVRGRVARFRYFEDIYGYSGVFICGPS 946
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 947 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1006
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1007 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTST--PCTR-----IPRMTGEEKEFEAIE 1058
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1059 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1114
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1115 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1174
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1175 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1233
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1234 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1293
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GA G K N+ F TLDG IG +
Sbjct: 1294 LRRADFHVGAHVNTFWR-------TPCRGATEGPSKKSVVWENKHITWFATLDGGIGLLL 1346
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1347 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLY 1406
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1407 LSTMERGELAKKIGTTPDIILDDL 1430
Score = 271 bits (694), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 209/678 (30%), Positives = 332/678 (48%), Gaps = 90/678 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + + A + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRPTEGKSHREKLGAGGPPSLSF----GNVMSM 82
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + I ++SV+E+D H L+ S+H FE PE L+ G
Sbjct: 83 ASVQLXXXXXX------IALISFPQLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 134 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 185
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 186 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 245
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 246 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLG 305
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 365
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 366 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTVREAADKEEPPSKKKRV 423
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 424 DSTAGWAGSKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 479
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 480 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 539
Query: 556 --------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + S A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 540 VRKEEEETPKAEGSEQEPSAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 598
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 599 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 648
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 649 ADPYVVIMSAEGHVTMFL 666
>gi|255918233|ref|NP_001157645.1| cleavage and polyadenylation specificity factor subunit 1 isoform 1
[Mus musculus]
Length = 1450
Score = 331 bits (848), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 215/676 (31%), Positives = 337/676 (49%), Gaps = 48/676 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1008 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1059
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1060 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1115
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1116 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1175
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1176 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1234
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1235 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1294
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 1295 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLL 1347
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1348 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLY 1407
Query: 1399 LPLEEQLEIAHQTGTT 1414
L E+ E+A + GTT
Sbjct: 1408 LSTMERSELAKKIGTT 1423
Score = 307 bits (787), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
>gi|402879380|ref|XP_003903320.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Papio anubis]
Length = 1389
Score = 330 bits (846), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 215/684 (31%), Positives = 339/684 (49%), Gaps = 48/684 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 730 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 785
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 786 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 835
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 836 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGXRGRVARFRYFEDIYGYSGVFICGPS 895
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 896 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 955
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S I + G + + +
Sbjct: 956 PWPVRK-IPLRCTAHYVAYHVESKVYAVATS-------TNTPCARIPRMTGEEKEFETIE 1007
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
+ + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1008 RDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGL 1063
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1064 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1123
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1124 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1182
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1183 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1242
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GA G K N+ F TLDG IG +
Sbjct: 1243 LRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLL 1295
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1296 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLY 1355
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1356 LSTMERSELAKKIGTTPDIILDDL 1379
Score = 272 bits (696), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 202/644 (31%), Positives = 321/644 (49%), Gaps = 104/644 (16%)
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 2 SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSAR 229
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 55 VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI-------------------------GGVL 322
++ T K HP+IWS +LP D + LAVP PI GGV+
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGEYPGSGWGCVEGALSLPTSLCPPPGGVV 226
Query: 323 VVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
V N++ Y +QS +ALN+ + + + LD A AT++ D ++S
Sbjct: 227 VFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRIQEGVRITLDCAQATFISYDKMVISL 286
Query: 382 KTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
K G++ +LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T
Sbjct: 287 KGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEK 346
Query: 441 ----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNTE 495
+++ + KEE + +T QD V+ E+ +YGS A + T+
Sbjct: 347 LQEPPASAVREAADKEEPPSKKKRVDATASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQ 404
Query: 496 SAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASATGI 533
A T+SF V DS++NIGP + + G L I + + + +
Sbjct: 405 LA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVL 462
Query: 534 SKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYLI 580
K ++V ELPGC +WTV + G ++ A DD H +LI
Sbjct: 463 QKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLI 522
Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 523 LSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---V 578
Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 579 NQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 615
>gi|426361048|ref|XP_004047737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Gorilla gorilla gorilla]
Length = 1440
Score = 330 bits (845), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 215/686 (31%), Positives = 340/686 (49%), Gaps = 52/686 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 781 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 836
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 837 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 886
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 887 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 946
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 947 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1006
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHN 1049
WPV+K IPL+ T H + Y E +Y + S P + I + G + +
Sbjct: 1007 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFET 1056
Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1107
+ + + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1057 IERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVS 1112
Query: 1108 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISAL 1161
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1113 GLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTAL 1172
Query: 1162 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1173 CHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLL 1231
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G
Sbjct: 1232 RYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGM 1291
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGC 1336
+LL RA+FHVGAHV F R + GA G K N+ F TLDG IG
Sbjct: 1292 RLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGL 1344
Query: 1337 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1345 LLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRY 1404
Query: 1397 EMLPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1405 LYLSTMERSELAKKIGTTPDIILDDL 1430
Score = 287 bits (735), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 210/675 (31%), Positives = 336/675 (49%), Gaps = 84/675 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP G C +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGTCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T +S ++E + + P +K+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---ADKEEPPSKKK 422
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR---DSLVNIGPLKDFSYG--- 521
R ++ G + A + AV DS++NIGP + + G
Sbjct: 423 RVDATAGWSGEGRSRAGQERGQVTQGWSGAGAPLTVAVPQVCDSILNIGPCANAAMGEPA 482
Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 FLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRK 542
Query: 555 ----HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+ G + S A DD H +LI+S E TM+L+T + E+ S + QG
Sbjct: 543 EEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 601
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
T+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADP
Sbjct: 602 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADP 651
Query: 670 YVLLGMSDGSIRLLV 684
YV++ ++G + + +
Sbjct: 652 YVVIMSAEGHVTMFL 666
>gi|354491126|ref|XP_003507707.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 3 [Cricetulus griseus]
Length = 1449
Score = 330 bits (845), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 215/676 (31%), Positives = 337/676 (49%), Gaps = 48/676 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1008 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1059
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1060 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1115
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1116 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1175
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1176 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1234
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1235 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1294
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 1295 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLL 1347
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1348 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLY 1407
Query: 1399 LPLEEQLEIAHQTGTT 1414
L E+ E+A + GTT
Sbjct: 1408 LSTMERSELAKKIGTT 1423
Score = 303 bits (775), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 217/673 (32%), Positives = 347/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE E L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ ++
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429
Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY------- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 555 HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
++ R + + ++ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
>gi|440904368|gb|ELR54893.1| Cleavage and polyadenylation specificity factor subunit 1, partial
[Bos grunniens mutus]
Length = 1417
Score = 330 bits (845), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 214/676 (31%), Positives = 339/676 (50%), Gaps = 48/676 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 775 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 830
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + RP+L + D +L Y+A+ P ++ +
Sbjct: 831 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 880
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E T R F++I G+ G F+ G
Sbjct: 881 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 940
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 941 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1000
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P +V + G + + +
Sbjct: 1001 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIE 1052
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
+ + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1053 RDERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1108
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1109 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1168
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1169 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1227
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1228 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1287
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIA 1338
L RA+FHVGAHV F R + GAA G K N+ F TLDG IG +
Sbjct: 1288 LRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLL 1340
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1341 PMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLY 1400
Query: 1399 LPLEEQLEIAHQTGTT 1414
L E+ E+A + GTT
Sbjct: 1401 LSPMERGELAKKIGTT 1416
Score = 280 bits (717), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 206/634 (32%), Positives = 320/634 (50%), Gaps = 89/634 (14%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LELV + GNV S+A + GA +RD+++L SV+E+D H L+ S+H
Sbjct: 66 LELVASFSFFGNVMSMASVQLAGA----KRDALLL-------SVVEYDPGTHDLKTLSLH 114
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVG 215
FE PE L+ G P V+VDP GRC +L+YG ++++L ++ GLVG
Sbjct: 115 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVG 171
Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ G + S++I++R LD K ++ D F+HGY EP ++IL E TW GR
Sbjct: 172 E--------GQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGR 223
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
V+ + TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +
Sbjct: 224 VAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLN 283
Query: 334 QSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
QS +ALN+ + + + LD A A ++ D ++S K G++ +LT++
Sbjct: 284 QSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLI 343
Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T S+
Sbjct: 344 TDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA-- 401
Query: 452 EEFGDIEADAPSTKRLRRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAV 505
E D E KR+ + S QD V+ E+ +YGS A + T+ A T+SF V
Sbjct: 402 REAADKEEPPSKKKRVDATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEV 457
Query: 506 RDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV- 542
DS++NIGP + + G L I + + + + K ++V
Sbjct: 458 CDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVT 517
Query: 543 --ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVL 590
ELPGC +WTV + G + A DD H +LI+S E TM+L
Sbjct: 518 TFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMIL 577
Query: 591 ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNS 650
+T + E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 578 QTGQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL 633
Query: 651 ESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 634 -------GSPIVQCAVADPYVVIMSAEGHVTMFL 660
>gi|119602512|gb|EAW82106.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
gi|119602513|gb|EAW82107.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
gi|119602514|gb|EAW82108.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
Length = 1365
Score = 330 bits (845), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 215/686 (31%), Positives = 340/686 (49%), Gaps = 52/686 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 706 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 761
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 762 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 811
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 812 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 871
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 872 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 931
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHN 1049
WPV+K IPL+ T H + Y E +Y + S P + I + G + +
Sbjct: 932 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFET 981
Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1107
+ + + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 982 IERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVS 1037
Query: 1108 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISAL 1161
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1038 GLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTAL 1097
Query: 1162 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1098 CHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLL 1156
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G
Sbjct: 1157 RYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGM 1216
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGC 1336
+LL RA+FHVGAHV F R + GA G K N+ F TLDG IG
Sbjct: 1217 RLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGL 1269
Query: 1337 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1270 LLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRY 1329
Query: 1397 EMLPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1330 LYLSTMERSELAKKIGTTPDIILDDL 1355
Score = 286 bits (733), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 202/620 (32%), Positives = 322/620 (51%), Gaps = 80/620 (12%)
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 2 SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSAR 229
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 55 VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYA 346
++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLT 226
Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 227 TGTTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDK 286
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADA 461
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE +
Sbjct: 287 AAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRV 346
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + +
Sbjct: 347 DATAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAV 402
Query: 521 G----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY- 554
G L I + + + + K ++V ELPGC +WTV
Sbjct: 403 GEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIA 462
Query: 555 --------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVD 604
+ G + S DD+ H +LI+S E TM+L+T + E+ S
Sbjct: 463 PVRKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-G 521
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
+ QG T+ AGN+ R ++QV G R+L+G L F P + + ++
Sbjct: 522 FATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQC 571
Query: 665 SIADPYVLLGMSDGSIRLLV 684
++ADPYV++ ++G + + +
Sbjct: 572 AVADPYVVIMSAEGHVTMFL 591
>gi|351713968|gb|EHB16887.1| Cleavage and polyadenylation specificity factor subunit 1
[Heterocephalus glaber]
Length = 1440
Score = 330 bits (845), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 216/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SSGQPTTQGEARKEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
P W +V LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGLRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAP 1007
Query: 993 WPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1052
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1008 WPVRK-IPLRCTAHYVAYHVESKVYAVATSTST--PCTR-----IPRMTGEEKEFEAIER 1059
Query: 1053 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1060 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1115
Query: 1110 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1164
+A GT +QGE+V RGRV + P +T+ +Y KE KG ++AL
Sbjct: 1116 GYVAAGTCLMQGEEVTCRGRVRDWERIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1175
Query: 1165 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1176 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1234
Query: 1225 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1284
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1235 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1294
Query: 1285 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1339
RA+FHVGAHV F R + GA+ G K N+ F TLDG IG + P
Sbjct: 1295 RRADFHVGAHVNTFWR-------TPCRGASEGPSKKSVVWENKHITWFATLDGGIGLLLP 1347
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1348 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1407
Query: 1400 PLEEQLEIAHQTGTTRSQILSNL 1422
E+ E+A + GTT IL +L
Sbjct: 1408 STMERGELAKKIGTTPDIILDDL 1430
Score = 303 bits (776), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 221/684 (32%), Positives = 351/684 (51%), Gaps = 101/684 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEGLTKNDKTTEGKSHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE P
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTVREAADKEE--------PP 418
Query: 464 TKRLRRSSSDAL-------QDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPL 515
+K+ R S+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP
Sbjct: 419 SKKKRVDSAAGWAGNKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPC 474
Query: 516 KDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGCKGI 550
+ + G L I + + + + K ++V ELPGC +
Sbjct: 475 ANAAVGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDM 534
Query: 551 WTVYH---------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVT 600
WTV + G + S A DD H +LI+S E TM+L+T + E+
Sbjct: 535 WTVIAPVRKEEEETPKAEGSEQEPSAPEAQDDGRRHGFLILSREDSTMILQTGQEIMELD 594
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
S + QG T+ AGN+ R ++QV G R+L+G L F P + +
Sbjct: 595 TS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAP 643
Query: 661 VLSVSIADPYVLLGMSDGSIRLLV 684
++ ++ADPYV++ ++G + + +
Sbjct: 644 IVQCAVADPYVVIMSAEGHVTMFL 667
>gi|392306997|ref|NP_001254722.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
gi|380812168|gb|AFE77959.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
gi|383417835|gb|AFH32131.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
Length = 1442
Score = 329 bits (844), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 216/686 (31%), Positives = 341/686 (49%), Gaps = 52/686 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 783 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 839 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 889 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 949 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHN 1049
WPV+K IPL+ T H + Y E +Y + S P + I + G + +
Sbjct: 1009 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFET 1058
Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1107
+ + + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1059 IERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVS 1114
Query: 1108 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISAL 1161
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1115 GLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTAL 1174
Query: 1162 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1175 CHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLL 1233
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G
Sbjct: 1234 RYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGM 1293
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGC 1336
+LL RA+FHVGAHV F R + GA G K N+ F TLDG IG
Sbjct: 1294 RLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGL 1346
Query: 1337 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1347 LLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRY 1406
Query: 1397 EMLPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1407 LYLSTMERSELAKKIGTTPDIILDDL 1432
Score = 304 bits (779), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 219/677 (32%), Positives = 347/677 (51%), Gaps = 86/677 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+ G ++ A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651
Query: 668 DPYVLLGMSDGSIRLLV 684
DPYV++ ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668
>gi|397497327|ref|XP_003819464.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Pan paniscus]
gi|410336497|gb|JAA37195.1| cleavage and polyadenylation specific factor 1, 160kDa [Pan
troglodytes]
Length = 1442
Score = 329 bits (844), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 215/686 (31%), Positives = 340/686 (49%), Gaps = 52/686 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 783 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 839 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 889 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 949 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHN 1049
WPV+K IPL+ T H + Y E +Y + S P + I + G + +
Sbjct: 1009 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFET 1058
Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1107
+ + + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1059 IERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVS 1114
Query: 1108 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISAL 1161
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1115 GLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTAL 1174
Query: 1162 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1175 CHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLL 1233
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G
Sbjct: 1234 RYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGM 1293
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGC 1336
+LL RA+FHVGAHV F R + GA G K N+ F TLDG IG
Sbjct: 1294 RLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGL 1346
Query: 1337 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1347 LLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRY 1406
Query: 1397 EMLPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1407 LYLSTMERSELAKKIGTTPDIILDDL 1432
Score = 305 bits (781), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 220/677 (32%), Positives = 348/677 (51%), Gaps = 86/677 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+ G + S A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651
Query: 668 DPYVLLGMSDGSIRLLV 684
DPYV++ ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668
>gi|56676371|ref|NP_037423.2| cleavage and polyadenylation specificity factor subunit 1 [Homo
sapiens]
gi|23503048|sp|Q10570.2|CPSF1_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|16878041|gb|AAH17232.1| Cleavage and polyadenylation specific factor 1, 160kDa [Homo sapiens]
gi|119602516|gb|EAW82110.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_c
[Homo sapiens]
gi|123993607|gb|ABM84405.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
gi|123999626|gb|ABM87355.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
gi|307684758|dbj|BAJ20419.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
Length = 1443
Score = 329 bits (844), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 215/686 (31%), Positives = 340/686 (49%), Gaps = 52/686 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 784 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 840 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 890 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 950 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHN 1049
WPV+K IPL+ T H + Y E +Y + S P + I + G + +
Sbjct: 1010 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFET 1059
Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1107
+ + + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1060 IERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVS 1115
Query: 1108 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISAL 1161
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1116 GLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTAL 1175
Query: 1162 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1176 CHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLL 1234
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G
Sbjct: 1235 RYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGM 1294
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGC 1336
+LL RA+FHVGAHV F R + GA G K N+ F TLDG IG
Sbjct: 1295 RLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGL 1347
Query: 1337 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1348 LLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRY 1407
Query: 1397 EMLPLEEQLEIAHQTGTTRSQILSNL 1422
L E+ E+A + GTT IL +L
Sbjct: 1408 LYLSTMERSELAKKIGTTPDIILDDL 1433
Score = 304 bits (779), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + S DD+ H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669
>gi|1045574|gb|AAC50293.1| cleavage and polyadenylation specificity factor [Homo sapiens]
Length = 1442
Score = 329 bits (843), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 213/681 (31%), Positives = 338/681 (49%), Gaps = 43/681 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 784 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 840 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 890 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 950 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHN 1049
WPV+K IPL+ T H + Y E +Y + S P + I + G + +
Sbjct: 1010 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFET 1059
Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1107
+ + + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1060 IERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVS 1115
Query: 1108 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISAL 1161
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1116 GLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTAL 1175
Query: 1162 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1176 CHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLL 1234
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G
Sbjct: 1235 RYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGM 1294
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1341
+LL RA+FHVGAHV F R AT + N+ F TLDG IG + P+
Sbjct: 1295 RLLRRADFHVGAHVNTFWRTPCRATEGLSKKSVVWE---NKHITWFATLDGGIGLLLPMQ 1351
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1401
E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1352 EKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLST 1411
Query: 1402 EEQLEIAHQTGTTRSQILSNL 1422
E+ E+A + GTT IL +L
Sbjct: 1412 MERSELAKKIGTTPDIILDDL 1432
Score = 305 bits (781), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + S DD+ H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669
>gi|229335612|ref|NP_001108153.2| cleavage and polyadenylation specificity factor subunit 1 [Danio
rerio]
Length = 1449
Score = 328 bits (842), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 217/697 (31%), Positives = 357/697 (51%), Gaps = 68/697 (9%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + S T+ EE T
Sbjct: 790 WCLLVRENGVMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTR 845
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG +I +K E+A+ +HSRP+L A + + +L Y+A+ ++ +
Sbjct: 846 QG---DIPLVK--EVALVSLGYNHSRPYLLAHV-EQELLIYEAFPYDQQQ---------- 889
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--------PHG---------APCQRIT 915
+ S L+ +RF + P + RE+ P G R
Sbjct: 890 ----------AQSNLK-VRFKKMPHNINYREKKVKVRKDKKPEGQGEDTLGVKGRVARFR 938
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++ISG+ G F+ G P W +V R +R+HP DG+I +F+ HN+NC GF+Y
Sbjct: 939 YFQDISGYSGVFICGPSPHWMLVTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNK 998
Query: 975 QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
QG L+I LP+ +YD WPV+K IPL+ T H ++Y E +Y + SV +P +
Sbjct: 999 QGELRISVLPTYLSYDAPWPVRK-IPLRCTVHYVSYHVESKVYAVCTSVK--EPCTR--- 1052
Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
I + G + + + + + +++ ++++ P TR + ++ E+
Sbjct: 1053 --IPRMTGEEKEFETIERDERYIHPQQDKFSIQLISPVSWEAIPNTR--VDLEEWEHVTC 1108
Query: 1095 VRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE----- 1148
++ V L + T + +A+GT +QGE+V RGR+L+ P +T+
Sbjct: 1109 MKTVALKSQETVSGLKGYVALGTCLMQGEEVTCRGRILILDVIEVVPEPGQPLTKNKFKV 1168
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
+Y KE KG ++AL G L+ A G KI L +L G+AF D LY+ + +KNF
Sbjct: 1169 LYEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKDNDLTGMAFIDTQ-LYIHQMYSIKNF 1227
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1268
IL D+ KSI L ++ + L+L+++D L+ ++ EF++D + L +VSD KN+ ++
Sbjct: 1228 ILAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFMVDNNQLGFLVSDRDKNLMVY 1287
Query: 1269 YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
Y P+ ES+ G +LL RA+F+VG+HV F R+ T A D N+ F
Sbjct: 1288 MYLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTANKKALTWD--NKHITWFA 1345
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
TLDG +G + P+ E T+RRL LQ L +PH AGLNP++FR H + + + +I+
Sbjct: 1346 TLDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQNAVKNIL 1405
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
D ELL+ Y L E+ E+A + GTT IL +L ++
Sbjct: 1406 DGELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEI 1442
Score = 292 bits (747), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 209/641 (32%), Positives = 334/641 (52%), Gaps = 78/641 (12%)
Query: 94 DGIS-AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
DG S LE V + L GNV S+A + G + RD+++L+F+DAK+SV+E+D H
Sbjct: 58 DGKSRKEKLEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTH 113
Query: 153 GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
L+ S+H FE PE L+ G P+V+VDP+ RC +LVYG +++L +
Sbjct: 114 DLKTLSLHYFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT-- 168
Query: 213 LVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTW 270
+ DE G G + S++I++R+LD K ++ D F+HGY EP ++IL E TW
Sbjct: 169 -LADEQEGIVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTW 227
Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
GRV+ + TC I A+S++ K HP+IWS NLP D +++AVP PIGGV+V N++
Sbjct: 228 PGRVAVRQDTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLL 287
Query: 331 YHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
Y +QS ++LN+ + P+ + LD + A+++ +D ++S K G++ +L
Sbjct: 288 YLNQSVPPFGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVL 347
Query: 390 TVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
T++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T + +
Sbjct: 348 TLITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEE 407
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDA-------LQDMVNGEELSLYGS-ASNNTESAQKT 500
G + E + + P K+ R S+ A L D ++ E+ +YGS A + T+ A T
Sbjct: 408 GKENEEKEKQ---PPNKKKRVDSNWAGCPGKGNLPDELD--EIEVYGSEAQSGTQLA--T 460
Query: 501 FSFAVRDSLVNIGPLKDFSYG--------LRINADAS-----ATGISKQSNYELV----- 542
+SF V DS++NIGP S G + N + +G K ++
Sbjct: 461 YSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVVVCSGYGKNGALSVLQKSIR 520
Query: 543 -------ELPGCKGIWTVYHKSSR---------GHNADSSRMAAY---DDEYHAYLIISL 583
ELPGC +WTV + + G + + + D + H +LI+S
Sbjct: 521 PQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILSR 580
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
E TM+L+T + E+ S + QG T+ AGN+ + +IQV G R+L+G L
Sbjct: 581 EDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---VNQL 636
Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
F P + S ++ S+ADPYV++ ++G + + V
Sbjct: 637 HFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFV 670
>gi|427795803|gb|JAA63353.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
cft1 cpsf subunit, partial [Rhipicephalus pulchellus]
Length = 726
Score = 328 bits (841), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 225/701 (32%), Positives = 354/701 (50%), Gaps = 75/701 (10%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEEGTGQG 814
V E+G LEI+ +P + F V F G+ +VD+ A +++E ++ S E
Sbjct: 73 VARENGVLEIYSLPEYKLCFLVKNFPMGQKVLVDSVQMTAPSGTKSEKLSDMSHESMPV- 131
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
+H + VV L ++ HSRP L A + D +L Y+A+ F T
Sbjct: 132 ----VHEILVVGLGIR-----HSRPLLLARV-DEDLLIYEAFPF------------YETQ 169
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTRE-----ETPHGAPCQR-------ITIFKNISG 922
R + LRF + D + RE + P ++ + F +ISG
Sbjct: 170 REGHL---------KLRFKKMSHDIFLRERKYKTQKPENEEEEKAFQSRQWLHPFSDISG 220
Query: 923 HQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G FL G RP W M R LR HP DG I F HNVNC GF++ QG L+I
Sbjct: 221 YSGVFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQGELRIS 280
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
LP+ TYD WPV+KV PL+ TPH + Y + Y ++ S P P N ++
Sbjct: 281 TLPTHLTYDAPWPVRKV-PLRCTPHFVNYHVDSKTYCVVTSQP--DPCNHLVRF-----T 332
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVT 1099
G + + L + T++++ +++L P W+T + + E+ ++ V
Sbjct: 333 GEEKEYELLERDSRYIFPTMDKFSLQLLSP----VSWETIPNTRVDLDEWEHLTCLKNVM 388
Query: 1100 LFNT-TTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKE 1153
L + TT + LA+GT Y GEDV +RGR+++ P +N + VYSKE
Sbjct: 389 LSSEGTTTGMKGYLALGTNYCYGEDVTSRGRIIILDIIDVVPEPGQPLTKNKIKIVYSKE 448
Query: 1154 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1213
KG ++AL+ + G LL A G KI + + EL G+AF D +Y+ S+ VKN IL+GD
Sbjct: 449 QKGPVTALSQVVGFLLSAIGQKIYIWQLKDNELVGVAFIDTQ-IYIHSVVTVKNLILVGD 507
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
+ KS+ L ++E L+L+++D L+ +A EF ID + +S +V+D ++N+ ++ Y P+
Sbjct: 508 VFKSVSLLRYQEASRTLSLVSRDVRPLEVYAVEFFIDNTQMSFLVTDAERNLLLYMYQPE 567
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
ES GQ+LL R +FHVG+ V R++ + S R + TLDGS
Sbjct: 568 SRESCGGQRLLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGS 627
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN----GKAHRPGPDSIVD 1389
+ + P+ E T+RRL LQ LV ++PH AGLNP+++R ++S G H+ +I+D
Sbjct: 628 LAYVLPVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHK----NILD 683
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
EL+ + L E+ E++ + GTT +QI +L ++ T+
Sbjct: 684 GELIWKFMHLSFMERSELSKKIGTTVTQITDDLLEIETYTA 724
>gi|260835071|ref|XP_002612533.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
gi|229297910|gb|EEN68542.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
Length = 1003
Score = 324 bits (830), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 269/963 (27%), Positives = 429/963 (44%), Gaps = 153/963 (15%)
Query: 543 ELPGCKGIWTVY------HKSSRGHNADSSRMAAYDDEY-----------------HAYL 579
+LPGC +WTV G A+S+ + H +L
Sbjct: 107 DLPGCLDMWTVIGIPPESKPQEEGEKAESAGSEEKPEGEKEETKEEGPPDVDLTNSHGFL 166
Query: 580 IISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYM 639
I+S E TMVL+T + E+ S + QG T+ AGN+ + +IQV G R+L G
Sbjct: 167 ILSREDSTMVLQTGKEIMELDHS-GFSTQGPTVYAGNIGNNKYIIQVSPYGIRLLQGVKQ 225
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL--VGDPSTCTVSVQTP 697
Q L F S+ + S+ADPY L+ DG I LL V DP +
Sbjct: 226 LQHLPFD---------SKGPAFVLASVADPYALVMSEDGQILLLTLVNDPYGSGHRLSAK 276
Query: 698 AAIESSKKPVSSCTLYHDKG------------PEPWLRKTSTDAW------LSTGV---- 735
+ K + Y D P P + K + + + TGV
Sbjct: 277 KIDMAGKSQAITVCAYRDTSGLFTVSSPSTTTPAPEVEKDAAEPAAEDAVAMETGVDDED 336
Query: 736 ----GEAIDGADGGPLDQGDI-------------------YSVVCYESGALEIFDVPNFN 772
GE G + + ++ + V+C E+G+LEI+++P+F+
Sbjct: 337 EMLYGEPSAKPSGPAVVREEVKPSTSTVQEPVVKEVEPTHWCVICRENGSLEIYNLPDFS 396
Query: 773 CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
V+ V F +G +VD++ + + K+ V E+ M
Sbjct: 397 LVYLVKNFPTGMKLLVDSFQSTSSASTSQS------------DKQGDQLASVKEILMVGL 444
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
SRP L A + D +L Y+A+ P + S P T + V + + R
Sbjct: 445 GHKGSRPHLLARV-DEDLLIYEAF----PYHLS----PSYTMLKIRFKKVQHNLILRERK 495
Query: 893 SRTPLDAYTREET--PHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQL 949
A +EE+ G+ Q F +ISG+ G F+ GS P W M R LR+HP
Sbjct: 496 GGKTKKAGDQEESDGQTGSRIQHFRTFTDISGYSGLFICGSSPHWLFMTSRGALRIHPMS 555
Query: 950 CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQIT 1009
DG++ F+ HNVNC GF+Y G L+I LP+ +YD WPV+KV PL+ TPH +
Sbjct: 556 IDGAVTCFSPFHNVNCPKGFLYFNRGGELRISVLPTHLSYDAPWPVRKV-PLRCTPHFVA 614
Query: 1010 YFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1069
Y E +Y V+ + N++ + D++ ++ D + ++++ ++++
Sbjct: 615 YHMECKVY--AVAASTFEMCNRIPRMAGDEKEYDAVEKD-----DRYIYPMLDKFNIQLM 667
Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGR 1129
P TR MQ EN E +G +V + G+
Sbjct: 668 SPVSWEIIPNTRG---MQLEENY-------------AECTCSFLVGINFV----LFVAGQ 707
Query: 1130 VLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1184
+++ P +N + E+Y KE KG +SAL G+LL A G KI L ++
Sbjct: 708 IVILDVIEVVPEPGQPLTKNKIKELYGKEQKGPVSALCGCNGYLLSAIGQKIFLWEFRNN 767
Query: 1185 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1244
+L G+AF D +Y+ + +KN+++L D+ KSI L ++ D L+ +
Sbjct: 768 DLIGVAFIDTQ-VYIHTAISIKNYVILADVFKSISLLRYQ-----------DMRPLETYC 815
Query: 1245 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--Q 1302
EF +D + + +VSD QKN ++ Y P+ ES+ GQ+L+ RA+F+VG+HV F R+ +
Sbjct: 816 VEFFVDNAQIGFLVSDAQKNFLLYSYQPEARESYGGQRLVRRADFNVGSHVNTFFRVRCK 875
Query: 1303 MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHV 1362
++ S +R A K R +F TLDG +G + P+ E T+RRL LQ L+ +P
Sbjct: 876 IMDPSGERRRDADTVAK--RHVTMFATLDGGLGALLPMAEKTYRRLLMLQNTLMTHMPFP 933
Query: 1363 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
AGLNP++FR N ++ +I+D ELL + L + E+ E+A + GT+ I +L
Sbjct: 934 AGLNPKAFRMLKHNHRSLINACRNILDGELLWKFLHLSVVERSELARKIGTSPETITEDL 993
Query: 1423 NDL 1425
D+
Sbjct: 994 MDI 996
>gi|215701517|dbj|BAG92941.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 265
Score = 323 bits (828), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 161/246 (65%), Positives = 191/246 (77%), Gaps = 1/246 (0%)
Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
GYIEPV+VILHE+E TWAGR+ KHHTCMISA SIS TLKQHP+IWSA NLPHDAY+LLA
Sbjct: 19 GYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSISMTLKQHPVIWSAANLPHDAYQLLA 78
Query: 314 VPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
VP PI GVLV+ AN+IHYHSQS SC+L LNN++ D S E+ +S+F VELDAA ATWL
Sbjct: 79 VPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSHPDGSPEISKSNFQVELDAAKATWLS 138
Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
ND+ + STK G+++LLTVVYDGRVVQRLDL K+ SVL+S +T+IGNS FFLGSRLGDSL
Sbjct: 139 NDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKASVLSSAVTSIGNSFFFLGSRLGDSL 198
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASN 492
LVQF+ + S+L E DIE D P +KRL+R SD LQD+ + EELS A N
Sbjct: 199 LVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLKRIPSDVLQDVTSVEELSFQNIIAPN 258
Query: 493 NTESAQ 498
+ ESAQ
Sbjct: 259 SLESAQ 264
>gi|296414526|ref|XP_002836950.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295632796|emb|CAZ81141.1| unnamed protein product [Tuber melanosporum]
Length = 1468
Score = 318 bits (815), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 350/1473 (23%), Positives = 596/1473 (40%), Gaps = 270/1473 (18%)
Query: 57 NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVL----------------MDGI 96
N++V ++++I+ E ++K G+ RR+L
Sbjct: 29 NVLVAKTSLLQIFTTTTYETELNSALADAKQPGDIDRRILDADEEQTFAADIALQRSQVE 88
Query: 97 SAASLELVCHYRLHGNV---ESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
S L LV Y L G+V + + +LS GG ++++ +F+DAK S++E+D
Sbjct: 89 SVTKLVLVAEYPLSGSVTGLQRIKLLSTRSGG-------EAVLASFKDAKCSLMEWDPET 141
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVK--------VDPQGRCGGVLVYGLQMII 203
+ + S+H +E RE F P+V DP RC + G + I
Sbjct: 142 NSITTISLHYYE----------REEFC-SPVVSDGLPTELVADPGSRCAALRFSGDMLAI 190
Query: 204 LKASQGG-----------------------------SGLVGDEDTFGSGG---------G 225
+ Q +G EDT G G
Sbjct: 191 IPFRQREDEELSLGRGDADEVMGDEDGDNDDWDPEMAGTARGEDTIMGEGDVKTTDATEG 250
Query: 226 FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
S V+++ LD + HV F+H Y EP IL+ TW G ++ + I
Sbjct: 251 KDRPYHPSFVLSVSQLDDAISHVISLTFLHEYREPTFGILYSPRRTWTGLLAAEGRKDTI 310
Query: 284 SALSISTTLKQHP--LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCAL 340
S + I+ L+Q I S LP+D +K++ + P GG L+VG N IH + +
Sbjct: 311 SYIVITLDLEQKASTPILSVSGLPYDIFKVVPLAPPTGGSLLVGGNELIHVDQAGKTTGV 370
Query: 341 ALNNYAVSLDSSQELP-RSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRV 397
A+N + L +S +EL+ + L+++ LL TK G+ V++ DGR
Sbjct: 371 AVNPFCRRSTGFAGLADQSDLCLELEGSQVVELESEGGDMLLFTKRGEGVIVGFRMDGRN 430
Query: 398 VQRLDLSKTNP---SVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
V + ++K N S++ ++T +G F+G GD+ ++++ G+K
Sbjct: 431 VSGVKITKLNNHPGSIVGGRVSTAVGLGGRRLFVGCIEGDARVLKWRRKGERKKAGEGIK 490
Query: 452 EE----------FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
EE +G +E SS + NG N+ +Q +
Sbjct: 491 EEVLENEDEDDVYGALEDMDDDLYGGGGDSSFRKDSLTNGRR--------NSEAKSQGEY 542
Query: 502 SFAVRDSLVNIGPLKDFSYG------------------LRINADASATGISKQSNYELV- 542
F D L N+GP +D + G L + + + S+ S ++
Sbjct: 543 IFQTHDRLTNLGPFRDITLGKPTFPEESRERQKGVSPELELVTTSGPSNTSEDSGISIIR 602
Query: 543 -----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE-----YHAYLIISLEAR 586
+ P C+ +WTV +S+ NA DD + +L ++
Sbjct: 603 KSISPTIVGRFDFPQCQALWTVRARSANTSNAAVGLGGEEDDRSVEESFDRFLFVTKNDE 662
Query: 587 TMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
+ V D EV D+ +G TI G + R++QV R+ D +
Sbjct: 663 SQVFRVGDTFEEV-RGTDFESEGETIEVGVVGNGMRIVQVVSEQVRVYDCDLQLSQII-- 719
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
P E +G E V + DPY+LL DGS + D + ++ + AI+ K
Sbjct: 720 PMFDEE-TGEEGPNVHRARVCDPYILLIKVDGSPAVYKMDSTNLELAEERADAIKFDKYQ 778
Query: 707 VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ-GDIYSVVCYESGALEI 765
S C KG + +D P++ D + G L+I
Sbjct: 779 -SGCIYASTKG-----------------IFIPLDA----PVENVKDYLLFLLTVEGGLQI 816
Query: 766 FDVPN-FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKV 824
+D+ N +F+ + F +T D+ T ++ E+ K+ I + V
Sbjct: 817 YDLSNPVTPLFSAESF--------NTLYPLLRTDNPTSPTANREK---HRSKQLIIEILV 865
Query: 825 VELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS--KSDDPVSTSRSLSVSNV 882
++ + P+L A ++ + Y+ ++ P KS +P S LS+S
Sbjct: 866 ADMG----DSIFKEPYLIARSSNNDLTFYKPFISSSPSTLRFIKSPNPHIASNELSLSAG 921
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRE 941
+ + R L T N++G+ FL G+ P + + +
Sbjct: 922 TKNIFRPL------------------------TAVYNLAGYSAVFLPGADPSFVIKTAKS 957
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPL 1001
R+H +L + + + H+ + GF+YV S GI+++ +P+ T+D W +KV P
Sbjct: 958 SPRIH-KLAGTGVRSLSSFHSAGADRGFVYVDSLGIVRVALMPAEFTFDGNWGYKKVTPG 1016
Query: 1002 KATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTV 1061
+ + YF N+Y ++S +P + + +E G N++ D +
Sbjct: 1017 EHV-QSLAYFPPMNVY--VISTSKRQPFD------LAEEDG------NIAKDDTTLQPEI 1061
Query: 1062 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQ 1120
+ +++L P W +E AL V+ ++L + TKE + L+++GTA +
Sbjct: 1062 DSGTLKLLSPQT----WTAVDEYKFAHNEIALVVKTISLEVSEHTKERKQLVSVGTAIFR 1117
Query: 1121 GEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPK 1175
GED +ARG + +F P T V +E+KG +SA+ + G+LL A G K
Sbjct: 1118 GEDHSARGGIYVFEVIEVVPEPNRPETNRKLKLVTREEVKGTVSAICGVNGYLLAAQGQK 1177
Query: 1176 IILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1234
I++ + L +AF D LYV + IL GD KS++F + E+ ++ L
Sbjct: 1178 IMVRGLKEDQSLLPVAFLDMC-LYVSVAKNLDGMILFGDFMKSVWFAGFSEEPYKMTLFG 1236
Query: 1235 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1294
KD L+ + EFL DG+ L VV D + NI Y P+ +S GQ+L+ RA+F G
Sbjct: 1237 KDTQKLEIISAEFLPDGNQLYFVVVDAESNIHTLQYDPEHPKSLAGQRLIRRADFFSGHE 1296
Query: 1295 VTKFLRLQM----LATSSDRTGAAPGSDKT------------NRFALLFGTLDGSIGCIA 1338
++ L L+ SS+ A +D + + +L GT GS+ I
Sbjct: 1297 ISTLTMLPFSPYSLSASSNSHLPADATDTSPLHHHHQNQQQQQEYFVLAGTQTGSLAMIR 1356
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
+ E +RRL +Q ++V+ HVAGLNPR +R
Sbjct: 1357 TIPETAYRRLNIVQGQIVNGEEHVAGLNPREYR 1389
>gi|317157892|ref|XP_001826637.2| protein cft1 [Aspergillus oryzae RIB40]
gi|391864317|gb|EIT73613.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
[Aspergillus oryzae 3.042]
Length = 1389
Score = 310 bits (794), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 338/1394 (24%), Positives = 592/1394 (42%), Gaps = 197/1394 (14%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
++I+LAF +AK++++E+D +G+ S+H +E + + + G ++ VDP R
Sbjct: 88 EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147
Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
C V +G++ + IL Q G LV D+ + GS G A
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206
Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
SS V+ L LD + H F++ Y EP IL+ + T + + + +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + + S LP D +K++A+P P+GG L++G+N +H + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326
Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S +S ++ L+ L N LL TG++VL+ DGR V + +
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386
Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
P S +G+ FLGS DS+L+ G S+ SSG K+ +
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
E D+ +S D +D + E+ + G + ++F D L+NI
Sbjct: 441 RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500
Query: 513 GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
GPLKD ++G + S ELV +L
Sbjct: 501 GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560
Query: 545 PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
C +WT +S H ++ + A + E H Y+++S +A + E +++ +
Sbjct: 561 ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613
Query: 602 SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
+ F + TI G L G+ RV+Q+ R DG DL P E
Sbjct: 614 ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
S E +S S+ DPYV + D ++ LL D S V+ I +SK +SC
Sbjct: 669 DTS--EERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSKW--TSCC 724
Query: 712 LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
LY DK TG+ +I A L Q + + + L I+ +P+
Sbjct: 725 LYFDK----------------TGIFSSI-SATSDELAQNSMTLFLMTQDCRLFIYRLPDQ 767
Query: 772 NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
+ + G + E K S T +E + + V +L
Sbjct: 768 KLL----AIIEGVDCLPPVLSSEPPKRSTT--------------REVLTEIVVADLG-DS 808
Query: 832 WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
WS S P+L + Y+ ++ T +P + L +N+ R+
Sbjct: 809 WS---SFPYLIIRSRHDDLAVYRPFI----SITKSVGEPHADLNFLKETNLVLPRI---- 857
Query: 892 FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
+ D + EE P + I NISG F G P + + L
Sbjct: 858 -TSGVEDQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKG 913
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYF 1011
G + + C GFI + S+G++ +CQ+P G D W +Q+ IP+ + Y
Sbjct: 914 GYAQSLSKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQ-IPIGEQVDHLAYS 972
Query: 1012 AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEP 1071
+ +Y + S L D E+ + N S V+ ++++ P
Sbjct: 973 SSSGMYVIGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQRSSLKVVSP 1021
Query: 1072 DRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRV 1130
W + + +E+ + V+ ++L + T E + ++ +GTA+ +GED+A+RG V
Sbjct: 1022 KT----WTVIDSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCV 1077
Query: 1131 LLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLIASGPKIILH--KW 1181
+F + +P+ + V + +KGA++AL+ + QG L++A G K I+ K
Sbjct: 1078 YVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGLKE 1137
Query: 1182 TGTELNGIAFYDAPPLYVVSLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQLNLLAKD 1236
G+ L +AF D +++VK ++ D K ++F + E+ +++L AKD
Sbjct: 1138 DGSLLP-VAFMDVQ----CHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKD 1192
Query: 1237 FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 1296
L+ A +FL DG+ L ++V+D N+ + Y P+ +S G +LLSR++FH G ++
Sbjct: 1193 LDYLEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFIS 1252
Query: 1297 KFLRLQMLATSSDR----TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1352
L + SS++ A K R +L + +GS+G + + E ++RRL +LQ
Sbjct: 1253 TLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQ 1312
Query: 1353 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
+L +++ H GLNPR+FR S+G A R ++D +LL + + + ++EIA + G
Sbjct: 1313 SQLTNTIEHPCGLNPRAFRAVESDGTAGR----GMLDGKLLFQWLDMSKQRKVEIASRVG 1368
Query: 1413 TTRSQILSNLNDLA 1426
+I ++ ++
Sbjct: 1369 ANEWEIKADFEAIS 1382
>gi|336388105|gb|EGO29249.1| hypothetical protein SERLADRAFT_445076 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1424
Score = 310 bits (794), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 348/1475 (23%), Positives = 622/1475 (42%), Gaps = 203/1475 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDG-------- 95
N+VV +NV+ I+ VR +E ++ E RR V MDG
Sbjct: 47 NVVVARSNVLRIFEVR-EERPPMSTQTEDERDRRSHVRKGTEAVEGEVEMDGQGEGYVNM 105
Query: 96 ----------ISAASLELVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKI 142
+ + V + LHG V E++ I+S N D ++++F+DAKI
Sbjct: 106 GTVKKGAVHLPTVSRFYFVREHMLHGTVTGLETVRIMSS----NDDNLDRLLVSFKDAKI 161
Query: 143 SVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQM 201
++LE+ D IH L S+H +E +P+ + L +S ++VDP RC + + +
Sbjct: 162 ALLEWSDDIHDLITVSIHTYERAPQLMAL----DSSLFHTKLRVDPSSRCAALSLPKDAI 217
Query: 202 IILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIE 257
IL Q + L V ++D S +++L D +++HV DF+F+ G+
Sbjct: 218 AILPFFQSQAELDVMEQD---QNQARDVPYSPSFILDLASDVDENIRHVIDFVFLPGFNN 274
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + +L + E TW+GR+ T + ++ +P+I + LP D L+ +
Sbjct: 275 PTIAVLFQTEQTWSGRLKEFKDTAKLIIFTLDLLSHTYPVITAVDGLPFDCISLVPCVAS 334
Query: 318 IGGVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELP-----RSSFSVELDAAHATW 371
+GGV+++ +NTI Y S AL +N ++ S S +P +S ++ L+ HA
Sbjct: 335 LGGVVIMSSNTIIYVDPASRRVALPVNGWS-SRVSDMPMPALSGDEASRNISLEGCHAVL 393
Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLGSRLG 430
+ + + K G + + +V DG+ V +L ++ + + S + I FLGS +G
Sbjct: 394 VDDRTMFVFLKDGTVYPVELVADGKTVSKLSMAPALAQTTIPSMVRKINEDHLFLGSIVG 453
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
S+L++ L ++A + ++ + + +
Sbjct: 454 ASVLLKTVRVEEEVEDEEKLPAHAAVVDAPTTMDLDDDDDTMPSMNGVTH---------S 504
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG------------ 532
+N + ++ DSL GP+ D ++ L D +ATG
Sbjct: 505 NNIIHRTRSVVHLSLCDSLPAYGPISDVTFSLAKLGDRYVPELVAATGSGFLGGFTLFQR 564
Query: 533 -ISKQSNYELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM-- 588
+ ++ +L + G +GIW+ + R + R + + +IIS +A
Sbjct: 565 DLPSRTKRKLHAIGGARGIWSFPVRQQVRVNGLSYERPVNSFESENDTVIISTDANPSPG 624
Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDLSFGP 647
V A ++ ++ + G TI AG+ F R ++ V R+L+ G + +DL
Sbjct: 625 VSRIATRTSKSDIAIPTRIPGTTIGAGSFFQRTAILHVMTNAIRVLESGKQIIKDLD--- 681
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQ--TPAAIESSKK 705
+ + SI DP+VL+ D +I L +G+ + + +P +SS+
Sbjct: 682 ------GNIPRPRIKACSICDPFVLIIREDDTIGLFIGEAERGKIRRKDMSPMGDKSSRY 735
Query: 706 PV------SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
+SC P D +++ + ++ + + ++
Sbjct: 736 LAGCFFTDNSCIFETHANDLPSSASNGVDKNVTSTMQAVVNS------NSRSQWLILVRP 789
Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
G +EI+ +P F+ + D+Y AL + RK N
Sbjct: 790 QGVMEIWTLPKLTLAFSTSSLAMLEHILSDSYDTPALSPPQ-----------DHPRKSN- 837
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
+ V ++ + P+L L G I+ Y+A P + S+
Sbjct: 838 -DLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPAD------------SIPP 884
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI----------SGHQGFFLS 929
S VS +++ ++ + + EET ++ I + S G F +
Sbjct: 885 SRVSVLKVKFIKTATKIFELPKHEETEKSILAEQKRISRQFVPFVTSPTPGSVLSGVFFT 944
Query: 930 GSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT---SQGILKICQLPSG 986
G RP W + + G I ++ H++ H F + S+G + G
Sbjct: 945 GDRPSWIVATNK----------GGIRIYSSGHHIV--HSFTSCSLWESKGDFLVYS-DEG 991
Query: 987 STYDNYWP---VQKVIPLKATPHQITYFAEKNLYP---LIVSVPVLKPLNQVLSLLIDQE 1040
+ + P + V+P + P Y A P LIV+ ++ + E
Sbjct: 992 PSLLEWMPDLCLDSVLPSRNIPRSRAY-ANVVYDPSAMLIVAASSMQ-----ANFASFDE 1045
Query: 1041 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1100
G+ I S+V L + + + ++ P+ W T +E + VTL
Sbjct: 1046 DGNIIWEPEASNVSLPK---CDCSTLELIAPEA----WITMDGYEFAPNEYVNALECVTL 1098
Query: 1101 FNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNA-DNPQNL-----VTEVYSKE 1153
+T+ ++ +A+GT+ +GED+A +G LF D QNL + + +
Sbjct: 1099 ETLSTETGSKDFIAVGTSIDRGEDLAVKGATYLFEIVEVVPDYSQNLKRWYKLKLLARDD 1158
Query: 1154 LKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLG 1212
KG ++AL + G+L+ + G KI + + E L G+AF D +YV SL +VKNF+L+G
Sbjct: 1159 AKGPVTALCGINGYLVSSMGQKIFIRAFDMDERLVGVAFLDVG-VYVTSLRVVKNFLLIG 1217
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D KSI+F++++E +L +LAKD +F TLS+V D ++++ Y P
Sbjct: 1218 DAVKSIWFVAFQEDPYKLVVLAKDVHRTHVTNADFFFTDDTLSIVTEDGDGILRMYAYDP 1277
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1332
ES GQ LL R EFH + L ++A + P + + F+ DG
Sbjct: 1278 DDPESKNGQHLLCRTEFHNHSECRSSL---VIARRTKEESVLPQAKILSAFS------DG 1328
Query: 1333 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
S+ + P+D+ +F+RLQ LQ +L ++ HVAGLNPR++R N +P I+D +L
Sbjct: 1329 SLSSLTPVDDASFKRLQLLQGQLTRNIQHVAGLNPRAYRIVR-NDFVSKPLSKDILDGQL 1387
Query: 1393 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1427
LS +E LP+ Q E+ Q GT R+ +L + +LA+
Sbjct: 1388 LSAFESLPISRQNEMTKQIGTERNIVLHDWMELAI 1422
>gi|336375160|gb|EGO03496.1| hypothetical protein SERLA73DRAFT_165174 [Serpula lacrymans var.
lacrymans S7.3]
Length = 1428
Score = 309 bits (791), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 348/1479 (23%), Positives = 622/1479 (42%), Gaps = 207/1479 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDG-------- 95
N+VV +NV+ I+ VR +E ++ E RR V MDG
Sbjct: 47 NVVVARSNVLRIFEVR-EERPPMSTQTEDERDRRSHVRKGTEAVEGEVEMDGQGEGYVNM 105
Query: 96 --------------ISAASLELVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFE 138
+ + V + LHG V E++ I+S N D ++++F+
Sbjct: 106 GTVKSTGKKGAVHLPTVSRFYFVREHMLHGTVTGLETVRIMSS----NDDNLDRLLVSFK 161
Query: 139 DAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
DAKI++LE+ D IH L S+H +E +P+ + L +S ++VDP RC + +
Sbjct: 162 DAKIALLEWSDDIHDLITVSIHTYERAPQLMAL----DSSLFHTKLRVDPSSRCAALSLP 217
Query: 198 GLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVH 253
+ IL Q + L V ++D S +++L D +++HV DF+F+
Sbjct: 218 KDAIAILPFFQSQAELDVMEQD---QNQARDVPYSPSFILDLASDVDENIRHVIDFVFLP 274
Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
G+ P + +L + E TW+GR+ T + ++ +P+I + LP D L+
Sbjct: 275 GFNNPTIAVLFQTEQTWSGRLKEFKDTAKLIIFTLDLLSHTYPVITAVDGLPFDCISLVP 334
Query: 314 VPSPIGGVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELP-----RSSFSVELDAA 367
+ +GGV+++ +NTI Y S AL +N ++ S S +P +S ++ L+
Sbjct: 335 CVASLGGVVIMSSNTIIYVDPASRRVALPVNGWS-SRVSDMPMPALSGDEASRNISLEGC 393
Query: 368 HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLG 426
HA + + + K G + + +V DG+ V +L ++ + + S + I FLG
Sbjct: 394 HAVLVDDRTMFVFLKDGTVYPVELVADGKTVSKLSMAPALAQTTIPSMVRKINEDHLFLG 453
Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
S +G S+L++ L ++A + ++ + +
Sbjct: 454 SIVGASVLLKTVRVEEEVEDEEKLPAHAAVVDAPTTMDLDDDDDTMPSMNGVTH------ 507
Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG-------- 532
++N + ++ DSL GP+ D ++ L D +ATG
Sbjct: 508 ---SNNIIHRTRSVVHLSLCDSLPAYGPISDVTFSLAKLGDRYVPELVAATGSGFLGGFT 564
Query: 533 -----ISKQSNYELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR 586
+ ++ +L + G +GIW+ + R + R + + +IIS +A
Sbjct: 565 LFQRDLPSRTKRKLHAIGGARGIWSFPVRQQVRVNGLSYERPVNSFESENDTVIISTDAN 624
Query: 587 TM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDL 643
V A ++ ++ + G TI AG+ F R ++ V R+L+ G + +DL
Sbjct: 625 PSPGVSRIATRTSKSDIAIPTRIPGTTIGAGSFFQRTAILHVMTNAIRVLESGKQIIKDL 684
Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQ--TPAAIE 701
+ + SI DP+VL+ D +I L +G+ + + +P +
Sbjct: 685 D---------GNIPRPRIKACSICDPFVLIIREDDTIGLFIGEAERGKIRRKDMSPMGDK 735
Query: 702 SSKKPV------SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSV 755
SS+ +SC P D +++ + ++ + + +
Sbjct: 736 SSRYLAGCFFTDNSCIFETHANDLPSSASNGVDKNVTSTMQAVVNS------NSRSQWLI 789
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
+ G +EI+ +P F+ + D+Y AL + R
Sbjct: 790 LVRPQGVMEIWTLPKLTLAFSTSSLAMLEHILSDSYDTPALSPPQ-----------DHPR 838
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
K N + V ++ + P+L L G I+ Y+A P +
Sbjct: 839 KSN--DLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPAD------------ 884
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI----------SGHQG 925
S+ S VS +++ ++ + + EET ++ I + S G
Sbjct: 885 SIPPSRVSVLKVKFIKTATKIFELPKHEETEKSILAEQKRISRQFVPFVTSPTPGSVLSG 944
Query: 926 FFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT---SQGILKICQ 982
F +G RP W + + G I ++ H++ H F + S+G +
Sbjct: 945 VFFTGDRPSWIVATNK----------GGIRIYSSGHHIV--HSFTSCSLWESKGDFLVYS 992
Query: 983 LPSGSTYDNYWP---VQKVIPLKATPHQITYFAEKNLYP---LIVSVPVLKPLNQVLSLL 1036
G + + P + V+P + P Y A P LIV+ ++ +
Sbjct: 993 -DEGPSLLEWMPDLCLDSVLPSRNIPRSRAY-ANVVYDPSAMLIVAASSMQ-----ANFA 1045
Query: 1037 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1096
E G+ I S+V L + + + ++ P+ W T +E +
Sbjct: 1046 SFDEDGNIIWEPEASNVSLPK---CDCSTLELIAPEA----WITMDGYEFAPNEYVNALE 1098
Query: 1097 VVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNA-DNPQNL-----VTEV 1149
VTL +T+ ++ +A+GT+ +GED+A +G LF D QNL + +
Sbjct: 1099 CVTLETLSTETGSKDFIAVGTSIDRGEDLAVKGATYLFEIVEVVPDYSQNLKRWYKLKLL 1158
Query: 1150 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNF 1208
+ KG ++AL + G+L+ + G KI + + E L G+AF D +YV SL +VKNF
Sbjct: 1159 ARDDAKGPVTALCGINGYLVSSMGQKIFIRAFDMDERLVGVAFLDVG-VYVTSLRVVKNF 1217
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1268
+L+GD KSI+F++++E +L +LAKD +F TLS+V D ++++
Sbjct: 1218 LLIGDAVKSIWFVAFQEDPYKLVVLAKDVHRTHVTNADFFFTDDTLSIVTEDGDGILRMY 1277
Query: 1269 YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
Y P ES GQ LL R EFH + L ++A + P + + F+
Sbjct: 1278 AYDPDDPESKNGQHLLCRTEFHNHSECRSSL---VIARRTKEESVLPQAKILSAFS---- 1330
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
DGS+ + P+D+ +F+RLQ LQ +L ++ HVAGLNPR++R N +P I+
Sbjct: 1331 --DGSLSSLTPVDDASFKRLQLLQGQLTRNIQHVAGLNPRAYRIVR-NDFVSKPLSKDIL 1387
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1427
D +LLS +E LP+ Q E+ Q GT R+ +L + +LA+
Sbjct: 1388 DGQLLSAFESLPISRQNEMTKQIGTERNIVLHDWMELAI 1426
>gi|194474008|ref|NP_001124043.1| cleavage and polyadenylation specificity factor subunit 1 [Rattus
norvegicus]
gi|149066087|gb|EDM15960.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 1386
Score = 308 bits (790), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 219/669 (32%), Positives = 345/669 (51%), Gaps = 75/669 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429
Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSE 485
Query: 522 -------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH---------- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 ENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 545
Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T+ AG
Sbjct: 546 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAG 604
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
N+ R ++QV G R+L+G L F P + + ++ ++ADPYV++
Sbjct: 605 NIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMS 654
Query: 676 SDGSIRLLV 684
++G + + +
Sbjct: 655 AEGHVTMFL 663
Score = 281 bits (718), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 202/679 (29%), Positives = 317/679 (46%), Gaps = 89/679 (13%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 778 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 833
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 834 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 883
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 884 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 943
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 944 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1003
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1004 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1055
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENE 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1056 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET---- 1107
Query: 1110 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHL 1168
V G LKG ++A L QG
Sbjct: 1108 ---------VSG--------------------------------LKGYVAAGTCLMQGEE 1126
Query: 1169 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
+ G +I L +EL G+AF D LY+ + VKNFIL D+ KSI L ++E+
Sbjct: 1127 VTCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESK 1184
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+
Sbjct: 1185 TLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRAD 1244
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDEL 1343
FHVGAHV F R + GAA G K N+ F TLDG IG + P+ E
Sbjct: 1245 FHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEK 1297
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E
Sbjct: 1298 TYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTME 1357
Query: 1404 QLEIAHQTGTTRSQILSNL 1422
+ E+A + GTT IL +L
Sbjct: 1358 RSELAKKIGTTPDIILDDL 1376
>gi|74212803|dbj|BAE33365.1| unnamed protein product [Mus musculus]
Length = 741
Score = 308 bits (788), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
>gi|148697644|gb|EDL29591.1| cleavage and polyadenylation specific factor 1, isoform CRA_c [Mus
musculus]
Length = 1388
Score = 308 bits (788), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 219/671 (32%), Positives = 344/671 (51%), Gaps = 77/671 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH-------- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545
Query: 556 --KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 546 TPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654
Query: 674 GMSDGSIRLLV 684
++G + + +
Sbjct: 655 MSAEGHVTMFL 665
Score = 281 bits (719), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 202/679 (29%), Positives = 317/679 (46%), Gaps = 89/679 (13%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 780 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 836 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 886 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 945
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 946 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1006 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1057
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENE 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1058 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET---- 1109
Query: 1110 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHL 1168
V G LKG ++A L QG
Sbjct: 1110 ---------VSG--------------------------------LKGYVAAGTCLMQGEE 1128
Query: 1169 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
+ G +I L +EL G+AF D LY+ + VKNFIL D+ KSI L ++E+
Sbjct: 1129 VTCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESK 1186
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+
Sbjct: 1187 TLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRAD 1246
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDEL 1343
FHVGAHV F R + GAA G K N+ F TLDG IG + P+ E
Sbjct: 1247 FHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEK 1299
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E
Sbjct: 1300 TYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTME 1359
Query: 1404 QLEIAHQTGTTRSQILSNL 1422
+ E+A + GTT IL +L
Sbjct: 1360 RSELAKKIGTTPDIILDDL 1378
>gi|355698297|gb|EHH28845.1| Cleavage and polyadenylation specificity factor 160 kDa subunit
[Macaca mulatta]
Length = 1436
Score = 307 bits (787), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 183/524 (34%), Positives = 276/524 (52%), Gaps = 34/524 (6%)
Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
R F++I G+ G F+ G P W +V R LR+HP DG + +F HNVNC GF+Y
Sbjct: 923 RFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLY 982
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
QG L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S
Sbjct: 983 FNRQGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATS-------TN 1034
Query: 1032 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSS 1089
I + G + + + + + E + ++++ P W+ A I +Q
Sbjct: 1035 TPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEW 1090
Query: 1090 ENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE 1148
E+ ++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1091 EHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTK 1150
Query: 1149 -----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLN 1203
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ +
Sbjct: 1151 NKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMI 1209
Query: 1204 IVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1263
VKNFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +
Sbjct: 1210 SVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDR 1269
Query: 1264 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT--- 1320
N+ ++ Y P+ ES+ G +LL RA+FHVGAHV F R + GA G K
Sbjct: 1270 NLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVV 1322
Query: 1321 --NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
N+ F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + +
Sbjct: 1323 WENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRR 1382
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ +++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1383 TLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1426
Score = 286 bits (732), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 218/704 (30%), Positives = 344/704 (48%), Gaps = 116/704 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVR---------------- 506
T QD V+ E+ +YGS A + T+ A T+SF VR
Sbjct: 427 TASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVRLRQQGPHPSQCPQRPL 482
Query: 507 --------DSLVNIGPLKDFSYG--LRINADASATGISKQSNYELV-------------- 542
DS++NIGP + + G ++ + S + + E+V
Sbjct: 483 TFAVPQVCDSILNIGPCANAAMGEPAFLSEEVPRVVNSPEPDLEIVVCSGHGKNGALSVL 542
Query: 543 ------------ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYLI 580
ELPGC +WTV + G ++ A DD H +LI
Sbjct: 543 QKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLI 602
Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
+S E TM T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 603 LSREDSTM---TGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---V 655
Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 656 NQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 692
>gi|148697642|gb|EDL29589.1| cleavage and polyadenylation specific factor 1, isoform CRA_a [Mus
musculus]
Length = 1417
Score = 307 bits (786), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 56 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 108
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 109 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 161
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 162 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 218
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 219 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 278
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 279 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 338
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 339 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 398
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 399 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 456
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 457 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 512
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 513 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 572
Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 573 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 631
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 632 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 681
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 682 VIMSAEGHVTMFL 694
Score = 281 bits (718), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 202/679 (29%), Positives = 317/679 (46%), Gaps = 89/679 (13%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 809 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 864
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 865 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 914
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 915 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 974
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 975 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1034
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1035 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1086
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENE 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1087 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET---- 1138
Query: 1110 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHL 1168
V G LKG ++A L QG
Sbjct: 1139 ---------VSG--------------------------------LKGYVAAGTCLMQGEE 1157
Query: 1169 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
+ G +I L +EL G+AF D LY+ + VKNFIL D+ KSI L ++E+
Sbjct: 1158 VTCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESK 1215
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+
Sbjct: 1216 TLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRAD 1275
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDEL 1343
FHVGAHV F R + GAA G K N+ F TLDG IG + P+ E
Sbjct: 1276 FHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEK 1328
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E
Sbjct: 1329 TYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTME 1388
Query: 1404 QLEIAHQTGTTRSQILSNL 1422
+ E+A + GTT IL +L
Sbjct: 1389 RSELAKKIGTTPDIILDDL 1407
>gi|410042329|ref|XP_003954555.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Pan troglodytes]
Length = 1296
Score = 306 bits (783), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 220/677 (32%), Positives = 348/677 (51%), Gaps = 86/677 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNEALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+ G + S A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651
Query: 668 DPYVLLGMSDGSIRLLV 684
DPYV++ ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668
Score = 300 bits (769), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 195/580 (33%), Positives = 292/580 (50%), Gaps = 42/580 (7%)
Query: 860 GPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA-PCQRITIFK 918
G E + DD S S S S+ R S+ P D R+ P A P + +
Sbjct: 732 GSETSPTVDDEEEMLYGDSGSLFSPSKEEARRSSQPPAD---RDPAPFRAEPTHWCLLVR 788
Query: 919 NISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
F+ G P W +V R LR+HP DG + +F HNVNC GF+Y QG
Sbjct: 789 ENGTMXXXFICGPSPPWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGE 848
Query: 978 LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSL 1035
L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S P +
Sbjct: 849 LRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR-------- 899
Query: 1036 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1093
I + G + + + + + E + ++++ P W+ A I +Q E+
Sbjct: 900 -IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVT 954
Query: 1094 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE---- 1148
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 955 CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1014
Query: 1149 -VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1207
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VKN
Sbjct: 1015 VLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKN 1073
Query: 1208 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1267
FIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ +
Sbjct: 1074 FILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMV 1133
Query: 1268 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NR 1322
+ Y P+ ES+ G +LL RA+FHVGAHV F R + GA G K N+
Sbjct: 1134 YMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENK 1186
Query: 1323 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1187 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQN 1246
Query: 1383 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1247 AVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1286
>gi|390358535|ref|XP_789715.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 1223
Score = 305 bits (782), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 210/689 (30%), Positives = 338/689 (49%), Gaps = 54/689 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V C E+G LE++ +P+ F V F G +VD S S TG
Sbjct: 560 WCVFCRENGQLEMYSLPDMVLAFLVKNFPMGSKVLVD---------------SGSAFMTG 604
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+++ +V E+ + + ++ A++ D I+ Y+A+ P NT + +
Sbjct: 605 DQSQQHEMLQQVQEVLLVGLGHDRKKIYMLALVEDD-IMIYEAF----PYNTVTQEHHLR 659
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPL-DAYTREETPHGAP---------CQRITIFKNISG 922
R + + + + R S+ P + T+ ET A R+ F N+
Sbjct: 660 V-RFRKIPHKILMKPKKTRTSKKPTAEGGTKPETETEAESDTKTTSRRVNRLREFHNVQT 718
Query: 923 HQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G F+SGS P W V R LR HP DG+I F HNVNC +GF+Y + L+IC
Sbjct: 719 YSGVFISGSHPYWLFVTSRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRIC 778
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
LPS +YD WPV+KV PL+ TPH + Y E Y ++ SV Q + +
Sbjct: 779 VLPSHLSYDAPWPVRKV-PLRCTPHFVAYHVETKTYAVVTSV-------QETKTHVWKVT 830
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1100
G +I + D T + +++ P TR I +++EN ++VV L
Sbjct: 831 GEEIGEEPVERDDRFVPTTKVVFSIQLFSPVSWDAIPNTR--IEYEAAENVTCLKVVNLS 888
Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELK 1155
T + + + T +V ED+ RG V ++ P +N + +Y K K
Sbjct: 889 CEGTMTGKKGYVVVATTHVYSEDLQTRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQK 948
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1215
G +SAL + G LL G K+ + ++ +L G+AF D +Y+ + VK FIL+ D+
Sbjct: 949 GPVSALCEVMGFLLTCIGQKVYMWQFKDNDLIGLAFIDTQ-IYIHNAVSVKQFILITDVM 1007
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
K YFL ++ Q L+L+++D L+ F EF++D ++ +VSD KN+ +F+Y P+
Sbjct: 1008 KGAYFLQYQAQDRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAP 1067
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT--SSDRTGAAPGSDKTNRFALLFGTLDGS 1333
ES G LL R + ++G+ V F+R++ T S+++ + P R + F TLDGS
Sbjct: 1068 ESHGGAYLLRRGDMNIGSAVNTFVRVRCRLTDPSTEQVLSGP---VLRRQVVFFATLDGS 1124
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+G + P+ E T+RRL LQ L + +PHV GLNP+S+R S+ + +I+D +LL
Sbjct: 1125 LGLLLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLL 1184
Query: 1394 SHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
Y L + E+ E A + GT+ QI+S+L
Sbjct: 1185 LKYCHLSVVERNEFAKKIGTSVDQIISDL 1213
Score = 182 bits (461), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 141/456 (30%), Positives = 221/456 (48%), Gaps = 60/456 (13%)
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
RV+ + TC I ALS++ K HP+IWS +LP+D ++ AVP PIGGVL++ N++ Y
Sbjct: 11 RVAVRQDTCSIVALSLNMAQKVHPIIWSQSSLPYDCMQVQAVPKPIGGVLILAVNSLLYL 70
Query: 333 SQSASCALALNNYAVSLDS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGD 385
+QS + Y VSL+S S P + + +D AT++ D LS K G+
Sbjct: 71 NQS------IPPYGVSLNSLTDWSTAFPLKTQEGVKLSMDCTQATFISYDRLALSLKDGE 124
Query: 386 LVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
+ +LT++ DG R V+ L K SVLT+ I +G+ FLGSRLG+SLL+++T +
Sbjct: 125 IYVLTLLVDGMRSVRGFHLDKAAASVLTTCICPMGDGFLFLGSRLGNSLLLKYTEKVSET 184
Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD----MVNGEELSLYGSASNNTESAQKT 500
+ K E + P+ K +SD + + + +EL +YG T + +
Sbjct: 185 SPTDASKTEEPKPGEEPPTKKMRSDDASDWMASDTKFLDDPDELEVYGKQVQKTGTQLTS 244
Query: 501 FSFAVRDSLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELVE---- 543
+SF + DSL+NIGP + G + N D + +G K +++
Sbjct: 245 YSFEICDSLLNIGPCGNMIMGEPAFLSEEFQGNVDPDLELVTTSGYGKNGALSVLQRTIR 304
Query: 544 --------LPGCKGIWTVYHKSSRGHNAD----SSRMAAYDDEYHAYLIISLEARTMVLE 591
LPGC +WTV KS + AD S + D + HA+LI+S + +MVL+
Sbjct: 305 PQVVTTFNLPGCLDMWTV--KSLKEAKADEKSEESEASPEDKDRHAFLILSKQDSSMVLQ 362
Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
T +TEV + Q TI A N+ R ++QV + +++G Q +
Sbjct: 363 TGQEITEVAAG-GFSTQAPTIFASNMGDDRYIVQVMNKSICLMEGVEQIQHMVL------ 415
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
S + S+ADPY+LL +G L+ P
Sbjct: 416 ----DVGSPIKQCSLADPYLLLLTENGDPILMTLKP 447
>gi|384946686|gb|AFI36948.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
Length = 1428
Score = 305 bits (780), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 219/677 (32%), Positives = 347/677 (51%), Gaps = 86/677 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+ G ++ A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651
Query: 668 DPYVLLGMSDGSIRLLV 684
DPYV++ ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668
Score = 260 bits (664), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 168/557 (30%), Positives = 274/557 (49%), Gaps = 36/557 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 783 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 839 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 889 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 949 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S I + G + + +
Sbjct: 1009 PWPVRK-IPLRCTAHYVAYHVESKVYAVATS-------TNTPCARIPRMTGEEKEFETIE 1060
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
+ + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1061 RDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGL 1116
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALAS 1163
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1117 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCH 1176
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L +
Sbjct: 1177 CNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRY 1235
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +L
Sbjct: 1236 QEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRL 1295
Query: 1284 LSRAEFHVGAHVTKFLR 1300
L RA+FHVGAHV F R
Sbjct: 1296 LRRADFHVGAHVNTFWR 1312
>gi|345482082|ref|XP_001607052.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Nasonia vitripennis]
Length = 1415
Score = 304 bits (778), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 208/689 (30%), Positives = 348/689 (50%), Gaps = 62/689 (8%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
V ++G LE++ +P + + F G+ + D+ + ++ +G+ Q
Sbjct: 773 VYRDNGTLEVYSLPELRLSYLIKNFGFGQNILHDS------------MEFTTIQGSQQNE 820
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
N ++V E+A+ H +RP L L D + YQ Y + P+ K
Sbjct: 821 PVN-PEVQVREIAVVALGHHGNRPMLLVRL-DSELQIYQVYRY--PKGHLK--------- 867
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGSR 932
L + + + + FSR +EE R+ + F NI+G+ G F+ G
Sbjct: 868 -LRFKKIDHNFI--VGFSRIG----PKEEDMPSMNDTRLCMMRYFSNIAGYNGVFIGGDY 920
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W + R LR HP DG + +F +NVNC GF+Y + L+IC LP+ +YD
Sbjct: 921 PHWIFLTGRGELRAHPMNIDGPVKSFAPFNNVNCPQGFLYFNRKDELRICVLPTHLSYDA 980
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNL 1050
WPV+KV PL+ TPH +TY E Y ++ S +PL D+E + N
Sbjct: 981 PWPVRKV-PLRCTPHFVTYHLESKTYCVVTSTA--EPLKSYYRFNGEDKEFTEEERNERF 1037
Query: 1051 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1108
L+ T E++ + + P W T I + E+ ++ V+L T+
Sbjct: 1038 ----LYPTQ--EQFSIVLFSPVS----WDTIPNTKIDLDQWEHVTCLKNVSLAYEGTRSG 1087
Query: 1109 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALA 1162
+ + IGT Y GED+ +RGR+ +F P +N ++Y+KE KG ++A+
Sbjct: 1088 LKGYIVIGTNYNYGEDITSRGRIFIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPVTAIT 1147
Query: 1163 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
+ G L+ A G KI + + +L G+AF D +YV + +K+ IL+ D++KS+ L
Sbjct: 1148 QVSGFLVSAIGQKIYIWQLKDNDLVGVAFIDTQ-IYVCQMLSIKSLILVADVYKSVSLLR 1206
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1282
++ + L+L+++DF + + +A E+ I + L +V+D + NI IF Y P+ S+S GQK
Sbjct: 1207 FQPEYKTLSLVSRDFRTTEIYAIEYFIQNNELGFIVADGESNISIFSYQPESSQSLGGQK 1266
Query: 1283 LLSRAEFHVGAHVTKFLRLQMLAT-SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1341
L+ +A+ H+G + F R++ T S++ T G+DK R ++ TLDGS+G I P+
Sbjct: 1267 LIRKADIHLGQKINTFFRIKCKTTDSANPTKQFSGADK--RHVTMYATLDGSLGYILPVP 1324
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1401
E T+RRL LQ LV + H+AGLNP++FR + S + I+D +L+ Y LP+
Sbjct: 1325 EKTYRRLLMLQNVLVSHIYHIAGLNPKAFRTYKSCVRMQGNPARGIIDGDLVRKYLDLPV 1384
Query: 1402 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
E++EIA + GT +I+ +++++ TS
Sbjct: 1385 NEKIEIAKKIGTGAQEIMDDMHEIYKQTS 1413
Score = 294 bits (753), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 210/667 (31%), Positives = 334/667 (50%), Gaps = 78/667 (11%)
Query: 58 LVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
LVV AN+I ++ ++ + G KE + LE + Y LHGNV S+
Sbjct: 30 LVVAGANIIRVFRLIPDVDPGKKEKFTESRPPK---------MRLECLAQYTLHGNVMSM 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ G+ RDS++L+F +AK+SV+E+D IH LR S+H FE E +K G +
Sbjct: 81 QAVQLIGSP----RDSLLLSFREAKLSVVEYDPEIHSLRTVSLHYFEEEE---IKDGWTN 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+GRC +L+YG ++++L + GD I SS++I
Sbjct: 134 HHHVPIVRVDPEGRCAVMLIYGRKLVVLPFRKDPILDEGDLIENPKSSSHKTPILSSYMI 193
Query: 237 NLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
L+ L+ M ++ D F+HGY EP ++IL+E T+AGR++ + TC + A+S++ K
Sbjct: 194 VLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFAGRIAVRQDTCAMVAISLNIQQKV 253
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDSSQ 353
HP+IWS NLP D Y+ +AV P+GG L++ N++ Y +QS ++LN+ + +
Sbjct: 254 HPIIWSVSNLPFDCYQAVAVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLTDNCTNFP 313
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
P+ + L+++ ++ D ++S KTG+L +L++ D R V+ K SVLT
Sbjct: 314 LKPQEGVKISLESSQVAFISPDRLVISLKTGELYVLSLFADSMRSVRGFHFDKAAASVLT 373
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS-SGLKEEFGDIEADAPSTKRLRRS- 470
S + ++ FLGSRLG+SLL++FT + S L+ + TK+++
Sbjct: 374 SCVCLCDDNYLFLGSRLGNSLLLRFTEKESEKINDISMLEMSLNSSNSQEQPTKKIKLDY 433
Query: 471 -----SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG---- 521
+SD L D+ + EEL +YGS + T ++ F V DSL+NIGP + S G
Sbjct: 434 LEDWMASDVL-DIKDPEELEVYGSET-QTSIQITSYIFEVCDSLLNIGPCGNISMGEPAF 491
Query: 522 ----LRINAD-----ASATGISKQSNYELVE------------LPGCKGIWTVYHKSSRG 560
N++ + +G K +++ LPG + IWTV +
Sbjct: 492 LSEEFSNNSEPDVELVTTSGYGKNGALCVLQRSIRPQVITTFDLPGYENIWTVIDSTVSD 551
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
+ A + H +LI++ + TMVL+T + EV + + QG TI AGNL
Sbjct: 552 NRAKTETEGT-----HGFLILTQDDSTMVLQTGQEINEVVDQSGFSTQGTTIFAGNLGSN 606
Query: 621 RRVIQVFERGARILDG----SYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
R +IQV + G R+L G +M DL ++ S ADPYV L
Sbjct: 607 RYIIQVTQMGVRLLQGLEQIQHMPMDLG--------------CPIVHASCADPYVSLLSE 652
Query: 677 DGSIRLL 683
DG + LL
Sbjct: 653 DGQVVLL 659
>gi|389740693|gb|EIM81883.1| hypothetical protein STEHIDRAFT_65512 [Stereum hirsutum FP-91666 SS1]
Length = 1438
Score = 303 bits (776), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 369/1499 (24%), Positives = 635/1499 (42%), Gaps = 211/1499 (14%)
Query: 47 PSKRGIGPVPNLVVTAANVIEIYVVRV------------QEEGSKESKNSGETKRRVLMD 94
P + P+ NLVV +N++ I VR +E K + + V MD
Sbjct: 31 PDSQKALPLFNLVVARSNLLRILEVREVPTLRPIHLDDERERRGNVRKGTEPVEGEVEMD 90
Query: 95 ---------GISAAS----------LELVCHYRLHGNV---ESLAILSQGGADNSRRRDS 132
G S AS V YRLHG V E++ I+S D
Sbjct: 91 EQGEGYVNMGASTASNGAPRPTVLRFYFVRDYRLHGTVTGLETVRIMSS----LEDEMDR 146
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRC 191
++++F+DAKI++LE+ H L S+H +E +P+ L L +S ++ DP +C
Sbjct: 147 LLVSFKDAKIALLEWSTDTHSLSTVSIHTYERAPQLLSL----DSNMFTAQLRTDPLSQC 202
Query: 192 GGVLVYGLQMIILKASQGGSGL-VGDED-TFGSGGGFSARIESSHVINLR---DLDMKHV 246
+ + IL Q L V D+D T +S S +++L D +++V
Sbjct: 203 AALSLPKDAFAILPFYQTQVDLDVMDQDQTRARDVPYSP----SFILDLAAEVDERIRNV 258
Query: 247 KDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPH 306
DF+F+ G+ P + +L + + TW GR+ T + +++ + +P+I S LP+
Sbjct: 259 VDFVFLPGFSHPTVAVLFQAQQTWTGRLKEYKDTMRLFIFTLNVVTRSYPIITSVEGLPY 318
Query: 307 DAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS---- 361
D ++ P+ +GGV+V+ +N+ IH S ALA+N + + ++P ++ +
Sbjct: 319 DCLSVVPCPAALGGVVVLTSNSVIHIDQASRRVALAVNGW---MPRVSDMPVTALAQGDQ 375
Query: 362 --VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSD 414
+EL+ + T++ + + K G + + DG+VV +L +S T PSV
Sbjct: 376 GRLELEGSRMTFVDDKTLFIVLKDGTIHPVEFFVDGKVVSKLSISPPLAQTTTPSV---- 431
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
I I N FF+GS G S L++ SG++E+ D + + K + D+
Sbjct: 432 IRKITNEHFFVGSTAGPSALLKV----------SGVEEDIQD-DVEEIDGKTAPAAVVDS 480
Query: 475 LQDMVNGEELSLYGSA--------------SNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+ M ++ LYGS+ + +T + ++ DSL GP+ D ++
Sbjct: 481 VDGMDIDDDDDLYGSSKADPTPTANGNAVETTSTTRKRTVIHLSLCDSLPAHGPISDMTF 540
Query: 521 GLRINAD------ASATGISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGH 561
+ N D +ATG + L + LP G +G+W++ + +
Sbjct: 541 SMTKNGDRAVPELVAATGSGLLGGFTLFQRDLPIRTKRKLHAIGGARGVWSLPVRQAVRV 600
Query: 562 NADSSRMAAYD-DEYHAYLIISLEARTM--VLETADLLTEVTESVDYFVQG-RTIAAGNL 617
N S + + +IIS +A + A ++ ++ + G T+ A
Sbjct: 601 NGVSYQTPQNPLRSDNDTIIISTDATPSPGISRIATRSSKTDLNITTRIPGVTTVGAAPF 660
Query: 618 FGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
F ++ V R+L DGS P G+ + + + + SI DPYV +
Sbjct: 661 FQGTAILHVLSNAIRVLEPDGSERQ------PIKDMDGN-NYRAKIKNCSICDPYVFVLR 713
Query: 676 SDGSIRLLVGDPSTCTVSVQ--TPAAIESSKKPVSSCTLYHDKGP-----EPWLRKTSTD 728
D +I L +G+ + + +P ++S+ ++ C G L ++T
Sbjct: 714 EDETIGLFIGETERGKIRRKDMSPMGDKTSRY-IAGCFFSDTTGTFQAHVNSSLNGSNTT 772
Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
+T +++ A Q + ++ G +EI+ +P VF+ + + +V
Sbjct: 773 KQNATSTLQSVMNAG-----QKTQWLLLVRPQGVMEIWTLPKLTLVFSTTALATLQPLLV 827
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D+ AL S Q RK + + ++ + RP LF +L G
Sbjct: 828 DSLDPPAL----------SSLPQDQPRKP--QELDIDQILVAPLGETSPRPHLFVLLRSG 875
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--P 906
+ Y+A FE P + DP SR S+ V ++ + F D +E++
Sbjct: 876 QLAIYEAVSFELP-----TGDPEPASRP-SILPVKLVKVLSRAFDIQHPDEQPQEKSVLA 929
Query: 907 HGAPCQRITI-FKNISGHQ----GFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVL 960
QR+ I F + G F +G RPCW + + +RVH + +FT
Sbjct: 930 ELKKIQRLFIPFVTSPAPEKTFTGVFFTGDRPCWILGTDKGGIRVHSS-GHAVVHSFTPC 988
Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY--FAEKNLYP 1018
+ F+ T +G + +P + +P + P Y
Sbjct: 989 SLWDSKGDFLLYTDEGPCLLEWMPDVQLHTE-------LPSRFMPRSRAYTNVVFDPFTC 1041
Query: 1019 LIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPW 1078
LIV LK Q S D + D N+S T + + ++ PD W
Sbjct: 1042 LIVGAASLK--AQFTSFDEDGNQTWEPDAPNISYP------TTDCSTLELITPDA----W 1089
Query: 1079 QTRATIPMQSSENALTVRVVTLFNTTTKENE-TLLAIGTAYVQGEDVAARGRVLLFSTGR 1137
T S+E V V L +T + + +A+GT +GED+A +G +F
Sbjct: 1090 LTMDGYEFASNEIVNAVECVMLETQSTDSGQKSFIAVGTTINRGEDLAVKGATYIFEIVE 1149
Query: 1138 NADNPQNLVTEVYSKEL------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIA 1190
+P V + + KG ++AL + G+L+ + G KI + E L G+A
Sbjct: 1150 VVPDPSFGVKRWFKLRMRCRDDAKGPVTALCGMDGYLVSSMGQKIFVRALDLDERLVGVA 1209
Query: 1191 FYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLI 1249
F D +YV SL +KN +++ D KS++F++++E +L +LAKD + CF + +F
Sbjct: 1210 FLDVG-VYVTSLRALKNLLIISDAVKSVWFVAFQEDPYKLTVLAKDAQQV-CFTSADFFF 1267
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
LS+V DE+ +++++Y P ES GQ+LL AEFH L +
Sbjct: 1268 ANQQLSIVTCDEEGILRMYHYNPHDPESKNGQRLLCHAEFHGQIEYRSSLTIA------- 1320
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
R P ++ + L+ G+ DGS+ + P++E F+RL LQ +L +V H A LNPR+
Sbjct: 1321 RRTKGPDTE-IPQAKLICGSPDGSLSALVPVEEAAFKRLHLLQGQLTRNVQHTAALNPRA 1379
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
FR N + +D LL +E LP+ Q+EI Q GT R +L + +ALG
Sbjct: 1380 FRAVR-NEYVSKTLHKGFLDGLLLRSFEDLPVSRQIEITRQIGTERRLVLKDW--VALG 1435
>gi|354491124|ref|XP_003507706.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 isoform 2 [Cricetulus griseus]
Length = 1388
Score = 303 bits (775), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 217/671 (32%), Positives = 347/671 (51%), Gaps = 77/671 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE E L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ ++
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429
Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-------HK 556
L I + + + + K ++V ELPGC +WTV +
Sbjct: 486 SEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545
Query: 557 SSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
+ R + + ++ A D H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 546 APRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654
Query: 674 GMSDGSIRLLV 684
++G + + +
Sbjct: 655 MSAEGHVTMFL 665
Score = 281 bits (718), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 202/679 (29%), Positives = 317/679 (46%), Gaps = 89/679 (13%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 780 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 836 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 886 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 945
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 946 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1006 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1057
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENE 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1058 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET---- 1109
Query: 1110 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHL 1168
V G LKG ++A L QG
Sbjct: 1110 ---------VSG--------------------------------LKGYVAAGTCLMQGEE 1128
Query: 1169 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
+ G +I L +EL G+AF D LY+ + VKNFIL D+ KSI L ++E+
Sbjct: 1129 VTCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESK 1186
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+
Sbjct: 1187 TLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRAD 1246
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDEL 1343
FHVGAHV F R + GAA G K N+ F TLDG IG + P+ E
Sbjct: 1247 FHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEK 1299
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E
Sbjct: 1300 TYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTME 1359
Query: 1404 QLEIAHQTGTTRSQILSNL 1422
+ E+A + GTT IL +L
Sbjct: 1360 RSELAKKIGTTPDIILDDL 1378
>gi|307191845|gb|EFN75271.1| Cleavage and polyadenylation specificity factor subunit 1
[Harpegnathos saltator]
Length = 1214
Score = 301 bits (772), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 205/698 (29%), Positives = 343/698 (49%), Gaps = 77/698 (11%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 570 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYMLHDSMESTTLQSAPINETLNPE------ 623
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
++V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 624 -------LQVREVLMVALGHHGNRPMLLVRL-DSELQIYQAYKY--PKGHLK-------- 665
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
L + + SR P E+ P A RI + F NI+G+ G F+
Sbjct: 666 --LRFKKLDHGIIPG-HLSRKP----KEEDVPVNANETRICMMRYFSNIAGYNGVFICSD 718
Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W + R LR HP DGS+ +F +N+NC GF+Y + L+IC LP+ +YD
Sbjct: 719 YPHWIFLTGRGELRTHPMGIDGSVTSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYD 778
Query: 991 NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
WPV+KV PL+ TPH +TY E Y +I S +PL +
Sbjct: 779 APWPVRKV-PLRCTPHFVTYHLESKTYCVITSTS--EPLKSY---------------YRF 820
Query: 1051 SSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVTL 1100
+ D + +T E+ R L P + + W+T I + E+ ++ V+L
Sbjct: 821 NGED--KEFTEEDRPERFLYPSQEQFCIVLFSPVSWETIPNTKIELDQWEHVTCLKNVSL 878
Query: 1101 FNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKEL 1154
T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+KE
Sbjct: 879 AYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQ 938
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
KG I+A+ + G L+ A G KI + + +L GIAF D +Y+ + +K+ IL+ D+
Sbjct: 939 KGPITAITQVSGFLVTAVGQKIYIWQLKDNDLVGIAFIDTQ-IYIHQMLSIKSLILIADV 997
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
+KSI L ++E+ L+L+++DF + + E+LID + L +++D + N+ +F Y P+
Sbjct: 998 YKSISLLRFQEKCRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLIADGESNLALFMYQPES 1057
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT--SSDRTGAAPGSDKTNRFALLFGTLDG 1332
ES GQKL+ +A+FH+G + F R++ T +SD+ SD + ++ +LDG
Sbjct: 1058 RESLGGQKLIRKADFHLGQKINTFFRIKCRVTDVASDKKHF---SDADKKHVTMYASLDG 1114
Query: 1333 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
S+G + P+ E T+RRL LQ LV + H+AGLNP+++R + S + I+D +L
Sbjct: 1115 SLGYVLPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYVRNQGNPARGIIDGDL 1174
Query: 1393 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
+ Y LP E+ ++A + GT +I+ ++ ++ T+
Sbjct: 1175 VWRYLSLPNNEKADVAKKIGTRVQEIIEDITEIDRQTA 1212
Score = 216 bits (549), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 152/475 (32%), Positives = 243/475 (51%), Gaps = 52/475 (10%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
M +V D F+HGY EP ++IL+E T++GR++ + TC + A+S++ + HP+IWS
Sbjct: 1 MDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVHPIIWSVS 60
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDSSQELPRSSFS 361
NLP D Y+ + V P+GG L++ N++ Y +QS ++LN+ A + + P+
Sbjct: 61 NLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTNFPLRPQDGVK 120
Query: 362 VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGN 420
+ L+ A +L D ++S K+G+L +L++ D R V+ K SVLTS + +
Sbjct: 121 ISLEGAQVAFLSADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASVLTSCVCMCED 180
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKRLRRS------SSD 473
+ FLGSRLG+SLL++FT ++ S E D + + P K+ ++ +SD
Sbjct: 181 NYLFLGSRLGNSLLLRFTEKEPETIKSLDDGEINIEDNDNEEPPAKKAKQDFLGDWMASD 240
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----------R 523
L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S G
Sbjct: 241 VL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEPAFLSEEFAHN 298
Query: 524 INAD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
N D + +G K ++ ELPGC+ +WTV G + ++
Sbjct: 299 QNPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI-----GSLNNDEQV 353
Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
+ + HA+LI+S E TMVL+T + EV +S + QG T+ AGNL R ++QV +
Sbjct: 354 KSETEGSHAFLILSQEDSTMVLQTGQEINEVDQS-GFSTQGSTVFAGNLGANRYIVQVTQ 412
Query: 629 RGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
G R+L G Q + ++ S ADPYV+L DG + LL
Sbjct: 413 MGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVILLSEDGQVMLL 457
>gi|390347522|ref|XP_003726804.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 1439
Score = 300 bits (768), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 206/689 (29%), Positives = 333/689 (48%), Gaps = 54/689 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V C E+G LE++ +P+ F V F G +VD S S TG
Sbjct: 776 WCVFCRENGQLEMYSLPDMVLAFLVKNFPMGSKVLVD---------------SGSAFMTG 820
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+++ +V E+ + + ++ A++ D I+ Y+A+ P NT + +
Sbjct: 821 DQSQQHEMLQQVQEVLLVGLGHDRKKIYMLALVEDD-IMIYEAF----PYNTVTQEHHLR 875
Query: 873 TSRSLSVSNVSASRLRNLRFSRTP----------LDAYTREETPHGAPCQRITIFKNISG 922
R + + + + R S+ P + R+ F N+
Sbjct: 876 V-RFRKIPHKILMKPKKTRTSKKPTAEGGTKTETETEAESDTKTQTRRVNRLREFHNVQT 934
Query: 923 HQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G F+SGS P W V R LR HP DG+I F HNVNC +GF+Y + L+IC
Sbjct: 935 YSGVFISGSHPYWLFVTSRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRIC 994
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
LPS +YD WPV+KV PL+ TPH + Y E Y ++ SV Q + +
Sbjct: 995 VLPSHLSYDAPWPVRKV-PLRCTPHFVAYHVETKTYAVVTSV-------QETKTHVWKVT 1046
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1100
G +I + D T + +++ P TR I +++EN ++VV L
Sbjct: 1047 GEEIGEEPVERDDRFVPTTKVVFSIQLFSPVSWDAIPNTR--IEYEAAENVTCLKVVNLS 1104
Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELK 1155
T + + + T +V ED+ RG V ++ P +N + +Y K K
Sbjct: 1105 CEGTMTGKKGYVVVATTHVYSEDLQTRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQK 1164
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1215
G +SAL + G LL G K+ + ++ +L G+AF D +Y+ + VK FIL+ D+
Sbjct: 1165 GPVSALCEVMGFLLTCIGQKVYMWQFKDNDLIGLAFIDTQ-IYIHNAVSVKQFILITDVM 1223
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
K YFL ++ Q L+L+++D L+ F EF++D ++ +VSD KN+ +F+Y P+
Sbjct: 1224 KGAYFLQYQAQDRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAP 1283
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT--SSDRTGAAPGSDKTNRFALLFGTLDGS 1333
ES G LL R + ++G+ V F+R++ T S+++ + P R + F TLDGS
Sbjct: 1284 ESHGGAYLLRRGDMNIGSAVNTFVRVRCRLTDPSTEQVLSGP---VLRRQVVFFATLDGS 1340
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+G + P+ E T+RRL LQ L + +PHV GLNP+S+R S+ + +I+D +LL
Sbjct: 1341 LGLLLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLL 1400
Query: 1394 SHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
Y L + E+ E A + GT+ QI+S+L
Sbjct: 1401 LKYCHLSVVERNEFAKKIGTSVDQIISDL 1429
Score = 286 bits (732), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 219/726 (30%), Positives = 349/726 (48%), Gaps = 105/726 (14%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A Y+ +H PTG+ +C H + P ++ NLVV
Sbjct: 2 YAFYREIHPPTGVEHC---VYCHFFS------------------PDQQ------NLVVAK 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
+ + +Y + + + +K + + K + LE + + G V S+ Q
Sbjct: 35 GSELTVYSM-ITVDSNKPTDKESKPKNK----------LEEAATFHIFGKVMSM----QS 79
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
RD+++L+F +AK+S++E+D ++H L+ SMH FE E K G P+
Sbjct: 80 AQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEEDE---TKEGVYRNIFHPV 136
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
VKVDP RC +L YG ++++L + GLV D D S + S+VI L ++D
Sbjct: 137 VKVDPDHRCAIMLTYGSKLVVLPFRR--DGLVEDLDKSMSASTRRGALMPSYVIRLNEMD 194
Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
+ +V D F+HGY EP ++IL+E TWAGRV+ + TC I ALS++ K HP+IWS
Sbjct: 195 DPICNVLDIQFLHGYYEPTLLILYEPLRTWAGRVAVRQDTCSIVALSLNMAQKVHPIIWS 254
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS----SQELP 356
+LP+D ++ AVP PIGGVL++ N++ Y +QS + Y VSL+S S P
Sbjct: 255 QSSLPYDCMQVQAVPKPIGGVLILAVNSLLYLNQS------IPPYGVSLNSLTDWSTAFP 308
Query: 357 ---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + +D AT++ D LS K G++ +LT++ DG R V+ L K SVLT
Sbjct: 309 LKTQEGVKLSMDCTQATFISYDRLALSLKDGEIYVLTLLVDGMRSVRGFHLDKAAASVLT 368
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+ I +G+ FLGSRLG+SLL+++T + S K E + PS K +S
Sbjct: 369 TCICPMGDGFLFLGSRLGNSLLLKYTEKVSETSPSDASKTEEPKPGEEPPSKKMRSDDAS 428
Query: 473 DALQD----MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
D + + + +EL +YG T + ++SF + DSL+NIGP + G
Sbjct: 429 DWMASDTKFLDDPDELEVYGKQVQKTGTQLTSYSFEICDSLLNIGPCGNMIMGEPAFLSE 488
Query: 522 -LRINAD-----ASATGISKQSNYELVE------------LPGCKGIWTV--YHKSSRGH 561
+ N D + +G K +++ LPGC +WTV K+
Sbjct: 489 EFQGNVDPDLELVTTSGYGKNGALSVLQRTIRPQVVTTFNLPGCLDMWTVKSLKKAKADE 548
Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
++ S + D + HA+LI+S + +MVL+T +TEV + Q TI A N+ R
Sbjct: 549 KSEESETSPEDKDRHAFLILSKQDSSMVLQTGQEITEVAAG-GFSTQAPTIFASNMGDDR 607
Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIR 681
++QV + +++G Q + S + S+ADPY+LL +G
Sbjct: 608 YIVQVMNKSICLMEGVEQIQHMVL----------DVGSPIKQCSLADPYLLLLTENGDPI 657
Query: 682 LLVGDP 687
L+ P
Sbjct: 658 LMTLKP 663
>gi|291232722|ref|XP_002736302.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
[Saccoglossus kowalevskii]
Length = 984
Score = 300 bits (768), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 288/1107 (26%), Positives = 465/1107 (42%), Gaps = 239/1107 (21%)
Query: 423 FFLGSRLGDSLLVQFT--CGSGTSMLSSGLKE---EFGDIEADAPSTKRLRRSSSDALQD 477
FLGSRLG+SLL+++ T +++G K+ + + + P+ K+ +SD +
Sbjct: 6 LFLGSRLGNSLLLKYVEKAQESTDSVTNGAKKTEEDEETNKEEPPNKKKRTDDASDWIAS 65
Query: 478 MV-----NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRI 524
V + +EL +YGS + + +++F V DS++NIGP G +
Sbjct: 66 DVALLAEDVDELEVYGSQTQ-AGTQLTSYTFEVCDSIMNIGPCTKAVMGEPVFLSEEFQT 124
Query: 525 NAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSR 567
N D + +G SK ++ ELPGC +WTV + N D +
Sbjct: 125 NPDPDMELVALSGYSKNGALSVLQRSIRPQVVTTFELPGCIDMWTVVGPPEK-ENKDQPK 183
Query: 568 MAAYDD---------EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
++ HA+LI+S + +M+L T + E+ S + QG T+ AGNL
Sbjct: 184 EKTEEEGDKKPDALTNGHAFLILSRDDSSMILSTGQEIMELDHS-GFSTQGPTVYAGNLG 242
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
++QV G R+L+G Q + S ++ S++DPY LL G
Sbjct: 243 NNAYILQVSPMGVRLLEGVNQLQHIPL----------DLGSPIVLCSVSDPYALLMSEKG 292
Query: 679 SI--------------RLLVGDP---------STCTVSVQTPAAIESSKKPVSSCTLYHD 715
+ RL + P + C + SSK +S
Sbjct: 293 ELVLLTLKPDGFAGGHRLAISRPQIPQISRILTLCAYKDTSGMFTTSSKMESTSDETEEK 352
Query: 716 KGPEPWLRKTSTDAWLSTG-------VGEAIDGADGGPLDQGD----------------- 751
K +P + S + +S GE+ D + P + +
Sbjct: 353 KITKPSVADISMTSEISNVDDEDEMLYGES-DASLFSPTKKEEKSSFLQTREVLSETKPT 411
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
+ + E+G LEI+ +P+F F V F G +VD+Y ++ +S+ G+
Sbjct: 412 YWCAMSRENGVLEIYSLPDFKLAFLVKNFPMGFKVMVDSY----------QMTASAPGGS 461
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+ +++ V EL + + + L A + D + Y+A+
Sbjct: 462 SKSDQQHDMMPIVKELLLIGLGHKNKKTHLLARV-DEDLYIYEAF--------------- 505
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
T S+ N LR LRF + F+ G
Sbjct: 506 -THDQSSLDN----HLR-LRFRKV-------------------------------FVCGP 528
Query: 932 RPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W M R LR HP DGS+ F HN+NC GF+Y G L+IC LP+ +YD
Sbjct: 529 YPHWLFMTSRGALRSHPMHIDGSVTCFAPFHNINCPKGFLYFNKHGELRICVLPTHLSYD 588
Query: 991 NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNH 1048
WPV+KV PL+ TPH I+Y E Y ++ SV P L+ I + G +
Sbjct: 589 ALWPVRKV-PLRCTPHFISYHIESKTYAVVTSVSEPCLR---------ICKMTGDDKEFE 638
Query: 1049 NLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTK 1106
++ D T+E++ +++ P W+ I + E+ ++ V L + T
Sbjct: 639 DVERDDRFIFPTIEKFSLQLFSP----LSWEAIPNTKIDTEDWEHITGLKTVFLKSEGTV 694
Query: 1107 EN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISA 1160
+ +A+ T V GE+V RGR+L+F P +N + +Y KE KG ++
Sbjct: 695 SGLKGFIAVSTTIVYGEEVTCRGRILIFDVIEVVPEPGQPLTKNKLKLLYDKEQKGPVTT 754
Query: 1161 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
L ++G L A G KI L + +L G+AF D +++ +L +KNFIL DI KS+
Sbjct: 755 LCDIEGLLAAAIGQKIFLWAFRNNDLIGVAFIDT-QIHIHTLCTIKNFILAADIRKSVSL 813
Query: 1221 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1280
L + ++ L+L+ + ES+ G
Sbjct: 814 LRFSDEDRSLSLVTR----------------------------------------ESFGG 833
Query: 1281 QKLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1338
Q+LL RA+F+ G+HV F R+ ++ ++++ P R +F TLDGSIG +
Sbjct: 834 QRLLRRADFNAGSHVCSFFRMRSKLSDPATEKLLTGPME---RRHVTMFATLDGSIGYLI 890
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
P+ E T+RRL LQ L H AGLNP+ FR K+ +I+D +LL Y
Sbjct: 891 PMTEKTYRRLLMLQNALTTQTLHTAGLNPKGFRMVKHQTKSLENTHKNILDGDLLWKYTF 950
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNLNDL 1425
L + E+ E+A + GT+ QIL +L D+
Sbjct: 951 LSVNERTELAKKIGTSVEQILDDLMDV 977
>gi|380014171|ref|XP_003691113.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Apis florea]
Length = 1583
Score = 300 bits (767), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 210/668 (31%), Positives = 343/668 (51%), Gaps = 81/668 (12%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV AN+I ++ + + +K+ K + ++ LE + Y LHGNV S+
Sbjct: 30 LVVAGANIIRVFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D H LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
P+V+VDP+GRC +L+YG ++++L + S GD I SS++I
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ + H
Sbjct: 195 LKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
P+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S E
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308
Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKR 466
SVLTS + ++ FLGSRLG+SLL++FT ++ ++ E + E + K+
Sbjct: 369 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIVLEENETEETPAKK 428
Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+++ +SD L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S
Sbjct: 429 IKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486
Query: 521 G--------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYH 555
G N D + +G K ++ ELPGC+ +WTV
Sbjct: 487 GEPAFLSEEFSHNQDPDVELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVI- 545
Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
G + ++ + HA+LI+S E TM+L+T + EV +S + QG TI AG
Sbjct: 546 ----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTIFAG 600
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
NL R ++QV + G R+L G Q + ++ S ADPYV L
Sbjct: 601 NLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLS 650
Query: 676 SDGSIRLL 683
DG + LL
Sbjct: 651 EDGQVMLL 658
Score = 290 bits (743), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 200/680 (29%), Positives = 327/680 (48%), Gaps = 78/680 (11%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 772 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE------ 825
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 826 -------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL------- 868
Query: 875 RSLSVSNVSASRLRNLRFSRTP--LDAYTREETPHGAPCQR---ITIFKNISGHQGFFLS 929
R + L P L R+E R + F NI+G+ G F+
Sbjct: 869 -----------RFKKLDHGIIPGHLRPRPRDEDMPAMNDTRHCMMRYFSNIAGYNGVFIC 917
Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +
Sbjct: 918 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 977
Query: 989 YDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNH 1048
YD WPV+KV PL+ TPH +TY E Y +I S+ +PL +
Sbjct: 978 YDAPWPVRKV-PLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------Y 1019
Query: 1049 NLSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVV 1098
+ D + +T EE R + P + + W+T I + E+ ++ V
Sbjct: 1020 RFNGED--KEFTEEERPDRFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNV 1077
Query: 1099 TLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSK 1152
+L T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+K
Sbjct: 1078 SLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAK 1137
Query: 1153 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
E KG I+A+ + G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+
Sbjct: 1138 EQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYIHQMLSIKSLILIA 1196
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D++KSI L ++E+ L+L+++DF + + E+LID + L +V+D + NI +F Y P
Sbjct: 1197 DVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGESNIALFMYQP 1256
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1332
+ ES GQKL+ +A+FH+G V F R++ S SD R ++ +LDG
Sbjct: 1257 ESRESLGGQKLIRKADFHLGQKVNTFFRIR-CRISDPANDKKHFSDADKRHVTMYASLDG 1315
Query: 1333 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
++G I P+ E T+RRL LQ LV + H+AGLNP+++R + S+ + I+D +L
Sbjct: 1316 NLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDL 1375
Query: 1393 LSHYEMLPLEEQLEIAHQTG 1412
+ Y LP E++++A +
Sbjct: 1376 VWRYLYLPNNEKIDVAKKIA 1395
>gi|171695066|ref|XP_001912457.1| hypothetical protein [Podospora anserina S mat+]
gi|170947775|emb|CAP59938.1| unnamed protein product [Podospora anserina S mat+]
Length = 1441
Score = 299 bits (765), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 361/1491 (24%), Positives = 604/1491 (40%), Gaps = 231/1491 (15%)
Query: 57 NLVVTAANVIEIYVVRV--------QEEGSKESKNSGETKRRVLMD--GISAA------- 99
NLVV +++++I+ ++ Q+ ++N+G + R+ D G+ A+
Sbjct: 28 NLVVAKSSLLQIFRTKIVSTEIDASQQGSGARTRNAGRYESRLANDDDGLEASFLGGDSL 87
Query: 100 ----------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFD 148
L LV L G + LA + + N R D++++AF+DA++S++E+D
Sbjct: 88 AFKTDRTNNTKLVLVSEISLSGTITGLAKIK---SQNLRSGGDALLVAFKDARLSLVEWD 144
Query: 149 DSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMII 203
H L S+H +E E W +F + DP GRC + G+ + I
Sbjct: 145 AERHDLSTVSIHYYEQDELQGSPWAPPLSNFTNF-----LAADPGGRCAALKFGGMNLAI 199
Query: 204 LKASQG---------------GSGLVGDEDTFGSGGGF--SARIESSHVINLRDLD--MK 244
L Q G V E +GG S V+ L +LD +
Sbjct: 200 LPFKQADEDIDMDDDWDEDLDGPRPVKQEAAVVNGGSSIKETPYSPSFVLRLSNLDPSLL 259
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
H F+H Y EP IL + + + K H + ++ + I S L
Sbjct: 260 HPVHLAFLHEYREPTFGILAST-VNASNSLGRKDHLAYM-VFTLDLQQRASTTILSVPGL 317
Query: 305 PHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
P D +++ +P+P+GG L+VGAN IH +A+N S +S ++
Sbjct: 318 PQDLFRVQPLPAPVGGALLVGANELIHIDQSGKPNGVAVNPLTKQCTSFGLSDQSDLNLR 377
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT-- 417
L+ L + L+ G + L+T DGR V LD+ S+T S++ ++T
Sbjct: 378 LEECTIDVLSAEELLVILSDGRMALVTFRIDGRTVSGLDVKLLPSETGGSLIPGRVSTLS 437
Query: 418 -IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG-DIEADAPSTKRLRRSSSDAL 475
IG S+ F GS GDSL+ +T S ++ G DI+ D
Sbjct: 438 RIGKSVMFAGSEEGDSLVFGWTKKQNQSGRKKSRLQDVGLDIDMADEEDLDEDEDEDDLY 497
Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------ 529
+ ++ ++ +ASN E +F + D L++I P++ +YG ++A S
Sbjct: 498 AEEPTPKQQAV-ATASNVKEG---DLTFRIHDRLLSIAPIQSMTYGQPVDAPGSEEEQNS 553
Query: 530 -----------ATGISKQSNYELV------------ELPGCKGIWTVYHKSS--RGHNAD 564
G +K S ++ E P +G WTV K + D
Sbjct: 554 AGVRSELQLVCGVGRNKSSAMAIMNLAIPPKVIGRFEFPEARGFWTVCAKKPVPKSLQGD 613
Query: 565 SSRMAAYDD-----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIA 613
A +D +Y ++I++ E + TA +T + G TI
Sbjct: 614 KGPGAIGNDYGTSGQYDKFMIVAKVDLDGYEKSDVYALTAAGFESLTGTEFDPAAGFTIE 673
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
AG + R+IQV + R DG + P E +T S SIADPY+L+
Sbjct: 674 AGTMGKDNRIIQVLKSEVRCYDGDLGLSQIV--PMMDEETGAEPRAT--SASIADPYLLI 729
Query: 674 GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLST 733
+D S+ + S+ +E +K DK +T WL+
Sbjct: 730 IRNDQSVFI---------ASIHDDNELEEVEK--------EDK-------TLATTKWLTG 765
Query: 734 GVGEAIDGADGGPLDQGD--------IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
+ +G G + GD I + SGAL I+ +P+
Sbjct: 766 CLYTDTNGVFGE--ESGDKKAKLPESILMFLLSASGALYIYRLPDL-------------- 809
Query: 786 HIVDTYMREALKDSETEINS--SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
Y+ E L T +++ ++ +GT KE + + V +L P+L
Sbjct: 810 -CKPVYVAEGLSYIPTGLSADYAARKGTA---KETVSEILVADLG----DTTAKSPYLIL 861
Query: 844 ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
+ + Y+ Y ++ + ++L + S L +++P + E
Sbjct: 862 RHANDDLTMYEPYRYQLGAG-------LEFPKTLFFQKIPNSVL-----AKSPAEETDDE 909
Query: 904 ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNV 963
E H A C + NI G+ FL G P + + + + L ++ A + H
Sbjct: 910 EVTHQAKCLALRRCNNIGGYSTVFLPGPSPSFIIKSSKSMPKVLPLQGAAVTAISSFHTE 969
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C HGFIY S I+++ QLP ++ K IP+ + Y Y + +
Sbjct: 970 GCEHGFIYADSHNIVRVSQLPKDWSFAETGLAVKKIPIGEDIVAVAYHPPSQSYVVACNT 1029
Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD-LHRTYTVEEYEVRILEPDRAGGPWQTRA 1082
P +P E+ D H + + L T+E ++++ P W
Sbjct: 1030 P--EPF----------ELPRDDDYHKEWAREVLPFKPTLERGTLKLIGPIT----WTVVD 1073
Query: 1083 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1141
TI M+ EN L V + L + T E + L+ +GTA +GED+ RG V +++
Sbjct: 1074 TIVMEPCENVLCVETLNLEVSEATNERKLLIGVGTAITKGEDLPTRGAVYVYNVADVIPE 1133
Query: 1142 PQNLVT----EVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAF 1191
P T ++ +KE +GA++AL+ + QG +L+A GPK ++ K GT L +AF
Sbjct: 1134 PGKPETGKKLKLIAKEDIPRGAVTALSEIGTQGLMLVAQGPKCMVRGLKEDGTLLP-VAF 1192
Query: 1192 YDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
D YV S + L+ D K ++F + E+ ++ L K L+ +FL
Sbjct: 1193 MDM-NCYVTSAKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKSNTRLEVLNADFLP 1251
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFLRLQMLATSS 1308
+G LS+V D + +I I + P+ +S +G LL R F GA HVTK L L +
Sbjct: 1252 NGKELSIVACDAEGHIHILQFDPEHPKSLQGHLLLHRTSFSTGAHHVTKSLLLPSTLSPD 1311
Query: 1309 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
++ + LL + G + + PL E +RRL SL +L +S+ H AGLNP+
Sbjct: 1312 NKEDNEENGATSRPHILLLASPTGVLAALRPLSETAYRRLSSLAAQLTNSLTHAAGLNPK 1371
Query: 1369 SFRQFHSNGKAHRPGPDS-----IVDCELLSHYEMLPLEEQLEIAHQTGTT 1414
+R + G D+ IVD +L+ + L ++ E+A + G T
Sbjct: 1372 GYRM--PSATCPPAGVDAGIGRHIVDGTILARFSELGRAKRGEVAGRAGYT 1420
>gi|110750698|ref|XP_624382.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Apis mellifera]
Length = 1415
Score = 299 bits (765), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 205/698 (29%), Positives = 338/698 (48%), Gaps = 78/698 (11%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 772 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE------ 825
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 826 -------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL------- 868
Query: 875 RSLSVSNVSASRLRNLRFSRTP--LDAYTREETPHGAPCQR---ITIFKNISGHQGFFLS 929
R + L P L R+E R + F NI+G+ G F+
Sbjct: 869 -----------RFKKLDHGIIPGHLRPRPRDEDMPAMNDTRHCMMRYFSNIAGYNGVFIC 917
Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +
Sbjct: 918 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 977
Query: 989 YDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNH 1048
YD WPV+KV PL+ TPH +TY E Y +I S+ +PL +
Sbjct: 978 YDAPWPVRKV-PLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------Y 1019
Query: 1049 NLSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVV 1098
+ D + +T EE R + P + + W+T I + E+ ++ V
Sbjct: 1020 RFNGED--KEFTEEERPDRFIFPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNV 1077
Query: 1099 TLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSK 1152
+L T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+K
Sbjct: 1078 SLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAK 1137
Query: 1153 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
E KG I+A+ + G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+
Sbjct: 1138 EQKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYIHQMLSIKSLILIA 1196
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D++KSI L ++E+ L+L+++DF + + E+LID + L +V+D + NI +F Y P
Sbjct: 1197 DVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGESNIALFMYQP 1256
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1332
+ ES GQKL+ +A+FH+G V F R++ S SD R ++ +LDG
Sbjct: 1257 ESRESLGGQKLIRKADFHLGQKVNTFFRIR-CRISDPANDKKHFSDADKRHVTMYASLDG 1315
Query: 1333 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
++G I P+ E T+RRL LQ LV + H+AGLNP+++R + S+ + I+D +L
Sbjct: 1316 NLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDL 1375
Query: 1393 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
+ Y LP E++++A + GT +I+ +L ++ T+
Sbjct: 1376 VWRYLYLPNNEKIDVAKKIGTRVQEIIEDLTEIDRQTA 1413
Score = 293 bits (750), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 209/668 (31%), Positives = 343/668 (51%), Gaps = 81/668 (12%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV AN+I ++ + + +K+ K + ++ LE + Y LHGNV S+
Sbjct: 30 LVVAGANIIRVFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D H LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
P+V+VDP+GRC +L+YG ++++L + S GD I SS++I
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ + H
Sbjct: 195 LKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
P+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S E
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308
Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPSTKR 466
SVLTS + ++ FLGSRLG+SLL++FT ++ ++ + + E + K+
Sbjct: 369 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPAKK 428
Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+++ +SD L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S
Sbjct: 429 IKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486
Query: 521 G--------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYH 555
G N D + +G K ++ ELPGC+ +WTV
Sbjct: 487 GEPAFLSEEFSHNQDPDVELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVI- 545
Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
G + ++ + HA+LI+S E TM+L+T + EV +S + QG TI AG
Sbjct: 546 ----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTIFAG 600
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
NL R ++QV + G R+L G Q + ++ S ADPYV L
Sbjct: 601 NLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLS 650
Query: 676 SDGSIRLL 683
DG + LL
Sbjct: 651 EDGQVMLL 658
>gi|322792443|gb|EFZ16427.1| hypothetical protein SINV_15375 [Solenopsis invicta]
Length = 1532
Score = 298 bits (763), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 208/681 (30%), Positives = 334/681 (49%), Gaps = 76/681 (11%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ T IN
Sbjct: 740 LVYRDSGTLEIYSLPDLRLSYLIRNFGFGQYVLHDSMESTTLQS--TPINEIPHP----- 792
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
M+V E+ M H +RP L L D + YQ Y + P+ K
Sbjct: 793 ------DMQVREILMVALGHHGNRPMLLVRL-DSELQIYQVYRY--PKGYLK-------- 835
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
L + + R S P E+ P RI + F NI+G+ G F+
Sbjct: 836 --LRFKKLDHGIIPG-RLSPRP----KEEDVPRNTSDTRICVMRYFSNIAGYNGVFICSD 888
Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W + R LR HP DGS+ +F +N+NC GF+Y + L+IC LP+ +YD
Sbjct: 889 YPHWIFLTGRGELRTHPMGIDGSVTSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYD 948
Query: 991 NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
WPV+KV PL+ TPH +TY E Y +I S +PL +
Sbjct: 949 APWPVRKV-PLRCTPHFVTYHLESKTYCVITSTA--EPLKSY---------------YRF 990
Query: 1051 SSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVTL 1100
+ D + +T EE R L P + + W+T I + E+ ++ V+L
Sbjct: 991 NGED--KEFTEEERPDRFLYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNVSL 1048
Query: 1101 FNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKEL 1154
T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+KE
Sbjct: 1049 AYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQ 1108
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
KG I+A+ + G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+ D+
Sbjct: 1109 KGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDT-QIYIHQMLSIKSLILIADV 1167
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
+KSI L ++E+ L+L+++DF + + E+LID + L +V+D + N+ +F Y P+
Sbjct: 1168 YKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFIVADGESNLALFMYQPES 1227
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGS 1333
ES GQKL+ +A+FH+G V F R++ T ++ G+DK R ++ +LDGS
Sbjct: 1228 RESLGGQKLIRKADFHLGQKVNTFFRIRCRVTDPANDKKQFSGADK--RHVTMYASLDGS 1285
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCEL 1392
+G I P+ E T+RRL LQ LV + H+AGLNP+S+R + + ++ P I+D +L
Sbjct: 1286 LGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKSYRHTYKSYIRNQGNPARGIIDGDL 1345
Query: 1393 LSHYEMLPLEEQLEIAHQTGT 1413
+ Y LP E+ ++A + GT
Sbjct: 1346 VWRYLFLPNNEKADLAKKIGT 1366
Score = 293 bits (749), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 206/621 (33%), Positives = 328/621 (52%), Gaps = 63/621 (10%)
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
LE + Y LHGN+ S+ + G+ +RDS++L+F DAK+SV+E+D IH LR S+
Sbjct: 32 KLECLAQYTLHGNIMSMQAVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSL 87
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-D 218
H FE E +K G + P+V+VDP+GRC +L++G ++++L + S GD D
Sbjct: 88 HYFEEEE---IKDGWTNHHHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPSLDDGDLLD 144
Query: 219 TFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
T A I SS++I L+ L+ M +V D F+HGY EP ++IL+E T+AGR++
Sbjct: 145 TAKLTSSNKAPILSSYMIVLKSLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFAGRIAV 204
Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS- 335
+ TC + A+S++ + HP+IWS NLP D Y+ + V P+GG L++ N++ Y +QS
Sbjct: 205 RQDTCAMVAISLNIQQRVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAFNSLIYLNQSI 264
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
++LN+ A + + P+ + L+ A ++ D ++S K+G+L +L++ D
Sbjct: 265 PPYGVSLNSLADTSTNFPLKPQEGVKMSLEGAQVAFISADRLVISLKSGELYVLSLFADS 324
Query: 396 -RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS-SGLKEE 453
R V+ K SVLTS + ++ FLGSRLG+SLL++FT ++ + +G +
Sbjct: 325 MRSVRGFHFDKAAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNGGEIT 384
Query: 454 FGDIEADAPSTKRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
+ E++ K+ ++ +SD L D+ + EEL +YGS + +T ++ F V D
Sbjct: 385 IEENESEETPAKKAKQDFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCD 442
Query: 508 SLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELV------------ 542
SL+NIGP + S G N D + +G K ++
Sbjct: 443 SLLNIGPCGNISMGEPAFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTF 502
Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
ELPGC+ +WTV N D + A + HA+LI+S E TM+L+T + EV +S
Sbjct: 503 ELPGCEDMWTVIGTL----NNDEIKTEA--EGSHAFLILSQEDSTMILQTGQEINEVDQS 556
Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVL 662
+ QG T+ AGNL R ++QV + G R+L G Q + ++
Sbjct: 557 -GFSTQGSTVFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIV 605
Query: 663 SVSIADPYVLLGMSDGSIRLL 683
S ADPYV L DG + LL
Sbjct: 606 HASCADPYVTLLSEDGQVMLL 626
>gi|301773406|ref|XP_002922132.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1-like [Ailuropoda
melanoleuca]
Length = 1469
Score = 298 bits (762), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 210/631 (33%), Positives = 333/631 (52%), Gaps = 76/631 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 104 LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 159
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P V+VDP GRC +L+YG ++++L + + +E
Sbjct: 160 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEG 213
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 214 LMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQ 273
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS
Sbjct: 274 DTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPP 333
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
+ALN + + + LD A A ++ D ++S K G++ +LT++ DG R
Sbjct: 334 YGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMR 393
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
V+ K SVLT+ + T+ FLGSRLG+SLL+++T +S ++E
Sbjct: 394 SVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA--- 449
Query: 457 IEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDS 508
+ + P +K+ R S+ QD V+ E+ +YGS A + T+ A T+SF V DS
Sbjct: 450 ADKEEPPSKKKRVDSTVGWSGGKSMPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDS 505
Query: 509 LVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV---E 543
++NIGP + + G L I + + + + K ++V E
Sbjct: 506 ILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFE 565
Query: 544 LPGCKGIWTVY-------HKSSRGHNADS--SRMAAYDD-EYHAYLIISLEARTMVLETA 593
LPGC +WTV ++ +G A+ S + A DD H +LI+S E TM+L+T
Sbjct: 566 LPGCYDMWTVIAPVRKEQEETPKGEGAEQEPSALEADDDGRRHGFLILSREDSTMILQTG 625
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 626 QEIMELDTS-GFATQGPTVFAGNIGDSRYIVQVSPLGIRLLEG---VNQLHFIPVDL--- 678
Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 679 ----GSPIVQCAVADPYVVIMSAEGHVTMFL 705
Score = 286 bits (731), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 214/708 (30%), Positives = 336/708 (47%), Gaps = 106/708 (14%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 820 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 875
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 876 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 918
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
L N+ +RF + P + RE E GA + R
Sbjct: 919 ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKVEGGSAEEGAGARGRVARFR 968
Query: 916 IFKNISGHQG-------FFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNH 967
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC
Sbjct: 969 YFEDIYGYSGGGGACPQVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPR 1028
Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLK 1027
GF+Y QG L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S +
Sbjct: 1029 GFLYFNRQGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNM-- 1085
Query: 1028 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIP 1085
P + I + G + + + D + E + ++++ P W+ A I
Sbjct: 1086 PCTR-----IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIE 1136
Query: 1086 MQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1144
++ E+ ++ V+L + T + +A GT +QGE+V RGR+L+ P
Sbjct: 1137 LEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQ 1196
Query: 1145 LVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1199
+T+ +Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+
Sbjct: 1197 PLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYI 1255
Query: 1200 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1259
+ VKNFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VS
Sbjct: 1256 HQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVS 1315
Query: 1260 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1319
D +N+ ++ Y P+ ES+ G +LL RA+FHVGAHV F R + GAA G K
Sbjct: 1316 DRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPSK 1368
Query: 1320 T-----NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1374
N+ F TLDG IG + P+ E T RLQ S R H
Sbjct: 1369 KSVVWENKHITWFATLDGGIGLLLPMQEKT-NRLQPAX----------------SPRMLH 1411
Query: 1375 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ + + +++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1412 VDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 1459
>gi|317036382|ref|XP_001398211.2| protein cft1 [Aspergillus niger CBS 513.88]
Length = 1393
Score = 296 bits (758), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 338/1464 (23%), Positives = 629/1464 (42%), Gaps = 199/1464 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+L+V ++++IY + + E ++ + ++L++ Y L G V L
Sbjct: 28 DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++AF +AK+S++E+D G+ S+H +E +
Sbjct: 77 CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
+ G ++ VDP RC + +G++ + I+ Q G LV D+ +GS
Sbjct: 131 VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187
Query: 223 ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
GGG + S V+ L LD + H F++ Y EP IL+ +
Sbjct: 188 HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + + ++ + ++ S LP D ++++A+P P+GG L++G+
Sbjct: 248 VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + S +S ++ L+ L + LL T
Sbjct: 308 NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
G++ ++ DGR V + + + I T IG+ FLGS GDS+L+
Sbjct: 368 GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
++ S ++ ++ D AD + S D +D + + +L G +
Sbjct: 428 YSYSSSSTKKHRLQAKQVIDDSADMSEEDQ---SDDDVYEDDLYSTSPDTTLTGRRPSGE 484
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
SA + F + D L+NIGPL+D + G R++ + TG S +++ +G
Sbjct: 485 SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
V + H S + + D + A L EA R V+ T
Sbjct: 545 GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604
Query: 597 TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
++ ES+ + V G TI+ G R+RV+QV + R D +
Sbjct: 605 SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
TQ ++ ++ +S S+AD + + D ++ L D S V
Sbjct: 665 TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719
Query: 700 IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
+ S K SC LY DK TG+ +ID P+ + D++ +
Sbjct: 720 VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760
Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
L ++ V + + ++ + G + ++ SSE G +EN+
Sbjct: 761 DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
V +L + WSA P+L ++ Y+ ++ VST +
Sbjct: 803 IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
++ S+ N R P + + + + + I +ISG F+ G+ + +
Sbjct: 846 HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFIIRT 905
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
+L + + + L C+ GFIY+ SQ ++ C+LP + +D W +++V
Sbjct: 906 SASAPHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLPPMTRFDYQWTLKRVH 965
Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHRT 1058
+ H + Y +Y VL + L D E+ + N +S R
Sbjct: 966 LGEQVDH-LAYSTSSGMY-------VLGTCHATDFKLPEDDELHPEWRNEAISFFPSARG 1017
Query: 1059 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1117
+ +++ P+ W + + + E + ++ ++L + T E + ++ +GTA
Sbjct: 1018 SFI-----KLVSPNT----WSIIDSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTA 1068
Query: 1118 YVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLI 1170
+ +GED+ +RG + +F + +P + T+ + + +KGA++AL+ + QG +L+
Sbjct: 1069 FARGEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLV 1128
Query: 1171 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQ 1226
A G K ++ K G+ L +AF D YV + +K +LGD K ++F + E+
Sbjct: 1129 AQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCILGDAVKGVWFAGYSEE 1186
Query: 1227 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1286
+++L AKD L+ A EFL DG L +VV+D NI + Y P+ +S G +LLSR
Sbjct: 1187 PYKMSLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSR 1246
Query: 1287 AEFHVGAHVTKFLRLQMLATSSDR-TGAAPGSDKTNRFAL---LFGTLDGSIGCIAPLDE 1342
++FH+G + L SS++ ++ G D N+ L L T +GS+G I + E
Sbjct: 1247 SKFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPE 1306
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1402
++RRL +LQ +L +++ H GLNPR+FR S+G A R ++D LL + + +
Sbjct: 1307 ESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GMLDGNLLFKWIDMSKQ 1362
Query: 1403 EQLEIAHQTGTTRSQILSNLNDLA 1426
+ EIA + G +I ++L ++
Sbjct: 1363 RKTEIAGRVGAREWEIKADLEAIS 1386
>gi|261201748|ref|XP_002628088.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
gi|239590185|gb|EEQ72766.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
Length = 1403
Score = 296 bits (758), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 358/1479 (24%), Positives = 627/1479 (42%), Gaps = 215/1479 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V + +++++ + GS ++ +T+ + L LV Y L G + L
Sbjct: 28 NLIVAKSTLLQVFNLVNVVYGSAPGQSDEKTRSQY-------TKLVLVAEYALSGTITDL 80
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++ +AK+S++E+D H + TS+H +E + +H+
Sbjct: 81 GRVKILNSKSGGE------AVLVGTRNAKLSLIEWDPERHKIATTSIHYYERDD-VHISP 133
Query: 173 GRESFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
+ A P + VDP RC VL +G + + IL Q G LV
Sbjct: 134 WTPNLANCPSHLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDFDSDVEEPPRDTN 192
Query: 215 -----GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
DE +G F SS V+ + L+ M H F++ Y EP IL+ +
Sbjct: 193 HTAEGQDEAKKSNGLAFHTPYASSFVLPIAALEPAMLHPISLAFLYEYREPTFGILYSQV 252
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
T + + + S ++ + + S LP+D +K++A+P P+GG L++G N
Sbjct: 253 ATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVALPPPVGGALLIGTN 312
Query: 328 T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
+H + A+ +N +A S +S + L+ + L +N LL G
Sbjct: 313 ELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEGSIVEQLGTENGDMLLVLLNG 372
Query: 385 DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
+ +L+ DGR V + L K PS +G F GS DS+
Sbjct: 373 KMAVLSFKLDGRSVSGISLRLVPDLAGGSLLKARPSC----SVPLGRGKIFFGSEESDSV 428
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY------ 487
L+ G S S+ K+ D SS + +D + E LY
Sbjct: 429 LI------GWSRPSTRPKDPPVQGAGD---DNIAELSSDEEEEDDEDIYEDDLYATPVPT 479
Query: 488 GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYE 540
G+ + + S + T ++F + D L N+GP++D + G + D S +N E
Sbjct: 480 GAKARGSLSVKGTNLNDYTFRIHDRLWNLGPMRDLTLGRPAGSRDKDKRQPVSSLSTNLE 539
Query: 541 LVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDE 574
LV G G W+V+ K + + S
Sbjct: 540 LVATQGYGKAGGLTILRREIDPYVIDSLMIKDTDGAWSVHVKDPKLPSQSGSLPLNASSN 599
Query: 575 YHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFE 628
Y YL++S + +++V + E T++ ++ + RTI G L G RV+QV +
Sbjct: 600 YDHYLLLSKSKGSDKEKSVVYTMSSGGLEETKASEFNPNEDRTIDIGTLAGGTRVVQVLK 659
Query: 629 RGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
R D G + Q + SE V+ S ADPYVL+ D S+ LL D
Sbjct: 660 GEVRSYDSGLGLAQIFPVWDEDM-----SEEKYVVHASFADPYVLIIRDDQSVLLLQADG 714
Query: 688 STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
S ++ I S+ S +LY DK +T+ LS V
Sbjct: 715 SGDLDEIEADGIINSTT--WISGSLYQDKYRSFMSYETAPSRKLSDNV------------ 760
Query: 748 DQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
+ ++ ES L IF +PN VFT + D +I S+
Sbjct: 761 ----LLFLLSSES-KLHIFHLPNAKEPVFTAECV-----------------DLLPQILST 798
Query: 807 SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
+E++ + V ++ + P+L ++ ++ Y+ Y +T+
Sbjct: 799 EPPPKRATYRESLTEILVADIG----DSVSRTPYLILRSSNNDLILYEPY------HTTH 848
Query: 867 SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
S + S S++ + N F + + + + GA + + + ++ G++
Sbjct: 849 STEKKS-------SDLRFLKTINHHFPKFHAGSNVEDSSHIGALPKPLRVLGDVCGYRTV 901
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
F+ G+ PC+ + + L ++ + + + C GF+YV + ++++C+ P
Sbjct: 902 FMPGNSPCFVIKSSTSIPHVLNLRGKTVHSLSSFNIPACERGFVYVDADNVVRMCRFPRN 961
Query: 987 STYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1046
+ +D W +K I L + Y + Y + S V +L D E+ +
Sbjct: 962 THFDGSWATRK-IGLGEQVDIVEYSSSSETYVIGTSQKV------DFNLPEDDEIHPEWR 1014
Query: 1047 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTT 1105
N +S + +++ V++L P W + ++++E + V+ + L + T
Sbjct: 1015 NEVISFLP-----QIDQGSVKLLSPRT----WSIIDSHTLRTAERIMCVKCLDLEVSEIT 1065
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKELKGAISA 1160
E ++A+GTA +GED+AARG + +F D P+ + + +E+KGA+++
Sbjct: 1066 HERRDMIAVGTAVTRGEDIAARGCIYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTS 1125
Query: 1161 LASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDI 1214
L+ + QG L+ A G K I+ K G+ L +AF D YV L +K ++GD
Sbjct: 1126 LSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKELKGTGMCIMGDA 1183
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
K I+F + E+ +L+L +KD G+L A +FL DG L ++V+D+ NI + Y P+
Sbjct: 1184 LKGIWFAGYSEEPYKLSLFSKDDGTLQVMAADFLPDGKRLYILVADDDCNIHVLQYDPED 1243
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDRTGAAPGSDKTNRFALLFGTL--- 1330
S KG +LL R+ FH G + L + + S+ A P + + L+ L
Sbjct: 1244 PGSSKGDRLLHRSTFHTGHFASTMTLLPRTIIPSAQGPDANPDMMELDSSGPLYHVLVTS 1303
Query: 1331 -DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1389
GSI I PL E +RRL +LQ +L++++ H GLNPR+FR S+G R +VD
Sbjct: 1304 ETGSIALITPLSETAYRRLSALQSQLINTLEHPCGLNPRAFRAIESDGIGGR----GMVD 1359
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
+LL + L + + EIAH+ G +I ++L + G
Sbjct: 1360 GDLLHRWLDLGTQRKAEIAHRVGADIWEIRADLEAIGKG 1398
>gi|332018184|gb|EGI58789.1| Cleavage and polyadenylation specificity factor subunit 1 [Acromyrmex
echinatior]
Length = 1412
Score = 296 bits (757), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 203/673 (30%), Positives = 332/673 (49%), Gaps = 64/673 (9%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ T IN
Sbjct: 771 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQ--STPINEIPHP----- 823
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 824 ------DMQVREILMVALGHHGNRPMLLVRL-DSDLQIYQAYRY--PKGYLK-------- 866
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
L + + R S P E+ P RI + F NI+G+ G F+
Sbjct: 867 --LRFKKLDHGIIPG-RLSPRP----KEEDVPRNRNITRICVMRYFSNIAGYNGVFICSD 919
Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +YD
Sbjct: 920 YPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYD 979
Query: 991 NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
WPV+KV PL+ TPH +TY E Y +I S +PL + +V
Sbjct: 980 APWPVRKV-PLRCTPHFVTYHLESKTYCVITSTA--EPLKSYYRFNGEDKV--------- 1027
Query: 1051 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1108
L + Y + ++ + + W+T I + E+ ++ V+L T+
Sbjct: 1028 ----LTKLYYLFQFSRIFMNLLFSPVSWETIPNTKIELDQWEHVTCLKNVSLAYEGTRSG 1083
Query: 1109 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALA 1162
+ + +GT Y GED+ +RGR+L+F P +N ++Y+KE KG I+A+
Sbjct: 1084 LKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPITAIT 1143
Query: 1163 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
+ G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+ D++KSI L
Sbjct: 1144 QVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDT-QIYIHQMLSIKSLILIADVYKSISLLR 1202
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1282
++E+ L+L+++DF + + E+LID S L +V+D + N+ +F Y P+ ES GQK
Sbjct: 1203 FQEEYRTLSLVSRDFRPAEVYTIEYLIDNSNLGFIVADGESNLALFMYQPESRESLGGQK 1262
Query: 1283 LLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1341
L+ +A+FH+G + F R++ T ++ G+DK R ++ +LDGS+G I P+
Sbjct: 1263 LIRKADFHLGQKINTFFRIKCRITDPANDKKQFSGADK--RHVTMYASLDGSLGYILPVP 1320
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELLSHYEMLP 1400
E T+RRL LQ LV + H+AGLNP+++R + + ++ P I+D +L+ Y LP
Sbjct: 1321 EKTYRRLLMLQNVLVTHICHIAGLNPKAYRHTYKSYVRNQGNPARGIIDGDLVWRYLFLP 1380
Query: 1401 LEEQLEIAHQTGT 1413
E+ ++A + GT
Sbjct: 1381 NNEKADLAKKIGT 1393
Score = 290 bits (741), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 214/665 (32%), Positives = 342/665 (51%), Gaps = 76/665 (11%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV ANVI ++ + + ++ K + ET+ LE + Y LHGN+ S+
Sbjct: 30 LVVAGANVIRVFRLIPDVDMTRREKYT-ETRP-------PKMKLECLTQYTLHGNIMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+ G+ +RDS++L+F DAK+SV+E+D IH LR S+H FE E +K G +
Sbjct: 82 AVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IKDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR---IESSH 234
P+V+VDP+GRC +L++G ++++L + S + D D S S I SS+
Sbjct: 135 HHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPS--LDDGDLLDSAKLTSTNKTPILSSY 192
Query: 235 VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+I L+ L+ M +V D F+HGY EP ++IL+E T++GR++ + TC + A+S++
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDS 351
+ HP+IWS NLP D Y+ + V P+GG L++ N++ Y +QS ++LN+ A S +
Sbjct: 253 RVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADSSTN 312
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
P+ + L+ + ++ D ++S K+G+L +L++ D R V+ K SV
Sbjct: 313 FPLKPQEGVKMSLEGSQVAFISADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM--LSSGLKEEFGDIEADAPSTKRLR 468
LTS + ++ FLGSRLG+SLL++FT ++ L+ + + P+ K +
Sbjct: 373 LTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNDNEITIEENENEETPAKKTKQ 432
Query: 469 R-----SSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-- 521
+SD L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S G
Sbjct: 433 DFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEP 490
Query: 522 ------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSS 558
N D + +G K ++ +LPGC+ +WTV
Sbjct: 491 AFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFQLPGCEDMWTVIGIV- 549
Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
N D R ++ HA+LI+S E TMVL+T + EV +S + QG T+ AGNL
Sbjct: 550 ---NNDEIRT---EEGSHAFLILSQEDSTMVLQTGQEINEVDQS-GFSTQGSTVFAGNLG 602
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
R ++QV + G R+L G Q + ++ S ADPYV L DG
Sbjct: 603 ANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVALLSEDG 652
Query: 679 SIRLL 683
+ LL
Sbjct: 653 QVMLL 657
>gi|239611898|gb|EEQ88885.1| protein CFT1 [Ajellomyces dermatitidis ER-3]
gi|327352847|gb|EGE81704.1| CFT1 [Ajellomyces dermatitidis ATCC 18188]
Length = 1402
Score = 295 bits (756), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 359/1479 (24%), Positives = 627/1479 (42%), Gaps = 216/1479 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V + +++++ + GS ++ +T+ + L LV Y L G + L
Sbjct: 28 NLIVAKSTLLQVFNLVNVVYGSAPGQSDEKTRSQY-------TKLVLVAEYALSGTITDL 80
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++ +AK+S++E+D H + TS+H +E + +H+
Sbjct: 81 GRVKILNSKSGGE------AVLVGTRNAKLSLIEWDPERHKIATTSIHYYERDD-VHISP 133
Query: 173 GRESFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
+ A P + VDP RC VL +G + + IL Q G LV
Sbjct: 134 WTPNLANCPSHLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDFDSDVEEPPRDTN 192
Query: 215 -----GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
DE +G F SS V+ + L+ M H F++ Y EP IL+ +
Sbjct: 193 HTAEGQDEAKKSNGLAFHTPYASSFVLPIAALEPAMLHPISLAFLYEYREPTFGILYSQV 252
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
T + + + S ++ + + S LP+D +K++A+P P+GG L++G N
Sbjct: 253 ATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVALPPPVGGALLIGTN 312
Query: 328 T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
+H + A+ +N +A S +S + L+ + L +N LL G
Sbjct: 313 ELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEGSIVEQLGTENGDMLLVLLNG 372
Query: 385 DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
+ +L+ DGR V + L K PS +G F GS DS+
Sbjct: 373 KMAVLSFKLDGRSVSGISLRLVPDLAGGSLLKARPSC----SVPLGRGKIFFGSEESDSV 428
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY------ 487
L+ G S S+ K D + SSD +D + E LY
Sbjct: 429 LI------GWSRPSTRPK----DPPVQGAGDDNIAELSSDEEEDDEDIYEDDLYATPVPT 478
Query: 488 GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYE 540
G+ + + S + T ++F + D L N+GP++D + G + D S +N E
Sbjct: 479 GAKARGSLSVKGTNLNDYTFRIHDRLWNLGPMRDLTLGRPAGSRDKDKRQPVSSLSTNLE 538
Query: 541 LVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDE 574
LV G G W+V+ K + + S
Sbjct: 539 LVATQGYGKAGGLTILRREIDPYVIDSLMIKDTDGAWSVHVKDPKLPSQSGSLPLNASSN 598
Query: 575 YHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFE 628
Y YL++S + +++V + E T++ ++ + RTI G L G RV+QV +
Sbjct: 599 YDHYLLLSKSKGSDKEKSVVYTMSSGGLEETKASEFNPNEDRTIDIGTLAGGTRVVQVLK 658
Query: 629 RGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
R D G + Q + SE V+ S ADPYVL+ D S+ LL D
Sbjct: 659 GEVRSYDSGLGLAQIFPVWDEDM-----SEEKYVVHASFADPYVLIIRDDQSVLLLQADG 713
Query: 688 STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
S ++ I S+ S +LY DK +T+ LS V
Sbjct: 714 SGDLDEIEADGIINSTT--WISGSLYQDKYRSFMSYETAPSRKLSDNV------------ 759
Query: 748 DQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
+ ++ ES L IF +PN VFT + D +I S+
Sbjct: 760 ----LLFLLSSES-KLHIFHLPNAKEPVFTAECV-----------------DLLPQILST 797
Query: 807 SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
+E++ + V ++ + P+L ++ ++ Y+ Y +T+
Sbjct: 798 EPPPKRATYRESLTEILVADIG----DSVSRTPYLILRSSNNDLILYEPY------HTTH 847
Query: 867 SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
S + S S++ + N F + + + + GA + + + ++ G++
Sbjct: 848 STEKKS-------SDLRFLKTINHHFPKFHAGSNVEDSSHIGALPKPLRVLGDVCGYRTV 900
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
F+ G+ PC+ + + L ++ + + + C GF+YV + ++++C+ P
Sbjct: 901 FMPGNSPCFVIKSSTSIPHVLNLRGKTVHSLSSFNIPACERGFVYVDADNVVRMCRFPRN 960
Query: 987 STYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1046
+ +D W +K I L + Y + Y + S V +L D E+ +
Sbjct: 961 THFDGSWATRK-IGLGEQVDIVEYSSSSETYVIGTSQKV------DFNLPEDDEIHPEWR 1013
Query: 1047 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTT 1105
N +S + +++ V++L P W + ++++E + V+ + L + T
Sbjct: 1014 NEVISFLP-----QIDQGSVKLLSPRT----WSIIDSHTLRTAERIMCVKCLDLEVSEIT 1064
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKELKGAISA 1160
E ++A+GTA +GED+AARG + +F D P+ + + +E+KGA+++
Sbjct: 1065 HERRDMIAVGTAVTRGEDIAARGCIYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTS 1124
Query: 1161 LASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDI 1214
L+ + QG L+ A G K I+ K G+ L +AF D YV L +K ++GD
Sbjct: 1125 LSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKELKGTGMCIMGDA 1182
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
K I+F + E+ +L+L +KD G+L A +FL DG L ++V+D+ NI + Y P+
Sbjct: 1183 LKGIWFAGYSEEPYKLSLFSKDDGTLQVMAADFLPDGKRLYILVADDDCNIHVLQYDPED 1242
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDRTGAAPGSDKTNRFALLFGTL--- 1330
S KG +LL R+ FH G + L + + S+ A P + + L+ L
Sbjct: 1243 PGSSKGDRLLHRSTFHTGHFASTMTLLPRTIIPSAQGPDANPDMMELDSSGPLYHVLVTS 1302
Query: 1331 -DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1389
GSI I PL E +RRL +LQ +L++++ H GLNPR+FR S+G R +VD
Sbjct: 1303 ETGSIALITPLSETAYRRLSALQSQLINTLEHPCGLNPRAFRAIESDGIGGR----GMVD 1358
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
+LL + L + + EIAH+ G +I ++L + G
Sbjct: 1359 GDLLHRWLDLGTQRKAEIAHRVGADIWEIRADLEAIGKG 1397
>gi|393220097|gb|EJD05583.1| cleavage factor protein [Fomitiporia mediterranea MF3/22]
Length = 1450
Score = 295 bits (755), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 361/1462 (24%), Positives = 624/1462 (42%), Gaps = 214/1462 (14%)
Query: 76 EGSKESKNSGETKRRVLMDGISAASLEL--VCHYRLHGNV---ESLAILSQGGADNSRRR 130
EG E GE + +S + + + +RLHG V E + ILS
Sbjct: 89 EGEVEMDTQGEGFVNMASKPLSMTTYQFHFIREHRLHGIVTGLEPVKILSS----TEDSL 144
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQG 189
D ++++F+DAK+++LE+ +H L S+H +E +P+ L + + G L +VDP
Sbjct: 145 DRLLVSFKDAKLALLEWSPELHDLVTVSIHTYERAPQMTFLDPSKFT---GQL-RVDPLS 200
Query: 190 RCGGVLVYG--LQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVK 247
RC + + L ++ SQ LV + T +S + N D +++V
Sbjct: 201 RCAALSLPCDCLAILPFYHSQVDLDLVDADQTVSRDIPYSPSF-ILDLFNQVDHRIRNVI 259
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
DF F+ G+ P + +L + + TW GR+ TC + ++ +P+I S NLPHD
Sbjct: 260 DFAFLPGFNNPTLAVLFQTQHTWTGRLKEFKDTCNLFIFTLDLVTHMYPIITSVENLPHD 319
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQ-SASCALALNNYAVSLDSSQELPRSSFSVE--- 363
+ +L S +GGV+++ N++ Y Q S L +N +A + ++P E
Sbjct: 320 CFAMLPCDSSLGGVVIISCNSLIYVDQASRKTVLPVNGWAARV---SDMPMQQLRPEEMN 376
Query: 364 ----LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDI 415
L+ AHAT++ + + T+ G ++ + +V DGR RL L S+T L ++
Sbjct: 377 RDLHLEGAHATFVDSRTFFIITRDGLVLPVEIVMDGRTALRLALHPAMSQTTTPALVRNV 436
Query: 416 TTIGNS--------LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF------GDIEAD- 460
S + F+GS +G S+L++ T ++EE GDI A
Sbjct: 437 AFRSASGDQAPRSQILFVGSTVGPSVLLRVTW----------VEEEIQKDKQQGDIPAAV 486
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS---FAVRDSLVNIGPLKD 517
A + + D + V E + +G + +++A +T S ++ DSL GP+
Sbjct: 487 ADNPMAVDFDDEDDIYGDVAKETQTTHGQPTAASQAAVETKSVIHLSLCDSLSAYGPINS 546
Query: 518 FSYGLRINAD------ASATGISKQSNYELVE-------------LPGCKGIWTV-YHKS 557
++ L N D +ATG ++ + L + + G +GIW + +S
Sbjct: 547 MAFALTRNGDRPTAELVAATGYARLGGFTLFQRDVPTRSKRKLHAVGGARGIWCIPVRQS 606
Query: 558 SRGHNADSSR-MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR----TI 612
+ + ++ SR + E +I+S +A T + D + R TI
Sbjct: 607 LKVNGSERSRNLLPGSSEVVDTVIVSTDANPSPGLTR--FAAKSSRNDIAITARRTETTI 664
Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
A F R +I V R+L+ D S + ++ + I+DP++L
Sbjct: 665 GAAPFFQRTAIIHVTTDLIRVLE-----PDCSERQCIRDMDGSNKRPKIRFCCISDPFIL 719
Query: 673 LGMSDGSIRLLVGDPSTCTVSVQ--TPAAIESSKKPVSSCTLYHDKG------PEPWLRK 724
+ D S+ L VGD + + TP E + + C G E
Sbjct: 720 VIREDESLGLFVGDAERGRIRRKDMTPMG-EKVSRYSAGCFFLDQSGIFELHMSESSPTT 778
Query: 725 TSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGR 784
++D G G D +G + V+C G +EI+ +P VF+
Sbjct: 779 GTSDDKQRMGTGSLESAVDA---QRGTQWLVLCRPQGVVEIWTLPKLALVFSTSSLKDLP 835
Query: 785 THIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAI 844
+ + D++ AL S E+ + ++ +I ++ ++ + P L +
Sbjct: 836 SVVSDSFDPPAL--------SLPEDPPRKPQEADIELLQFAQIG-----ELYPHPHLIVM 882
Query: 845 LTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV----------------------SNV 882
L G + YQA + K D P ST R+ ++ S+V
Sbjct: 883 LRCGQLAIYQAVAVD------KDDFPESTVRTSTLKIKFIKMGTRSFEPRQLEPAEKSSV 936
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RE 941
A + R LR S P E K +S G F++G PCW + ++
Sbjct: 937 IAEQRRALR-SLVPFIVSPNSE-------------KRVS---GVFVTGDEPCWIVATDKD 979
Query: 942 RLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQ--GILKICQLPSGSTYDNYWPVQKV 998
L++H C V +FT + F+ T + G + +P + + K
Sbjct: 980 GLKIHS--CSFQTVNSFTSCSVWDSKCDFLMHTDEAFGPCLLGWIPEFNLGTDM--PSKT 1035
Query: 999 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL-HR 1057
+ + T +T+ A L ++ S V P + D+E G+++ + +++ H
Sbjct: 1036 VTVGRTYTNVTFDAASGL--MVASSVVPNPFT-----IFDEE-GNKLWEPDAPNINYPHS 1087
Query: 1058 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGT 1116
+ E L G +Q +E + V L +T+ + + +GT
Sbjct: 1088 VMSALELFHSDLSCVMDGYEFQ--------PNEFVTALDCVQLETQSTESGTKEFIVVGT 1139
Query: 1117 AYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVY------SKELKGAISALASLQGHLLI 1170
+GED+A +G +F +P+ + + E KG ++AL + G+L+
Sbjct: 1140 TVNRGEDLAVKGVTYVFEIVEIVPDPEGGLARQFKLRLLCKDEAKGPVTALCGMNGYLVS 1199
Query: 1171 ASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1229
+ G KI + E L G+AF D +YV SL +KN +++GD KS++ ++++E +
Sbjct: 1200 SMGQKIFVRALDLDERLVGVAFLDV-GVYVTSLRTIKNLLIIGDAVKSVWLVAFQEDPFK 1258
Query: 1230 LNLLAKDFGSLDCFATEFLI--DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
L ++AK+ LD +FL DG + VSDE+ I++ Y ES GQ LL R
Sbjct: 1259 LVIVAKEVQRLDVMTADFLFASDGD-FYIAVSDEEGIIRLLEYDTSDPESHSGQYLLRRT 1317
Query: 1288 EFH--VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD-ELT 1344
E+H V +H T ++A S G P + L+ +DGS+ + P+D + +
Sbjct: 1318 EYHAQVESHTTV-----LIARRSQNDGLVPQA------RLISAAVDGSMYALTPVDADES 1366
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1404
+RLQ LQ +L ++ HVAGLNPR+FR S+G A RP I+D LL+ +E LP+ Q
Sbjct: 1367 AKRLQLLQGQLTRNMQHVAGLNPRAFRAVRSDGVA-RPLTKGILDGNLLAGFEQLPIPRQ 1425
Query: 1405 LEIAHQTGTTRSQILSNLNDLA 1426
EIA GT R +L + +L+
Sbjct: 1426 NEIARPIGTDRLAVLRDRRELS 1447
>gi|302694047|ref|XP_003036702.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
gi|300110399|gb|EFJ01800.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
Length = 1396
Score = 295 bits (755), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 340/1480 (22%), Positives = 617/1480 (41%), Gaps = 234/1480 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS------------------- 97
N+V N + IY VR + SK + K+ M+G+
Sbjct: 38 NVVTARGNTLSIYEVREETATSKSPTEAKSQKKDDAMEGVKEERQTPVVQVRSLSKKTYP 97
Query: 98 ---------AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFD 148
+ LV +RLHG V L + + D ++++F+DAKI++LE+
Sbjct: 98 DSDSHSQPLSTKFHLVREHRLHGVVTGLQAVKIISSLEDHL-DRLLVSFKDAKIALLEWS 156
Query: 149 DSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
+ L S+H +E + + +F ++VDPQ RC + + IL Q
Sbjct: 157 TATQDLLTVSIHTYERAIQM-VATDISAFTSE--LRVDPQSRCAALSLPKDAFAILPPCQ 213
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHE 265
+ D S ++NL + +++V DF F+ G+ P + +L+E
Sbjct: 214 VSDSVCRD-----------VPYSPSFILNLPSEVESGIRNVIDFTFLPGFSNPTVAVLYE 262
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
TW GR++ + T ++ ++ +++P+I A LP D +LA PS GGV+VV
Sbjct: 263 TYQTWTGRLNEQKDTVKMAFFTLDIVNRRYPVIGLATGLPCDCLSVLACPS-TGGVMVVA 321
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELP------RSSFSVELDAAHATWLQNDVALL 379
+N+I Y QS + N + S LP + ++EL+ + + ++ + A +
Sbjct: 322 SNSIIYVDQSGRKVVLPVNAWIPRMSDIALPTNLTPEEQARTLELEGSRSIFVDDKTAFI 381
Query: 380 STKTGDLVLLTVVYDGRVVQRLDL-----SKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
K G + + +V GRVV +L L T PS+L I N +GS GDS
Sbjct: 382 ILKDGTIYPVELVTAGRVVSKLALGTPLAKTTIPSILRR----INNDYLLVGSASGDS-- 435
Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL--QDM-VNGEELSLYGSAS 491
++LS+ EE D + D + +S AL QD+ ++ ++ +YG +
Sbjct: 436 ---------ALLSTSWVEEVIDDDVDMEAN-----TSVAALEQQDIEMDDDDDDIYGPSI 481
Query: 492 NNTESAQK------------TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGI 533
T ++QK + D+L GP+ D ++ + N D +ATG
Sbjct: 482 IKTGTSQKESAAPMSKKTRSVLRLSFCDALPAYGPIADLTFTVGKNGDRPVAELVTATGS 541
Query: 534 SKQSNYELVE--LP-----------GCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYL 579
+ L + LP G +G+W++ +SS A+ H L
Sbjct: 542 GHLGGFTLFQKDLPLRKKKKLPIISGARGVWSLPIRRSSSAAVAE-----------HDTL 590
Query: 580 IISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
IIS +A R V T L+ V+ V G TI AG F R ++ V R
Sbjct: 591 IISTDANPSPGFSRLAVRATKGDLSVVSR-----VNGMTIGAGPFFQRTAILHVMTNAIR 645
Query: 633 IL--DGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
+L DG+ + +D+ + + S SI DPYVL+ D +I L +G+ +
Sbjct: 646 VLEPDGNERQIIKDME---------GNVPRAKIKSCSICDPYVLIFREDDTIGLFIGETT 696
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG--EAIDGADGGP 746
+ + + + ++ + D + + DA T + +D +
Sbjct: 697 RGKIRRKDMSPMGEKSSRYTAGGFFTDTASVFRVYHQNADANTETTIPMHSVVDASSKSQ 756
Query: 747 LDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
+ V+ G +EI+ +P VF+ + + + D+ AL +
Sbjct: 757 ------WLVLVRPQGVVEIWTLPKLTLVFSTTLLATLQNVLTDSQEPPALSPPQDPPRKP 810
Query: 807 SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
E + + ++ + +P L +L G + Y+A+ N +
Sbjct: 811 QE-------------LDIEQILLTNLGQSDPKPHLLVLLRSGHLAIYEAFA----TNQAP 853
Query: 867 SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR------ITIFKNI 920
+P R+ S+ +++ ++ + + +ET G ++ F
Sbjct: 854 IVEPPLKPRA------SSLQIQFVKIASKAFEMQRTDETEKGILAEQKKALRTFVPFACA 907
Query: 921 SGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
G F +G RP W + + ++++P ++ AF+ + F+ + +G
Sbjct: 908 GAPAGVFFTGDRPHWIVATDKGGVQMYPS-GHAAVYAFSACTLWERSTEFLIYSEEG-QT 965
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITY--FAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
+C+ + + + + +P++ P Y + +IV+ L+
Sbjct: 966 LCE------WITEYEIGRPLPMRHIPRGRAYSNIVYEPASSMIVAAASLRARFASF---- 1015
Query: 1038 DQEVGHQI---DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
E G+QI D ++ TVE + ++ P+ W T ++E T
Sbjct: 1016 -DEDGNQIWAPDGPGITEP------TVECSTLELISPEV----WATVDGYEFATNEFVNT 1064
Query: 1095 VRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKE 1153
+ V L +T+ + +A+GT+ V+GED+A +G +F + N Y +
Sbjct: 1065 MECVPLETVSTEAGVKHFIAVGTSIVRGEDLAVKGATYIFEVVEVVPDQSNGPKRWYRLK 1124
Query: 1154 L------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVK 1206
L KG ++AL + +L+ + G KI + + E L G+AF D +YV SL +K
Sbjct: 1125 LRCRDDAKGPVTALCGINNYLVSSMGQKIFVRAFDLDERLVGVAFMDV-GVYVTSLRALK 1183
Query: 1207 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
N +L+GD+ + I F++++E +L L +D + +F L++V +DE +
Sbjct: 1184 NLLLIGDVVRGIQFVAFQEDPYKLVTLGRDVSRMCATTVDFFFAEEALAIVTTDENGVMS 1243
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1326
++ Y P+ +S G+ LL + EF++ T F ++A + P L+
Sbjct: 1244 MYNYDPEAPDSHDGRLLLKQTEFNLH---TDFRTSTLIARRTKDDPIIPQG------ILI 1294
Query: 1327 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1386
FG DG++ C+ P+ + +RLQ LQ +L ++ HVAGLNP++ R N RP
Sbjct: 1295 FGGTDGTLSCLTPVPDDAAKRLQPLQLQLTRNMQHVAGLNPKALRIVR-NEHVSRPLSKG 1353
Query: 1387 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
I+D L++++E LP+ Q E+ Q GT R+ IL + L+
Sbjct: 1354 ILDGNLIAYFEHLPITRQDEMTRQIGTERATILRDWMSLS 1393
>gi|336276223|ref|XP_003352865.1| hypothetical protein SMAC_04980 [Sordaria macrospora k-hell]
gi|380092984|emb|CCC09221.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1486
Score = 293 bits (750), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 358/1428 (25%), Positives = 594/1428 (41%), Gaps = 207/1428 (14%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 95 DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVDRNT 154
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 155 LETISIHYYEKEELVGSPWVAPLHHYPTLLLADPASRCAALKFSERNLAILPFKQPDEDM 214
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 215 DMDNWDEELDGPRPKKDLSGAIANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 274
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 275 YRDPTIGVLSSTKTASNSLGHRDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 332
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + +S + L+ L
Sbjct: 333 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 392
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITT---IGNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T+ +G S F
Sbjct: 393 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRVGRSTVF 452
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + K DI+ D
Sbjct: 453 VGSEEGDSVLLGWTRRQGQTQKR---KSRIQDIDLDLDLDDEDLEDDD-----------D 498
Query: 485 SLYGSASNNTESA--------QKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------- 529
LYG S + E A +F + D L++I P++ +YG + S
Sbjct: 499 DLYGEESTSPEQAISAAKAVKSGELNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSE 558
Query: 530 ----------ATGISKQSNYELV------------ELPGCKGIWTVYHKSS--RGHNADS 565
A G K S ++ E P +G WTV K + D
Sbjct: 559 GVRSDLQLVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDK 618
Query: 566 SRMAA-YDD--EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
M+ YD ++H ++I++ E + TA +T + G T+ AG
Sbjct: 619 GPMSNDYDTSGQHHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGT 678
Query: 617 LFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
+ R++QV + R DG ++Q + + E+G+ V + SIADP++LL
Sbjct: 679 MGKDCRILQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIR 733
Query: 676 SDGSIRLLVGDPSTCTV-SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG 734
D S+ + P + V+ + + K ++ C LY D +TG
Sbjct: 734 DDFSVFVAEMSPKLLELDEVEKEDQMLTGTKWLAGC-LYTD----------------TTG 776
Query: 735 VGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMRE 794
V A + A G D +I + SG L I+ +P+ V + +S Y+
Sbjct: 777 V-FADEAAGKGTKD--NILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPP 825
Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
L + ++ +GT KE++ + V +L H P+L + + YQ
Sbjct: 826 GLS-----ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQ 873
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----P 910
Y + + + P S S + ++ N F++ P + ++ H A P
Sbjct: 874 PYRVK-----ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPVEDDELHNAQRFLP 921
Query: 911 CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFI 970
+R T NISG+ FL GS P + + + L + A + H C HGFI
Sbjct: 922 MRRCT---NISGYSTVFLPGSSPSFILKTAKSSPRVLGLQGSGVQAMSSFHTEGCEHGFI 978
Query: 971 YVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1030
Y + GI ++ Q+P+ S++ K IP+ + Y Y +V +P
Sbjct: 979 YADTNGIARVTQIPTDSSFAELGLSVKKIPVGVDTQSVVYHPPTQAY--VVGCNNAEPFE 1036
Query: 1031 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1090
L D + + N++ + V+ +++L +G W T+ M+ E
Sbjct: 1037 ----LPKDDDYHKEWARENITFKPM-----VDRGMLKLL----SGITWTVIDTVEMEPCE 1083
Query: 1091 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEV 1149
L V + L + +T E + L+A+GTA +GED+ RGRV +F P T
Sbjct: 1084 TVLCVETLNLEVSESTNERKQLIAVGTALTKGEDLPTRGRVYVFDIADVIPEPGKPET-- 1141
Query: 1150 YSKELK---------GAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPP 1196
SK+LK GA++AL+ + QG +L+A G K ++ K GT L +AF D
Sbjct: 1142 -SKKLKLVAKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN- 1198
Query: 1197 LYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
YV S+ + L+ D K ++F + E+ ++ L K ++ +FL DG L
Sbjct: 1199 CYVTSVKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKEL 1258
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFLRLQML--ATSSDRT 1311
+V SD +I I + P+ +S +G LL R F+ GA H T L L + T+S +
Sbjct: 1259 YIVASDADGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLLLPAVYPTTTSPNS 1318
Query: 1312 GAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
+ G + + LL + G + + PL E +RRL SL +L +++PH AGLNP+ +R
Sbjct: 1319 NSEVGENPPH--ILLLASPTGLLATLRPLQENAYRRLSSLAIQLTNALPHPAGLNPKGYR 1376
Query: 1372 QFHSNGKA--HRPGPDS-----IVDCELLSHYEMLPLEEQLEIAHQTG 1412
+ A PG D+ IVD ++L + L ++ EIA + G
Sbjct: 1377 LPSPSASASMQLPGVDAGIGRNIVDGKILERFMELGTGKRQEIAGRAG 1424
>gi|148886831|sp|Q7SEY2.2|CFT1_NEUCR RecName: Full=Protein cft-1; AltName: Full=Cleavage factor two
protein 1
Length = 1456
Score = 293 bits (749), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 344/1418 (24%), Positives = 592/1418 (41%), Gaps = 188/1418 (13%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 96 DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + ++ + L+ L
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T++ G S F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + ++ ++ D D + GEE
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507
Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
+ A + ++ + +F + D L++I P++ +YG + S
Sbjct: 508 ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567
Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
A G K S ++ E P +G WTV K + +D
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627
Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+YH ++I++ E + TA +T + G T+ AG + R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+QV + R DG ++Q + + E+G+ V + SIADP++LL D S+ +
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSVFI 742
Query: 683 LVGDPSTCTVSVQTPA-AIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG 741
P + I +S K ++ C LY D ++ + VG+
Sbjct: 743 AEMSPKLLELEEVEKEDQILTSTKWLAGC-LYTD----------TSGVFADETVGKGT-- 789
Query: 742 ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
+ +I + SG L I+ +P+ V + +S Y+ L
Sbjct: 790 -------KDNILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPPGLS---- 830
Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
+ ++ +GT KE++ + V +L H P+L + + YQ Y +
Sbjct: 831 -ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-- 880
Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIF 917
+ + P S S + ++ N F++ P + ++ PH A P +R +
Sbjct: 881 ---ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS-- 928
Query: 918 KNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
NISG+ FL GS P + + + L + A + H C HGFIY + GI
Sbjct: 929 -NISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGI 987
Query: 978 LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
++ Q+P+ S+Y K IP+ + Y Y +V ++P L
Sbjct: 988 ARVTQIPTDSSYAELGLSVKKIPIGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPK 1041
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
D + + N++ + V+ +++L +G W T+ M+ E L V
Sbjct: 1042 DDDYHKEWARENITFKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVET 1092
Query: 1098 VTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK- 1155
+ L + +T E + L+A+GTA ++GED+ RGRV +F P T SK+LK
Sbjct: 1093 LNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKL 1149
Query: 1156 --------GAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLN 1203
GA++AL+ + QG +L+A G K ++ K GT L +AF D YV S+
Sbjct: 1150 VAKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVK 1207
Query: 1204 IVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1261
+ L+ D K ++F + E+ ++ L K ++ +FL DG L +V SD
Sbjct: 1208 ELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDA 1267
Query: 1262 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1321
+I I + P+ +S +G LL R F+ GAH L + A + + + S++ +
Sbjct: 1268 DGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENS 1326
Query: 1322 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA-- 1379
LL + G + + PL E +RRL SL +L + +PH AGLNP+ +R + A
Sbjct: 1327 PHILLLASPTGVLATLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASM 1386
Query: 1380 HRPGPDS-----IVDCELLSHYEMLPLEEQLEIAHQTG 1412
PG D+ IVD ++L + L ++ E+A + G
Sbjct: 1387 QLPGVDAGIGRNIVDGKILERFLELGTGKRQEMAGRAG 1424
>gi|301628217|ref|XP_002943254.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Xenopus (Silurana) tropicalis]
Length = 628
Score = 292 bits (748), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 204/626 (32%), Positives = 318/626 (50%), Gaps = 74/626 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E S + + E K LEL+ + GNV S+
Sbjct: 29 NLVVAGTSQLYVYRLNPNCESSSKGEKGSEVKGH-------KEKLELMASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F++AK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKEAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG Q+++L ++ GLVG+ G +
Sbjct: 135 NVHNPKVRVDPSGRCAVMLIYGTQLVVLPFRRDTLAEEHDGLVGE--------GQKSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
K HP+IWS NLP+D + LAVP PIGGV++ N++ Y +QS ++LN+
Sbjct: 247 IMQKVHPVIWSLTNLPYDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVSLNSLTNG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
S P+ V LD + AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTSFPLKPQEGLRVTLDCSQATFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ +T + FLGSRLG+SLL+++T S + + D P K+
Sbjct: 367 ASVLTTSMTPMEPGYLFLGSRLGNSLLLRYTEKVQDSPAGPSKDPD----KQDEPPNKKK 422
Query: 468 RRSSSDALQ-----DMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
R SS A +MV+ +E+ +YGS + + T+SF V DS++NIGP S G
Sbjct: 423 RVDSSLARPGGSKGNMVDEIDEIEVYGSEM-QSGTQLSTYSFEVCDSILNIGPCATASMG 481
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
L I + + + + K ++V ELPGC +WTV
Sbjct: 482 EPAFLSEEFQESPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVISN 541
Query: 557 SSRGHNADSSRM------AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
+ A D H +LI+S + TM+L+T + E+ S + Q
Sbjct: 542 HKKEEQEGEKEGETPPVEAEEDTNRHGFLILSRDDSTMILQTGQEIMELDTS-GFATQDP 600
Query: 611 TIAAGNLFGRRRVIQVFERGARILDG 636
T+ AGN+ + ++QV RG R+L+G
Sbjct: 601 TVYAGNIGDNKYIVQVSPRGIRLLEG 626
>gi|345566738|gb|EGX49680.1| hypothetical protein AOL_s00078g169 [Arthrobotrys oligospora ATCC
24927]
Length = 1407
Score = 287 bits (735), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 350/1525 (22%), Positives = 618/1525 (40%), Gaps = 297/1525 (19%)
Query: 57 NLVVTAANVIEIYVVRVQEEGS-----KESKNSGETKRRVLM-----DGISAAS------ 100
NLVV ++++I+ + E+ E+K+ G + RRV D + S
Sbjct: 31 NLVVAKTSLLQIFRLVEYEDAEGEFALDEAKDEGGSDRRVFEGRDHEDSFTVESGMHLQR 90
Query: 101 --------LELVCHYRLHGNVESLAIL----SQGGADNSRRRDSIILAFEDAKISVLEFD 148
L+LV Y L+G+V S+ + S+ G D ++++F+ AKIS+LE+D
Sbjct: 91 ETIEKTTKLDLVAQYHLYGSVTSMVKIRIPTSKSGGD------CLLVSFDSAKISLLEWD 144
Query: 149 DSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK--------VDPQGRCGGV-LVYGL 199
+ H + S+H +E E+ R PL DP+ RC + L
Sbjct: 145 PAAHSISTISLHYYEGDEF-----------RSPLTPEFPINYLISDPKSRCAAFKFNHDL 193
Query: 200 QMII-LKASQGGSGLVGDEDTF-----------------------GSGGGFSARIESSHV 235
I+ + ++ + D D+F G G S V
Sbjct: 194 VAILPFRQTEDEDLEIPDNDSFTYDLEDDDDAEKPKKDVEMKDNTGEGKPSDTPYHPSFV 253
Query: 236 INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
++ LD ++ + D +F+H Y EP I+++ + G + + +++ +
Sbjct: 254 LSASQLDESVERIIDIVFLHEYREPTFGIVYQPQQGSVGMLERRKDPTHFIVVTLDLDQR 313
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYAVSLDSS 352
I SA NLP D +K +A+P PIGG L++G + I H A+A+N+YA +
Sbjct: 314 ASTSIMSAKNLPFDIWKAVALPPPIGGTLLLGEHEIVHVDQAGKMSAVAVNSYAQQYSAF 373
Query: 353 QELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP-- 408
+S + L++ A L N+ L+ T GD +L+ +GR + L + +
Sbjct: 374 NMTDQSDLELNLESCSAISLPNENGDVLIVTIAGDFAILSFKAEGRSISSLSVRRIQSKD 433
Query: 409 -----SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
S + +GN FFLGS D++L + + LS S
Sbjct: 434 GYPFTSAPCETLVEVGNRRFFLGSLDSDAMLWGYKRKGEKTSLSQK-------------S 480
Query: 464 TKRLRRSSSDALQDMVNGEELSLYG------------SASNNTESAQKTFSFAVRDSLVN 511
+L R ++ + +E LYG S+ N + + F D L N
Sbjct: 481 EVKLERDDAEDNV-EDDDDEDDLYGESTVTPITPRKASSGNIGRGSSGEYVFRRHDRLQN 539
Query: 512 IGPLKDFSYG--------------------LRINADASATGISKQSNYELV-------EL 544
+GP + ++G L G N + +
Sbjct: 540 VGPCRQMAFGRPAMLPEKLKLHQGVLPELELMATTGRGVEGAVTVFNTSICPRVSATFDF 599
Query: 545 PGCKGIWTVYHKSSRGHNA--DSSRMAAYDDE------YHAYLIISLEARTMV------- 589
C+ +W V+ K + + SS Y+++ Y YL S + T+V
Sbjct: 600 KDCQRLWAVHSKQVKKGQSMIPSSVSKGYEEQIGATEDYSTYLFASNTSETLVYKVGTKF 659
Query: 590 --LETADL-LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
LE D+ TEV ++++ G R+ QV E ++ D Q +
Sbjct: 660 EPLEGTDIETTEVCPTLEF---------GTFQDGLRIAQVCETNVKVYDSEL--QLIQII 708
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKK 705
+N E G + ++S S ADPY+LL D SI T + ++ PA I+ +K
Sbjct: 709 STNDEDPDGGPH--IVSASFADPYMLLICGDSSILACQCHERTLELDRIELPATIKDTK- 765
Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCY---ESGA 762
T+ L T E G V+C+ E G
Sbjct: 766 --------------------YTNGCLYTSSSEV--------FGLGTKSQVLCFLLTEEGT 797
Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG-QGRKENIHS 821
L++F +PNF T++ F D ++ S E ++ I
Sbjct: 798 LQVFTLPNFELKATLEHF-----------------DMSLQLVSPDETALRFHTARDEIEE 840
Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
+ V +L A P+L I+ Y+ ++ G
Sbjct: 841 IIVADLGDNISKA----PYLIVKTKRDDIIIYEPFISNG--------------------- 875
Query: 882 VSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRE 941
+ ++ N + P + + +++P G P +I ++ G+ F++G P + +
Sbjct: 876 ICFKKIYN---TVLPTVSLSEQKSPSG-PLVKI---DDLGGYSVAFMAGDTPTFITKSSK 928
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPL 1001
L +L G + + + + GF+Y+ S+G ++C P S ++ W Q+ IPL
Sbjct: 929 TLPKLYKLQGGMVRSLSPFNTKETERGFLYIDSKGTARVCHFPEVSM-EHTWLSQR-IPL 986
Query: 1002 KATPHQITYFAEKNLYPLIV---SVPVLKPLN-QVLSLLIDQEVGHQIDNHNLSSVDLHR 1057
+ TP +TY+ KN+Y + V S P + + Q+ L+D+ + +++ +L +
Sbjct: 987 ERTPTSLTYYDPKNVYVVSVLSTSKPEVDDEDFQMEEGLVDETLLPELETGHLVMISPVT 1046
Query: 1058 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA 1117
T + YE + E P+ +A + ++ SE TKE + L+A+GT
Sbjct: 1047 WTTTDRYEFPVHEV-----PFVVKA-VELEISE-------------VTKERKVLIAVGTG 1087
Query: 1118 YVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
++GE+ ARG V +F G+ + + + +E+KG +S LA + G+LLI
Sbjct: 1088 LLRGENSPARGAVYVFDVIDVVPEIGKPETGKKFKL--ISREEVKGVVSTLAGMDGYLLI 1145
Query: 1171 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
G K ++ K G+ L +AF D V+ + K ++ GD+ K + F+ + E+
Sbjct: 1146 THGQKCMIRGLKEDGSLLP-VAFMDMNTHTTVAKTLEK-MVMFGDVLKGVSFVGFSEEPY 1203
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
++ L KD L A +FL G+ VV+D Q NI + Y P+ +S G +LL + E
Sbjct: 1204 KMILFGKDPRQLSITAGDFLPAGTACYFVVADAQSNIHVLQYDPENPKSIHGNRLLPKGE 1263
Query: 1289 FHVGAHVTKFLRL-QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1347
+ G V L + + ++ D+ F +F T+ G G ++ + E +RR
Sbjct: 1264 IYCGHEVKSICILPKKKSLFTEPDEDDMDEDEDEEFLCMFSTMTGVFGTVSSITESMYRR 1323
Query: 1348 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEI 1407
L +Q ++ ++ H+AGLNPR++R + P +I+D +LL + ML + E+
Sbjct: 1324 LNVIQGQITNTGEHIAGLNPRAYRAAKFRNTSSEPM-RAILDGKLLVRWLMLGAGRRKEL 1382
Query: 1408 AHQTGTTRSQILSNLNDLALGTSFL 1432
A + GT+ + +L L T+F
Sbjct: 1383 AGRAGTSEEMLREDLWFLQDATAFF 1407
>gi|426194401|gb|EKV44332.1| hypothetical protein AGABI2DRAFT_187183 [Agaricus bisporus var.
bisporus H97]
Length = 1413
Score = 287 bits (734), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 342/1451 (23%), Positives = 611/1451 (42%), Gaps = 173/1451 (11%)
Query: 57 NLVVTAANVIEIYVVRVQE-----EGSKESKNSGETKR-------RVLMD-------GIS 97
N+VV +N++ I+ VR + + E + G+T+R V MD I+
Sbjct: 49 NVVVARSNLLRIFEVREEPAPFPTQADDERERKGKTRRGTEAVEGEVEMDEEGEGFVNIA 108
Query: 98 AASLE-----------LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKIS 143
++++ + +RLHG V E + I+ A + D ++++F+DAKI+
Sbjct: 109 KSAIQKTKLPTVTKFYFIREHRLHGIVTGLEGVRIM----ASLEDKLDRLLVSFKDAKIA 164
Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
+LE+ D+IH L S+H +E +P+ + L R L +VDP RC + + +
Sbjct: 165 LLEWSDTIHDLVTVSIHTYERAPQLISLD---SPLFRSDL-RVDPISRCAALSLPKHAIA 220
Query: 203 ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
IL Q + L V ++D S S +++L + ++++V DF+F+ G+ P
Sbjct: 221 ILPFYQTQAELDVMEQDQSQSK---DVPYSPSFILDLPIQVEENIRNVIDFVFLPGFNNP 277
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
+ IL + + TW GR+ T + ++ + +I S LP+DA+ LL + I
Sbjct: 278 TIAILFQTQQTWTGRLRESKDTARLIIFTLDILTQNSTIITSVEGLPYDAFSLLPCSTAI 337
Query: 319 GGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPR---SSFSVELDAAHATWLQN 374
GGV+V+ N++ Y QS+ +L +N +A + P ++ + L+ + + +
Sbjct: 338 GGVIVITGNSVIYVDQSSRRVSLQVNGWATRISDLPYPPMEEDAALKLHLEGCRSAMVDD 397
Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLGSRLG 430
L K G + + ++ DG+ V +L ++ P++ + I T+ + F+GS +G
Sbjct: 398 KTVFLIYKDGTVYPVELIADGKTVSKLIMA---PALAQTTIPTVVKRVDEDHLFIGSAVG 454
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
S+L++ G K + D + D D + G+
Sbjct: 455 PSILLKTAHVEQEVEEEHGSKSGPAVVTQDV---------TMDDDDDDIYGDSTMETEPT 505
Query: 491 SNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYEL 541
+N +KT ++RD L GP+ ++ L +N + +ATG + L
Sbjct: 506 ANGVTHVRKTKTVIHLSLRDYLPAYGPISSMTFSLAMNGEKAVPELVAATGAGSLGGFTL 565
Query: 542 VE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----LIISLEARTMVLETADL 595
+ LP K +Y SRG + R + H + LI+S + +
Sbjct: 566 FQRDLPTVKKRKILYISGSRGIWSLPIRQPLRSNTSHGHDYDTLILSTDINPSPGSSRIA 625
Query: 596 LTEVTE--SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSE 651
+ + S++ G TI A F R ++ V R+L DG+ + +
Sbjct: 626 VRSMNRDVSINSRTPGLTIGAAPFFQRTAILHVMTNAIRVLHPDGTERQ-------TIPD 678
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSC 710
+ SIADP+VL+ D SI + V D +P +SS+ ++ C
Sbjct: 679 KDGNMPRPKIRFCSIADPFVLVMREDDSIGMFVATDREKIRRKDMSPMGDKSSRY-LAGC 737
Query: 711 TLYHDKGPEPWLRKTSTD--AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDV 768
G L + + D + +T + GA + ++ G LEI+ +
Sbjct: 738 FFTDTTG----LFEANFDNKSPATTSTLQITSGAKSQ-------WLLLVRPQGVLEIWTL 786
Query: 769 PNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELA 828
P + F+ S ++ + DT+ A Q + + ++
Sbjct: 787 PKLSLAFSTPAIASLQSVLTDTHDPPA-------------PSLPQDPPRKPQDLDIEQIL 833
Query: 829 MQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR 888
+ P L L G + Y+A + +N D P +TS + ++A
Sbjct: 834 LAPIGESSPTPHLCVFLRSGQLAIYEAVVLG--QNPEVPDTPRATSLQIQFVKIAAKSFE 891
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHP 947
R + + +T + + G F +G RP W + R ++V+P
Sbjct: 892 IQRPEENEKGILAEHKKINRMFIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGVQVYP 951
Query: 948 QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQ 1007
+ AFT F+ T G + + +P + +P+++ P
Sbjct: 952 S-GHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVPD-------FQFDGPLPMRSIPRG 1003
Query: 1008 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI---DNHNLSSVDLHRTYTVEEY 1064
Y L+ S+ V Q D++ G+ I D N+SS +V+
Sbjct: 1004 RAY--SNVLFDPSTSLIVAASSLQSTFTSFDED-GNNIWEPDAPNISSP------SVDCS 1054
Query: 1065 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGED 1123
+ ++ PD W T ++E + +VTL T+ + +A+GT +GED
Sbjct: 1055 ALELIAPDI----WATMDGFEFATNEYINDMTIVTLETAATETGTKDFIAVGTTIDRGED 1110
Query: 1124 VAARGRVLLFSTGRNADNPQNLVTEVYSKEL--------KGAISALASLQGHLLIASGPK 1175
+A +G +F P V++ +L KG ++A+ L +L+ + G K
Sbjct: 1111 LAVKGATYIFEIAEVV--PDQAVSQRRWYKLRLRCRDDAKGPVTAVCGLSDYLVSSMGQK 1168
Query: 1176 IILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1234
I + + E L G+AF D +YV SL +KN +L+GD KS+ F++++E +L LL+
Sbjct: 1169 IFVRAFDSDERLVGVAFMDVG-VYVTSLQTLKNLLLIGDAVKSVQFVAFQEDPYKLVLLS 1227
Query: 1235 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1294
KD S+ +FL + L LV DE+ I+I+ Y P+ +S +G+ LL EFH G
Sbjct: 1228 KDIQSVCVTRADFLFSENDLRLVTGDEEGIIRIYEYNPQDPDSREGRHLLLETEFH-GQR 1286
Query: 1295 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1354
++ ++A + P S LL G+ DGS+ + ++E F+RL LQ +
Sbjct: 1287 --EYRTSVLVAHRIKEDQSIPNS------RLLTGSADGSLASLTIVEEEAFKRLGLLQGQ 1338
Query: 1355 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1414
L+ ++ H+A LNP++FR N +P I+D LL YE LP+ Q E Q G
Sbjct: 1339 LMRNIQHMAALNPKAFR-IVKNEYVSKPLTRGILDGNLLGQYESLPINRQSEATQQIGAD 1397
Query: 1415 RSQILSNLNDL 1425
R +L + +L
Sbjct: 1398 RVNVLRDWIEL 1408
>gi|19112233|ref|NP_595441.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe
972h-]
gi|74582544|sp|O74733.1|CFT1_SCHPO RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|3738146|emb|CAA21247.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe]
Length = 1441
Score = 287 bits (734), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 337/1444 (23%), Positives = 599/1444 (41%), Gaps = 206/1444 (14%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV ++ G + ++ L G++ D +I+ + AK+S LE+D S+H
Sbjct: 92 LRLVSQVKVFGTITEISALKGKGSNGC---DLLIMLTDYAKVSTLEWDMQSQSFVTNSLH 148
Query: 161 CFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
+E +K + P + VDP C +L + M+ + L +E
Sbjct: 149 YYED-----VKSSNICSSHTPTQLLVDPDSDCC-LLRFLTDMMAIIPYPANEDLDMEEAA 202
Query: 220 F-GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
S S + S V+ LD + + D F++GY EP + IL+ E T +
Sbjct: 203 IENSKISSSYAYKPSFVLASSQLDASISRILDVKFLYGYREPTLAILYSPEQTSTVTLPL 262
Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-HSQS 335
+ T + S +++ + +I + +LP+D Y +++P+P+GG L++G N + Y S
Sbjct: 263 RKDTVLFSLVTLDLEQRASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVDSAG 322
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDLVLLT 390
+ + +N+Y +S F++EL+ A L + +L +G L
Sbjct: 323 RTVGIGVNSYYSKCTDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHTSGQFFYLD 382
Query: 391 VVYDGRVVQRLDLS----KTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGT 443
+ DG+ V+ L L + N L S IT G +L FLGS+ DS L++++ +
Sbjct: 383 FLLDGKSVKGLSLQALDLEINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWSRRTT- 441
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
EE E D L ++ + DM++ E +
Sbjct: 442 -------NEEVRLDEGD----DTLYGTNDAEMDDMLDIYETDESVGSKRKIAYENGPLRL 490
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY---ELV------------------ 542
+ D L NIGP+ DF+ G A + Q N+ ELV
Sbjct: 491 EICDVLTNIGPITDFAVG-----KAGSYSYFPQDNHGPLELVGTAGADGAGGLVVFRRNI 545
Query: 543 --------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETA 593
+ GC+ +WTV S + N S A Y + E YL++S E + +
Sbjct: 546 FPLIAGEFQFDGCEALWTV-SISGKLRNMKSRIQAQYSNPELETYLVLSKEKESFIFLAG 604
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSES 652
+ EV S D+ +T+ G+L R++Q+ R+ D + +TQ +F
Sbjct: 605 ETFDEVQHS-DFSKDSKTLNVGSLLSGMRMVQICPTSLRVYDSNLRLTQLFNF------- 656
Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSI----------RLLVGDPSTCTVSVQTPAAIES 702
S+ V+S SI DP +++ G I RL+ D V+T A++ S
Sbjct: 657 ---SKKQIVVSTSICDPCIIVVFLGGGIALYKMDLKSQRLIKTDLQNRLSDVKT-ASLVS 712
Query: 703 SKKPVSSCTLY----------------HDKGPEPWL-----RKTSTDAWLSTGVGEAIDG 741
L+ +D E L KTS + + G +++
Sbjct: 713 PDSSALFAKLFTYNETLNAKGQIANGMNDSASETDLDIQPNHKTSNNDQM--GYDQSV-S 769
Query: 742 ADGGP--------------LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKFVSG 783
AD P LDQ + + G L+++++ +F+ + D F
Sbjct: 770 ADDVPEVDNTIVTEKNVSNLDQESLEKHPILFALTDEGKLKVYNLADFSLLMECDVFDLP 829
Query: 784 RTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
T + ++ T N S S ++VEL + P LF
Sbjct: 830 PT------LFNGMESERTYFNKES-------------SQELVELLVADLGDDFKEPHLFL 870
Query: 844 ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
I Y+A+L+ NT K + ++ ++ V + +R TP DA +
Sbjct: 871 RSRLNEITVYKAFLYS---NTDKHKNLLAFAK---VPQETMTREFQANVG-TPRDAESTM 923
Query: 904 ETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVL 960
E + ++T + + H F++G +P + + P + I++
Sbjct: 924 EKKASSSVDHLKMTALEVVGNHSAVFVTGRKPFLILSTLHSNAKFFPISSNIPILSVAPF 983
Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLI 1020
H + G+IYV ++IC+ YDN WP +KV L + I Y K +Y +
Sbjct: 984 HAHHAPQGYIYVDENSFIRICKFQEDFEYDNKWPYKKV-SLGKQINGIAYHPTKMVYAVG 1042
Query: 1021 VSVPVLKPL-----NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1075
+VP+ + N+ ++ D + + N S+DL T
Sbjct: 1043 SAVPIEFKVTDEDGNEPYAITDDNDY---LPMANTGSLDLVSPLT--------------- 1084
Query: 1076 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS 1134
W + Q E L+V +V L + TTK + +A+GT+ +GED+A RG LF
Sbjct: 1085 --WTVIDSYEFQQFEIPLSVALVNLEVSETTKLRKPYIAVGTSITKGEDIAVRGSTYLFE 1142
Query: 1135 TGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNG 1188
P ++ + V +E+KG ++ + + G+LL G K+I+ + L G
Sbjct: 1143 IIDVVPQPGRPETRHKLKLVTREEIKGTVAVVCEVDGYLLSGQGQKVIVRALEDEDHLVG 1202
Query: 1189 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1248
++F D Y +S ++N +L GD+ +++ F+ + E+ ++ L +K +L+ A +FL
Sbjct: 1203 VSFIDLGS-YTLSAKCLRNLLLFGDVRQNVTFVGFAEEPYRMTLFSKGQEALNVSAADFL 1261
Query: 1249 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1308
+ G L VV+D N+++ Y P+ ES G++L++R +FH+G +T + +L
Sbjct: 1262 VQGENLYFVVADTSGNLRLLAYDPENPESHSGERLVTRGDFHIGNVITA---MTILPKEK 1318
Query: 1309 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
A G D + F+ + DG + + P+ + +RRL +Q L + V + GLNP+
Sbjct: 1319 KHQNAEYGYDTGDDFSCVMVNSDGGLQMLVPISDRVYRRLNIIQNYLANRVNTIGGLNPK 1378
Query: 1369 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
S+R S P I+D L+ ++ + + + E+AH+ G S I+++L +L
Sbjct: 1379 SYRLITSPSNLTNPT-RRILDGMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDLVELDEA 1437
Query: 1429 TSFL 1432
S++
Sbjct: 1438 LSYM 1441
>gi|409076059|gb|EKM76433.1| hypothetical protein AGABI1DRAFT_108759 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1413
Score = 286 bits (733), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 341/1451 (23%), Positives = 610/1451 (42%), Gaps = 173/1451 (11%)
Query: 57 NLVVTAANVIEIYVVRVQE-----EGSKESKNSGETKR-------RVLMD-------GIS 97
N+VV +N++ I+ VR + + E + G+T+R V MD I+
Sbjct: 49 NVVVARSNLLRIFEVREEPAPFPTQADDERERKGKTRRGTEAVEGEVEMDEEGEGFVNIA 108
Query: 98 AASLE-----------LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKIS 143
++++ + +RLHG V E + I+ A + D ++++F+DAKI+
Sbjct: 109 KSAIQKTKLPTVTKFYFIREHRLHGIVTGLEGVRIM----ASLEDKLDRLLVSFKDAKIA 164
Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
+LE+ D+IH L S+H +E +P+ + L R L +VDP RC + + +
Sbjct: 165 LLEWSDTIHDLVTVSIHTYERAPQLISLD---SPLFRSDL-RVDPISRCAALSLPKHAIA 220
Query: 203 ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
IL Q + L V ++D S +++L + ++++V DF+F+ G+ P
Sbjct: 221 ILPFYQTQAELDVMEQD---QSQAKDVPYSPSFILDLPIQVEENIRNVIDFVFLPGFNNP 277
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
+ IL + + TW GR+ T + ++ + +I S LP+DA+ LL + I
Sbjct: 278 TIAILFQTQQTWTGRLRESKDTARLIIFTLDILTQNSTIITSVEGLPYDAFSLLPCSTAI 337
Query: 319 GGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPR---SSFSVELDAAHATWLQN 374
GGV+V+ N++ Y QS+ +L +N +A + P ++ + L+ + + +
Sbjct: 338 GGVIVITGNSVIYVDQSSRRVSLQVNGWATRISDLPYPPMEEDATLKLHLEGCRSAMVDD 397
Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLGSRLG 430
L K G + + ++ DG+ V +L ++ P++ + I T+ + F+GS +G
Sbjct: 398 KTVFLIYKDGTVYPVELIADGKTVSKLIMA---PALAQTTIPTVVKRVDEDHLFIGSAVG 454
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
S+L++ G K + D + D D + G+
Sbjct: 455 PSILLKTAHVEQEVEEEHGSKSGPAVVTQDV---------TMDDDDDDIYGDSTMETEPT 505
Query: 491 SNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYEL 541
+N +KT ++RD L GP+ ++ L +N + +ATG + L
Sbjct: 506 ANGVTHVRKTKTVIHLSLRDYLPAYGPISSMTFSLAMNGEKAVPELVAATGAGSLGGFTL 565
Query: 542 VE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----LIISLEARTMVLETADL 595
+ LP K +Y SRG + R + H + LI+S + +
Sbjct: 566 FQRDLPTVKKRKILYISGSRGIWSLPIRQPLRSNTSHGHDYDTLILSTDINPSPGSSRIA 625
Query: 596 LTEVTE--SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSE 651
+ + S++ G TI A F R ++ V R+L DG+ + +
Sbjct: 626 VRSMNRDVSINSRTPGLTIGAAPFFQRTAILHVMTNAIRVLHPDGTERQ-------TIPD 678
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSC 710
+ SIADP+VL+ D SI + V D +P +SS+ ++ C
Sbjct: 679 KDGNMPRPKIRFCSIADPFVLVMREDDSIGMFVATDREKIRRKDMSPMGDKSSRY-LAGC 737
Query: 711 TLYHDKGPEPWLRKTSTD--AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDV 768
G L + + D + +T + GA + ++ G LEI+ +
Sbjct: 738 FFTDTTG----LFEANFDNKSPATTSTLQITSGAKSQ-------WLLLVRPQGVLEIWTL 786
Query: 769 PNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELA 828
P + F+ S ++ + DT+ A Q + + ++
Sbjct: 787 PKLSLAFSTPAIASLQSVLTDTHDPPA-------------PSLPQDPPRKPQDLDIEQIL 833
Query: 829 MQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR 888
+ P L L G + Y+A + +N D P +TS + ++A
Sbjct: 834 LAPIGESSPTPHLCVFLRSGQLAIYEAVVLG--QNPEVPDTPRATSLQIQFVKIAAKSFE 891
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHP 947
R + + +T + + G F +G RP W + R ++V+P
Sbjct: 892 IQRPEENEKGILAEHKKINRMFIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGVQVYP 951
Query: 948 QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQ 1007
+ AFT F+ T G + + +P + +P+++ P
Sbjct: 952 S-GHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVPD-------FQFDGPLPMRSIPRG 1003
Query: 1008 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI---DNHNLSSVDLHRTYTVEEY 1064
Y L+ S+ V Q D++ G+ I D N+SS +V+
Sbjct: 1004 RAY--SNVLFDPSTSLIVAASSLQSTFTSFDED-GNNIWEPDAPNISSP------SVDCS 1054
Query: 1065 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGED 1123
+ ++ PD W T ++E + +VTL T+ + +A+GT +GED
Sbjct: 1055 ALELIAPDI----WATMDGFEFATNEYINDMTIVTLETAATETGTKDFIAVGTTIDRGED 1110
Query: 1124 VAARGRVLLFSTGRNADNPQNLVTEVYSKEL--------KGAISALASLQGHLLIASGPK 1175
+A +G +F P V++ +L KG ++A+ L +L+ + G K
Sbjct: 1111 LAVKGATYIFEIAEVV--PDQAVSQRRWYKLRLRCRDDAKGPVTAVCGLSDYLVSSMGQK 1168
Query: 1176 IILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1234
I + + E L G+AF D +YV SL +KN +L+GD KS+ F++++E +L LL+
Sbjct: 1169 IFVRAFDSDERLVGVAFMDVG-VYVTSLQTLKNLLLIGDAVKSVQFVAFQEDPYKLVLLS 1227
Query: 1235 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1294
KD S+ +FL + L LV DE+ I+I+ Y P+ +S +G+ LL EFH G
Sbjct: 1228 KDIQSVCVTRADFLFSENDLRLVTGDEEGIIRIYEYNPQDPDSREGRHLLLETEFH-GQR 1286
Query: 1295 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1354
++ ++A + P S LL G+ DGS+ + ++E F+RL LQ +
Sbjct: 1287 --EYRTSVLVAHRIKEDQSIPNS------RLLTGSADGSLASLTIVEEEAFKRLGLLQGQ 1338
Query: 1355 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1414
L+ ++ H+A LNP++FR N +P I+D LL YE LP+ Q E Q G
Sbjct: 1339 LMRNIQHMAALNPKAFR-IVKNEYVSKPLTRGILDGNLLGQYESLPINRQSEATQQIGAD 1397
Query: 1415 RSQILSNLNDL 1425
R +L + +L
Sbjct: 1398 RVNVLRDWIEL 1408
>gi|392558419|gb|EIW51607.1| hypothetical protein TRAVEDRAFT_176174 [Trametes versicolor FP-101664
SS1]
Length = 1431
Score = 286 bits (732), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 346/1405 (24%), Positives = 606/1405 (43%), Gaps = 182/1405 (12%)
Query: 103 LVCHYRLHGNVESL-AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC 161
LV +RLHG V L A+ + ++ + D ++++F+DAKI++LE+ D+IH + S+H
Sbjct: 123 LVREHRLHGTVTGLEAVRTVHSLED--KLDRLLVSFKDAKIALLEWSDAIHDVMTVSIHT 180
Query: 162 FE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK--ASQGGSGLVGDED 218
+E +P+ + L RG L +VDP RC + + + IL SQ L+ E
Sbjct: 181 YERAPQLMALD---SPLFRGEL-RVDPLSRCAALSLPKDSLAILPFYQSQAELDLMEQES 236
Query: 219 TFGSGGGFSARIESSHVINL-RDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
+ +S S V++L D+D +++V DF F+ G+ P + +L + + TW GR+
Sbjct: 237 SQARDVPYSP----SFVLDLANDVDQRIRNVIDFAFLPGFNNPTVAVLCQYQQTWTGRLK 292
Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ- 334
T + ++ +PLI + LP+D L + IGGV ++ +N I + Q
Sbjct: 293 EYKDTVGLFIFTLDLVTNNYPLITAVDGLPYDCLSLTPCSTAIGGVFILASNAIIFVDQA 352
Query: 335 SASCALALNNY---AVSLDSSQELPRSSF-SVELDAAHATWLQNDVALLSTKTGDLVLLT 390
S L +N + L P+ +++L+ A T++ + + K G + +
Sbjct: 353 SRRVILPVNGWPPRTSDLTMPSLTPQEQLRNLQLEGARFTFVDDKTLFVILKDGTVHPVE 412
Query: 391 VVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
+V DG+ V RL ++ T P+V + + + F+GS +G S+L++ T+
Sbjct: 413 LVLDGKTVSRLSMADALARTTIPAV----VARVRDDYLFVGSMVGPSVLLR------TAH 462
Query: 446 LSSGLKEEFGDIEAD-----APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK- 499
+ +KEE D++A AP+ D NGE+ S G+ + +S +K
Sbjct: 463 VEEVIKEEDVDMDAGPATVVAPADTMDLDDDDDLYGPSGNGEQPSANGATNGTVDSVKKR 522
Query: 500 -TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG-------------ISKQSNY 539
++ D+L G + D ++GL N D +ATG + +S
Sbjct: 523 TVVRLSLCDALPAHGAISDMAFGLARNGDRVVPELIAATGSGELGGFHLFQRDMPTRSKR 582
Query: 540 ELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLL 596
+L + G +G+W++ ++ + R ++ +D +IIS +A + A
Sbjct: 583 KLHAIGGARGMWSLAVRQAMKVSGGTLERPSSQNDS----VIISTDANPSPGLSRIATRS 638
Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRVIQVF---ERGARIL--DGS--YMTQDLSFGPSN 649
++ + G T+ A F ++ + R+L DG+ + +DL
Sbjct: 639 AHSDIAITTRIPGTTLGAAPFFQGTAILHILFNVTNAIRVLEPDGTERQIIKDLE----- 693
Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSS 709
+ + S SI DP+VL+ D +I L +G+ + + + + +
Sbjct: 694 ----GTAPRPKIKSCSICDPFVLIIREDDTIGLFIGELERGKIRRKDMSPMGDKTSRYVA 749
Query: 710 CTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI-------YSVVCYESGA 762
+ D T L T V E + QG + + ++ G
Sbjct: 750 GGFFTD-----------TSGLLQTFVNEQAPAENVTSTLQGAMNAGNKSQWLILVRPQGV 798
Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
+E++ +P F+ + + D+Y AL S ++ + ++ +I +
Sbjct: 799 VELWTLPKLTLAFSTTLLATLDPILTDSYDGPAL--------SLPQDPPRKPQELDIDQI 850
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
+ L R RP L +L G + Y+A P S + S +L V V
Sbjct: 851 VIAPLGESR-----PRPHLIVLLRSGQLAVYEAVAIPPPPEPLPS----TRSSTLLVKFV 901
Query: 883 S-ASRLRNLRFSRTPLDAYTREE----------TPHGAPCQRITIFKNISGHQGFFLSGS 931
AS+ +++ + E+ AP Q + G F +G
Sbjct: 902 KVASKAFDIQHPEEEQKSVLAEQKRISRLLVPFVTSPAPGQTFS---------GVFFTGD 952
Query: 932 RPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
RP W + + ++V P + AFT F+ + +G + +P D
Sbjct: 953 RPSWILSTDKGGVKVFPS-GHSVVQAFTTSSLWESRGDFLLYSEEGPSLVEWMPD-VQLD 1010
Query: 991 NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
+ P + V ++ P+ F LIV+ + N+ S D V + D+ N+
Sbjct: 1011 GHLPARSVP--RSRPYSNVVFDAST--SLIVAASSFQ--NRFASYDEDGNVVWEPDSPNI 1064
Query: 1051 SSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-E 1109
SS L T+E ++ PD W T +E + + L +T+ +
Sbjct: 1065 SS-PLCECSTLE-----LISPDG----WITMDGYEFAPNEFVNCIVSIPLETMSTESGMK 1114
Query: 1110 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL------KGAISALAS 1163
+A+GT +GED+A +G V +F +P V + +L KG +S L
Sbjct: 1115 DFIAVGTTINRGEDLAVKGAVYIFEIVEVVPDPSTHVKRWWRLKLLCRDDAKGPVSFLCG 1174
Query: 1164 LQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
+ G+L+ + G KI + + E L G+AF D +YV SL VKN +++GD KS++F++
Sbjct: 1175 INGYLVSSMGQKIFVRAFDLDERLVGVAFLDV-GVYVTSLRAVKNLLVIGDAVKSVWFVA 1233
Query: 1223 WKEQGAQLNLLAKDFGSLDCF--ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1280
++E +L +L KD L C A F DG LS+V DE+ ++++ Y P ES G
Sbjct: 1234 FQEDPYKLVVLGKD-PQLCCITRADLFFADGQ-LSIVTCDEEGIVRLYAYDPHDPESKSG 1291
Query: 1281 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
Q LL R EFH + R ML + G + + L+ G++DGS+ + +
Sbjct: 1292 QHLLRRTEFHGQSE----YRSSMLVARRPKN----GDPEIPQARLVCGSVDGSLSTLTYV 1343
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1400
DE +RL LQ +L+ +V HVA LNP++FR N RP I+D LL+ +E LP
Sbjct: 1344 DEAASKRLHLLQGQLIRTVQHVAALNPKAFRMVR-NEYVSRPLSKGILDGNLLATFEDLP 1402
Query: 1401 LEEQLEIAHQTGTTRSQILSNLNDL 1425
+ Q E+ Q GT R+ +L + L
Sbjct: 1403 IARQNEVTRQIGTDRATVLKDWASL 1427
>gi|170102106|ref|XP_001882269.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164642641|gb|EDR06896.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1406
Score = 285 bits (730), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 337/1475 (22%), Positives = 602/1475 (40%), Gaps = 237/1475 (16%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSGETKR-------RVLMD---------- 94
N+VV +N++ I+ VR +Q + E + + +R V MD
Sbjct: 51 NVVVARSNLLRIFEVREEPCPIQNQADDERERRSKVRRGTEAVEGEVAMDEQGDGFINIA 110
Query: 95 --------GISAASLELVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFEDAKIS 143
+ V + LHG +E + I+S R D ++++F+DAKI+
Sbjct: 111 KSQKCPTHTPTVTRFYFVREHHLHGIVTGIEGVKIMSS----LEDRLDRLLISFKDAKIA 166
Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
+LE+ D++H L S+H +E +P+ + + S R L + DP RC + + +
Sbjct: 167 LLEWSDAVHDLITVSIHTYERAPQLMSID---SSLFRTEL-RTDPISRCAALSLPRHALA 222
Query: 203 ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
IL Q + L V D+D S +++L D ++++V DF F+ G+ P
Sbjct: 223 ILPFYQSQAELEVMDQD---QSQAKDVPYSPSFILDLPAQVDQNIRNVIDFAFLPGFNNP 279
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
+ +L + + TW GR+ T + ++ + +P+I S LPH+ LL + +
Sbjct: 280 TIAVLFQTQQTWTGRLREFKDTVRLVIFTLDIVTQNYPIITSVEGLPHECLALLPCGTSL 339
Query: 319 GGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPRSSFSVE-------LDAAHAT 370
GGV+++ +N I Y QS+ L +N + + ++P S + E L+ + A
Sbjct: 340 GGVVIITSNAIIYTDQSSKRVVLPVNGWVSRI---SDIPLPSLTPEEQLRNICLEGSRAV 396
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLG 426
++ + + K G + L +V DG+ V +L +S P + + I ++ L F +G
Sbjct: 397 FVDDRNLFVILKDGTVYPLEIVVDGKTVSKLTMS---PPLAQTSIPSVLRKLDDDHFLVG 453
Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFG---DIEADAPSTKRLRRSSSDALQDMVNGEE 483
S +G S+L++ ++ ++EE D+EA AP+T + D N
Sbjct: 454 SSVGPSVLLK----------AAHIEEEVAEDHDMEA-APATVVYDADDMEFDDDDGNLPR 502
Query: 484 LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQS 537
++ + ++RDSL GP+ D ++ L N D +ATG
Sbjct: 503 VA-------QPMAKPTVIHLSLRDSLPAYGPISDMTFSLAKNGDRPVPELVAATGSGFLG 555
Query: 538 NYELVE--LP-----------GCKGIWTV------------YHKSSRGHNADSSRMAAYD 572
+ L + LP G +G+W++ Y K+ A++ +
Sbjct: 556 GFTLFQRDLPVRTKRKLHVIGGARGLWSLPIRQPVKASGISYEKAVNPFQAENDSLIIST 615
Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
D + +S + V+ TA + G TI A F R V+ V R
Sbjct: 616 D-INPSPGLSRAGKNDVMITAR------------IPGTTIGAAPFFQRTTVLHVMTNALR 662
Query: 633 ILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
+L+ G + +D+ + + SI+DP+VL+ D SI L +G+
Sbjct: 663 VLEPGMQIIKDMD---------GNMPRPRIRACSISDPFVLILREDDSIGLFIGETERGK 713
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGP-EPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
+ + + + SC G E ++T +++ + A++ G
Sbjct: 714 IRRKDMSPMGDK----VSCFYTDTTGLLESNFENSTTPVGVTSTLSAAVNAGSKGQ---- 765
Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
+ ++ G +E++ +P F+ D S + +VD++ A
Sbjct: 766 --WLILVRPQGIVELWTLPKLTLGFSADGLTSLQNVLVDSHDPPA-------------PS 810
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
Q V ++ + RP L L G + Y+
Sbjct: 811 LPQDPPRKPQEFDVEQILVAPIGESSPRPHLCVFLRSGQLTIYEVLPLG----------- 859
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGH------- 923
T+ +L + +++ ++ S + EE G ++ I++
Sbjct: 860 -RTTEALPKVRPAHVKIKFVKISSMAFEIQRPEEGEKGIIAEQKRIYRMFVPFVTSASPG 918
Query: 924 ---QGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
G F +G RP W + ++++P + AFT F+ T +
Sbjct: 919 VTFSGVFFTGDRPNWIFGTDKGGVQIYPS-GHAVVNAFTPCSLFESKGDFLMYTEEA--S 975
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVL---SLL 1036
+ + YD +PL++ P Y S V P +L S L
Sbjct: 976 VSKWLPDFHYDG------PLPLRSVPRGRAY-----------SSLVFDPSTSLLVAASSL 1018
Query: 1037 IDQEVGHQIDNHNLSSVDLHR--TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
+ + D++ + + + + ++ PD W T ++E
Sbjct: 1019 QAKFASYDDDDNKIWEPETPNIGNPMCDTSTLELISPDM----WITMDGFEFATNEYIND 1074
Query: 1095 VRVVTLFNTTTK-ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKE 1153
V VTL T+ ++ +A+GT +GED+AARG ++ +P Y
Sbjct: 1075 VACVTLETAGTEVGSKDFIAVGTTIDRGEDLAARGATYIYEIVEVVPDPAISPKRWYKLR 1134
Query: 1154 L------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVK 1206
L KG ++A+ G+L+ + G KI + + E L G+AF D +YV SL +K
Sbjct: 1135 LRCRDDAKGPVTAVCGFHGYLVSSMGQKIFVRAFDSDERLVGVAFMDVG-VYVTSLRTLK 1193
Query: 1207 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
N +L+GD KS+ F++++E +L LL KD + +F LSLV DE+ ++
Sbjct: 1194 NLLLVGDAVKSLSFIAFQEDPYKLVLLGKDTQHVCVTNADFFFTDGELSLVTGDEEGIMR 1253
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1326
++ Y P+ +S G+ LL R EFH + + T + R P + L+
Sbjct: 1254 MYEYNPQDPDSKDGRYLLLRTEFHGQSE------YRTSTTIARRLKDDPSIPQAK---LI 1304
Query: 1327 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1386
G DG + + P++E F+RLQ LQ +L ++ HVAGLNP++FR N +P
Sbjct: 1305 IGGTDGCLSSLTPVEEHAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVR-NDFVSKPLSKG 1363
Query: 1387 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1421
I+D LL+HYE LP+ Q E+ Q GT R +L +
Sbjct: 1364 ILDGNLLAHYESLPIIRQNEMTRQIGTDRVTLLRD 1398
>gi|449543656|gb|EMD34631.1| hypothetical protein CERSUDRAFT_116804 [Ceriporiopsis subvermispora
B]
Length = 1440
Score = 285 bits (728), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 358/1488 (24%), Positives = 625/1488 (42%), Gaps = 227/1488 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDGISAASLE- 102
N+VV ++++ I+ VR +E S+ E +RR V MDG L
Sbjct: 49 NVVVARSSLLRIFEVR-EEPAPISSQKEDERERRASVRKGTEAVEGEVEMDGSGEGFLNM 107
Query: 103 ---------------------LVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFE 138
L+ +RLHG +E + I++ R D ++++F+
Sbjct: 108 GSVKSTAQNGSVQPPTINRFYLIREHRLHGIVTGIEGVRIVTS----LEDRLDRLLVSFK 163
Query: 139 DAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
DAKI++LE+ D++H L S+H +E +P+ + L +S P ++ DP RC +L+
Sbjct: 164 DAKIALLEWSDAVHDLVTVSIHTYERAPQLMAL----DSSLFRPTLRADPLSRCAALLLP 219
Query: 198 GLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVH 253
+ IL Q + L V ++DT S +++L D +++V DF+F+
Sbjct: 220 RDSIAILPFYQSQAELDVVEQDT---SQLRDVPYSPSFIVDLSAEVDDRIRNVIDFVFLP 276
Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
G+ P + +L +++ TW GR+ T + ++ + +P+I S LP+D + + A
Sbjct: 277 GFNNPTIAVLFQKQQTWTGRLREYKDTVSLYIFTLDLVTRNYPVITSTEGLPYDCFAVAA 336
Query: 314 VPSPIGGVLVVGANTIHYHSQSA-SCALALNNY-------AVSLDSSQELPRSSFSVELD 365
+ +GGV+++ +N I Y QS+ AL +N + V S+QE R + L+
Sbjct: 337 CSTALGGVVILASNAIIYVDQSSRRVALPVNGWPPRVSDMPVQALSAQEQLR---DLRLE 393
Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGN 420
+H ++ + + K G + + +V DG+ V +L +S T P+V + +
Sbjct: 394 GSHFVFVDDRTLFIILKDGTVYPVELVLDGKSVSKLTMSSAVARTTIPTV----VRRVQT 449
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
F+GS +G S+L++ T+ + + +E D+E T + D+ M
Sbjct: 450 DHLFIGSTVGPSVLLK------TARVEEDIADE--DVEMSVAPTAVV-----DSTDTMDL 496
Query: 481 GEELSLYGSASNNTE------------SAQKT-FSFAVRDSLVNIGPLKDFSYGLRINAD 527
+E LYGS T S ++T ++ DSL GP+ D ++ L N D
Sbjct: 497 DDEDDLYGSTKETTHRVDGLVNGAADASKKRTVVHLSLCDSLPAHGPIADMTFALAKNGD 556
Query: 528 ------ASATGISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRM 568
+ATG + L + LP G +G+W++ + + N +
Sbjct: 557 RAVPELVAATGSGTLGGFTLFQRDLPTRVKRKLHAIGGGRGMWSLPVRQAVKVNGSTYEK 616
Query: 569 AAYDDEYHAY---LIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
A + +H+ +IIS +A + A ++ + G TI A F +
Sbjct: 617 PA--NPFHSVNDSVIISTDANPSPGLSRIASRNQNGDITITTRIPGTTIGAAPFFQGTAI 674
Query: 624 IQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
+ V ++ + D S + + + SI DP+VL+ D +I L
Sbjct: 675 LHVMYNVTNVI--RVLEPDGSERQIIKDVDGNVARPKIRACSICDPFVLIIREDDTIGLF 732
Query: 684 VGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKGP-EPWLRKTSTDAWLSTGVGEAIDG 741
+G+P + + + + + + + ++ C G + L + +T ++
Sbjct: 733 IGEPERGKIRRKDMSPMGDKTSRYLTGCFFTDTTGTFQTHLNPLAAGTEAATSTLQS--A 790
Query: 742 ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
+ G Q + ++C G LEI+ + F+ S + +VDTY L
Sbjct: 791 INAGSRSQ---WLILCRPQGTLEIWTLSKLTLAFSTTLIPSLESVVVDTYDVPHL----- 842
Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
S ++ + ++ +I + V L RP+L L G + Y+ P
Sbjct: 843 ---SLPQDPPRKPQELDIEQIVVAPLG-----ESSPRPYLTVFLRSGQLAVYETIPVAPP 894
Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
DP+ SRS ++ +RF + A+ ++ + K IS
Sbjct: 895 A------DPLPNSRSCTIL---------VRFRKVLSKAFDIQQQNEEVEKSVLAEQKRIS 939
Query: 922 ------------GH--QGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIV-AFTVLHNVNC 965
G G F +G RPCW + + ++V P S+V AFT
Sbjct: 940 RLLIPFVTSPNPGQTLSGVFFTGDRPCWILSTDKGGVKVFPS--GHSVVHAFTASSVWES 997
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
F+ + +G + +P G D + P + +P + Y +L V
Sbjct: 998 KSDFLLYSEEGPSLLEWIP-GVQLDGHLP-SRTVPRNKAYSNVVYDPSTSLI-----VAA 1050
Query: 1026 LKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIP 1085
++ S D + + D N+ S+ T T+E +L PD W T
Sbjct: 1051 SSSQSRFASYDEDGNIVWEPDASNI-SLPFCETSTLE-----LLSPDG----WVTLDGYE 1100
Query: 1086 MQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1144
+E + VTL ++T+ + + +GT +GED+A +G +F +P
Sbjct: 1101 FAPNEFVNCLDCVTLETSSTESGTKDYIVVGTTINRGEDLAVKGAAYVFEIIEVVPDPTA 1160
Query: 1145 LVTEVYSKEL------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPL 1197
+ + +L KG ++A+ + G+L+ + G KI + + E L G+AF D +
Sbjct: 1161 QMKRWHRLKLHCRDDAKGPVTAMCGMNGYLVSSMGQKIFVRAFDLDERLVGVAFLDV-GV 1219
Query: 1198 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1257
YV SL VKN +++ D KS++F++++E +L +L KD L +F +S++
Sbjct: 1220 YVTSLCAVKNLLVISDAVKSVWFVAFQEDPYKLVILGKDPYPLYVTKADFFFAEGRVSII 1279
Query: 1258 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH--VGAHVTKFL--RLQMLATSSDRTGA 1313
DE ++I Y P ES GQ LL R EFH V + L RL+ + R
Sbjct: 1280 SCDEDGVMRILEYDPHDPESKNGQHLLRRTEFHGQVEYRTSAILARRLKGVDIPQSR--- 1336
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
L+ G DGS+ + ++E +RL LQ +L +V HVAGLNPR FR
Sbjct: 1337 -----------LICGLTDGSLITMTYVEEAASKRLHLLQGQLTRNVQHVAGLNPRGFRIV 1385
Query: 1374 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1421
N RP I+D LL YE LP+ Q E+ Q GT R+ IL +
Sbjct: 1386 R-NDYVSRPLTRGILDGNLLMAYEDLPIVRQDEVTRQIGTDRTTILKD 1432
>gi|431908147|gb|ELK11750.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
alecto]
Length = 671
Score = 285 bits (728), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 212/661 (32%), Positives = 328/661 (49%), Gaps = 114/661 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN + + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRSAEGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 309
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT+V DG R V+ K
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLVTDGLRSVRAFHFDKAA 369
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC----GSGTSMLSSGLKEEFGDIEADAPS 463
SVLTS + T+ FLGSRLG+SLL+++T +++ + KEE P
Sbjct: 370 ASVLTSSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPASTVREAADKEE--------PP 421
Query: 464 TKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPL 515
+K+ R S+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP
Sbjct: 422 SKKKRVDSTVGWSGGKSVAQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPC 477
Query: 516 K-------------------------DFSYGLRINADASATGISKQSNYELV-------- 542
+ + GL + + S + + E+V
Sbjct: 478 ANAAMGEPAFLSEEVPVWEVQGGGGVECTVGLWPHPSLAQFQNSPEPDLEIVMCSGYGKN 537
Query: 543 ------------------ELPGCKGIWTVY-------HKSSRGHNAD---SSRMAAYDDE 574
ELPGC +WTV ++ +G + S+ A D
Sbjct: 538 GALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEQEETPKGEAVEPEPSAPDADDDGR 597
Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
H +LI+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L
Sbjct: 598 RHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLL 656
Query: 635 D 635
+
Sbjct: 657 E 657
>gi|121797760|sp|Q2TZ19.1|CFT1_ASPOR RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|83775384|dbj|BAE65504.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1393
Score = 284 bits (727), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 337/1402 (24%), Positives = 590/1402 (42%), Gaps = 209/1402 (14%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
++I+LAF +AK++++E+D +G+ S+H +E + + + G ++ VDP R
Sbjct: 88 EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147
Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
C V +G++ + IL Q G LV D+ + GS G A
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206
Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
SS V+ L LD + H F++ Y EP IL+ + T + + + +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + + S LP D +K++A+P P+GG L++G+N +H + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326
Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S +S ++ L+ L N LL TG++VL+ DGR V + +
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386
Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
P S +G+ FLGS DS+L+ G S+ SSG K+ +
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
E D+ +S D +D + E+ + G + ++F D L+NI
Sbjct: 441 RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500
Query: 513 GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
GPLKD ++G + S ELV +L
Sbjct: 501 GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560
Query: 545 PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
C +WT +S H ++ + A + E H Y+++S +A + E +++ +
Sbjct: 561 ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613
Query: 602 SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
+ F + TI G L G+ RV+Q+ R DG DL P E
Sbjct: 614 ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
S E +S S+ DPYV + D ++ LL D S V+ I +SK +SC
Sbjct: 669 DTS--EERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSKW--TSCC 724
Query: 712 LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
LY DK TG+ +I A L Q + + + L I+ +P+
Sbjct: 725 LYFDK----------------TGIFSSI-SATSDELAQNSMTLFLMTQDCRLFIYRLPDQ 767
Query: 772 NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
+ + G + E K S T +E + + V +L
Sbjct: 768 KLL----AIIEGVDCLPPVLSSEPPKRSTT--------------REVLTEIVVADLG-DS 808
Query: 832 WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
WS S P+L + Y+ ++ T +P + L +N+ R+
Sbjct: 809 WS---SFPYLIIRSRHDDLAVYRPFI----SITKSVGEPHADLNFLKETNLVLPRI---- 857
Query: 892 FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
+ D + EE P + I NISG F G P + + L
Sbjct: 858 -TSGVEDQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKG 913
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGI------LKICQLPSGSTYDNYWP--VQKVIPLKA 1003
G + + C GFI + S+ + L C L + +Y+P +Q+ IP+
Sbjct: 914 GYAQSLSKFQTSECGEGFILLDSKVLCFILLCLTYCILSFHTGCHSYYPWTIQQ-IPIGE 972
Query: 1004 TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE 1063
+ Y + +Y + S L D E+ + N S V+
Sbjct: 973 QVDHLAYSSSSGMYVIGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQR 1021
Query: 1064 YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGE 1122
++++ P W T+ +E+ + V+ ++L + T E + ++ +GTA+ +GE
Sbjct: 1022 SSLKVVSPKT----W----TVIDSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGE 1073
Query: 1123 DVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLIASGPK 1175
D+A+RG V +F + +P+ + V + +KGA++AL+ + QG L++A G K
Sbjct: 1074 DIASRGCVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQK 1133
Query: 1176 IILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNF-----ILLGDIHKSIYFLSWKEQGA 1228
I+ K G+ L +AF D +++VK ++ D K ++F + E+
Sbjct: 1134 CIVRGLKEDGSLLP-VAFMDVQ----CHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPY 1188
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
+++L AKD L+ A +FL DG+ L ++V+D N+ + Y P+ +S G +LLSR++
Sbjct: 1189 KMSLFAKDLDYLEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSK 1248
Query: 1289 FHVGAHVTKFLRLQMLATSSDR----TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1344
FH G ++ L + SS++ A K R +L + +GS+G + + E +
Sbjct: 1249 FHTGNFISTLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEES 1308
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1404
+RRL +LQ +L +++ H GLNPR+FR S+G A R ++D +LL + + + +
Sbjct: 1309 YRRLSALQSQLTNTIEHPCGLNPRAFRAVESDGTAGR----GMLDGKLLFQWLDMSKQRK 1364
Query: 1405 LEIAHQTGTTRSQILSNLNDLA 1426
+EIA + G +I ++ ++
Sbjct: 1365 VEIASRVGANEWEIKADFEAIS 1386
>gi|327287424|ref|XP_003228429.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Anolis carolinensis]
Length = 1294
Score = 283 bits (725), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 213/680 (31%), Positives = 336/680 (49%), Gaps = 89/680 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + +S S E K LELV + GNV S+
Sbjct: 29 NLVVAGTSQLYVYRLNHDSESTTKSDRSSEGKSH-------KEKLELVAAFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + DE G G + S++I
Sbjct: 135 NVHIPKVRVDPDGRCAVMLIYGTRLVVLPFRRD---TLTDEHEGVVGEGQKSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F++GY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 192 DIRELDEKLLNIIDMQFLYGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
HP+IWS NLP D + LAVP PIGGV++ N++ Y +QS Y VSL+S
Sbjct: 252 HPVIWSLSNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSLNSLTN 305
Query: 352 -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ P + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 GTTVFPLRIQEGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKA 365
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLT+ + T+ FLGSRLG+SLL+++T +++ K+ E KR
Sbjct: 366 AASVLTTCMITMDPGYLFLGSRLGNSLLLRYTEKLQEPPVNAA-KDATEKTEEPPVKKKR 424
Query: 467 LRRSSS-----DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ + ++ A QD V+ E+ +YGS + + + T+SF V DS++NIGP + + G
Sbjct: 425 VEQQANWAGGKSAPQDEVD--EIEVYGSEAQSG-TQLSTYSFEVCDSILNIGPCANAAMG 481
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
L I + + + + K ++V ELPGC +WTV
Sbjct: 482 EPAFLSEEFQNSLEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 541
Query: 557 SSRGHNADSSRMAAY----------DDEYHAYLIISLEARTMVLETADLLTEVTESVDY- 605
D+ +A D + H +LI+S E TMV + +D
Sbjct: 542 QKAEQEEDAQGESAEKEPSPPEPPDDGKRHGFLILSREDSTMVNPANGPTGQEIMELDTS 601
Query: 606 -FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
T AGN+ R ++QV G R+L+G L F P + S ++
Sbjct: 602 GLAPRSTQDAGNIGENRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQC 651
Query: 665 SIADPYVLLGMSDGSIRLLV 684
++ADPYV++ S+G + + V
Sbjct: 652 AVADPYVVIMSSEGQVTMFV 671
Score = 168 bits (426), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 140/560 (25%), Positives = 243/560 (43%), Gaps = 90/560 (16%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V+ E+G +EI+ +P + VF V F G+ +VD+ + +E + EE
Sbjct: 786 WCVLVRENGTMEIYQLPEWRLVFLVKNFPMGQRVLVDSSFGQPASQAE----AKKEEVIR 841
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
Q + + +V L ++ SRP+L + D +L Y+A+ + S+
Sbjct: 842 QTEMPLVKEVLLVALGNRQ-----SRPYLL-VHVDQELLIYEAF-----NHDSQLGQTNL 890
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREE--TPHGAPCQRITIFKNISGHQGFFLSG 930
R V + R + R S+ ++ EE P G R F++I G+ G F+ G
Sbjct: 891 KVRFKKVPHNINFREKKPRPSKKKTESAGGEEASVPRGR-VARFRYFEDIYGYSGVFICG 949
Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W + VTS+G L++ + +
Sbjct: 950 PSPHWLL----------------------------------VTSRGALRLHPMTIDGPIE 975
Query: 991 NYWPVQKVIPLKATPHQITYFAEKN----LYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1046
++ P V P YF + ++ +P + + + I++ V +
Sbjct: 976 SFAPFHNV----NCPKGFLYFNRQGTGGGIHNACSRIPRMTGEDDMEFETIERGVLKCVP 1031
Query: 1047 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTK 1106
DL ++ ++ ++ E+ ++ V+L + T
Sbjct: 1032 GEGFGHPDLILSFKID-----------------------LEEWEHVTCMKTVSLKSEETV 1068
Query: 1107 EN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISA 1160
+ +A+GT +QGE+V RGR+L+ P +T+ +Y KE KG ++A
Sbjct: 1069 SGLKGYIAVGTCLMQGEEVTCRGRILIMDIIEVVPEPGQPLTKNKFKVLYEKEQKGPVTA 1128
Query: 1161 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
L G+L+ A G KI L +L G+AF D LY+ + VKNFIL D+ KSI
Sbjct: 1129 LCHCNGYLVSAIGQKIFLWSLKDNDLTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISL 1187
Query: 1221 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1280
L ++E+ L+L+++D L+ + +F++D L +VSD +N+ ++ Y P+ ES+ G
Sbjct: 1188 LRYQEESKTLSLVSRDAKPLEVYCVDFMVDSCQLGFLVSDRDRNLLVYMYLPEAKESFGG 1247
Query: 1281 QKLLSRAEFHVGAHVTKFLR 1300
+LL RA+FHVGAHV F R
Sbjct: 1248 MRLLRRADFHVGAHVNAFWR 1267
>gi|148697643|gb|EDL29590.1| cleavage and polyadenylation specific factor 1, isoform CRA_b [Mus
musculus]
Length = 1311
Score = 283 bits (724), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 196/601 (32%), Positives = 310/601 (51%), Gaps = 68/601 (11%)
Query: 129 RRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQ 188
+RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G P V+VDP
Sbjct: 11 KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQNVHTPRVRVDPD 67
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HV 246
GRC +L+YG ++++L + + +E G G + S++I++R LD K ++
Sbjct: 68 GRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYIIDVRALDEKLLNI 124
Query: 247 KDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPH 306
D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K HP+IWS +LP
Sbjct: 125 IDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPF 184
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELD 365
D + LAVP PIGGV++ N++ Y +QS +ALN+ + + + LD
Sbjct: 185 DCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLD 244
Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFF 424
A A ++ D ++S K G++ +LT++ DG R V+ K SVLT+ + T+ F
Sbjct: 245 CAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLF 304
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-----SSDALQDMV 479
LGSRLG+SLL+++T SS E D E KR+ + QD V
Sbjct: 305 LGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVGWTGGKTVPQDEV 362
Query: 480 NGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----------------L 522
+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G L
Sbjct: 363 D--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPDL 418
Query: 523 RI------NADASATGISKQSNYELV---ELPGCKGIWTVYH----------KSSRGHNA 563
I + + + + K ++V ELPGC +WTV K+
Sbjct: 419 EIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAESTEQE 478
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
S+ A D H +LI+S E TM+L+T + E+ S + QG T+ AGN+ R +
Sbjct: 479 PSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYI 537
Query: 624 IQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
+QV G R+L+G L F P + + ++ ++ADPYV++ ++G + +
Sbjct: 538 VQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMF 587
Query: 684 V 684
+
Sbjct: 588 L 588
Score = 280 bits (717), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 202/679 (29%), Positives = 317/679 (46%), Gaps = 89/679 (13%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 703 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 758
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 759 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 808
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 809 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 868
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 869 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 928
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 929 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 980
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENE 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 981 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET---- 1032
Query: 1110 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHL 1168
V G LKG ++A L QG
Sbjct: 1033 ---------VSG--------------------------------LKGYVAAGTCLMQGEE 1051
Query: 1169 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
+ G +I L +EL G+AF D LY+ + VKNFIL D+ KSI L ++E+
Sbjct: 1052 VTCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESK 1109
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+
Sbjct: 1110 TLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRAD 1169
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDEL 1343
FHVGAHV F R + GAA G K N+ F TLDG IG + P+ E
Sbjct: 1170 FHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEK 1222
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E
Sbjct: 1223 TYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTME 1282
Query: 1404 QLEIAHQTGTTRSQILSNL 1422
+ E+A + GTT IL +L
Sbjct: 1283 RSELAKKIGTTPDIILDDL 1301
>gi|407929511|gb|EKG22329.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1418
Score = 283 bits (723), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 343/1435 (23%), Positives = 584/1435 (40%), Gaps = 217/1435 (15%)
Query: 99 ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
+ L LV Y L G V SLA + D D++++AF DAK+S++E+D + H L S
Sbjct: 81 SKLVLVAEYPLEGTVLSLARIK--ALDTKSGGDALLIAFRDAKMSLVEWDPANHALSTIS 138
Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-GDE 217
+H +E E + + DP RC + + IL Q G LV GD+
Sbjct: 139 IHYYEGEELHGAPWDADLGHYHNFLAADPSSRCAALKFGARHLAILPFRQLGDDLVEGDD 198
Query: 218 ---------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
+ +G +SS ++L +D + H F+H Y EP
Sbjct: 199 YDPDFDEPMDAPAAKEKATNGDVAQTPYKSSFALSLPQIDPALTHPVHLDFLHEYREPTF 258
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
I+ + A + + + ++ K + S LP+D +K++ +P P+GG
Sbjct: 259 GIISANKAAAASLLYERRDLLTYTVFTLDLEEKASTALLSVAGLPYDTHKVIPLPLPVGG 318
Query: 321 VLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVA 377
L++GAN IH + A+A+N++A S +S ++ L+ A L +N
Sbjct: 319 ALLLGANQFIHVDQAGKTSAVAVNDFAKQCSSFPMSDQSELAMRLEGASIELLSPENGDL 378
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFLGSRLG 430
L+ K G L +++ DGR V L + K + S T++G + F+GS G
Sbjct: 379 LVVLKDGSLAVISFKLDGRSVSGLSIRKISEEKGGHVVPTAASCTTSLGRNRMFIGSEDG 438
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
DS+L+ +T + LS K ++ AD D +G + SA
Sbjct: 439 DSVLLGWT--KKAAQLSR--KRSHAEMLADDAELSFDEEDLEDDDDLYGDGPSTAKTASA 494
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN-YELV------- 542
S+ S ++F + D ++++ P+KD + D + + + ++ +LV
Sbjct: 495 SSEA-SDPSNYTFRIHDIMLSLAPIKDVALASHKVTDTAIGTLERAADQLDLVVSTGRGA 553
Query: 543 -------------------ELPGCKGIWTVYHK--SSRGHNADSSRMA----AYDDEYHA 577
E + +W+V+ K + +G A S+ A A D +Y
Sbjct: 554 AGGLALMRREIDPVILRKGEFSNARAVWSVHAKKPAPKGMVAAGSQDAEAKLAADVDYDQ 613
Query: 578 YLIISL------EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+LI+S E + TA E + TI G + G R++QV +
Sbjct: 614 FLIVSRSNGDGGEESAIFNITATGFEETNKGDFEREDAATINVGTIAGGTRIVQVLKAEI 673
Query: 632 RILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC 690
R D + Q L P E+GS ++S S ADPY+L+ D S+ +L D +
Sbjct: 674 RSYDSELGLDQIL---PMEDENGS---ELRIISASFADPYILVIRDDSSVIVLQADANGE 727
Query: 691 TVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
+ + S+K WLS + ++ +
Sbjct: 728 MEEIDRGDTLLSTK-------------------------WLSGCIHQSQSTGEKA----- 757
Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
+ + G L IF++P+ + V + ++ L T SSS
Sbjct: 758 --LAYLLSAEGGLHIFELPDLSKPVYVAASLG--------FLPPTLTADFTPRRSSS--- 804
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
K + + V EL + + P+L + ++ YQ Y F E
Sbjct: 805 -----KAALTEVIVAELG----DSTYKTPYLIVRTSSNDLVIYQPYHFPAHEVVKP---- 851
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
+L + RL FS P A E+T G TI N+ G+ F++G
Sbjct: 852 --FFENLRWLKIPQPRLPE--FSEEP--ALESEDTGIGKESILTTI-ANVGGYSAVFMAG 904
Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY- 989
+ P + + L ++ S+ + H C+ GF Y+ + G L++CQLP G Y
Sbjct: 905 TSPSFILKESSSLPRVIKMRTKSVKNLSSFHRAECDRGFAYINADGNLRVCQLPRGYRYG 964
Query: 990 DNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1049
D W V+K I + + Y K++ L++ V KP L + E H+ N
Sbjct: 965 DAGWAVKK-ISINQDVQAMCYHPPKDV--LVLGVGDKKPFT-----LPEDEHHHEWLEEN 1016
Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATI---PMQSSENALTVRVVTL-FNTTT 1105
++ + VE+ +++L+ Q+ A I +++ E LT++V+ L + T
Sbjct: 1017 ITFKPM-----VEQGMIKVLDT-------QSLAVIDTYELEAFEVVLTIKVLNLEVSENT 1064
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISA 1160
E + L+A+GT +++GED+ +RG + +F P T + +E++G+++A
Sbjct: 1065 HERKQLVAVGTGFIRGEDLPSRGCIYVFEVINVVPEPGRPETNRRLKLIAKEEVRGSVTA 1124
Query: 1161 LASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV-KNFILLGDIH 1215
+ + QG LL+A G K ++ K GT L +AF D V+ + +L+GD
Sbjct: 1125 ITDVGSQGFLLMAQGQKCMVRGLKEDGTLLP-VAFMDMQCYVTVAKELNGSGMLLMGDAA 1183
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
K +F+ + E ++ L K ++ A +FL L L+V+D N+ Y P
Sbjct: 1184 KGAWFVGYTEDPYKMILFGKSRSKMEVMAADFLPHDKQLYLMVADGDCNLHALQYDPDHP 1243
Query: 1276 ESWKGQKLLSR---------------------------AEFHVGAHVTKFLRLQMLATSS 1308
+S GQ+LL + A+ H HV+ + A
Sbjct: 1244 KSLSGQRLLHKSTFHTGHFTTTMTLLPSSLSPTVSPSSADEHANGHVSPSPSPENDAMDI 1303
Query: 1309 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
D AP + +L T GS+ + PL E +RRL +LQ L+ ++ H GLNPR
Sbjct: 1304 D---PAPAGTVQH---ILLTTQTGSLALLTPLSEQQYRRLGALQTYLIGALEHWCGLNPR 1357
Query: 1369 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1423
++R S G R IVD LL+ + L + + E A + G + S+L
Sbjct: 1358 AYRAVESEGFGSR----GIVDGALLARWCELGSQRRAEGAAKVGVEEWVVRSDLE 1408
>gi|395740218|ref|XP_002819588.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Pongo abelii]
Length = 1388
Score = 282 bits (722), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 208/704 (29%), Positives = 332/704 (47%), Gaps = 90/704 (12%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 731 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 786
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 787 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 829
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-------------ETPHGAPCQ----RIT 915
L N+ +RF + P + RE T GA + R
Sbjct: 830 ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGARGRVARFR 879
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG + +F HNVNC GF+Y
Sbjct: 880 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNR 939
Query: 975 QGILKICQLPSGSTYDNYWPVQKVIPLKATP-HQITYFAEKNLYPLIVSV--PVLKPLNQ 1031
Q ++ PS + P L P H +Y + S P +
Sbjct: 940 QEPQRLSGSPSRTXXXXPTPPG----LLGLPGHWCVTPTNPQVYAVATSTNTPCAR---- 991
Query: 1032 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSS 1089
I + G + + + + + E + ++++ P W+ A I +Q
Sbjct: 992 -----IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEW 1042
Query: 1090 ENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE 1148
E+ ++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1043 EHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTK 1102
Query: 1149 -----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLN 1203
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ +
Sbjct: 1103 NKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMI 1161
Query: 1204 IVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1263
VKNFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +
Sbjct: 1162 SVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDR 1221
Query: 1264 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT--- 1320
N+ ++ Y P+ ES+ G +LL RA+FHVGAHV F R + GAA G K
Sbjct: 1222 NLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGLSKKSVV 1274
Query: 1321 --NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
N+ ++ G IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + +
Sbjct: 1275 WENKHITWLVSVRGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRR 1334
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ +++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1335 TLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1378
Score = 172 bits (437), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 102/269 (37%), Positives = 152/269 (56%), Gaps = 29/269 (10%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
T K HP+IWS +LP D + LAVP PI
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPI 275
Score = 115 bits (288), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 101/347 (29%), Positives = 167/347 (48%), Gaps = 56/347 (16%)
Query: 379 LSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+S K G++ +LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++
Sbjct: 285 ISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKY 344
Query: 438 TCG----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASN 492
T +++ + KEE + +T + QD V+ E+ +YGS A +
Sbjct: 345 TEKLQEPPASAVREAADKEEPPSKKKRVDATAGWSAAGKSVPQDEVD--EIEVYGSEAQS 402
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASA 530
T+ A T+SF V DS++NIGP + + G L I + +
Sbjct: 403 GTQLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGAL 460
Query: 531 TGISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHA 577
+ + K ++V ELPGC +WTV + G + S A DD H
Sbjct: 461 SVLQKSIRPQVVTTFELPGCYDMWTVIAPLRKEEEDNPKGEGTEQEPSTPEADDDGRRHG 520
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
+LI+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 521 FLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG- 578
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 579 --VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 616
>gi|384487281|gb|EIE79461.1| hypothetical protein RO3G_04166 [Rhizopus delemar RA 99-880]
Length = 1468
Score = 282 bits (722), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 205/694 (29%), Positives = 335/694 (48%), Gaps = 71/694 (10%)
Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
+G L I+ +P+F F +F IVD DS G K I
Sbjct: 811 TGILRIYSLPDFKEHFACPQFSIAPDLIVD--------DS--------------GVKSRI 848
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
+ + E+ M P L I+ Y+A+ + + + S + V
Sbjct: 849 PTNNIQEILMTHIGKERKDPHLVVRTDTNDIIIYKAFTYLDESSPDRLALRFSRVQHEYV 908
Query: 880 SNVSASRLRNLRFSRTPLDAYTREET--------------PHGAPCQR--ITIFKNISGH 923
S S+S + R +D + +T QR + F +++G+
Sbjct: 909 SRKSSSHESKPKKKRGIIDEFEIPDTDLNEEEEDLKLSTKKMDKKIQRKLLIPFTDVAGY 968
Query: 924 QGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQ 982
G F++G++P W M + +RVHP + IV FT HNVNC HGFI V S+ +++ +
Sbjct: 969 AGVFVAGAQPAWLMCSCKSFVRVHPMKTEHEIVGFTQFHNVNCQHGFITVDSKSTIQLSR 1028
Query: 983 LPS-GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
L + G YD W +QKV+ L T H+I Y +Y ++VS V + + ID
Sbjct: 1029 LRTEGINYDLDWVIQKVL-LGQTVHKIQYHPVMRVYAVLVSSSVPTRMKNDDNQYID--- 1084
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1100
G + D +E++ + ++ P W+ + + E ++ L
Sbjct: 1085 GKETDERGPGEF----LPEMEQFSMILVSP----VTWEIVDKVEFEEFEQCFSLECALLD 1136
Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQ--NLVTEVYSKELK 1155
T+ + + IGT ++GED +G + ++ DNPQ + V ++++K
Sbjct: 1137 SKQTSTGRKYYMIIGTGTLKGEDTTMKGSIRMYDIIEVVPEPDNPQTNHKFKPVLTEDVK 1196
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
GA++A+ ++ GHL G K+I+ E L G+AF D +YV S++ +KNFIL+GD
Sbjct: 1197 GAVTAMCTVSGHLAACIGSKVIVWSLEDDERLVGVAFIDVQ-IYVTSMSSIKNFILIGDA 1255
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
KSI+FL ++ + A+L LL KD+ S D +F+ID +L L+V D +NI ++ YAP
Sbjct: 1256 QKSIWFLGFQLEPAKLTLLGKDYQSFDVGCVDFIIDDKSLYLIVGDTNENIDLYQYAPFN 1315
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
+S+ GQKL+ R +FHVG+ V +RL + + G + + R L GT +GSI
Sbjct: 1316 LQSFGGQKLMRRGDFHVGSQVQTMVRLPQIEKTEK------GFEYSRRHFCLCGTFNGSI 1369
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD---SIVDCE 1391
I+ + E TF+RL +L LV+++ HVAGLNPR+FR G R + +++D +
Sbjct: 1370 AVISSISEKTFKRLNTLYGHLVNNLQHVAGLNPRAFRLI--KGPKQRMSTNRTKAVLDGD 1427
Query: 1392 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
L+ + L +EEQ E Q GTT ++I+ +L D+
Sbjct: 1428 LIFEFAGLSIEEQKETTKQIGTTVTRIMEDLVDI 1461
Score = 179 bits (453), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 163/650 (25%), Positives = 277/650 (42%), Gaps = 97/650 (14%)
Query: 88 KRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEF 147
K+ ++ + LELV ++++G + ++ + DS++L F DAK+S+LE+
Sbjct: 87 KKGGMISDTTLGRLELVAQFKMNGIITTMGTVRTNSPRGREGCDSLLLGFSDAKMSLLEW 146
Query: 148 DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL---VKVDPQGRCGGVLVYGLQMIIL 204
S + + S+H +E E+ ++ F P + +DPQ RC Y ++ +L
Sbjct: 147 SSSTNSIITVSIHYYERDEF------KKEFLTNPYPSAIHIDPQQRCAVFNFYDNKLAVL 200
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
Q S + + G S +I+L LD +K+V D F+ Y EP + I
Sbjct: 201 PFRQ--SDKLDERQGEGEEDEEKWPYYPSFIIDLATLDSRIKNVIDMTFLSDYYEPTLAI 258
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
L + E TW GR+ T + +S+ T K +P+I+S LP+D +KL+A+P P+ G+L
Sbjct: 259 LFQPEQTWTGRLGNNKDTVSLVVISLDITAKIYPIIYSIDKLPYDCFKLVAMPKPVTGML 318
Query: 323 VVGANTIHYHSQ-SASCALALNNYAVSLDSSQELPRSSFS-------VELDAAHATWLQN 374
V+ AN+I + SQ S +A+N Y + + P + + L+ A A
Sbjct: 319 VIAANSILHVSQGSPGMGVAVNGYT---KKTTDFPGMIYEPSLIELGLSLEGAKALAFGG 375
Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN---------------PSVLTSDITTIG 419
D L+ + G L+ V DG V + +S+ P +L S + +
Sbjct: 376 DRCLIFMQNGHWALVEVRRDGNKVVGMAISEIKHDLPVMEKKPPRFDTPPLLASVPSCVT 435
Query: 420 N----SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
N FFLGSR+GDSLL+++ + D + + D +
Sbjct: 436 NVKAGEYFFLGSRVGDSLLIKYDANRVNHQSVAPPVFRVCDTMLNTGPIVDMAVGDVDTV 495
Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
+ + +L L S+ + A F +I P F++ D+ A
Sbjct: 496 EQQEDWPQLELVSSSGHGKNGALCVFQ-------RHIYPQTSFAFH---QFDSQA----- 540
Query: 536 QSNYELVELPGCKGIWTVY-HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
IW++ K+ + N DD++ L IS T+VL D
Sbjct: 541 --------------IWSIKCRKNDQQQNE--------DDDFDKLLFISKSKSTLVLSAGD 578
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSES 652
L EV ++ +G TIA LF R++QV+ G +L +G + ++
Sbjct: 579 ELQEV--KTGFYTRGSTIAVSTLFDATRIVQVYATGVMVLTPEGKRI-----------QT 625
Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC-TVSVQTPAAIE 701
+ ++ SI DPY+LL + + I L GD ST + +Q P I+
Sbjct: 626 VPIPRGAKIVEASIHDPYILLTLDNNKILALQGDASTKDIIHIQLPNHIK 675
>gi|328773280|gb|EGF83317.1| hypothetical protein BATDEDRAFT_21894 [Batrachochytrium dendrobatidis
JAM81]
Length = 1673
Score = 282 bits (721), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 220/775 (28%), Positives = 362/775 (46%), Gaps = 126/775 (16%)
Query: 753 YSVVCYESGALEIFDVPNFN--CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
+ V ++G L ++ +P+F C F + F + +D + + T N++ +E
Sbjct: 930 WCFVYTDTGHLLVYTLPDFKECCAFPL--FSTLPVLAMDVPLWRSRSIDSTFANTTGDE- 986
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
+ VV L S P+L + +G + Y+ +F P TS +DD
Sbjct: 987 --------FEEILVVNLGN---SKDRQTPYLVCLAANGDLAVYK--IFVCP--TSSNDDD 1031
Query: 871 VS--------TSRSLSVSNVSASRLRN---LRFSRTPLDAYTRE---------------E 904
S SR+ + + A L+ +R R P D TR+ +
Sbjct: 1032 TSFVNSGTFKQSRTPAELELDAQNLKKRLAIRLVRIPHDQITRDLQFYTDNEGDKIDLVQ 1091
Query: 905 TPHGAPC----QRITIFKNI--SG---HQGFFLSGSRPCWCMVFRER------------- 942
P P Q + F I SG + G ++GSRPCW MV +
Sbjct: 1092 EPQHQPTFLKRQHLKPFDAIGWSGGNMYSGVVVTGSRPCWIMVALQSRQQDLDVISFDNS 1151
Query: 943 -----------------LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
LR HP DG + F LHNVN HGF+Y+ +G+ +ICQLP
Sbjct: 1152 VACSTKLPPVPLLGTNMLRFHPMPVDGPMKCFAPLHNVNVAHGFLYINWKGLFRICQLPP 1211
Query: 986 GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS------VPVLKPLNQVLSLLIDQ 1039
+D+ WPV KV P+ T H++ Y Y + S +P + + V + +ID+
Sbjct: 1212 QFNFDHDWPVCKV-PIHKTVHKVAYHYSSQTYAIATSTPERFDIPHAQYASAVAAAVIDE 1270
Query: 1040 EVGHQIDNHNLSSVDLHR---------TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1090
G ++ + + TV+ Y++ ++ + W+T +I + +E
Sbjct: 1271 --GDEMPDAERKVTGIRELSEIKPGMYEATVDRYKIELV----SSVTWETVDSIELSEAE 1324
Query: 1091 NALTVRVVTLFNTTTKENETL-LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE- 1148
+ + V L + T + L LAIGT Y +GED+++RG++ L+ +P N T
Sbjct: 1325 TVMALEAVDLSSKETISGKKLYLAIGTGYSRGEDLSSRGKLHLYDVIEVVPDPNNPQTNR 1384
Query: 1149 ----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNI 1204
V S++ + SA+ ++ +LL A GPKII+++ E+ G+AF D ++V SL+
Sbjct: 1385 KFKHVDSEDDRSPFSAICTVNDYLLAAIGPKIIMYQLEDGEITGVAFLDVN-VFVTSLSS 1443
Query: 1205 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
VKN I + DI KS++F++++E+ A+L +L +D L +A LID + L+L+V+D KN
Sbjct: 1444 VKNLIQICDIQKSVWFVAFQEEPAKLAVLGRDVHPLQGYAANMLIDDNQLALLVADGDKN 1503
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
+ YAP +S G++L+ + E H+G HV+KF+R++ R A S + A
Sbjct: 1504 LHTMIYAPDNVQSLGGERLIRKGEIHLGQHVSKFIRMRRKPLL--RNDAIVFSKQYLNVA 1561
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS-------NG 1377
TLDG++ I P+ E F+RL L ++V S+ H+AGLNPR FRQ +G
Sbjct: 1562 ---ATLDGALEIITPVSERIFKRLYGLYSRMVTSIEHIAGLNPRGFRQAQHRVRPITLSG 1618
Query: 1378 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
PGP I+D +LL Y L +Q +A G+ +++ +L ++ G F
Sbjct: 1619 FIGPPGPRGILDGDLLYEYVRLSRTQQRGLAKAIGSKDDRLMDDLLEVLTGLDFF 1673
Score = 197 bits (502), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 181/744 (24%), Positives = 330/744 (44%), Gaps = 152/744 (20%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
AA LEL +R+HGN+ SL ++ + S + D+++L+F++AK+S++E+ L
Sbjct: 87 AACLELAAQFRVHGNITSLGVVPM---NYSGKADALLLSFKEAKMSLVEYSQFTQKLVTV 143
Query: 158 SMHCFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
SMH FE E+ L S R P +KVDPQG C + +YG ++ IL Q G+ L+ D
Sbjct: 144 SMHYFEREEFKKLG----SIDRPPPEIKVDPQGYCAAMRIYGDRLAILPFKQDGADLLND 199
Query: 217 EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
+ S F I V+ DLD ++++ DF F+ GY P + I+++ E TW R+
Sbjct: 200 LNDANSKYPFRPSI----VLPFLDLDKSIRNIIDFTFLFGYAVPTIAIMYQTEQTWTARL 255
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY--- 331
+ T I+ +S+ T + +P+++ LP++ L++VP+PIGG++V+ N I +
Sbjct: 256 GIRKDTVSIAVISLDTAEESYPVLYKIEKLPYNCTMLVSVPTPIGGLIVLSHNAIIFTDQ 315
Query: 332 -HSQSASCA----------LALNNYAVSLDSSQELP---------------RSSFSVELD 365
H+ +C + L Y + LD Q P ++ LD
Sbjct: 316 IHAPGIACIVNAYFDSETNIMLTPYELQLDMVQPRPPRPPSVFFAQNKYTDYKELAISLD 375
Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDGRV----------VQRLDLSK---------- 405
+ ++ D+ LL + G+++ + ++ + V V+ L++
Sbjct: 376 GSRGMFISPDIFLLVLRDGEMIQVDLIGEEGVGRSWKRRKGGVKSFQLTRLGIRMTAPVH 435
Query: 406 -------TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE 458
+NP L+ +++ FLGSR G L + S + + L +F ++E
Sbjct: 436 LFPLADASNPLSLSGRNSSVPLGGSFLGSR-GSKLRYNYLFASSRTTDACLL--QFVEVE 492
Query: 459 ADAPSTKRLRRSSSDALQDMVNGE----ELSLYGSASNNTES------------------ 496
A S+ + +++ + + NGE + LYG ++ ++
Sbjct: 493 EFAKSSVSMNGAAN--MNNTDNGEDDELDKDLYGDSTTAKQTDTDMSALLSSDEHGHGEI 550
Query: 497 -AQKTFSFAVRDSLVNIGPLKDFSYGLRINAD---------------ASATG-------- 532
+++T F + DS+ + PL+DF+ GL +ATG
Sbjct: 551 VSEQTLRFRLCDSVTVVSPLRDFAVGLPAETSEHRFSPKIGGCDLEIVAATGHGPHGHLA 610
Query: 533 -ISKQSNYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
+++ ++V ELP + +WT+ + D ++ D +H Y+I+S + T
Sbjct: 611 ILNRSVRPQIVTTFELPQIEEMWTI---RCAKFDKDYRLVSEPTDAFHKYVILSHSSGTS 667
Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF--- 645
+L+ + TE+ ++ ++ G T+ G L ++QV G + D + D +
Sbjct: 668 ILKAGEAFTEMDDTT-FYQAGPTVGVGALLDETIIVQVHPNGVILFD--FSKYDFTIIDR 724
Query: 646 --------------GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
G E G ++ V+S S DPY +L M+ G I LL D +T
Sbjct: 725 LNTNRMHALYIFVEGTKLQEMRVGDDDIWVISCSFMDPYAMLLMNTGHIVLLSLDETTHQ 784
Query: 692 VSVQTPAAIESSKKPVSSCTLYHD 715
++ + E K+ VS+ +LY D
Sbjct: 785 ITQIS----EYKKRLVSTFSLYCD 804
>gi|367052335|ref|XP_003656546.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
gi|347003811|gb|AEO70210.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
Length = 1460
Score = 281 bits (718), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 369/1518 (24%), Positives = 611/1518 (40%), Gaps = 270/1518 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLM---------DGISAA-------- 99
NLVV +++++++ +V + S +SG R DG+ A+
Sbjct: 28 NLVVAKSSLLQVFRTKVVSTELEASPDSGHRSRNAARYESRLANDDDGLEASFLGGDSLA 87
Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDS 150
L LV L G V LA + A + DS+++A +DA++S++E+D
Sbjct: 88 LRTDRANVTKLVLVAETPLAGTVTGLARIKTPHARHGC--DSLLIALKDARLSLVEWDAE 145
Query: 151 IHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMIIL 204
H L S+H +E E + S PL ++ DP RC + + IL
Sbjct: 146 RHALATVSIHYYEQEEL------QGSPWAAPLSHYVNFLEADPGSRCAALKFGARNLAIL 199
Query: 205 KASQGGSGL-VGDEDTFGSGGGFSARIESSHVIN-----------------LRDLD--MK 244
Q + +GD D G A+ +SS VI+ L +LD +
Sbjct: 200 PFRQADEDIDMGDWDG-ELDGPRPAKDQSSAVIDGASNIEDTPYSPSFVLRLSNLDPSLL 258
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
H F+H Y EP IL H T M+ L + K I S L
Sbjct: 259 HPVHLAFLHEYREPTFGILASTASASNSLGRKDHFTYMVFTLDLQQ--KASTTILSVGGL 316
Query: 305 PHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
P D ++++ +P+P+GG L+VG+N IH + +A+N S + +S ++
Sbjct: 317 PQDLFRVVPLPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTRQCTSFGLVDQSELNLR 376
Query: 364 LDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT 417
L+ L D+ L+ G + L+T DGR V L+L + + S++ +TT
Sbjct: 377 LEGCVVDVLTADLGELLVILNDGRMALVTFRIDGRTVSGLELRMLPASSGGSIIPGRVTT 436
Query: 418 ---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+G + F G GDS+L FG + + + +R R+
Sbjct: 437 LSRVGRNAMFAGLEEGDSVL-------------------FGWAKKQSQAGRRRPRAKDAV 477
Query: 475 LQ------------DMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGPLK 516
LQ + + ++L A+ S+ + +F + D LV+I P++
Sbjct: 478 LQMDEEAGEEEEEEEDEDEDDLYGEEPAARQQPSSTASSLMTGDLTFRIHDRLVSIAPIQ 537
Query: 517 DFSYGLRINADAS-----------------ATGISKQSNYELV------------ELPGC 547
+YG + S A G K ++ + E P
Sbjct: 538 AMTYGQPVWLPGSEEERNSAGVHSDLQLVCAVGRDKSASLATINLAIAPKVIGRFEFPEA 597
Query: 548 KGIWTV-----YHKSSRGHNADSSRMAAYD--DEYHAYLII------SLEARTMVLETAD 594
+G WT+ KS +G A +S YD +Y ++I+ E + TA
Sbjct: 598 RGFWTMCAKKPIPKSLQGDKAGASLGNGYDTSGQYDKFMIVGKVDLDGYEKSDVYALTAA 657
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
+ + G TI AG + R+IQV + R DG + ++Q L P E
Sbjct: 658 GFESLGGTEFDPAAGITIEAGTMGKGSRIIQVLKSEVRCYDGDFGLSQIL---PMQDEE- 713
Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
+G+E V S S+ADP++L+ D S+ + D S + + S+ K ++ C LY
Sbjct: 714 TGAEPRAV-SASVADPFLLIIRDDSSVFIARIDSSNELEELDKDDPVLSTTKWLTGC-LY 771
Query: 714 HDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
D S + +G+ A + + SGAL I+ +P+
Sbjct: 772 AD----------SAGVFAEESMGKPASTAQC-------VLMFLLSASGALYIYRLPDLAR 814
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
V + +S Y+ L + + +GT KE + + V +L
Sbjct: 815 PIYVAEGLS--------YIPPGLS-----ADYAGRKGTA---KETLAEILVADLG----D 854
Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
+ H P+L + + YQ + S+ + S +L V N +
Sbjct: 855 STHKSPYLILRHANDDLTLYQPF-------RSRKATEQAFSETLFFQKVP-----NTALA 902
Query: 894 RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDG 952
++P +A +E H + N+ G+ F+ G+ P + + + + RV P L
Sbjct: 903 KSPQEA-DEDEASHQPRFLSMRRCDNVGGYSTVFVPGASPSFIIASSKSMPRVMP-LQGS 960
Query: 953 SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFA 1012
++A + H C HGFIY S+ I ++CQ P G Y + IP+ + Y
Sbjct: 961 GVIAMSPFHTEGCEHGFIYADSRRIARVCQFPDGCIYAETGVAVRKIPIGEDIAAVAY-- 1018
Query: 1013 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1072
+P + S V ++ L D + + NLS TV+ +++L P
Sbjct: 1019 ----HPPMQSYVVGCNTSEPFELPKDDDYHKEWARENLSF-----KPTVDRGILKLLSPI 1069
Query: 1073 RAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVL 1131
W + M+ E L V + L + T E + L+A+GTA +GED+ RGRV
Sbjct: 1070 T----WTVVDAVQMEPCETILCVETLNLEVSEFTNERKQLIAVGTALTKGEDLPTRGRVY 1125
Query: 1132 LFSTGRNADNPQNLVT----EVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KW 1181
++ P T ++ +KE +GA++AL+ + QG +L+A G K ++ K
Sbjct: 1126 VYDIADVIPEPGRPETGKKLKLIAKEDIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKE 1185
Query: 1182 TGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1239
G+ L +AF D YV + + LL D K ++F + E+ ++ L K
Sbjct: 1186 DGSLLP-VAFMDM-NCYVTAAKELPGTGLCLLADAFKGVWFTGYTEEPYKMMLFGKSSTK 1243
Query: 1240 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKF 1298
L+ +FL DG LS VVSD I I + P+ +S +G LL R F+ GA H TK
Sbjct: 1244 LEVLNADFLPDGKELSFVVSDADGYIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHATKS 1303
Query: 1299 LRL-------------------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
L L S ++ P + + + LL + G + + P
Sbjct: 1304 LLLPASTPADKEKNDGNAANAQAKAKASDNKQPREPAAQRPH--VLLLASPTGVLAALRP 1361
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLS 1394
L E +RRL SL +L +S+PH AGLNPR +R + + G D SIVD +L
Sbjct: 1362 LSESAYRRLSSLAAQLTNSLPHPAGLNPRGYRA--AGAECPPAGVDAGLGRSIVDGTVLE 1419
Query: 1395 HYEMLPLEEQLEIAHQTG 1412
+ L + ++E+A + G
Sbjct: 1420 RFAELGMARRVELAGRAG 1437
>gi|441648592|ref|XP_004093268.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Nomascus leucogenys]
Length = 1177
Score = 280 bits (717), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 202/651 (31%), Positives = 321/651 (49%), Gaps = 115/651 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++ T++L
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKY--------------------------TEKL 400
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
+ + A+++ + EE S +K R++A
Sbjct: 401 QEPPASAVREAADKEE----------PPSKKK-----------------------RVDAT 427
Query: 528 ASATGISKQSNYELV----ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD- 573
+G ++S V ELPGC +WTV + G + S A DD
Sbjct: 428 VGWSGEGQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTPEADDDC 487
Query: 574 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
H +LI+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+
Sbjct: 488 RRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRL 546
Query: 634 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L+G L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 547 LEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 587
Score = 165 bits (417), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 94/250 (37%), Positives = 137/250 (54%), Gaps = 26/250 (10%)
Query: 1192 YDAP------PL--------YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1237
YDAP PL Y V + NFIL D+ KSI L ++E+ L+L+++D
Sbjct: 925 YDAPWPVXKIPLRCTAHYVAYHVESKVCPNFILAADVMKSISLLRYQEESKTLSLVSRDA 984
Query: 1238 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1297
L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+FHVGAHV
Sbjct: 985 KPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNT 1044
Query: 1298 FLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1352
F R + GA G K N+ F TLDG IG + P+ E T+RRL LQ
Sbjct: 1045 FWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQ 1097
Query: 1353 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
L +PH AGLNPR+FR H + + + +++D ELL+ Y L E+ E+A + G
Sbjct: 1098 NALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIG 1157
Query: 1413 TTRSQILSNL 1422
TT IL +L
Sbjct: 1158 TTPDIILDDL 1167
Score = 122 bits (305), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/290 (29%), Positives = 136/290 (46%), Gaps = 21/290 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 702 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 757
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 758 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 807
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 808 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 867
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 868 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 927
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYP-LIVSVPVLKPLNQVLSLLIDQE 1040
WPV K IPL+ T H + Y E + P I++ V+K +SLL QE
Sbjct: 928 PWPVXK-IPLRCTAHYVAYHVESKVCPNFILAADVMKS----ISLLRYQE 972
>gi|240277254|gb|EER40763.1| cleavage factor two protein 1 [Ajellomyces capsulatus H143]
Length = 1408
Score = 280 bits (717), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 339/1432 (23%), Positives = 604/1432 (42%), Gaps = 197/1432 (13%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G + L + D+ +++++A +AK+S++E+D H + TS+H
Sbjct: 65 LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIH 122
Query: 161 CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
+E + +++ + A P + VDP RC VL +G + + IL Q G LV
Sbjct: 123 YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180
Query: 215 -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
DE +G F SS V+ + L+ M H F++ Y
Sbjct: 181 DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL+ + T + + + S ++ + + S LP+D +K++ +P
Sbjct: 241 REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300
Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
P+GG L++G+N +H + A+ +N +A S +S + L+ + L
Sbjct: 301 PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFL 425
+N LL G + +L+ DGR V + L + S+L + + + F
Sbjct: 361 ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD-------- 477
GS GDS+L+ ++ S + + G+I + D
Sbjct: 421 GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480
Query: 478 MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
M G + +S+ G+ N+ + F + D L N+GP++D + G + D
Sbjct: 481 MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533
Query: 531 TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
S +N ELV G G +VY K + +
Sbjct: 534 PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593
Query: 565 SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
S Y YL++S + +++V + E T++ ++ + RTI G L
Sbjct: 594 GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653
Query: 619 GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
RV+QV + R D G + Q + SE +V+ S ADPYVL+ D
Sbjct: 654 SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708
Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
SI LL D S +T I S+ S +LY DK
Sbjct: 709 QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDKY-------------------R 747
Query: 738 AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
+ + +G P + Q D + + L +F +PN VFT +
Sbjct: 748 SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793
Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
D +I S+ +E I + V +L + P+L ++ + Y+
Sbjct: 794 ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLTLYE 846
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
Y + TS ++ S R + ++N + S + ++ + T P +
Sbjct: 847 PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTALSKPLR-- 895
Query: 915 TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
+ ++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV +
Sbjct: 896 -VLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954
Query: 975 QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
++++C+ P + +D W +K I L + Y + Y + + V +
Sbjct: 955 DNVVRMCRFPRNTHFDGSWAARK-IGLGEQVDAVEYSSSSETYVIGTNQKV------DFN 1007
Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
L D E+ + N +S + +++ V++L P W + ++++E +
Sbjct: 1008 LPEDDEIHPEWRNEVISFLP-----QIDKGSVKLLTPRT----WSIIDSYNLRNAERIMC 1058
Query: 1095 VRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTE 1148
V+ + L + T E + + +GTA +GED+AARG + +F + D P+ +
Sbjct: 1059 VKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEVIKVVPEVDRPETNRKLKL 1118
Query: 1149 VYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNI 1204
+ +E+KGA+++L+ + QG L+ A G K I+ K G+ L +AF D YV L
Sbjct: 1119 IAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKE 1176
Query: 1205 VKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1262
+K ++GD K ++F + E+ +L+L +KD G+L A +FL DG+ L ++V+D+
Sbjct: 1177 LKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKDDGTLQVMAADFLPDGNRLYILVADDD 1236
Query: 1263 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1322
NI + Y P+ S KG +LL R+ F G + L ATSS + G D +
Sbjct: 1237 CNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQ-GPDADPDMMDL 1295
Query: 1323 FA------LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
+ +L + GSI I P+ E ++RRL +LQ +L +++ H GLNPR+FR S+
Sbjct: 1296 DSSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLANTLEHPCGLNPRAFRAVESD 1355
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
G R +VD +L+ + L + + EIA++ G +I ++L + G
Sbjct: 1356 GIGGR----GMVDGDLVKRWLDLGTQRKAEIANRVGADVWEIRADLEAIGKG 1403
>gi|340371789|ref|XP_003384427.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Amphimedon queenslandica]
Length = 1408
Score = 280 bits (716), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 177/530 (33%), Positives = 273/530 (51%), Gaps = 43/530 (8%)
Query: 917 FKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
F NI+G+ G F+ G P W M R L +HP DG + +F NVNC GF+Y +
Sbjct: 894 FSNIAGYSGVFVCGPYPHWIFMAARGHLSIHPMYIDGPVQSFAPFDNVNCPSGFLYFNKE 953
Query: 976 GILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1035
L+I LP+ +YD+YWPV+KV PLKATPH + Y E ++ +I S P Q +++
Sbjct: 954 SELRISVLPTQLSYDSYWPVRKV-PLKATPHFVGYHMESKVHVIIASTP------QPVTV 1006
Query: 1036 LIDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
+ D G D D Y+ EE Y +++L P W+T IP E
Sbjct: 1007 IPDPN-GETEDALETVERDGRFVYSQEETYYLQLLSPTS----WET---IPHSKYEMEAH 1058
Query: 1095 VRVVTLFNTTTKENETL------LAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----Q 1143
V + + ETL + +GT GE+++A+G+VL+F P Q
Sbjct: 1059 YHVTDMKVMRLRSQETLSGRKEYIVVGTMATFGEELSAKGKVLIFDVSVVIPEPGKPFSQ 1118
Query: 1144 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPLYVVSL 1202
+ +Y +E K ++ L + G +L A G KI + ++ +L +AF DA Y+ +
Sbjct: 1119 YRLKNLYDQEQKWPVTGLECVNGLILTAMGQKIFMWQFKDNKDLLAVAFIDAE-TYIHTA 1177
Query: 1203 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1262
+K FIL GD+ +SI L + E L+L+++D ++ F+T F+IDG L +VSD
Sbjct: 1178 QSIKGFILTGDVTRSIQLLHYNEDRRSLSLISQDPNPMEVFSTTFMIDGKALGFLVSDSD 1237
Query: 1263 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1322
+NI +F Y P+ S G L+ + HVG+ V FL ++ +T A G+ + +
Sbjct: 1238 RNITLFQYQPENPASSGGANLVRCGDIHVGSLVNVFLNIRC------KTSAGLGASREMK 1291
Query: 1323 FAL-------LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1375
AL FGTLDG IGC+ P+ E +RRL LQ K+ + H+AGLNP++FR F +
Sbjct: 1292 IALADKRQCTFFGTLDGGIGCLLPIPEKVYRRLSMLQVKMTQGMRHMAGLNPKAFRTFQT 1351
Query: 1376 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ +I+D LL Y L +E+ + + Q GTT +QI+ +L ++
Sbjct: 1352 RHQYLHNAQRNILDGTLLYQYLSLTAKEKFDFSKQIGTTVAQIMEDLKEI 1401
Score = 224 bits (570), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 235/880 (26%), Positives = 376/880 (42%), Gaps = 207/880 (23%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A Y+ +H PTG+ +C S HS + V V +
Sbjct: 2 YAVYREVHPPTGVEHCTSCHFVHSEKEQV---------------------------AVAS 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
+++ I+ V ++ +N G+ K L + HGN++SL +
Sbjct: 35 TSLLRIFDV------AQLQRNDGKAK------------LVQCLEFSFHGNIQSLDKVRLR 76
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
+D RD ++L+F DAK+S++E++ +GL+ SMH FE E ++ G P+
Sbjct: 77 HSD----RDCLLLSFNDAKLSIVEYNPETNGLKTVSMHQFEDEE---IRGGILHNDSRPV 129
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
VKVDP+GRC +L++G + + Q L D S + I ++ I+LRDL
Sbjct: 130 VKVDPEGRCAVMLLFGSHLAVCPFQQD---LSIDTPLSPSPSLDTHDILPTYTISLRDLP 186
Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
+ +KD F+ GY P ++ L E TWAGR+S + + M+ LS++T+ K H +IW+
Sbjct: 187 EPLPVIKDMTFIEGYTSPTLLFLSEVSPTWAGRISLRQDSMMLLGLSLNTSDKSHTVIWT 246
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALN---NYAVSLDSSQELP 356
NLP D+ L VP P+GGVLV GANT+ Y +QS+ L+LN +Y E
Sbjct: 247 LKNLPFDSSYLHPVPKPLGGVLVFGANTLIYLNQSSPPYGLSLNSITDYTTRFLLKNE-- 304
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG--RVVQRLDLSKTNPSVLTSD 414
S + LD + + ++ N+ L+S ++GD+ ++T+ D R V+R+ K S+L+S
Sbjct: 305 -GSLGIRLDCSQSVFISNEQLLVSLQSGDIYIVTLFPDSGMRGVKRITFDKAASSILSSC 363
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
I +I FLGSRL +SLL+++ ++ + + E G A
Sbjct: 364 ICSIKPHFLFLGSRLANSLLLRY-----STTVKQNIVEPIG-----------------GA 401
Query: 475 LQDMVNGEELSLYGSASNNTESAQK----TFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
+ D+ +++ +YG ++ + ++ +S V DSL+ IGP+ + G A S
Sbjct: 402 ILDL---DDIEVYGESAVSQSTSSSSLLTNYSLEVCDSLLCIGPVVKATIGE--PAFLSE 456
Query: 531 TGISKQS-NYELV--------------------------ELPGCKGIWTV---------- 553
+ K + ELV ELPGC +WTV
Sbjct: 457 EFVDKSDLDLELVLCSGHGKNGALSVLQRTIRPQVVTTFELPGCIDMWTVKSEGEEEEKG 516
Query: 554 ----YHKSSRGHNADSSRMAAYDDE-YHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+ G D SR H YLI+S TMVL+T +TE+ +S + Q
Sbjct: 517 EETKEEGQNEGGEKDQSREKEEKGSGQHDYLILSRSDSTMVLQTGQEITELDQS-GFATQ 575
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDG----SYMTQDLSFGPSNSESGSGSENSTVLSV 664
T+ AGN+ ++Q R+L G Y+ D+ G V V
Sbjct: 576 SATVFAGNV--GSFIVQATRTDIRLLKGIKQLCYVALDMGGG--------------VKCV 619
Query: 665 SIADPYVLLGMSDGSIRLL--------VGDPS--------------------TCTVSVQ- 695
+ PYV++ + +G I LL + PS T S+Q
Sbjct: 620 DVCSPYVIVLLMEGEIGLLKLVDESLVLSWPSLGNNTPVNHISAYTDTSGLFDVTSSLQF 679
Query: 696 ----------TPAAIESSKKPVSSCTLYHDK-----GPEPWLRKTSTDAWLSTGVGEAID 740
P A K+P S +L +D+ GP K + + +
Sbjct: 680 EGDGSEKEEEVPIAPPPVKRPHLSSSLLYDEDELLYGPVKTEVKEENASPMEASLAAE-- 737
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
+ P + ++C E GALEI+ VP F VF V F
Sbjct: 738 -PEAPPPITPTHWCLLCKEDGALEIYSVPEFQFVFAVRNF 776
>gi|225558298|gb|EEH06582.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 1408
Score = 280 bits (715), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 339/1432 (23%), Positives = 604/1432 (42%), Gaps = 197/1432 (13%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G + L + D+ +++++A +AK+S++E+D H + TS+H
Sbjct: 65 LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQICTTSIH 122
Query: 161 CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
+E + +++ + A P + VDP RC VL +G + + IL Q G LV
Sbjct: 123 YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180
Query: 215 -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
DE +G F SS V+ + L+ M H F++ Y
Sbjct: 181 DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL+ + T + + + S ++ + + S LP+D +K++ +P
Sbjct: 241 REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300
Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
P+GG L++G+N +H + A+ +N +A S +S + L+ + L
Sbjct: 301 PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFL 425
+N LL G + +L+ DGR V + L + S+L + + + F
Sbjct: 361 ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD-------- 477
GS GDS+L+ ++ S + + G+I + D
Sbjct: 421 GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480
Query: 478 MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
M G + +S+ G+ N+ + F + D L N+GP++D + G + D
Sbjct: 481 MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533
Query: 531 TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
S +N ELV G G +VY K + +
Sbjct: 534 PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593
Query: 565 SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
S Y YL++S + +++V + E T++ ++ + RTI G L
Sbjct: 594 GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653
Query: 619 GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
RV+QV + R D G + Q + SE +V+ S ADPYVL+ D
Sbjct: 654 SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708
Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
SI LL D S +T I S+ S +LY DK
Sbjct: 709 QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDKY-------------------R 747
Query: 738 AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
+ + +G P + Q D + + L +F +PN VFT +
Sbjct: 748 SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793
Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
D +I S+ +E I + V +L + P+L ++ ++ Y+
Sbjct: 794 ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLILYE 846
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
Y + TS ++ S R + ++N + S + ++ + T P +
Sbjct: 847 PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTTLSKP---L 894
Query: 915 TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
+ ++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV +
Sbjct: 895 RVLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954
Query: 975 QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
++++C+ P + +D W +K I L + Y + Y + + V +
Sbjct: 955 DNVVRMCRFPRNTHFDGSWAARK-IGLGEQVDAVEYSSSSETYVIGTNQKV------DFN 1007
Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
L D E+ + N +S + +++ V++L P W + ++++E +
Sbjct: 1008 LPEDDEIHPEWRNEVISFLP-----QIDKGSVKLLTPRT----WSIIDSYNLRNAERIMC 1058
Query: 1095 VRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTE 1148
V+ + L + T E + + +GTA +GED+AARG + +F D P+ +
Sbjct: 1059 VKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEVIEVVPEVDRPETNRKLKL 1118
Query: 1149 VYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNI 1204
+ +E+KGA+++L+ + QG L+ A G K I+ K G+ L +AF D YV L
Sbjct: 1119 IAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKE 1176
Query: 1205 VKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1262
+K ++GD K ++F + E+ +L+L +KD G+L A +FL DG+ L ++V+D+
Sbjct: 1177 LKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKDDGTLQVMAADFLPDGNRLYILVADDD 1236
Query: 1263 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1322
NI + Y P+ S KG +LL R+ F G + L ATSS + G D +
Sbjct: 1237 CNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQ-GPDADPDMMDL 1295
Query: 1323 FA------LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
+ +L + GSI I P+ E ++RRL +LQ +L +++ H GLNPR+FR S+
Sbjct: 1296 DSSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLTNTLEHPCGLNPRAFRAVESD 1355
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
G R +VD +L+ + L + + EIA++ G +I ++L + G
Sbjct: 1356 GIGGR----GMVDGDLVKRWLDLGTQRKAEIANRVGADVWEIRADLEAIGKG 1403
>gi|320169222|gb|EFW46121.1| cleavage and polyadenylation specificity factor 1 [Capsaspora
owczarzaki ATCC 30864]
Length = 1725
Score = 279 bits (714), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 178/520 (34%), Positives = 284/520 (54%), Gaps = 39/520 (7%)
Query: 920 ISGHQ---GFFLSGSRPCWCMV--FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
+ GHQ G F+ G RP W ++ R+ LR H L DGS+ AF+ +N C GF+Y T+
Sbjct: 1218 LGGHQLCSGVFVCGRRPLWLLMSPTRKALRAHLMLTDGSVSAFSAFNNNACPGGFVYFTT 1277
Query: 975 QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
QG L+ CQL + +DN WPV++V PL+AT H I Y Y L+ S P KP +
Sbjct: 1278 QGTLRFCQLAPTTNHDNPWPVRRV-PLRATAHYIGYHEVFRTYVLVTSHP--KPYFNLPR 1334
Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
L D+ ++ T + + ++++ P W++ + + + E +
Sbjct: 1335 LTNDETYTPVPYTPKPRAI----PATFDTFSLQLISPVT----WESIHSFDLPAFERVTS 1386
Query: 1095 VRVVTLFNTTTKENETLL----AIGTAYVQGEDVAARGRVLLFS---TGRNADNPQN--L 1145
V + + T++E T L IGT ++GEDV GR+++F + PQ
Sbjct: 1387 VDIAAI---TSQETVTGLKDYVVIGTTVIEGEDVTCHGRIIVFEIIDVVPEVNRPQTNRK 1443
Query: 1146 VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNI 1204
+ + +E KGAI+AL+ + GHL+ G KII+ ++ + ++G+AF D +VVS++
Sbjct: 1444 LKYLMEREQKGAITALSHVCGHLVSCIGQKIIIWQFASDDTMDGVAFIDTQ-TFVVSVSA 1502
Query: 1205 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
+KNFIL+GD++ S++ L + E L +A+DF + +T+FL+DGS+L + +D +N
Sbjct: 1503 IKNFILVGDLNNSVFLLRFNETTKHLGFIARDFDHMSVASTQFLVDGSSLGFLATDSHQN 1562
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAAPGSDKTNR 1322
+ +F Y P ES GQ+LL + +FHVG+HV + LR+ + L S DR GA+ R
Sbjct: 1563 LVVFAYNPLNRESNNGQRLLRQLDFHVGSHVQQVLRMVPRSLPVSVDR-GAS-----VKR 1616
Query: 1323 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
L TL+GS+ +AP+ E TFRRL+ LQ++LV + AGLNP +R + K
Sbjct: 1617 HIDLLATLEGSLNALAPIGETTFRRLEWLQRQLV-GLQQRAGLNPIGYRAYRFPRKMTTT 1675
Query: 1383 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+++D ELLS + L L EQ E+A Q T ++ ++
Sbjct: 1676 RAGNVIDGELLSRFLYLGLAEQRELARQRRNTPEDLIDDI 1715
Score = 109 bits (272), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 76/261 (29%), Positives = 135/261 (51%), Gaps = 23/261 (8%)
Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL-TWAGRVSWKHHTCMISA 285
R+ S+ I L +L + HV D F+ GY EP + +L E +W GR + TC + A
Sbjct: 294 RLRPSYEIKLTELQRHIHHVIDIEFLTGYFEPTLALLFEPNAPSWTGRTVQRKDTCSMVA 353
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNN 344
LSI+T+ HP++WS LP ++ +++AVP P+ G ++V + I + SQS+ + ++LN
Sbjct: 354 LSINTSSHSHPVVWSVDKLPFNSMRVMAVPRPVCGTVIVTPDAILHLSQSSPTVGVSLNE 413
Query: 345 Y-AVSLDSSQELPR------SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV 397
++S + +P SS + +L + L T+ G++ + T++ +GR
Sbjct: 414 LSSMSTELRLGIPENKHPDGSSVVYNMQEGRCCFLTPETLLAVTEGGEMFVATLLTEGRT 473
Query: 398 VQRLDLSKTNPSVLTSDITTIGNSLF-FLGSRLGDSLLVQF----TCGSGTSMLSSGLKE 452
V R+ + SVL +T++ N + F+GSR DS+L++ T + L+S +
Sbjct: 474 VVRIRIEPAGASVLPCCMTSLYNGQYCFIGSRASDSVLLRVMNNATAAADKRRLASAALD 533
Query: 453 EFGDIEADAPSTKRLRRSSSD 473
+F + KR R S ++
Sbjct: 534 DFS-------ANKRSRSSDTN 547
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 66/253 (26%), Positives = 120/253 (47%), Gaps = 30/253 (11%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHS--RADYVPQIPLIQTEELDSELPSKRGIGPVPNL-- 58
FA ++ H PT + +C T++ R V + L++ +D+ S G G L
Sbjct: 2 FAYFRQQHPPTAVEHCVEASFTNAAERQLVVARANLLEVYRIDAATAS--GSGWRSELSS 59
Query: 59 --VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAA---------SLELVCHY 107
+TA +++ R G + S + + + +A LELV +
Sbjct: 60 GSALTAQTAGAMHLGRAAGYGGNDGGRSDDAATEINTRSLHSAPATPPALQHKLELVASF 119
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
L GNVES+ + +RDS++LAF++AK++V+++D + L+ S+H +E
Sbjct: 120 NLSGNVESIGVARLAHC----KRDSLLLAFKEAKVAVVDYDPATLDLKTISLHMYED--- 172
Query: 168 LHLKRGRESFARG----PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
+ ++ GR++ A P+++VDP +C LVYG ++IIL Q + ++D + S
Sbjct: 173 IEMRGGRDATALQAVWPPVIRVDPMRQCAAFLVYGTKLIILPFRQESH--LDEDDDYQSA 230
Query: 224 GGFSARIESSHVI 236
+A + S I
Sbjct: 231 QAPAASVPPSAQI 243
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 100/198 (50%), Gaps = 27/198 (13%)
Query: 537 SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET-ADL 595
S++E EL G +G+W+V+ S + + +++ D H+ L+ S + T+V T +
Sbjct: 735 SSFE--ELTGGRGLWSVF---STALDPSLAALSSLDGASHSLLVASRDDSTLVFTTTGEE 789
Query: 596 LTEVTESVDYFVQGRTIAAGNLF---GRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
L ++ ES +F G TIA GN+F G+ ++ VF G R++DG + Q+L +S
Sbjct: 790 LEQIAES-GFFTAGATIAIGNVFAANGKILIVDVFAHGIRLVDGVNLRQELLLAQLSSV- 847
Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLL--VGDPSTCTVSVQTPAAIESSKKPVSSC 710
S ++ SIA+ VL +DG++ + GD S T AA +PV +
Sbjct: 848 ------SEIIHASIAESSVLALHADGAVSFVQFTGDTQELVASTATVAA----GQPVVAV 897
Query: 711 TLYHDKG----PEPWLRK 724
+LY D+ PE L++
Sbjct: 898 SLYADRSGLFVPEAVLQR 915
>gi|350633238|gb|EHA21604.1| hypothetical protein ASPNIDRAFT_51242 [Aspergillus niger ATCC 1015]
Length = 1406
Score = 278 bits (711), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 334/1479 (22%), Positives = 629/1479 (42%), Gaps = 216/1479 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+L+V ++++IY + + E ++ + ++L++ Y L G V L
Sbjct: 28 DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++AF +AK+S++E+D G+ S+H +E +
Sbjct: 77 CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
+ G ++ VDP RC + +G++ + I+ Q G LV D+ +GS
Sbjct: 131 VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187
Query: 223 ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
GGG + S V+ L LD + H F++ Y EP IL+ +
Sbjct: 188 HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + + ++ + ++ S LP D ++++A+P P+GG L++G+
Sbjct: 248 VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + S +S ++ L+ L + LL T
Sbjct: 308 NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
G++ ++ DGR V + + + I T IG+ FLGS GDS+L+
Sbjct: 368 GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
++ S ++ ++ D AD + S D +D + + +L G +
Sbjct: 428 YSYSSSSTKKHRLQAKQVIDDSADMSEEDQ---SDDDVYEDDLYSTSPDTTLTGRRPSGE 484
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
SA + F + D L+NIGPL+D + G R++ + TG S +++ +G
Sbjct: 485 SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNPEKTGDRTNSTSPELQIVASQGSHKSG 544
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
V + H S + + D + A L EA R V+ T
Sbjct: 545 GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604
Query: 597 TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSYMT 640
++ ES+ + V G TI+ G R+RV+QV + R D T
Sbjct: 605 SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISIGTQESRKRVVQVLKNEVRSYDFGKFT 664
Query: 641 -----QDLSFGPSNSES-------GSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
++ + G S + ++ +S S+AD + + D ++ L D S
Sbjct: 665 PSRCRRNFADGTDLSLTQIYPIWDDDTNDERMAVSASLADSCLAILRDDSTLLFLQADDS 724
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
V + S K SC LY DK TG+ +ID P+
Sbjct: 725 GDLDEVVFGEDVASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV- 765
Query: 749 QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE 808
+ D++ + L + C+ + G ++ + S+ I++
Sbjct: 766 KNDMFLFLLSHDCKLFV------KCLLWSSFALRGWHLMLSKSSGLSRPRSKAAIDN--- 816
Query: 809 EGTGQGRKENIHSMKVVELAM----QRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
+G + + S+ ++E + + WSA P+L I+C+ EG
Sbjct: 817 ----RGDRRFVASVNLIEAIVADLGETWSAS---PYL--------IVCHH---IEG---- 854
Query: 865 SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQ 924
+ ++ S+ N R P + + + + + I +ISG
Sbjct: 855 --------------IHSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLS 900
Query: 925 GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
F+ G+ + + +L + + + L C+ GFIY+ SQ ++ C+LP
Sbjct: 901 AVFMPGASAGFIIRTSASAPHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLP 960
Query: 985 SGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGH 1043
+ +D W +++V + H + Y +Y VL + L D E+
Sbjct: 961 PMTRFDYQWTLKRVHLGEQVDH-LAYSTSSGMY-------VLGTCHATDFKLPEDDELHP 1012
Query: 1044 QIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FN 1102
+ N + ++ + ++++ P+ W + + + E + ++ ++L +
Sbjct: 1013 EWRNEDCLAISFFPS--ARGSFIKLVSPNT----WSIIDSFSLGADEYVMAIKNISLEVS 1066
Query: 1103 TTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGA 1157
T E + ++ +GTA+ +GED+ +RG + +F + +P + T+ + + +KGA
Sbjct: 1067 ENTHERKDMIVVGTAFARGEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGA 1126
Query: 1158 ISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILL 1211
++AL+ + QG +L+A G K ++ K G+ L +AF D YV + +K +L
Sbjct: 1127 VTALSEIGGQGFVLVAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCIL 1184
Query: 1212 GDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA 1271
GD K ++F + E+ +++L AKD L+ A EFL DG L +VV+D NI + Y
Sbjct: 1185 GDAVKGVWFAGYSEEPYKMSLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYD 1244
Query: 1272 PKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR-TGAAPGSDKTNRFAL---LF 1327
P+ +S G +LLSR++FH+G + L SS++ ++ G D N+ L L
Sbjct: 1245 PEDPKSSNGDRLLSRSKFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLM 1304
Query: 1328 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1387
T +GS+G I + E ++RRL +LQ +L +++ H GLNPR+FR S+G A R +
Sbjct: 1305 TTQNGSLGLITCIPEESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GM 1360
Query: 1388 VDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
+D LL + + + + EIA + G +I ++L ++
Sbjct: 1361 LDGNLLFKWIDMSKQRKTEIAGRVGAREWEIKADLEAIS 1399
>gi|242798830|ref|XP_002483249.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces stipitatus ATCC 10500]
gi|218716594|gb|EED16015.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces stipitatus ATCC 10500]
Length = 1382
Score = 278 bits (710), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 359/1467 (24%), Positives = 608/1467 (41%), Gaps = 224/1467 (15%)
Query: 57 NLVVTAANVIEIYVV-------RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRL 109
NLVV ++++IY + V E G + + N KR L+L Y L
Sbjct: 28 NLVVIKTSLLQIYNLVTETVTPSVLENGQRANDNE---KRN------ETTKLQLFAEYDL 78
Query: 110 HGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
HG V + S+ NSR D+++L+F +AK+S++E++ I + S+H +E +
Sbjct: 79 HGTVTDI---SRINILNSRSGGDALLLSFRNAKLSLIEWNPEIQNISTVSIHYYEKEDIT 135
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE-------DTF 220
+ + VDP RC VL +G++ + IL Q G LV DE D F
Sbjct: 136 LSPWAPDLSQCDSHLTVDPSSRCA-VLNFGVRNLAILPFHQAGDDLVMDEYDPDLDMDDF 194
Query: 221 GSGGGFSARIES----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
++ +S S V+ L LD + H F+H Y EP I
Sbjct: 195 TGQDKNTSHTDSKKGTEKDHTHQTPYAASFVLPLTALDPTLIHPIGLTFLHEYREPTFGI 254
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
L+ T A + + + S ++ + + S LP D ++A+P+P+GG L
Sbjct: 255 LYSPIATSAALLEERKDVVVYSVFTLDLEQRASTPLLSIAKLPSDLLHIMALPAPVGGAL 314
Query: 323 VVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LL 379
++G+N IH + A+A+N +A + S + +S + L+ + + + LL
Sbjct: 315 LIGSNELIHVDQSGKASAVAVNEFAKQVSSFPMIDQSDLGLRLENSVVEVINKECGDILL 374
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT--------TIGNSLFFLGSRLGD 431
+ TG+LVL+ DGR V + P+ D+ ++G+ F+GS D
Sbjct: 375 TLSTGELVLVHFKIDGRSVSGPVVCPV-PTNSGGDVVGATASCSISLGSGKVFIGSEDTD 433
Query: 432 SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS--SDALQDMVNGEELSLYGS 489
SLL+ S S S E+ D + + + S A ++ VN +
Sbjct: 434 SLLLDCYVSSAVSKKSKDHGEDQFDEDMNDEDDDDMYEDDLYSSAPKEAVNK-------A 486
Query: 490 ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
SN SA + +SF V D L ++ L+ + G + D+ A +S QS +EL EL G
Sbjct: 487 VSNG--SASEDYSFRVLDKLPSLASLRSVTVGKPASRDSDAGNVS-QSVHEL-ELAAAYG 542
Query: 550 ---------IWTVYH----KSSRGHNADS------SRMAAYDDEYHAYLIISLEARTMVL 590
+ H + G ADS S + +D E+ + V
Sbjct: 543 SGRNGGVALLQRALHLDGISTMNGETADSVWNINTSTKSGRNDPSEG------ESPSYVF 596
Query: 591 ETADLLTEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARIL 634
T T+ E++ Y V G T+ G L G RV+QV R+
Sbjct: 597 LTKSNSTDNEETLVYAVNGSNLEPFSAPDVNPNGDPTVDIGTLAGNSRVVQVLTGEVRVY 656
Query: 635 DGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
D + M Q P E G E V S S ADPY+L+ D S+ LL D S
Sbjct: 657 DTNLGMAQ---IYPVWDED-EGDERFAV-STSFADPYLLIIRDDSSVLLLHSDESGDLDE 711
Query: 694 VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 753
+ P I SS+ + C LY DK V E D A G+ Y
Sbjct: 712 LSKPETI-SSQSWLCGC-LYTDK----------------HNVFE--DNA------TGNTY 745
Query: 754 SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 813
+ + L +F +P V + D + I SS +
Sbjct: 746 MFLLNQECKLFMFRLPTRELVSVTEGV-----------------DYVSSILSSDQPAKRL 788
Query: 814 GRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVST 873
+E I + V +L + P+L ++ Y+ PV
Sbjct: 789 NSRETIAELLVADLG----EISTASPYLIIRSATDDLIIYK---------------PVRE 829
Query: 874 SRSLSVSNVSASRLR--NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
+ + V+ ++ N + P++A + +R+ +I G+ +SG+
Sbjct: 830 NSKDEKTGVTLKYIKESNHFLPKVPIEAAATDTQQRMPGLRRLA---DIGGYAAVLMSGA 886
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P + + L + SI + + C G IYV ++ +++ C+L + D
Sbjct: 887 SPSLVVRTSKSLPRVFSIQSDSIRGISGFDSAGCEKGLIYVDNEHVVRTCRLHDNTQLDF 946
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WP++K IPL ++ Y A Y + V+ ++ L D + H +
Sbjct: 947 SWPIRK-IPLN---EEVDYLA----YSTVSGTYVVGTTHEQDFKLPDNDELHP----EWA 994
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1110
+ D+ V + +++L P W+ + ++E + + L + T E +
Sbjct: 995 NEDISLRPKVAQGSIKLLNPKT----WKVIDSYTFNAAERITAIENINLEISEKTSERKD 1050
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL- 1164
++ +GT + +GED+AARG V +F +P NL ++ +E ++GA++A++ +
Sbjct: 1051 MIVVGTTFAKGEDIAARGNVYVFDVINVVPDPDEPGTNLKLKLIGEESVRGALTAVSGIG 1110
Query: 1165 -QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIY 1219
QG L++A G K ++ K G+ L +AF D YV + +K L+GD K ++
Sbjct: 1111 GQGFLIVAQGQKCMVRGLKDDGSLL-PVAFIDVQ-CYVSVIKELKGTGMCLIGDALKGLW 1168
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
F + E+ ++ L KD L+ +FL DG L ++V+D N+ + Y P+ +S
Sbjct: 1169 FTGYSEEPYKMTLFGKDLDELEVVTADFLPDGKKLYILVADSDCNLHVLQYDPEDPKSSN 1228
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF----ALLFGTLDGSIG 1335
G +LL+R +FH+G + L A SS+ S + + L T G +
Sbjct: 1229 GDRLLNRCKFHMGHFASTITLLPRTAVSSELAVMNSDSMDIDSYIPLHQALITTQSGLMA 1288
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
+ L E ++RRL +LQ +L +++ H GLNPR++R S+G R ++D +LL
Sbjct: 1289 LVTSLSEESYRRLSALQSQLSNTLEHPCGLNPRAYRAVESDGVVGR----GMIDGKLLMR 1344
Query: 1396 YEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ L +LEIA + G +I ++L
Sbjct: 1345 WLDLSRPRKLEIAGRVGADEWEIRADL 1371
>gi|348555856|ref|XP_003463739.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 2 [Cavia porcellus]
Length = 1387
Score = 278 bits (710), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 201/679 (29%), Positives = 316/679 (46%), Gaps = 89/679 (13%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 779 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQPTTQGEVR----KEEATR 834
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 835 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 884
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 885 VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGVRGRVARFRYFEDIYGYSGVFICGPS 944
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 945 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1004
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
WPV+K IPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1005 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTST--PCTR-----IPRMTGEEKEFEAIE 1056
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENE 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1057 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET---- 1108
Query: 1110 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHL 1168
V G LKG ++A L QG
Sbjct: 1109 ---------VSG--------------------------------LKGYVAAGTCLMQGEE 1127
Query: 1169 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
+ G +I L +EL G+AF D LY+ + VKNFIL D+ KSI L ++E+
Sbjct: 1128 VTCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESK 1185
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+
Sbjct: 1186 TLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRAD 1245
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDEL 1343
FHVGAHV F R + GA G K N+ F TLDG IG + P+ E
Sbjct: 1246 FHVGAHVNTFWR-------TPCRGATEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEK 1298
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E
Sbjct: 1299 TYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTME 1358
Query: 1404 QLEIAHQTGTTRSQILSNL 1422
+ E+A + GTT IL +L
Sbjct: 1359 RGELAKKIGTTPDIILDDL 1377
Score = 271 bits (693), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 209/676 (30%), Positives = 332/676 (49%), Gaps = 88/676 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + + A + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRPTEGKSHREKLGAGGPPSLSF----GNVMSM 82
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + I ++SV+E+D H L+ S+H FE PE L+ G
Sbjct: 83 ASVQLXXXXXX------IALISFPQLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 134 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 185
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 186 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 245
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 246 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLG 305
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 365
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 366 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTVREAADKEEPPSKKKRV 423
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 424 DSTAGWAGSKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 479
Query: 522 --------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH--- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 480 EPAFLSEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVR 539
Query: 556 ------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+ G + S A DD H +LI+S E TM+L+T + E+ S + Q
Sbjct: 540 KEEEETPKAEGSEQEPSAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQ 598
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
G T+ AGN+ R ++QV G R+L+G L F P + + ++ ++AD
Sbjct: 599 GPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVAD 648
Query: 669 PYVLLGMSDGSIRLLV 684
PYV++ ++G + + +
Sbjct: 649 PYVVIMSAEGHVTMFL 664
>gi|395324102|gb|EJF56549.1| hypothetical protein DICSQDRAFT_93527 [Dichomitus squalens LYAD-421
SS1]
Length = 1433
Score = 277 bits (709), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 358/1492 (23%), Positives = 627/1492 (42%), Gaps = 235/1492 (15%)
Query: 57 NLVVTAANVIEIYVVR-------VQEEGSKES-----KNSGETKRRVLMD---------- 94
N+VV ++++ I+ VR Q+E KE K + + V MD
Sbjct: 42 NVVVARSSLLRIFEVREEPAPVSTQKEVEKERRAAVRKGTEAVEGEVEMDTSGEGFVNMG 101
Query: 95 ---GISAAS-------LELVCHYRLHGNVESL-AILSQGGADNSRRRDSIILAFEDAKIS 143
G++ A+ LV +RLHG V L A+ + D+ + D ++++F+DAKI+
Sbjct: 102 TSAGLNGAAHPPTVNRFYLVREHRLHGTVTGLEAVRTVHSLDD--KLDRLLVSFKDAKIA 159
Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
+LE+ S+H + S+H +E +P+ + + R L + DP RC + + +
Sbjct: 160 LLEWSLSLHDVITVSIHTYERAPQLIAID---SPLFRSEL-RADPLSRCAALSLPKDSLA 215
Query: 203 ILK--ASQGGSGLVGDEDTFGSGGGFSARIESSHVINL-RDLD--MKHVKDFIFVHGYIE 257
IL SQ ++ E + +S S +++L D+D +++V DF F+ G+
Sbjct: 216 ILPFYQSQAELDILEQEASQARDVPYSP----SFILDLANDVDKRIRNVIDFTFLPGFHN 271
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + +L + + TW GR+ T + ++ +P+I + LP+D + L +
Sbjct: 272 PTVAVLCQYQQTWTGRLKEYKDTVGLYIFTLDFVTNNYPVITAVDGLPYDCFALTPCSTA 331
Query: 318 IGGVLVVGANTIHYHSQSA-SCALALNNYAVSLD-------SSQELPRSSFSVELDAAHA 369
IGGV+++ +N + + QS L +N + + ++QE R ++L+ A
Sbjct: 332 IGGVVILASNAVLFVDQSGRRVILPVNGWPPRVSDLPMPPLTAQEQTR---DLQLEGARF 388
Query: 370 TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFF 424
++ + L K G + + ++ DGR V +L +S T P+V + IG+ F
Sbjct: 389 VFVDDKKLFLILKDGTVYPIELIQDGRTVSKLTMSDALARTTIPAV----VKRIGDDHIF 444
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE---ADAPST--------------KRL 467
+GS +G S+L++ ++ ++EE D + A+ P+T L
Sbjct: 445 IGSIVGPSVLLK----------TARVEEEIHDEDVAMAEGPATVVDTSKTVDMMDDDDDL 494
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
S+ A Q NG A +N + + ++ D++ GP+ D ++GL N D
Sbjct: 495 YGPSTIADQPAANGTA----NGAVDNVRT-RTVVHLSLCDAIPAHGPISDMTFGLSRNGD 549
Query: 528 ------ASATGISKQSNYELVE--LP-----------GCKGIW------------TVYHK 556
+ATG ++ L + +P G +G+W T + +
Sbjct: 550 RLVPELVAATGSGHLGSFSLFQRDMPTRFKRKLHAIGGGRGMWSLPVRQQVKTGGTTFER 609
Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEART--MVLETADLLTEVTESVDYFVQGRTIAA 614
S +AD+ + D + + + R+ + + VT F QG I
Sbjct: 610 PSNPFHADNDTVIISTDANPSPGLSRIATRSSHSDITITTRIPGVTLGAAPFFQGTAILH 669
Query: 615 GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
+F VI+V E DG+ S + + + S SI DP++L+
Sbjct: 670 -VMFNVTNVIRVLEP-----DGTERQ-------SIKDLDGNAARPRIKSCSICDPFILII 716
Query: 675 MSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG 734
D +I L +G+ + + + + + Y D L +T +A
Sbjct: 717 REDDTIGLFIGEIERGKIRRKDMSPMGEKTSKYLAGYFYTDTS---GLFQTFLNA---EA 770
Query: 735 VGEAIDGADGGPLDQGDI--YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
GEA G ++ G+ + + G +EI+ +P F+ + I D+
Sbjct: 771 PGEAATSTLQGAMNAGNKTHWLTLVRPQGVVEIWTLPKLTLAFSTTTLATLDPVISDSLE 830
Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
AL Q + V +L + H RP L +L G +
Sbjct: 831 PPALS-------------LPQDPPRKPQELDVDQLVIAPLGESHPRPHLIVLLRSGQLAI 877
Query: 853 YQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNL-RFSRTPLDAYTREETPHGAPC 911
Y+A P DP+ +RSL++ L NL + D EE
Sbjct: 878 YEAVAASPPA------DPLPPTRSLTL-------LVNLVKVKSKAFDIQHTEEEQKSVLA 924
Query: 912 QRITIFKNI----------SGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVL 960
++ I + + + G F +G RP W + + +RV P + AFT
Sbjct: 925 EQKRISRLLLPFVTSPAPGQTYSGVFFTGDRPSWIVSTDKGGVRVFPS-GHNVVHAFTTC 983
Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLI 1020
F+ + +G + +P D + P + V ++ P+ F + LI
Sbjct: 984 SLWESRGDFLLYSEEGPSLVEWMPD-IILDAHLPARSVP--RSRPYSHVVFDASS--SLI 1038
Query: 1021 VSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT 1080
V+ +N+ S D + + D+ N+S T T+E ++ PD W T
Sbjct: 1039 VAASSF--MNRFASYDEDGNIVWEPDSPNISFPHCE-TSTLE-----LISPDG----WIT 1086
Query: 1081 RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNA 1139
++E V V L +T+ + +A+GT +GED+A +G V +F
Sbjct: 1087 MDGYEFAANEFVSCVVSVPLETVSTESGMKDFIAVGTTINRGEDLAVKGAVYIFEIVEVV 1146
Query: 1140 DNPQNLVTEVYSKEL------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFY 1192
+ + + +L KG +S L + G+L+ + G KI + + E L G+AF
Sbjct: 1147 PDASLNIKRWWRLKLLCRDDAKGPVSFLCGMNGYLVSSMGQKIFVRAFDLDERLVGVAFL 1206
Query: 1193 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE---FLI 1249
D +YV SL VKN +++GD KS++F++++E +L +L KD C T F
Sbjct: 1207 DV-GVYVTSLRAVKNLLVIGDAVKSVWFVAFQEDPYKLVILGKD--PHHCCVTRADLFFA 1263
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
DG LS+V DE+ ++++ Y P ES GQ LL R EFH R +L
Sbjct: 1264 DGH-LSIVTCDEEGVVRLYAYDPHDPESKGGQHLLRRTEFHGQTE----YRSSLLVARRP 1318
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
+ G + + L+ G++DGS+ + +DE F+RL LQ +L+ +V HVA LNP++
Sbjct: 1319 KA----GDPEIPQARLICGSVDGSLTTLTYVDENAFKRLHLLQGQLIRTVQHVAALNPKA 1374
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1421
FR N RP ++D LL+ +E LP+ Q E+ Q GT R+ +L +
Sbjct: 1375 FRMVR-NEYVSRPLSKGVLDGNLLATFEDLPIGRQNEVTRQIGTDRATVLKD 1425
>gi|390599704|gb|EIN09100.1| hypothetical protein PUNSTDRAFT_67240 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1439
Score = 277 bits (709), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 338/1408 (24%), Positives = 600/1408 (42%), Gaps = 177/1408 (12%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
+ A L LV +RLHG V L + + N D ++++FEDAKI+VLE+ + H L
Sbjct: 118 TVARLRLVREHRLHGMVTGLGRIKILSSLNDGL-DRLLISFEDAKIAVLEWSEEQHDLLT 176
Query: 157 TSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
S+H +E +P+ + L S G L +VDP RC + + I+ Q
Sbjct: 177 VSIHTYERAPQLMSLN---ASLFHGWL-RVDPISRCAALALPCDAFAIIPFHQTLE---- 228
Query: 216 DEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
A S +++L D + +V D F+ G+ P + +L + TW G
Sbjct: 229 -----------EAPYAPSFILDLTSEVDQRIHNVVDMSFLPGFNNPTVAVLFQPTQTWTG 277
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
R++ T + ++ + +P+I S NLP+D + A + +GGV+V+ +N+I +
Sbjct: 278 RLTEYKDTMKLLVFTLDAVTRNYPVITSVDNLPYDCLSVHACSAAVGGVIVITSNSIIHV 337
Query: 333 SQSAS-CALALNNYAVSLDSSQELP----RSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
SQS+ AL++N +A + P ++ ++ L+ + ++ + L K G +
Sbjct: 338 SQSSRRVALSVNGWASRVTDMSLAPVQAEYATRNLALEGSRLAFVDDRTFFLFLKDGTVY 397
Query: 388 LLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
+ + DG VV + + S + + +T + F+GS G S+L++ T
Sbjct: 398 PVELSLDGAVVSTISMGHALAQSAIPAVVTPVTQEHIFVGSTAGTSVLLKIT-------- 449
Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDM-------VNGEELSLYGSASNNTESAQK 499
++EE D +DA + + + S + D + + SL +N T + K
Sbjct: 450 --SVEEEVEDNASDAVAAAVVDTADSMVMDDDDDIYGVSMKTDAQSLSNGHANGTHLSVK 507
Query: 500 TFS---FAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYELVE--LP--- 545
S ++ DSL G + D S+ L N + +ATG + L + LP
Sbjct: 508 KRSVTHLSLSDSLPGYGSISDMSFSLAKNGEKVVPELVAATGSGSMGGFTLFQRDLPART 567
Query: 546 --------GCKGIW------------TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
G +G+W T Y ++ AD+ + D A +
Sbjct: 568 KRKLHAIGGGRGMWSLSLRPTVKVNGTSYERAVNPFQADNDTVVVSTDANPAPGLSRFSH 627
Query: 586 RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGS--YMTQ 641
RT E S+ V G+TI A F R ++ V R+L DGS + +
Sbjct: 628 RTPRTEI---------SITTRVPGQTIGAAPFFQRTAILHVMSNAIRVLEPDGSERQVIK 678
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI- 700
DL + SI DP+VL+ D +I L +G+ + + + +
Sbjct: 679 DLD---------GNMARPKIRHCSICDPFVLIVREDDTIGLFIGESERGKIRRKDMSPMG 729
Query: 701 ESSKKPVSSCTLYHDKGPEPWLRKTSTDAW---LSTGVGEAIDGADG-----------GP 746
+ + + ++ C + G + + ++ +T + + AD G
Sbjct: 730 DKTSRYLTGCFFTDNAGVFDLRSQANGNSGADKTATSTLQGVVNADSRSQWLLLVRPQGV 789
Query: 747 LDQGDIYSVV-CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
L+ D+ + C +I+ +P + VF+V + + D+ AL
Sbjct: 790 LEASDLSPIPGCRRLNEKQIWTLPKLSIVFSVRLASTLDWVLADSGDGPAL--------- 840
Query: 806 SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
S G R + + V + + +P L L G + YQA P S
Sbjct: 841 -SMPGESPRRPQE---LDVEQAVIAPLGETAPQPHLLLFLRSGQLAIYQAI----PMQAS 892
Query: 866 KSDDPVS-TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQ 924
D+ +S S + + V+ R + ++ +T +
Sbjct: 893 SVDESLSRPSLGVRFAKVATRVFEIQRQDDSEKSILAEQKKISRVLIPFLTSPSPTTTFS 952
Query: 925 GFFLSGSRPCWCMVF-RERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKICQ 982
G F +G PCW + R +R+HP S+V AFT F+ + +G +
Sbjct: 953 GVFFTGDHPCWILKPDRSGIRIHPS--GHSVVHAFTSCSLWESKGDFLLYSDEGPSLLEW 1010
Query: 983 LPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1042
+P + + P + IP + ++T+ A L IV+ L+ + + D +
Sbjct: 1011 MPD-TDVETELP-SRSIPQPRSYSKVTFDASTGL---IVAAAHLE--AEFATYDEDNNIV 1063
Query: 1043 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFN 1102
+ D+ N+S T+E ++ PD W T ++E +V V L
Sbjct: 1064 WEPDSANVS-FPRSSCSTLE-----LISPDE----WITMDGFEFANNEFVTSVESVPLET 1113
Query: 1103 TTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSK-------EL 1154
++T+ ++ +A+GT +GED+A RG +F P+N + K +
Sbjct: 1114 SSTESGSKDFIAVGTTIDRGEDLAVRGTTYVFEIVEVVP-PENSSLSRWWKLRLRCRDDA 1172
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGD 1213
KG ++AL ++ G+L+ + G KI + + E L G+AF D +YV +L VKN +++GD
Sbjct: 1173 KGPVTALCAMDGYLVSSMGQKIFVRAFDMDERLVGVAFLDVG-VYVTTLRAVKNLLVIGD 1231
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
KS++F+ ++E +L +LAKDF ++ +F+ ++S++ +DE ++++ Y P+
Sbjct: 1232 AAKSVWFVGFQEDPYKLVILAKDFQTVCVTTADFIFTEDSMSILTNDENGVMRLYQYDPQ 1291
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
+S GQ+L+ R EF H T Q + R G + + ++ G++DGS
Sbjct: 1292 DPDSRNGQQLMCRTEFD--THTT----CQTSIVFARRVGEGEEA-ALPQAKVVAGSIDGS 1344
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+ + +DE F+RLQ LQ +L ++ HVAGLNP++FR N +P I+D LL
Sbjct: 1345 LAALTCMDEPAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVR-NDYVSKPLSKGILDGNLL 1403
Query: 1394 SHYEMLPLEEQLEIAHQTGTTRSQILSN 1421
S Y LP+ Q EI Q T R+ +L +
Sbjct: 1404 SSYLELPIPRQEEITKQIATERAAVLRD 1431
>gi|67521912|ref|XP_659017.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
gi|74598221|sp|Q5BDG7.1|CFT1_EMENI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|40745387|gb|EAA64543.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
gi|259486722|tpe|CBF84808.1| TPA: Protein cft1 (Cleavage factor two protein 1)
[Source:UniProtKB/Swiss-Prot;Acc:Q5BDG7] [Aspergillus
nidulans FGSC A4]
Length = 1339
Score = 275 bits (702), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 352/1457 (24%), Positives = 603/1457 (41%), Gaps = 235/1457 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++I+ +R S ++ +T+ R L L Y+L G V +
Sbjct: 28 NLIVARTSLLQIFSLR------DVSLSALDTEVRPAQHRQETCKLVLEREYQLPGTVTDI 81
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ ++ G D ++++AF DAK+S++E+D +GL S+H +E +
Sbjct: 82 CRVKILKTKSGGD------AVLVAFRDAKLSLVEWDPERYGLSTISIHYYERDDMTRSPW 135
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE- 231
+ G ++ DP RC + I+ Q G LV D+ FGS + R+E
Sbjct: 136 ASDLSTCGSILSADPGSRCAIFQFGARSLAIIPFHQPGDDLVMDD--FGSEPDYENRVEG 193
Query: 232 --------------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
SS V+ L LD + H F++ Y EP IL+ + T
Sbjct: 194 NSRSHEAKDKDAAEYQTPYASSFVLPLTALDPSVIHPISLAFLYEYREPTFGILYSQVAT 253
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
+ + + +++ + + S LP D +K++A+P P+GG L++G+N
Sbjct: 254 SHALLHERKDVVFYTVITLDLEQRASTTLLSVTRLPSDLFKVVALPPPVGGSLLIGSNEL 313
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
+H + A+ +N ++ S +S ++ L+ +D LL+ TG
Sbjct: 314 VHIDQAGKTNAVGVNEFSRQASSFSMTDQSDLALRLENCVVERFSDDNGDLLLALSTGVF 373
Query: 387 VLLTVVYDGRVVQRLD---LSKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCG 440
L++ DGR V + LS + L S ++ +GN F GS DS+L+
Sbjct: 374 ALVSFKLDGRSVSGISVRPLSGPSKEFLASTASSSAFLGNGKVFFGSESADSVLL----- 428
Query: 441 SGTSMLSSGLKEEF-GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
G S SS K+ F G D S DA +D + + N S
Sbjct: 429 -GWSSASSATKKSFSGSTSND--------ESEDDAYEDDLYSSAPAAMTDNPQNQPSNSS 479
Query: 500 TFSFA---VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVY 554
+F + D L + GP++D G A + T K ELV G G +
Sbjct: 480 VAAFGDLRIHDRLSSPGPIRDIVLGRSSEASSRDT---KDGVLELVAAQGSDEGGTMVIM 536
Query: 555 HK--------SSRGHNADS----SRMAAYDDEYHAYLIISL-------EARTMVLETADL 595
+ S A+S S + +D+ Y+I+S E+ VLE D
Sbjct: 537 KREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYVILSKQEKPDKEESEVFVLE--DK 594
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
L +T T+ G L + RVIQV R D + D
Sbjct: 595 LRPITAPEFNPNHELTVEIGTLASKSRVIQVLRNEVRSYDAVWDEDD------------- 641
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S+ ++ ++ DPY+ + D ++ LL D S + TL D
Sbjct: 642 SDERVAVNATLVDPYLAIIRDDSTLLLLQADDS----------------GDLDEVTLSED 685
Query: 716 KGPEPWLRKT--STDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
+ WL S +A T +I + + L ++ +P+F
Sbjct: 686 VVSQKWLSACFYSDNAGFFTAPFASI--------------LFLLNQDHQLYVYRLPDF-A 730
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
V +V + V I+ T E K S T +EN+ + VVEL
Sbjct: 731 VISVIEGVGCLPPILST---EPPKRSTT--------------RENVLQIAVVELG----D 769
Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
++ S PFL + ++ Y+ + E T R L +N + + N
Sbjct: 770 SYSSLPFLILRTENDDLVVYKPFFTNSKELTGL--------RFLKEANHTLPKTPNTT-- 819
Query: 894 RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLC 950
D E P + I NI+G F+ G P +FR P +L
Sbjct: 820 ----DELQSEMKP-------LRILPNIAGCSSIFMPG--PSAGFIFRAS-TTSPHFIRLR 865
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
G I + + GF Y+ S G L + +LP G+ W + + +P+ ++TY
Sbjct: 866 GGFIKGLGCFDS--PDKGFAYLDSHG-LHLAKLPEGTQLGYPW-IMRTVPIGQQIDKLTY 921
Query: 1011 FAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1069
+ + Y VL + L D E+ + N +S + V + ++++
Sbjct: 922 VSASDTY-------VLGTCQRCEFRLPEDDELHPEWRNEEISFLP-----EVNQSSLKVV 969
Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARG 1128
P W + P++ +E+ + ++ ++L + T E ++ +GT+ +GED+ +RG
Sbjct: 970 SPKT----WSVIDSYPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRG 1025
Query: 1129 RVLLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH-- 1179
+ +F +P+ N ++ KE +KGA++AL+ + QG L+ A G K ++
Sbjct: 1026 CIYVFEVIEVVPDPEQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRGL 1085
Query: 1180 KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1237
K G+ L +AF D +V + +K + GD K ++F + E+ +++L AKD
Sbjct: 1086 KEDGSLLP-VAFMDMQ-CFVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSLFAKDL 1143
Query: 1238 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1297
L+ A +FL DG+ L +VV+D N+ + Y P+ S G KLL+R++FH G +
Sbjct: 1144 DYLEVLAADFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTGNFAST 1203
Query: 1298 FLRLQMLATSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSL 1351
L SS+R A GSDK + +L + +GSIG + + E ++RRL +L
Sbjct: 1204 VTLLPRTLVSSER--AMSGSDKMDIDNTAPLHQVLVTSHNGSIGLVTCVPEESYRRLSAL 1261
Query: 1352 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1411
Q +L +++ H GLNPR++R S+ A R ++D LL Y + + + EIA +
Sbjct: 1262 QSQLTNTLEHPCGLNPRAYRAVESDASAGR----GMLDSNLLLQYLDMSKQRKAEIAGRV 1317
Query: 1412 GTTRSQILSNLNDLALG 1428
G T +I ++L ++ G
Sbjct: 1318 GATEWEIRADLEAISGG 1334
>gi|358372791|dbj|GAA89393.1| cleavage and polyadenylation specificity factor subunit A
[Aspergillus kawachii IFO 4308]
Length = 1372
Score = 272 bits (695), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 336/1465 (22%), Positives = 619/1465 (42%), Gaps = 222/1465 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+L+V ++++IY + + + E ++ + ++L++ Y L G V L
Sbjct: 28 DLIVVRTSLLQIYSLH-KVTSNAEGADAQQELTKLLLEK----------EYSLSGTVTGL 76
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+ NSR +++++AF +AK+S++E+D + S+H +E + +
Sbjct: 77 CRVK---VLNSRSGGEAVLVAFRNAKLSLIEWDPERRSISTISIHYYERDDLTRSPWVPD 133
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS------------ 222
G ++ VDP RC + +G++ + I+ Q G LV D+ +GS
Sbjct: 134 LKNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGMSTDHDL 190
Query: 223 GGG---------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
GGG + S V+ L LD + H F++ Y EP IL+ + T +
Sbjct: 191 GGGPDKAKEGIAYQTPYAPSFVLPLTALDPSILHPISLAFLYEYREPTFGILYSQVATSS 250
Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IH 330
+ + + ++ + ++ S LP D ++++A+P P+GG L++G+N +H
Sbjct: 251 ALLPERKDVVFYTVFTLDLEQQASTILLSVSRLPSDLFRVVALPPPVGGALLIGSNELVH 310
Query: 331 YHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVL 388
+ A+ +N ++ + S +S ++ L+ L + LL TG++ +
Sbjct: 311 IDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLSTGEMAI 370
Query: 389 LTVVYDGRVVQRLD---------LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
+ DGR V + L+ N + S T IG+ FLGS GDS+L+ ++C
Sbjct: 371 MKFKLDGRSVSGISVHLLPAHAGLTSMNSAAAAS--TFIGDGKIFLGSEDGDSVLLGYSC 428
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNTESA 497
S +S ++ D AD + S D +D + + +L G + SA
Sbjct: 429 SSSSSKKHRLQAKQAIDDSADMSEEDQ---SEDDVYEDDLYSTSPDTTLTGRRPSGESSA 485
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WT 552
+ F + D L+NIGPL+D + G ++ + G S +++ +G
Sbjct: 486 FGLYDFRMHDKLINIGPLRDITIGRKLPTNQEKGGDRTNSTSPELQIVASQGSHKSGGLV 545
Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLLTEV 599
V + H S + + D + A L EA R V+ T ++
Sbjct: 546 VMAREIDPHVVASISLESVDSIWTASLTWEEEAVSRTSENIGQRSQRCYVIATEAKASDR 605
Query: 600 TESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-MTQD 642
ES+ + V G TI G R+RV+QV + R D +TQ
Sbjct: 606 EESLIFVVDGHDLKPFRAPDFNPNEDVTINIGTQESRKRVVQVLKNEVRSYDIDLGLTQI 665
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
++ ++ +S S+AD + + D ++ L D S V + S
Sbjct: 666 YPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVLGEDVAS 720
Query: 703 SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA 762
K SC LY DK TG+ +ID P+ + D++ +
Sbjct: 721 GK--WISCCLYSDK----------------TGLFSSIDRTLSEPV-KNDMFLFLLSHDSK 761
Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
L ++ V + + ++ + + G + ++ SSE G +EN+
Sbjct: 762 LFVYRVRD-QKLLSIIEGLDGLSPLL-----------------SSEPPKRSGTRENLVEA 803
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
V +L + WSA P+L + ++ Y+ ++ + T + + +
Sbjct: 804 IVADLG-ETWSAS---PYLILRSENDDLIIYKPFV-------------IPTGPTGEIHTL 846
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRER 942
S+ N D + + + + + I +ISG F+ G+
Sbjct: 847 KFSKENNSVLPMISPDVDSTQPSGSDYRVRPLRILPDISGLSAVFMPGAS---------- 896
Query: 943 LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG----ILKICQLPSGSTYDNYWPVQKV 998
F + + + H F+ + + ++ CQLP + +D W ++KV
Sbjct: 897 ------------AGFVLRTSASAPH-FLRLRGESPRCSTVRFCQLPPMTRFDYQWTLKKV 943
Query: 999 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHR 1057
+ H + Y +Y VL + L D E+ + N +S R
Sbjct: 944 HLGEQVDH-LAYSTSSGMY-------VLGTCHATDFKLPDDDELHPEWRNEAISFFPSAR 995
Query: 1058 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGT 1116
+ +++ P+ W + + + E + ++ ++L + T E + L+ +GT
Sbjct: 996 GSFI-----KLVSPNT----WSIIDSYSLGTDEYVMAIKNISLEISENTHERKDLIVVGT 1046
Query: 1117 AYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLL 1169
A+ +GED+ +RG + +F + +P + T+ + + +KGA++AL+ + QG +L
Sbjct: 1047 AFARGEDIPSRGCIYVFEVVQVVPDPDDPETDRKLKLIGKESVKGAVTALSEIGGQGFVL 1106
Query: 1170 IASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKE 1225
+A G K ++ K G+ L +AF D YV + +K +LGD K I+F + E
Sbjct: 1107 VAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCILGDAVKGIWFAGYSE 1164
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1285
+ +++L AKD L+ A EFL DG L +VV+D NI + Y P+ +S G KLLS
Sbjct: 1165 EPYKMSLFAKDLDYLEVSAAEFLPDGRRLFIVVADSDCNIHVLQYDPEDPKSSNGDKLLS 1224
Query: 1286 RAEFHVGAHVTKFLRLQMLATSSDRT-GAAPGSDKTNRFAL---LFGTLDGSIGCIAPLD 1341
R++FH G + L SS++ + D N+ AL L T +GS+G I +
Sbjct: 1225 RSKFHTGNFASTLTLLPRTMVSSEKMISNSDDMDIDNQSALHQVLMTTQNGSLGLITCMP 1284
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1401
E ++RRL +LQ +L +++ H GLNPR+FR S+G A R ++D LL + +
Sbjct: 1285 EESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GMLDGNLLFKWIDMSK 1340
Query: 1402 EEQLEIAHQTGTTRSQILSNLNDLA 1426
+ + EIA + G +I ++L ++
Sbjct: 1341 QRKTEIAGRVGAREWEIKADLEAIS 1365
>gi|302831157|ref|XP_002947144.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
gi|300267551|gb|EFJ51734.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
Length = 2830
Score = 270 bits (690), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 253/984 (25%), Positives = 419/984 (42%), Gaps = 182/984 (18%)
Query: 575 YHAYLIISL-EARTMVLETADLLTEVTES--VDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+HAYL+I++ RTMVL D L +VT S ++ V T+AAGNLF ++Q G
Sbjct: 1889 FHAYLLITMGRVRTMVLRCTDGLDDVTNSPECEFLVNQPTLAAGNLFHNAVIVQACPMGL 1948
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS-------------IADPYVLLGMSDG 678
R+L+G + Q+L + ++ S ADPYVL+G+SDG
Sbjct: 1949 RVLEGMTLVQELRVSDFQASRPKTAQYSFCCRTKHPIAHRAMGPIPQAADPYVLVGLSDG 2008
Query: 679 SIRLLVGDPSTCTVSVQTPAA-------IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
+ LL GDP + T+ V T AA S ++ +++ L+ D+ +W+
Sbjct: 2009 TAVLLEGDPLSLTLGVATAAAEQLMAVPARSRQQRLAAACLHRDE-----------TSWM 2057
Query: 732 STGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS-------GR 784
++ + I+ +C SG LE + +P+ VF + G
Sbjct: 2058 ASATAAEAASS----GSSFSIFLWICRLSGRLECYSLPSMRLVFHSSGLAAAEEVLRMGP 2113
Query: 785 THIVDTY--MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS----- 837
+ D Y +E E++ G G G E+ VVEL ++ + S
Sbjct: 2114 AVMYDVYDLFGGGGGGAEAELDG----GGGSGIMED----PVVELRVESFLGGGSPAVPD 2165
Query: 838 --RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA----------- 884
RP L + G ++ YQ L P ++ + P + + S
Sbjct: 2166 CERPVLLVMAASGNLVAYQIALRRLPLDSLSHEAPAAMGAAAGSSGGGGGIGGGAALGPR 2225
Query: 885 -SRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
+R +L ++ +++R + ++ + + + G F++GSRP W + R L
Sbjct: 2226 MARFDHLAYTDPSSKSHSRTDI------RKYPVASQGTSYSGVFVAGSRPLWLVASRGGL 2279
Query: 944 RVHPQLCDGSIVAFTVLHNVNCNHGFI-YVTSQGILKICQLPSGSTYDNYWPVQKVIPLK 1002
HP +G++ A T HN NC GFI +S+G+LK+CQLP + D W V + +PL+
Sbjct: 2280 VPHPMFAEGAVAAMTPFHNANCPLGFISACSSRGLLKVCQLPPHTRLDTPW-VTRRVPLR 2338
Query: 1003 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE 1062
TPH++ +F + L I S V+ D ++ R E
Sbjct: 2339 VTPHKLAWFRDAGLMAAITSRVVVSRPRPPEEPGGDAHAAAAYAAAAAAAAGRGRE---E 2395
Query: 1063 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA----- 1117
+E+R+LEP+ G W + P E AL ++V+ L N TT + + LLA+GT
Sbjct: 2396 AWELRLLEPNGCGRLWLSPLLPP---GEQALCLKVIYLQNATTGDTDALLAVGTGSPMGQ 2452
Query: 1118 ---------------------------------------YVQGEDVAARGRVLLFST--- 1135
GED GR+LL++
Sbjct: 2453 LGGGNWRFRLPRGRVAGSGGLVVHRQCEREGAGRGCRGERPPGEDYPCLGRILLYTISAE 2512
Query: 1136 -----GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW--------- 1181
G N + V V ++++ A++++ + LL+ G +I +++W
Sbjct: 2513 VVDLGGGNLTRRWSAVL-VATRDMASAVTSVQEFKSQLLVTCGSRIEMYEWRGPAAGASG 2571
Query: 1182 -----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1236
G L AF+D P L V SL VK+++L D + +YF+ + + L ++KD
Sbjct: 2572 GGGGGPGGRLEKRAFFDLPSL-VTSLVAVKDYLLAADASQGLYFVRYSDSARVLEFMSKD 2630
Query: 1237 FGSLDCFATEFLIDGSTLSLVVSDEQKNIQI--FYYAPKMS-ESWKGQKLLSRAEFHVGA 1293
F D +I+ L+ + +D N+ + FY + + E W GQ+L HV
Sbjct: 2631 FDHRDVLTAGVVINEPKLAFLAADAAGNLALSEFYGSRNTNPEFWAGQRLAPLGLMHVAR 2690
Query: 1294 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQ 1352
++ + ++M P SD NR ALL G +G + IAP+ D +RL +LQ
Sbjct: 2691 RLSCCVSIKM-----------PTSDGKNRHALLCGAAEGGLSYIAPVPDAEMTQRLLALQ 2739
Query: 1353 KKLVDSVPHVAGLNPRSFRQFH-------SNGKAHR----PGPDSIVDCELLSHYEMLPL 1401
+ +PHVAGLNPR+FR G++H P + ++D +LL + +L
Sbjct: 2740 NHMSRRLPHVAGLNPRAFRHRFCRIPKSLGGGQSHHAPPAPASNGLLDGQLLLGFPLLSR 2799
Query: 1402 EEQLEIAHQTGTTRSQILSNLNDL 1425
+ Q + A G T QI+S+L +
Sbjct: 2800 QHQGQAAEALGVTVRQIMSDLRAI 2823
Score = 104 bits (259), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 120/215 (55%), Gaps = 25/215 (11%)
Query: 228 ARIESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
A + + +++NL + + ++ V+D +F+HGY EPV+++LHE + TW G + + TC ++A+
Sbjct: 1305 ATLGNGYLLNLNKMMGIREVRDCVFLHGYTEPVLLLLHEPDPTWVGMLRERKDTCCLAAI 1364
Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA 346
SIS LK+H ++W +LP+D +KLLAVP VLV+ N + SQ++ A ALN+ A
Sbjct: 1365 SISLRLKRHTILWKLASLPYDCFKLLAVPY-RPAVLVISPNLLLLCSQASQHAAALNSNA 1423
Query: 347 VS--------LDSSQELP---------RSSFSVELDAA-----HATWLQN-DVALLSTKT 383
+ LD S+E P + + +V D A +AT + + +V ++
Sbjct: 1424 LPGEVPPPLILDPSREPPAATAARLAAQYALNVHPDCAPAAGRNATLMADLEVVAAGLQS 1483
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
G L+ + + ++G QR+ + +T + S + I
Sbjct: 1484 GTLLAVHLQFEGPADQRITVVRTGGGPIASAMVGI 1518
Score = 95.5 bits (236), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 8/188 (4%)
Query: 56 PNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD-GISAASLELVCHYRLHGNVE 114
PNL+V N +E++ +R + + + G A LELV Y LHG VE
Sbjct: 1078 PNLIVVRTNRLEVHSLRSSAVATNAAAATATAAATASAAVGSGGARLELVVSYHLHGVVE 1137
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
SLA+LS G +S RRD+++LAF + K+SV+E++ H LR +S+H FE + + GR
Sbjct: 1138 SLAVLSGG---SSSRRDALLLAFREGKLSVVEWNPRTHSLRTSSLHYFEGDPGVQ-REGR 1193
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG---LVGDEDTFGSGGGFSARIE 231
+ P V DP GRC + Q+ +L A + +G V D G G G RI
Sbjct: 1194 IAVPLPPRVVTDPAGRCAAMSFCFSQLALLPALEVKAGAWQCVDDGGVMGVGRGERERIG 1253
Query: 232 SSHVINLR 239
H+ R
Sbjct: 1254 GVHINERR 1261
>gi|315045910|ref|XP_003172330.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
gi|311342716|gb|EFR01919.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
Length = 1397
Score = 269 bits (687), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 350/1484 (23%), Positives = 618/1484 (41%), Gaps = 222/1484 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + G+ ++ D + A L L Y + G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSAPA---GQPDQKGRHDRLQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+ NS+ D+I+++ +AK+S++E+D HG+ S+H +E E +
Sbjct: 85 ERVR---ISNSKSGGDAILVSSRNAKLSLIEWDPQKHGITTISIHYYEGEESHMSPWVPD 141
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE-DTFGSGGGFSARIES- 232
+ + VDP G C + +G+ + IL Q G LV D+ D +G + +
Sbjct: 142 LGSCSSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDAIPNGDDTTDAVNDA 200
Query: 233 ----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
S V+ + LD + H F+H Y EP IL+ +
Sbjct: 201 QKPAPGNAVHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSMSLT 260
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
+ S ++ K + + LP D +K++ +P PIGG L++G N +H
Sbjct: 261 IDRKDIVSYSIFTLDLQQKASTSLLTVSRLPSDIFKVVPLPPPIGGALLIGTNELVHVDQ 320
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
+ A+ +N +A + +S + L+ L + LL G + +LT
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCMVEQLGSGAGDVLLILSDGRMAILTF 380
Query: 392 VYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGS--- 441
DGR V + L ++ S++ S + ++G + F GS GDS+L+ ++ S
Sbjct: 381 KVDGRSVAGISLHFVAEQSGGSIIKSRPSCSASLGRNKLFYGSEEGDSILLGWSKHSSAT 440
Query: 442 -------------GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
GT+ LS +++ D + +++ + +VNG+ G
Sbjct: 441 KKPSKAAGGGNEDGTANLSDEEEQDDDDDDMYEDDLYSANPTTTQQEKQVVNGD-----G 495
Query: 489 SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK 548
+A+ F+ D L ++GP +D + G + + S +EL +
Sbjct: 496 AAN---------FTLRAHDRLWSLGPYRDITLGRPPKSKSKDRQDSVPEISAPLELVAAR 546
Query: 549 GI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA------------RTMVLE 591
G TV + DS +M DD Y + I ++ R ++L
Sbjct: 547 GFGKAGGLTVLKREIDPFTIDSLKM---DDVYGVWSIRVIDPKSKDAGLSRSYDRYLLLA 603
Query: 592 TADLLTEVTESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILD 635
A + ESV Y V + TI G L RV+QV R D
Sbjct: 604 KAKG-DDKEESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLATGSRVVQVLRTEIRSYD 662
Query: 636 GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVS 693
+ + P E S E TV+ S A+PY+L D S+ +L D + V
Sbjct: 663 CNLGLAQIY--PVWDEDTS--EERTVIQASFAEPYLLTIRDDNSLLILQADKNGDLDEVE 718
Query: 694 VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 753
+Q AA S K VS C LY DK + S+D D +I
Sbjct: 719 IQGSAA---SAKWVSGC-LYEDK-----TKIFSSDL-------------DTEHAATPNIL 756
Query: 754 SVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ G L IF +PN + VD L S SSS
Sbjct: 757 LFLLDSDGNLSIFRLPNITEPLCRVDNL--------------NLLPSNLPYESSSRRPV- 801
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+E + + V +L A H P++ ++ Y+ Y G SK
Sbjct: 802 --NRETLTELLVADLG----DAIHKSPYMILRTKHDDLVLYEPYRITGENGRSKLQ--FI 853
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
+ + V ++ N +R+P +P + + ++ G++ F+SG
Sbjct: 854 KAVNHVVMGPRTNQPMNKDINRSP------------SPSKLLRALSDVCGYKTVFMSGQN 901
Query: 933 PCWCM---VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
PC+ + + R + +L ++ + T H C GF YV ++++ +LPS + +
Sbjct: 902 PCFILKSAIARPNVL---RLRGKAVQSLTGFHIAACERGFAYVDEDNVIRMSRLPSNTRF 958
Query: 990 DNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1049
D+ W +K IPL I Y + Y + S + L D E + N
Sbjct: 959 DSAWATRK-IPLGEQVDCIVYSSASESYVIGTST------KEDFKLPEDDESHTEWRNEF 1011
Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKEN 1108
++ + ++ V++LEP W ++ +E ++++ L + TT E
Sbjct: 1012 ITFLP-----QLDRGTVKLLEPKN----WSAIDIYEVEPAERITCIKIIRLEISETTHER 1062
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLF---STGRNADNPQ-NLVTEVYSKE-LKGAISALAS 1163
+ ++ +G+A +GED+ +G + +F + D+P+ N +++++E +KGA++A++
Sbjct: 1063 KDMVVVGSAVAKGEDIVPKGCIRVFEIIDVVPDPDHPEKNKKLKLFAREEVKGAVTAVSG 1122
Query: 1164 L--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKS 1217
+ QG L++A G K ++ K G+ L IAF D YV L +K ++GD K
Sbjct: 1123 IGGQGFLIVAQGQKCMVRGLKEDGSLLP-IAFKDTQ-CYVNVLKELKGTGMCIIGDAFKG 1180
Query: 1218 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
++F + E+ +L+L K+ +L +FL DG+ L ++V+D+ N+ + Y P+ S
Sbjct: 1181 LWFTGYSEEPYKLDLFGKENENLAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSS 1240
Query: 1278 WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT---------NRFALLFG 1328
KG +LL R+ FH G F L T ++P + +++ +L
Sbjct: 1241 SKGDRLLHRSVFHTG----HFASTMTLLPHGSHTLSSPVDEDAMDTDLPPPPSKYQVLIT 1296
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
GSIG I+PL+E ++RRL +LQ +LV+++ H GLNPR +R S+G + G ++
Sbjct: 1297 FQTGSIGVISPLNEDSYRRLLALQSQLVNALEHPCGLNPRGYRAVESDGMGGQRG---MI 1353
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
D LL + + + + EIA + G I +L L G ++L
Sbjct: 1354 DGNLLLRWLDMGAQRKAEIAGRVGADVGAIRIDLEKLHGGLAYL 1397
>gi|340924328|gb|EGS19231.1| hypothetical protein CTHT_0058560 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1460
Score = 268 bits (684), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 364/1506 (24%), Positives = 601/1506 (39%), Gaps = 246/1506 (16%)
Query: 57 NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD--GISAA------- 99
NLVV +++++++ + +Q G+ + +++ + R+ D G+ A+
Sbjct: 28 NLVVAKSSLLQVFRTKTVTTEIDTLQTNGASKGRSAARYENRLANDDDGLEASFLGGDSL 87
Query: 100 ----------SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
L LV L G V L+ + + + +S++LAF DAK+S++E+D
Sbjct: 88 GFRADRTTNTKLVLVYETPLAGTVIGLSKIKTSTSRSGC--ESLLLAFRDAKLSLVEWDA 145
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMII 203
+ L S+H +E E + S PL + DP RC + + I
Sbjct: 146 ERNALGTVSIHYYEQEEL------QGSPWAAPLSHYVNFLVADPGSRCAALKFAARNLAI 199
Query: 204 LKASQGGSGL-VGDEDTFGSG------------GGFSARIES-----SHVINLRDLD--M 243
L Q + +GD D G ++ IE S V+ L +LD +
Sbjct: 200 LPFRQVDEDIDMGDWDEELDGPRPQKDVSNAAVSNGASNIEDTPYSPSFVLRLSNLDPSL 259
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
H F+H Y EP IL H+T M+ L + K I S
Sbjct: 260 LHPVHLAFLHEYREPTFGILASTSSASNALGRKDHYTYMVFTLDLQQ--KASTTILSVSG 317
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
LP D Y+++ +P+P+GG L+VG N IH +A+N S +S ++
Sbjct: 318 LPQDLYRVVPLPAPVGGALLVGCNELIHIDQSGKPNGVAVNPMTKQCTSFGLADQSDLNI 377
Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDIT 416
L+ L D+ L+ G +VL+T DGR V L+L P +++ I+
Sbjct: 378 RLEGCIIDVLTPDLGEFLMILNDGRMVLITFRIDGRTVSGLELRLVPPASGGTIIPGRIS 437
Query: 417 T---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
T IG ++ F GS GDSL+ +T + + + + D
Sbjct: 438 TLSRIGKNVMFAGSEEGDSLVFGWT-----KKQTQAGRRKSKPRDDDFYMDDYEEEEEEV 492
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------------ 521
D+ E S + S + SF + D L++I P++ +YG
Sbjct: 493 DEDDLYGEETTSHHQPVSAASSLLSGDLSFRIHDRLISIAPIQSMTYGQPVWMPGSEEER 552
Query: 522 --LRINAD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNAD 564
+ ++AD A G K S + E +G WT+ K +
Sbjct: 553 NSIGVHADLQLVCAVGRDKSSCLATMNLAIQPKVIGQFEFSEARGFWTMCAKKPIPKSLQ 612
Query: 565 SSRMAA------YDD--EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGR 610
S + + YD +Y ++I++ E + TA + + G
Sbjct: 613 SDKGVSVLGGNDYDTGGQYDRFMIVAKVDLDGYEKSDVYALTAAGFEGLCGTEFDPAAGI 672
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
TI AG + R++Q+ + R DG + + P E +G+E V + SIADPY
Sbjct: 673 TIEAGTMGKGSRIVQILKSEVRSYDGDFGLSQIV--PMMDEE-TGAEPRAV-TASIADPY 728
Query: 671 VLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
+L+ D S + D S ++ + S K +S C LY+D
Sbjct: 729 LLIIRDDSSAFIAGIDSSNELEELRKEDKVLVSSKWLSGC-LYND--------------- 772
Query: 731 LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVD 789
ST + P I + SGAL I+ +P+ + ++ D
Sbjct: 773 -STAIFAEETAKSSKPTQS--ILLFLLSSSGALYIYRLPDLSKPIYVTDGLA-------- 821
Query: 790 TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGT 849
Y+ AL T +GT KE I + V +L H P+L ++
Sbjct: 822 -YIPPALSSDFT-----VRKGT---PKEAITEIMVADLG----DTTHKSPYLILRHSNDD 868
Query: 850 ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
+ YQ Y ++ + T + S + +L N F+R P + +++ P
Sbjct: 869 LTIYQPYRYK-----------LGTGQVFS-KTLFFQKLPNPSFARAP-EETEQDDVPPQP 915
Query: 910 PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDGSIVAFTVLHNVNCNHG 968
+ NI+G+ FL G P + + + + RV P L ++A + H C+HG
Sbjct: 916 RLLSMRRCNNIAGYSTVFLPGHSPSFILKSAKSMPRVVP-LQGAGVIAMSPFHTEGCDHG 974
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1028
FIY S I ++ Q+P +Y K +P+ + Y + Y +V +P
Sbjct: 975 FIYADSHNIARVTQIPEDWSYAELGLAVKKVPIGEDIAAVAYHPPQQCY--VVGCNASEP 1032
Query: 1029 LNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1087
E+ D H + +L T++ ++++ P W T+ ++
Sbjct: 1033 F----------ELPKDDDYHKEWARENLVFKPTLDRGLLKLISPIT----WTVIDTVQLE 1078
Query: 1088 SSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
E L V + L + +T E L+A+GTA +GED+ RGRV ++ P
Sbjct: 1079 PCETVLCVETLNLEVSESTNERRQLIAVGTALTKGEDLPTRGRVHVYDIADVIPEPGKPE 1138
Query: 1147 TEVYSKELK---------GAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYD 1193
T SK+LK GA++AL+ + QG +L+A G K ++ K GT L +AF D
Sbjct: 1139 T---SKKLKLIAKEDIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMD 1194
Query: 1194 APPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDG 1251
YV + + L+ D K ++F+ + E+ ++ L K L+ +FL DG
Sbjct: 1195 MS-CYVTAAKELPGTGLCLMADAFKGVWFVGYTEEPYKMMLFGKSSTKLEVLTADFLPDG 1253
Query: 1252 STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH-VTKFLRL-QMLATSSD 1309
L +V D +I I + P+ +S +G LL R F+ GAH TK L L L T +
Sbjct: 1254 KELFIVACDADGHIHILQFDPEHPKSLQGHLLLHRTSFNTGAHNPTKSLLLPSTLPTDTP 1313
Query: 1310 RT------------------GAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1351
T AP LL + G I + PL E ++RRL SL
Sbjct: 1314 STIDGSNPNTNNTNGTPNASNLAPYDATERPHILLLCSPTGLIAALRPLSESSYRRLSSL 1373
Query: 1352 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS-----IVDCELLSHYEMLPLEEQLE 1406
+LV+S+PH AGLNP+ +R + G D+ IVD +L + L + + E
Sbjct: 1374 AAQLVNSLPHAAGLNPKGYRM--PSADCPPAGVDASVGRNIVDGTVLERFTELGMARRAE 1431
Query: 1407 IAHQTG 1412
+A + G
Sbjct: 1432 LAGRAG 1437
>gi|148886829|sp|A2R919.1|CFT1_ASPNC RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|134083776|emb|CAK47110.1| unnamed protein product [Aspergillus niger]
Length = 1383
Score = 267 bits (683), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 332/1465 (22%), Positives = 619/1465 (42%), Gaps = 211/1465 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+L+V ++++IY + + E ++ + ++L++ Y L G V L
Sbjct: 28 DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++AF +AK+S++E+D G+ S+H +E +
Sbjct: 77 CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
+ G ++ VDP RC + +G++ + I+ Q G LV D+ +GS
Sbjct: 131 VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187
Query: 223 ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
GGG + S V+ L LD + H F++ Y EP IL+ +
Sbjct: 188 HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + + ++ + ++ S LP D ++++A+P P+GG L++G+
Sbjct: 248 VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + S +S ++ L+ L + LL T
Sbjct: 308 NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
G++ ++ DGR V + + + I T IG+ FLGS GDS+L+
Sbjct: 368 GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
++ S ++ ++ D AD + S D +D + + +L G +
Sbjct: 428 YSYSSSSTKKHRLQAKQVIDDSADMSEEDQ---SDDDVYEDDLYSTSPDTTLTGRRPSGE 484
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
SA + F + D L+NIGPL+D + G R++ + TG S +++ +G
Sbjct: 485 SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
V + H S + + D + A L EA R V+ T
Sbjct: 545 GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604
Query: 597 TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
++ ES+ + V G TI+ G R+RV+QV + R D +
Sbjct: 605 SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
TQ ++ ++ +S S+AD + + D ++ L D S V
Sbjct: 665 TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719
Query: 700 IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
+ S K SC LY DK TG+ +ID P+ + D++ +
Sbjct: 720 VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760
Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
L ++ V + + ++ + G + ++ SSE G +EN+
Sbjct: 761 DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
V +L + WSA P+L ++ Y+ ++ VST +
Sbjct: 803 IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
++ S+ N R P + + + + + I +ISG F+ G+ + +
Sbjct: 846 HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFII-- 903
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
S F L N + ++ C+LP + +D W +++V
Sbjct: 904 ---------RTSASAPHFLRLRGEN--------SRSSTVRFCKLPPMTRFDYQWTLKRVH 946
Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHR- 1057
+ H + Y +Y VL + L D E+ + N +S R
Sbjct: 947 LGEQVDH-LAYSTSSGMY-------VLGTCHATDFKLPEDDELHPEWRNEAISFFPSARG 998
Query: 1058 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGT 1116
++ ++ + D + + + + E + ++ ++L + T E + ++ +GT
Sbjct: 999 SFIKLVWDHHLQRQDSVILIFHLH-SFSLGADEYVMAIKNISLEVSENTHERKDMIVVGT 1057
Query: 1117 AYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLL 1169
A+ +GED+ +RG + +F + +P + T+ + + +KGA++AL+ + QG +L
Sbjct: 1058 AFARGEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVL 1117
Query: 1170 IASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKE 1225
+A G K ++ K G+ L +AF D YV + +K +LGD K ++F + E
Sbjct: 1118 VAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCILGDAVKGVWFAGYSE 1175
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1285
+ +++L AKD L+ A EFL DG L +VV+D NI + Y P+ +S G +LLS
Sbjct: 1176 EPYKMSLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLS 1235
Query: 1286 RAEFHVGAHVTKFLRLQMLATSSDR-TGAAPGSDKTNRFAL---LFGTLDGSIGCIAPLD 1341
R++FH+G + L SS++ ++ G D N+ L L T +GS+G I +
Sbjct: 1236 RSKFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIP 1295
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1401
E ++RRL +LQ +L +++ H GLNPR+FR S+G A R ++D LL + +
Sbjct: 1296 EESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GMLDGNLLFKWIDMSK 1351
Query: 1402 EEQLEIAHQTGTTRSQILSNLNDLA 1426
+ + EIA + G +I ++L ++
Sbjct: 1352 QRKTEIAGRVGAREWEIKADLEAIS 1376
>gi|330799483|ref|XP_003287774.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
gi|325082229|gb|EGC35718.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
Length = 1453
Score = 266 bits (681), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 182/622 (29%), Positives = 329/622 (52%), Gaps = 69/622 (11%)
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
++++VE++++ ++S+P+L G ++ Y+++ E + K + R LS
Sbjct: 876 ENLEIVEISLE--ILNNSQPYLLLKNRIGDLIVYKSFKKENGDLRFKKYNHNFILRDLSN 933
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
++ S + D Y + + I K S + G F+ G +P W
Sbjct: 934 NSKSINS-----------DGYRK---------KSIVNIKLSSKNNGVFIGGQKPVWIFNE 973
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGSTYDNYWPVQKV 998
+ +R+H DG+IV+ HN +C +GF+Y T + +KI L ++N + +++V
Sbjct: 974 KGYIRLHSMDFDGAIVSLKPFHNADCPNGFLYYTEDKQHIKIGYLNGLMNFENEYAIRRV 1033
Query: 999 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1058
P+K + H+I Y E Y ++VS P +V +++ + +
Sbjct: 1034 -PIKLSAHKIAYHNELKCYVVVVSFP---------------QVTQELEEDSKKPI----- 1072
Query: 1059 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTT--TKENETLLAIG 1115
T E+++++I++P W+ + +Q E L +++V+L F + T +++ L IG
Sbjct: 1073 LTDEKFQIKIIDP-TIDWSWRFIDSFSLQDRETVLAMKIVSLKFKESDETIKSKPFLVIG 1131
Query: 1116 TAYVQGEDVAARGRVLLFS--TGRNADNPQNLVTE----VYSKELKGAISALASLQGHLL 1169
TA+ GED +GRVL+F + + +L T+ +Y KE KG ++AL+S+ G LL
Sbjct: 1132 TAFTFGEDTQCKGRVLVFEIVSHKTQFESDDLGTKRLNLLYEKEQKGPVTALSSVSGLLL 1191
Query: 1170 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1229
+ GPK+ ++++ +L ++F+DA +Y+ S++ +K +I++GD++KS+YFL W G Q
Sbjct: 1192 MTIGPKLTVNQFLTGQLVTLSFHDAQ-IYICSISTIKTYIVIGDMYKSVYFLQW--NGKQ 1248
Query: 1230 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1289
L L+KD+ SL+ F+TEF+++ TLS++VSD KNI +F + P S +GQ LL +A+F
Sbjct: 1249 LVPLSKDYQSLNIFSTEFIVNQQTLSILVSDLDKNILLFSFDPADPTSRQGQMLLCKADF 1308
Query: 1290 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1349
H+G+++ KF+R M + +D+ + FGTLDGS+ + PLDE ++
Sbjct: 1309 HIGSNIEKFVRTPMKFNIQSSSNGNNNNDQ----LVFFGTLDGSLNVLRPLDERMYQLFY 1364
Query: 1350 SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS-------IVDCELLSHYEMLPLE 1402
LQ KL +P AGLN + +R F S + P + I+D +LLS + L +
Sbjct: 1365 HLQSKLY-YLPQPAGLNAKQYRAFKSFSQNFHFSPSTIHQLPKYILDGDLLSKFVKLNQK 1423
Query: 1403 EQLEIAHQTGTTRSQILSNLND 1424
E+ +A G+ +IL+ L +
Sbjct: 1424 ERRLLASSVGSNTDEILTALKN 1445
Score = 248 bits (633), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 198/738 (26%), Positives = 331/738 (44%), Gaps = 119/738 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS-AASLELVCHYRLHGNVES 115
NLV++ N +++Y + K KN T ++ + + SLEL+ +L G +ES
Sbjct: 31 NLVLSKNNTLQVYKI-------KYVKNENTTTQQKQIKKVEIKPSLELLIELKLFGTIES 83
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+A + G + +DS++L F DAKISVL+++ I I S+H +E+ E+ K GR
Sbjct: 84 MASVRYPGEN----KDSLLLTFRDAKISVLDYNIDIMDFEIRSLHFYENDEF---KNGRI 136
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
F P++K+D Q RC +L+Y +++L Q S L +++ ++
Sbjct: 137 HFKHPPILKIDTQQRCATMLLYDRNIVVLPFKQISSILDDEDEEEKDEEDEKENDNANQD 196
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
D +F F++GY EP ++ LHE TW R++ K T ++A+SI+ + K
Sbjct: 197 YTEEFDDDDDDNNFCFLYGYYEPTILFLHEPSQTWTSRIAVKRLTSQLTAISINFSTKLA 256
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA-VSLDSSQE 354
+IW N+P++ +L++VP P+ G LV+ N + + +Q++ LA+N YA + + E
Sbjct: 257 SIIWHTSNMPYNCDQLVSVPEPLSGALVITPNIMFHVNQTSKYGLAVNEYANIDIGDKFE 316
Query: 355 LPRS---SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
P + LD ++ +L+ D + S K G+L++ ++ DGR VQR+ +SK SVL
Sbjct: 317 FPLDETLNLVFTLDRSNFVFLEADKFIGSLKGGELLIFHLISDGRTVQRIHVSKAGGSVL 376
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
+ + + ++L FLGSRLGDSLL+Q+T S T +E + E + K+ + S
Sbjct: 377 ATCMCVVSDNLLFLGSRLGDSLLLQYTEKSIT--------DESLEHENFSNPYKKQKTSE 428
Query: 472 SDAL-----------QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+ L D V EE L+ N +S Q + D ++N+GP+ D
Sbjct: 429 QEKLLNQQQQQQKDEMDEVLDEEDELFKEKKNQLKSYQ----LGICDQILNVGPVGDMVI 484
Query: 521 GLRINADASATGISKQSNY--ELVELPGCKG----------------------------- 549
G +N + Y +EL C G
Sbjct: 485 GQALNPTYDLNTLPSDPAYMPRFLELVTCSGYGKNGSISILQNSVKPEIVGAFDSEGVVN 544
Query: 550 -IWTVYHKSSRGHNADSSR------------------------MAAYDDEYHAYLIISLE 584
WTVY+K+S D +++Y YL IS+
Sbjct: 545 SFWTVYNKASSSIKEDEEEKLIGKKRTINEIIKEEQQYEQQQQKQPIEEDYLDYLYISMS 604
Query: 585 ARTMVLETADLLTEVTESVDYFVQG----RTIAAGNLFGRRRVIQVFERGARIL-DGSYM 639
T T L T +E +G RT+ GNLF +RR++ + E ++L D + +
Sbjct: 605 NGT----TNILDTTSSEEGKLTFKGEFEYRTLDMGNLFNKRRIVLINENSIKLLNDYNNI 660
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
Q++ + S I DPYVL+ SD SI+L D ++ +
Sbjct: 661 VQEIKLS------------KPIKSTFIQDPYVLVHYSDNSIQLFKCDYKLLKLNQFNFSL 708
Query: 700 IESSKKPVSSCTLYHDKG 717
+ V + +L+ DK
Sbjct: 709 NHGDEGKVLTSSLFFDKN 726
>gi|409046890|gb|EKM56369.1| hypothetical protein PHACADRAFT_93103 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1417
Score = 266 bits (680), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 324/1389 (23%), Positives = 594/1389 (42%), Gaps = 161/1389 (11%)
Query: 103 LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
V +RLHG V ES+ I+S D ++++F+DAKI++LE+ D+++ L S+
Sbjct: 120 FVREHRLHGTVTGMESIRIVSS----QEDGLDRLLVSFKDAKIALLEWSDAVNDLLTVSI 175
Query: 160 HCFE-SPEWLHLKRGRESFARGPL----VKVDPQGRCGGVLVYGLQMIILKASQGGSGL- 213
H +E +P+ + L+ PL ++ DP RC +++ + IL Q + L
Sbjct: 176 HTYERAPQMMALE--------APLFHSQLRTDPLSRCAALMLPKDSLAILPFYQSQADLD 227
Query: 214 VGDEDTFGSGGGFSARIESSHVINLR-DLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
+ ++DT S S V+++ D+D +KHV D +F+ G+ P + +L + TW
Sbjct: 228 IMEQDTQTSCRDIP--YSPSFVLDMTTDVDERIKHVIDLVFLPGFNSPTIAVLFQNTQTW 285
Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
R+ T + ++ + P++ + NLP+D L+ + +GGV++V AN +
Sbjct: 286 TSRLREYKDTVGLIIFTLDLVTRNCPVLTAVDNLPYDCLYLVPCSAQLGGVVIVSANALI 345
Query: 331 YHSQ-SASCALALNNYAVSLDSSQELPR-----SSFSVELDAAHATWLQNDVALLSTKTG 384
Y +Q S L +N + + S LP+ S +++L+ ++A ++ ++ + G
Sbjct: 346 YVAQTSRRVILPVNGWQARV-SDHPLPQLTEEEKSRNLKLEGSYAVFVDDNKLFVLLSDG 404
Query: 385 DLVLLTVVYDGRVVQRLDL-SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ + V DGR V RL + S + + + + + + F+GS G S+L++ T
Sbjct: 405 TVYPMEVHADGRTVSRLTMGSALAQTTIPAIVRRVTDENLFIGSTAGPSVLLK------T 458
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
S + +KEE +++ AP+ + D D +GE A T
Sbjct: 459 SHVEEDVKEEDVEMDT-APAAVVDEANEMDLDDD--DGELCHWVHFAKKRT-----VVHL 510
Query: 504 AVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYELVE--LP---------- 545
++ DS+ GP+ D ++ L D +ATG + L + LP
Sbjct: 511 SLCDSIPAYGPVSDMTFSLTRVGDRPVAELVAATGSGGLGGFTLFQRDLPSRVKRKLHAV 570
Query: 546 -GCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLLTEVTE 601
G +G+W++ ++ R + + R + + +IIS +A + A ++
Sbjct: 571 GGGRGMWSLAVRQAVRVNGSTYERPSNPHHGGNDAVIISTDANPSPGLSRIASRSSKSDI 630
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGS--YMTQDLSFGPSNSESGSGSE 657
+ + G T+ A + F ++ V R+L DG+ + +DL
Sbjct: 631 QITTRIPGTTVGAASFFQGTAILHVMSNAIRVLEPDGTERQIIKDLD---------GSVP 681
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDK 716
+ S+ DP++++ D S+ L +G+P + + + + E + K ++ C + D
Sbjct: 682 RPKIRYCSMCDPFIMVIREDDSLGLFIGEPERGKIRRKDMSPMGEKTSKYIAGC-FFMDT 740
Query: 717 GPEPWLRKTSTDAWLSTGVGEAIDGA-DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVF 775
R + A V + + G Q + ++ G LE++ +P VF
Sbjct: 741 TGIFQSRVNAAAAAADKNVTSTLQTVMNAGTRTQ---WLLLVRPQGVLEVWSLPKLALVF 797
Query: 776 TVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAH 835
+ + + +VD+ AL Q + V ++A+
Sbjct: 798 STSHVSALESVLVDSGDSPAL-------------SLPQDPPRKPQDLDVEQIAIAPLGES 844
Query: 836 HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
S+ +L L G Y+A P S P + + +L V V +
Sbjct: 845 SSKLYLLVFLRCGLFAVYEAL----PAPASTDPPPPTRTSTLCVKFV--------KVVTR 892
Query: 896 PLDAYTREETPHGAPCQRITIFKNI----------SGHQGFFLSGSRPCWCM-VFRERLR 944
D EE ++ I + + G FL+G RPCW + + ++
Sbjct: 893 AFDIQQSEEVEKSVLAEQKRISRQLIPFVTSPTPGRAFSGVFLTGDRPCWILSTDKGGVK 952
Query: 945 VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKAT 1004
+ P + AFT F+ + +G I +P ++ + P + IP
Sbjct: 953 IMPS-GHQVVHAFTACSLWESKGDFLLYSDEGPSLIEWVPE-IQFEGHLP-SRSIP---R 1006
Query: 1005 PHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEY 1064
P ++ + L+V+ L+ + S D+ V + D N+S E
Sbjct: 1007 PRPYSHVVFEPTTTLLVAASSLQ--STFTSYDEDRNVVWEPDEPNMS------LPVCETS 1058
Query: 1065 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGED 1123
+ ++ PD W T +E + +TL +T+ + +A+ T +GED
Sbjct: 1059 ALELISPDT----WTTMDGYEFAQNEFVTCMECITLETLSTETGTKDFVAVSTTINRGED 1114
Query: 1124 VAARGRVLLFSTGRNADNPQNLVTEVYS------KELKGAISALASLQGHLLIASGPKII 1177
+A +G V +F +P Y E KG ++AL + +L+ + G KI
Sbjct: 1115 LAVKGAVYIFEVVEVVPDPAMGQKRWYRLKLHCRDEAKGPVTALCGMDNYLVSSMGQKIF 1174
Query: 1178 LHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1236
+ E L G+AF D +YV SL VKN +++GD K ++ ++++E +L +LAKD
Sbjct: 1175 VRALDLDERLVGVAFLDVS-VYVTSLRAVKNLLVIGDALKGVWLVAFQEDPYKLVVLAKD 1233
Query: 1237 FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 1296
+ + + SL+ DE+ +++ Y P ES GQ+LL R EFH T
Sbjct: 1234 YYPIPVACADLFFADGKASLISCDEEGVLRLSEYDPHDPESRHGQRLLCRTEFH---GQT 1290
Query: 1297 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1356
++ ++A R G ++ + L+ G DGS+ + +D+ +RL LQ +L
Sbjct: 1291 EYRTSHLIA----RRGKGLDAE-IPQAKLICGHTDGSLTSLTYVDDAVSKRLHLLQGQLA 1345
Query: 1357 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1416
+V HVAGLNP++FR N + RP I+D LL+ +E LP+ Q+E+ Q T R+
Sbjct: 1346 RNVQHVAGLNPKAFRVVR-NDRVARPLTKGILDGNLLAAFEDLPVPRQVEVTRQIATERT 1404
Query: 1417 QILSNLNDL 1425
+L + DL
Sbjct: 1405 TVLKDWLDL 1413
>gi|326471884|gb|EGD95893.1| protein kinase subdomain-containing protein [Trichophyton tonsurans
CBS 112818]
Length = 1398
Score = 264 bits (675), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 356/1475 (24%), Positives = 605/1475 (41%), Gaps = 203/1475 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + R D A L L Y + G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSTTATQPDQKGRN---DRSQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ + + D+I+++ +AK+S++E+D HG+ S+H +E E H+
Sbjct: 85 QRVRISNSKSGG--DAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVPD 141
Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------DT 219
P + VDP G C + +G+ + IL Q G LV D+ D
Sbjct: 142 LGSCPSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDMVSDA 200
Query: 220 FGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
S G +A + S V+ + LD + H F+H Y EP IL+ +
Sbjct: 201 QKSAPGNTAHDKPYAPSFVLPMAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
+ S ++ + + + LP D +K++ +P P+GG L++G N +H
Sbjct: 261 IDRKDVVSYSIFTLDLQQRASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
+ A+ +N +A + +S + L+ L + LL G + +L+
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILSF 380
Query: 392 VYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
DGR V + L +K PS S +G + F GS GDS+L+ ++
Sbjct: 381 KVDGRSVSGISLHFVAEQSGGLITKARPSCSAS----LGRNKLFYGSEEGDSILLGWSRP 436
Query: 441 SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES---- 496
S T+ S K G E+ A D D + ++L AS E
Sbjct: 437 SSTTKRPS--KAADGVDESGAADLSDEAEQDDDGDDDDMYEDDLHSVNPASIRQEKQVVN 494
Query: 497 --AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
+ F+F D L ++GP +D + G + + S + +EL +G
Sbjct: 495 GDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPAIAAPLELVAARGFGKSG 554
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR---TMVLETAD---LLTEVT--- 600
TV + + DS +M DD Y + I ++ + T + + D LL +
Sbjct: 555 GLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTRLSRSYDKYLLLAKAKGDD 611
Query: 601 --ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
ESV Y V + T+ G L RV+QV R D Y
Sbjct: 612 KEESVVYSVGSSGLDSIDAPEFNPNEDCTVDIGTLATGTRVVQVLRTEIRSHD--YNLGL 669
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAI 700
P E S E TV+ S A+PY+L D S+ +L D + V VQ AA
Sbjct: 670 AQIYPVWDEDTS--EERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA- 726
Query: 701 ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYES 760
S K VS C LY DK ++ + + GP +I +
Sbjct: 727 --SGKWVSGC-LYEDK--------------MNIFFSDFDIENEAGP----NILLFLLDVD 765
Query: 761 GALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
G L IF +PN + + VD L S SSS +E +
Sbjct: 766 GNLSIFRLPNISEPLCRVDNL--------------NLLPSNLPYESSSRRPV---NRETL 808
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
+ + +L A H P++ ++ Y+ Y G S R L
Sbjct: 809 TELLIADLG----DAIHKSPYMILRTKHDDLVLYEPYRIAGESGHSGL-------RFLKA 857
Query: 880 SN--VSASRLR---NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
N V R N +R+P + C+ + ++ G++ F+SG PC
Sbjct: 858 VNHVVMGPRTDQGVNHDINRSP------------SSCKLLRALPDVCGYKTVFMSGHNPC 905
Query: 935 WCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
+ ++ R H G V + + H C GF YV ++++ +LPS + +D+ W
Sbjct: 906 F-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGFAYVDEDNVIRMSRLPSNTRFDSGW 964
Query: 994 PVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
+K I L I Y + Y + S + L D E + N ++ +
Sbjct: 965 ATRK-IALGEQVDSIVYSSASECYVIGTSA------KEDFKLPEDDESHTEWRNEFITFL 1017
Query: 1054 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLL 1112
+E V++LEP W T + ++ +E + V+ L + T E + ++
Sbjct: 1018 P-----QLERGTVKLLEPKN----WSTIDSHELKPAERITCIEVIRLEISELTHERKDMV 1068
Query: 1113 AIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNLVTEVYSKE-LKGAISALASL--Q 1165
+G++ V+GED+ +G + +F P ++ ++++KE +KGA++AL+ + Q
Sbjct: 1069 VVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKSKKLKLFAKEEVKGAVTALSGIGGQ 1128
Query: 1166 GHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFL 1221
G L++A G K ++ K G+ L +AF D YV L +K ++GD K ++F+
Sbjct: 1129 GFLIVAQGQKCMVRGLKEDGSLLP-VAFKDTQ-CYVNVLKELKGTGMCIIGDAFKGLWFI 1186
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
+ E+ +L+L K+ +L +FL DG+ L ++V+D+ N+ + Y P+ S KG
Sbjct: 1187 GYSEEPYKLDLFGKENENLAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGD 1246
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSS----DRTGAAPGSDKTNRFALLFGTLDGSIGCI 1337
+LL R+ FH G + L A + D S +++ +L GSI I
Sbjct: 1247 RLLHRSVFHTGHFASTMTLLPHGAYTPSAPVDEDAMDTDSLPPSKYQILMTFQTGSIAVI 1306
Query: 1338 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1397
PL E ++RRL +LQ +LV+++ H LNPR +R S+G + G ++D LL +
Sbjct: 1307 TPLSEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVESDGMGGQRG---MIDGNLLLRWL 1363
Query: 1398 MLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
+ + + EIA + G I ++L L G ++L
Sbjct: 1364 DMGAQRKAEIAGRVGADVGAIRTDLEKLHGGLAYL 1398
>gi|55725165|emb|CAH89449.1| hypothetical protein [Pongo abelii]
Length = 565
Score = 264 bits (674), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 182/536 (33%), Positives = 282/536 (52%), Gaps = 65/536 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTV 553
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTV 538
>gi|414587797|tpg|DAA38368.1| TPA: hypothetical protein ZEAMMB73_143443 [Zea mays]
Length = 153
Score = 259 bits (663), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 127/159 (79%), Positives = 139/159 (87%), Gaps = 6/159 (3%)
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
M ESWKGQKLLSRAEFHVGAHV+KFLRLQML T G A S+KTNRFAL+FGTLDG
Sbjct: 1 MVESWKGQKLLSRAEFHVGAHVSKFLRLQMLPTQ----GLA--SEKTNRFALVFGTLDGG 54
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
IGCIAP+DELTFRRLQSLQ+KLVD++PHV GLNPRSFR F SNGKAHRPGPD+I+D ELL
Sbjct: 55 IGCIAPVDELTFRRLQSLQRKLVDAIPHVCGLNPRSFRHFKSNGKAHRPGPDNIIDFELL 114
Query: 1394 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
SHYEM+ LEEQLEIA Q GTTRSQILSN +D +LGTSFL
Sbjct: 115 SHYEMMSLEEQLEIAQQIGTTRSQILSNFSDFSLGTSFL 153
>gi|9794908|gb|AAF98388.1| cleavage and polyadenylation specificity factor [Drosophila
melanogaster]
Length = 813
Score = 259 bits (663), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 225/824 (27%), Positives = 373/824 (45%), Gaps = 90/824 (10%)
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD--- 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 10 SPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKDLSG 69
Query: 716 ----KG----------------------PEPWLRKTSTDAWLSTGVGEAI------DGAD 743
KG EP ++ + L G A D A
Sbjct: 70 LFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMADLAK 129
Query: 744 GGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTY 791
D + VV +SG LEI+ +P+ V+ V+ +G + D
Sbjct: 130 QSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLTDAM 189
Query: 792 MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTIL 851
E + S T +S ++ +S +EL++ + RP L + T +L
Sbjct: 190 --EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRVELL 246
Query: 852 CYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP- 910
YQ +F P+ K R L N+ + ++ D E+ P
Sbjct: 247 IYQ--VFRYPKGHLKI-----RFRKLDQLNLLDQQPTHIELDEN--DEQEEIESYQMQPK 297
Query: 911 -CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG 968
Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN +G
Sbjct: 298 YVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNG 357
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1028
F+Y + LKI LPS +YD+ WPV+KV PL+ TP Q+ Y E +Y LI +P
Sbjct: 358 FLYFDTTYELKISVLPSYLSYDSVWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE--EP 414
Query: 1029 LNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RATI 1084
+ + D+E+ + S D Y + ++E+ ++ P+ W+ A+I
Sbjct: 415 MTKYYRFNGEDKELSEE-------SRDERFIYPIGSQFEMVLISPET----WEIVPDASI 463
Query: 1085 PMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ 1143
+ E+ ++V L + T + L IGT + ED+ +RG + ++ P
Sbjct: 464 TFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPEPG 523
Query: 1144 NLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
+T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF D +Y
Sbjct: 524 KPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDTN-IY 582
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
V + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L +V
Sbjct: 583 VHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGFLV 642
Query: 1259 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1318
+D ++NI ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 643 TDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL--- 699
Query: 1319 KTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R S+ K
Sbjct: 700 YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKK 759
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
I+D +L+ Y ++ E+ E+A + GT +IL +L
Sbjct: 760 QGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 803
>gi|396471273|ref|XP_003838832.1| similar to cleavage and polyadenylation specificity factor subunit A
[Leptosphaeria maculans JN3]
gi|312215401|emb|CBX95353.1| similar to cleavage and polyadenylation specificity factor subunit A
[Leptosphaeria maculans JN3]
Length = 1402
Score = 259 bits (662), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 338/1426 (23%), Positives = 582/1426 (40%), Gaps = 229/1426 (16%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++I+ ++ V EG E N+ E + A L LV
Sbjct: 28 NLVVAKNSLLQIFEIKSTTTEVTPEGGDEVDNAAANLDTEAADVQFQRTENTAKLVLVAE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
+ L G V SLA + A N++ R +++++AF DAK+S++E+D + L S+H +E+P
Sbjct: 88 FPLAGTVISLARIK---ALNTKSRGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144
Query: 166 E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGG-------SG 212
+ W + +F + DP RC + + IL Q S
Sbjct: 145 DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQSDLVEDDYDSD 199
Query: 213 LVGDEDT------FGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
L G DT SGG + + SS V+ L +LD + H F+H Y EP I
Sbjct: 200 LDGPRDTKPDQAEAPSGGETTHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHQYREPTFGI 259
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
+ ++ + S ++ K + S LP+D +++ +P PIGG L
Sbjct: 260 IAASRAAAPSLLANRKDILTYSVFTLDLEQKASTTLLSVTGLPYDISRVVPLPHPIGGAL 319
Query: 323 VVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LL 379
++G N IH + +A+N +A + S +S ++ L+ + L D +L
Sbjct: 320 LLGNNEIIHVDQGGKTNGVAVNEFAKACTSFPLSDQSDLALHLEGCNVELLSQDTGDVVL 379
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPS----VLTSDITTIGNSL---FFLGSRLGDS 432
G L+++T +GR V + + VL + + N + F+GS G+S
Sbjct: 380 VLNNGRLLIMTFTLEGRTVSGMTIQTVAADHGGHVLKAGSSCTSNLVRGRLFIGSEDGES 439
Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG--SA 490
+L+ G S ++ L+ ++ D T D L D + + +A
Sbjct: 440 VLL------GWSSATASLRRRHSNVGLDGDGTSEEEEEDIDDLDDDLYNDTAPAVQKITA 493
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKD------------------FSYGLRINADASATG 532
+ + + T+SF + D+L +I P++D S G A + TG
Sbjct: 494 AASEPTPPGTYSFRIHDTLPSIAPIRDAVLHPGKVTDSLNRGEIMLSTGR--GAAGAITG 551
Query: 533 ISKQ---SNYELVELPGCKGIWTVYHKSS------RGHNADSSRMAAYDDEYHAYLIISL 583
+ ++ + ELP GIW V+ + D+ + D +Y YL++S
Sbjct: 552 LDRELHPVSLAASELPSTHGIWAVHARKQAPGGVVTAFGEDTEANMSTDVDYDQYLVVSK 611
Query: 584 EAR-----TMVLET-ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
+ T+V E + L+E + +G T+ G L +V+QV R D S
Sbjct: 612 TSEDGSESTVVYEVHGNELSETDKGDFEREEGSTLFVGVLAAGTKVVQVMRTEVRTYD-S 670
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP 697
+ D + E+G+ V++ S ADPY+L+ L D S +
Sbjct: 671 ELNMDQILPMEDEETGN---ELRVINASFADPYLLV---------LREDSSVKILKASGD 718
Query: 698 AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVC 757
+E + R S+ WLS + ++ + I++ +
Sbjct: 719 GELEDLEA-----------------RGLSSTKWLSASLFKSATFTE--------IFAFLL 753
Query: 758 YESGALEIFDV--PNFNC-VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
G L IF + P C V F+ + R A K + TEI
Sbjct: 754 TPEGGLHIFAMSEPEKPCYVAEALGFLPPLLTVDFVPRRSAAKATITEI----------- 802
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
LA P L + ++ Y+A+ F
Sbjct: 803 ------------LAADLGDVTTRSPHLIIRTSSDDLVIYKAFHFP--------------- 835
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRP 933
S S +++ LR ++ ++ + Y + A + + ++ G+ F G+ P
Sbjct: 836 -SRSAADLWTKNLRWIKLAQQHVPRYVEDAGSEDAGVESTLLALDDVCGYSTVFQRGASP 894
Query: 934 CWCMVFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
+ +F+E P+ L + T H +C GF YV S L+I QLPS + +
Sbjct: 895 SF--IFKEA-SSSPRVIGLSGKPVKGLTTFHTSSCERGFAYVDSTDTLRISQLPSRTHFG 951
Query: 991 NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVLSLLIDQEVGHQIDN 1047
+ + +P+ A + + Y LY + P VL P E H
Sbjct: 952 HLGWATRRLPMDAEVYALAYHP-AGLYVVGTGQPEDFVLDP----------SETYH---- 996
Query: 1048 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTK 1106
+ L D+ +VE +++++ G W T E L ++ + L + TT
Sbjct: 997 YELPKEDISFKPSVERGVIKLIDE----GTWSIIDTHVFDPQEVVLCIKALNLEVSETTH 1052
Query: 1107 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISAL 1161
+ + L+A+GT+ V GED+A +G + +F P T + E+KGA+SA+
Sbjct: 1053 QRKDLIAVGTSIVHGEDLATKGCIRIFEVITVVPEPDRPETNKRLKLIVKDEVKGAVSAI 1112
Query: 1162 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1215
+ L QG L++A G K ++ K GT L +AF D YV +L + N +L+GD +
Sbjct: 1113 SELGTQGFLIMAQGQKCMVRGLKEDGTLLP-VAFMDMQ-CYVTTLKTLPNTGMLLMGDAY 1170
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+ ++F + E+ +++L + +L+ A EFL L ++V+D NIQ+ + P+
Sbjct: 1171 RGVWFTGYTEEPYKMSLFGRSKHNLEAMAVEFLPFNGELHIIVADADMNIQVLQFDPENP 1230
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQ---MLATSSDRTGA----APGSDKTNRFAL--- 1325
+S +G +LL +A FH G T LQ + S+ G AP S + L
Sbjct: 1231 KS-EGSRLLHKATFHTGHFPTTTHLLQSHLQMPESASTFGTTDTFAPDSTPSAPLPLHQV 1289
Query: 1326 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
L + G++ I PL E ++RRL +L L++++ GLNP +FR
Sbjct: 1290 LITSQSGTLALITPLSESSYRRLSNLAAYLINTLESPCGLNPVAFR 1335
>gi|169864473|ref|XP_001838845.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
gi|116500065|gb|EAU82960.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
Length = 1458
Score = 258 bits (660), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 359/1512 (23%), Positives = 624/1512 (41%), Gaps = 248/1512 (16%)
Query: 57 NLVVTAANVIEIYVVRVQE-------EGSKESKN---------SGETKRRVLMDGISAAS 100
NLVV +N++ I+ VR + E +E K GE DG S
Sbjct: 40 NLVVARSNLLRIFEVREEPCAVPHGVEDERERKGGIRRGTEAVEGELAMDAQGDGFINVS 99
Query: 101 ----------------LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
L LV ++LHG V L+ + + A + D ++++F+DAKI++
Sbjct: 100 KGMAMKSDVEHPKTTRLYLVREHKLHGMVTGLSGV-RIIASLEDKLDRLLVSFKDAKIAL 158
Query: 145 LEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVK----VDPQGRCGGVLVYGL 199
LE+ D++H L S+H +E +P+ L PL K VDPQ RC + +
Sbjct: 159 LEWSDAVHDLVPVSIHTYERAPQLTSLT--------APLFKSQLRVDPQSRCAALGLPNH 210
Query: 200 QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYI 256
+ IL F S +++L + ++++V DF F+ G+
Sbjct: 211 ALAILP--------------FLDDAVSDVPYSPSFILDLAVSVNPNIRNVADFCFLPGFN 256
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
+P + ++ E TW GR+ T + ++ +P+I S LP D+ L VP+
Sbjct: 257 KPTLAVMFEPLQTWMGRIGEYKDTVKLVIFTLDIKTSSYPIITSVDGLPMDSLGL--VPA 314
Query: 317 PIGGVLVVGANTIHYHSQSAS--CALALNNYA--VSLDSSQELPRSSFSVELDAAHATWL 372
GGV++ N++ Y QS+S A+ +N +A ++ LP ++ L+ + +
Sbjct: 315 -FGGVVITTPNSLIYIDQSSSRQIAVPVNGWASRITDLPLLPLPSPDLNLTLEGSKTVVV 373
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGS 427
+ G + + V+ DG+ V +L + K T PSV+ S
Sbjct: 374 DEKTLFVILANGIIYPIEVMADGKTVTKLQVGKPLAQATIPSVVES-------------- 419
Query: 428 RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-----GE 482
LGD L + +L + EE D E + + K + L D + +
Sbjct: 420 -LGDGHLFVGSTVGVGVVLKTAWVEEEVDDEEEGTNAKVVEDDIDMDLYDDDDDLYGDSK 478
Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG---- 532
+ + +T+ + ++RD+L GP+ ++ L D +ATG
Sbjct: 479 NKTQVTAEVKDTKKYRSVLHLSLRDTLPAYGPISSLTFSLATEGDKPVPELVTATGSGIL 538
Query: 533 ---------ISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----- 578
+ ++ +++ + G +G+W++ + S SS A + HA
Sbjct: 539 GGFTLFQRDLPTRTKKKILAVGGTRGLWSLPIRQSVKKGGSSSSTTAIE---HAKTERDT 595
Query: 579 LIISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
LI+S +A R TEV ++ V G T+ A F R ++ V
Sbjct: 596 LILSTDATPSPGVSRIATRAPPGGKTEV--NITTRVPGTTVGAAPFFQRTAILVVMTNSI 653
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSE------NSTVLSVSIADPYVLLGMSDGSIRLLVG 685
++L+ P +E + + + S SI DP+VL+ D S+ L +G
Sbjct: 654 KVLE-----------PDGTERQTIQDMDGKLLRPKIRSCSICDPFVLIIREDDSLGLFIG 702
Query: 686 DPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI--DGA 742
+ + + + + E + K ++ C G +TS +T + + G+
Sbjct: 703 ETERGKIRRKDMSPMGEKTSKYLAGCFFTDTSGLFGQQFETSVPVEGATATLQNVVSGGS 762
Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
G Q + ++ G +EI+ +P F+V S +VD++ + AL S
Sbjct: 763 TSGGKPQHTQWLLLVRPQGVMEIWTLPKLTLAFSVSAVPSLFNVLVDSHDKPAL--SVPN 820
Query: 803 INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
+ G+ E + +V E R LF L +G + Y+A P
Sbjct: 821 PGDPPQRKPGEFDVEQVCVSRVGE-------DGRGRVCLFVFLRNGQLTIYEAL----PL 869
Query: 863 NTSKSDDPVSTSRSLSV--------SNVSASRLR----NLRFSRTPLDAYTREETPHGA- 909
+T+ S S ++ V V A + R N++F + A+ + GA
Sbjct: 870 STTASQPAASVDGAMDVDASSTEPQQQVEAEKKRSQTLNIKFVKISSIAFEIQRHEDGAE 929
Query: 910 ------------------PCQRITIFKNIS----------GHQGFFLSGSRPCWCM-VFR 940
QR+ + + + G F +G +P W + +
Sbjct: 930 GGERGSGERASGILAEHKKIQRLFVPFTVKPKTSDGTPSPTYSGVFFTGDKPNWIIGTDK 989
Query: 941 ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIP 1000
++++P + +F+ F+ T G I LP TY + P + IP
Sbjct: 990 GGVQIYPS-GHNVVHSFSACSLWEERGEFLVYTEDGPCLIEWLPD-FTYSHPLPARS-IP 1046
Query: 1001 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYT 1060
+ + F LIV+ + Q D++ G ++ + VD T T
Sbjct: 1047 -RGRGYSNVVFDPSTC--LIVAASSM----QARFASYDED-GVRVWEKDGPGVDDPITDT 1098
Query: 1061 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYV 1119
+ ++ P+ W T ++E + +VTL T+ ++ +A+GT
Sbjct: 1099 ---SALELISPNS----WITMDGFEFATNEYINDISIVTLETAATETGSKDFIAVGTTID 1151
Query: 1120 QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL------KGAISALASLQGHLLIASG 1173
+GED+AA+G +F +P T Y L KG ++A+ QG+L+ + G
Sbjct: 1152 RGEDLAAKGAAYIFEIVEVVPDPAISPTRWYKLRLRCRDDAKGPVTAVCGFQGYLVSSMG 1211
Query: 1174 PKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1232
KI + + E L G+AF D +YV SL ++KN +L+GD KS+ F++++E +L L
Sbjct: 1212 QKIFVRAFDSDERLVGVAFMDVG-IYVTSLRVLKNLLLIGDAVKSVMFVAFQEDPYKLVL 1270
Query: 1233 LAKDFGSLDCFATEFLIDGS-TLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH- 1290
LAKD +F + L+L+V DE+ ++I+ Y P +S G+ LL R E+H
Sbjct: 1271 LAKDVHLHSVTRADFFFNADGDLALIVGDEEGIMRIYEYNPNDPDSRDGRYLLLRTEYHG 1330
Query: 1291 -VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1349
V H + T + R P +++ LL G+ DGS+ + P+DE F+RLQ
Sbjct: 1331 QVPYHTS--------TTIARRDKEDPSIPQSH---LLIGSADGSLSSLVPVDEYAFKRLQ 1379
Query: 1350 SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1409
LQ +L ++ HVAGLNP++FR N +P I+D +LL+ YE LP+ Q E+
Sbjct: 1380 LLQGQLTRNIQHVAGLNPKAFR-IVKNDYVSKPLSKGILDGQLLAQYESLPIPRQNEMTK 1438
Query: 1410 QTGTTRSQILSN 1421
Q GT R +L +
Sbjct: 1439 QIGTERGVVLRD 1450
>gi|406865186|gb|EKD18229.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 1443
Score = 258 bits (659), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 341/1512 (22%), Positives = 614/1512 (40%), Gaps = 232/1512 (15%)
Query: 57 NLVVTAANVIEIYVVRVQ------EEGSKESKNSGETKRRVLMDGISAA----------- 99
NL+V ++++++ ++ EEG+ SK + + + DG+ A+
Sbjct: 28 NLIVAKTSLLQVFTTKITSIELGIEEGA--SKQNDKWDPSLDNDGLDASFIGADSLLRPD 85
Query: 100 -----SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
L LV Y L G + SLA + + + +++++ F DAK+S++E+D + G+
Sbjct: 86 RARRTKLVLVAEYTLSGTITSLARIKTLSSKSGG--EALLVGFRDAKLSLVEWDPARPGI 143
Query: 155 RITSMHCFESPEWLHLKRG------RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
S+H +E E L+R +ES + DP RC + G + I+ Q
Sbjct: 144 STISIHYYEQDE---LQRSPWAPNLKESVN---YLIADPGSRCAALKFGGRNLGIIPFKQ 197
Query: 209 GGSGLVGDE----------------DTFGSGGGFSARIESSHVINLRDLDMKHVKD--FI 250
+ D+ S S V+ L LD +
Sbjct: 198 DDEDVNMDDWDEEIDGPRPADKVITKATNSSNDKETPYGPSFVLRLATLDPNLINPIHLA 257
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
F++ Y EP IL ++ + + + H T M+ L + + I S LP+D
Sbjct: 258 FLYEYREPTFGILSSSQMPASSLLFERRDHLTYMVFTLDLQQ--RASTTIMSVTGLPYDL 315
Query: 309 YKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
++++ + +P+GG L++G N IH + +A+N +A S + +S + L+ +
Sbjct: 316 FEVVPLDAPVGGALLIGTNELIHIDQAGKANGVAVNVFAKQCTSFGLVDQSGLDMRLEGS 375
Query: 368 HATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITT---I 418
L Q+ ++ +TG++ +L+ DGR V L + + + SV+ + ++T I
Sbjct: 376 KIEQLSIQSGEMIIFLQTGEIAILSFHMDGRSVSSLSVRRVSAEAGGSVIPARVSTLSHI 435
Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
G + F+GS DS+++ + S S S K IE ++ D D
Sbjct: 436 GQNTLFVGSACADSMVLGW---SRKSNQVSRRKPRVEVIEDADDASLDELDDEDDDADDD 492
Query: 479 VNGEELSLYGSASN------NTESAQKTFSFAVRDSLVNIGPLKDFSYG---LRINADAS 529
+ GE S+ A+N S + F V DSLVNI P+ + ++G L N D
Sbjct: 493 LYGEGPSIIQDATNGVAKSDTVNSKAGDYVFQVHDSLVNIAPIVNITFGNASLSQNEDEK 552
Query: 530 ATGISKQSNYELV--------------------------ELPGCKGIWTVYHK--SSRGH 561
+ + ELV E P +GIWT+ K + +G
Sbjct: 553 LDSVGVRGYLELVASVGKQRAGALAVIHQNIQPKVIGRFEFPEARGIWTMSAKRPAEKGL 612
Query: 562 NADSSRMA-----AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV-------QG 609
A + + A D +Y +I+S +A + ET+D+ + + + G
Sbjct: 613 EAKKEKSSTSGDYAIDAQYDRLMIVS-KALSDGTETSDVYALTSANFEALTGTEFEPAAG 671
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
TI AG L RVIQV + R DG+ + Q L + +G+E ++S S AD
Sbjct: 672 STIEAGTLGNGNRVIQVLKSEVRSYDGNLGLAQILPM----YDDDTGAE-PKIVSASFAD 726
Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
PY+LL D SI + D + ++ + K ++ C LY D
Sbjct: 727 PYLLLFRDDSSIFVAQSDENNELEEIEREDDALLATKWLTGC-LYAD------------- 772
Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHI 787
S GV + G +++ ++ + GAL I+ +P+ N V+ +
Sbjct: 773 ---SRGVFAPVQSDKGQKVEE-NVMMFLLSAGGALHIYALPDLSNAVYVAEGLC------ 822
Query: 788 VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR-PFLFAILT 846
++ L + S++ E + EL + +R P+L +
Sbjct: 823 ---FVPPVLSAAYAARRSAARE-------------TITELVVADLGDETARSPYLILRPS 866
Query: 847 DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP 906
+ Y+ P +TS S S S + ++ N +R P + ET
Sbjct: 867 TDDLTIYE------PFHTS------SESSGGLASTLQFLKIHNPHLARNP--DVSAAETA 912
Query: 907 HGAPCQR---ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNV 963
G R + + N+ G+ FL G P + M + L + + H
Sbjct: 913 DGIQETRDEPMRVISNLGGYCTVFLPGGSPSFIMKSAKSTPKVISLQGLGVRGMSSFHTE 972
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C+ GFIY G+ ++ QLP +T+ + I L H + Y Y S
Sbjct: 973 GCDRGFIYTDVDGLARVSQLPKDTTFAELGVSLQKIELGQEIHGVAYHPPTECYVAATST 1032
Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY--TVEEYEVRILEPDRAGGPWQTR 1081
+ E+ + DNH+ T+ T+E+ +R++ P W
Sbjct: 1033 EA------------EFELPKEDDNHHPQWAKEQITFKPTMEQGRLRLINPVN----WTVV 1076
Query: 1082 ATIPMQSSENALTVRVVTLFNT-TTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1140
+ + E + ++ + L + T E + L+A+GT +GED+A +GR+ ++
Sbjct: 1077 DEVELDPFEVIMCIKTLILETSEITNERKQLIAVGTGISKGEDLAIKGRIHVYDVINVVP 1136
Query: 1141 NPQNLVTEVYSKEL------KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIA 1190
P T K + +GAI+ ++ + QG ++++ G K ++ K GT L +A
Sbjct: 1137 EPDRPETNKRLKLIATEDIARGAITCISEIGTQGFMIVSQGQKCMVRGLKEDGTLLP-VA 1195
Query: 1191 FYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1248
F D Y+ S+ +K + D K ++ + E+ ++ L K +++ + L
Sbjct: 1196 FMDMN-CYITSIKELKGTGICVFSDAVKGVWVAGYTEEPYKMMLFGKSAKNMEIMQADLL 1254
Query: 1249 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1308
DG L +V +D N+ I + P+ +S +G LL R+ F +G H+ + L L +
Sbjct: 1255 PDGKELYIVAADSDCNLHIMQFDPEHPKSLQGHLLLHRSTFALGGHLPTSMTL--LPRTK 1312
Query: 1309 DRTGAAPGSDKTNRFA--------LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVP 1360
T P D + A +L + G I + PL E +RRL +L L++++
Sbjct: 1313 SATLLPPSPDAMDTAADATIPEHEILITSSTGCISLLTPLSEAQYRRLSTLTSHLINTLY 1372
Query: 1361 HVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1420
H GLNPR++R + G +++D +L + L + + E+A + G ++
Sbjct: 1373 HACGLNPRAYR-VDKDAPEGMVGSRTVIDGNILMRWMELGSQRRAEVAGRVGVDVLEVRE 1431
Query: 1421 NLNDLALGTSFL 1432
+L L G +L
Sbjct: 1432 DLASLMGGLGYL 1443
>gi|452001482|gb|EMD93941.1| hypothetical protein COCHEDRAFT_1129958 [Cochliobolus heterostrophus
C5]
Length = 1385
Score = 257 bits (657), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 329/1414 (23%), Positives = 583/1414 (41%), Gaps = 222/1414 (15%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++++ ++ V G E++N+ E L S A L LV
Sbjct: 28 NLVVAKNSLLQVFELKSTTTEVTPGGGDEAENAAANLDTEAADVPLQRTESTAKLVLVGE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + R +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVVSLARVK--ALSTKSRGEALLVAFRDAKLSLVEWDPESYSLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ------------ 208
W + +F + DP RC + + IL Q
Sbjct: 146 LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGSHNLAILPFRQRDLVDDDYDSDA 200
Query: 209 -GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
G ++ T + G + SS V+ L +LD + H F+H Y EP I+
Sbjct: 201 DGPKESKPEQQT--ASGSHTTPYTSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAA 258
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
T ++ + S ++ K + S LP+D +++ +PSPIGG L+VG
Sbjct: 259 SRDTAPSLLAHRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITRVVPLPSPIGGALLVG 318
Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTK 382
+N IH + +A+N +A + S +S ++ L+ L ++ L+
Sbjct: 319 SNEIIHVDQGGKTNGVAVNEFAKACTSFPLSDQSDLALRLEGCSVELLSHEAGDVLVVLN 378
Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLV 435
G L++LT DGR V + + S + +G F+GS GDS+++
Sbjct: 379 NGRLLVLTFTLDGRTVSGMTVHPVAADHGGHLIKAAASCTSNLGRGRLFVGSEDGDSVML 438
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNT 494
+T +S L+ + + D D D+ N ++ +A+ +
Sbjct: 439 GWTS------TASHLRRKQSNANIDTDEDMSDEEDMEDMEDDLYNDTAPAVQKITAAASE 492
Query: 495 ESAQKTFSFAVRDSLVNIGPLKD------------------FSYGLRINADASATGISKQ 536
+A T++F + D L +I P+K+ S G A AS T ++++
Sbjct: 493 PTAPGTYTFRIHDVLPSIAPIKNAVLHPGKDTESLNRGEVMLSTGR--GAAASITALNRE 550
Query: 537 SNYELV---ELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISLEAR- 586
+ V +LP +G W V+ + + D A + +Y YL++S
Sbjct: 551 LHPVTVATRQLPSARGTWAVHARKQAPGDVTAAFGEDMEANMATNVDYDQYLVVSKTGED 610
Query: 587 ----TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
T+V E + LTE + +G T+ G L +V+QV R D S +
Sbjct: 611 GTESTVVYEVNGNELTETDKGDFEREEGSTLFVGVLAAGTKVVQVMRTEIRTYD-SELNM 669
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
D + ESG+ V++ S ADPY+L+ D S+++ A +
Sbjct: 670 DQILPMEDEESGN---EVNVINASFADPYLLVLREDSSVKIFR-------------ATGD 713
Query: 702 SSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG 761
+ V + L S WLS + ++ ++++ + G
Sbjct: 714 GELEDVEATGL-------------SNSQWLSASLFKSASFT--------EVFAFLLTPEG 752
Query: 762 ALEIFDVPNFNCVFTVDKFVSGRTHIVDT-YM--REALKDSETEINSSSEEGTGQGRKEN 818
L +F V + V + +S ++ Y+ R A+K + TEI
Sbjct: 753 GLRVFAVSDMEKPCYVAEALSFLPPVLGMDYVPKRSAIKATITEI--------------- 797
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
LA A P L + ++ Y+A+ S S S
Sbjct: 798 --------LAADLGDATTKSPHLIVRTSSDNLVIYKAF----------------HSPSRS 833
Query: 879 VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCM 937
+++ LR ++ S+ + YT + + + + +I G+ F G+ P +
Sbjct: 834 AADLWTKNLRWVKLSQQHIPRYTEDGGAEDSGFESTLLTLSDIGGYSTVFQRGTTPAF-- 891
Query: 938 VFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
+F+E P+ L + + T H +C GF Y+ S L+I QLP + Y +
Sbjct: 892 IFKESSSA-PRVIGLSGKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGW 950
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
+ +P+ A H + Y + LY + P L+ E H + L D
Sbjct: 951 ATRRMPMDAEIHALAYHS-SGLYIVGTGQPEEYQLDP-------SETYH----YELPKED 998
Query: 1055 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLA 1113
+ T+E +++L+ W T + E L+++ + L + T + + L+A
Sbjct: 999 MSFKPTIERGIIKLLDEKT----WTIIDTHVLDPQEVVLSIKTLNLEVSENTHQRKDLVA 1054
Query: 1114 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QG 1166
+GTA + GED+A +G + +F P T + E+KGA+SA++ L QG
Sbjct: 1055 VGTAILHGEDLATKGCIRIFEVITVVPEPDRPETNKRLKLIVKDEVKGAVSAISELGTQG 1114
Query: 1167 HLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLS 1222
+++A G K ++ K GT L +AF D YV L + + + D ++ ++F
Sbjct: 1115 FMIMAQGQKCMVRGLKEDGTLLP-VAFMDM-QCYVSDLKNLPGTGMLAMSDAYRGVWFTG 1172
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1282
+ E+ +++L A+ SL+ A +F+ L L+V+D N+Q+ + P +S G +
Sbjct: 1173 YTEEPYRMSLFARSKHSLEAIAIDFIPFEEQLHLLVADADMNLQVLQFDPDNPKSEAGSR 1232
Query: 1283 LLSRAEFHVGAHVTKFL-----RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1337
LL ++ FH G H L RL+M ++SD P +L + G++ +
Sbjct: 1233 LLHKSTFHTG-HFPATLHVVHSRLKM-PSASDFAATQP------LHQILCTSQSGTLALV 1284
Query: 1338 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
PL E T+RRL +L L +++ AGLNPR+FR
Sbjct: 1285 TPLSEDTYRRLSNLSAYLSNTLDATAGLNPRAFR 1318
>gi|440637976|gb|ELR07895.1| hypothetical protein GMDG_02777 [Geomyces destructans 20631-21]
Length = 1495
Score = 257 bits (657), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 347/1450 (23%), Positives = 591/1450 (40%), Gaps = 215/1450 (14%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
S L LV Y L G V SLA + +++ +S++L+F+DAK+S++E+D HGL
Sbjct: 147 STTKLVLVGEYALAGTVTSLARIKI--SESKSGGESLLLSFKDAKLSLVEWDPERHGLST 204
Query: 157 TSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS 211
S+H +E E W ++ + DP+GRC + + IL QG
Sbjct: 205 VSIHYYEQEEIGGSPWDPYLSNCFNY-----LTADPRGRCAALKFGARNLAILPFRQGDE 259
Query: 212 GLVGDE--------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
D+ T + G S V+ L LD + H F++ Y
Sbjct: 260 DTTMDDWDEELDGPRPTTAIITSENKGHEDTPYAPSFVLRLSSLDPTLIHTVHLAFLYEY 319
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL + + + ++ K I LP+D +K++ +P
Sbjct: 320 REPTFGILSSTLSPSSSLLDERKDQLSYMVFTLDLNQKASTTILVVTGLPYDLFKVIPLP 379
Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQN 374
SPIGG L+VG N IH + +A+N A S S + +SS + L+ L
Sbjct: 380 SPIGGALLVGGNELIHIDQSGKANGVAVNALAKSCTSFGLVDQSSLQMRLEGCAVEQLSA 439
Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFL 425
D L+ TG+L +L+ DGR V L+L + PS S + I ++ F+
Sbjct: 440 DNGEMLIILNTGELAVLSFRMDGRSVSGLNLRRV-PSESGICMGAQASCTSLINHNSMFI 498
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE-- 483
GS DS+++ ++ S + + + + D +D + GE
Sbjct: 499 GSEDTDSIVLGWSRKSKQAGRRRS-QPTIDAGDDADVDGTDEDQEDEDEDEDDLYGESTA 557
Query: 484 -LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----RINADASATGISKQSN 538
+ L G + + S ++F + DSLVNI PL+D + R + D AT IS +SN
Sbjct: 558 AIPLKGEVAADANSKAGDYAFRIHDSLVNIAPLRDVTLSKPETPREDEDEEAT-ISTRSN 616
Query: 539 YELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
+ELV E P +GIWT+ K + + A
Sbjct: 617 FELVGVTGRNTSGSLAFLRREIEPNVIGRFEFPEARGIWTLCAKRPLIKGLEPEKSEAIL 676
Query: 573 D-------EYHAYLIISL-------EARTMVLETADLLTEVTESVDYF-VQGRTIAAGNL 617
D ++ +I+S E+ VL +A E ++ G TI G +
Sbjct: 677 DPESELGAQFDRLMIVSKSTEDTPEESSVYVLTSAGF--EALADTEFEPAAGATIKCGTV 734
Query: 618 FGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
RV+Q+ + R DG + Q L + E+G+ +++ SI DPYVLL
Sbjct: 735 GNGMRVVQILKSEVRSYDGDLGLAQILPM--FDDETGA---EPKIVAASIVDPYVLLIRD 789
Query: 677 DGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG 736
D SI + D ++ + K +S C LY+D STG+
Sbjct: 790 DASIFVASCDSDNDLEEIERGDDSLLTNKWLSGC-LYND----------------STGMF 832
Query: 737 EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYM--R 793
++G + I S++ E GAL ++ +P+ + ++ + T I Y R
Sbjct: 833 AETALSNGTVSKKSVIMSLLNSE-GALFMYALPDLSKPIYQANGVSFIPTTISPDYATRR 891
Query: 794 EALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCY 853
+ ++ TE+ L A P+L ++ + Y
Sbjct: 892 STVAETLTEV-----------------------LLADLGDATSKSPYLIFRASNDDLTIY 928
Query: 854 QAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR 913
+ F+ P S+ P S+SL + + T + A E G+P +
Sbjct: 929 EP--FQVP-----SEAPRPLSKSLHFQKIHNPHVAKTANPETEV-AADAESAKRGSPMRA 980
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT 973
I N+ G FL G P + + + L + + + H C+ GFIYV
Sbjct: 981 IA---NVGGLSSVFLPGDSPSFVVKSSKSTPRVVGLRGHGVRSLSGFHTEGCDRGFIYVD 1037
Query: 974 SQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPL--IVSVPVLKPLNQ 1031
S+GI ++ QL + + + + + +TY K++Y + +V P P +
Sbjct: 1038 SKGIARVSQLEPETNVTDIGLTLRKVKIGEEVQAVTYHPPKDVYVIGTVVKEPFELPKDD 1097
Query: 1032 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE---------VRILEPDRAGGPWQTRA 1082
D HR + E+ +++L P W
Sbjct: 1098 ----------------------DYHREWAKEDITFKPLTGRGFLKLLNPSN----WSVID 1131
Query: 1083 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF---STGRN 1138
+ + S E + ++ + L + T E + L+ +GTA +GED+A RGRV ++ +
Sbjct: 1132 KVELDSHEIIMCIKTLNLEVSENTHERKQLITVGTAISKGEDLAIRGRVYVYEVITVVPF 1191
Query: 1139 ADNPQ-NLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAF 1191
D P+ N ++ +KE +GAI+ ++ + QG +++A G K ++ K GT L +AF
Sbjct: 1192 PDRPETNKKLKLIAKEEIPRGAITGISEIGTQGFMIVAQGQKSMVRGLKEDGTLLP-VAF 1250
Query: 1192 YDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
D YV ++ + L D K ++F + E+ ++ + K ++ + L
Sbjct: 1251 IDMN-TYVTTVKSLPGTGMCLFADAIKGVWFAGYSEEPYKMTIFGKQSQGMEVITADLLP 1309
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
G L ++V+D N+ + + P+ +S GQ LL R F +G H+ + L L T++
Sbjct: 1310 IGDELYIIVADSDCNLHVLQFDPEHPKSLHGQLLLQRTTFSLGGHMPTTMTLLPLTTTTQ 1369
Query: 1310 RTGAA---PGSDKTNRFALLFGTL-DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1365
A S+ TN + L TL G + + PL E +RRL +L L + + H GL
Sbjct: 1370 TPTPAVTSTASEPTNPASGLLMTLSSGVVAILTPLSEQQYRRLNALSNHLSNLLYHPGGL 1429
Query: 1366 NPRSFRQFHSNGKA---HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
NP++ R ++ +A RP IVD +L + L +++ E+A + G I +L
Sbjct: 1430 NPKAHRISNTAPEAVIGGRP----IVDGSVLWRWLELGSQKRAEVAGRVGVDGETIREDL 1485
Query: 1423 NDLALGTSFL 1432
++A G +L
Sbjct: 1486 QEIAAGLGYL 1495
>gi|451849663|gb|EMD62966.1| hypothetical protein COCSADRAFT_92785 [Cochliobolus sativus ND90Pr]
Length = 1405
Score = 257 bits (656), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 332/1424 (23%), Positives = 585/1424 (41%), Gaps = 222/1424 (15%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++++ ++ V G E++N+ E L S A L LV
Sbjct: 28 NLVVAKNSLLQVFELKSTTTEVTPGGGDEAENAAANLDTEAADVPLQRTESTAKLVLVGE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + R +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVVSLARVK--ALSTKSRGEALLVAFRDAKLSLVEWDPESYNLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ-------GGSGL 213
W + +F + DP RC + + IL Q S
Sbjct: 146 LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGSHNLAILPFRQRDLVDDDYDSDA 200
Query: 214 VGDEDTF----GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
G +++ + G + SS V+ L +LD + H F+H Y EP I+
Sbjct: 201 DGPKESKLEQQAASGSHTTPYTSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAASR 260
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
T ++ + S ++ K + S LP+D +++ +PSPIGG L+VG+N
Sbjct: 261 DTAPSLLAHRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITRVVPLPSPIGGALLVGSN 320
Query: 328 -TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTG 384
IH + +A+N +A + S +S ++ L+ L ++ L+ G
Sbjct: 321 EIIHVDQGGKTSGVAVNEFAKTCTSFPLSDQSDMALRLEGCSVELLSHEAGDVLIVLNNG 380
Query: 385 DLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L++LT DGR V + + S + +G F+GS GDS+++ +
Sbjct: 381 RLLVLTFTLDGRTVSGMTVHPVAADHGGHLIKAAASCTSNLGRGRLFVGSEDGDSVMLGW 440
Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTES 496
T +S L+ + + D D D+ N ++ +A+ + +
Sbjct: 441 TS------TASHLRRKQSNANIDTDEDMSDEEDMDDMEDDLYNDTAPAVQKITAAASEPT 494
Query: 497 AQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL--- 541
A T++F + D L +I P+K+ + G + ++ A N EL
Sbjct: 495 APGTYTFRIHDVLPSIAPIKNAVLHPGKDTESLNRGEIMLSTGRGAAAAITALNRELHPV 554
Query: 542 ----VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISLEAR----- 586
+LP +G W V+ + + D A + +Y YL++S
Sbjct: 555 TAATRQLPSARGTWAVHARKQAPGDVTAAFGEDMEANMATNVDYDQYLVVSKTGEDGTES 614
Query: 587 TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
T+V E + LTE + +G T+ G L +V+QV R D S + D
Sbjct: 615 TVVYEVNGNELTETDKGDFEREEGSTLFVGILAAGTKVVQVMRTEIRTYD-SELNMDQIL 673
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
+ ESG+ V++ S ADPY+L+ D S+++ A + +
Sbjct: 674 PMEDEESGN---ELNVINASFADPYLLVLREDSSVKIFR-------------ATGDGELE 717
Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
V + L S WLS + ++ ++++ + G L +
Sbjct: 718 DVEATGL-------------SNSQWLSASLFKSASFT--------EVFAFLLTPEGGLRV 756
Query: 766 FDVPNFNCVFTVDKFVSGRTHIVDT-YM--REALKDSETEINSSSEEGTGQGRKENIHSM 822
F V + V + +S ++ Y+ R A+K + TEI
Sbjct: 757 FAVSDMEKPCYVAEALSFLPPVLGMDYVPKRSAIKATITEI------------------- 797
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
LA A P L + I+ Y+A+ S S S +++
Sbjct: 798 ----LAADLGDATTKSPHLIIRTSSDNIVIYKAF----------------HSPSRSAADL 837
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRE 941
LR ++ S+ + YT + + + + +I G+ F G+ P + +F+E
Sbjct: 838 WTKNLRWVKLSQQHIPRYTEDGGAEDSGFESTLLALSDIGGYSTVFQRGTTPAF--IFKE 895
Query: 942 RLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
P+ L + + T H +C GF Y+ S L+I QLP + Y + +
Sbjct: 896 SSSA-PRVIGLSGKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGWATRR 954
Query: 999 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1058
+P+ A H + Y + LY + P L+ E H + L D+
Sbjct: 955 MPMDAEIHALAYHS-SGLYIIGAGQPEEYQLDP-------SETYH----YELPKEDMSFK 1002
Query: 1059 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1117
T+E +++L+ W T + E L+++ + L + T + + L+A+GTA
Sbjct: 1003 PTIERGIIQLLDEKT----WAIIDTHVLDPQEVVLSIKTLNLEVSENTHQRKDLIAVGTA 1058
Query: 1118 YVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLI 1170
+ GED+A +G + +F P T + E+KGA+SA++ L QG +++
Sbjct: 1059 ILHGEDLATKGCIRIFEVITVVPEPDRPETNKRLKLIVKDEVKGAVSAISELGTQGFMIM 1118
Query: 1171 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQ 1226
A G K ++ K GT L +AF D YV L + + + D ++ ++F + E+
Sbjct: 1119 AQGQKCMVRGLKEDGTLLP-VAFMDM-QCYVSDLKNLPGTGMLAMSDAYRGVWFTGYTEE 1176
Query: 1227 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1286
+++L A+ SL+ A +F+ L L+V+D N+Q+ + P +S G +LL +
Sbjct: 1177 PYRMSLFARSKHSLEAIAVDFIPFEEQLHLLVADADMNLQVLQFDPDNPKSEAGSRLLHK 1236
Query: 1287 AEFHVGAHVTKFL-----RLQMLATSSDRTGA-------------APGSDKTNRF-ALLF 1327
+ FH G H L RL+M ++SD GA +P T +L
Sbjct: 1237 STFHTG-HFPATLHVVHSRLKM-PSASDFAGANNTENGDFEMDTSSPDDKATQPLHQILC 1294
Query: 1328 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
T G++ + PL E T+RRL +L L +++ AGLNPR+FR
Sbjct: 1295 TTQSGTLALVTPLSEDTYRRLSNLSAYLSNTLDATAGLNPRAFR 1338
>gi|403411348|emb|CCL98048.1| predicted protein [Fibroporia radiculosa]
Length = 1437
Score = 256 bits (655), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 334/1472 (22%), Positives = 615/1472 (41%), Gaps = 194/1472 (13%)
Query: 57 NLVVTAANVIEIYVVRVQE------------------------EGSKESKNSGE------ 86
N+VV +N++ I+ VR + EG E SGE
Sbjct: 45 NVVVARSNLLRIFEVREEPAPFSTQKEDERDRRASMRKGTEAVEGEVEMDASGEGFVNMG 104
Query: 87 ----TKRRVLMDGISAASLELVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFED 139
T + ++ + L+ +RLHG +E + I++ ++S D ++++F+D
Sbjct: 105 SVKSTGQNGILHQPTVNRFYLIREHRLHGIVTGIEGVRIIT--SIEDSF--DRLLVSFKD 160
Query: 140 AKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPL----VKVDPQGRCGGV 194
AKI++LE+ +++H L S+H +E +P+ + + PL ++ DP RC +
Sbjct: 161 AKIALLEWSEAMHDLITVSIHTYERAPQLMAID--------APLFRSQLRADPLSRCAAL 212
Query: 195 LVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIF 251
+ + IL Q + L D + S +++L D +++V DF+F
Sbjct: 213 SLPKDSIAILPFYQSQAEL--DIMEHETSQARDVPYSPSFILDLSADVDTRIRNVIDFVF 270
Query: 252 VHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKL 311
+ G+ P + +L + + TW GR+ T + ++ + +P+I + LPHD + +
Sbjct: 271 LPGFNSPTIAVLFQYQQTWTGRLKEYKDTVGLILFTLDLVTRHYPVITAIDGLPHDCFAM 330
Query: 312 LAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPRSSFS-------VE 363
+ +GGV+V+ +N+I Y Q+ L ++ + L +LP S S ++
Sbjct: 331 APCSTALGGVVVLASNSIIYVDQATRRVILPVSGW---LPRISDLPIPSLSHQDQQRDLQ 387
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-TNPSVLTSDITTIGNSL 422
L+ + ++ + + K G + + ++ DG+ V RL ++ + + S + + +
Sbjct: 388 LEGSQFVFVDDRTLYVVLKDGTVYPVEIIVDGKTVSRLSMAPPVARTTMPSLVRKMQDDY 447
Query: 423 FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD--APSTKRLRRSSSDALQDMVN 480
F+GS +G S+L++ T G E + A AP+ D
Sbjct: 448 LFVGSIIGPSVLLKTT---RVEEDIEGDDVEMASVPATVVAPNNAMDLDDDDDLYGGSAV 504
Query: 481 GEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASAT 531
E+ + G N + + K + DSL GP+ D ++ L N D +AT
Sbjct: 505 IEQPHMNGITQNGSTAISKKRTVVQLSFCDSLPAYGPIADMTFTLAKNGDRAVPELVAAT 564
Query: 532 GISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY 578
G + L++ LP G +GIW++ + + N + A + YHA
Sbjct: 565 GSGMLGGFTLLQRDLPTRTKRKMHAIGGGRGIWSLLVRQAVKVNGSTYERPA--NPYHAE 622
Query: 579 ---LIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
++IS +A + A + + + G TI A F ++ V +
Sbjct: 623 NDSIVISTDANPSPGLSRIASRNAQGDIQITTRIPGTTIGAAPFFQGTAILHVMINVTNV 682
Query: 634 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
+ + D + + + SI DP+VL+ D SI L +G+ +
Sbjct: 683 I--RVLEPDGTERQVIKDWDGNIPRPKIRFCSICDPFVLIIRDDDSIGLFIGESERGKIR 740
Query: 694 VQTPAAI-ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
+ + + E + + ++ C + D + + + A + + G Q
Sbjct: 741 RKDMSPMGEKTSRYLAGC-FFTDTSGIFQVHQNAQAAGIEGATSTLQSVMNAGNRTQ--- 796
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++C G +EI+ +P F+ + + D Y AL S ++
Sbjct: 797 WLILCRPQGVIEIWTLPKLGLAFSTTHAAGLESVLTDLYDPPAL--------SVPQDPPR 848
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+ ++ +I + V L RP L L G + Y+ + P P +
Sbjct: 849 KPQELDIEQLLVAPLG-----ESSPRPHLMLFLRSGQLAVYEVHSTPVPAEPL----PAA 899
Query: 873 TSRSLSVSNVSA-SRLRNLRFSRTPLDAYTREETPHGAPCQRIT-----IFKNISGHQ-- 924
S +L V V SR N++ S + E+ +RI+ + S Q
Sbjct: 900 RSSTLLVKFVKVLSRAFNIQHSDEVEKSVLAEQ-------KRISHLLIPFATSPSPGQTF 952
Query: 925 -GFFLSGSRPCWCMVF-RERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKIC 981
G FL+G RP W + + ++V P S+V AFT + F+ + +G +
Sbjct: 953 SGVFLTGDRPSWLLCTDKGGVKVLPS--GHSVVHAFTASSVWESKNDFLLYSEEGPSLME 1010
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE- 1040
LP D + +P ++ P +Y S V P ++ Q+
Sbjct: 1011 WLPD-VQLDGH------LPSRSVPRPRSY-----------SNVVYDPSTSLIVAASSQQS 1052
Query: 1041 --VGHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ D + + D + ++ + E + ++ P+ W T +E +
Sbjct: 1053 KFASYDEDGNIVWEPDTNISFPSCECSALELISPEG----WVTMDGYEFAQNEFVNCLDC 1108
Query: 1098 VTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL-- 1154
+TL +T+ + +A+GT +GED+A +G V +F + + + +Y +L
Sbjct: 1109 ITLETMSTETGTKDFIAVGTTINRGEDLAVKGAVYIFEIVEVVPDTNSGLKRLYRLKLQC 1168
Query: 1155 ----KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFI 1209
KG ++AL + +L+ + G KI + + E L G+AF D ++V SL VKN +
Sbjct: 1169 RDDAKGPVTALCGMDNYLVSSMGQKIFVRAFDLDERLVGVAFLDVG-VFVTSLRSVKNLL 1227
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
++GD KS++F++++E +L +L KD + + +SL+V DE I++
Sbjct: 1228 VIGDAVKSVWFVAFQEDPYKLVILGKDPYHTCVTCADLFFAENRVSLLVCDEDGVIRLLE 1287
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
Y P ES GQ LL R EFH T++ ++A D+ P + L+ G+
Sbjct: 1288 YDPHDPESRGGQHLLRRTEFH---GQTEYRTSVLIARRKDKDIDIPQA------KLVCGS 1338
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1389
DGS+ ++E F+ L LQ +L +V HVAGLNPR+FR N RP I+D
Sbjct: 1339 TDGSLVSFTFVEEAAFKGLHLLQGQLTRNVQHVAGLNPRAFRIVR-NDYVSRPLSKGILD 1397
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1421
LL+ +E LP+ Q E+ Q GT R+ +L +
Sbjct: 1398 GNLLTTFEELPIARQNEMTRQIGTERATVLKD 1429
>gi|281205270|gb|EFA79463.1| CPSF domain-containing protein [Polysphondylium pallidum PN500]
Length = 1395
Score = 255 bits (652), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 209/734 (28%), Positives = 340/734 (46%), Gaps = 109/734 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLV+ +++++Y +R ++ + +++ D + LEL +L +ESL
Sbjct: 31 NLVIAKTSLLQVYTIRYDRIEQQQQQQQQTNEQQSQQDTLKPW-LELNLELQLFSIIESL 89
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ G D DS+IL+F DAK+S+++++ + L I S+H FE LK GR++
Sbjct: 90 NCVRLPGDD----IDSLILSFRDAKVSIVKYNKATEKLDIRSLHYFEGNS--ELKGGRKT 143
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG-------------------DE 217
F PL++VD Q RC +L+Y + +L + S L DE
Sbjct: 144 FRTPPLIRVDYQQRCAVMLLYDRHLAVLPFPRSFSILDDEEEEEEEEAAVVADQQQQHDE 203
Query: 218 D-----------TFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHER 266
+ S + S+VI+L L +++VKDF F+H Y EP ++ LHE
Sbjct: 204 NEQQQPQDDQQQQQTSEKNKKKKQSESYVISLNSLGIENVKDFCFLHTYYEPTLLFLHEP 263
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
TW R+S K T +++A+S++ +Q P+IWS +LP++ +L+ VP P+GG +V+
Sbjct: 264 SQTWTSRISSKKFTNVLTAVSLNIAQRQQPVIWSIEHLPYNCERLVPVPDPLGGAMVLTP 323
Query: 327 NTIHYHSQSASCALALNNYA-VSLDSSQELPRSSFSVE----LDAAHATWLQNDVALLST 381
N + Y +QS+ L N YA + + P S S LD A+ +L D L S
Sbjct: 324 NILFYFNQSSRYGLECNEYAQIDTGDQFQFPIDSSSTNLVFTLDCANFIFL-GDRLLGSL 382
Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--- 438
K G+L++ ++ DGR VQR+ ++K SVL+S + ++L FLGSRLGDSLL+Q+T
Sbjct: 383 KGGELLIFHLISDGRNVQRISITKAGASVLSSTSCVLTDNLLFLGSRLGDSLLLQYTEKI 442
Query: 439 ----CGSGTSMLSSGLKE----EFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
LS+ K+ E D+ D + S +D + +E ++
Sbjct: 443 IDVDSSDNVENLSNPYKKKKTSEVFDLFDDEERNSKTGASDADGNGQSLFDDEDDIF--- 499
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN-ADASATGISKQSNYELV------- 542
N+ ++ K++ + D + NIGP+ D G+ + A S +Q + ELV
Sbjct: 500 -NDKKNQLKSYRLNICDHITNIGPVSDLITGVSYDHASVSNDESFEQRSLELVACSGHGK 558
Query: 543 -------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
ELPG + WT+Y+ + S + +
Sbjct: 559 NGALTILQYGVRPELNTSFELPGVRQSWTLYYDDPLAASQSGSSASNAAASAASKKRQHE 618
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQD 642
E T+V +T L EV + TI N+FGRRR+ V + G ++L G S +TQ+
Sbjct: 619 EDSTLVFQTGGQLKEVAK-----FDHATITVANMFGRRRIALVHQNGIKLLSGHSNITQE 673
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS-TCTVSVQTPAAIE 701
+ +V I DPYVL+ DG+I L G+ T + + P
Sbjct: 674 IKL-------------KSVKMAYIVDPYVLILHKDGTISLYQGNTGITQLLEYELPQP-- 718
Query: 702 SSKKPVSSCTLYHD 715
K V SC+++HD
Sbjct: 719 --KDGVMSCSMFHD 730
Score = 193 bits (491), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 135/446 (30%), Positives = 224/446 (50%), Gaps = 59/446 (13%)
Query: 823 KVVELAMQRW-SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
K+VE+ + ++ HS P+L + G IL Y+A K D + ++ L
Sbjct: 872 KIVEIVIHYLHNSPHSSPYLMILNEFGDILIYKAI---------KYKDSMDNTKEL---- 918
Query: 882 VSASRLRNLRFSRTPLDAYTREETPHGAP-------CQRITIFKNISGHQGFFLSGSRPC 934
+R ++ + L + RE + P ++I F NI GH+G F+ G R
Sbjct: 919 -----IRFIKHTDQNLHSKQREYSYGIDPSSESSFYIRKIVAFDNIGGHKGVFMCGKRSL 973
Query: 935 WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
W + LR HP + +FT HN+NC++GFIY T +G+L+I QL + ++N W
Sbjct: 974 WFFCEKNYLRAHPMNFKDPVTSFTCFHNINCSYGFIYFTEKGVLRINQLSNMMNFENEWA 1033
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
++K IPL+ T H+I++ E Y L++S P Q D
Sbjct: 1034 IRK-IPLRMTCHKISFHQEFKCYVLVISYP----------------QAPQSDEEEEEKEK 1076
Query: 1055 LHRTYTVEE-YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL--FNTTTKENETL 1111
+ +EE ++V++++P W + M E L ++V L + + +
Sbjct: 1077 SKKPLILEEKFQVKLIDPSMN---WSIVDSFSMSEKETVLCAKIVHLKYADVDGIKLKPY 1133
Query: 1112 LAIGTAYVQGEDVAARGRVLLF------STGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
L +GTAY GED +GR+L+F + + + +Y K+ KG ++ALA L
Sbjct: 1134 LCVGTAYTHGEDTVCKGRILVFEIISHREVQDDTGEEKKRLNLLYEKDQKGPVTALAGLN 1193
Query: 1166 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1225
G LL++ GPK+I++ ++ L GIAFYD +++VSL+ VKN+IL+GD++KS+ F K+
Sbjct: 1194 GLLLMSIGPKLIVNNFSSGSLVGIAFYDT-QIFIVSLSTVKNYILVGDMYKSVSFFKLKD 1252
Query: 1226 QGAQLNLLAKDFGSLDCFATE--FLI 1249
Q QL LL KD+ ++ F+ + FL+
Sbjct: 1253 Q-KQLILLGKDYEEMNTFSNQVHFLV 1277
>gi|384253955|gb|EIE27429.1| hypothetical protein COCSUDRAFT_64224 [Coccomyxa subellipsoidea
C-169]
Length = 1137
Score = 254 bits (649), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 197/628 (31%), Positives = 294/628 (46%), Gaps = 81/628 (12%)
Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
PFL +L DGT L Y+A F P V RL + P
Sbjct: 553 PFLLLLLADGTFLAYRA--FHTPRG-----------------RVCFKRLSLPAHAHCPPQ 593
Query: 899 AYTREETPHGAPCQRITIFKNISGHQ-----GFFLSGSRPCWCMVFRERLRVHPQLCDGS 953
+ T AP +T F + + G F+SG RP W + R L H +G
Sbjct: 594 DRRSKTT---APSSSMTRFDGLGESKEHVNSGMFVSGERPLWLVASRGTLVAHAMDVEGR 650
Query: 954 IVAFTVLHNVNCNHGFIYV----TSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQIT 1009
+ T HN+NC GFI LKICQLP + D WP+QK I ++ATPH++
Sbjct: 651 VSGMTPFHNINCPLGFITACMAENDGETLKICQLPMRTRLDTPWPLQK-IAVRATPHRLA 709
Query: 1010 YFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS---VDLHRTYTVEEY-- 1064
Y+AE LY L+VS PV P + QE D H S D + E
Sbjct: 710 YYAEARLYVLLVSRPV--PYRE------HQEEASDGDPHASYSYICADAAAKASGTELGG 761
Query: 1065 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV 1124
EVR+LEP R +QT A + E +V L N T E + +GTA GED
Sbjct: 762 EVRLLEPGR----YQTVARHALDPGEEPCSVAADWLRNAQTGALEPYITVGTALNYGEDY 817
Query: 1125 AARGRVLLFSTGRNADNPQN--------LVTEVYSKELKGAISALASLQGHLLIASGPKI 1176
GR+LLF R + + +T V++ + LA + G L+ A G +
Sbjct: 818 PCSGRILLFKATRTSTSGAEQADPTISWQLTLVHASGFSRPVQGLAVMDGRLVAAVGNNM 877
Query: 1177 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG--AQLNLLA 1234
+ + G+ L+ I+F+ A L++ S+ +K FILLGD+HK + F+ ++ L L+
Sbjct: 878 QVMELRGSSLHMISFFHA-QLFITSVATIKTFILLGDVHKGLTFVYADKKANYTALTQLS 936
Query: 1235 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY--APKMSESWKGQKLLSRAEFHVG 1292
KD+ +D A EFL++G L L+ D +N+++F Y + +W+G+KLL HVG
Sbjct: 937 KDYNDVDVEAAEFLVNGKKLFLLACDAAQNLRLFAYDGGKEQQATWQGKKLLPLGAIHVG 996
Query: 1293 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSL 1351
++ L ++ T + TG A +FG+ GSI +AP D L L +L
Sbjct: 997 QNICSSLSHRI--TPASATG-------VQLRAAVFGSAAGSIASLAPTWDGLPAEELLAL 1047
Query: 1352 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG---------PDSIVDCELLSHYEMLPLE 1402
Q+++V +VP VAGLNP SFR+ + +G G D ++D + L+ ++ LPL
Sbjct: 1048 QREMVLAVPQVAGLNPVSFRRRYKHGVKALAGGQSFEAPVSDDRVLDLDQLNRFQWLPLT 1107
Query: 1403 EQLEIAHQTGTTRSQILSNLNDLALGTS 1430
EQ+ +A + +R Q+L L ++ + S
Sbjct: 1108 EQVALAAKCNLSRQQVLHALREMVMAIS 1135
Score = 171 bits (433), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 148/477 (31%), Positives = 227/477 (47%), Gaps = 68/477 (14%)
Query: 225 GFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
S + +S+++ L L + V+D +F+H Y EPV+++LHE + +W G++ T ++
Sbjct: 18 ALSTTVGNSYMLKLAKLGISEVRDAVFLHRYSEPVLLVLHETKPSWGGQLRNSKDTMEVT 77
Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
A S++ K+H +WS NLP DA+KL+ VP GG LV+ N + Y SQ A+ A A
Sbjct: 78 AFSLNVAHKRHTRLWSIGNLPSDAFKLIEVPG--GGGLVICQNLLIYVSQEAAAAAASGA 135
Query: 345 YAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
F ++L WL ++ LL +G L+L+ V +G +RL +S
Sbjct: 136 PRA----------EGFELDLTDCSGAWLADNSLLLGLASGQLILVNVQLEGS--KRLKVS 183
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
K + S + +G L FLGS + +SLL++ G ++L G +E+ EADA
Sbjct: 184 KAQGAPPPSCMCRLGPELLFLGSWVANSLLIR-AVPEGQTLLLGGPEEQAS--EADATHA 240
Query: 465 KRLRRSSSDALQDMVN--GEELSLY-----GSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
+ R DA D+ N +E+SL A +T A K +S V DSLV+IG ++D
Sbjct: 241 SKRPRLDPDA-ADLGNEDEDEVSLIYRTDAQPALPSTTGASK-YSLQVVDSLVSIGIVQD 298
Query: 518 FSYGLRINADASATGISKQSN-------------------------YEL---VELPGCKG 549
G + A ++K EL V LPG
Sbjct: 299 LVTG-EASTSAPQEWVAKTERGPPKLLAAVGSDKFGAVAVLRSSLVPELVTEVPLPGVDQ 357
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES-VDYFVQ 608
+W V H G D S + YHA+L ++ ++ T VL T + L E S VD+ +
Sbjct: 358 MWAV-HFQPEGLPVDDSLL------YHAFLFLNEKSGTKVLRTGEELDETDSSQVDFILS 410
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
RT+ AGNL G R++QV RG +L GS QDL + G N+T+++ S
Sbjct: 411 SRTVFAGNLLGNSRIVQVHARGVVLLSGSSRVQDLPV-----QDLIGVSNTTIVAAS 462
>gi|310789917|gb|EFQ25450.1| CPSF A subunit region [Glomerella graminicola M1.001]
Length = 1439
Score = 253 bits (647), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 351/1471 (23%), Positives = 583/1471 (39%), Gaps = 247/1471 (16%)
Query: 69 YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL----SQGGA 124
Y R+ ++ ES G V D L LV Y + G V LA + S+ G
Sbjct: 66 YDRRLNDDDGLESSFLGGDGMLVRADRAVNTKLVLVAEYPIFGVVTGLARIKIQHSKSGG 125
Query: 125 DNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL-- 182
+ ++++A A++S+++++ H L S+H +E E + S GPL
Sbjct: 126 E------ALLIATRVARLSLVQWNSEKHALEDISIHYYEKEEL------QGSPFDGPLAN 173
Query: 183 ----VKVDPQGRCGGVLVYGLQMI-ILKASQG--------------GSGLVGDEDTFGSG 223
+ DP RC L +G + I L Q G + T +
Sbjct: 174 YRTHLAADPGSRCAA-LSFGPRYIAFLPFKQADEDIDMDDWDEDVDGPRPAKEPPTTAAT 232
Query: 224 GGFS----ARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
G S +S+V+ L LD + H F+H Y EP I+ +
Sbjct: 233 NGTSNIADVPYSTSYVLPLPQLDPSLLHPVYLAFLHEYREPTFGIISSTQRRSNTLPRKD 292
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
H + + L + + I S NLP D +K++A+P P+GG L+VG N IH
Sbjct: 293 HFSYKVFTLDLQQ--RASTAILSVNNLPQDLFKVVALPGPVGGALLVGTNELIHIDQSGK 350
Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYD 394
+A+N + + +S + L+ H + +N L+ G L ++T D
Sbjct: 351 PNGVAVNAFTKETTNFPLADQSDLDLRLEHCHIELMSAENGELLMVLSDGRLAIITFKID 410
Query: 395 GRVVQRLDLSKTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
GR V + + V S I+ + ++FF+GS DSL++ +T
Sbjct: 411 GRTVSGVSVKPVAAEVGGNIVQCSVSTISKLSRNVFFVGSTGSDSLVLGWT--------- 461
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS---------------LYGSASN 492
A + +R R D+ + + E++ + A+
Sbjct: 462 ----------RKQAQNARRKTRLVDDSFEYDLEDEDMDDGDDDDLYGETTTTMIQPGATA 511
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLR-INAD------------------------ 527
N S +F V DSL++I P+KD + G + N D
Sbjct: 512 NGVSKGGDLTFRVHDSLLSIAPVKDMTSGKQAFNPDSEEANNSVGVVADLQLACVVGRGN 571
Query: 528 ASATGISKQSNYELV----ELPGCKGIWT--VYHKSSRGHNADSSRMAAYDDEYHAYLII 581
A A I Q+ V E P +G WT V + D AA E+ A
Sbjct: 572 AGAVAILNQNIQPKVIGKFEFPEARGFWTMCVQKPVPKSLQGDKGANAAVGSEFDAS--- 628
Query: 582 SLEARTMVLETADLLTEVTESVDYFV-----------------QGRTIAAGNLFGRRRVI 624
S+ + M++ DL + E+ D + G T+ AG + R+I
Sbjct: 629 SIYDKFMIVSKVDL--DGYETSDVYALTGAGFEALTGTEFDPAAGFTVEAGTMGKHMRII 686
Query: 625 QVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
QV + R DG ++Q L + E+G+ V+S SIADPY+LL D SI +
Sbjct: 687 QVLKSEVRCYDGDLGLSQILPM--LDEETGA---EPRVVSASIADPYLLLVRDDSSIMVA 741
Query: 684 VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGAD 743
D + V+ S K ++ C LY D +TG +
Sbjct: 742 QIDNNCELEEVEKQDDAILSTKWLAGC-LYAD----------------TTGRFAPVQTDK 784
Query: 744 GGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEI 803
G P Q +I+ + +GAL I+ +P+ + V +G T++
Sbjct: 785 GTPEGQ-NIFMFLLSAAGALYIYALPDLSKPVYV---AAGLTYVPPLL----------SA 830
Query: 804 NSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPEN 863
+ + GT Q E + + V +L + P+L + + Y+ E +
Sbjct: 831 DYAVRRGTVQ---ETLTELLVADLG----DTTTTSPYLILRHANDDLTIYEPIRLESQDK 883
Query: 864 TSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNIS 921
T V S++L ++ N +++P++ E E P P + NI+
Sbjct: 884 T------VGLSKTLHFQKIT-----NPALAKSPVEVADDEANEQPRFVPLRPC---PNIN 929
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
G+ FL G+ P + + + L + + H C GFIY S+G ++
Sbjct: 930 GYSTVFLPGASPSFIIKSSKSSPKVIGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVT 989
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
QLP+ + + + IP+ I Y Y + SV L E+
Sbjct: 990 QLPADTNFTELGVAVRKIPIGDNVGLIAYHPPMETYAVACSV------------LERFEL 1037
Query: 1042 GHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1100
D H + + +Y E ++++ P W T+ ++ E A+ ++ + L
Sbjct: 1038 PKDDDYHKEWAKEATTSYPQTERGIIKLMSPTT----WSVIDTVELEPHEVAMCMKTLHL 1093
Query: 1101 -FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNLVTEVYSKE-- 1153
+ TKE L+ IGTA +GED+ RGR+L++ P N ++ +KE
Sbjct: 1094 EVSEETKERRMLITIGTAINRGEDLPIRGRILVYDVVPVVPQPGRPETNKKLKLVAKEEI 1153
Query: 1154 LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVK--N 1207
+GA++ L + QG +L+A G K ++ K GT L +AF D YV ++ V+
Sbjct: 1154 PRGAVTGLCEVGSQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDM-NCYVTAVREVRGTG 1211
Query: 1208 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1267
+ L+ D K ++F+ + E+ ++ L K G + +F+I G L +VV D+ I +
Sbjct: 1212 YCLMTDAFKGVWFVGYAEEPYKMMLFGKSTGKFEVLTADFIIAGDELHIVVCDKDGVIHV 1271
Query: 1268 FYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1326
+ P+ +S +G LL+RA F H T L L S+ T A T LL
Sbjct: 1272 MQFDPEHPKSLQGHLLLNRASFSAAPNHPTTTLSLPRTPASTATTSATKNPPTT----LL 1327
Query: 1327 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD- 1385
+ G++ + PL E +RRL SL + ++PH A NP++ R + A PG D
Sbjct: 1328 LASPTGALASLTPLSEQAYRRLTSLANSIAGALPHAAATNPKAHRLQPLD--ARTPGVDT 1385
Query: 1386 ----SIVDCELLSHYEMLPLEEQLEIAHQTG 1412
SIVD LL+ + L + E+A + G
Sbjct: 1386 SAGRSIVDGALLARWNELGAGRRSEVAGKGG 1416
>gi|440466842|gb|ELQ36086.1| hypothetical protein OOU_Y34scaffold00669g71 [Magnaporthe oryzae Y34]
gi|440481991|gb|ELQ62520.1| hypothetical protein OOW_P131scaffold01068g7 [Magnaporthe oryzae
P131]
Length = 1475
Score = 247 bits (630), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 341/1516 (22%), Positives = 605/1516 (39%), Gaps = 234/1516 (15%)
Query: 57 NLVVTAANVIEIYVVRV---QEEGSKESKNSGETKRRVLMD--GISAA------------ 99
NLVV +++++I+ R+ + +G+ +S + L D G+ A+
Sbjct: 51 NLVVAKSSLLQIFATRLVPAELDGTSQSAKATHNYDTKLNDDEGLEASFLGGDAAIIRSD 110
Query: 100 ----SLELVCHYRLHGNVESLAILSQGGADNSRRR---------DSIILAFEDAKISVLE 146
L LV + L G + LA + S D +++AF+DAK+S++E
Sbjct: 111 RNHTKLVLVAEFPLSGTITGLARVKANATKTSNGNGAGSSSSGGDFLLIAFKDAKLSLVE 170
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQ 200
+D L S+H +E E + S PL + DP RC L +G +
Sbjct: 171 WDPDRRSLETISIHYYEQNEL------QSSPWAAPLSDYVNFLVADPGSRCAA-LKFGAR 223
Query: 201 MIILKASQGGSGLVGDED----------------TFGSGGGFSARIES-----SHVINLR 239
+ + + G +G +D T + G +E S V+ L
Sbjct: 224 SLAIIPFKQADGDIGMDDWDEELDGPRPAQEKPATAATNGTTDNVVEDTPYTPSFVLRLP 283
Query: 240 DLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
+LD + H F++ Y EP IL +T + ++ K H + ++ K
Sbjct: 284 NLDPALLHPVHLAFLYEYREPTFGILSS-NITPSTYLARKDH-LTYTVFTLDLQQKASTT 341
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP 356
I S LP D +++A+P+P+GG L+VG+N IH + +A+N S S
Sbjct: 342 ILSVGGLPKDLTRVIALPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTKSCTSFSLAD 401
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLSKTNPSV---- 410
+S + L+ L + D L T+V+ DGR V L + P
Sbjct: 402 QSDLGLRLEGCMINVLSAEDGQFIIVLNDGRLATLVFHIDGRTVSGLKIKMVAPEAGGQL 461
Query: 411 ---LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
S +T +G + F GS GDS++ + S K + D + D
Sbjct: 462 LQTSVSCLTRLGRNALFAGSDRGDSVVFGWNRKHNQV---SKRKPKIQDPDLDLDIDYDD 518
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----- 522
D D+ E + ++++ E+ F V D +V+I P++D ++G
Sbjct: 519 LEDDEDDDDDLYADTEKTKATTSASTGETKTDDLIFRVHDRMVSIAPIRDVTFGKPPPPT 578
Query: 523 ---RINADASAT----------GISKQSNYELV------------ELPGCKGIWTV---- 553
R D +A G K S+ ++ E P +G+WT+
Sbjct: 579 DAERNTKDPAAVQSELQLVAVVGRDKASSLAIINREMTPVSIGRFEFPEARGLWTLSTQK 638
Query: 554 -YHKSSRGHNADSSRMAAYDD----EYHAYLIISLEARTMVLETADLLTEVTESVDYF-- 606
K + N + AA + +Y Y+I++ E ET+D+ +
Sbjct: 639 PLPKPLQASNKNPKTAAATESILSAQYDQYMIVAKEDDDG-FETSDVYALTAAGFETLSG 697
Query: 607 -----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
G TI AG + ++IQV + R DG +TQ + + E+G
Sbjct: 698 TEFEPAAGFTIEAGTMGDHTKIIQVLKSEVRCYDGDLGLTQIIPM--LDEETG---HEPR 752
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
S SIADPY+L+ D S + + + ++ I SS K + C LY D
Sbjct: 753 ATSASIADPYLLIIRDDSSAFIAHVNEDSEIEEIEKEDKIISSTKWSTGC-LYAD----- 806
Query: 721 WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
S G A P I + +GAL I+ +P+ +
Sbjct: 807 -----------SKGAFAATQQTAKSPKSTPTIMMFLLSAAGALYIYALPDIS-------- 847
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
Y+ E L +++ G R E I + V +L + + H
Sbjct: 848 -------RPVYVAEGLCYVPPYLSADYSARKGMAR-ETISEILVTDLGDTVFKSPH---- 895
Query: 841 LFAIL--TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
IL ++ + Y+ Y ++D S ++ L + +L N ++ P +
Sbjct: 896 --VILRHSNHDLTIYEPYRI--------AEDSQSLTKILRLR-----KLPNPAVAKAP-E 939
Query: 899 AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIV 955
A E+ P + + NI+G+ F+ G P + + + + P+ L +
Sbjct: 940 ATNSEDPPLMSRNMPLRACANIAGYSAVFMPGHSPSFLI---KSAKATPKVIGLRGSGVR 996
Query: 956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKN 1015
A + H C GFIY S G+ ++ Q+P +++ K +PL I Y +
Sbjct: 997 AMSSFHTEGCERGFIYADSAGVARVAQIPKDTSFSELGLSVKKVPLGIDADGIAYHSPTG 1056
Query: 1016 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1075
+Y L S + L D + + N+S + VE ++++ P
Sbjct: 1057 VYVLTCS------YWEPFELPKDDDYHCEWAKENISFKPM-----VERSVLKVINPIN-- 1103
Query: 1076 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF- 1133
W T + E A+ +R + L + +T E L+ +GTA +GED+ RG + +F
Sbjct: 1104 --WSDIWTEEFEQHEVAMCIRSLNLEVSQSTNERRQLITVGTAMCKGEDLPVRGGIYVFD 1161
Query: 1134 ------STGRNADNPQNLVTEVYSKEL-KGAISALASL--QGHLLIASGPKIILHKWT-G 1183
GR + + + +V +E+ +GA+++L+ + QG +++A G K ++
Sbjct: 1162 LASVVPQKGRPETDKK--LKQVAKEEIPRGAVTSLSEIGTQGLMMVAQGQKTLVRGLQED 1219
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1241
+L +AF D YV + + ++ D K ++F + E ++ L K +L+
Sbjct: 1220 GKLPPVAFMDMN-CYVTCVKELAGTGLCVMADAFKGVWFCGYTEGPYKMMLFGKSSTNLE 1278
Query: 1242 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1301
C + L DG L +V +D N+ + + P+ +S +G LL+R F GAH + +
Sbjct: 1279 CMNVDLLPDGKDLLIVAADSDGNLHVLQFDPEHPKSLQGHLLLNRTTFSTGAHHPQ--KS 1336
Query: 1302 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1361
+L T+ R S R +L + G + + PL + T+ RL +L L+ SVPH
Sbjct: 1337 LLLPTTDPRPSTNQPSSDAERQHILMASPTGVLAAVQPLSQSTYTRLSALASNLMASVPH 1396
Query: 1362 VAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1416
A LNP+++R ++ + D ++VD LL+ + L + E+A + G + +
Sbjct: 1397 HAALNPKAYRLPPTSTRNQVAAVDISVGRAVVDGSLLARWAELASGRRAEVAGRAGFSGA 1456
Query: 1417 QILSNLNDLALGTSFL 1432
+ + + LG S +
Sbjct: 1457 AEVRSELEAVLGWSIM 1472
>gi|389641257|ref|XP_003718261.1| cft-1 [Magnaporthe oryzae 70-15]
gi|351640814|gb|EHA48677.1| cft-1 [Magnaporthe oryzae 70-15]
Length = 1452
Score = 246 bits (629), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 341/1516 (22%), Positives = 605/1516 (39%), Gaps = 234/1516 (15%)
Query: 57 NLVVTAANVIEIYVVRV---QEEGSKESKNSGETKRRVLMD--GISAA------------ 99
NLVV +++++I+ R+ + +G+ +S + L D G+ A+
Sbjct: 28 NLVVAKSSLLQIFATRLVPAELDGTSQSAKATHNYDTKLNDDEGLEASFLGGDAAIIRSD 87
Query: 100 ----SLELVCHYRLHGNVESLAILSQGGADNSRRR---------DSIILAFEDAKISVLE 146
L LV + L G + LA + S D +++AF+DAK+S++E
Sbjct: 88 RNHTKLVLVAEFPLSGTITGLARVKANATKTSNGNGAGSSSSGGDFLLIAFKDAKLSLVE 147
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQ 200
+D L S+H +E E + S PL + DP RC L +G +
Sbjct: 148 WDPDRRSLETISIHYYEQNEL------QSSPWAAPLSDYVNFLVADPGSRCAA-LKFGAR 200
Query: 201 MIILKASQGGSGLVGDED----------------TFGSGGGFSARIES-----SHVINLR 239
+ + + G +G +D T + G +E S V+ L
Sbjct: 201 SLAIIPFKQADGDIGMDDWDEELDGPRPAQEKPATAATNGTTDNVVEDTPYTPSFVLRLP 260
Query: 240 DLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
+LD + H F++ Y EP IL +T + ++ K H + ++ K
Sbjct: 261 NLDPALLHPVHLAFLYEYREPTFGILSS-NITPSTYLARKDH-LTYTVFTLDLQQKASTT 318
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP 356
I S LP D +++A+P+P+GG L+VG+N IH + +A+N S S
Sbjct: 319 ILSVGGLPKDLTRVIALPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTKSCTSFSLAD 378
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLSKTNPSV---- 410
+S + L+ L + D L T+V+ DGR V L + P
Sbjct: 379 QSDLGLRLEGCMINVLSAEDGQFIIVLNDGRLATLVFHIDGRTVSGLKIKMVAPEAGGQL 438
Query: 411 ---LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
S +T +G + F GS GDS++ + S K + D + D
Sbjct: 439 LQTSVSCLTRLGRNALFAGSDRGDSVVFGWNRKHNQV---SKRKPKIQDPDLDLDIDYDD 495
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----- 522
D D+ E + ++++ E+ F V D +V+I P++D ++G
Sbjct: 496 LEDDEDDDDDLYADTEKTKATTSASTGETKTDDLIFRVHDLMVSIAPIRDVTFGKPPPPT 555
Query: 523 ---RINADASAT----------GISKQSNYELV------------ELPGCKGIWTV---- 553
R D +A G K S+ ++ E P +G+WT+
Sbjct: 556 DAERNTKDPAAVQSELQLVAVVGRDKASSLAIINREMTPVSIGRFEFPEARGLWTLSTQK 615
Query: 554 -YHKSSRGHNADSSRMAAYDD----EYHAYLIISLEARTMVLETADLLTEVTESVDYF-- 606
K + N + AA + +Y Y+I++ E ET+D+ +
Sbjct: 616 PLPKPLQASNKNPKTAAATESILSAQYDQYMIVAKEDDDG-FETSDVYALTAAGFETLSG 674
Query: 607 -----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
G TI AG + ++IQV + R DG +TQ + + E+G
Sbjct: 675 TEFEPAAGFTIEAGTMGDHTKIIQVLKSEVRCYDGDLGLTQIIPM--LDEETG---HEPR 729
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
S SIADPY+L+ D S + + + ++ I SS K + C LY D
Sbjct: 730 ATSASIADPYLLIIRDDSSAFIAHVNEDSEIEEIEKEDKIISSTKWSTGC-LYAD----- 783
Query: 721 WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
S G A P I + +GAL I+ +P+ +
Sbjct: 784 -----------SKGAFAATQQTAKSPKSTPTIMMFLLSAAGALYIYALPDIS-------- 824
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
Y+ E L +++ G R E I + V +L + + H
Sbjct: 825 -------RPVYVAEGLCYVPPYLSADYSARKGMAR-ETISEILVTDLGDTVFKSPH---- 872
Query: 841 LFAIL--TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
IL ++ + Y+ Y ++D S ++ L + +L N ++ P +
Sbjct: 873 --VILRHSNHDLTIYEPYRI--------AEDSQSLTKILRLR-----KLPNPAVAKAP-E 916
Query: 899 AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIV 955
A E+ P + + NI+G+ F+ G P + + + + P+ L +
Sbjct: 917 ATNSEDPPLMSRNMPLRACANIAGYSAVFMPGHSPSFLI---KSAKATPKVIGLRGSGVR 973
Query: 956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKN 1015
A + H C GFIY S G+ ++ Q+P +++ K +PL I Y +
Sbjct: 974 AMSSFHTEGCERGFIYADSAGVARVAQIPKDTSFSELGLSVKKVPLGIDADGIAYHSPTG 1033
Query: 1016 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1075
+Y L S + L D + + N+S + VE ++++ P
Sbjct: 1034 VYVLTCS------YWEPFELPKDDDYHCEWAKENISFKPM-----VERSVLKVINPIN-- 1080
Query: 1076 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF- 1133
W T + E A+ +R + L + +T E L+ +GTA +GED+ RG + +F
Sbjct: 1081 --WSDIWTEEFEQHEVAMCIRSLNLEVSQSTNERRQLITVGTAMCKGEDLPVRGGIYVFD 1138
Query: 1134 ------STGRNADNPQNLVTEVYSKEL-KGAISALASL--QGHLLIASGPKIILHKWT-G 1183
GR + + + +V +E+ +GA+++L+ + QG +++A G K ++
Sbjct: 1139 LASVVPQKGRPETDKK--LKQVAKEEIPRGAVTSLSEIGTQGLMMVAQGQKTLVRGLQED 1196
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1241
+L +AF D YV + + ++ D K ++F + E ++ L K +L+
Sbjct: 1197 GKLPPVAFMDMN-CYVTCVKELAGTGLCVMADAFKGVWFCGYTEGPYKMMLFGKSSTNLE 1255
Query: 1242 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1301
C + L DG L +V +D N+ + + P+ +S +G LL+R F GAH + +
Sbjct: 1256 CMNVDLLPDGKDLLIVAADSDGNLHVLQFDPEHPKSLQGHLLLNRTTFSTGAHHPQ--KS 1313
Query: 1302 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1361
+L T+ R S R +L + G + + PL + T+ RL +L L+ SVPH
Sbjct: 1314 LLLPTTDPRPSTNQPSSDAERQHILMASPTGVLAAVQPLSQSTYTRLSALASNLMASVPH 1373
Query: 1362 VAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1416
A LNP+++R ++ + D ++VD LL+ + L + E+A + G + +
Sbjct: 1374 HAALNPKAYRLPPTSTRNQVAAVDISVGRAVVDGSLLARWAELASGRRAEVAGRAGFSGA 1433
Query: 1417 QILSNLNDLALGTSFL 1432
+ + + LG S +
Sbjct: 1434 AEVRSELEAVLGWSIM 1449
>gi|428186188|gb|EKX55039.1| hypothetical protein GUITHDRAFT_160593 [Guillardia theta CCMP2712]
Length = 2290
Score = 244 bits (623), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 168/544 (30%), Positives = 263/544 (48%), Gaps = 69/544 (12%)
Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD--GSIVAFTVLHNVNCNHGFI 970
R+ G +G ++ +P + R R+HP D + + +N+ C G +
Sbjct: 1068 RLMPLGGAGGLEGVLIAARQPAVVLFGRGLPRIHPWKLDRGEGVRSAARFNNLQCKDGIV 1127
Query: 971 YVT------SQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP 1024
+ ++G+LKIC +P G + D WP+ + + T H + + A + L+VS
Sbjct: 1128 CIADKGRDRAKGVLKICNIPEGISGDTPWPL-RTKHVGMTVHHVAFHAATGCHVLVVSSQ 1186
Query: 1025 VLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT---YTVEEYEVRILEPDRAGGPWQTR 1081
+I++ L T E+YEV++ P
Sbjct: 1187 ------------------QEIEDERKPEGTLEGAIPPLTEEKYEVQLRAP--YSMELLDS 1226
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENE-TLLAIGTAYVQGEDVAAR--GRVLLFST--- 1135
+ E AL ++VV L NT K++ +A+GT + GE +R GR+ +F
Sbjct: 1227 YEFDFANGEKALCLQVVHLKNTRVKDSLLPFVAVGTGFQNGESETSRATGRIYVFEVTTV 1286
Query: 1136 -------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP--------KIILHK 1180
GR + + + T +++K +SAL L+G+LL+A GP K+ +++
Sbjct: 1287 VGEEGYEGRTSFKIKKIFTSADIQDIKAPVSALCQLEGYLLVAQGPNPGMIGGSKLYVYE 1346
Query: 1181 WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1240
W +L G AF+DA LY+ +L VK FI+ GDI S++ L W+E L LLAKD L
Sbjct: 1347 WVDEKLVGRAFFDAH-LYITTLKTVKFFIVFGDIRHSVHLLRWREDIRMLQLLAKDALPL 1405
Query: 1241 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1300
+A EF++ GS L+ SDEQKN+Q+F + P E ++ Q+L+ RA+ HVG+H+ KF+R
Sbjct: 1406 SVYAAEFVVMGSNFGLLASDEQKNVQVFVFNPNSPE-YRRQQLICRADLHVGSHINKFIR 1464
Query: 1301 --LQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1358
L T RT A + TLDG IG I P+ E ++RRL +LQ LV +
Sbjct: 1465 WPLPFRPTLGVRTAAH------------YTTLDGGIGAIIPIPEQSYRRLLALQNLLVTA 1512
Query: 1359 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
+PH AGLNPRS+R + R + +D LL Y L L Q++++ TR I
Sbjct: 1513 MPHYAGLNPRSWRLYKPAMCMKRRYAKNFLDGNLLGRYLHLDLALQMQLSSALNQTREAI 1572
Query: 1419 LSNL 1422
L +L
Sbjct: 1573 LGDL 1576
Score = 239 bits (609), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 142/412 (34%), Positives = 222/412 (53%), Gaps = 38/412 (9%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD---GISAASLELVCHYRLHGNV 113
NL V +E+YV++ +E+ ++ N + ++ D G A+L+ V Y L+GNV
Sbjct: 31 NLAVVKGTQLELYVLKEEEKKHSKTCNGKQNGQKAAGDSGHGHGGATLQCVGRYDLNGNV 90
Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
ES+A + G R RD + L F DAK+S+LE+D+SI + S+H FE E +++G
Sbjct: 91 ESMAFVRLPG----RNRDHLFLVFRDAKLSILEYDNSIDDIVNVSLHLFEDDE---IRKG 143
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF------------- 220
R SF R PL++VDP RC +LVY +M+++ GS L D++
Sbjct: 144 RVSFGRAPLLRVDPLQRCAALLVYESKMVVIPFKHKGSDLEEDDEILTQPNKKFKSESAS 203
Query: 221 -------GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
G+ I ++V++L + +KHV DF F+ GY EP + LHE TWAGR
Sbjct: 204 SNTVTRLGAPSDNKLGILPTYVVDLDEAGIKHVVDFTFLDGYYEPTISFLHENSRTWAGR 263
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
++ + T MI+ +S++ + ++ P+IWSA LPH++ ++A+P+P GGV+VV +N + Y +
Sbjct: 264 LAVSNFTGMITTVSLNISQRRQPIIWSASKLPHNSRHIVALPAPAGGVVVVSSNALIYRN 323
Query: 334 QSASCALALNNYAVSL-DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
CAL LN YA++ D + + D H L+ L S TG+ ++ V
Sbjct: 324 HEQKCALKLNEYAIAAGDGGNRFDTAGDIICFDTVHPVRLEGYQMLFSLVTGESYIMGVQ 383
Query: 393 Y--DGRVVQRLDLS----KTNPS-VLTSDITTIGNSLFFLGSRLGDSLLVQF 437
DG ++ L L K +PS S + +G+S FLGSRLGDS LV+
Sbjct: 384 LDTDGNTIKALTLDLVDVKLSPSGGFASIMCRVGDSYLFLGSRLGDSSLVKM 435
Score = 76.6 bits (187), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 135/314 (42%), Gaps = 77/314 (24%)
Query: 501 FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------------------ 542
+ F + D+L NIGP+ G R++A G K+ + ELV
Sbjct: 587 YRFELCDTLTNIGPI-----GSRLDA-----GAVKKDSVELVTASGGLQYGKLGVLQRSL 636
Query: 543 --------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE----YHAYLIISL--EARTM 588
LP + +WTV+ +++ + D ++E HAY++IS + T+
Sbjct: 637 NPVVMTAVPLPDAQAVWTVFGPTAKAADEDMEEDGNEEEEQSAGMHAYMVISQGNDKGTI 696
Query: 589 VLETADLLT-EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
VL+ +L + E VD+ V +T+ GN+FG +R++QV +L+G Q+L
Sbjct: 697 VLKGRELEEFDEDEQVDFEVDAKTVCVGNIFGNQRIVQVTPWNVYVLNGPRKEQELPV-- 754
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
+G+G + +++ I DPY+ L + DG + LLVGD S+ V+ + +
Sbjct: 755 ---VAGNGLQ---IVAAYIRDPYIALILQDGRLNLLVGDASSMQVNY-----VSHEIHNI 803
Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
++ + D P+ GEA D Q D+ +G +++
Sbjct: 804 TAACFFLDPIPD----------------GEANDDP-----QQRDVMLAAAPRNGHFQLYT 842
Query: 768 VPNFNCVFTVDKFV 781
+P+ V+ FV
Sbjct: 843 LPSLELVYDAADFV 856
>gi|321475208|gb|EFX86171.1| hypothetical protein DAPPUDRAFT_313209 [Daphnia pulex]
Length = 1260
Score = 244 bits (622), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 202/687 (29%), Positives = 324/687 (47%), Gaps = 84/687 (12%)
Query: 57 NLVVTAANVIEIY--VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
NLVV ANV+ ++ + E+ ++ G+ + LE + Y L G V
Sbjct: 29 NLVVAGANVLRVFRLIPNTDEKMLRKESADGQPPK---------MKLECLASYNLFGKVM 79
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
S+A +S G+ +D+I+++F AK+S++E+D L+ S+H FE L G
Sbjct: 80 SIAAVSLPGSS----QDTILMSFAHAKLSLIEYDPVSDNLKTLSLHNFEVVSIL--DEGI 133
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE-SS 233
S + P ++VDP+GRC +L++ + IL F + + SS
Sbjct: 134 GSNHKIPEIRVDPEGRCAALLIFRNTLAILP--------------FRKDSAHDSNVTLSS 179
Query: 234 HVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
++I L DL+ + +V D F+HGY EP ++IL+E T+ GR++ + TC + A+S++T
Sbjct: 180 YIIKLTDLEERVDNVIDVQFLHGYYEPTLIILYEPVGTFPGRIAVRQDTCNMVAVSLNTQ 239
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLD 350
+ HP+IWS +LP D +LL VP P+GG L++ N++ Y +QS +++N+ A
Sbjct: 240 QRVHPIIWSLNSLPFDCSQLLPVPKPLGGALIMAVNSVIYVNQSVPPYGVSVNSIADHCT 299
Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPS 409
S P + LD A A +LQ D +LS K G+L +LT+ D R V++ L K S
Sbjct: 300 SFPLKPYEGSRIGLDCARAAFLQYDRVVLSLKGGELYVLTLFADSMRSVRKFHLEKAAAS 359
Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
VLT+ + N L FLGSRLG+SLL+ F + + A P ++
Sbjct: 360 VLTTCLCICDNYL-FLGSRLGNSLLLAFQ--------TKDYNQYATPFAAKKPKMEQFSL 410
Query: 470 SSSDALQDMVNGEELS--LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
L D ++ EE+ LYG +T+S ++ F V DSL+NIGP + G
Sbjct: 411 LFDQEL-DHLDEEEIDNYLYGEDHESTDSKAISYQFEVCDSLLNIGPCGQMAVG---EPA 466
Query: 528 ASATGISKQSNYELVELP-----GCKGIWTVYHKSSRGHNADSSRMAAYDDEY------- 575
++ T K+S VE+ G G V ++ + + + D +
Sbjct: 467 STCTDFDKKSPDPDVEIVTTSGYGKNGAICVLQRTMKPQVVTTFELPEVSDMFTVFASRN 526
Query: 576 ------HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
H YL++S TMVL+T + E+ +S + V TI A NL R ++QV
Sbjct: 527 NEDAIMHTYLLLSRADSTMVLQTGQEINEMDQS-GFSVTSPTILAANLGNNRFIVQVCPT 585
Query: 630 GARILDGS-YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
R+LD + + Q+L + + S S +DPYV + +G I LL
Sbjct: 586 SVRLLDATATVIQELVM----------DSDFLITSASASDPYVAVLTENGRIGLLTFVEG 635
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHD 715
+ ++ + S PV LY D
Sbjct: 636 S---QLEMIFPVLSKNSPVVCVCLYRD 659
Score = 230 bits (587), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 152/496 (30%), Positives = 251/496 (50%), Gaps = 43/496 (8%)
Query: 824 VVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVS 883
+VE+ + H RP L +D +L Y+A PV S+ + +
Sbjct: 791 IVEMGIFGLGHLHRRPLLMIRTSDFGVLLYEAI----------PALPVYDSKQKNELKIR 840
Query: 884 ASRLRN---LRFSRTPLDAYTREE------TPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
+L + LR ++T Y R+ P+ + F NI+G+ G F+ G P
Sbjct: 841 FRKLNHSLLLRETKT----YVRKGGQSVVLEPYAWKTNQFKYFSNIAGYTGVFIGGPYPH 896
Query: 935 WC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
W M R LR+HP DGSI F HNVNC GFIY+ + L+IC LP+ YD W
Sbjct: 897 WLFMTSRGELRLHPMSIDGSIKCFACFHNVNCAQGFIYLNRKDELRICLLPTLFNYDAPW 956
Query: 994 PVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
PV+KV PL+ TPH + Y E Y I++ + +P N++ D + +L
Sbjct: 957 PVRKV-PLRCTPHYLIYHVETKTY--ILATSLAEPTNRIYRFNGDDK------ELSLEER 1007
Query: 1054 DLHRTYT-VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETL 1111
D Y VE++ ++++ P TR + + E+ ++ V+L + +
Sbjct: 1008 DDRFPYPHVEKFAIQLISPVTWEAVPNTR--MDLDDWEHVTCLKTVSLEYEGHASGLKDY 1065
Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQG 1166
LA+ T Y GED+ +RGR+ + P +N + +Y+K+ KG ++A++S+ G
Sbjct: 1066 LAVSTNYNYGEDIISRGRIFILDLIEVVPEPGQPLTKNKIKTLYAKDQKGPVAAISSVCG 1125
Query: 1167 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1226
+L+ A G KI L + +L GIAF D +Y+ L +K+FIL D++KS+ L ++E+
Sbjct: 1126 YLVAAIGQKIYLWQLKNDDLVGIAFIDTE-IYIHQLLNIKSFILAADVYKSVSILRFQEE 1184
Query: 1227 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1286
L ++A+D+ L+ A ++ ID + L +VSD +KN+ ++ Y P+ ES G +L+ +
Sbjct: 1185 YRTLCIVARDYQPLEVMAVDYYIDNTQLGFLVSDAEKNLILYMYQPEARESQGGHRLIRK 1244
Query: 1287 AEFHVGAHVTKFLRLQ 1302
A+FHVG V+ R++
Sbjct: 1245 ADFHVGQVVSTMFRIK 1260
>gi|346971831|gb|EGY15283.1| cft-1 [Verticillium dahliae VdLs.17]
Length = 1445
Score = 242 bits (618), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 349/1504 (23%), Positives = 609/1504 (40%), Gaps = 257/1504 (17%)
Query: 57 NLVVTAANVIEIYVVRV-------QEEGSKESKNSGETKRRVLMD--GISAA-------- 99
NL+V+ ++++I+ V+ + +K + N+GET R + D G+ +A
Sbjct: 28 NLIVSKGSLLQIFAVKTVSTEIDTSQIQAKSTSNAGETYDRRINDDDGLESAFLGGDGML 87
Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDD 149
L LV Y +HG + LA + +SR +++++ A++S+L++D
Sbjct: 88 MRADRTTNTRLVLVAEYPVHGVIAGLARVK---IQSSRSGGEALLVHSRTARLSLLQWDP 144
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMII 203
HG+ S+H +E EW + S GPL ++ DPQ RC L +GL+ I
Sbjct: 145 EKHGVEDISIHFYEKEEW------QGSPMDGPLRQHATILQADPQSRCAA-LKFGLRKIA 197
Query: 204 L-------------KASQGGSGLVGDEDTFGSGGGFSARIES----------SHVINLRD 240
+ G E+ + + S S V+ L
Sbjct: 198 FLPFRQIDGDIDMDDWDEEVDGPRPQEEPPAAAAVHGSSSNSSSLAPVPYTPSFVLALPQ 257
Query: 241 LD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
LD + H F F+H Y EP + I+ H T + + + L
Sbjct: 258 LDPEILHPVHFAFLHEYREPTLGIISSTNRRLKMEPQKDHFTFKVFTVDL--------LQ 309
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPR 357
+++N K++A+P P+GG L++G N IH + +A+N YA + +
Sbjct: 310 KASLN------KVIALPKPMGGALLIGENELIHIDQAGKAHGVAVNPYAAKMTKFPLADQ 363
Query: 358 SSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVL 411
S + L+ + +N LL T+ G++ ++T DGR V + + ++ VL
Sbjct: 364 SELKLRLEHCEVELMSPENGEMLLVTRHGEMAVVTFKMDGRSVSGVSVKVVATENGGDVL 423
Query: 412 ---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
+ +T + + F G+ GDS ++ G + + K+ E+
Sbjct: 424 PFRAACLTKVTKNSMFYGTIGGDSKVI----GWSRQHVQTARKKARLLDESLDYDLDDDE 479
Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
D D + GE ++ + F V DSL+++ P+ D +YG
Sbjct: 480 ADDDDDDDDDLYGEGTVAPQPSAAAGSAKGGDVVFRVHDSLLSLSPIMDMTYGKTAFFPG 539
Query: 522 ---------LRINAD-ASATGISKQSNYELV------------ELPGCKGIWTV-----Y 554
+R D A G + + L+ + P +G WT
Sbjct: 540 SEDAKNSEGVRSELDLVCAVGRHRGGSLALINQHIQPRVIGRFDFPEARGFWTTRVQKTI 599
Query: 555 HKSSRGHNADSSRMAAYDD-----EYHAYLIISLEARTMVLETADLLTEVTESVDYF--- 606
KS +G + +A +D +Y ++I++ + ET+D+ +
Sbjct: 600 AKSLQGDKG--ANLAVGNDYGSVTQYDKFMIVA-KVDLDGYETSDVYALTGAGFEALSGT 656
Query: 607 ----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTV 661
G TI AG + R+IQV R DG ++Q L + E+G+ V
Sbjct: 657 EFDPAAGLTIEAGTMGNDMRIIQVLRSEVRCYDGDLGLSQILPM--LDEETGA---EPRV 711
Query: 662 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK----G 717
+S SI DPY+LL D SI + + S K +S C LY D
Sbjct: 712 ISASIVDPYLLLLREDSSILVAQITNHNELEELDKEDETIVSTKWLSGC-LYKDSRGLFA 770
Query: 718 PEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV 777
P + TST + + AI G+++ +C ++ +PN + V
Sbjct: 771 PVQTDKGTSTSESVFLFLLNAI----------GELHVRIC-------VYALPNLSKSIYV 813
Query: 778 DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
+G ++I S + ++ GT E + + V +L ++ H
Sbjct: 814 ---AAGLSYI----------PSLLSADYTARRGTS---PETLTEILVADLGDSTSASAH- 856
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
L + + Y+ + G E K D + SL VS S L +++P+
Sbjct: 857 ---LILRHANDDMTIYEPFRIGGQEE--KED----LANSLFFKKVSNSHL-----AKSPV 902
Query: 898 DAYTREETPHGAPCQRITIFK---NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
+A E R+ + NI G+ FL G+ P + + + L +
Sbjct: 903 EAAEDEAVQE----NRVIPLRACDNIGGYSTVFLPGASPSFILKSSKSTPKVIGLQGLGV 958
Query: 955 VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEK 1014
+ H C GFIY S+G ++ Q P + + +P+ + +
Sbjct: 959 NGMSSFHTEGCERGFIYADSKGCARVTQFPDAANVAELGVSVRKVPIDTAVSHVAWHPNM 1018
Query: 1015 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEY-EVRILEPDR 1073
+Y V+ L+P E+ D H + + ++E+ +++ P
Sbjct: 1019 EVY--AVASSKLEPF----------ELPKDDDYHKEWAKEECPMPPMKEHGSIKLYSPIT 1066
Query: 1074 AGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1132
W ++ E A+ ++ + L + TKE L A+GTA ++GED+ RGR+L+
Sbjct: 1067 ----WNVIDEFELEQYEVAMCMKTLLLEVSEETKERRMLFAVGTAILRGEDLPVRGRILV 1122
Query: 1133 FSTGRNADNPQNLVTE-----VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWT 1182
F P T+ + +E+ +GA+++L + QG +L+A G K ++ K
Sbjct: 1123 FDVVHVIPQPDRPETDRKLKLIAKEEIPRGAVTSLCEVGTQGLMLVAQGQKCMVRGLKED 1182
Query: 1183 GTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1240
GT L +AF D YVV+++ ++N + L+ D + ++F+ + E+ ++ L K L
Sbjct: 1183 GTLLP-VAFLDMS-TYVVAVHELRNTGYCLMADANMGVWFVGYSEEPYRMTLFGKSGTQL 1240
Query: 1241 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFL 1299
C +FL+ G+ LS+V SDE + I + P+ S +G LL+RA F V H L
Sbjct: 1241 KCLTADFLVAGNDLSIVASDEDGVLHILQFDPEHPRSLQGHLLLNRASFSVAPNHAWATL 1300
Query: 1300 RL------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1353
L L S TGAA ++T LL + G+I + P+ E +RRL SL
Sbjct: 1301 VLPRTTTRPYLPQSEPATGAAGSQNRTQ--TLLLASASGAIASLNPITEHAYRRLTSLTT 1358
Query: 1354 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEIA 1408
L +++PH AG+NP++ R +G A P D +IVD LL+ + L ++ E A
Sbjct: 1359 SLANALPHAAGMNPKAHRLPPQDGAARPPAVDVSAGRTIVDGALLARWNELGARQRAEAA 1418
Query: 1409 HQTG 1412
+ G
Sbjct: 1419 GKGG 1422
>gi|347838999|emb|CCD53571.1| similar to Cleavage and polyadenylation specificity factor subunit 1
[Botryotinia fuckeliana]
Length = 1447
Score = 240 bits (613), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 336/1519 (22%), Positives = 606/1519 (39%), Gaps = 242/1519 (15%)
Query: 57 NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD-GIS---------- 97
NLVV +++++I+ + + E+ S +K+ RV D G+
Sbjct: 28 NLVVAKSSLLQIFTTKTVSVDLDELSEKDSSTAKDDTNIDPRVNNDDGVEDSFLGTDSIM 87
Query: 98 -------AASLELVCHYRLHGNVESLA----ILSQGGADNSRRRDSIILAFEDAKISVLE 146
L LV Y L G V SL I S+ G + +I++ F+DAK+S++E
Sbjct: 88 QRPELARTTKLVLVAEYNLSGTVTSLVRVKTISSKTGGE------AILVGFKDAKLSLVE 141
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+D G+ S+H +E E + VDP RC + + IL
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201
Query: 207 SQGGSGLVGDEDTFGSG--------------GGFSARIESSHVINLRDLDMK-----HVK 247
Q + D D G G SS V+ L LD H++
Sbjct: 202 KQDEDVNMDDWDEELDGPRPAKISQKAAAEDGQLDTPYGSSFVLRLSSLDPSIIFPIHLE 261
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLP 305
F++ Y EP IL + + + H T M+ L + K I S LP
Sbjct: 262 ---FLYEYREPTFGILSSTMAPSSALLQERRDHLTYMVFTLDMHQ--KASTTILSVGGLP 316
Query: 306 HDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVEL 364
+D ++++ + P+GG L+VG N IH + +A+N +A L ++ + L
Sbjct: 317 YDLFRIVPLAPPVGGALLVGTNELIHIDQAGKANGVAVNMFAKQCTGFSLLDQADLDLRL 376
Query: 365 DAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLT---SDI 415
+ L +N L+ +GD+ +L+ DGR V L + + + ++LT S +
Sbjct: 377 EGCKIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAELGGAILTGAASCV 436
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
+++G F+GS + DS+++ + SG + + E D +
Sbjct: 437 SSLGAGSLFVGSEVSDSVILGWNRKSGQTSRRKSRLDSSAIAEVDE---AMFDEEDLEDD 493
Query: 476 QDMVNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSYG---LRINADASA 530
D + G+ ++ + +N T S KT ++F + DS+VNI P+ + ++G L + D
Sbjct: 494 DDDLYGDGPTITHATANITASNSKTGDYTFRIHDSMVNIAPITNIAFGEAALSLGKDEEL 553
Query: 531 TGISKQSNYELV--------------------------ELPGCKGIWTVYHK--SSRGHN 562
QS +LV +LP +GIWT+ K + +G
Sbjct: 554 KSSGVQSELQLVAAVGREKGGSLAVINREIQPNVIGRFDLPEARGIWTMSAKRPAPKGLQ 613
Query: 563 ADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLTEVTESVDYF-VQG 609
+ + D +Y +I+S +A + E+A D E ++ G
Sbjct: 614 VNKEKSVTSGDYGVDAQYDRLMIVSKASDAEDAIEESAVYALTDAGFEALTGTEFEPAAG 673
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
TI AG L RV+Q+ + R DG + Q L + E+G+ ++S S AD
Sbjct: 674 STIEAGTLGNGMRVVQILKSEVRSYDGDLGLAQILPM--LDDETGA---EPKIISASFAD 728
Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
P++LL D SI + D ++ I S K ++ C LY D +D
Sbjct: 729 PFLLLIRDDASIFVAQCDDDNDLEEIERVDDILLSTKWLTGC-LYDD------YSGAFSD 781
Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDK---FVSGRT 785
+ S GE ++ + GAL I+ +P+ + V + FV
Sbjct: 782 SK-SNKAGE-------------NVKMFLLSAGGALHIYALPDLSKPVYVAEGICFVPPVL 827
Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
+ A +++ TEI L + P+L
Sbjct: 828 SADYAARKSAARETLTEI-----------------------LVANLGDSVSQSPYLILRP 864
Query: 846 TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET 905
++ + Y+ + + S S L S + +++N ++ P + EE
Sbjct: 865 SNDDLTIYEPFRVK------------SASPDLLSSTLQFLKIQNTHLTQAP--DVSAEEQ 910
Query: 906 PHGA------PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTV 959
GA P + I+ N+ G+ F+ G P + + + L + + +
Sbjct: 911 VDGAQQTSDKPMRAIS---NLGGYSTVFMPGGSPSFIIKSSKTAPKVLSLQGTGVRSLSS 967
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPL 1019
H C+ GFIY +++GI ++ Q P +T+ + + I + H + Y Y +
Sbjct: 968 FHTEGCDRGFIYASTEGIARVAQFPPNTTFADIGMALRKIEIGEDVHAVAYHPPLQTYVI 1027
Query: 1020 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL-SSVDLHRTYTVEEYEVRILEPDRAGGPW 1078
S D E+ D+ ++ ++E+ ++++ P W
Sbjct: 1028 GTST------------FTDFELPKDDDHRKTWQEENIALKPSIEKSFLKLVSPVN----W 1071
Query: 1079 QTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF---S 1134
I ++ E ++ + L + T E + L+ +GTA +GED+A GR+ ++ +
Sbjct: 1072 SVIDAIELEPCELITCIKTMNLVISEVTNERKHLIVVGTAITKGEDLATTGRLYVYDVVT 1131
Query: 1135 TGRNADNPQN------LVTEVYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGT 1184
D P+ + +E+ ++ G ++ L+ + QG +L+A G K ++ K GT
Sbjct: 1132 VVPEPDRPETNKKLKLISSEIITRGAGGPVTGLSEIGTQGFMLVAQGQKCMVRGLKEDGT 1191
Query: 1185 ELNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1242
L +AF D YV S+ + ++ D K ++F + E+ ++ L K ++
Sbjct: 1192 NLP-VAFMDMN-CYVTSVKELPGTGLCVMADALKGVWFAGYTEEPYRMLLFGKSAAKMEV 1249
Query: 1243 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH-------V 1295
+ L DG L +V +D N+ I Y P+ +S +G LL R F +GAH +
Sbjct: 1250 LCADLLPDGKDLFIVAADANGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTLL 1309
Query: 1296 TKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPLDELTFRRLQSLQK 1353
L L T+ + + T + LL + G++ ++PL E +RR +L
Sbjct: 1310 PTTRPLPQLTTAPSPSPDPSPQEDTPSPSQPLLLTSRTGTLALLSPLTESQYRRFGTLVS 1369
Query: 1354 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1413
L +++ H GLNPR++R + G +I+D +L + L + + E+A + G
Sbjct: 1370 HLTNTLYHPCGLNPRAYR-IDRDANEGIVGGRTIIDGGVLGRWMELGSQRRGEVAGRVGV 1428
Query: 1414 TRSQILSNLNDLALGTSFL 1432
++ L+ L G F+
Sbjct: 1429 DVLELRDELSGLRGGLGFI 1447
>gi|320591495|gb|EFX03934.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
clavigera kw1407]
Length = 1461
Score = 240 bits (612), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 341/1435 (23%), Positives = 570/1435 (39%), Gaps = 215/1435 (14%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
S + L LV + L G V LA + G + +++++A +DA++S+LE+D + L
Sbjct: 100 SISKLVLVAEFPLAGTVTGLARIKIPGTKSGG--EAVLVALKDARLSLLEWDPDQNDLTT 157
Query: 157 TSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
S+H +E E + DP RC + + IL Q V
Sbjct: 158 ISIHYYEQEELQGAPWAAPLSDYANFLVADPGSRCAALKFGARNLAILPFRQADEEDVDM 217
Query: 217 ED-------------------TFGSGGGFS-ARIESSHVINLRDLD--MKHVKDFIFVHG 254
+D G G G S V+ L +LD + H F+H
Sbjct: 218 DDWDEELDGPRPAKDPSSAAVVSGPGDGIEDTPFAPSFVLRLSNLDTTLLHPVHLAFLHE 277
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y EP IL T A V + ++ K I S NLP D ++++ +
Sbjct: 278 YREPTFGILSSSVSTSA--VIGRRDKLSYLVFTLDLQQKASTTILSVANLPQDLFRVVPI 335
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
PSPIGG ++VGAN IH + +A+N + S +S ++ L+ L
Sbjct: 336 PSPIGGAILVGANELIHIDQSGRANGVAVNPFTKQSTSFGLADQSDLALRLEGCTVDVLS 395
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLS----KTNPSVLTSDITT---IGNSLFF 424
+ L+ G L +LT+ DGR V L + + V+ S IT IG + F
Sbjct: 396 AEAGELLIVLHDGQLAVLTIRVDGRTVSGLSVKMVRREAGGDVIQSGITCLSRIGRQMLF 455
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVNGEE 483
GS DS+++ ++ G + G D+ AD R + D + +
Sbjct: 456 AGSDQADSVVLGWSRKQGQTARRKPRANRAGLDLGADEEYFDDEREEGEELDDDEDDDDL 515
Query: 484 LSLYGSAS------NNTESAQKTFSFAVRDSLVNIGPLKDFSYG-------LRINADAS- 529
SA+ N T SF + D L++I P++D G L +D +
Sbjct: 516 YGDGPSAAQTLGIDNTTGRGGDDLSFRIHDRLLSIAPIRDMVIGKPALVGELAKRSDQAT 575
Query: 530 ---------ATGISKQSNYELV------------ELPGCKGIWTV-----YHKSSRGHNA 563
A G + L+ E + +WTV ++ +G
Sbjct: 576 IHSELNLVCAVGSGRAGALALLSREINPDPLGAFEFAEAQALWTVSSSKPIPRTIQGEKG 635
Query: 564 DSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF-------VQGRTIAA 614
++ Y+ + Y+I++ E ET+D+ + G T+ A
Sbjct: 636 GATVGEDYESPAMHDKYMIVAKEDDDG-FETSDVYAVTASGFETLKGTEFEPAAGFTVQA 694
Query: 615 GNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
G + RR+IQV + R DG ++Q L + +G+E VL SIADPY+LL
Sbjct: 695 GTMGRNRRIIQVLKSEVRCYDGDLGLSQILPM----VDEDTGAE-PRVLFASIADPYLLL 749
Query: 674 GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLST 733
D S+ + + ++ +S K V+ C LYHD KTS A+L +
Sbjct: 750 IRDDASVLVAEMNKDFELEELERDDGSLASTKWVAGC-LYHDTAS--VFSKTSILAFLLS 806
Query: 734 GVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS--GRTHIVDTY 791
SG I+ +P+ V + ++ R + D
Sbjct: 807 A-------------------------SGTFYIYALPDLKQPVYVAEGLNYVPRLFLPDHT 841
Query: 792 MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTIL 851
+R + KE + + V +L A P+L + +
Sbjct: 842 VRRGMA------------------KEPLTEILVADLG----DAVSKAPYLIVRHANDDLT 879
Query: 852 CYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR--NLRFSRTPLDAYTREETPHGA 909
YQ P+ T SL + S L+ N F+++P+ + + ++
Sbjct: 880 IYQ---------------PLRTPSSLGSLSESLRFLKVPNPVFAKSPV-SISSDDASSQL 923
Query: 910 PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIVAFTVLHNVNCN 966
+ + +NI G+ FL GS + + + + P+ L ++ + + H +
Sbjct: 924 RAMPLRVCENIGGYSTVFLPGSSASFVL---KSAKSQPRVVSLQGTAVRSLSPFHTESSE 980
Query: 967 HGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVL 1026
FIYV +G ++C +P+G + + L + + Y Y + S
Sbjct: 981 RSFIYVDVEGSGRVCSMPAGWNLTELGVCARKVALDTDANALAYHPPTGTYAVGTSA--- 1037
Query: 1027 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1086
+ + + ++ H+ D + S+ R E + ++ P G W T T+ M
Sbjct: 1038 -----LEAFELPKDDPHRADWNKESTA--FRPL-AERGRLLLMSP----GSWSTIDTVEM 1085
Query: 1087 QSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP--- 1142
+ E + V+ + L + T E + L+A+GTA +GED+A RGRV +F P
Sbjct: 1086 EPYEVVMCVKTLNLEVSEATNERKQLVAVGTAISRGEDLAIRGRVYVFDVVSVIPEPGRP 1145
Query: 1143 -QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAP 1195
N ++ +KE +GA++A++ + QG +L+A G K ++ K GT L +AF D
Sbjct: 1146 ETNRKLKLIAKEDIPRGAVTAVSEIGTQGLMLVAQGQKCLVRGLKEDGTLLP-VAFMDMN 1204
Query: 1196 PLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1253
YV S + ++ D K ++F + E+ ++ L K L + L DG
Sbjct: 1205 -CYVTSAKELPGTGLCVMSDAFKGVWFTGYTEEPYKMILFGKSNTRLHALNVDLLPDGKE 1263
Query: 1254 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF-LRLQMLATSSDRTG 1312
L +VV+D N+ + + P+ +S +G LL RA F GAH + L L T +DR
Sbjct: 1264 LFIVVTDADGNLHVMQFDPEHPKSLQGHILLHRATFCTGAHFSTLSLLLPSTFTPADRPT 1323
Query: 1313 A---------APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1363
A P + + + LL G+ G + + PL E +RRL SL +L S+ A
Sbjct: 1324 ANGETNGASSQPEAQQHQQHQLLLGSPTGLLASLVPLSESEYRRLSSLAGQLATSLTQTA 1383
Query: 1364 GLNPRSFRQFHSNGKAH-RPGPD-----SIVDCELLSHYEMLPLEEQLEIAHQTG 1412
GLNP+ +R + A PG D S+VD LL+ + L + EIA + G
Sbjct: 1384 GLNPKGYRMTAGSAAATLAPGVDAAVGRSVVDGALLARWTELGSGRKGEIAGRVG 1438
>gi|324499955|gb|ADY39993.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
suum]
Length = 1434
Score = 239 bits (610), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 201/700 (28%), Positives = 341/700 (48%), Gaps = 64/700 (9%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREA--LKDSE---TEINSSSEE 809
V+ E+G L I+ +P V+ V K +H+ D + E L D ++I S++
Sbjct: 773 VMARENGNLYIYSIPEMQLVYMVKKL----SHLPDVAIDEMNYLGDESVVASDIASNTLN 828
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
+ E I +VE+ + + RP LF ++ D + Y+ ++++ N
Sbjct: 829 EALVAKPEEI----IVEVLLTGMGMNQGRPMLFVVV-DDMVSVYEMFMYD---NGVVEHL 880
Query: 870 PVSTSRSLSVSNVSASRLRNLRFS----RTPLDAYTREETPHGA---PCQRITIFKNISG 922
V R L + V+ R+ RF R P++A R+ + P +RI N
Sbjct: 881 AVRFKR-LPYTTVT----RSCRFQGNDGRAPVEA-ARDTVRYRTALHPFERIGNILN--- 931
Query: 923 HQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKIC 981
G F+ S PC ++ LR+HP +G I++FT +NV C +GFIY+T + ++I
Sbjct: 932 --GVFICSSYPCVFLMDSGILRMHPLNLEGPILSFTAFNNVLCPNGFIYLTEREWAMRIA 989
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
+LP+ D+ PV+K I T H I Y + N Y ++ S K N L +L++++
Sbjct: 990 KLPTDVELDSSLPVRK-IRTGRTIHNIVYLLQSNTYAVVGSE---KKPNNRLCVLVNED- 1044
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVT 1099
D H D +E Y+V++ P+ W+ A I M+ E V
Sbjct: 1045 -KSFDEH--EKADSFVLPELEVYDVKLYSPED----WKPVPNAEIKMEDFEVLTCCEEVV 1097
Query: 1100 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKE 1153
L + T + LA+GTA GE+V RGR+++ P ++ + +Y KE
Sbjct: 1098 LRSEGTVSGVQNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKE 1157
Query: 1154 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1213
KG +++L S G+LL G K+ + + L GI+F D Y+ L V+N L D
Sbjct: 1158 QKGPVTSLCSCNGYLLAGMGQKVFIWLFRDNNLQGISFLDMH-FYIHQLVGVRNLALACD 1216
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSL--DCFATEFLIDGSTLSLVVSDEQKNIQIFYYA 1271
I++S+ L ++E+ L+L ++D ++ A +FLID ++ ++SDE NI +F Y
Sbjct: 1217 IYRSVALLRYQEEYKALSLASRDMRAVVQPPMAAQFLIDNRQMAFIMSDEAANIAVFNYL 1276
Query: 1272 PKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-NRFALLFGTL 1330
P+ ES G++L+ R+E ++G +V F+R++ +S G + NR ++LF +L
Sbjct: 1277 PEALESSGGERLILRSEINIGTNVNSFMRVKGHISS----GFVENEHYSLNRQSVLFCSL 1332
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1390
DGS G + PL E FRRL LQ+ + V AGLN + R H ++VD
Sbjct: 1333 DGSFGFVRPLSEKVFRRLHMLQQLMSSLVAQAAGLNVKGSRAARPQRPNHYLNTRNMVDG 1392
Query: 1391 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
+++ Y L L ++ ++A + GT+R I+ +L +++ T+
Sbjct: 1393 DVVFQYLHLSLADKNDLARKLGTSRYHIIDDLTEISRLTT 1432
Score = 178 bits (451), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 172/656 (26%), Positives = 281/656 (42%), Gaps = 114/656 (17%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + H RL V+SLA+ + S++L F+ AK+SV+ F + L+ S+H
Sbjct: 107 LECIIHVRLLAPVKSLAV---ARIPQNPSCSSLLLGFDTAKLSVVGFSAAERSLKTISLH 163
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G + P+++VDP RC +L+YG + +L L
Sbjct: 164 CFEEE---MLKDGYVTDLPSPVIRVDPAQRCAVMLIYGRYLAVLPFDDTSPHL------- 213
Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
++ + L +D + ++ D F+ GY EP ++ L+E T AGR ++
Sbjct: 214 -----------HTYTVALSSIDPRLVNIIDIAFLDGYYEPTLLFLYEPAQTTAGRACVRY 262
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS- 337
T + +S++T + H +W NLP D ++L +P PIGG L++GAN + Y +QS
Sbjct: 263 DTVCMLGVSLNTKEQVHASVWQLNNLPMDCNQVLMIPRPIGGALIIGANELIYLNQSVPP 322
Query: 338 CALALNNYAVSLDSSQELPRSS---FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD 394
C LN+ +D + P S ++ LD A + + ++ ++G L +LT+V D
Sbjct: 323 CGSLLNS---CMDGFTKFPLKSEKEMALTLDGCAACVISTNKVVVCARSGALFILTLVVD 379
Query: 395 G-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
V+ ++ + +T F+GSR+GDSL +++ S L
Sbjct: 380 STNSVKSIEFKHEFDVSIPHTVTACSPGYLFVGSRVGDSLFIEYV---------SEL--- 427
Query: 454 FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ---KTFSFAVRDSLV 510
+ D P K+L+ + QD + E+L LYG A + S + F V D ++
Sbjct: 428 ---VPVDDPIEKKLK---VEVPQDDLEDEDLELYGKALPSVISQDVSVEKMRFRVLDRML 481
Query: 511 NIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS---- 566
N+ P K + +G S+ N L E P ++ + GH DSS
Sbjct: 482 NVAPCKKMT-----------SGCSEGLNSYLQEQPRLDPVFD--RVCACGHGKDSSICIF 528
Query: 567 ---------------------RMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
+ +D+ H Y+I S E ++ LET + L E+ V +
Sbjct: 529 QQSIRPDIITSSSIEGVIQYWAVGRREDDTHMYIIASKELGSLALETDNDLVELEAPV-F 587
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ----DLSFGPSNSESGSGSENSTV 661
TIAAG L +QV ++ Q L+F V
Sbjct: 588 ITSESTIAAGELADGGLSVQVTTSSIVVVAEGQQIQLIPLQLTF--------------PV 633
Query: 662 LSVSIADPYVLLGMSDGSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
LS SI DP+V + +G + L L P +V P I +K P+++ +Y D
Sbjct: 634 LSASIVDPFVAICTQNGRLLLYELDNTPHVHLKAVDLPGNIIHNKSPITALCIYRD 689
>gi|449299306|gb|EMC95320.1| hypothetical protein BAUCODRAFT_25380 [Baudoinia compniacensis UAMH
10762]
Length = 1437
Score = 239 bits (609), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 343/1492 (22%), Positives = 585/1492 (39%), Gaps = 264/1492 (17%)
Query: 52 IGP-VPNLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRL 109
IGP NLVV ++++++ V R+ + + + + R L L+ Y L
Sbjct: 22 IGPQADNLVVAKTSLLQVFEVKRISQAKDNGHHDHADAQSR----------LSLIGEYTL 71
Query: 110 HGNVESLAILS-----QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
G V +L+ ++ GGA +++ AF+DAK+S++E+D + + S+H +E
Sbjct: 72 SGTVTALSPITLPSSRTGGA-------ALVCAFKDAKLSLIEWDPEHYRISTISIHYYEG 124
Query: 165 PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS------------- 211
L G ++ VDP RC + Q+ IL Q G
Sbjct: 125 DNVLLPPFGAALSECESILTVDPGSRCAALKFGERQLAILPFRQQGDELADEAAEDADMA 184
Query: 212 --------GLVGDEDTFGSGGGFS----ARIESSHVINLRDLD--MKHVKDFIFVHGYIE 257
G V + T + S +SS V+ L LD + H F+H Y E
Sbjct: 185 EAESEEQPGNVTLKRTSTTQALDSKDDITPYKSSFVLPLITLDPSLTHPVHLAFLHEYRE 244
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P IL + + + + ++ + + S LP D +K++A+P P
Sbjct: 245 PTFGILSAPQQPSLALLDERKDCLSYTVFTLDLEQRASTNLMSVSKLPSDLWKVIALPPP 304
Query: 318 IGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
+GG L+VG N IH + A+A+N +A + S +++L+ L +
Sbjct: 305 VGGALLVGTNELIHIDQSGKTTAVAVNEFAKVASNFSMADHSDLNMKLEGCEIEMLDSST 364
Query: 377 --ALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSD----ITTIGNSLFFLGS 427
AL+ G L+ GR V L +S+ TN + + + ++ F+GS
Sbjct: 365 GNALIVLNDGSFATLSFKMLGRTVGGLTVSRVADTNGGNVNASAPSCVASMQQQKLFVGS 424
Query: 428 RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD---------------APSTKRLRRSSS 472
G S LV++ + T + G APS ++R++S
Sbjct: 425 EDGSSSLVRWAKDTPTLSRKRSHAQMLGQDAPMDDADDAEELDEDDLYAPSAVAVKRAAS 484
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL--RINAD--- 527
A+ A T++F + DSL ++ P+ + G R +
Sbjct: 485 ----------------VANAAAVDASTTYTFELEDSLNSLAPMNNVCLGRSPRTGKEKLE 528
Query: 528 -ASATGISKQS-----NYELV-------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
+ G K S N E++ ++ G K IW+V +S G S+ D
Sbjct: 529 LVAGIGRGKASSLAFMNREIIPNEIRSRDVAGAKDIWSVCARSREGDKVSSA------DT 582
Query: 575 YHAYLIISLEARTMVLETADL----LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
Y L + T + AD + E+ E+ D+ G T+ G L ++Q
Sbjct: 583 YDNLLFVFDGESTKTYKYADSAEGSIIELDET-DFEGDGETVCVGTLANGSCIVQCRRTE 641
Query: 631 ARILDGSYMTQDLSFGPSNSESGSGSENST---VLSVSIADPYVLLGMSDGSIRLLVGDP 687
R T D G S S E +++ S DPY+L+ D S+++L D
Sbjct: 642 IR-------TYDHQLGLSQIIPMSDDETDAELKIVATSFCDPYLLVIQDDSSVQILQVDK 694
Query: 688 STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
+ +P+ + E LR+ WL+ + G L
Sbjct: 695 -------------QGDVEPLDAA--------ESDLREGK---WLTGSL-------YAGEL 723
Query: 748 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
G + + + G L++F +P V++ ++ L S+
Sbjct: 724 SDGQSAAFLLGQEGGLQVFSLPETKLVYSAPTL---------PFLPPVL---------SA 765
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
+ +G K + + VV+L + + RP+L ++ Y+ + +
Sbjct: 766 DAPQRRGGKVTLTEVLVVDLGAEGVT----RPYLIVRTAMDDLILYEPFHY--------- 812
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREE----TPHGAPCQRITIFKNISGH 923
S + + A+ +LRF + P + + T G P Q I G
Sbjct: 813 --------SATTLDARATGFTDLRFRKVPFTYLPKYDEGLDTADGRPAQLQPAV--IGGR 862
Query: 924 QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
+L G P + + L L + +F+ LH C GF V G LK QL
Sbjct: 863 NALYLPGGTPSFLVKEATSLPKVLGLRARGVRSFSPLHRAGCQQGFALVDGDGKLKEYQL 922
Query: 984 PSGSTYDNYWPVQKVIPLKATPH---QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1040
P ++ W V + + L P Q+ Y ++ +Y V+ V L D +
Sbjct: 923 PGHVSFATGWSV-RTLTLGEPPQEVRQVAYHEQRGIY-------VVATCRDVDFTLHDLD 974
Query: 1041 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1100
+ D NL V +Y + +L + + ++ M +E +++++ L
Sbjct: 975 ERQRDDEPNLKP-------QVPQYTLHLL----SATSHKVIQSLEMPYAEIVTSLKIMPL 1023
Query: 1101 -FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS---TGRNADNPQNLVTE--VYSKEL 1154
+ T E + +L +G A +GED A+G + +F D+P++ + +E
Sbjct: 1024 EVSEHTHEQKLMLVVGAAAQRGEDAPAKGLLTVFDIIDVVPEPDDPESGIRLHIAAREET 1083
Query: 1155 KGAISALASLQGHLL-IASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFI 1209
KGAI+AL S G L+ A G KI++ K GT L +AF DA Y+VSL +
Sbjct: 1084 KGAITALESFSGGLVGTAQGQKIMVRGLKEDGTCLP-VAFLDAQ-TYMVSLKTMGRSGLS 1141
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
L GD K ++F W E+ +L LL K ++ + EFL L L+V D + ++ +
Sbjct: 1142 LAGDAWKGLWFGGWTEEPYRLTLLGKSRTKMEVVSAEFLPFDGQLYLLVVDGKMDLHVLQ 1201
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL---------QMLATSSDRTGAAPGSDKT 1320
Y P+ ++ GQ+LL ++ FH+G L L Q T+ D G G++ +
Sbjct: 1202 YDPENPKTVSGQRLLHKSTFHLGHWPVDMLLLPSDLAPFAQQAPLTNGDSNGHTNGTESS 1261
Query: 1321 NR---------FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
F +L G++G I P+DE T+RRL +LQ +L + H AGLNPR++R
Sbjct: 1262 AANAPAPAPSLFHVLTTFQSGAVGLITPVDEATYRRLGALQTQLTSVLEHAAGLNPRAYR 1321
Query: 1372 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1423
S R +VD L+ L + E+ + G + S+L
Sbjct: 1322 AVESESLGGR----GVVDGMLVQRIGELGAARRAEVLGRAGADAWGLRSDLE 1369
>gi|66812672|ref|XP_640515.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
gi|60468551|gb|EAL66554.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
Length = 1628
Score = 238 bits (606), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 156/547 (28%), Positives = 276/547 (50%), Gaps = 80/547 (14%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDG----------------SIV 955
+RI F +ISG +G F+ G +P W + LR+H ++
Sbjct: 1122 KRIFEFSSISGKRGLFIGGKKPIWAFCEKGYLRLHSMDSSDNSNSNNSNNNNNNNSNTVE 1181
Query: 956 AFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEK 1014
FT +N++C GFIY + + ++KIC L + ++N +++ IP K + H+I Y +E
Sbjct: 1182 TFTSFNNISCQDGFIYFSKEKDVIKICTLSTLMNFENDIAIRR-IPTKNSCHKIAYHSEA 1240
Query: 1015 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA 1074
Y +IVS P +V ++ + + T +++++++++P
Sbjct: 1241 KCYVVIVSFP---------------QVTQELQEDSKKPI-----LTDDKFQIKLIDP-TI 1279
Query: 1075 GGPWQTRATIPMQSSENALTVRVVTLFNTT---TKENETLLAIGTAYVQGEDVAARGRVL 1131
W+ + +Q E L +++V+L T L IGTA+ GED +GRVL
Sbjct: 1280 DWNWKFIDSFSLQDRETVLAMKIVSLKFTEPDGITRARPFLVIGTAFTFGEDTQCKGRVL 1339
Query: 1132 LF------STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1185
+F + + + + + +Y KE KG ++AL+S+ G LL+ GPK+ ++++
Sbjct: 1340 VFEIVSHKTQFESEELGEKRLNLLYEKEQKGPVTALSSVNGLLLMTIGPKLTVNQFYTGS 1399
Query: 1186 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1245
L ++FYDA +Y+ S+ +KN+I++GD++KS+YFL WK+ LNLL+KD+ +L+ F+T
Sbjct: 1400 LVTLSFYDAQ-IYICSICTIKNYIVIGDMYKSVYFLQWKDNKT-LNLLSKDYQALNIFST 1457
Query: 1246 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA 1305
EF+++ TLS++VSD KNI +F + P+ S GQ Q +
Sbjct: 1458 EFIVNQKTLSILVSDLDKNILLFSFEPQDPSSRSGQ------------------INQEIN 1499
Query: 1306 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1365
++ P ++ ++FGTLDG + + PLDE + +Q KL +P AGL
Sbjct: 1500 GNNKNDNRLPKKEQ----LVIFGTLDGGLNVLRPLDEKIYLLFYHIQSKLY-YLPQTAGL 1554
Query: 1366 NPRSFRQFHSNGKAHRPGPDS-------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
NP+ +R F S + P + I+D +L+S + L E+ I++ +T +I
Sbjct: 1555 NPKQYRSFKSFSQNFHFSPSTFHQLPKFILDGDLISKFLSLSQSEKRLISNSINSTSDEI 1614
Query: 1419 LSNLNDL 1425
+ +L D+
Sbjct: 1615 IESLKDV 1621
Score = 199 bits (505), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 156/585 (26%), Positives = 266/585 (45%), Gaps = 128/585 (21%)
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
+++++++VKDF F+HGY EP ++ LHE TW R++ K TC ++A+S++ K I
Sbjct: 281 KNIEIENVKDFCFLHGYYEPTILFLHEPIQTWTSRIAVKKFTCQMTAISLNLLTKAGSFI 340
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W+ N P++ L++VP P+GG LV+ AN + Y +Q++ LA+N YA S+D+S +
Sbjct: 341 WNVSNFPYNCEMLVSVPEPLGGALVITANIMFYVNQTSRYGLAVNEYA-SIDTSTIIGSQ 399
Query: 359 SFS----------VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
F LD ++ +L++D + S K G+L++ ++ DGR VQR+ +SK
Sbjct: 400 PFDFPIDDTLNLVFTLDRSNFVFLESDKFIGSLKGGELLIFHLISDGRSVQRIHVSKAGG 459
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT------SMLSSGLKEE-------FG 455
SVLTS I + N+L FLGSRLGDSLL+Q+T S T S+ K++
Sbjct: 460 SVLTSCICVLSNNLIFLGSRLGDSLLLQYTEKSITDDQLEHENFSNPYKKQKTSEVFDLF 519
Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELS-------------LYGSASNNTESAQKTFS 502
D ++ + +++ Q+ + ++ L+ N +S Q
Sbjct: 520 DENSETNNNNNSNNNNNKENQEKSSSSSIASKLLEEIEDEEDQLFKEKKNQLKSYQ---- 575
Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY-----ELV--------------- 542
+ D ++NIGP+ D G I+ T Q Y ELV
Sbjct: 576 LGICDQIINIGPIGDIVVGQSIDPTYDETIQPNQPEYVPKTLELVTCSGYGKNGSISVLQ 635
Query: 543 -----------ELPGCKGIWTVY------------------HKSSRGHNADSSRMAAY-- 571
ELPG +WTVY K SR N ++ +
Sbjct: 636 NNIKPELVMAFELPGILNVWTVYKEEIEEEHIEKEIKKNTSKKRSRDENNNNEQEDNEQE 695
Query: 572 ----------------DDEYHAYLIISL-EARTMVLETADLLTEVTESVDYFVQGRTIAA 614
D +H YL +SL + T++ ET L EV + +++
Sbjct: 696 DNEDNEEEEEEEKMQKDKNWHDYLYLSLKDGTTLIFETGRDLKEVGK-----FNFKSLDI 750
Query: 615 GNLFGRRRVIQVFERGARILDG-SYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
GNLFGR+R++ +++ G ++++G + Q++ N + S I DP++LL
Sbjct: 751 GNLFGRKRIVVIYQGGIKLINGFDRVIQEIQI------------NEPIKSSYICDPFILL 798
Query: 674 GMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG 717
+G+I++ G D + + + + + S +L+ D+
Sbjct: 799 QFHNGTIQIFKGIDEENQLIQFSINSISNNLNQSIFSSSLFFDRN 843
Score = 91.3 bits (225), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/168 (34%), Positives = 89/168 (52%), Gaps = 18/168 (10%)
Query: 57 NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGISAA-------SLELVC 105
NLV+ NV++IY +R ++ E +S+ + ++ I+ SLEL+
Sbjct: 32 NLVLAKTNVLQIYKIRYEKIEKYENVSDSQPQQQQEQEQQQQDITQKKKIELKPSLELII 91
Query: 106 HYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
+L GN+ES+A + ++ RDS+IL F DAKISVL++D + I S+H FE
Sbjct: 92 EKKLFGNIESMASVRYPNSE----RDSLILTFRDAKISVLDYDSDLLDFEIRSLHYFEKD 147
Query: 166 EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
E+ K GR F PL+KVD Q RC +L+Y + +L + S L
Sbjct: 148 EF---KGGRNHFKHPPLLKVDTQQRCAVMLLYDRNLAVLPFKKTSSIL 192
>gi|9794904|gb|AAF98386.1| cleavage and polyadenylation specificity factor [Drosophila
melanogaster]
Length = 507
Score = 237 bits (605), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 162/497 (32%), Positives = 258/497 (51%), Gaps = 49/497 (9%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E ++E + +L + + A + EEL +YGS + + + F F V D
Sbjct: 430 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRI 524
SL+N+ P+ G R+
Sbjct: 489 SLMNVAPINYMCAGERV 505
>gi|116182170|ref|XP_001220934.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
gi|88186010|gb|EAQ93478.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
Length = 1394
Score = 236 bits (601), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 337/1379 (24%), Positives = 546/1379 (39%), Gaps = 241/1379 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLM---------DGISAA-------- 99
NL V +++++I+ +V S+N+G R DG+ A+
Sbjct: 41 NLAVAKSSLLQIFRTKVIATELDTSQNNGHRTRNANRYESRLANDDDGLEASFLGGDSLA 100
Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRR-RDSIILAFEDAKISVLEFDD 149
L LV + L G V L + N+R DS++LAF+DAK+S++E+D
Sbjct: 101 QRTDRANYTKLVLVAEFPLAGTVTGLVRIK---TPNARLGLDSLLLAFKDAKLSLVEWDT 157
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
H L S+H +E E + DP RC + + IL Q
Sbjct: 158 EHHTLSTVSIHYYEQEELQGSPWAAPLSHYANFLAADPGSRCAALKFGARNLAILPFKQA 217
Query: 210 GSGL-VGDEDTFGSGGGFSARIES----------------SHVINLRDLD--MKHVKDFI 250
+ +GD D G + + S S V+ L +LD + H
Sbjct: 218 DEDIDMGDWDEELDGPRPAKDLSSAVINGASNIEDTPYSPSFVLRLSNLDPSLLHPVHLA 277
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
F+H Y EP IL H M+ L + K I S LP D ++
Sbjct: 278 FLHEYREPTFGILASTAAASNSLGRKDHFVYMVFTLDLQQ--KASTTILSVTGLPQDLFR 335
Query: 311 LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHA 369
++ +P+P+GG L+VG+N IH +A+N S + +S ++ L+
Sbjct: 336 VVPLPAPVGGALLVGSNELIHIDQSGKPNGVAVNPMTKHCTSFGLVDQSDLNLRLEGCVI 395
Query: 370 TWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGN 420
L D+ L+ G + ++T+ DGR V L+L + + S++ ++T IG
Sbjct: 396 DVLAADLGELLIILNDGQMAVMTLRIDGRTVSGLELKILPASSGGSIVPGRVSTLSRIGR 455
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA------ 474
+ F G GDS+L FG + +R R+ +A
Sbjct: 456 NAMFAGLEEGDSVL-------------------FGWAKKQTQVGRRKPRTKDNAGDVDVE 496
Query: 475 ---LQDMVNGEELSLYGSASNNTESAQKTFS--------FAVRDSLVNIGPLKDFSYGLR 523
+ +E LYG AS S V D L+N+GP++ +Y
Sbjct: 497 EDEDIEEEEEDEDDLYGEASAPQHQPVSAVSGLLSGEASLRVHDRLINLGPIQAMTYSQP 556
Query: 524 INADAS-----------------ATGISKQS-----NYEL-------VELPGCKGIWTVY 554
+ S A G K + N E+ E P +G WT+
Sbjct: 557 VWLPGSEEERNSAGVHSDLQLVCAVGREKSASLVTMNLEIQPKVIGRFEFPEARGFWTMC 616
Query: 555 HKSSRGHNADSSRMAAY-------DDEYHAYLIIS------LEARTMVLETADLLTEVTE 601
K S + + +Y ++I++ E + TA +
Sbjct: 617 AKKPIPKTLQSDKGGNFLGKDYDVSGQYDKFMIVAKVDLDGYEKSDVYALTAAGFESLGG 676
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
+ G TI AG + R+IQV + R DG + ++Q + + E+G+
Sbjct: 677 TEFDPAAGITIEAGTMGKGSRIIQVLKSEVRCYDGDFGLSQIVPM--LDEETGA---EPR 731
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
+S SIADP +L+ D S+ + D S ++ ++ K ++ C LY D
Sbjct: 732 AISASIADPLLLIIRDDSSVFVAQMDSSNELEELEKEDQTLATTKWLTGC-LYAD----- 785
Query: 721 WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
+T A+ E + G G P I + SG+L I+ +P+ + V +
Sbjct: 786 -----TTGAF-----AEEVAGKGGKPAQA--ILVFLLSASGSLYIYRLPDLSKPVYVAEG 833
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
+S Y+ L + S+ +GT KE + + V +LA R H+
Sbjct: 834 LS--------YIPPGLS-----ADYSARKGTA---KETVAEILVADLA-NRSQLRHAN-- 874
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
D TI YQ + + +TS D S++L +L N F+++P +A
Sbjct: 875 -----DDLTI--YQPFRY----STSAGAD---FSKTLFFQ-----KLPNAAFAKSPEEAD 915
Query: 901 TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTV 959
E T H + NI+G+ FL G+ P + + + RV P L ++A +
Sbjct: 916 EDEAT-HQPRMLSMRRCSNIAGYSTVFLPGASPSFIIKSSKSAPRVLP-LQGAGVIAMSP 973
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPL 1019
H C +GFIY SQ + ++ QLP Y + IP+ + Y Y
Sbjct: 974 FHTEGCENGFIYADSQHMARVTQLPQDWNYAETGLAVRKIPIGEDIAAVAYHPPMQSY-- 1031
Query: 1020 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQ 1079
+V L+P L D + + NLS TV+ ++++ P W
Sbjct: 1032 VVGCNTLEPFE----LPKDDDYHKEWARENLSF-----KPTVDRGILKLVSPIT----WT 1078
Query: 1080 TRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRN 1138
++ M+ E L V ++L + T E + L+A+GTA ++GED+ RGRV ++
Sbjct: 1079 VVDSVQMEPCETVLCVATLSLEVSEFTNERKQLIAVGTALIKGEDLPTRGRVYVYDITEV 1138
Query: 1139 ADNPQNLVTEVYSKELK---------GAISALASL--QGHLLIASGPKIILH--KWTGTE 1185
P T SK+LK GA++AL+ + QG +L+A G K ++ K GT
Sbjct: 1139 IPEPGRPET---SKKLKLIAKEEIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGTL 1195
Query: 1186 LNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
L +AF D YV + + LL D K ++F + E+ ++ L K L+
Sbjct: 1196 LP-VAFMDMN-CYVTNAKELPGTGLCLLADAFKGVWFTGYTEEPYKMMLFGKSSTKLEVL 1253
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFLRL 1301
+FL DG L +V D NI I + P+ +S +G LL R F+ GA H TK L L
Sbjct: 1254 NADFLPDGKDLFIVACDADGNIHILEFDPEHPKSLQGHLLLHRTTFNTGANHPTKSLLL 1312
>gi|401889164|gb|EJT53104.1| cleavage and polyadenylation specific protein [Trichosporon asahii
var. asahii CBS 2479]
Length = 1358
Score = 235 bits (600), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 359/1467 (24%), Positives = 599/1467 (40%), Gaps = 237/1467 (16%)
Query: 45 ELPSKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD---------G 95
E+P + +G NLVV + ++ +R EE + + ++ MD
Sbjct: 39 EVPDVKVVG---NLVVAGGQDLRVFEIR--EESTPLPDDESAVPKQEDMDVGDSFFDSAP 93
Query: 96 ISAAS--------LELVCHYRLHGNVESLAILSQGGADNS-RRRDSIILAFEDAKISVLE 146
I A L L+ + LHG V LA L D+S D ++++FE AK S +
Sbjct: 94 IERAPVRYKTTRRLHLLTRHTLHGVVTGLAGLRT--IDSSVDGLDRLLVSFEHAKWSRGD 151
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+ S+H +E + + + + + P+++ DP R + + + +L
Sbjct: 152 -------IATVSLHTYERCQQM-INGNFQGYV--PMLRSDPLSRLAILTLPEDALAVLPI 201
Query: 207 SQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHER 266
Q S L +D+ S ++K++KDF+F+ G+ P + +L
Sbjct: 202 VQEQSELDAMQDSVSSP------------------EIKNIKDFLFLPGFHSPTIALLFAP 243
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLK-QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
TWAGR T + +I T+ +PLI S LP D+ L+A PS +GGV+VV
Sbjct: 244 MNTWAGRYKSVKDTFRLEIRTIDTSAGGTYPLITSVTGLPSDSQYLVACPSEVGGVVVVT 303
Query: 326 ANTIHYHSQSAS-CALALN---NYAVSL--DSSQELPRSSFSVELDAAHATWLQNDVALL 379
A+ I + QS + ++N NY ++ DSS E S + LD +HA ++ + LL
Sbjct: 304 ASGIIHIDQSGRLVSTSVNGWWNYTTNMKSDSSYE----SQKLALDNSHAQFVTENDMLL 359
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT-SDITTIGNSLFFLGSRLGDSLLVQFT 438
+TG++ + DGR V + + + + +V S + G+ F+GS GDSLL
Sbjct: 360 VLETGEVHQIRFEMDGRAVGAIKVDEQSSTVPPPSTLVPAGSDGIFVGSVEGDSLLAMVE 419
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE---ELSLYGSASNNTE 495
S +EE P TK+ D +++ G + S
Sbjct: 420 KARDQSA-----QEE--------PETKQQEMDVDDWDEEVATGPVTVSVKAQDVLSGIGR 466
Query: 496 SAQKTFSFAVRD-------SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK 548
A F AV D LV IG S G +N I+K+ +E +L
Sbjct: 467 IADMEFGIAVTDLGTRTYPQLVCIG---GGSQGSTMNVFRRGIPITKRRLFE--QLRTAV 521
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
W + + + NA + ++ + + E T + + +V E + F +
Sbjct: 522 ATWFLPVERA---NAPKFKDIPESEQSTIAIAATQEGSTQIFALS--TRKVQERIAEFPE 576
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES-GSGSENS---TVLSV 664
IA G R R++ V +LD SN+ G+ E S +++
Sbjct: 577 P-AIATGTWLRRTRIVLVLPSQVLLLD------------SNANPVGTICEMSDAPPIVAA 623
Query: 665 SIADPYVLLGMSDGSIRLLVGD-----------PSTCTVSVQTPAAI------------- 700
SIADPYVL+ +DGS+ + VGD P + V A +
Sbjct: 624 SIADPYVLIRRADGSVSVFVGDTVEGKWSEAPMPEGLALPVCQAAEVFTDTTGIYRTFEA 683
Query: 701 -----ESSKKPVSSC----TLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
E KPV + H G E R + +S V + +G
Sbjct: 684 TQGVKEEPVKPVPTKQGQKAKIHLTG-EQLKRLQDSKPAISADVATTESAFNAA---RGT 739
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
+ + +SG L+I +P+F+ V + + D+ + D +T EEG
Sbjct: 740 QWIALLAQSGELQIRSLPDFDLVLQSNG-------VYDS--EPSFTDDQTGELPELEEG- 789
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+ + M + + RP + + G + Y+A P T + D
Sbjct: 790 -----DEVSQMLFCPIGTRTL-----RPHVIVLHRSGRLNIYEAQ----PRFTVDARD-- 833
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--PHGAPCQRITIFKNISGHQGFFLS 929
+ RSL+V R R + T L + T T P P F +I G G F++
Sbjct: 834 QSRRSLAV------RFRKVH---TQLLSVTPSSTVKPAAIP------FTDIEGLTGAFIT 878
Query: 930 GSRPCWCMVFRERLRVHPQLCDG-SIVAFTVLHNVN-CNHGFIYVTSQGILKICQLPSGS 987
G RP W + HP G A+ + HG ++ + IC +P
Sbjct: 879 GERPHWIISSDS----HPIRAFGLKQAAYAFCKTTHQGGHGEYFLRIEDGSFICYMPPTL 934
Query: 988 TYDNYWPVQKVIPLKATPHQITYFAEKNLY--PLIVSVPVLKPLNQVLSLLIDQEVGHQI 1045
D P + ++ T + + Y +SVP + ++ +L+ E +
Sbjct: 935 NTDFAMPCDR-YKMERTYTHVAFDPPSCHYVAAAAMSVP-FQAYDEEGEILLGPEGPDLL 992
Query: 1046 DNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTT 1104
N SS++L ++ R+L+ +E L V VTL +++
Sbjct: 993 PPKNERSSIEL---FSAGSEPFRVLD------------GYDFDQNEEVLCVESVTLESSS 1037
Query: 1105 TKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP---QNLVTEVYSKE-LKGAIS 1159
+ +A+GT GED A G V +F N + K+ + +S
Sbjct: 1038 SPTGFRDFIAVGTGKNFGEDRATSGAVYVFEVVEVVGTKPGVSNWRLKYRCKDPTRNPVS 1097
Query: 1160 ALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
A+A++ G+++ ++GPKI+ L G+AF D +YV S+ + KN IL+GD KS+
Sbjct: 1098 AIANINGYIVHSNGPKILAKGLDYDDRLMGLAFLDVS-MYVTSIRVFKNLILVGDFVKSL 1156
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
F S +E + + +D L A +FL+ ++ + +D+ N+++ + P +S
Sbjct: 1157 IFASLQENPYKFVTIGRDLADLSLTAADFLVHEGQVTFITNDQHGNMRLVDFDPANPDSL 1216
Query: 1279 KGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1338
G+KLL++ EF G VT + T+ + AP S L++ T DG+I +
Sbjct: 1217 NGEKLLTQTEFGTGCPVTASCMIARRKTAEEE--FAPQSQ------LIYATADGAITSVV 1268
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
+ E F+RLQ +Q +LV + HVAGLNPR+FR N RP ++D LL+H+ +
Sbjct: 1269 AVKEARFKRLQLVQDQLVRNAQHVAGLNPRAFRTVR-NDLVPRPLARGVLDGGLLAHFAL 1327
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNLNDL 1425
PL Q E+ Q GT + S+L L
Sbjct: 1328 QPLRRQREMMRQIGTDAVTVGSDLYTL 1354
>gi|367018592|ref|XP_003658581.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
42464]
gi|347005848|gb|AEO53336.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
42464]
Length = 1547
Score = 231 bits (588), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 323/1339 (24%), Positives = 527/1339 (39%), Gaps = 208/1339 (15%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR------------DSIILAFEDAK 141
D + L LV + L G V LA + A+ + DS+++AF DA+
Sbjct: 93 DRANTTKLVLVAEFPLAGTVTGLARIRTPKANRNHDGGAGHAGHAGHGCDSLLIAFRDAR 152
Query: 142 ISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVL 195
+S++E+D H L S+H +E E + S PL + DP RC +
Sbjct: 153 LSLVEWDAEQHTLSTISIHYYEQEEL------QGSPWAAPLSHYVNFLVADPGSRCAALK 206
Query: 196 VYGLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVIN----------------- 237
+ IL Q + +GD D G + S+ V+N
Sbjct: 207 FGARNLAILPFRQADEDIDMGDWDEELDGPRPAKDPSSNAVVNGASNIEDTPYSPSFVLR 266
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
L +LD + H F+H Y EP IL H M+ L + K
Sbjct: 267 LSNLDPSLLHPVHLAFLHEYREPTFGILASATAPSNALGRKDHLVYMVFTLDLQQ--KAS 324
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQE 354
I S LP D ++++ +P+P+GG L+VG+N IH +A+N +
Sbjct: 325 TTILSVSGLPQDLFRVVPLPAPVGGALLVGSNELIHVDQSGKPNGVAVNPMTRQCTNFGL 384
Query: 355 LPRSSFSVELDAAHATWLQNDVALLST--KTGDLVLLTVVYDGRVVQRLDLSKTNPSV-- 410
+ +S ++ L+ L D+ L G ++T DGR V L++ S
Sbjct: 385 VDQSDLNLRLEGCAIDVLTPDLGELFVVLNDGRAAVVTFRIDGRTVSGLEIKMLPESAGG 444
Query: 411 -----LTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGLKEEFGDIEADAP 462
S ++ IG + F G GDSLL+ + +G L + GD++A+
Sbjct: 445 SLIPGRVSTLSRIGRNAVFAGREEGDSLLLGWAKRQAQTGRRRLRARDAAGSGDVDAEG- 503
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT-------------FSFAVRDSL 509
L D + + + +E ESA + SF V D L
Sbjct: 504 --AELAEGDEDVVAEGEDEDEDEEDEDDLYGEESAPRQQPVSAASSFLSGDVSFRVHDRL 561
Query: 510 VNIGPLKDFSY----------------GLRINADASAT-GISKQSNYELV---------- 542
+++ P++ +Y G+R + + T G K + V
Sbjct: 562 LSVAPIQALTYSQPVYLAGSEEERNSAGVRSDLNLVCTVGRDKSAALATVNLAIQPRVIG 621
Query: 543 --ELPGCKGIWTV-----YHKSSRGHNADSSRMAAYD--DEYHAYLIIS------LEART 587
E P +G WTV KS +G A +S YD +Y ++I++ E
Sbjct: 622 RFEFPEARGFWTVCAKKPVPKSLQGDKAGNSLSKDYDTAGQYDRFMIVAKVDLDGYEKSD 681
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
+ TA + + G TI AG + R+IQ+ + R DG + + P
Sbjct: 682 VYALTAAGFEGLGGTEFDPAAGITIEAGTMGKGSRIIQILKSEVRCYDGDFGLSQIV--P 739
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
E +G+E V S SI DP++L+ D S + D S + +S K +
Sbjct: 740 MLDEE-TGAEPRAV-SASIVDPFLLIIRDDSSAFIAQVDSSNELEELDKEDPTLASTKWL 797
Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
+ C LY D +T A+ G+ GG L Q + + SGAL I+
Sbjct: 798 TGC-LYAD----------TTGAFAEEAPGK------GGKLSQ-SVLMFLLSASGALHIYR 839
Query: 768 VPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVEL 827
+P+ + V + +S Y+ L + S+ +GT KE I + V +L
Sbjct: 840 LPDLSKPVYVAEGLS--------YIPPGLS-----ADYSARKGTA---KETIAEILVADL 883
Query: 828 AMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRL 887
H P L T+ + YQ + + NT + S++L +L
Sbjct: 884 G----DMTHKSPHLILRHTNDDLTLYQPFRY----NTGAG---LEFSKTLFF-----QKL 927
Query: 888 RNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP 947
N F+++P +A E T H + N+ G+ FL G+ P + + + +
Sbjct: 928 PNTVFAKSPEEADDDEAT-HQPRFLSMRRCANVGGYSTVFLPGASPSFIIKSSKSVPKVL 986
Query: 948 QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQ 1007
L ++A + H C HGFIY S+ + ++ QLP +Y + IP+
Sbjct: 987 PLQGTGVIAMSPFHTEGCEHGFIYADSRDMARVAQLPQDWSYAELGLAVRKIPIGEDIAA 1046
Query: 1008 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVR 1067
Y Y + + P + L D + + NL+ TV+ ++
Sbjct: 1047 AAYHPPMQSYVVGCNTP------EPFELPKDDDYHKEWARENLAF-----KPTVDRGNLK 1095
Query: 1068 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAA 1126
++ P W +I M+ E L V + L + T E + L+A+GTA +GED+
Sbjct: 1096 LVSPIT----WTVVDSIQMEPCETVLCVECLGLEVSEFTNERKQLIAVGTAITKGEDLPT 1151
Query: 1127 RGRVLLFSTGRNADNPQNLVTEVYSKELK---------GAISALASL--QGHLLIASGPK 1175
RGRV ++ P T SK+LK GA++AL+ + QG +L+A G K
Sbjct: 1152 RGRVYVYDIADVIPQPGRPET---SKKLKLIAKEDIPRGAVTALSEIGTQGLMLVAQGQK 1208
Query: 1176 IILH--KWTGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLN 1231
++ K G+ L +AF D YV + + L+ D K ++F + E+ ++
Sbjct: 1209 CMVRGLKEDGSLLP-VAFMDM-SCYVTAAKELPGTGLCLMADAFKGVWFTGYTEEPYKMM 1266
Query: 1232 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1291
L K L+ +FL DG L +VVSD +I I + P+ +S +G LL R F+
Sbjct: 1267 LFGKSATRLEVLNADFLPDGKELFIVVSDADGHIHILQFDPEHPKSLQGHLLLHRTTFNT 1326
Query: 1292 GAHVTKFLRLQMLATSSDR 1310
GAH L + T +D+
Sbjct: 1327 GAHQPTKSLLLPVTTPADQ 1345
Score = 48.1 bits (113), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 49/106 (46%), Gaps = 18/106 (16%)
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR------------- 1371
L+ G + + L E +RRL SL +L S+PH AGLNPR +R
Sbjct: 1418 LVLAAPTGVLAALRALPESAYRRLSSLAAQLAGSLPHAAGLNPRGYRLPDGVASSSSPWS 1477
Query: 1372 QFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEIAHQTG 1412
S+ A PG D +IVD LL + L + ++E+A + G
Sbjct: 1478 SSSSSFSAVVPGVDAGVGRTIVDGALLQRFTELGMARRVELAGRAG 1523
>gi|353234640|emb|CCA66663.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Piriformospora indica DSM 11827]
Length = 1324
Score = 229 bits (585), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 330/1445 (22%), Positives = 598/1445 (41%), Gaps = 235/1445 (16%)
Query: 55 VPNLVVTAANVIEIYVVR---VQEEGSKES--KNSGETKRRVLMDGISAASLELVCHYRL 109
V NLVV N + IY VR EE ES K+SG ++ + L LV + L
Sbjct: 36 VTNLVVGRNNRLRIYDVRRTIYTEETHVESDLKSSGPSRH--------SHRLCLVREHLL 87
Query: 110 HGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
HG + LA + D ++++F+D+K++++E+ ++++ + S+H +E L
Sbjct: 88 HGIIIGLAAVRTANPGLGSP-DRLLVSFQDSKLALMEWSNTLYDISTVSIHSYERSPLLL 146
Query: 170 LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL-VGDEDTFGSGGGFSA 228
E A ++ DP RC +++ + +L Q + L V D G +
Sbjct: 147 NSDFTECRA---YLRTDPANRCAALVMPRDNIALLPWYQPQTELDVQD--------GIQS 195
Query: 229 RIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTC 281
E S+V N+ +D ++++ D +F+ G+ P + IL + + TW GR+
Sbjct: 196 IAEELPYSPSYVTNVSAMDERIRNILDLVFLPGFNVPTIAILFQEQRTWTGRLKENKDNT 255
Query: 282 MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCAL 340
+ +S+ + + +I + LP+D+ + + +GGVLVV AN+I H S L
Sbjct: 256 SLFFISLDLVSRSYQVIATIEKLPYDSLYMSPCHAKLGGVLVVTANSILHVDQASKITTL 315
Query: 341 ALNNYAVSL-DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
++ +A + D+S + + L+ + ++ + +LS G + + + ++GR V
Sbjct: 316 PMSGWAARVSDTSHGFQDAVDDIHLEGSRMGYISDSQVILSLSNGKCLHIRIDHEGRTVW 375
Query: 400 RLDLSKT-----NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF 454
L T PSVL + + L FLGS GDS+L ++
Sbjct: 376 GLTAVHTFGISSPPSVLIAK-----DGLVFLGSTAGDSVLFEYA---------------- 414
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
QD+ + + L N +E+ +FS D+L + G
Sbjct: 415 ---------------------QDLSSHRDFML----PNASETIPTSFSLLPVDNLQDSGS 449
Query: 515 LKDFSY-GLRINADAS---ATGISKQSNYELV----------ELP---GCKGIWTV-YHK 556
S+ GLR + + + A G+ + V +LP G +GIW++ H+
Sbjct: 450 YTAASFFGLRGSEEPALIAANGLDDLGGFSTVHKTMPLRLRKKLPAIAGRQGIWSMRVHQ 509
Query: 557 SS-----RGHNA----------DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
+ GHN +SR+A + A L I++ R +L
Sbjct: 510 GNGIELPLGHNTLLSTDATPTPGASRIAT---KSQARLDINITTRIPML----------- 555
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTV 661
TIA F ++QV R+L T D S + + + + +
Sbjct: 556 ---------TIAVAPFFDGTHLLQVTSNSLRLL-----TTDGSEKQVIPDRDNSTARARI 601
Query: 662 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-----K 716
+I DPYVL+ D ++ L VG+P+ + + + + K + T Y D K
Sbjct: 602 RHAAICDPYVLILREDDTLGLFVGEPTRGKLRRKDMSPLGDKKLCYWAATFYDDLTGRLK 661
Query: 717 GPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFT 776
E +R+T VG ++G+ + +C +G LEI+ +P VF
Sbjct: 662 IDEDLMRETK-------AVG-----------NRGEKWLALCRSTGTLEIWSLPKLALVF- 702
Query: 777 VDKFVSGRTHIVDTYMREALK-DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAH 835
V G + LK D + E++S+++ G + + + +L S H
Sbjct: 703 VSSISLGPS---------VLKHDQKKEVDSATKTELPVG-ATTLQQVIITDLGEIEPSPH 752
Query: 836 HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVS-ASRLRNLRFSR 894
L + ++ YQ P K+ P RS+ +S R+ + +
Sbjct: 753 -----LIVLYDSNLLIVYQMV----PLEPDKAGLPQLDRRSVPSLRISFVKRMVHHLANP 803
Query: 895 TPLDAYTREETPHGAPCQRITIFKNISGH----QGFFLSGSRPCWCMVFRERLRVHPQLC 950
TP + T + + I F + G F++G P W + +H
Sbjct: 804 TPDENQTSGGSNEKRLPKTIVPFSVLDWEGNSIYGAFVTGDNPAWILSKNHSGLLHLPCG 863
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
++ +FT + + F+ T +G + Q G T+ +P K + T
Sbjct: 864 YEAVHSFTPCSMWDFSPTFLMSTEEGSC-LVQWTPGITFHGQYPCSKTRKGRTQ----TN 918
Query: 1011 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGH--QIDNHNLSSVDLHRTYTVEEYEVRI 1068
A N L+V+ N LL D+E + + D N+S L + + +
Sbjct: 919 IAYSNTTGLLVAASS----NDRDFLLFDEEGTNSWEPDGVNVSLPKLGAS------ALEL 968
Query: 1069 LEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAAR 1127
L+P+ W T ++E V V L +T+ N+ +A+GT+ +GED+A R
Sbjct: 969 LDPET----WVTIDGYEFAANEVVNIVESVKLETLSTQTGNKEFIAVGTSIHRGEDLAVR 1024
Query: 1128 GRVLLFSTGRNADNPQ------NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1181
G +F + + + + + E KG ++A+ + G+L+ + G KI + +
Sbjct: 1025 GGTYIFEIAEVIQDTEERGRRRHRLKLLCKDEAKGPVTAVCGMNGYLVSSMGQKIFVRAF 1084
Query: 1182 TGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1240
E L G+AF DA +YV S+ +KN +++ D K ++F++++E +L +L+K+
Sbjct: 1085 DLDERLVGVAFLDAG-VYVTSIRCLKNLLVITDAIKGVWFVAFQEDPFKLVILSKEVRPT 1143
Query: 1241 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1300
+F + + L+ D + +++ Y P ++ +G +LL EF +R
Sbjct: 1144 SIPQGDFFFAHNDMELLTIDLRGVLRLHSYDPTHVDTEEGARLLCSVEFQTHVEPVTIVR 1203
Query: 1301 LQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVP 1360
+ M SS ++ LL +DGS+ ++PLD F+RL LQ +LV
Sbjct: 1204 VAMEQPSS--------DSASDASRLLIPRVDGSLASLSPLDMDIFKRLYLLQAQLVRHTH 1255
Query: 1361 HVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1420
H+A LNP+++R + R ++D LL ++ L + Q IA+Q G T ++
Sbjct: 1256 HIAALNPKAYRAVQGSSTT-RTMSRRMLDFGLLVGFKKLSFDRQQGIANQIGETWETLIR 1314
Query: 1421 NLNDL 1425
+ L
Sbjct: 1315 DCTQL 1319
>gi|406699110|gb|EKD02327.1| cleavage and polyadenylation specific protein [Trichosporon asahii
var. asahii CBS 8904]
Length = 1339
Score = 229 bits (585), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 315/1252 (25%), Positives = 519/1252 (41%), Gaps = 184/1252 (14%)
Query: 242 DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK-QHPLIWS 300
++K++KDF+F+ G+ P + +L TWAGR T + +I T+ +PLI S
Sbjct: 200 EIKNIKDFLFLPGFHSPTIALLFAPMNTWAGRYKSVKDTFRLEIRTIDTSAGGTYPLITS 259
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALN---NYAVSL--DSSQE 354
LP D+ L+A PS +GGV+VV A+ I + QS + ++N NY ++ DSS E
Sbjct: 260 VTGLPSDSQYLVACPSEVGGVVVVTASGIIHIDQSGRLVSTSVNGWWNYTTNMKSDSSYE 319
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT-S 413
S + LD +HA ++ + LL +TG++ + DGR V + + + + +V S
Sbjct: 320 ----SQKLALDNSHAQFVTENDMLLVLETGEVHQIRFEMDGRAVGAIKVDEQSSTVPPPS 375
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ G+ F+GS GDSLL S +EE P TK+ D
Sbjct: 376 TLVPAGSDGIFVGSVEGDSLLAMVEKARDQSA-----QEE--------PETKQQEMDVDD 422
Query: 474 ALQDMVNGE---ELSLYGSASNNTESAQKTFSFAVRD-------SLVNIGPLKDFSYGLR 523
+++ G + S A F AV D LV IG S G
Sbjct: 423 WDEEVATGPVTVSVKAQDVLSGIGRIADMEFGIAVTDLGTRTYPQLVCIG---GGSQGST 479
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
+N I+K+ +E +L W + + + NA + ++ + +
Sbjct: 480 MNVFRRGIPITKRRLFE--QLRTAVATWFLPVERA---NAPKFKDIPESEQSTIAIAATQ 534
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
E T + + +V E + F + IA G R R++ V +LD
Sbjct: 535 EGSTQIFALS--TRKVQERIAEFPEP-AIATGTWLRRTRIVLVLPSQVLLLD-------- 583
Query: 644 SFGPSNSES-GSGSENS---TVLSVSIADPYVLLGMSDGSIRLLVGD-----------PS 688
SN+ G+ E S +++ SIADPYVL+ +DGS+ + VGD P
Sbjct: 584 ----SNANPVGTICEMSDAPPIVAASIADPYVLIRRADGSVSVFVGDTVEGKWSEAPMPE 639
Query: 689 TCTVSVQTPAAI------------------ESSKKPVSSC----TLYHDKGPEPWLRKTS 726
+ V A + E KPV + H G E R
Sbjct: 640 GLALPVCQAAEVFTDTTGIYRTFEATQGVKEEPVKPVPTKQGQKAKIHLTG-EQLKRLQD 698
Query: 727 TDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTH 786
+ +S V + +G + + +SG L+I +P+F+ V +
Sbjct: 699 SKPAISADVATTESAFNAA---RGTQWIALLAQSGELQIRSLPDFDLVLQSNG------- 748
Query: 787 IVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILT 846
+ D+ + D +T EEG + + M + + RP + +
Sbjct: 749 VYDS--EPSFTDDQTGELPELEEG------DEVSQMLFCPIGTRTL-----RPHVIVLHR 795
Query: 847 DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET- 905
G + Y+A P T + D + RSL+V R R + T L + T T
Sbjct: 796 SGRLNIYEAQ----PRFTVDARD--QSRRSLAV------RFRKVH---TQLLSVTPSSTV 840
Query: 906 -PHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDG-SIVAFTVLHNV 963
P P F +I G G F++G RP W + HP G A+
Sbjct: 841 KPAAIP------FTDIEGLTGAFITGERPHWIISSDS----HPIRAFGLKQAAYAFCKTT 890
Query: 964 N-CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY--PLI 1020
+ HG ++ + IC +P D P + ++ T + + Y
Sbjct: 891 HQGGHGEYFLRIEDGSFICYMPPTLNTDFAMPCDRY-KMERTYTHVAFDPPSCHYVAAAA 949
Query: 1021 VSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQ 1079
+SVP + ++ +L+ E + N SS++L ++ R+L+
Sbjct: 950 MSVP-FQAYDEEGEILLGPEGPDLLPPKNERSSIEL---FSAGSEPFRVLD--------- 996
Query: 1080 TRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRN 1138
+E L V VTL ++++ +A+GT GED A G V +F
Sbjct: 997 ---GYDFDQNEEVLCVESVTLESSSSPTGFRDFIAVGTGKNFGEDRATSGAVYVFEVVEV 1053
Query: 1139 ADNP---QNLVTEVYSKE-LKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYD 1193
N + K+ + +SA+A++ G+++ ++GPKI+ L G+AF D
Sbjct: 1054 VGTKPGVSNWRLKYRCKDPTRNPVSAIANINGYIVHSNGPKILAKGLDYDDRLMGLAFLD 1113
Query: 1194 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1253
+YV S+ + KN IL+GD KS+ F S +E + + +D L A +FL+
Sbjct: 1114 VS-MYVTSIRVFKNLILVGDFVKSLIFASLQENPYKFVTIGRDLADLSLTAADFLVHEGQ 1172
Query: 1254 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1313
++ + +D+ N+++ + P +S G+KLL++ EF G VT + T+ +
Sbjct: 1173 VTFITNDQHGNMRLVDFDPANPDSLNGEKLLTQTEFGTGCPVTASCMIARRKTAEEE--F 1230
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
AP S L++ T DG+I + + E F+RLQ +Q +LV + HVAGLNPR+FR
Sbjct: 1231 APQSQ------LIYATADGAITSVVAVKEARFKRLQLVQDQLVRNAQHVAGLNPRAFRTV 1284
Query: 1374 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
N RP ++D LL+H+ + PL Q E+ Q GT + S+L L
Sbjct: 1285 R-NDLVPRPLARGVLDGGLLAHFALQPLRRQREMMRQIGTDAVTVGSDLYTL 1335
>gi|58702050|gb|AAH90169.1| LOC564406 protein, partial [Danio rerio]
Length = 416
Score = 229 bits (584), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 129/342 (37%), Positives = 203/342 (59%), Gaps = 14/342 (4%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE V + L GNV S+A + G + RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 66 LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P+V+VDP+ RC +LVYG +++L + + DE
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT---LADEQEG 175
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R+LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 176 IVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
TC I A+S++ K HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPP 295
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
++LN+ + P+ + LD + A+++ +D ++S K G++ +LT++ DG R
Sbjct: 296 FGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVLTLITDGMR 355
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
V+ K SVLT+ + T+ FLGSRLG+SLL+++T
Sbjct: 356 SVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYT 397
>gi|348679545|gb|EGZ19361.1| putative cleavage and polyadenylation specificity factor CPSF
[Phytophthora sojae]
Length = 1752
Score = 229 bits (583), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 171/601 (28%), Positives = 288/601 (47%), Gaps = 94/601 (15%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS------IVAFTVLHNVNCNH 967
+T F N++ G F G+ P W + R + P +C + +++FT H+ NC +
Sbjct: 1159 LTTFYNVNNMSGAFFRGAHPMWILGDRGQPTFIP-MCSAAPKVSVPVLSFTPFHHWNCPN 1217
Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWP-----VQKVIPLKATPHQITYFA---------- 1012
GFIY S+G L++C+LPS T P V + AT H + Y
Sbjct: 1218 GFIYFHSRGALRVCELPSSKT-STILPSSGGFVLQKAEFGATLHHMLYLGNHGPGGVSEA 1276
Query: 1013 -EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS------------------- 1052
E Y ++ SV +KP + + + ++ + + NL +
Sbjct: 1277 LEAPTYAVVCSVK-MKPTDAERATEV-EDADEEKEPENLDANGNPVGSNVMAPTAEMFPD 1334
Query: 1053 --VDLHRTYTVEEYEVRILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT---- 1104
+D E YE+R+++ + G W R + + E L+V+++ L++++
Sbjct: 1335 FEIDQMAHTEEEVYELRLVQTNEFG-EWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKE 1393
Query: 1105 ---------TKENETLLAIGTAYV--QGEDVAARGRVLLFS---------TGRNADNPQN 1144
K+ L IGT +V GED + RGR+LL+ G + +
Sbjct: 1394 EVASTSAEWNKKKRPYLVIGTGWVGPHGEDESGRGRLLLYELDYAQYVDEEGGSTSSKLP 1453
Query: 1145 LVTEVYSKELK-GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLN 1203
+ V+ KE + GAIS++ L ++L A G K+I++++ +L G AFYDA +++V+LN
Sbjct: 1454 KLRLVFIKEHRQGAISSVVQLGPYVLAAVGSKLIVYEFKSEQLIGCAFYDAQ-MFIVTLN 1512
Query: 1204 IVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1263
+VK+F++ GD++KS++FL W+E QL LLAKD+ L ATEF + L+L+ D +
Sbjct: 1513 VVKDFVMYGDVYKSVHFLRWREMQRQLVLLAKDYEPLAVSATEFSVFEKKLALLAVDMDE 1572
Query: 1264 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-----SDRTGAAPGSD 1318
N+ + +AP+ ES GQ+LL ++FH+G V R ++ + R AP S
Sbjct: 1573 NLHVMQFAPQDIESRGGQRLLRVSDFHLGVQVASMFRKRVDGPGGHVAVNGRGPRAPPSY 1632
Query: 1319 KTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
N + G +G +G + P+ E FRRL +LQ +V+++P LNPR FR +N +
Sbjct: 1633 YVN----VMGNSEGGVGALIPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQ 1688
Query: 1379 AHRPGPDS---------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1429
PD+ +D +L + L Q E+A GTT ++ NL ++ T
Sbjct: 1689 RRCGRPDAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVIHNLLEVQHAT 1748
Query: 1430 S 1430
+
Sbjct: 1749 A 1749
Score = 159 bits (403), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 159/628 (25%), Positives = 262/628 (41%), Gaps = 143/628 (22%)
Query: 235 VINLRDLD-MKHVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR+L+ M V D F+ GY+EP +++LHE + + GR++ T I+ +SI+
Sbjct: 277 LLRLRELEIMGKVIDLAFLDGYLEPTLMVLHEENEKNSTCGRLAAGFDTYCITVISINMN 336
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL-- 349
+ HP IW+ NLP D +KL +P+GGV+V+ AN Y +Q+ LA N +A
Sbjct: 337 TRLHPKIWTVKNLPSDCFKLFPCRAPLGGVVVLSANAFLYFNQTQFHGLATNVFASKTVN 396
Query: 350 -------DSSQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
D+ E P + + L +L LL+ GD +L++ Y+ +
Sbjct: 397 QSVFPLSDAVYETPDHEMAQLHIVLYDCQFEYLHEKEVLLTMPNGDAYVLSLPYEDTSSR 456
Query: 400 RL----DLSKTNPSVLTSDITTIG-----------NSLFFLGSRLGDSLLVQFTCGSGTS 444
L S + + L+ + G F+GSR GDS+L TS
Sbjct: 457 GLYGFGGASSSRNASLSLRMLRSGIQAHCLCVNEEKKTLFVGSRSGDSVLYALDQKKLTS 516
Query: 445 MLSSGLK----EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA--------SN 492
K EE E + A ++ + ++L LYG+A S
Sbjct: 517 AGGEASKQQEDEEMLIKEEVVKEEVTAEVKAEPAEEEEEDEDDLFLYGAAPTKEEPTTSG 576
Query: 493 NTESAQKTFSFAVR----------------------DSLVNIGPLKDFSYGLRINADAS- 529
+TE+ T AV+ D L +IG + G+ NAD++
Sbjct: 577 STEAVNGTNGSAVKKEENGHAVEEESGPYDYVLHQIDVLPSIGQITSIELGIENNADSNE 636
Query: 530 -------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAA 570
+ G + ++ EL GC+ +WTV + R
Sbjct: 637 KREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATKSSDGR--- 693
Query: 571 YDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
Y+AYLI+S+ RTMVL T + + + + ++ G T+AA NLF ++R++Q+F++G
Sbjct: 694 ---SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGPTLAAANLFNKQRIVQIFKQG 750
Query: 631 ARIL------------DGS----------------------YMTQDLSFGPSNSESGSGS 656
AR++ DG+ TQ+++ G
Sbjct: 751 ARVMMEVPDEETSNGNDGAEKTAKPEDEEVDDEDDGPKVKLVCTQEITLEGDVECGGMNV 810
Query: 657 ENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP------------AAIES 702
+ +TV +SV + DPY+LL ++DGS+RLL+GD ++V P ++
Sbjct: 811 DTATVGIVSVDVVDPYILLLLTDGSVRLLMGDEEDMELTVIDPEIDYLDGVTESNGTADA 870
Query: 703 SKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
SK SS L++D W +AW
Sbjct: 871 SKHGSSSACLFYD-----WAGMFRENAW 893
>gi|325094074|gb|EGC47384.1| cleavage factor two protein 1 [Ajellomyces capsulatus H88]
Length = 1377
Score = 228 bits (581), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 326/1430 (22%), Positives = 585/1430 (40%), Gaps = 224/1430 (15%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G + L + D+ +++++A +AK+S++E+D H + TS+H
Sbjct: 65 LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIH 122
Query: 161 CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
+E + +++ + A P + VDP RC VL +G + + IL Q G LV
Sbjct: 123 YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180
Query: 215 -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
DE +G F SS V+ + L+ M H F++ Y
Sbjct: 181 DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL+ + T + + + S ++ + + S LP+D +K++ +P
Sbjct: 241 REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300
Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
P+GG L++G+N +H + A+ +N +A S +S + L+ + L
Sbjct: 301 PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFL 425
+N LL G + +L+ DGR V + L + S+L + + + F
Sbjct: 361 ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD-------- 477
GS GDS+L+ ++ S + + G+I + D
Sbjct: 421 GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480
Query: 478 MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
M G + +S+ G+ N+ + F + D L N+GP++D + G + D
Sbjct: 481 MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533
Query: 531 TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
S +N ELV G G +VY K + +
Sbjct: 534 PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593
Query: 565 SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
S Y YL++S + +++V + E T++ ++ + RTI G L
Sbjct: 594 GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653
Query: 619 GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
RV+QV + R D G + Q + SE +V+ S ADPYVL+ D
Sbjct: 654 SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708
Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
SI LL D S +T I S+ S +LY DK
Sbjct: 709 QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDKY-------------------R 747
Query: 738 AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
+ + +G P + Q D + + L +F +PN VFT +
Sbjct: 748 SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793
Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
D +I S+ +E I + V +L + P+L ++ + Y+
Sbjct: 794 ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLTLYE 846
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
Y + TS ++ S R + ++N + S + ++ + T P +
Sbjct: 847 PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTALSKPLR-- 895
Query: 915 TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
+ ++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV +
Sbjct: 896 -VLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954
Query: 975 QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
++++C+ P + +D W +K I L + Y + Y + + V +
Sbjct: 955 DNVVRMCRFPRNTHFDGSWAARK-IGLGEQVDAVEYSSSSETYVIGTNQKV------DFN 1007
Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
L D E+ + N +S + +++ V++L P W + ++++E +
Sbjct: 1008 LPEDDEIHPEWRNEVISFLP-----QIDKGSVKLLTPRT----WSIIDSYNLRNAERIMC 1058
Query: 1095 VRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTE 1148
V+ + L + T E + + +GTA +GED+AARG + +F + D P+ +
Sbjct: 1059 VKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEVIKVVPEVDRPETNRKLKL 1118
Query: 1149 VYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNI 1204
+ +E+KGA+++L+ + QG L+ A G K I+ K G+ L +AF D YV L
Sbjct: 1119 IAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKE 1176
Query: 1205 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
+K G I ++ A++ + + D +FL D + L ++V+D+
Sbjct: 1177 LKG----GAITNCLF-------SARMTVPSSD-------DADFLPDENRLYILVADDD-- 1216
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
S KG +LL R+ F G + L ATSS + G D + +
Sbjct: 1217 ---------YPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQ-GPDADPDMMDLDS 1266
Query: 1325 ------LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+L + GSI I P+ E ++RRL +LQ +L +++ H GLNPR+FR S+G
Sbjct: 1267 SGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLANTLEHPCGLNPRAFRAVESDGI 1326
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
R +VD +L+ + L + + EIA++ G +I ++L + G
Sbjct: 1327 GGR----GMVDGDLVKRWLDLGTQRKAEIANRVGADVWEIRADLEAIGKG 1372
>gi|322694449|gb|EFY86278.1| Cleavage factor two protein 1 [Metarhizium acridum CQMa 102]
Length = 1431
Score = 228 bits (581), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 327/1387 (23%), Positives = 550/1387 (39%), Gaps = 181/1387 (13%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
R ++ ES G V D L L+ L G V LA + + +
Sbjct: 70 RANDDDGLESSFLGVESLIVRADPSHNTKLVLISEIPLAGTVIGLARVKI--KNTPSGGE 127
Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
+++LA++ AK+ + E+D H L TS+H +E E L+ G V + DP
Sbjct: 128 ALLLAYKAAKMCLTEWDPQRHTLETTSIHYYEKDE---LQGAPWEMPFGDYVNYLEADPG 184
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGD---ED-------------TFGSGGG----FSA 228
RC + IL +Q L D ED T G G G +
Sbjct: 185 SRCVAFKFGSRNLAILPFTQSEEDLEMDDWDEDLDGPCPVKEEPPTTNGDGPGDHDLVKS 244
Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
R S V+ L LD + H F+H Y EP IL + H T + L
Sbjct: 245 RYTPSFVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSMQSPSPALGIKDHLTYKVFTL 304
Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY 345
+ + I S LP D ++++A+P+P+GG L+VG N IH +A+N+
Sbjct: 305 DLQQ--RASTTILSVTGLPQDLFRVIALPAPMGGALLVGENELIHIDQSGKPNGVAVNDM 362
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL 403
A + S + +S + L+ L ND+ LL G L ++ DGR V ++ +
Sbjct: 363 AKQMTSFSLVDQSELGLRLEGCAVELLANDIGELLLILNDGRLAIICFHIDGRTVSKISI 422
Query: 404 ----SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
++ +++ S ++ I G++ FLGS DS+++ ++ G K +
Sbjct: 423 RLVSAECGGNLIKSQVSCISKLGSNTLFLGSESNDSIVLGWSRKQGQE------KRKKSR 476
Query: 457 IEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPL 515
+ + D D + G + SL S + N S SF V+D+L++I P+
Sbjct: 477 LLDPDLALDVDDLDLDDDEDDDLYGNDSSLAKPSQTINGSSKPGEVSFRVQDTLLSIAPI 536
Query: 516 KDFSYGL-RINADASATGISKQSNYEL----------------------------VELPG 546
+D + G D+ +SK EL + P
Sbjct: 537 RDVACGAPAFVPDSEEATLSKGVTAELELACAVGRGFSGSVAILNREIQPKVIGRFDFPE 596
Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVT 600
+G WT+ K A + +Y Y+I++ E + TA +
Sbjct: 597 ARGFWTMCVKKPLSKGAAVASDYDTTAQYDKYMIVAKVDLDGYETSDVYALTAAGFETLK 656
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENS 659
++ G T+ AG + + R+IQV + R DG ++Q L P E +
Sbjct: 657 DTEFEPAAGFTVEAGTMGKQMRIIQVLKSEVRCYDGDLGLSQIL---PMLDEDTGAEPRA 713
Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPE 719
T S SI DPY+LL D SI + + V P S K S C LY+D
Sbjct: 714 T--SASIVDPYLLLNRDDSSIFIAQIHSNNELEEVFKPDGTLKSTKWASGC-LYND---- 766
Query: 720 PWLRKTSTDAWLSTGVG-EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVD 778
T + V + D AD I + +GAL ++ +P+ V
Sbjct: 767 -------TQGIFQSNVNKQKADAAD-------RIMMFLLSSAGALHVYALPD------VS 806
Query: 779 KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR 838
K + ++ EAL ++++ G KE+I + V +L A
Sbjct: 807 KPI---------FVAEALTSIPPFLSAAFVARKG-ASKESITEILVADLG----DAISQT 852
Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
P+L + Y+ P + D ++ L V+ S + P
Sbjct: 853 PYLIVRHASDDLTIYE------PVRCQEEGDAELSASLLFKKCVNTSLAKT-----APEV 901
Query: 899 AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFT 958
+ E P P +R N++G+ FL G+ P + + L + +
Sbjct: 902 SEDDAEPPRFVPLRRCA---NVNGYGAVFLPGASPSFVLKSSHSEPRVIGLQGLGVRGMS 958
Query: 959 VLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYP 1018
H C+ GFIYV +GI ++ QLPS +++ + K I L I+Y Y
Sbjct: 959 TFHTEGCDRGFIYVDVEGIARVTQLPSNASFTDLGVSVKKIALDGDVGMISYHHPTGTY- 1017
Query: 1019 LIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGP 1077
+V+ L+P E+ D H + + T ++++ P
Sbjct: 1018 -VVACTKLEPF----------ELPRDDDYHKEWAKETIKFPPTTARGILKLINPVT---- 1062
Query: 1078 WQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1136
W + ++ E+ +++ + L + TKE + L+A+GTA +GED+ RGRV +F
Sbjct: 1063 WTVIHELELEPCESIESMKTLHLEVSEETKERKMLVAVGTALSKGEDLPTRGRVQVFDIV 1122
Query: 1137 RNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTEL 1186
P N ++ +KE +G ++AL+ + QG +L+A G K ++ K G+ L
Sbjct: 1123 TVIPEPGRPETNKRLKLIAKEEIPRGGVTALSEVGAQGLMLVAQGQKCMVRGLKEDGSLL 1182
Query: 1187 NGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1244
+AF D +V S+ + ++ D+ K ++F + E+ +L K G L
Sbjct: 1183 P-VAFLDM-NCHVASVKELPGTGLCVMADVFKGLWFAGYTEEPYTFKILGKSSGKLPLLV 1240
Query: 1245 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1304
+FL DG LS+V D + ++ I + P+ +S +G LL R F VT L
Sbjct: 1241 ADFLPDGEDLSMVAVDAEGDMHILEFNPEHPKSLQGHLLLHRTSF----AVTPNTPTSTL 1296
Query: 1305 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1364
+ + P + ++ LL G + ++PL E T+RRL S+ +L ++ G
Sbjct: 1297 LLPRTHSPSYPHASSSSHM-LLLACPSGQVAALSPLAESTYRRLLSVTNQLHPAIVAHCG 1355
Query: 1365 LNPRSFR 1371
L+ ++ R
Sbjct: 1356 LHTKAHR 1362
>gi|449661926|ref|XP_002167992.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Hydra magnipapillata]
Length = 1122
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 193/664 (29%), Positives = 305/664 (45%), Gaps = 124/664 (18%)
Query: 57 NLVVTAANVIEIYVV----RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
NLV + +Y + V +G + SK ++D + LEL+ + L GN
Sbjct: 29 NLVTAGGQRLNVYRLCDADMVVSDGDQSSK---------IVDSVGKRRLELLASFTLFGN 79
Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ ++ ++ G S RDS++LAF+ AK+S++EFD H L+ SMH FE+ E+ K
Sbjct: 80 IINMQVVRLG----SNVRDSLLLAFKHAKLSIVEFDPLSHDLKTDSMHYFENDEF---KG 132
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES 232
G PLV+VDP+ RC +L+Y +++L + DE S G +
Sbjct: 133 GLSHNIYLPLVRVDPEQRCACMLIYNRHLVVLPFKHD---IKLDESEELSDGEHIKSVLP 189
Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
S++I+L L+ + ++ + F+HGY +P ++ L E T GRV+ + T +SA+S++
Sbjct: 190 SYMIDLHSLEQPLLNITELQFLHGYHQPTLMFLFEPVQTSTGRVAVRQDTFCVSAISLNM 249
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
T K HP+IWS NLP D + L + PIGGVLV +N++ Y +QS + Y VSL+
Sbjct: 250 TEKVHPVIWSVTNLPFDCHMLRPIEKPIGGVLVFASNSLIYLNQS------IPPYGVSLN 303
Query: 351 SSQE----LP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLD 402
S E P + + L + + D +LS K G++ +L+++ DG R V+
Sbjct: 304 SITEGSTMFPLKIQEDVVITLAESSCDAIATDQFILSLKGGEIYVLSLLSDGLRTVRSFH 363
Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
K SVL S + I + FLGSRLG+SLL+++T E D+
Sbjct: 364 FEKAAGSVLASCVCWIEHGFVFLGSRLGNSLLLRYT-------------------EKDSA 404
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
S +S ++ M G DSL+NIGP+ + G
Sbjct: 405 SIA--EKSKEAKVEKMYGGGVGGGIIVC----------------DSLLNIGPITKAALGE 446
Query: 523 RINADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHK 556
G S+Q + E+V ELPGC +WTV K
Sbjct: 447 PAFLSEEFFG-SRQIDLEMVCCSGYGKNGTLTVLQRSIRPQVVTTFELPGCVNMWTVCGK 505
Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
SS+ + YH+YLI+S + TMVL+T +TE+ S + VQ TI A N
Sbjct: 506 SSKESV----------ENYHSYLILSRDDSTMVLKTGAEITELDNS-GFNVQQPTIFACN 554
Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
+ ++QV + +L+ + +S + + SI+DPYV++ S
Sbjct: 555 HLSNKYILQVCPQSIHLLEDTVQINSISL----------QDTIKITQCSISDPYVVMVDS 604
Query: 677 DGSI 680
G +
Sbjct: 605 TGQL 608
Score = 224 bits (570), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 128/380 (33%), Positives = 215/380 (56%), Gaps = 18/380 (4%)
Query: 1061 VEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTA 1117
+E + V ++ P W+T + +Q E+ ++V+ L + + L +GT
Sbjct: 753 IERFVVSLISP----TSWETVPNSRTVLQEFEHVTCMKVLLLHSELVDIGLKQYLVVGTT 808
Query: 1118 YVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIAS 1172
+ GED+A +GR+L+F P +T+ VY KE KG ++A+ + G+++ A
Sbjct: 809 FNYGEDLACKGRILIFDVLEVVPEPGQPLTKTKCKCVYDKEQKGPVTAICATSGYIIAAV 868
Query: 1173 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1232
G KI K+ +L G+AF D+ ++ V+L ++N I+ DI +SI + ++ + L L
Sbjct: 869 GQKIYAFKYKDNDLVGVAFVDSQ-VFTVNLMAIRNVIVAADISRSISLVRFQVEHKSLAL 927
Query: 1233 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1292
+++D +L+ + +EF IDGS + VVSD ++NI IF Y P+ ES+ G +LL +A+ ++G
Sbjct: 928 VSRDTKTLEAYTSEFFIDGSQVGFVVSDAERNIVIFSYQPEALESFGGHRLLQKADINIG 987
Query: 1293 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1352
+HV +R++++ D + S++ R ++ TLDGSIG + PL E FRRL LQ
Sbjct: 988 SHVNTMMRIKLI---QDEQSLSKSSEQ--RQLIILPTLDGSIGILFPLSEKPFRRLTMLQ 1042
Query: 1353 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
KLVD +PH AGLNPR+FR + +I+D +LL Y L +E+ +IA + G
Sbjct: 1043 NKLVDCLPHKAGLNPRAFRALDVPLRTLTNPHRNILDGQLLDKYAQLSFQERFDIAKKMG 1102
Query: 1413 TTRSQILSNLNDLALGTSFL 1432
TT QIL ++ D+ ++ L
Sbjct: 1103 TTSGQILDDMMDIERASNHL 1122
>gi|147799623|emb|CAN68460.1| hypothetical protein VITISV_027523 [Vitis vinifera]
Length = 558
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 119/175 (68%), Positives = 142/175 (81%), Gaps = 5/175 (2%)
Query: 424 FLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE 483
F GS+LGDSLLVQFT S+ SS +++ GDIE B PS KR RRSSSDALQDMVNG++
Sbjct: 331 FEGSQLGDSLLVQFT-----SIPSSSVEKRVGDIEGBVPSAKRSRRSSSDALQDMVNGDK 385
Query: 484 LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE 543
L LYGSA N+TE++QKTFSF+V DSL+++GPLKDF+YGLRINAD ATGI KQ VE
Sbjct: 386 LPLYGSAPNSTETSQKTFSFSVNDSLIDVGPLKDFAYGLRINADLKATGIVKQKMITEVE 445
Query: 544 LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
LPGC+ IWTVYHK++RGHNADS++M DDEY AYLIIS E+RTMVLET +LL E
Sbjct: 446 LPGCERIWTVYHKNTRGHNADSTKMITKDDEYCAYLIISPESRTMVLETVELLGE 500
>gi|46120520|ref|XP_385083.1| hypothetical protein FG04907.1 [Gibberella zeae PH-1]
Length = 1436
Score = 227 bits (578), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 337/1441 (23%), Positives = 563/1441 (39%), Gaps = 204/1441 (14%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GGADN 126
R ++ ES G V D + L LV L G V LA + GG
Sbjct: 68 RANDDDGLESSFLGGETMIVKTDRTNNTKLVLVAELPLSGAVTGLAKVKTKHSKCGG--- 124
Query: 127 SRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKV 185
+++++A++ AK+ + +D L S+H +E E LH SF ++
Sbjct: 125 ----EALLIAYKAAKLCMAVWDPEKSTLETISIHYYEKEE-LHGAPWEVSFDEYANYLEA 179
Query: 186 DPQGRCGGVLVYGLQMIILKASQGGSGLVGDE------------DTFGSGGGFSARIES- 232
DP RC + IL Q L D+ +T G S +E
Sbjct: 180 DPGSRCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKETAAVANGDSDTVEPP 239
Query: 233 ---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
S V+ L LD + H F F+H Y EP IL + H T + L
Sbjct: 240 YTPSFVLRLPLLDPSLLHPVHFAFLHEYREPTFGILSSSQERAHSLGQKDHLTYKVFTLD 299
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + I S +LP D +K+LA+P+P+GG L++G N IH + +A+N+ A
Sbjct: 300 LQQ--RASTTILSVTDLPRDLFKILALPAPVGGALLIGENELIHVDQSGKANGVAVNSMA 357
Query: 347 VSLDSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S ++ ++ L+ ++N LL G + +++ + DGR V L +
Sbjct: 358 RQITSFSLTDQADLNLRLEHCVVEQLHIENGELLLVLNDGQIGIVSFLIDGRTVSGLSIK 417
Query: 405 ----KTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
+ +VL S +T +G + FF+GS +GDS+++ +T G K D
Sbjct: 418 MVTDENGGNVLKSRASTASKLGKNTFFVGSEMGDSVVLGWTRKMGQEKRR---KPRLIDT 474
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
+ + D D+ E + + + N SF + D+L++I P+KD
Sbjct: 475 DIALDVDELDLEDDDDEDDDLYGTESAAAKPAQALNGSGRSGELSFRIHDTLLSIAPIKD 534
Query: 518 FSYGLR--------------INAD---ASATGISKQSNYELV------------ELPGCK 548
+ G + +D A G K + ++ E P +
Sbjct: 535 LTPGKTSFLPDSEEMTLSDGVVSDLHLACIVGRGKAGSLAILNRNIQPKIIGRFEFPEAR 594
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHA------YLIISLEARTMVLETADLLT----- 597
G WT+ K S A DEY A Y+I++ + ET+D+
Sbjct: 595 GFWTMSVKKPLPKALGGS--AGVGDEYEAFGQHDKYMIVA-KVDLDGYETSDVYALTGAG 651
Query: 598 -EVTESVDY-FVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGS 654
E + ++ G T+ AG + + R+IQV + R DG +TQ L + E+G+
Sbjct: 652 FETLKETEFDPAAGFTVEAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA 709
Query: 655 GSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYH 714
V S SI DPY+LL D S+ L D + V+ A + K + C LY
Sbjct: 710 ---EPRVTSASIVDPYLLLIRDDSSLLLAQIDSNNELEEVEKMDATLQNTKWHAGC-LYA 765
Query: 715 DKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-C 773
D T + D G ++ I + +GAL ++ +P+ +
Sbjct: 766 D-----------------TEGAFQFNANDKGETEK--IMMFLLSSTGALHVYALPDLSKP 806
Query: 774 VFTVDKFVSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
V+ + H+ D +R L KE + + V +L
Sbjct: 807 VYVAEGLSYVPPHLSADYTLRRGLA------------------KETLREILVADLG---- 844
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS----RLR 888
P+L + Y+ P+ R SN+SA+ ++
Sbjct: 845 DTISQSPYLILRNQTDDLTIYE---------------PIHHVRPGGESNLSAALSFKKMS 889
Query: 889 NLRFSRTPLDAYTRE-ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP 947
N+ + TP + E P P +R NI+G+ FL GS P + + + +
Sbjct: 890 NVTLATTPAQTEDDDVEQPRFMPMRRCA---NINGYSTVFLPGSSPSFVLKSSKSIPRVI 946
Query: 948 QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQ 1007
L I + H C+ GFIY +GI ++ Q PS + + K +PL +
Sbjct: 947 GLQGLGIRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNFTELGISVKKVPLGSDVRG 1006
Query: 1008 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEV 1066
I Y Y I +P E+ D H + L T+ +
Sbjct: 1007 IAYHQPTGAY--IAGCMTSEPF----------ELPKDDDYHKEWAKETLSFPPTMPRGVL 1054
Query: 1067 RILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVA 1125
+++ P W I ++S E+ ++ + L + TKE L+A+GTA +GED+
Sbjct: 1055 KLISPIT----WTVIHDIELESCESIECMKTLHLEVSEDTKERRFLVAVGTAVSKGEDLP 1110
Query: 1126 ARGRVLLFSTGRNADNPQNLVTEVYSKEL------KGAISALASL--QGHLLIASGPKII 1177
RGRV ++ P T K + +G ++A++ + QG +L+A G K +
Sbjct: 1111 IRGRVHVYDIVTVIPEPGKPETNRRLKAIAREDIPRGGVTAISEIGTQGLMLVAQGQKCM 1170
Query: 1178 LH--KWTGTELNGIAFYDAPPLYVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLA 1234
+ K G+ L +AF D + + + L+ D K ++F + E+ +L
Sbjct: 1171 VRGLKEDGSLLP-VAFLDMSCHVSSARELSRTGLCLMADAFKGVWFAGYTEEPYTFKVLG 1229
Query: 1235 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1294
K G L +FL DG L++V +D ++ I + P+ +S +G LL R F V +
Sbjct: 1230 KSHGRLPVVVADFLPDGDDLAIVAADVDGDLHILEFNPEHPKSLQGHLLLHRTSFSVSPN 1289
Query: 1295 -VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1353
+ L L S T P LL + G + + PL E +RRL S+
Sbjct: 1290 PPSTTLLLPRTTPPSHPTPQDP------PHVLLLASSSGHLSSLIPLPETAYRRLLSVTN 1343
Query: 1354 KLVDSVPHVAGLNPRSFRQFHSNGK--AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1411
+L+ ++ GLN ++ R G +IVD +L+ + L ++ EIA +
Sbjct: 1344 QLLPALTPHGGLNAKAHRLPVGTRTVGVEAAGGRAIVDGAVLARWAELSAAKRAEIAGKG 1403
Query: 1412 G 1412
G
Sbjct: 1404 G 1404
>gi|378734083|gb|EHY60542.1| histone H2A [Exophiala dermatitidis NIH/UT8656]
Length = 1361
Score = 227 bits (578), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 315/1394 (22%), Positives = 553/1394 (39%), Gaps = 250/1394 (17%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLR 155
S L LV Y L G + SL + NS+ D++++AF DAK+S++E+D ++H +
Sbjct: 49 SETKLVLVAEYNLAGTITSLGRVK---IPNSKSGGDAVLVAFRDAKLSLIEWDPALHSIS 105
Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
S+H +E + + + + VDP RC + I+ Q L
Sbjct: 106 TLSIHYYEHHDLQSIPWQPDLSKCVSHLTVDPSSRCAAFNFGVSNLAIIPLHQVRDELAM 165
Query: 216 DE---------DTFGSGGGFSARIES-------SHVINLRDLD--MKHVKDFIFVHGYIE 257
DE + G + +S S V+ L LD + H D F+H Y +
Sbjct: 166 DEFDEVDGEVKERLSPDGQNENKHDSPDTPFKPSFVLPLTALDPGLLHPVDMAFLHEYRD 225
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + IL+ + + + + ++ K + S LP+D Y+++A+P P
Sbjct: 226 PTVGILYSTAARSSNMNHERRDVTIYAVYALDIGQKASTALQSVQKLPNDLYRVMALPPP 285
Query: 318 IGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
+GG L++G N IH + A+A+N A S +++ ++L+ L N
Sbjct: 286 VGGALLIGGNELIHIDQSGKTIAIAVNELAKEASSFPMADHANYRLKLEGCQIEHLGNPS 345
Query: 377 A--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV-------LTSDITTIGNSLFFLGS 427
L+ KTG+L LL+ DGR+V + L + ++ T +G++ F+GS
Sbjct: 346 GDMLVILKTGELALLSFRMDGRMVSSMALRRVGEGQSQGLALGASTCSTNLGSNRLFIGS 405
Query: 428 RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA-----LQDMVNGE 482
DS+L+ G T+ L + R++ + DA ++ +
Sbjct: 406 EESDSILL--ATGRKTTQLRR--------------TNSRIQSQADDAGLFDDNEEDGIED 449
Query: 483 ELSLYGSASN----NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ-- 536
E LY ++ N + +F + D L +I P+ D + A + +++Q
Sbjct: 450 EDDLYAELADELNGNASTDVSGHNFRLLDRLPSIAPINDVALANVGKRRAEESEVTRQEL 509
Query: 537 ---------------------SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
S ++ G+W + +RG A ++ +
Sbjct: 510 AVAYGRGHAGGLAFLSRKLEPSVTRQIKFERPIGVW-CFSSGNRGQQG------AEEENF 562
Query: 576 HAYLIISL-----EARTMVLETAD-LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
++IS RT +L D L + ES G I L IQV
Sbjct: 563 DDLVMISQTTDDGAGRTKLLRLIDGDLNSMGESEFDESAGAAIGVFKLEATNHTIQVLPT 622
Query: 630 GARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPST 689
R+ D + + F + E G + + + VS DPY+++ DGS+ LL D +
Sbjct: 623 ELRVYDAGFALSQI-FPIVDEEEG---QTARAVKVSFVDPYLVVVKDDGSMSLLKADKAG 678
Query: 690 CTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ 749
V+ P + + + S TLY D TD T G
Sbjct: 679 ELDEVELPENLRAWS--ILSATLYQD-----------TDDMFQTSRFY------NGTATP 719
Query: 750 GDIYSVVCYESGALEIFDVPNFNC-VFTVDKFVSGRTHIV-DTYMREALKDSETEINSSS 807
G I +++ + G + +PN + VF D TH++ D + + ++
Sbjct: 720 GPILTILT-QDGHFCLLSLPNVSIQVFQCDSLPFLPTHLMQDLQLPKHWRN--------- 769
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
K+++ + + +L ++ +P+L G ++ Y+++
Sbjct: 770 --------KDDLGEVLLADLG----NSTDRQPYLVVRNLVGDVIIYESFAMP-------- 809
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 927
D + + R V +A L + EE + Q + N++GH F
Sbjct: 810 -DVLGSFRFKKVFTKAAGELED------------GEEVGQPSTLQPMQAVTNVAGHASVF 856
Query: 928 LSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGS 987
+ G +P M + +L + + +H C G + V + +K C +P +
Sbjct: 857 IPGRQPLLIMREASTMPRVYELNPTKLKSMNSVHTGTCRQGLVLVDADDEIKFCNIPDST 916
Query: 988 TYD-NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1046
+ W +++V PL + YFA + Y L
Sbjct: 917 VLGLSDWVIRRV-PLGQDITSVAYFAPTDSYILAT------------------------- 950
Query: 1047 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRAT--IP--MQSSENALTV------- 1095
NH E ++ + D WQ AT +P +QSS L+
Sbjct: 951 NHT--------------TEFQLPQDDEWHPEWQGEATKFLPSSIQSSLKLLSAKTHSIIS 996
Query: 1096 --------RVVTL------FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---N 1138
RV+ L + T E + L+ +GTA V+GE+V RG + +F
Sbjct: 997 QYSFDACERVLCLESLNLEVSEETHERKDLIVVGTAIVKGENVTTRGNLYIFDVVDVVPE 1056
Query: 1139 ADNPQ-NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILHKWT-GTELNGIAFYD 1193
D P+ +L ++ +KE ++GA+SAL + QG LL A G K ++ + +AF D
Sbjct: 1057 PDRPESDLKIKLITKEDVRGAVSALCDIGSQGFLLAAQGQKSMVRGLKEDMSILPVAFLD 1116
Query: 1194 APPLYVVSLNIV-KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1252
V+ + +LGD ++ + + E+ +L +L +D A EFL DG
Sbjct: 1117 MRYYVHVARELPGTGLCILGDAFSGLWLVGYSEEPYKLQILGRDLEDPPVLAAEFLPDGK 1176
Query: 1253 TLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDR- 1310
L ++ SD+ +++ Y P+ ++ +G KLL R+ FH GA TK + L +A+ R
Sbjct: 1177 QLYIISSDDDGLLRVLQYDPENPKAERGTKLLLRSTFHSGAAPTKMILLPPQVASGRGRD 1236
Query: 1311 --------TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVP-H 1361
+GA P + R +L T +GS+ + PL E T+RRL +LQ L+ ++ H
Sbjct: 1237 PEIDMDVDSGAGPAA---GRHRILVTTQEGSLCMLTPLSEATYRRLSALQTTLLTTLDFH 1293
Query: 1362 VAGLNPRSFRQFHS 1375
LNPR++RQ +
Sbjct: 1294 PCSLNPRAYRQVET 1307
>gi|358390357|gb|EHK39763.1| hypothetical protein TRIATDRAFT_48211 [Trichoderma atroviride IMI
206040]
Length = 1441
Score = 226 bits (577), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 333/1473 (22%), Positives = 576/1473 (39%), Gaps = 199/1473 (13%)
Query: 57 NLVVTAANVIEIYVVR-------------------------VQEEGSKESKNSGETKRRV 91
NLVV ++++I+ V+ V ++ ES G +
Sbjct: 28 NLVVAKGSLLQIFTVKAISTELDPEFQPSQPTETETRFDRQVNDDDGLESSFLGGESMFM 87
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
D + L L+ L G V LA + + + ++++LA++ AK+ + E+D
Sbjct: 88 RTDRTNNTKLVLIAEIPLAGTVIGLARVKT--KNTASGGEALLLAYKAAKMCLAEWDPKK 145
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARG-PLVKVDPQGRCGGVLVYGLQMIILKASQGG 210
+ L S+H +E E + E F ++ DP RC + IL ++
Sbjct: 146 NELETISIHYYEKEE-MQGSPWEEVFGEYVNYLEADPGSRCAAFKFGTRNLAILPFTRSE 204
Query: 211 SGLVG---DED-------------TFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFV 252
L DED G G A S V+ L LD + H F+
Sbjct: 205 EDLEMEDWDEDLDGPRPVKEHTAAANGDGNNVEAAYTPSFVLRLPLLDPSLLHPVHLTFL 264
Query: 253 HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
H Y EP +L + + S H + + L + + I S LPHD YK++
Sbjct: 265 HEYREPTFGVLSSSQAPASSLGSKDHLSYKVFTLDLQQ--RASTTILSVTGLPHDLYKVI 322
Query: 313 AVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATW 371
A+P+P+GG L+VG N IH +A+N A S +S ++ L++
Sbjct: 323 ALPAPVGGALLVGQNELIHVDQSGKPNGVAINPMAKLATSFNLTDQSDLNLRLESCAIEL 382
Query: 372 L--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITTI---GNSL 422
L +N LL G L +++ DGR V L + + +++ S +T I G +
Sbjct: 383 LAIENGELLLILNDGRLGIISFKIDGRTVSGLGVKLVGADCGGNIIKSRVTCISRLGKNA 442
Query: 423 FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE 482
FFLGS DS+++ + S K D + + + +
Sbjct: 443 FFLGSETSDSVVLGW---SRKQTQEKRRKSRLIDTDLALDVDELDLEDDEEDDDLYGDDS 499
Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR--------------INAD- 527
+ +N SF + D+L++I P++D + G ++AD
Sbjct: 500 ATTKPNQTANGGTVKSGDISFRIHDTLLSIAPIQDITCGQSAFLPDSEEATLNKGVSADL 559
Query: 528 --ASATGISKQSNYELV------------ELPGCKGIWTVYHK----SSRGHNADSSRMA 569
A A G + + ++ E P +G WT+ K S G NA ++
Sbjct: 560 QLACAVGRGEAGSIAVINREIQPKVIGRFEFPEARGFWTMCVKKPVPKSLGTNAGAAGDY 619
Query: 570 AYDDEYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
++ ++I++ E + TA + E+ G T+ AG + + V
Sbjct: 620 DAPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKETEFEPAAGFTVEAGTMGNQMVV 679
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
IQV + R +G + Q L + +G+E V S SI DPY+L+ D S+ L
Sbjct: 680 IQVLKSEVRCYNGDLGLIQILPM----LDEETGAEPRAV-SASIVDPYLLIIRDDASVFL 734
Query: 683 LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGA 742
D + ++ + +S K + C LY D + GV +A G
Sbjct: 735 AQIDSNNEIEEIEKTDSGLTSTKWAAGC-LYKD----------------TKGVFQANQG- 776
Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSET 801
D ++ + +GAL I+ +P+ + V+ + S H+ ++ + +
Sbjct: 777 DQAKKSGEEVMMFLLNTAGALHIYALPDLSKPVYVAEGLSSIPPHLSADFVAKKV----- 831
Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
+E + + V +L H P+L + + Y+
Sbjct: 832 ------------ASREALTELVVADLG----DTVHYSPYLILRHSTDDLTIYEPIRL--- 872
Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
+D P SA+ + PL+ ++ P P + I N+
Sbjct: 873 ----PTDSPTRNLSDTLFFKKSANSILAKSTVEDPLEDTAQQ--PRYVP---LRICANVG 923
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
G+ FL G P + + + + + + + + C+ GFIY S+GI ++
Sbjct: 924 GYSTVFLPGPSPAFILKSSKSVPRVVGVQGLGVRGMSTFNTEGCDRGFIYSDSEGIARVT 983
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
QLPS + + K +PL + Y Y +V + L D +
Sbjct: 984 QLPSKTNFTELGVSVKKVPLGNDVRHVAYHHPTETYIAGCAV------TEGFELPKDDDY 1037
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1100
+ +LS H + ++++ P W +I M+ E+ ++ + L
Sbjct: 1038 HKEWAKESLS---FHPSTV--RGSLKLISPVT----WTVIHSIDMEPGESIECMKTLHLE 1088
Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNLVTEVYSKE--L 1154
+ TKE LLA+GTA +GED+ RGRV ++ P N ++ +KE
Sbjct: 1089 VSEETKERRMLLAVGTALTRGEDLPTRGRVQVYDIVTVIPEPGKPETNKKLKLLAKEEIP 1148
Query: 1155 KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVK--NF 1208
+G ++AL+ + QG +L+A G K ++ K G+ L +AF D +V S +
Sbjct: 1149 RGGVTALSEIGTQGLMLMAQGQKCMVRGLKEDGSLLP-VAFLDM-SCHVASARELPGTGL 1206
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1268
L+ D K ++F + E+ +L K GSL +FL DG LS+V D +I +
Sbjct: 1207 CLIADAFKGLWFAGYTEEPYTFKVLGKSSGSLPLLVADFLPDGEDLSMVAVDADGDIHVL 1266
Query: 1269 YYAPKMSESWKGQKLLSRAEFHVGAH-VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLF 1327
+ P+ +S +G LL R F V + T L L +S A+ S LL
Sbjct: 1267 EFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPRTLPASQSATASQDSSTPQPHLLLL 1326
Query: 1328 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDS-VPHVAGLNPRSFRQFHSNGKAHR----- 1381
+ GS+ + PL E +RRL S+ +L+ + VPH GL+ R+ R G R
Sbjct: 1327 ASPSGSLAALTPLPESAYRRLLSVTNQLLPALVPH-GGLHARAHRAPEGGGGMSRMVGVE 1385
Query: 1382 --PGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
+IVD +L+ + L ++ E+A + G
Sbjct: 1386 TAASGRAIVDGAILTRWNELGAAKRAEVASRGG 1418
>gi|429851266|gb|ELA26469.1| protein cft1 [Colletotrichum gloeosporioides Nara gc5]
Length = 1411
Score = 226 bits (576), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 323/1346 (23%), Positives = 545/1346 (40%), Gaps = 194/1346 (14%)
Query: 69 YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSR 128
Y R+ ++ ES G V D + L LV Y + G V LA + NS+
Sbjct: 66 YDRRLNDDDGLESSFLGGDGMLVRADRTNNTKLVLVAEYPIFGVVAGLARIK---IQNSK 122
Query: 129 RR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL----- 182
+++++A A++S++++D H L S+H +E E S GPL
Sbjct: 123 SGGEALLIATRVARLSLVQWDPEKHALEDVSIHFYEKEEL------EGSPFDGPLSNYPT 176
Query: 183 -VKVDPQGRCGGV---------LVYGLQMIILKASQGGSGLVGDED---------TFGSG 223
+ DP RC + L + L + + G T G+
Sbjct: 177 HLAADPGSRCAALRFGSRYIAFLPFKLNDEDIDMDDWDEDVDGPRPAKEPSATAATNGTS 236
Query: 224 GGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTC 281
+S+V+ L LD + H F+H Y EP I+ + H +
Sbjct: 237 NLADVPYSTSYVLPLPQLDPSLLHPVHLAFLHEYREPTFGIISSMQRRSNTLPRKDHFSY 296
Query: 282 MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCAL 340
+ L + + I S NLP D +K++A+P PIGG L+VG N IH +
Sbjct: 297 KVFTLDLQQ--RASTAILSVNNLPQDLFKVIALPGPIGGALLVGTNELIHIDQSGKPNGV 354
Query: 341 ALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVV 398
A+N + S +S + L+ + + +N L+ G L ++ DGR V
Sbjct: 355 AVNAFTKETTSFPLADQSELDLRLEHCYIEQMSPENGELLMVLSDGRLAIIAFKIDGRTV 414
Query: 399 QRLDL----SKTNPSVL---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
L + ++ +V+ S I+ + + FF+GS DSL+V G + +
Sbjct: 415 SGLSVRIVPAEAGGNVVQCGASSISRLSKNAFFIGSTGSDSLVV------GVTRKQTQNA 468
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS-LYGSASNNTESAQKTFSFAVRDSLV 510
+ + D+ + D D + GE + + S + N SF V DSL+
Sbjct: 469 RKKTRLVDDSFADDLEDEDIDDDDDDDLYGETTTTVQSSTAANGVPKGGEISFRVHDSLL 528
Query: 511 NIGPLKDFSYG--------------------LRINA-----DASATGISKQSNYELV--- 542
++ P+KD + G L++ A +A+A I Q+ V
Sbjct: 529 SLAPVKDMTTGKQAFIPESEDEKNSVGVVADLQLAAAVGKGNAAAIAIMNQNIQPKVIGK 588
Query: 543 -ELPGCKGIWT--VYHKSSRGHNADSSRMAAYDDEYHA------YLIISLEARTMVLETA 593
E P +G WT V + D AA E+ A ++I+S + ET+
Sbjct: 589 FEFPEARGFWTMCVQKPIPKSLQGDKGANAAVGSEFDASSIYDKFMIVS-KVDLDGYETS 647
Query: 594 DLLTEVTESVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSF 645
D+ + F G T+ AG + R+IQV + R DG ++Q L
Sbjct: 648 DVYALTGAGFEAFTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLGLSQILPM 707
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
+ E+G+ V+S SIADPY+LL D SI + D + ++ S +
Sbjct: 708 --LDEETGA---EPRVVSASIADPYLLLVRDDASIMVAQIDNNNELEEMEKQDDTILSTQ 762
Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
++ C LY D +TGV I G P Q I+ + GAL I
Sbjct: 763 WLAGC-LYTD----------------TTGVFAPIQTDKGTPESQ-SIFMFLLSAVGALYI 804
Query: 766 FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
+ +P+ + V +G T+ V ++ + + GT Q E + + V
Sbjct: 805 YALPDLSKPVYV---AAGMTY-VPPFL---------SADYAVRRGTVQ---ETLTEVLVA 848
Query: 826 ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS 885
+L A S P+L + I Y+ E D +++L ++
Sbjct: 849 KLG----DATESSPYLILRHANDDITIYEPIRLE------SQDKSEGLAKTLHFQKIT-- 896
Query: 886 RLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
N +++P++ + E P P + NI+G+ FL G+ P + + +
Sbjct: 897 ---NPALAKSPVEVADDDANEQPRFVPLRPCA---NINGYSTVFLPGASPSFIIKSAKSA 950
Query: 944 RVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKA 1003
L + + H C GFIY S+G ++ QLP+ ++++ ++K IP+
Sbjct: 951 PKVLGLQGIGVRGMSSFHTEGCERGFIYADSEGHTRVTQLPADTSFELGVSIRK-IPVGD 1009
Query: 1004 TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY-TVE 1062
I Y Y + SV +P E+ D H + + T+ +E
Sbjct: 1010 AIGLIAYHPPMETYAVACSVS--EPF----------ELPKDDDYHKEWAKETITTFPQME 1057
Query: 1063 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQG 1121
+++L P W T+ + E A+ ++ + L + TKE L+AIGTA +G
Sbjct: 1058 RGIIKLLSP----ATWSVIDTVELDPHEVAMCMKTLHLEVSEETKERRMLIAIGTAINRG 1113
Query: 1122 EDVAARGRVLLFSTGRNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASG 1173
ED+ RGR+L++ P N ++ +KE +GA++AL + QG +L+A G
Sbjct: 1114 EDLPIRGRILVYDVVPVVPQPGRPETNKKLKLVAKEEIPRGAVTALCEVGSQGLMLVAQG 1173
Query: 1174 PKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQ 1229
K ++ K GT L +AF D YV S+ V+ + L+ D K ++F+ + E+ +
Sbjct: 1174 QKCMVRGLKEDGTLLP-VAFMDMS-CYVTSVREVRGTGYCLMADAFKGVWFVGYAEEPYK 1231
Query: 1230 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1289
+ L K G + +FL+DG L +VV D+ I + + P+ +S +G LL+RA F
Sbjct: 1232 IMLFGKSTGKFEVLTADFLVDGDELHIVVCDKDGVIHVMQFDPEHPKSLQGHLLLNRASF 1291
Query: 1290 HVGA-HVTKFLRLQMLATSSDRTGAA 1314
H T L L T++ A+
Sbjct: 1292 SAAPNHPTATLSLPRTTTTAQSASAS 1317
>gi|358387835|gb|EHK25429.1| hypothetical protein TRIVIDRAFT_32877 [Trichoderma virens Gv29-8]
Length = 1440
Score = 226 bits (575), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 338/1477 (22%), Positives = 589/1477 (39%), Gaps = 208/1477 (14%)
Query: 57 NLVVTAANVIEIYVVR-------------------------VQEEGSKESKNSGETKRRV 91
NLVV ++++I+ V+ V ++ ES G +
Sbjct: 28 NLVVAKGSLLQIFTVKSISTELDPEFQPNQPAEVDTRFDRQVNDDDGLESSFLGGETMFM 87
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
D + L L+ L G V LA L + + ++LA++ AK+ + ++D
Sbjct: 88 RTDRTNNTKLVLIAEIPLAGTVIGLARLKTN--KTASGGEVLLLAYKAAKMCLAQWDPKK 145
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQMIILKASQ 208
+ L S+H +E E L E F G V + DP RC + IL +
Sbjct: 146 NELETISIHYYEKEE-LQGSPWEEVF--GEYVNHLEADPGSRCAAFKFGTRNLAILPFRR 202
Query: 209 GGSGLVG---DEDTFG---------SGGGFSARIESSHV------INLRDLDMKHVKDFI 250
L DED G + G S +E+++ + L D + H
Sbjct: 203 SEEDLEMEDWDEDLDGPRPVKEQAAAVNGDSDNVEAAYTPSFVLRLPLLDPSLLHPVHLT 262
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
F+H Y EP +L + A + K H ++ + I S LPHD YK
Sbjct: 263 FLHEYREPTFGVLSSSQAP-AASLGLKDHLSY-KVFTLDLQQRASTTILSVTGLPHDLYK 320
Query: 311 LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD--AA 367
++A+P+P+GG L+VG N IH +A+N A + S ++ ++ L+ A
Sbjct: 321 VIALPAPVGGALLVGQNELIHVDQSGKPNGVAVNPMAKLVTSFSLTDQADLNLRLENCAI 380
Query: 368 HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDLSKTNPSVLTSD---ITTIGN 420
++N LL G L +++ DGR V RL + +V+ S I+ +G
Sbjct: 381 ELLAVENGELLLILNDGRLGIISFKIDGRTVSGLSVRLVGADCGGNVIKSRAACISRLGK 440
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD-APSTKRLRRSSSDALQDMV 479
+ FF+GS GDS+++ ++ + + + I+ D A L + D+
Sbjct: 441 NTFFVGSETGDSVVLGWS-----RRQTQEKRRKSRLIDPDLALEVDELDLEDDEEDDDLY 495
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR--------------IN 525
+ + +N + SF + D L++I P++D + G ++
Sbjct: 496 GDDSAATKPQTTNGGAAKSGDLSFRIHDVLLSIAPIQDITCGQAACLPDSEEATLIKGVS 555
Query: 526 AD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAA 570
+D A A G + + ++ E P +G WT+ K + S+ A
Sbjct: 556 SDLQLACAVGRGEAGSLAIINREIQPRVIGRFEFPEARGFWTMCVKKPVPKSLGSNVGVA 615
Query: 571 YDDE---YHAYLIISLEARTMVLETADLLTEVTESVDYFVQ-------GRTIAAGNLFGR 620
D + H +I + ET+D+ + + G T+ AG + +
Sbjct: 616 GDYDAPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKETEFEPAAGFTVEAGTMGKQ 675
Query: 621 RRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
VIQV + R +G + Q L + +G+E V S SI DPY+L+ DGS
Sbjct: 676 MMVIQVLKSEVRCYNGDLGLIQILPM----LDEETGAEPRAV-SASIVDPYLLIIRDDGS 730
Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI 739
+ L D + ++ +S K V+ C LY D + GV ++
Sbjct: 731 VFLAQIDSNNEIEEMEKADGGLTSTKWVAGC-LYKD----------------TKGVFQSN 773
Query: 740 DGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYM--REAL 796
+ G D+G + + +GAL I+ +P+ + V+ + S H+ ++ R A
Sbjct: 774 LNSAAGKADEG-VMMFLLNSAGALHIYSLPDLSKAVYIAEGLSSIPPHLSAGFVARRGAT 832
Query: 797 KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+++ TEI V +L + HS P+L + + Y+
Sbjct: 833 RETLTEI-------------------VVADLG----DSVHSSPYLILRHSTDDLTIYEPI 869
Query: 857 LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI 916
T D + +S + S+++ S + + P D + P P +
Sbjct: 870 RLPTASATHALSDTLFFKKSAN-SSLAKSAVED------PSD--DTAQPPRYVPLRTCA- 919
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
N+ G+ FL G P + + + + L + + H C+ GFIY S+G
Sbjct: 920 --NVGGYSAVFLPGPSPAFIIKSSKSIPRVVGLQGLGVRGMSTFHTEGCDRGFIYADSEG 977
Query: 977 ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1036
I ++ QLPS + K +PL + Y Y ++ + L
Sbjct: 978 IARVTQLPSKTNLTELGVSVKKVPLGHDIRHVAYHHPTETYIAGCTI------TENFELP 1031
Query: 1037 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1096
D + + +LS + ++ ++++ P W +I M+ E+ ++
Sbjct: 1032 KDDDYHKEWARESLSFLP-----SMARGALKLINPIT----WTVIHSIDMEPGESIECMK 1082
Query: 1097 VVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNLVTEVYS 1151
+ L + TKE LLA+GTA +GED+ RGRV ++ P N ++ +
Sbjct: 1083 TLHLEVSEETKERRMLLAVGTALTRGEDLPTRGRVQVYDIVTVIPEPGKPETNKRLKLLA 1142
Query: 1152 KE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV 1205
KE +G ++AL+ + QG +L+A G K ++ K G+ L +AF D + +
Sbjct: 1143 KEEIPRGGVTALSEIGTQGLMLVAQGQKCMVRGLKEDGSLLP-VAFLDMSCHVSTARELP 1201
Query: 1206 -KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
L+ D K ++F + E+ +L K GSL +FL DG LS+V D +
Sbjct: 1202 GTGLCLIADAFKGLWFAGYTEEPYTFKVLGKSSGSLPLLVADFLPDGEDLSMVAVDADGD 1261
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAH-VTKFLRLQMLATSSDRTGAAPGSDKTNRF 1323
I + + P+ +S +G LL R F V + T L L +S +P S +
Sbjct: 1262 IHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPRTLPASQSATTSPDSSSSQPH 1321
Query: 1324 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS-VPHVAGLNPRSFRQFHSNGKAHR- 1381
LL + G + + PL E +RRL S+ +L+ + VPH GL+ R+ R G R
Sbjct: 1322 LLLLASPSGCLASLTPLPESAYRRLLSVTNQLLPALVPH-GGLHARAHRTPEGGGGMSRT 1380
Query: 1382 ------PGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
+IVD +L+ + L ++ E+A + G
Sbjct: 1381 VGVETAASGRAIVDGAILARWNELGAAKRAEVATRGG 1417
>gi|408396642|gb|EKJ75797.1| hypothetical protein FPSE_03977 [Fusarium pseudograminearum CS3096]
Length = 1427
Score = 225 bits (574), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 334/1440 (23%), Positives = 558/1440 (38%), Gaps = 202/1440 (14%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GGADN 126
R ++ ES G V D + L LV L G V LA + GG
Sbjct: 68 RANDDDGLESSFLGGETMIVKTDRTNNTKLVLVAELPLSGAVTGLAKVKTKHSKCGG--- 124
Query: 127 SRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKV 185
+++++A++ AK+ + +D L S+H +E E LH SF ++
Sbjct: 125 ----EALLIAYKAAKLCMAVWDPEKSTLETISIHYYEKEE-LHGAPWEVSFDEYANYLEA 179
Query: 186 DPQGRCGGVLVYGLQMIILKASQGGSGLVGDE------------DTFGSGGGFSARIESS 233
DP RC + IL Q L D+ +T G S +E
Sbjct: 180 DPGSRCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKETATVANGDSDTVEPP 239
Query: 234 HV------INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
+ + L D + H F F+H Y EP IL + H T + L
Sbjct: 240 YTPSFVLRLPLLDPSLLHPVHFAFLHEYREPTFGILSSSQEPAHSLGQKDHLTYKVFTLD 299
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + I S +LP D +K+LA+P+P+GG L++G N IH + +A+N+ A
Sbjct: 300 LQQ--RASTTILSVTDLPRDLFKILALPAPVGGALLIGENELIHVDQSGKANGVAVNSMA 357
Query: 347 VSLDSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S ++ ++ L+ ++N LL G + +++ + DGR V L +
Sbjct: 358 RQITSFSLTDQADLNLRLEHCVVEQLHIENGELLLVLNDGQIGIVSFLIDGRTVSGLSVK 417
Query: 405 ----KTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
+ +VL S +T +G + FF+GS +GDS+++ +T G K D
Sbjct: 418 MVTDENGGNVLKSRASTASKLGKNAFFVGSEMGDSVVLGWTRKMGQEKRR---KPRLIDT 474
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
+ + D D+ E + + + N SF + D+L++I P+KD
Sbjct: 475 DIALDVDELDLEDDDDEDDDLYGTESAAAKPAQALNGSGRSGELSFRIHDTLLSIAPIKD 534
Query: 518 FSYGLR--------------INAD---ASATGISKQSNYELV------------ELPGCK 548
+ G + +D A G K + ++ E P +
Sbjct: 535 LTPGKTSFLPDSEEMTLSDGVVSDLHLACIVGRGKAGSLAILNRNIQPKIIGRFEFPEAR 594
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEY-----HAYLIISLEARTMVLETADLLT------ 597
G WT+ K S A DEY H +I + ET+D+
Sbjct: 595 GFWTMSVKKPLPKALGGS--AGVGDEYETFGQHDKYMIVAKVDLDGYETSDVYALTGAGF 652
Query: 598 EVTESVDY-FVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
E + ++ G T+ AG + + R+IQV + R DG +TQ L + E+G+
Sbjct: 653 ETLKETEFDPAAGFTVEAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA- 709
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
V S SI DPY+LL D S+ L D + V+ A + K + C LY D
Sbjct: 710 --EPRVTSASIVDPYLLLIRDDSSLLLAQIDSNNELEEVEKMDATLQNTKWHAGC-LYAD 766
Query: 716 KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CV 774
T + +D G ++ I + +GAL ++ +P+ + V
Sbjct: 767 -----------------TKGAFQLSASDKGETEK--IMMFLLSSTGALHVYALPDLSKPV 807
Query: 775 FTVDKFVSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
+ + H+ D +R L KE + + V +L
Sbjct: 808 YVAEGLSYVPPHLSADYTLRRGLA------------------KETLREILVADLG----D 845
Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS----RLRN 889
P+L + Y+ P+ R SN+SA+ + N
Sbjct: 846 TISQSPYLILRNQTDDLTIYE---------------PIRHVRPGGESNLSAALSFKKTSN 890
Query: 890 LRFSRTPLDAYTRE-ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ 948
+ + TP E E P P +R NI+G+ FL GS P + + + +
Sbjct: 891 VTLATTPAQTEDDEVEQPRFMPMRRCA---NINGYSTVFLPGSSPSFVLKSSKSIPRVIG 947
Query: 949 LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQI 1008
L I + H C+ GFIY +GI ++ Q PS + + K +PL + I
Sbjct: 948 LQGLGIRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNFTELGISVKKVPLGSDVRGI 1007
Query: 1009 TYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVR 1067
Y Y I +P E+ D H + L T+ ++
Sbjct: 1008 AYHQPTGAY--IAGCMTSEPF----------ELPKDDDYHKEWAKETLSFPPTMPRGILK 1055
Query: 1068 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAA 1126
++ P W I ++S E+ ++ + L + TKE L+A+GTA +GED+
Sbjct: 1056 LISPIT----WTVIHDIELESCESIECMKTLHLEVSEDTKERRFLVAVGTAVSKGEDLPI 1111
Query: 1127 RGRVLLFSTGRNADNPQNLVTEVYSKEL------KGAISALASL--QGHLLIASGPKIIL 1178
RGRV ++ P T K + +G ++A++ + QG +L+A G K ++
Sbjct: 1112 RGRVHVYDIVTVIPEPGKPETNRRLKAIAREDIPRGGVTAISEIGTQGLMLVAQGQKCMV 1171
Query: 1179 H--KWTGTELNGIAFYDAPPLYVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLAK 1235
K G+ L +AF D + + + L+ D K ++F + E+ +L K
Sbjct: 1172 RGLKEDGSLLP-VAFLDMSCHVSSARELSRTGLCLMADAFKGVWFAGYTEEPYTFKVLGK 1230
Query: 1236 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH- 1294
G L +FL DG L++V +D ++ I + P+ +S +G LL R F V +
Sbjct: 1231 SHGRLPVVVADFLPDGDDLAIVAADVDGDLHILEFNPEHPKSLQGHLLLHRTSFSVSPNP 1290
Query: 1295 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1354
+ L L S T P LL + G + + PL E +RRL S+ +
Sbjct: 1291 PSTTLLLPRTTPPSHPTPQDP------PHVLLLASSSGHLSSLIPLPETAYRRLLSVTNQ 1344
Query: 1355 LVDSVPHVAGLNPRSFRQFHSNGK--AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
L+ ++ GLN ++ R G +IVD +L+ + L ++ EIA + G
Sbjct: 1345 LLPALTPHGGLNAKAHRLPVGTRTVGVEAAGGRAIVDGAVLARWAELSAAKRAEIAGKGG 1404
>gi|149512998|ref|XP_001514888.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Ornithorhynchus anatinus]
Length = 831
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 215/738 (29%), Positives = 340/738 (46%), Gaps = 141/738 (19%)
Query: 233 SHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
S++I++R LD K ++ D F+HGY EP ++IL+E TW GRV+ + TC I A+S++
Sbjct: 8 SYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILYEPNQTWPGRVAVRQDTCSIVAISLNI 67
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS + Y VSL+
Sbjct: 68 LQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQS------VPPYGVSLN 121
Query: 351 S----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLD 402
S + P R + LD A A ++ D ++S K G++ +LT++ DG R V+
Sbjct: 122 SLTAGTTAFPLRLREGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFH 181
Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA- 461
K SVLT+ + T+ FLGSRLG+SLL+++T S +E D AD
Sbjct: 182 FDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPAGSA-REPARDSGADKQ 240
Query: 462 -PSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
P K+ R + A QD V+ E+ +YGS A + T+ A T+SF V DS++NI
Sbjct: 241 EPPVKKKRVEQALSWAGGKSAAQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 296
Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
GP + + G L I + + + + K ++V ELPGC
Sbjct: 297 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 356
Query: 548 KGIWTVYHK-------SSRGHNADSSRMAAY---DDEYHAYLIISLEARTMVLETADLLT 597
+WTV S +G A+S D + H +LI+S E TM+L+T +
Sbjct: 357 YDMWTVIAPVRKEEGDSPKGEGAESEPTPPEPEDDGKRHGFLILSREDSTMILQTGQEIM 416
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 417 ELDTS-GFATQGPTVYAGNIGDDRYIVQVSPLGLRLLEG---VNQLHFIPVDL------- 465
Query: 658 NSTVLSVSIADPYVLLGMSDGSIR--LLVGDP---STCTVSVQTPAAIESSKKPVSSCTL 712
S ++ ++ADPYV++ ++G + LL D T +++ P + S K ++ C +
Sbjct: 466 GSPIVQCAVADPYVVIMSAEGHVTMFLLKSDSYGGRTHRLALHKP-PLHSQSKVIALC-V 523
Query: 713 YHD-----------KGP--EPWLRKTSTDAWLSTGVGEAIDGADG--------------- 744
Y D GP +P LR S L + +D +
Sbjct: 524 YRDVSGMFTTESRASGPRDDPSLRGQSEAEPLLQELSHTVDDEEEMLYGDSSSLFSPSRD 583
Query: 745 -------GPLDQGDI--------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD 789
P D+ + V+ ++GA+EI+ +P + VF V F G+ +VD
Sbjct: 584 EPRRSSLPPADRDAPQYRAEPTHWCVLVRDNGAMEIYQLPEWRLVFLVKNFPMGQRVLVD 643
Query: 790 TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAI----- 844
+ + S + + EE QG + + +V L +RP+L +
Sbjct: 644 SSFGQPAA-SAAQAEAKKEEPARQGELPLVKEVLLVALG-----NRQTRPYLLRLKWAIR 697
Query: 845 ---LTDGTILCYQAYLFE 859
LT T + Q Y+ +
Sbjct: 698 DSELTSITFIDMQLYIHQ 715
Score = 107 bits (267), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 55/140 (39%), Positives = 85/140 (60%), Gaps = 8/140 (5%)
Query: 1168 LLIASG-----PKIILHKWT--GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
LL+A G P ++ KW +EL I F D LY+ + VKNFIL D+ KSI
Sbjct: 676 LLVALGNRQTRPYLLRLKWAIRDSELTSITFIDMQ-LYIHQIISVKNFILAADVMKSISL 734
Query: 1221 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1280
L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G
Sbjct: 735 LRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGG 794
Query: 1281 QKLLSRAEFHVGAHVTKFLR 1300
+LL RA+FHVGAHV F R
Sbjct: 795 MRLLRRADFHVGAHVNAFWR 814
>gi|33411762|emb|CAD58786.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
Length = 880
Score = 221 bits (564), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 161/497 (32%), Positives = 252/497 (50%), Gaps = 62/497 (12%)
Query: 233 SHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 8 SYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNI 67
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSL 349
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 68 TQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGT 127
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNP 408
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 128 TAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAA 187
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
SVLT+ + T+ FLGSRLG+SLL+++T S+ E D E KR+
Sbjct: 188 SVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRVD 245
Query: 469 RS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
+ S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 246 ATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 301
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHKS 557
L I + + + + K ++V ELPGC +WTV
Sbjct: 302 PAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 361
Query: 558 SR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+ G + A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 362 RKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFAT 420
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + S ++ ++A
Sbjct: 421 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVA 470
Query: 668 DPYVLLGMSDGSIRLLV 684
DPYV++ ++G + + +
Sbjct: 471 DPYVVIMSAEGHVTMFL 487
Score = 124 bits (312), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 128/271 (47%), Gaps = 16/271 (5%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 602 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 657
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + RP+L + D +L Y+A+ P ++ +
Sbjct: 658 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 707
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E T R F++I G+ G F+ G
Sbjct: 708 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 767
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 768 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 827
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVS 1022
WPV+K IPL+ T H + Y E +Y + S
Sbjct: 828 PWPVRK-IPLRCTAHYVAYHVESKVYAVATS 857
>gi|322704830|gb|EFY96421.1| Cleavage factor two protein 1 [Metarhizium anisopliae ARSEF 23]
Length = 1433
Score = 221 bits (562), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 332/1393 (23%), Positives = 548/1393 (39%), Gaps = 191/1393 (13%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
R ++ ES G V D + L L+ L G V LA + + +
Sbjct: 70 RANDDDGLESSFLGGESLIVRADPSNITKLVLITEIPLAGTVIGLARVKV--KNTPSGGE 127
Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
+++LA++ AK+ + E+ H L TS+H +E E L+ + G V + DP
Sbjct: 128 ALLLAYKAAKMCLTEWHPQRHTLETTSIHYYEKDE---LQGAPWEMSFGDYVNYLEADPG 184
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGD---ED-------------TFGSGGG----FSA 228
RC + IL +Q L D ED T G G G +
Sbjct: 185 SRCVAFKFGSRNLAILPFTQSEEDLEMDDWDEDLDGPRPVKEELPLTNGDGPGDHDLVKS 244
Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
R S V+ L LD + H F+H Y EP IL + A H T + L
Sbjct: 245 RYTPSFVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSMQSPSAALGIKDHLTYKVFTL 304
Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY 345
+ + I S LP D ++++A+P+P+GG L+VG N IH +A+N+
Sbjct: 305 DLQQ--RASTTILSVTGLPQDLFRVMALPAPMGGALLVGENELIHIDQSGKPNGVAVNDM 362
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL 403
A + S + +S + L+ L ND+ LL G L ++ DGR V ++ +
Sbjct: 363 AKQMTSFSLVDQSELGLRLEGCAVELLANDIGELLLILNDGRLAIVCFHIDGRTVSKISI 422
Query: 404 ----SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
++ +++ S ++ I G++ FLGS DS+++ ++ G K +
Sbjct: 423 RLVSAEYGGNLIKSQVSCISKLGSNTLFLGSESNDSIVLGWSRKQGQE------KRKKSR 476
Query: 457 IEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPL 515
+ + D D + G + SL S + N S SF ++DSL++I P+
Sbjct: 477 LLDPDLALDVDDLDLDDDEDDDLYGNDASLAKPSQTINGGSKPGEVSFRIQDSLLSIAPI 536
Query: 516 KDFSYGL-RINADASATGISKQSNYEL----------------------------VELPG 546
+D + G + D+ +SK EL + P
Sbjct: 537 RDVACGAPALVPDSEEATLSKGVTAELELACAVGRGSSGSVAILNREIQPKVIGRFDFPE 596
Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVT 600
+G WT+ K A + +Y Y+I++ E + TA +
Sbjct: 597 ARGFWTMCAKKPLSKGAAVASDFDTTGQYDKYMIVAKVDLDGYETSDVYALTAAGFETLK 656
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENS 659
++ G T+ AG + + R+IQV + R DG ++Q L P E +
Sbjct: 657 DTEFEPAAGFTVEAGTMGKQMRIIQVLKSEVRCYDGDLGLSQIL---PMLDEDTGAEPRA 713
Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPE 719
T S SI DPY+LL D SI + + V P S K S C LY+D
Sbjct: 714 T--SASIVDPYLLLIRDDSSIFIAQIHSNNELEEVLKPDGTLKSTKWASGC-LYND---- 766
Query: 720 PWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD-IYSVVCYESGALEIFDVPNFN-CVFTV 777
T V E D+ D I + GAL ++ +P+ + VF
Sbjct: 767 -------TQGIFQNNVNEQ-------QADETDRIMMFLLSSVGALHVYALPDVSRPVFVA 812
Query: 778 DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
+ S + ++ A + +G KE+I + V +L A
Sbjct: 813 EALTS-----IPPFLSAAF---------VARKGAS---KESITEILVADLG----DAISQ 851
Query: 838 RPFLFAILTDGTILCYQA--YLFEGPENTSKS---DDPVSTSRSLSVSNVSASRLRNLRF 892
P+L + Y+ Y EG S S V+TS + + VS
Sbjct: 852 TPYLIVRHASDDLTIYEPVRYQAEGDAELSASLLFKKCVNTSLAKTAPEVSED------- 904
Query: 893 SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDG 952
DA E P P +R N++G+ FL + P + + L
Sbjct: 905 -----DA----EPPRFVPLRRCA---NVNGYGAVFLPNASPSFVLKSSHSEPRVMGLQGL 952
Query: 953 SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFA 1012
+ + H C+ GFIYV +GI ++ QLPS + K I L I+Y
Sbjct: 953 GVRGMSTFHTEGCDRGFIYVDMEGIARVTQLPSNANLTELGVSVKKIALDGDVGMISYHH 1012
Query: 1013 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1072
Y +V L+ +E + N T+ ++++ P
Sbjct: 1013 PTGTY--VVGCTKLEQFELPRDDDYHKEWAKETSNF---------PPTMARGILKLINPV 1061
Query: 1073 RAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVL 1131
W + ++ E+ +++ + L + TKE + L+A+GTA +GED+ RGRV
Sbjct: 1062 T----WTVIHELELEPCESIESMKTLHLEVSEETKERKMLVAVGTALSKGEDLPTRGRVQ 1117
Query: 1132 LFSTGRNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KW 1181
+F P N ++ +KE +G ++AL+ + QG +L+A G K ++ K
Sbjct: 1118 VFDIVTVIPEPGRPETNKRLKLIAKEEIPRGGVTALSEVGAQGLMLVAQGQKCMVRGLKE 1177
Query: 1182 TGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1239
G+ L +AF D +V S+ + ++ D+ K ++F + E+ +L K G
Sbjct: 1178 DGSLLP-VAFLDM-SCHVASVKELPGTGLCVMADVFKGLWFAGYTEEPYTFKILGKSSGK 1235
Query: 1240 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1299
L A +FL DG LS+V D + ++ I + P+ +S +G LL R F V +
Sbjct: 1236 LPLLAADFLPDGEDLSMVAVDAEGDLHILEFNPEHPKSLQGHLLLHRTSFAVTPNTPSS- 1294
Query: 1300 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS- 1358
+L + S ++ LL G + ++PL E T+RRL S+ +L +
Sbjct: 1295 --TLLLPRTHSPSYPQASSSSSSHMLLLACPSGQLAALSPLAESTYRRLLSVTNQLHPAI 1352
Query: 1359 VPHVAGLNPRSFR 1371
VPH GL+ ++ R
Sbjct: 1353 VPH-GGLHSKAHR 1364
>gi|302924728|ref|XP_003053954.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256734895|gb|EEU48241.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1429
Score = 220 bits (561), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 330/1437 (22%), Positives = 560/1437 (38%), Gaps = 201/1437 (13%)
Query: 75 EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSII 134
++G + S G V D + L LV L G V LA + + ++++
Sbjct: 72 DDGLESSFLGGGESMLVRTDRTNNTKLVLVAELPLTGTVIGLAKIKTKYTKSGG--EALL 129
Query: 135 LAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQG 189
LA++ AK+ + E+D + L S+H +E E W + + + ++ DP
Sbjct: 130 LAYKAAKMCLCEWDPKKNTLETLSIHYYEKDELQGAPW---EVAFDEYVN--FLEADPGS 184
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGD---EDTFGS---------GGGFSARIESSHV-- 235
RC + IL Q L D ED G G S +E+++
Sbjct: 185 RCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKESTAVANGDSDTLEAAYTPS 244
Query: 236 ----INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
+ L D + H F+H Y EP IL + H T + L +
Sbjct: 245 FVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSSQERAHSLGQKDHLTYKVFTLDLQQ- 303
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLD 350
+ I S +LP D YK++A+P+P+GG L++G N IH + +A+N+ A +
Sbjct: 304 -RASTTILSVTDLPRDLYKMIALPAPVGGALLIGENEFIHIDQSGKANGVAVNSMARQMT 362
Query: 351 SSQELPRSSFSVELDAA--HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS---K 405
S ++ ++ L+ +++N LL G L +++ DGR V + + +
Sbjct: 363 SFSLSDQADLNLRLEGCIIEQLYIENGELLLILNDGRLGIVSFRIDGRTVSGISIKMIPE 422
Query: 406 TNPSVL----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
N L S + +G + FF+GS GDS+++ G M ++
Sbjct: 423 ENGGRLIKSRASTASKLGKNTFFIGSETGDSVVL----GWSRKMSQEKRRKTRLVDADLG 478
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
L D D + G E + + + N SF + D+L++I P++D + G
Sbjct: 479 LDVDDLDLEDDDDEDDDLYGTETAAKPTQALNGAGKSGELSFRIHDTLLSIAPIRDLTSG 538
Query: 522 -LRINADASATGISKQ--SNYELV--------------------------ELPGCKGIWT 552
D+ +SK S+ +L E P +G WT
Sbjct: 539 KAAFLPDSEEATLSKGVVSDLQLACVVGRGNSGSLAILNRHIQPKIIGRFEFPEARGFWT 598
Query: 553 VYHK----SSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVTES 602
+ K S G N ++ Y+I++ E + TA + E+
Sbjct: 599 MCVKKPVPKSLGGNVTVGNDYETFGQHDKYMIVAKVDLDGYETSDVYALTAAGFETLKET 658
Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTV 661
G T+ AG + + RVIQV + R DG +TQ L + E+G+ V
Sbjct: 659 EFDPAAGFTVEAGTMGKQMRVIQVLKSEVRSYDGDLGLTQILPM--LDEETGA---EPRV 713
Query: 662 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPW 721
+S SIADPY+LL D S+ + D + V+ + S K + C LY D
Sbjct: 714 ISASIADPYLLLIRDDSSVLIAQIDSNNELEEVEKTDSTLQSTKWHAGC-LYTD------ 766
Query: 722 LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKF 780
+ GV + G G D I + +GAL ++ +P+ + V+ +
Sbjct: 767 ----------TKGVFQPSVGDKGA--DTSKIMMFLLSSTGALHVYALPDLSKPVYVAEGL 814
Query: 781 VSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRP 839
H+ D +R L KEN+ + V +L P
Sbjct: 815 CYVPPHLSADYTLRRGLA------------------KENLRELLVADLG----DTVSQSP 852
Query: 840 FLFAILTDGTILCYQA--YLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
+L + Y+ Y EG E T S +L+ S + L +
Sbjct: 853 YLILRNQTDDLTIYEPLRYQPEGAEPT--------LSATLTFKKTSNAALATSPVETSQE 904
Query: 898 DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
DA + P P + N++G+ FL G P + + + + L I
Sbjct: 905 DAV---QQPRFVPLRTCA---NVNGYSTVFLPGPSPSFILKSSKSIPRVIGLQGLGIRGM 958
Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
+ H C+ GFIY +GI ++ QLPS + + + K +PL + I Y Y
Sbjct: 959 STFHTEGCDRGFIYADDEGIARVTQLPSETNFTDLGISVKKVPLDSDVCGIAYHQPTGTY 1018
Query: 1018 PLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGP 1077
+ N+ L D + + L+ T+ ++++ P
Sbjct: 1019 IAGCTT------NEPFELPRDDDYHKEWAKETLTFAP-----TMPRGVLKLISP------ 1061
Query: 1078 WQTRATI----PMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1132
T+ ++S E+ ++ + L + TKE LL +GTA +GED+ RGRV +
Sbjct: 1062 --VSLTVIHDQELESCESIECMKTLQLEVSEETKERRFLLTVGTALSKGEDLPIRGRVHV 1119
Query: 1133 FSTGRNADNPQNLVTEVYSKEL------KGAISALASL--QGHLLIASGPKIILH--KWT 1182
F P T K + +G ++A++ + QG +L+A G K ++ K
Sbjct: 1120 FDIVTVIPEPGKPETNKRLKAIAREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGLKED 1179
Query: 1183 GTELNGIAFYDAPPLYVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1241
G+ L +AF D + + + ++ D K ++F + E+ +L K G L
Sbjct: 1180 GSLLP-VAFLDMSCHVSSARELPRTGLCVMADAFKGVWFAGYTEEPYTFKILGKSHGRLP 1238
Query: 1242 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1301
+FL DG L++V +D ++ I + P+ +S +G LL R F V +
Sbjct: 1239 LLVADFLPDGEDLAIVAADADGDLHILEFNPEHPKSLQGHLLLHRTTFSVSPNPPT---- 1294
Query: 1302 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1361
ML A P ++ LL + G + + PL E T+RRL S+ +L+ ++
Sbjct: 1295 SMLLLPRTTPPAHPSPSDPSQI-LLLASPSGHLSTLVPLPEATYRRLLSVTNQLLPALTP 1353
Query: 1362 VAGLNPRSFRQFHSNGKAHRP-GPD-----SIVDCELLSHYEMLPLEEQLEIAHQTG 1412
GLN + +R RP G D +IVD +L+ + L ++ EIA + G
Sbjct: 1354 YGGLNAKGYRL----PSGTRPVGVDAAAGRTIVDGAILARWAELGAAKRAEIAGKGG 1406
>gi|50552095|ref|XP_503522.1| YALI0E03982p [Yarrowia lipolytica]
gi|74634000|sp|Q6C740.1|CFT1_YARLI RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|49649391|emb|CAG79101.1| YALI0E03982p [Yarrowia lipolytica CLIB122]
Length = 1269
Score = 219 bits (558), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 311/1370 (22%), Positives = 547/1370 (39%), Gaps = 214/1370 (15%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
A LEL+ Y L G V + + DN DS+ ++ + AK ++ ++ S +
Sbjct: 51 APRLELITEYYLDGTVTGVTRIKT--IDN-YDLDSLYISVKHAKAVIVAWNASSFTIDTK 107
Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
S+H +E + L E V + +L +M L + G + D+
Sbjct: 108 SLHYYE--KGLVESNFFEPECSSVAVSDEANSFYTCLLFQNDRMAFLPIIEKG---LDDD 162
Query: 218 DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
+ SG F + S ++ LD +++V D F+H Y E M IL + + W G +
Sbjct: 163 EMPESGQVF----DPSFIVKASRLDKRIENVMDICFLHEYRETTMGILFQPKRAWVGMKN 218
Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
T + +S+ K +I + LP DA K++ +P+P+GG L++ ANTI Y S
Sbjct: 219 ILKDTVSYAIVSVDVHQKNSTVIGTLNGLPVDAQKVIPLPAPLGGSLIICANTILYIDSS 278
Query: 336 ASCALALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQN--DVALLSTKTGDLVLLTV 391
AS + N +S + R S+ + L+ A ++Q + ALL T+ G L
Sbjct: 279 ASYTGVMVNNTHRQNSDLIVSRDQSTLDLRLEGAEVCFIQELGNTALLVTEDGQFFSLLF 338
Query: 392 VYDGRVVQRLDLSKTNPS--VLT--SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
DGR V L+L P +L+ S + + FLGSR GDSLLV++ G S
Sbjct: 339 NKDGRRVASLELRPIEPDNFILSQPSSVAAGPDGTIFLGSRAGDSLLVKWYHGEPESQPE 398
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVR 506
L D N + LYG + TE + + +
Sbjct: 399 ETL--------------------------DDGNESDDDLYGGDTAQTEDTTNRPLKLRLA 432
Query: 507 DSLVNIGPLKDFSYGLRINAD----ASATGISKQSNYELV--------------ELPGCK 548
D ++ +GP++ + G + + TG+ S ++ ++PG +
Sbjct: 433 DRMLGMGPMQSLALGKNRGSQGVEFVTTTGVGANSALAILTSALMPYKRKSLYKDMPGGQ 492
Query: 549 GIWTVYHK-SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
W+V + G A S D ++YL A V+E L T+ ++ +FV
Sbjct: 493 -FWSVPVRFEEEGEVAKSRTYVVSSDSENSYLYYVDAAG--VIEDVSLSTKKKKTKKHFV 549
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
T + ++QV I D S + +T + +
Sbjct: 550 SNVTTIFSSSMLDSALLQVCLETVNIYDAKI---------GQPHKYSLPQGTTAVEARVL 600
Query: 668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTST 727
YVL+ +SDG +++L VS+ +++++ + + G +T
Sbjct: 601 GNYVLVLLSDGQVKILEA------VSINKRPFLKAAQVSIEPASESKAIG------IYAT 648
Query: 728 DAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI 787
D+ L+ G G P VVCY G+L + S I
Sbjct: 649 DSSLTFGAPSKKRTRQGSPAQDSRPVVVVCYADGSL------------LLQGLNSDDRLI 696
Query: 788 VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
+D ++++ +E GQ +++V++A+ H +LT
Sbjct: 697 LDA----------SDLSGFIKEKDGQLYDA---PLELVDIALSPLGDDHILRDYLVLLTP 743
Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
++ Y+ Y + LRF + L E TP
Sbjct: 744 QQLVVYEPYHYND----------------------------KLRFRKIFL-----ERTPT 770
Query: 908 GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD----GSIVAFTVLHNV 963
+R+T I+G ++G + + L P+L + VAFT
Sbjct: 771 INSDRRLTQVPLINGKHTLGVTGET---AYILVKTLHTSPRLIEFGETKGAVAFT----- 822
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
+ + F Y+T G + C+ + + WPV+ V T ++TY ++Y
Sbjct: 823 SWDGKFAYLTQAGEVAECRFDPSFSLETNWPVKHVQLCGETISKVTYHETMDVY------ 876
Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSV--DLHRTYTVEEYEVRILEPDRAGGPWQTR 1081
V+ V ++ D+ D+ + S+ D+ T + +RI+ P W
Sbjct: 877 -VIATHKTVPHVVRDE------DDEVIESLTPDIMPATTYQG-AIRIVNP----YSWTVI 924
Query: 1082 ATIPMQ-SSENALTVRVVTLFNTTTK-ENETLLAIGTAYVQGEDVAARGRVLLFSTGR-- 1137
+ + +E AL V L + K + ++A+GT+ ++GED+AARG + LF
Sbjct: 925 DSYEFEMPAEAALCCESVKLSISDRKSQKREVVAVGTSILRGEDLAARGALYLFDVIEIV 984
Query: 1138 -NADNPQN--LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-ELNGIAFYD 1193
+ P+ + ++ ++GA +A+ + G LL G K+++ L +AF D
Sbjct: 985 PEKERPETNRRLKKLVQDRVRGAFTAVCEVSGRLLAVQGQKLLVQALQDDLTLVPVAFLD 1044
Query: 1194 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1253
YV + + +LLGD +S+ F+ + Q+ A+D + +F I+G
Sbjct: 1045 MQ-TYVAVAKSLNSMLLLGDATRSVQFVGFSMDPYQMIPFARDLQRVLVTTCDFAIEGEN 1103
Query: 1254 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1313
L+ VV+D QK + I Y P +S+ G +LL R+ F+ G + D +
Sbjct: 1104 LTFVVADLQKRLHILEYDPDDPQSYSGARLLRRSVFYSGKVI-------------DSSAM 1150
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
P ++ +RF ++ DGS+ + P E +RRL ++Q ++ D HV GL+PR++R
Sbjct: 1151 VPINE--DRFMVIGVCSDGSVTDVVPCPEDAYRRLYAIQTQITDKEAHVCGLHPRAYRYD 1208
Query: 1374 H----SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1419
+ HRP I+D L + LP +Q A++ G Q++
Sbjct: 1209 PILPGTGNSPHRP----ILDGHTLIRFANLPRNKQNVYANRLGQRYQQLI 1254
>gi|402591342|gb|EJW85272.1| hypothetical protein WUBG_03818, partial [Wuchereria bancrofti]
Length = 1025
Score = 219 bits (558), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 253/999 (25%), Positives = 418/999 (41%), Gaps = 150/999 (15%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + RL V+S AI + DS +L F+DAK+S++ + + L+ S+H
Sbjct: 62 LECLLAVRLLAPVQSFAI---ARISQNPDCDSFLLGFDDAKLSIVAVNPADRCLKTISLH 118
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G P+++VDP RC +LV+G + +L + + L
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCASMLVFGRYLAVLPFNDSSAQL------- 168
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+ + L +D + +V D +F+ GY EP ++ L+E T GR ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217
Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
T + L +S +K+ L +W NLP D ++LA+P P+GG+L+V N + Y +QS
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275
Query: 337 S-CALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
C ++LN+ +D + P F ++ LD A T + + LL + G L L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLKDFKHMALTLDGAVVTVVSTNKILLCDRNGRLFTLILV 332
Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
D V+ L+L +V+ +T+ F+GSRL DS+ + C S L
Sbjct: 333 TDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFLH--CIFEQSTLEES-- 388
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS---ASNNTESAQKTFSFAVRDS 508
+TK+++ S+ + E+ LYG + ++ + V D
Sbjct: 389 -----------ATKKIKLSTEPNANE--EDEDFELYGEMLPKVAKPDITEELLNIRVLDK 435
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
L+N+GP K + G + K ++LV G G + +S R SS
Sbjct: 436 LLNVGPCKKITGGCPSISAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSS 495
Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
+ +D+ H Y I S E T+ LET + L E+ E+ + TIAAG L
Sbjct: 496 SIEGVVQYWAIGRREDDTHMYFIASRELGTLALETDNDLVEL-EAPIFSTSESTIAAGEL 554
Query: 618 FGRRRVIQV-------FERGARILDGSYMTQDLSFGPSNSES-----GSGSENSTVLSVS 665
+QV G +I Y+ L+F N+ ++N +L
Sbjct: 555 ADGGLAVQVTTSSLVMVAEGQQI---QYIPLQLTFPVRNASIVDPYIAICTQNGRLLMYE 611
Query: 666 IAD-PYVLLGMSDGSIRL------------------LVGDPSTCTVSVQTPAAIESSKKP 706
+ + P+V L D S RL ++ S +S Q A + P
Sbjct: 612 LTNHPHVHLKEIDISKRLRHETSPITSLSVYRDMSGIIRFCSAANMSQQQQATGANMHIP 671
Query: 707 -------VSSCTLYHDKGPEPWLRKTSTDAWLSTGVG--EAIDGADGGPLDQGDI----Y 753
V LY D RK + G+ E D +D I +
Sbjct: 672 EQEDFEDVDDLLLYGDSKKS---RKETLSKRRIVGMKLTEQNTHFDTDVIDPNTIVPSHW 728
Query: 754 SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG- 812
+ E+G + I+ +P + V+ V K +H+ D + D E ++ EGT
Sbjct: 729 IAIARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDE----PATGEGTDA 780
Query: 813 -QGRKENIHSMK----VVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
G + ++K ++EL + + RP LF +L D T+ Y+ + + N
Sbjct: 781 MSGTMTDTFAVKPEEVIMELLLVGMGMNQGRPLLF-LLIDDTVSAYEMFTY----NNGIQ 835
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQ 924
+ L + V+ R+ RF T D E+ A + + F+ I
Sbjct: 836 GHLAIRFKRLPYTTVT----RSCRFQGT--DGRAAVESVRDAVRHKTVLHFFERIGNVLN 889
Query: 925 GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-ILKICQL 983
G F+ S PC + R+HP DG I++FT +N C +GFIY+T + ++++ +L
Sbjct: 890 GVFICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERDRLMRVAKL 949
Query: 984 PSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS 1022
PS D +PV++ I + AT H + Y N Y ++ S
Sbjct: 950 PSDMILDASYPVKR-INVGATVHSVVYLLHSNTYAVLTS 987
>gi|119602515|gb|EAW82109.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
CRA_b [Homo sapiens]
Length = 377
Score = 218 bits (555), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 133/369 (36%), Positives = 205/369 (55%), Gaps = 31/369 (8%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDIT 416
SVLT+ ++
Sbjct: 367 ASVLTTSVS 375
>gi|340515387|gb|EGR45642.1| predicted protein [Trichoderma reesei QM6a]
Length = 1441
Score = 217 bits (553), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 341/1447 (23%), Positives = 573/1447 (39%), Gaps = 202/1447 (13%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
+V ++ ES G V + + L L+ L G V LA L + + +
Sbjct: 68 QVNDDDGLESSFLGGETMLVRTERTNNTKLVLITEIPLAGTVIGLARLRT--SRTASGGE 125
Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
+++A++ AK+ + E+D + L S+H +E E L E F G V + DP
Sbjct: 126 VLLIAYKAAKLCMAEWDPRKNELETISIHYYEKEE-LQGAPWEEVF--GEYVNHLEADPG 182
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVG---DEDTFG---------SGGGFSARIESSHV- 235
RC + + IL + L DED G + G S +E+++
Sbjct: 183 SRCAALKFGTRNLAILPFRRSEEDLEMEDWDEDLDGPRPVKEQAAAVNGDSDNVEAAYTP 242
Query: 236 -----INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
+ L D + H F+H Y EP +L + A + H + + L +
Sbjct: 243 SFVLRLPLLDPSLLHPVHLTFLHEYREPTFGVLSSSQAPAASLGARDHLSYKVFTLDLQQ 302
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL 349
+ I S LPHD Y+++A+P+P+GG L+VG N IH S +A+N A
Sbjct: 303 --RASTTILSVTGLPHDLYRVIALPAPVGGALLVGQNELIHVDQSGKSNGVAVNPMAKLA 360
Query: 350 DSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDL 403
S +S + L+ A ++N LL G L +++ DGR V RL
Sbjct: 361 TSFSLTDQSDLKLRLENCAIEVLAIENGELLLILNDGRLGIISFKIDGRTVSGLSVRLVG 420
Query: 404 SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
+ +VL S T + G + F+GS DS+++ + S K D +
Sbjct: 421 ADCGGNVLKSRATCVSRLGKNTLFVGSETSDSVVLGW---SRRQTQEKRKKSRLIDPDLA 477
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+ + + + N + +F + D L++I P++D +
Sbjct: 478 LEVDELDLEDDEEDDDLYGDDSVATKPQQLPNGGPAKSGDLTFRIHDVLLSIAPIQDVTC 537
Query: 521 GLR--------------INAD---ASATGISKQSNYELV------------ELPGCKGIW 551
G + AD A A G + + ++ E P +G W
Sbjct: 538 GQAAFPPDSEEATLNRGVRADLQLACAVGRGEAGSLAIINREIQPRVIGRFEFPEARGFW 597
Query: 552 TVYHKSS--RGHNADSSRMAAYDD--EYHAYLIIS------LEARTMVLETADLLTEVTE 601
T+ K + A++ YD ++ ++I++ E + TA + E
Sbjct: 598 TMCVKKPVPKSLGANAGVAGDYDTPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKE 657
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
+ G T+ AG + + VIQV + R +G + Q L + +G+E
Sbjct: 658 TEFEPAAGFTVEAGTMGKQMVVIQVLKSEVRCYNGDLNLIQILPM----LDEETGAEPRA 713
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
V S SI DPY+ + DGS+ L D + ++ + +S K V+ C KG
Sbjct: 714 V-SASIVDPYLFIVRDDGSVFLAQIDSNNEIEEMEKTDSSLTSTKWVAGCLYKDTKG--- 769
Query: 721 WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDK 779
+ + +D+ T EA+ + +GAL IF +P+ + V+ +
Sbjct: 770 IFQSSYSDSTKQTS--EAV-------------MMFLLNSTGALHIFALPDLSKAVYVAEG 814
Query: 780 FVSGRTHIVDTYM--REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
S H+ Y R A +++ TEI V +L A H+
Sbjct: 815 LSSIPPHLSAGYAARRGATRETLTEI-------------------VVADLG----DAVHA 851
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTP- 896
P+L + + Y+ P N T+ +LS L F ++P
Sbjct: 852 SPYLILRHSTNDLTIYEPIRL--PAN--------ETAHTLS---------DTLFFKKSPN 892
Query: 897 -LDAYTREETPHGAPCQR-----ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
+ A + E P Q + I N+ G+ FL G P + + + L
Sbjct: 893 AVLAKSAVEDPSDDTAQPPRYVPLRICANVGGYSSVFLPGPSPAFVIKSSRSVPRVVGLQ 952
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
+ + H C+ GFIY S+GI ++ QLPS + + K +PL + Y
Sbjct: 953 GHGVRGMSTFHTEGCDRGFIYADSEGIARVTQLPSKTNFTELGISVKKVPLGFDVRHVAY 1012
Query: 1011 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN---LSSVDLHRTYTVEEYEVR 1067
Y I V + + E+ D H SV L T ++
Sbjct: 1013 HHPTETY--IAGCAVTE----------NFELPKDDDYHKEWARESVPLPPTAV--RGALK 1058
Query: 1068 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAA 1126
++ P W +I M++ E+ ++ + L + TKE LLA+GTA +GED+
Sbjct: 1059 LINPIT----WTVIHSIDMEAGESIECMKTLHLEVSEETKERRMLLAVGTALSRGEDLPT 1114
Query: 1127 RGRVLLFSTGRNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIIL 1178
RGRV ++ P N ++ +KE +G ++AL+ + QG +L+A G K ++
Sbjct: 1115 RGRVQVYDIVTVIPEPGKPETNKRLKLLAKEDIPRGGVTALSEIGTQGLMLVAQGQKCMV 1174
Query: 1179 H--KWTGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLA 1234
K G+ L +AF D +V S+ + L+ D K ++F + E+ +L
Sbjct: 1175 RGLKEDGSLLP-VAFLDM-SCHVSSVRELPGTGLCLIADAFKGLWFAGYTEEPYTFKVLG 1232
Query: 1235 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1294
K GSL +FL DG LS+V D ++ + + P+ +S +G LL R F V +
Sbjct: 1233 KSSGSLPLLVADFLPDGEDLSMVAVDADGDMHVLEFNPEHPKSLQGHLLLHRTTFSVTPN 1292
Query: 1295 -VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1353
T L L +S + + S T LL + GSI + PL E +RRL S+
Sbjct: 1293 PPTSTLLLPRTLPASQSSQDSSSSSSTQPHILLLASPSGSIAALTPLPESAYRRLLSVTN 1352
Query: 1354 KLVDS-VPHVAGLNPRSFRQFHSNGKAHR-------PGPDSIVDCELLSHYEMLPLEEQL 1405
+L+ + VPH GL+ R+ R G R +IVD +L+ + L ++
Sbjct: 1353 QLLPALVPH-GGLHARAHRTPEGGGGMSRTVGVETAATGRAIVDGTVLTRWNELGAAKRA 1411
Query: 1406 EIAHQTG 1412
E+A + G
Sbjct: 1412 EVATRGG 1418
>gi|339253000|ref|XP_003371723.1| cleavage and polyadenylation specificity factor subunit 1
[Trichinella spiralis]
gi|316967988|gb|EFV52332.1| cleavage and polyadenylation specificity factor subunit 1
[Trichinella spiralis]
Length = 1376
Score = 217 bits (553), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 169/630 (26%), Positives = 295/630 (46%), Gaps = 63/630 (10%)
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSK----------------------------SDD 869
RPFLFA++ + +L Y+A+ + P+ + +DD
Sbjct: 755 RPFLFAVVEE-QLLIYEAFHYPYPQQRYRLSVRFKKVRHTAILQRFRRIGRDDFKLLADD 813
Query: 870 -----PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISG 922
R S + + SR R R S +A+ E + AP ++++ F+N++G
Sbjct: 814 FQFSEQYRRRRKRSKHDSNRSR-RGDRHSGRRQEAHEHEPYRLTYEAPARQLSPFENVAG 872
Query: 923 HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G F+ G P +C + ++ LR+HP DG +VAF + F Y T+ G++++
Sbjct: 873 YAGLFIGGGYPYFCFLSKQGDLRLHPMHIDGPVVAFAPYCSPKQLRAFAYFTADGMMRVS 932
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
LPS +D P KV L H + Y E + Y L S + P ++V++L+ D +
Sbjct: 933 SLPSKFDFDRSIPSMKV-ELGRAAHFVVYLMESHTYALTTSEQM--PCHKVVTLIGDDKQ 989
Query: 1042 GHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTR--ATIPMQSSENALTVRVV 1098
D H Y T+E++++++ D W A + E+ + V
Sbjct: 990 FETFDREAP-----HFIYPTMEQFKLQLYSADT----WLPVPGAELDFDEFEHVTACQEV 1040
Query: 1099 TLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSK 1152
L + + ++ LAIGT GE+V RGR+L+ P +T+ VYSK
Sbjct: 1041 QLKSEGSASGLQSYLAIGTVLNYGEEVLIRGRLLIIDVVEVVPEPDRPMTKFKLKVVYSK 1100
Query: 1153 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
E KG +++L SL+G+LL G K+ + ++ L GI+F D +YV + ++ L
Sbjct: 1101 EQKGPVTSLCSLRGYLLTGMGQKVYIWQYKDNALVGISFLDLQ-VYVHQMASIRYLALTA 1159
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D + L ++E+ L+L+++D + A EFL+D + LS +++ +I + Y P
Sbjct: 1160 DAFFGVSLLRYQEEYKALSLVSRDPRPDEVLAVEFLVDRTDLSFLMTSAAGDILTYVYLP 1219
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1332
+ +S+ GQ+L+ +A++H G+ V F+R++ + + R L+F + DG
Sbjct: 1220 ESLDSFGGQRLVPQADYHFGSQVNAFVRMR---CHAQEIAGRKRQEVLQRQGLIFASSDG 1276
Query: 1333 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
S+ + PL E +R L LQ L+D +P AGLN +R R +I+D +
Sbjct: 1277 SVNYLLPLPEREYRLLGMLQSLLIDMLPSFAGLNVDDYRTVRFPNSCLREPTKNIIDGNI 1336
Query: 1393 LSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
Y + +Q +I Q G++ SQI+ L
Sbjct: 1337 CMLYLYIDALQQEDIVRQIGSSHSQIMLEL 1366
Score = 188 bits (477), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 165/616 (26%), Positives = 286/616 (46%), Gaps = 74/616 (12%)
Query: 99 ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
AS ELV +++G + S+AI G + D I+LA +DAK+SV+ +D H L S
Sbjct: 65 ASFELVLSEQVYGRLASVAIARLTGF----QLDVILLAIDDAKLSVVGYDIETHSLVTLS 120
Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVG 215
MH +E + K G F P++++DP+ RC + +YG +++L + S S +
Sbjct: 121 MHYYEDDLF---KLGFTRFEIPPMLRMDPERRCAAMTIYGAHLVVLPLVRESLYESMNIV 177
Query: 216 DEDTFGSGGGFSARIESSHV-INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
D G FS R+ S V N D M +V D F+HG+ EP +++L+E T AGRV
Sbjct: 178 DPSQ-RPGWPFSLRLTSYTVAFNAIDAKMHNVTDMCFLHGFYEPTVLLLYEPTQTTAGRV 236
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
+ T I A+S++ K H +IW+ NLP DA+ LLA+P P+GGVL+ N+I Y +Q
Sbjct: 237 VVRQDTYQILAVSLNPKDKTHAVIWTLGNLPFDAFALLALPKPLGGVLLFSVNSIIYLNQ 296
Query: 335 SA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
S C + +N+ + RS V LD +HA + + A L ++G + ++++++
Sbjct: 297 SVPCCGILINDNGRGFTNYPLRDRSELMVTLDGSHAALIDSANAALVLRSGLVFVVSLLF 356
Query: 394 DG-RVVQRLDLSKTN-----PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
D +V+ + L+ ++ PS +++ + ++ F+GS +G+S L + +++
Sbjct: 357 DRLNMVKEILLTASSVRGAAPSTVSA---CVSSNCLFVGSAIGNSALYAYEAIEQVDVVA 413
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN--NTES-AQKTFSFA 504
L R + + L DM LYG TE+ Q F F
Sbjct: 414 VTLPA---------------RDTGLNLLDDM------QLYGELIRPCTTETLVQTKFEFR 452
Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
D L ++GP + + G A + S++ + PG G +TV +S R
Sbjct: 453 RLDQLASLGPCRAITVGESSVAMVNNFYEDYVSDWLVAGGPGTDGSFTVMQRSVRPRLLT 512
Query: 565 SSRMAAYDDEYHA-----------------YLIISLEARTMVLETADLLTEVTESVDYFV 607
+R+ + + Y++++ + RT+V + +TE+ ++ + +
Sbjct: 513 QTRVEDVLNAWSVGAQLIGSVDRSASPRPQYMLLTTKQRTVVFTLSSGITEIFDT-GFEI 571
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
+ TIA G++ V+QV + +L Q ++ V S+
Sbjct: 572 RFETIACGDMMNGAYVVQVTKENLVLLHRGQQVQCINL----------RVFEEVCQASVI 621
Query: 668 DPYVLLGMSDGSIRLL 683
DPYV L + G + L
Sbjct: 622 DPYVALIVRHGHVLLF 637
>gi|154320778|ref|XP_001559705.1| hypothetical protein BC1G_01861 [Botryotinia fuckeliana B05.10]
Length = 1153
Score = 214 bits (544), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 270/1232 (21%), Positives = 501/1232 (40%), Gaps = 180/1232 (14%)
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDS 351
K I S LP+D ++++ + P+GG L+VG N IH + +A+N +A
Sbjct: 10 KASTTILSVGGLPYDLFRIVPLAPPVGGALLVGTNELIHIDQAGKANGVAVNMFAKQCTG 69
Query: 352 SQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP- 408
L ++ + L+ L +N L+ +GD+ +L+ DGR V L + + +
Sbjct: 70 FSLLDQADLDLRLEGCKIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAE 129
Query: 409 ---SVLT---SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
++LT S ++++G F+GS + DS+++ + SG + + E D
Sbjct: 130 LGGAILTGAASCVSSLGAGSLFVGSEVSDSVILGWNRKSGQTSRRKSRLDSSAIAEVD-- 187
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSY 520
+ D + G+ ++ + +N T S KT ++F + DS+VNI P+ + ++
Sbjct: 188 -EAMFDEEDLEDDDDDLYGDGPTITHATANITASNSKTGDYTFRIHDSMVNIAPITNIAF 246
Query: 521 G---LRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
G L + D QS +LV +LP +GIW
Sbjct: 247 GEAALSLGKDEELKSSGVQSELQLVAAVGREKGGSLAVINREIQPNVIGRFDLPEARGIW 306
Query: 552 TVYHK--SSRGHNADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLT 597
T+ K + +G + + D +Y +I+S +A + E+A D
Sbjct: 307 TMSAKRPAPKGLQVNKEKSVTSGDYGVDAQYDRLMIVSKASDAEDAIEESAVYALTDAGF 366
Query: 598 EVTESVDYF-VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
E ++ G TI AG L RV+Q+ + R DG + Q L + E+G+
Sbjct: 367 EALTGTEFEPAAGSTIEAGTLGNGMRVVQILKSEVRSYDGDLGLAQILPM--LDDETGA- 423
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
++S S ADP++LL D SI + D ++ I S K ++ C LY D
Sbjct: 424 --EPKIISASFADPFLLLIRDDASIFVAQCDDDNDLEEIERVDDILLSTKWLTGC-LYDD 480
Query: 716 KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVF 775
+D+ S GE ++ + GAL I+ +P+ +
Sbjct: 481 ------YSGAFSDSK-SNKAGE-------------NVKMFLLSAGGALHIYALPDLSKPV 520
Query: 776 TVDK---FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
V + FV + A +++ TEI L
Sbjct: 521 YVAEGICFVPPVLSADYAARKSAARETLTEI-----------------------LVANLG 557
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
+ P+L ++ + Y+ + + S S L S + +++N
Sbjct: 558 DSVSQSPYLILRPSNDDLTIYEPFRVK------------SASPDLLSSTLQFLKIQNTHL 605
Query: 893 SRTPLDAYTREETPHGA------PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
++ P + EE GA P + I+ N+ G+ F+ G P + + +
Sbjct: 606 TQAP--DVSAEEQVDGAQQTSDKPMRAIS---NLGGYSTVFMPGGSPSFIIKSSKTAPKV 660
Query: 947 PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPH 1006
L + + + H C+ GFIY +++GI ++ Q P +T+ + + I + H
Sbjct: 661 LSLQGTGVRSLSSFHTEGCDRGFIYASTEGIARVAQFPPNTTFADIGMALRKIEIGEDVH 720
Query: 1007 QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL-SSVDLHRTYTVEEYE 1065
+ Y Y + S D E+ D+ ++ ++E+
Sbjct: 721 AVAYHPPLQTYVIGTST------------FTDFELPKDDDHRKTWQEENIALKPSIEKSF 768
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDV 1124
++++ P W I ++ E ++ + L + T E + L+ +GTA +GED+
Sbjct: 769 LKLVSPVN----WSVIDAIELEPCELITCIKTMNLVISEVTNERKHLIVVGTAITKGEDL 824
Query: 1125 AARGRVLLF---STGRNADNPQN------LVTEVYSKELKGAISALASL--QGHLLIASG 1173
A GR+ ++ + D P+ + +E+ ++ G ++ L+ + QG +L+A G
Sbjct: 825 ATTGRLYVYDVVTVVPEPDRPETNKKLKLISSEIITRGAGGPVTGLSEIGTQGFMLVAQG 884
Query: 1174 PKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQ 1229
K ++ K GT L +AF D YV S+ + ++ D K ++F + E+ +
Sbjct: 885 QKCMVRGLKEDGTNLP-VAFMDMN-CYVTSVKELPGTGLCVMADALKGVWFAGYTEEPYR 942
Query: 1230 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1289
+ L K ++ + L DG L +V +D N+ I Y P+ +S +G LL R F
Sbjct: 943 MLLFGKSAAKMEVLCADLLPDGKDLFIVAADANGNLHIMQYDPEHPKSLQGHLLLHRTTF 1002
Query: 1290 HVGAH-------VTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPL 1340
+GAH + L L T+ + + T + LL + G++ ++PL
Sbjct: 1003 SLGAHHPTTMTLLPTTRPLPQLTTAPSPSPDPSPQEDTPSPSQPLLLTSRTGTLALLSPL 1062
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1400
E +RR +L L +++ H GLNPR++R + G +I+D +L + L
Sbjct: 1063 TESQYRRFGTLVSHLTNTLYHPCGLNPRAYR-IDRDANEGIVGGRTIIDGGVLGRWMELG 1121
Query: 1401 LEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
+ + E+A + G ++ L+ L G F+
Sbjct: 1122 SQRRGEVAGRVGVDVLELRDELSGLRGGLGFI 1153
>gi|313232279|emb|CBY09388.1| unnamed protein product [Oikopleura dioica]
Length = 1451
Score = 213 bits (541), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 179/679 (26%), Positives = 315/679 (46%), Gaps = 61/679 (8%)
Query: 760 SGALEIFDVPNFNCVFTV-DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
+G+LEI+ +P+ C+ D+ + I++T S EG+ +GR+ +
Sbjct: 814 NGSLEIYSLPD--CLLRFGDRNFANAPRILET---------------SRFEGS-EGRRVD 855
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
+ + V E+ + S P++ ++ D ++ Y F N +++ PV + R +
Sbjct: 856 V--LDVQEMNVFNMGPS-SLPYIVVMIGDQLMI----YRFRATLNRFQTESPVLSGRFIK 908
Query: 879 VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
+ + + + LR D ++ + + ++ F NIS H G FL G+ P W
Sbjct: 909 LQD----KTKLLRRIPGVHDESSKTKNRNNKIMRQ---FMNISDHNGIFLGGAYPTWIFC 961
Query: 939 FRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT-SQGILKICQLPSGSTYDNYWPVQ 996
+ RL +H +G + AFT N C GF+Y S L + L YD WP +
Sbjct: 962 GQNGRLNIHSMWQEGFVNAFTPFDNEKCADGFLYFRHSTKTLTVANLQPFLKYDADWPFK 1021
Query: 997 KVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL--IDQEVGHQIDNHNLSSVD 1054
K I L TP +Y E+ + + V ++ + +L I+ E GH+ + +L V
Sbjct: 1022 K-IKLNYTPCFSSYDLEQKV------LTVCGSRSEKIEMLPKINAE-GHK-EYEDLPEVQ 1072
Query: 1055 LHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE-NETL 1111
T ++ V + P W+ + I M + E+ L R V L + + +
Sbjct: 1073 NVETQLFPQFFVEMFSP----ASWEVIPNSRIEMDAHEHILCCRSVYLKSEASMSGRKQY 1128
Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQG 1166
+AIGT+ + GED +RGR++L P +T V+ +G +SA+ SL G
Sbjct: 1129 IAIGTSNICGEDFQSRGRLILLEVIDVVPEPGKPLTRYKYKTVFDASQRGPVSAVDSLDG 1188
Query: 1167 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1226
L+ A G K+ +H + L F D LY + + KN+ L+GDI + I L + +
Sbjct: 1189 ALIAAIGQKVFIHAFQDDNLRATGFVDTQ-LYTHATHCFKNYALVGDIQQGITLLRHQGE 1247
Query: 1227 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1286
++ +++ + + A L+DG+ + LV +D Q+N+Q++ Y P ES G++L+ +
Sbjct: 1248 RNCISQISRARRAGEVTAVGILLDGNQVGLVSTDMQRNLQVYMYKPDQKESNGGKQLVRQ 1307
Query: 1287 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1346
A+ ++G V L +D ++ R + LDGSIG I P+ E FR
Sbjct: 1308 ADINLGKRVISIW--NSLGRQNDTFTKVALTENDARHVTFYAGLDGSIGDIVPVSEKVFR 1365
Query: 1347 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1406
RL+ LQ + +PH GLNPR +R + + +I+D +LL + L EQ +
Sbjct: 1366 RLEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAKNIIDGDLLERFNGLSFTEQTD 1425
Query: 1407 IAHQTGTTRSQILSNLNDL 1425
++ + G TR +L ++ D+
Sbjct: 1426 LSRKIGVTREALLDDMMDV 1444
Score = 188 bits (478), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 181/686 (26%), Positives = 311/686 (45%), Gaps = 77/686 (11%)
Query: 57 NLVVTAANVIEIYVVR--VQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
NL V A N++ +Y +R V E G+ + EL + L G V
Sbjct: 45 NLAVAAGNMLSVYRIRSSVDEAGNHFDR------------------FELCDEFELWGIVV 86
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
+ L G+ RDS++L+ E++K ++E++ L SMH F+ + L+RG
Sbjct: 87 CMTRLRLAGS----VRDSLLLSIEESKCVIVEYEPDTGSLSTISMHFFQDED---LRRGF 139
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS-GLVGD--EDTFGSGGGFSARIE 231
+ L +VD RC VLVYG + +L + L G + F GF A
Sbjct: 140 RKLSSMALARVDGFNRCAAVLVYGSYLAVLPFRRSTERDLSGQRHQAVFYENSGFIA--- 196
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
++I+L+ L +K V DF F+ GY +P +++L+E TW GRV+ + TC + ALSI+
Sbjct: 197 --NMIDLQSLPVKIASVLDFQFLEGYNDPTILLLYEALPTWTGRVTERQDTCGMVALSIN 254
Query: 290 TTLKQHPLIWSAMNLPH-DAYK--LLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNY 345
+ HP+IW LP + Y L +P P+GG L+ N++ Y QS +ALN+
Sbjct: 255 LIDETHPVIWQMAGLPFPNPYSSALFPIPKPLGGSLLFATNSLIYLDQSVPPYGVALNSL 314
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
+ + + + L A L +D +S ++GD+ ++T+ D V+R L
Sbjct: 315 PLGCTNFALKTQDVAPLNLQNCKACMLSDDSICVSLESGDVYIITLKKDSLNNVRRFYLD 374
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
+ SV+ + ++ + ++L FLGSRLG+SLL+++ C + S+ L+ D
Sbjct: 375 QVASSVIPTTLSKLSDNLIFLGSRLGNSLLLRYKCKENSKKSSTSLENGEKDGVEIENKE 434
Query: 465 KRLRRSSSDALQDMVNG------EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
+ + + + NG +++ YG N + ++ F D+L NIGP
Sbjct: 435 EEKNELNFEIEKSSENGSPENKRKKMRYYGDEIFNLD-VNTSYDFETMDNLSNIGPCGPV 493
Query: 519 SYGLRINADASATGI---SKQSNYELVELPGCK----GIWTVYHKSSRGHNA-------- 563
N + + + ++ N ++ L G G TV HKS R A
Sbjct: 494 ELIHTANHNDNYDHVGSDARDRNIDVCVLSGKDKTGFGSITVLHKSVRPSIASQFPFPMN 553
Query: 564 --DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV-TESVDYFVQGRTIAAGNLFGR 620
D + ++E H+ L+++ + +TMV +T +L E+ E +TI +
Sbjct: 554 FSDMWTLRRSEEETHSLLVMTKKDQTMVFQTGAILEELKKEECGLATNAKTIFCATIGNG 613
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS- 679
+ ++QV R ++D TQ+ SG ++ V+ DPYV++ S G+
Sbjct: 614 KYIVQVLPRAVVLVDMD--TQETIQNKPFDLSGQ------IIQVA-CDPYVVILASKGTI 664
Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKK 705
I L++ + S T ++T A E +
Sbjct: 665 ISLVLFENSDGTAMLKTSTAPECKNQ 690
>gi|154285962|ref|XP_001543776.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407417|gb|EDN02958.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 1283
Score = 213 bits (541), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 242/988 (24%), Positives = 419/988 (42%), Gaps = 145/988 (14%)
Query: 501 FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYELVELPG----------- 546
+ F + D L N+GP++D + G + D S +N ELV G
Sbjct: 376 YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQPVSSILANLELVTTQGYGKAGGLAILR 435
Query: 547 ---------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
G +VY K + + S Y YL++S + +
Sbjct: 436 REIDPFVIDSLMIKDTDGARSVYVKDPKLPSQSGSLPLNPGSNYDHYLLLSKSKGLDKEK 495
Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDLS 644
++V + E T++ ++ + RTI G L RV+QV + R D G + Q
Sbjct: 496 SVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLASGTRVVQVLKGEVRSYDSGLGLAQIFP 555
Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSK 704
+ SE +V+ S ADPYVL+ D SI LL D S +T I S+
Sbjct: 556 VWDEDM-----SEEKSVVHTSFADPYVLIIRDDQSILLLQADDSGDLDEAETDGIINSTT 610
Query: 705 KPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGP-LDQGD-IYSVVCYESGA 762
S +LY DK + +G P + Q D + +
Sbjct: 611 --WISGSLYQDKY-------------------RSFKSHEGPPNMKQSDNVLLFLLSSESK 649
Query: 763 LEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
L +F +PN VFT + D +I S+ +E I
Sbjct: 650 LYVFHLPNAREPVFTTESI-----------------DLLPQILSTEPPPRRVTYRETITE 692
Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
+ V +L + P+L ++ ++ Y+ Y + ST R S
Sbjct: 693 LLVADLG----DSVSRSPYLILRSSNSDLILYEPYHYTS-----------STERQFS--G 735
Query: 882 VSASRLRNLRFSRTPLDAYTREETPHGAPCQRIT----IFKNISGHQGFFLSGSRPCWCM 937
+ ++ N F ++ ++ + H A C I+ + ++ G++ F+ G+ PC+ +
Sbjct: 736 LRFVKIANHHFPKSHSESNAGK---HPANCTAISKPLRVLGDVCGYRTVFMPGNSPCFII 792
Query: 938 VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
+ L ++ + + + C GF+YV + ++++C+ P + +D W +K
Sbjct: 793 KSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARK 852
Query: 998 VIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHR 1057
I L + Y + Y + + V +L D E+ + N +S +
Sbjct: 853 -IGLGEQVDAVEYSSSSETYVIGTNQKV------DFNLPEDDEIHPEWRNEVISFLP--- 902
Query: 1058 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGT 1116
+++ V++L P W + ++++E + V+ + L + T E + + +GT
Sbjct: 903 --QIDKGSVKLLTPRT----WSIIDSYNLRTAERIMCVKCLNLEVSEITHERKDTIVVGT 956
Query: 1117 AYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKELKGAISALASL--QGHLL 1169
A +GED+AARG + +F D P+ + + +E+KGA+++L+ + QG L+
Sbjct: 957 ALTKGEDIAARGCIYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGSLI 1016
Query: 1170 IASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKE 1225
A G K I+ K G+ L +AF D YV L +K ++GD K ++F + E
Sbjct: 1017 AAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKELKGTGMCIMGDALKGLWFAGYSE 1074
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1285
+ +L+L +KD G+L A +FL DG+ L ++V+D+ NI + Y P+ S KG +LL
Sbjct: 1075 EPYKLSLFSKDDGTLQVMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLH 1134
Query: 1286 RAEFHVGAHVTKFLRLQMLAT-SSDRTGAAPGSDKTNRFALLFGTL----DGSIGCIAPL 1340
R+ F G + L AT SS R A P + L L GSI I P+
Sbjct: 1135 RSTFQTGHFASTMTLLPRTATSSSQRPDADPDMMDLDSSGPLHHVLVTSETGSIALITPV 1194
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1400
E ++RRL +LQ +L +++ H GLNPR+FR S+G R +VD +L+ + L
Sbjct: 1195 SETSYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGIGGR----GMVDGDLVKRWLDLG 1250
Query: 1401 LEEQLEIAHQTGTTRSQILSNLNDLALG 1428
+ + EIA++ G +I ++L + G
Sbjct: 1251 TQRKAEIANRVGADVWEIRADLEAIGKG 1278
Score = 47.4 bits (111), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 84/180 (46%), Gaps = 21/180 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V +++++ + GS + +T+ + L LV Y L G + L
Sbjct: 28 NLIVAKTTLLQVFNLVNVVYGSGPGQPDEKTRSQY-------TKLVLVAEYALSGTITDL 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ +++++A +AK+S++E+D H + TS+H +E + +++ +
Sbjct: 81 GRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIHYYERDD-VNISPWTPN 137
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGSGGGFSARIESSH 234
A P + VDP RC VL +G + + IL Q G LV D+ F + +E H
Sbjct: 138 LASCPSYLTVDPNSRC-AVLNFGKKNLAILPFHQVGDDLVMDD--------FDSDVEEQH 188
>gi|268580265|ref|XP_002645115.1| Hypothetical protein CBG16808 [Caenorhabditis briggsae]
gi|296439546|sp|A8XPU7.1|CPSF1_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
Length = 1454
Score = 208 bits (530), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 193/749 (25%), Positives = 344/749 (45%), Gaps = 84/749 (11%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
++ DA +S+ GE D +D YS VV +++G + I +P+ V+
Sbjct: 737 KRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLVVAHDNGRITIHSLPDLELVYQ 791
Query: 777 VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT--------GQGRKENIHSM------ 822
+ +F + +VD + E K+ + + ++ E+ + ++ ++S
Sbjct: 792 IGRFSNVPELLVDMTVEEEEKEKKAKQTAAQEKEKETEKKKDDAKNEEDQVNSEMKKLCE 851
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
KVVE + + + P L AI+ D ++ Y+ + P+ V+ + + +
Sbjct: 852 KVVEAQIVGMGINQAHPVLIAII-DEEVVLYEMFASYNPQPGHLG---VAFRKLPHLIGL 907
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE 941
S N+ R P + E HG I F+ IS + G + G+ P +V+
Sbjct: 908 RTSPYVNIDGKRAPFEM----EMEHGKRYTLIHPFERISSINNGVMIGGAVPTL-LVYGA 962
Query: 942 --RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKV 998
++ H DGSI AFT +N N HGF+Y+T Q L+I ++ YD +PV+K
Sbjct: 963 WGGMQTHQMTIDGSIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYDMPYPVKK- 1021
Query: 999 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLH 1056
I + T H + Y ++Y ++ SVP KP N++ ++ D QE H+ D + + +
Sbjct: 1022 IEVGKTVHNVRYLMNSDIYAVVSSVP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAP 1077
Query: 1057 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN------ET 1110
YT+ + Q A +P E V + + K +T
Sbjct: 1078 PKYTLNLFSS------------QDWAAVPNTEFEFEDMEAVTAMEDVPLKSESRYGGLDT 1125
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQ 1165
LA+ T GE+V RGR++L P + +Y KE KG ++ L ++
Sbjct: 1126 YLALATVNNYGEEVLVRGRIILCEVIEVVPEPGQPTSNRKIKVLYDKEQKGPVTGLCAIN 1185
Query: 1166 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1225
G LL G K+ + ++ +L GI+F D YV L+ ++ L D +S+ + ++E
Sbjct: 1186 GLLLSGMGQKVFIWQFKDNDLMGISFLDMH-YYVYQLHSIRTIALALDARESMSLIRFQE 1244
Query: 1226 QGAQLNLLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
+ +++ ++D C A+EFL+DG + ++SDE NI +F Y+P+ ES G+
Sbjct: 1245 ENKAMSIASRD--DRKCAQAPMASEFLVDGMHIGFLLSDEHGNITLFSYSPEAPESNGGE 1302
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1341
+L +A ++G ++ FLR++ + D + + R +FG+LDGS G I PL
Sbjct: 1303 RLTVKAAINIGTNINAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGSFGYIRPLT 1362
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHY 1396
E ++RRL LQ + P +AGL+ + R Q NG+ R +++D +++ Y
Sbjct: 1363 EKSYRRLHFLQTFIGSVTPQIAGLHIKGARSSKPSQPIVNGRNAR----NLIDGDVVEQY 1418
Query: 1397 EMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
L + ++ ++A + G R IL +L L
Sbjct: 1419 LHLSVYDKTDLARRLGVGRYHILDDLMQL 1447
Score = 188 bits (478), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 150/576 (26%), Positives = 266/576 (46%), Gaps = 81/576 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++ F+DAK+S++ ++ ++ S+H FE+ +L+ G ++ P+V+ DP
Sbjct: 92 QDSILMTFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFTTYFNPPIVRTDPAN 148
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + ++ S++I L+ +D + +V
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 191 DMVFLEGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
LL++P P+GG +V G+NTI Y +Q+ C + LN+ D + P +
Sbjct: 251 CNSLLSIPKPLGGAVVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKHLKMT 307
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD + + ++++ + ++ GDL LL +V G V+ L+ SK + + +T
Sbjct: 308 LDCSTSVYMEDGRIAVGSREGDLYLLRLVTSSGGATVKSLEFSKVCDTSIAFTLTVCAPG 367
Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG 481
F+GSRLGDS L+++T ++ S K+ R + + ++
Sbjct: 368 HLFVGSRLGDSQLLEYTL-----------------LKVTKESAKKQRLEQQNPSEIELDE 410
Query: 482 EELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ 536
+++ LYG A +++ E ++ F D L+N+GP+K +G R N ++ +K+
Sbjct: 411 DDIELYGGAIEMQQNDDDEQISESLQFRELDRLLNVGPVKSMCFG-RPNYMSNDLIDAKR 469
Query: 537 SN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLIISL 583
+ ++LV G G V+ +S R SS + ++E H YLI+S
Sbjct: 470 KDPVFDLVTASGHGKNGALCVHQRSMRPEIITSSLLEGAEQLWAVGRKENESHKYLIVS- 528
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMTQD 642
R+ ++ E + T+AAG L +QV A + DG M Q+
Sbjct: 529 RVRSTLILELGEELVELEEQLFVTNEPTVAAGELLQGALAVQVTSTCIALVTDGQQM-QE 587
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
+ N V+ SI DPYV + +G
Sbjct: 588 VHI----------DSNFPVVQASIVDPYVAVLTQNG 613
>gi|346319828|gb|EGX89429.1| protein CFT1 [Cordyceps militaris CM01]
Length = 1452
Score = 207 bits (526), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 329/1440 (22%), Positives = 572/1440 (39%), Gaps = 222/1440 (15%)
Query: 91 VLMDGISAASLELVCHYRLHGNVESLAIL-----SQGGADNSRRRDSIILAFEDAKISVL 145
+L D L LV + G + LA L S GG ++++LA+ AK+ +
Sbjct: 94 LLRDRSQHTKLVLVAELPVAGTIIGLARLKLPHTSSGG-------EALLLAYRGAKMCLT 146
Query: 146 EFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQMI 202
E++ L S+H +E E L+ G V + DP RC +
Sbjct: 147 EWNPRRAALETVSIHFYEKDE---LQGAPWELPFGEYVNYLEADPASRCAAFKFGSRNLA 203
Query: 203 ILKASQGG--------------------SGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
IL Q + L + D G G ++ S V+ L LD
Sbjct: 204 ILPFRQAEEDLEMEDWDEALDGPKPPKEASLATNGDANGDANGTQSQHSPSFVLRLPLLD 263
Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
+ H F+H Y EP IL + T H T + L + + I S
Sbjct: 264 PTLLHPVHLAFLHQYREPTFGILSSAQSTSIALGFRDHLTYKVFTLDLKQ--RASTTILS 321
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA-----VSLDSSQE 354
LP D +++ +P+P+GG L+VGAN IH + +A+N A SL+ E
Sbjct: 322 VTGLPQDLSRVIPLPTPVGGALLVGANELIHIDQSGKANGVAVNPMARQMTSFSLNDQSE 381
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL---SKTNPSVL 411
L ++ +E A +++ LL L +++ DGR V + L S+ N L
Sbjct: 382 L---NYRLEGCAIEPVSMESGELLLILNDASLAIVSFKIDGRTVSGISLVPVSQENGGNL 438
Query: 412 ----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
S I+ IG S F+GS GDS+++ ++ S +++ ++A+
Sbjct: 439 LKSHVSCISRIGKSSMFIGSEYGDSVVLGWS-----RKQSQEKRKKSRVLDAELALDVDD 493
Query: 468 RRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
D + G E + S + N + F ++DSL+ + P+ D + G
Sbjct: 494 IDLDDFDEDDDLYGTESTAAKPSLATNGVTKGGELIFRLQDSLLCLAPIHDVAPGKAVFP 553
Query: 522 -------LRINAD-----ASATGISKQS-----NYEL-------VELPGCKGIWTVYHKS 557
LR A A G K N E+ E P +G WT+ K
Sbjct: 554 LDSEEVVLRDGVTSELQLACAVGRGKAGAIAILNREIQPKVIGRFEFPEARGFWTMCVKK 613
Query: 558 ----SRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF------ 606
+ G NA S + YD E + +I + ET+D+ +
Sbjct: 614 PLPKALGSNAVVS--SEYDSMELYDRFMIVAKVDLDGYETSDVYALTDAGFESLKDTEFE 671
Query: 607 -VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSV 664
G T+ AG + + R+IQV + R DG ++Q L + +G+E V+S
Sbjct: 672 PAAGFTVMAGTMGKQMRIIQVLKSEVRCYDGDLGLSQILPM----MDEDTGAE-PRVVSA 726
Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
SIADPY+++ D SI + A I+S+ + + DKGP ++
Sbjct: 727 SIADPYLMVIRDDNSIFI---------------AKIDSNDE---LDEVEKDKGPLASIK- 767
Query: 725 TSTDAWLSTGVGEAIDGADGGPL--DQGDIYSVVCY---ESGALEIFDVPNFNCVFTVDK 779
W TG A P D+G ++ + +GAL I+D+ N +
Sbjct: 768 -----W-QTGCLYADHDGHFQPKQPDEGSSPRILMFLMSTTGALHIYDLDNLS------- 814
Query: 780 FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRP 839
Y+ E L S S++ G KE + + V +L P
Sbjct: 815 --------EPVYVAEGLT-STPPFLSANFTGRKAAAKETLTEILVADLG----DVVAKSP 861
Query: 840 FLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDA 899
+L + Y+ + P ++S S +L + S + + D
Sbjct: 862 YLILRHDTDDLTLYEPVRYHEPNSSS-----APLSDTLFFKKSTNSTIAKSAPASDKEDD 916
Query: 900 YTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTV 959
T+++ P Q + N+ G+ FLSG P + + + + L + +
Sbjct: 917 ETQQK--RFVPLQ---LCANVGGYSAVFLSGDSPSFILKSAKSIPRIVGLQGQGVQGMST 971
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPL 1019
H C+ GFIY ++GI ++ QLP+ + Y K IPL +++++ + Y
Sbjct: 972 FHTEGCDRGFIYADTKGIARVSQLPTDTNYAELGISVKKIPLDCDVNRVSFHSHTATY-- 1029
Query: 1020 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPW 1078
I + +P E+ D H + ++ T+ ++++ P W
Sbjct: 1030 IAACSTREPF----------ELPKDDDYHKEWARETVNFAPTMPRGILKLISP----AAW 1075
Query: 1079 QTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1137
++ ++S E ++ + L + TKE ++A+G+A +GED+ RGRV +F
Sbjct: 1076 TVIHSLDLESCETIESMMALHLEISEETKERRMVVAVGSAICKGEDLPTRGRVQVFDIVT 1135
Query: 1138 NADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILHKWTGTELNG- 1188
P N ++ +KE +G +++L+ + G LLIA G K ++ G +G
Sbjct: 1136 VIPEPGRPETNKRLKLLAKEELPRGGVTSLSEIGTSGLLLIAQGQKCMVR---GLREDGG 1192
Query: 1189 ---IAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+AF D +++ + ++ L+ D K ++F + E+ +L K G +
Sbjct: 1193 LLPVAFLDMN-CHILGVRELRGTGLCLMADAFKGMWFAGYTEEPYTFKVLGKSGGQIPML 1251
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH--VTKFLRL 1301
+FL DG L+++ D ++ +F + P +S +G LL R F + + T L
Sbjct: 1252 VADFLPDGEDLNMIGVDADGDLHVFEFNPDHPKSLQGHLLLHRTTFSLSPNEPTTTVLLE 1311
Query: 1302 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1361
+ + S + G++ + LL G + + PL E +RRL SL +L+ +V
Sbjct: 1312 RTIPASQPQPQGTTGAETPH--TLLLSCPTGQLAALTPLSESAYRRLLSLANQLMPAVVP 1369
Query: 1362 VAGLNPRSFRQFHSNGK---AHRPGPDS------IVDCELLSHYEMLPLEEQLEIAHQTG 1412
GL+P++ R G A G ++ IVD +L+ + L ++ E+A ++G
Sbjct: 1370 YGGLHPKAHRLPEGRGAQSHARAVGVETAASGRMIVDGAVLARWTELGAAKRAEMATKSG 1429
>gi|400597740|gb|EJP65470.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
Length = 1444
Score = 206 bits (525), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 329/1450 (22%), Positives = 573/1450 (39%), Gaps = 235/1450 (16%)
Query: 85 GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
GET +L D L LV + G V LA L ++ ++++LA+ AK+ +
Sbjct: 85 GET--LLLRDRAQNTKLVLVAEIPVAGTVIGLARLKLQNTESGG--EALLLAYRGAKMCL 140
Query: 145 LEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQM 201
E++ L S+H +E E L+ G V + DP RC +
Sbjct: 141 TEWNPQKAALDTVSIHYYEKDE---LQGAPWELPFGEYVNYLEADPASRCAAFKFGSRNL 197
Query: 202 IILKASQGGSGL-VGDEDTFGSG-----------GGFSARIESSH----VINLRDLD--M 243
IL Q L + D D G G ES H V+ L LD +
Sbjct: 198 AILPFRQAEEDLEMEDWDEALDGPKPAKEAALATNGDDHETESQHSPSFVLRLPLLDPTL 257
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
H F+H Y EP IL + T H T + L + + I S
Sbjct: 258 LHPVHLAFLHQYREPTFGILSSAQSTSIALGFRDHMTYKVFTLDLKQ--RASTTILSVTG 315
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
LP D +++ +P+P+GG L+VG N IH + +A+N A + S +S +
Sbjct: 316 LPQDLKRVIPLPTPVGGALLVGENELIHIDQSGKANGVAVNPMARQMTSFSLADQSELNY 375
Query: 363 ELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL---SKTNPSVL----TS 413
L+ A +++ LL L +++ DGR V + L S+ N L S
Sbjct: 376 RLEGCAIEPISMESGELLLILNDASLAIISFKIDGRTVSGISLAAVSQENGGNLLKSRVS 435
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
I+ IG + F+GS GDS+++ ++ S +++ ++ D L D
Sbjct: 436 CISRIGKASMFIGSESGDSVVLGWS-----RKQSQEKRKKSRALDTD------LALDVED 484
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFS--------FAVRDSLVNIGPLKDFSYGLRIN 525
D E+ LYG+ S + +Q F ++D+L+ + P+ D + G +
Sbjct: 485 IDLDDDFDEDDDLYGTESAAAKPSQAGAGATKGGEPVFRLQDALLCLAPIHDVAPGKAVF 544
Query: 526 AD-----------------ASATGISKQS-----NYEL-------VELPGCKGIWTVYHK 556
A A G K N E+ E P +G W + K
Sbjct: 545 PSDSEEAFLRDGVTSELQLACAVGRGKAGAIAILNREIQPKVIGRFEFPEARGFWAMCVK 604
Query: 557 SSRGHNADSSRM--AAYD--DEYHAYLIISLEARTMVLETADLLTEVTESVDYF------ 606
SS + + YD ++Y ++I++ + ET+D+ +
Sbjct: 605 KPVPKALGSSAVISSEYDSTEQYDRFMIVA-KVDLDGYETSDVYALTDAGFESLKDTEFE 663
Query: 607 -VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSV 664
G T+ AG + + R++QV + R DG ++Q L + +G+E V+S
Sbjct: 664 PAAGFTVMAGTMGKQMRIVQVLKSEVRCYDGDLGLSQILPM----LDEDTGAE-PRVVSA 718
Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
SIADPY+++ D S+ + + + +E +K DKGP
Sbjct: 719 SIADPYLMIIRDDNSVFI---------AKIGSNDELEEVEK---------DKGP------ 754
Query: 725 TSTDAWLSTGVGEAIDGA--DGGPLDQGDIYSVVCYES--GALEIFDVPNFNCVFTVDKF 780
+ W + + DG P D +++ S GAL ++D+ N +
Sbjct: 755 LVSTKWQTGCLYTDYDGTFQAKKPDDNASPRTMMFLMSTAGALHMYDLDNLS-------- 806
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
Y+ E L S S++ G KE + + V +L PF
Sbjct: 807 -------EPVYVAEGLT-STPPFLSANFTGRKAAAKERLTEILVADLG----DVVSKSPF 854
Query: 841 LFAILTDGTILCYQAYLFEGPENTS---------KSDDPVSTSRSLSVSNVSASRLRNLR 891
L + Y+ ++ P ++S K + ++S S + + R
Sbjct: 855 LILRHDTDDLTLYEPVRYQEPNSSSPPLTDTLFFKKSANATIAKSASAFDKEEDETQQRR 914
Query: 892 FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
F PL PC N+ G+ FLSG P + + + + L
Sbjct: 915 F--VPLQ-----------PC------GNVGGYSTVFLSGDSPSFVLKSAKSIPRIVGLQG 955
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYF 1011
+ + H C+ GFIY ++GI ++CQLP+ + Y K IPL +++++
Sbjct: 956 QGVQGMSTFHTAGCDRGFIYADTKGIARVCQLPTDTNYAELGISVKKIPLDCDVNRVSFH 1015
Query: 1012 AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL-HRTYTVEEYEVRILE 1070
+ Y I + +P E+ D H + ++ T+ ++++
Sbjct: 1016 SHTATY--IAACSTREPF----------ELPKDDDYHKEWAREVVSFAPTMPRGMLKLIS 1063
Query: 1071 PDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1129
P W ++ ++S E ++ + L + TKE L+A+G+A +GED+ RGR
Sbjct: 1064 P----AAWTVIHSLDLESCETIESMMALHLEISEETKERRMLVAVGSAICKGEDLPTRGR 1119
Query: 1130 VLLF-------STGRNADNPQNLVTEVYSKELKGAISALASL--QGHLLIASGPKIILHK 1180
V +F GR N + L + + +G +++L+ + G LLIA G K ++
Sbjct: 1120 VQVFDIVTVIPEPGRPETN-KRLKLQAKEELPRGGVTSLSEIGTSGLLLIAQGQKCMVR- 1177
Query: 1181 WTGTELNG----IAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLA 1234
G +G +AF D +++ + ++ L+ D K ++F + E+ +L
Sbjct: 1178 --GLREDGGLLPVAFLDMN-CHILGVRELRGTGLCLMADAFKGMWFAGYTEEPYTFKVLG 1234
Query: 1235 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1294
K G + +FL DG LS++ D ++ +F + P +S +G L+ R F + +
Sbjct: 1235 KSGGQIPMLVADFLPDGEDLSMIGVDADGDLHVFEFDPDHPKSLQGHLLIHRTTFSLSPN 1294
Query: 1295 --VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1352
T L + + S + G++ + LL G + + PL E +RRL SL
Sbjct: 1295 EPTTTVLLERTIPASQPQPKGTTGAETPH--TLLLSCPTGQLAALTPLSESAYRRLLSLT 1352
Query: 1353 KKLVDS-VPHVAGLNPRSFR-------QFHSN--GKAHRPGPDSIVDCELLSHYEMLPLE 1402
+++ + VPH GL+P++ R Q HS G IVD +L+ + L
Sbjct: 1353 NQVLPAVVPH-GGLHPKAHRLPEGRGAQSHSRAVGVETAASGRMIVDGAVLARWTELGAA 1411
Query: 1403 EQLEIAHQTG 1412
++ E+A ++G
Sbjct: 1412 KRAEMALKSG 1421
>gi|320040273|gb|EFW22206.1| hypothetical protein CPSG_00105 [Coccidioides posadasii str.
Silveira]
Length = 1387
Score = 206 bits (524), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 154/534 (28%), Positives = 266/534 (49%), Gaps = 42/534 (7%)
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
+ +I G++ F+SGS PC+ M +L ++ + + H C GF YV +
Sbjct: 878 YSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLRGEAVSSLSSFHIPACEKGFAYVDASN 937
Query: 977 ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1036
++++C+LP + +DN W +KV + + YFA +Y L S V L +
Sbjct: 938 MVRMCRLPGNTRFDNSWVTRKV-HVGDQIDCVEYFAHSEIYALGSSHKVDFKLPE----- 991
Query: 1037 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1096
D E+ + + +S + +E +++L P W + + +E + ++
Sbjct: 992 -DDEIHPEWRSEVISFMP-----QLERGCIKLLSPRT----WSVVDSYELGDAERVMCMK 1041
Query: 1097 VVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ----NLVTEVYS 1151
+ + + T E + +L +GTA V+GED+ RG + +F A +P N ++++
Sbjct: 1042 TINMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEIIEVAPDPDRPETNRKLKIFA 1101
Query: 1152 KE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVK 1206
K+ +KGA++A++ + QG L++A G K ++ K G+ L +AF D YV L ++
Sbjct: 1102 KDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVKVLKELQ 1159
Query: 1207 N--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
++GD K I+F + E+ +L L KD L A +FL DG L ++V+D+
Sbjct: 1160 GTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEYLQVIAADFLPDGKRLYILVADDDCT 1219
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN--- 1321
I + Y P+ S KG +LL R+ FH+G H T + L + SS + PG D +
Sbjct: 1220 IHVLEYDPEDPTSSKGDRLLHRSSFHMG-HFTSTMTL-LPQHSSSPSADDPGEDDMDVDY 1277
Query: 1322 ---RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ +L + +GSIG + PL E ++RRL +LQ +LV S+ H GLNP+++R S+G
Sbjct: 1278 VPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVTSMEHPCGLNPKAYRAVESDGF 1337
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
R IVD LL + + ++ + EIA + G I +L ++ G FL
Sbjct: 1338 GGR----GIVDGNLLLRWLDMGVQRKAEIAGRVGADIESIRVDLEKISGGLDFL 1387
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 175/731 (23%), Positives = 291/731 (39%), Gaps = 94/731 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + G+ N+ + R ++ L LV Y L G + L
Sbjct: 28 NLIVAKTSILQVFSLVNVAYGTSALPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ +++++A +AK+S++E+D HG+ S+H +E E +H
Sbjct: 85 GRVKI--LDSRSGGEALLVATRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLV-----GDEDTFGSGGG---- 225
P L+ VDP RC +L +G+ + IL Q G LV GD D G
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDGDLDEKPEGASNIPA 200
Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ SS V+ L LD + H F++ Y EP IL+ T +
Sbjct: 201 QIAVENDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ + S ++ + + + LP D +K++ +P PIGG L++G+N IH
Sbjct: 261 LRDRKDIVSYSVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + + +S + L+ L D LL G + +L
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380
Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ DGR V +L K S+L + + ++G F GS DSLL+ + S
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGW---SRP 437
Query: 444 SMLSSGLKEEFGDI---EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
S L K E D + D VN LS S +N +
Sbjct: 438 SQLMRKPKVESADDVFGDHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495
Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
F F D L N+GP+ D + G R +AD + N
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555
Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
+ +++ G+W++ + DS+ Y YL+ S + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610
Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
++V E ++ ++ + T+ G L G RV+QV + R D + +
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
P E S+ +V+S S A+PYVL+ D S+ LL D S V I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725
Query: 706 PVSSCTLYHDK 716
+S C LY DK
Sbjct: 726 WLSGC-LYLDK 735
>gi|308459872|ref|XP_003092248.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
gi|308253976|gb|EFO97928.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
Length = 1448
Score = 206 bits (524), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 185/737 (25%), Positives = 339/737 (45%), Gaps = 70/737 (9%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
R+ DA +S+ GE D +D YS +V +++G L I +P+ V+
Sbjct: 741 RRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLMVAHDNGRLSIHSLPDMELVYQ 795
Query: 777 VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR----W 832
+ +F + ++D E K+ + + ++++ + K+ E M+
Sbjct: 796 IGRFSNVPELLMDMTTDEEEKERKAKAQQAAKDTAADEDQLTTEMKKLCERVMEAQIVGM 855
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
+ S P L AI+ D ++ Y+ + P+ ++ + + S N
Sbjct: 856 GINQSHPVLMAIV-DEQVVMYEMFSHYNPQAGHLG---IAFRKLPHFICLRTSSHLNSDG 911
Query: 893 SRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFR-ERLRVHPQLC 950
R P + E +G I F+ IS + G + G+ P + ++ H
Sbjct: 912 KRAPFEM----EVENGKRYTLIHPFERISSINNGVMIGGAVPTLVVYGAWGGMQTHQMTI 967
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKVIPLKATPHQIT 1009
DG I AFT +N N HGF+Y+T Q L+I ++ Y+ +P++K I + T H +
Sbjct: 968 DGPIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYEMPYPMKK-IEVGRTIHNVR 1026
Query: 1010 YFAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHRTYTVEEYEVR 1067
Y ++Y ++ S+P KP N++ ++ D QE H+ D + + + YT+ +
Sbjct: 1027 YLMNSDVYVVVSSIP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPPKYTLNLF--- 1079
Query: 1068 ILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDV 1124
+ W+ I + E V+L + +T ET LA+GT GE+V
Sbjct: 1080 ------SSQDWKAVPNTEIEFEDMEAVTACEDVSLKSESTISGVETYLAVGTVNNYGEEV 1133
Query: 1125 AARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILH 1179
RGR++L P + ++ KE KG ++ L ++ G LL G K+ +
Sbjct: 1134 LVRGRIILCEVIEVVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLSGMGQKVFIW 1193
Query: 1180 KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--F 1237
++ +L G++F D YV L+ ++ L D +S+ + ++E+ +++ ++D
Sbjct: 1194 QFKDNDLMGLSFLDMH-YYVYQLHSLRTIALACDARESMSLIRFQEENKAMSIASRDDRR 1252
Query: 1238 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1297
+ A +F++DG+ L ++SDE NI +F Y+P+ ES G++L RA ++G +V
Sbjct: 1253 TAKPPMAAQFVVDGAHLGFLLSDENGNITLFNYSPEAPESNGGERLTVRAAMNIGTNVNA 1312
Query: 1298 FLRLQ----MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1353
FLR++ +L SD + R + +FG+LDGS G + PL E ++RRL LQ
Sbjct: 1313 FLRVKGHTSLLNLQSDEEKES----VEQRMSTIFGSLDGSFGFVRPLSEKSYRRLHFLQT 1368
Query: 1354 KLVDSVPHVAGLNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
+ P +AGL+ + R Q NG+ R +++D +++ Y L L ++ ++A
Sbjct: 1369 FIGSVTPQIAGLHIKGARSARPAQPIVNGRNAR----NLIDGDVVEQYLHLSLYDKTDLA 1424
Query: 1409 HQTGTTRSQILSNLNDL 1425
+ G R I+ +L L
Sbjct: 1425 RRLGVGRYHIIDDLMHL 1441
Score = 181 bits (459), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 155/578 (26%), Positives = 263/578 (45%), Gaps = 86/578 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++AF+DAK+S++ ++ ++ S+H FE+ +L+ G ++ P+V+ DP
Sbjct: 92 QDSILMAFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFINYFHPPIVRTDPSN 148
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + ++ S++I L+ +D + +V
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
LL +P P+GG LV G+NTI Y +Q+ C + LN+ D + P +
Sbjct: 251 CTSLLPIPKPLGGALVFGSNTIVYLNQAVPPCGVVLNS---CYDGFTKFPLKDMKHLKMT 307
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD A + ++++ + + G L LL +V G V+ ++ S+ + + +T
Sbjct: 308 LDCATSVYMEDGRIAVGGRDGVLYLLRLVTSSGGATVKSMEFSRVWETSIAYCLTVCAPG 367
Query: 422 LFFLGSRLGDSLLVQFTCGSGT--SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
F+GSRLGDS LV++T T S ++++ G+IE D
Sbjct: 368 HLFIGSRLGDSQLVEYTLLKMTKESAKRQKIEKDPGEIELDE------------------ 409
Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+++ LYG A +++ E ++ F D L N+GP+K +G R N +S
Sbjct: 410 --DDMELYGGAIEMQLNDDEEQILESLEFRELDRLRNVGPVKSMCFG-RPNYMSSDLAEM 466
Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
K+ + ++LV G G V+ +S R SS + ++E H YLI+
Sbjct: 467 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSILEGAEQLWAVGRKENESHKYLIV 526
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
S R+ ++ E + T+AAG L +QV A + DG M
Sbjct: 527 S-RVRSTLVLELGEELVELEEQLFVTNEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 584
Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
Q++ N V+ SI DPYV + +G
Sbjct: 585 QEVHI----------DSNFPVVQASIQDPYVAVLTQNG 612
>gi|342877552|gb|EGU79002.1| hypothetical protein FOXB_10431 [Fusarium oxysporum Fo5176]
Length = 1399
Score = 205 bits (522), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 321/1424 (22%), Positives = 545/1424 (38%), Gaps = 192/1424 (13%)
Query: 69 YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GG 123
Y R ++ ES G V D + L LV L G V LA + GG
Sbjct: 65 YDHRANDDDGLESSFLGGESMLVRTDRTNLTKLVLVAELPLSGTVTGLAKVKTKHSKCGG 124
Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPL 182
+++++A++ AK+ + +D L S+H +E E LH SF
Sbjct: 125 -------EALLIAYKAAKLCMAVWDPEKSNLETISIHYYEKEE-LHGAPWEVSFDEYTNY 176
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
++ DP RC + IL Q L D+ G + ES+ V N D D
Sbjct: 177 LEADPGSRCAAFQFGSRNLAILPFRQAEEDLEMDDWDEDLDGPRPVK-ESTTVAN-GDSD 234
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
EP IL + H T + L + + I S
Sbjct: 235 TLEPA---------EPTFGILSSSQERAHSLGQKDHLTYKVFTLDLQQ--RASTTILSVT 283
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS 361
+LP D +K++ +P+P+GG L++G N IH S +A+N+ A + S ++ +
Sbjct: 284 DLPRDLFKIIPLPAPVGGSLLIGENELIHVDQSGKSNGVAVNSMARQITSFSLTDQADLN 343
Query: 362 VELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDLSKTNPSVLTSDI 415
+ L+ ++N LL G + ++T DGR V R+ + +++ S
Sbjct: 344 LRLEHCVIETLSIENGELLLVLNDGRIGIVTFQIDGRTVSGLTVRMVADENGGNLIKSRA 403
Query: 416 TT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+T +G + +F+GS +GDS+++ +T G K D E
Sbjct: 404 STASKLGKNAYFVGSEVGDSVVLGWTRKMGQEKRR---KPRLIDAEIGLEMDDLDLEDED 460
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-LRINADASAT 531
D D+ E + + + N SF + D+L++I P+KD + G + + D+
Sbjct: 461 DEDDDLYGTESAAAKPAQALNGGGKTGELSFRIHDTLLSIAPIKDLTPGKVSFHPDSEEA 520
Query: 532 GISKQSNYEL----------------------------VELPGCKGIWTVYHKS----SR 559
+S+ +L E P + WT+ K +
Sbjct: 521 TLSQGVVSDLHLACVVGRGKAGSLAILNRNIQPKIIGRFEFPEARDFWTMSVKKPMPKAL 580
Query: 560 GHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLT------EVTESVDY-FVQGRTI 612
G N ++ Y+I++ + ET+D+ E + ++ G T+
Sbjct: 581 GGNVGMGNEYETFGQHDKYMIVA-KVDLDGYETSDVYALTGAGFETLKDTEFDPAAGFTV 639
Query: 613 AAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
AG + + R+IQV + R DG +TQ L + E+G+ V S SIADPY+
Sbjct: 640 EAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA---EPRVTSASIADPYL 694
Query: 672 LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
LL D S+ L D + V+ A + K S C KG + ++D
Sbjct: 695 LLIRDDSSLMLAQIDSNNELEEVEKMDATLQNTKWHSGCLYADTKGA---FQPNASDKGA 751
Query: 732 STGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHI-VD 789
T I + +GAL ++ +P+ + V+ + H+ D
Sbjct: 752 ETE----------------KIMMFLLSSTGALHVYALPDLSKPVYVAEGLCYVPPHLSAD 795
Query: 790 TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGT 849
+R L KEN+ + V +L P+L
Sbjct: 796 YTLRRGLA------------------KENLREILVADLG----DTTSQSPYLILRNQTDD 833
Query: 850 ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
+ Y+ P + S S +L+ S + L + D E P
Sbjct: 834 LTIYE------PLRHVRDGGETSLSATLTFKKTSNTTLATIPVETEQDDV----EQPRFV 883
Query: 910 PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGF 969
P + NI+G+ FL G P + + + + L + + H C+ GF
Sbjct: 884 PLRPCA---NINGYSTVFLPGPSPSFVIKSSKSIPRVIGLQGLGVRGMSTFHTEGCDRGF 940
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1029
IY +GI ++ QLP + + K +PL A I Y Y I + +P
Sbjct: 941 IYADDKGIARVTQLPPDTNFTELGISVKKVPLGADVRGIAYHQPTGAY--IAGCMISEPF 998
Query: 1030 NQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1088
E+ D H + L T+ ++++ P W + ++S
Sbjct: 999 ----------ELPKDDDYHKEWAKETLTFPPTMPRGVLKLISPVS----WTVIHEVELES 1044
Query: 1089 SENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT 1147
E+ ++ + L + TKE L+ +GTA +GED+ RGRV +F P T
Sbjct: 1045 CESIECMKTLHLEVSEDTKERRFLVTVGTAVSKGEDLPIRGRVHVFDIVTVIPEPGRPET 1104
Query: 1148 EVYSKEL------KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPL 1197
K + +G ++A++ + QG +L+A G K ++ K G+ L +AF D
Sbjct: 1105 NKRLKAIAREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGLKEDGSLLP-VAFLDMSCH 1163
Query: 1198 YVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1256
+ + + L+ D K ++F + E+ +L K G L +FL DG L++
Sbjct: 1164 VSTARELPRTGLCLMADAFKGVWFAGYTEEPYTFKVLGKSHGRLPVLVADFLPDGEDLAI 1223
Query: 1257 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH--VTKFLRLQMLATSSDRTGAA 1314
V +D ++ I + P+ +S +G LL R F V + T L + L S
Sbjct: 1224 VAADADGDLHILDFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLLLPRTLPPSHPPP--- 1280
Query: 1315 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1374
+ LL + G + + PL E T+RRL S+ +L+ ++ GLN ++ H
Sbjct: 1281 ----QDPPHILLLASSSGHLATLVPLPETTYRRLLSVTNQLLPALTPHGGLNAKA----H 1332
Query: 1375 SNGKAHRP------GPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
RP G +IVD +L+ + L ++ EIA + G
Sbjct: 1333 RLPDGIRPVGVEAAGGRTIVDGAILARWAELGAAKRAEIAGKGG 1376
>gi|119195757|ref|XP_001248482.1| hypothetical protein CIMG_02253 [Coccidioides immitis RS]
gi|121769680|sp|Q1E5B0.1|CFT1_COCIM RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|392862316|gb|EAS37050.2| protein CFT1 [Coccidioides immitis RS]
Length = 1387
Score = 204 bits (520), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 161/560 (28%), Positives = 275/560 (49%), Gaps = 50/560 (8%)
Query: 891 RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
RF +P AY PH + + + +I G++ F+SGS PC+ M +L
Sbjct: 860 RFDPSP-KAYM----PHS---KFLRAYSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLR 911
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
++ + + H C GF YV + ++++C+LPS + +DN W +KV + + Y
Sbjct: 912 GEAVSSLSSFHIPACEKGFAYVDASNMVRMCRLPSNTRFDNSWVTRKV-HVGDQIDCVEY 970
Query: 1011 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1070
FA +Y L S V L + D E+ + + +S + +E +++L
Sbjct: 971 FAHSEIYALGSSHKVDFKLPE------DDEIHPEWRSEVISFMP-----QLERGCIKLLS 1019
Query: 1071 PDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1129
P W + + +E + ++ + + + T E + +L +GTA V+GED+ RG
Sbjct: 1020 PRT----WSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGS 1075
Query: 1130 VLLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--K 1180
+ +F A +P N ++++K+ +KGA++A++ + QG L++A G K ++ K
Sbjct: 1076 IYVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLK 1135
Query: 1181 WTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1238
G+ L +AF D YV L ++ ++GD K I+F + E+ +L L KD
Sbjct: 1136 EDGSLLP-VAFMDMQ-CYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNE 1193
Query: 1239 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1298
L A +FL DG L ++V+D+ I + Y P+ S KG +LL R+ FH G H T
Sbjct: 1194 YLQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHTG-HFTST 1252
Query: 1299 LRLQMLATSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1352
+ L + SS + P D + + +L + +GSIG + PL E ++RRL +LQ
Sbjct: 1253 MTL-LPEHSSSPSADDPEEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQ 1311
Query: 1353 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
+LV S+ H GLNP+++R S+G R IVD LL + + ++ + EIA + G
Sbjct: 1312 SQLVTSMEHPCGLNPKAYRAVESDGFGGR----GIVDGNLLLRWLDMGVQRKAEIAGRVG 1367
Query: 1413 TTRSQILSNLNDLALGTSFL 1432
I +L ++ G FL
Sbjct: 1368 ADIESIRVDLETISGGLDFL 1387
Score = 113 bits (283), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 172/731 (23%), Positives = 294/731 (40%), Gaps = 94/731 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + G+ N+ + R ++ L LV Y L G + L
Sbjct: 28 NLIVAKTSILQVFSLVNVAYGTSAPPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ ++++++ +AK+S++E+D HG+ S+H +E E +H
Sbjct: 85 GRVKI--LDSRSGGEALLVSTRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE-----DTFGSGGG---- 225
P L+ VDP RC +L +G+ + IL Q G LV DE D G
Sbjct: 142 LRLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDEDLDEKPEGASNIPA 200
Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ SS V+ L LD + H F++ Y EP IL+ T +
Sbjct: 201 QAAVANDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ + + ++ + + + LP D +K++ +P PIGG L++G+N IH
Sbjct: 261 LHDRKDIVSYAVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + + +S + L+ L D LL G + +L
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380
Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ DGR V +L K S+L + + ++G F GS DSLL+ ++ S
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGWSRPS-Q 439
Query: 444 SMLSSGLK---EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
SM ++ + FG + D VN LS S +N +
Sbjct: 440 SMRKPKVESADDVFG--DHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495
Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
F F D L N+GP+ D + G R +AD + N
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555
Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
+ +++ G+W++ + DS+ Y YL+ S + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610
Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
++V E ++ ++ + T+ G L G RV+QV + R D + +
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
P E S+ +V+S S A+PYVL+ D S+ LL D S V I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725
Query: 706 PVSSCTLYHDK 716
+S C LY DK
Sbjct: 726 WLSGC-LYLDK 735
>gi|147827332|emb|CAN62175.1| hypothetical protein VITISV_001516 [Vitis vinifera]
Length = 1989
Score = 201 bits (511), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 121/228 (53%), Positives = 148/228 (64%), Gaps = 49/228 (21%)
Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSG 442
+G+L+LLT+V DGRVV +L LSK+ SV TS I IG+SL F GS+LGDSLLVQF
Sbjct: 1657 SGELLLLTLVCDGRVVYKLGLSKSRASVFTSGIAAIGSSLSFPGSQLGDSLLVQF----- 1711
Query: 443 TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
T++ SS ++++ GD E D PSTKR RRSSSDALQDM NG++L LY
Sbjct: 1712 TAIPSSSVEKKVGDSEGDVPSTKRSRRSSSDALQDMDNGDKLPLY--------------- 1756
Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYEL--------------------- 541
V DSL+N+GPLKDF+YGLRIN D ATGI KQSNYEL
Sbjct: 1757 --VSDSLINVGPLKDFAYGLRINTDLKATGIVKQSNYELMCCSGHGKNGALCILQQSIRP 1814
Query: 542 -----VELPGCKGIWTVYHKSSRGHNADSSRMA-AYDDEYHAYLIISL 583
VELPGCKGIWTVYHK++RGHNADS +M+ +D E+ A++ SL
Sbjct: 1815 ERITEVELPGCKGIWTVYHKNTRGHNADSIKMSHVFDLEFRAFIFFSL 1862
>gi|452979579|gb|EME79341.1| hypothetical protein MYCFIDRAFT_104419, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 1342
Score = 200 bits (509), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 332/1439 (23%), Positives = 568/1439 (39%), Gaps = 254/1439 (17%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G V SLA DN+ D+II+AF DAK+S++E+D H + S+H
Sbjct: 46 LSLVAEYPLAGTVISLA--RTKPRDNASGGDAIIIAFRDAKLSLVEWDPENHRISTISLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
+E + G ++ VDP RC + Q+ IL G L G+E+
Sbjct: 104 YYEGDNVITPPFGPTLAESESILTVDPSSRCAALKFGARQLAILPFRHFGDELAGEEEED 163
Query: 221 GSG----GGFSARIESSH--------------VINLRDLD--MKHVKDFIFVHGYIEPVM 260
G S R ES+H V+ L LD + H F+H Y EP
Sbjct: 164 GFENEPMSAVSKRRESTHLNGEEEQTPYKASFVLPLTALDPTLSHTVHLAFLHEYREPTF 223
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
IL + + + ++ + + + LP +K+ +P PIGG
Sbjct: 224 GILSAPMEPSNALLEERKDVLTYTVYTLDLEQRASTNLITVPKLPSTLWKVKPLPLPIGG 283
Query: 321 VLVVGANTIHYHSQSASC-ALALNNYA-------VSLDSSQELPRSSFSVELDAAHATWL 372
L+VG N + + QS A A+N +A +S S L S+E + L
Sbjct: 284 ALLVGTNELVHVDQSGKANATAVNEFAKLESDFGMSDQSHLNLKLEDCSIETIDPKSGQL 343
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-------TNPSVLTSDITTIGNSLFFL 425
LL T G L ++ GR + ++++ T+ S S I + N F+
Sbjct: 344 -----LLVTSDGALAIIEFKLLGRSISAINVTPVTEDNGVTSLSAAPSCIANLANGSVFI 398
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD------IEADAPSTKRLRRSSSDALQDMV 479
GS G S L+ ++ + + G +A L ++ +A + V
Sbjct: 399 GSEDGASSLMGWSQPTAPLTRKRSHAQMLGKDGDEEDEDAIEEDDDDLYDAAPEAKKRAV 458
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD-----ASATGIS 534
+ EL S+ + F +RD L ++GP+ G + + A+ATG
Sbjct: 459 SDTELG----------SSNAAYQFEIRDHLQSLGPIHRMCVGRQGKSSDKLQLAAATG-R 507
Query: 535 KQS------NYELVELPGCKGIWTVYHKSSRGHNADSS---RMAAYDDEYHAYLIISLEA 585
KQS N ++V PG ++SR NA S+ R DE +L+
Sbjct: 508 KQSGRLTLLNRDVVPTPG---------RASRFENAKSAWAVRAHQAGDES------TLDN 552
Query: 586 RTMVLETADLLT-EVTESVDYFV--------------QGRTIAAGNLFGRRRVIQVFERG 630
+ V E A+ E++ + ++FV +G T+ L + ++Q ++
Sbjct: 553 KLFVFEGANTKAYEISSADEHFVEDRYPEHAKSEWESEGETLEVVALADGKIIVQFRKQE 612
Query: 631 ARILDGSY-MTQDLSFGPSNSESGSGSENS-TVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
R D + M Q L P E +EN ++ +++ DPYVL+ D SI++L
Sbjct: 613 VRTYDANLAMNQIL---PMEDE----AENELNIVHIAVCDPYVLVIRDDSSIQIL----- 660
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
SVQ + +P+ + +K WL+ + G L
Sbjct: 661 ----SVQG-----NELEPLEAEGSVAEK------------KWLTGSLY-------AGTLT 692
Query: 749 QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI-VDTYMREALKDSETEINSSS 807
QG + G L F +P+ +F + I VD R A
Sbjct: 693 QGSAAVFLLNADGGLHAFALPDLQPLFAIPTLPHLPPVIAVDAAQRRA------------ 740
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
G +E + + V +L ++P+L ++ Y+ + + P+ + +
Sbjct: 741 ------GTRETLTEVLVSDLGQHGV----TQPYLVLRTAMDDVVLYEPFHY--PQTSGRK 788
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 927
S + L R R + FS P + + E+ P ++ I +
Sbjct: 789 ----SWHQDL--------RFRKVPFSHIPKYSESIAESQSARPPPLKSV--KIDTYSAIA 834
Query: 928 LSGSRPCWCM----VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
+ G+ PC + + L + + ++ V C +GF + + L+ QL
Sbjct: 835 IPGAPPCLLLKEPSTLPKVLEIRQSAELNRLSMLCPINRVGCENGFFMINADEELEEQQL 894
Query: 984 PSGSTYDNYWPVQKVIPLKATPHQ------ITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
P + Y W V +V P+ P+Q I Y E+ LY V+ +V
Sbjct: 895 PLNTWYGTGWSVHQV-PI-GHPNQIEDVRRIAYHEERGLY-------VVATCREVDFYFA 945
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEP------DRAGGPWQTRAT----IPMQ 1087
+++ H + D+ V +Y V ++ D P+ T + ++
Sbjct: 946 EEDGRHPEQD------DITLRPKVPQYNVHLISAISHHIIDTVHMPYLAAITDLQVMMLE 999
Query: 1088 SSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL-- 1145
+SEN T E + L+ + A +GED+ A+G + ++ +P
Sbjct: 1000 ASEN-------------THEQKPLVVVSAAAQRGEDMPAKGTLYVYDIIDVVPDPDIAES 1046
Query: 1146 ---VTEVYSKELKGAISALAS--LQGHLLIASGPKIILH--KWTGTELNGIAFYDAPPL- 1197
+ ++ +E +GAI+ALA G + A G K+++ K G+ L +AF DA
Sbjct: 1047 GVKLHQLAREENRGAITALAGPFPGGFIGTAQGLKVMIRGMKEDGSCLP-VAFLDAQSYT 1105
Query: 1198 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG-SLDCFATEFLIDGSTLSL 1256
+V+ + L GD K ++F + E+ ++ +L K ++ + EFL L +
Sbjct: 1106 HVLKTLPGRGMWLAGDAWKGLWFGGFTEEPYRVTVLGKAPKMHMEVMSAEFLPFDGALYI 1165
Query: 1257 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDR----- 1310
VV D ++ + Y P+ +S G +LL R+ FH+G T + L LA+ + +
Sbjct: 1166 VVLDADCDMHVLQYDPENPKSLNGMRLLHRSTFHIGHFTTNSMLLPSTLASFAAQQHEMM 1225
Query: 1311 TGAAPGSDKTNRFA-LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
G + K + +L + G+IG I PLDE +RRL +LQ L + H AGLNPR+
Sbjct: 1226 NGGSKAEVKPDPLQHVLTSSTSGAIGLITPLDEQAYRRLSALQTHLTSILEHAAGLNPRA 1285
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
+R S G +VD L+ L + ++ + G + + S+L + G
Sbjct: 1286 YRSIESESFG---GARGVVDGLLVRRIHELGAARRADVLGRAGVSAWGLRSDLEIIGGG 1341
>gi|38014465|gb|AAH60475.1| LOC398931 protein, partial [Xenopus laevis]
Length = 363
Score = 199 bits (507), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 109/303 (35%), Positives = 178/303 (58%), Gaps = 23/303 (7%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LEL+ + GN+ S+A + GA +RD+++L+F++AK+SV+E+D H L+ S+H
Sbjct: 66 LELMASFSFFGNIMSMASVQLAGA----KRDALLLSFKEAKLSVVEYDPGTHDLKTLSLH 121
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVG 215
FE PE L+ G P V+VDP GRC +L+YG Q+++L ++ GLVG
Sbjct: 122 YFEEPE---LRDGFVQNVHIPKVRVDPSGRCAVMLIYGTQLVVLPFRRDTLAEEHEGLVG 178
Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ G + S++I++R+LD K ++ D F+HGY EP ++IL E TW GR
Sbjct: 179 E--------GQKSSFLPSYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGR 230
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
V+ + TC I A+S++ K HP+IWS +LP+D + LAVP P+GGV++ N++ Y +
Sbjct: 231 VAVRQDTCSIVAISLNIMQKVHPIIWSLNSLPYDCTQALAVPKPVGGVVIFAVNSLLYLN 290
Query: 334 QSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
QS ++LN+ S P+ + LD + AT++ D ++S K G++ ++T++
Sbjct: 291 QSVPPYGVSLNSLTNGTTSFPLKPQEEVRITLDCSQATFISYDKMVISLKGGEIYVVTLI 350
Query: 393 YDG 395
DG
Sbjct: 351 TDG 353
>gi|295665178|ref|XP_002793140.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226278054|gb|EEH33620.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 1408
Score = 198 bits (503), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 149/530 (28%), Positives = 265/530 (50%), Gaps = 40/530 (7%)
Query: 919 NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV + ++
Sbjct: 900 DVCGYRTVFMPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVV 959
Query: 979 KICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1038
++C+ P + +D W +K I L + Y + Y L S V L + D
Sbjct: 960 RMCRFPRNTHFDGSWAARK-IGLGEQVDSVEYSSSSETYVLGTSQKVDFKLPE------D 1012
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV 1098
E+ + N +S +++ V++L P W + +++SE + V+ +
Sbjct: 1013 DEIHPEWRNEVISFFP-----QIDKGSVKLLNPRT----WSIIDSYQLRTSERVMCVKCL 1063
Query: 1099 TL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSK 1152
L + T E + ++A+GTA +GED+AARG + +F + D P+ + + +
Sbjct: 1064 NLEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEVIKVVPEVDRPETNRKLKLIAKE 1123
Query: 1153 ELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN- 1207
E+KGAI++L+ + QG L+ A G K I+ K G+ L +AF D YV L +K
Sbjct: 1124 EVKGAITSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVSVLKELKGT 1181
Query: 1208 -FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
++GD K ++F + E+ +L+L +KD GSL A +FL DG L ++V+D+ NI
Sbjct: 1182 GMCIMGDALKGLWFAGYSEEPYKLSLFSKDDGSLQVMAADFLPDGKRLYIMVADDDCNIH 1241
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1323
+ Y P+ S KG +LL R+ FH G + +T R +L+ + A D +
Sbjct: 1242 VLQYDPEDPGSAKGDRLLHRSTFHTGQFASTLTLLPRTSVLSQGPETEANAMDLDLSGPL 1301
Query: 1324 -ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
+L + GSI I P+ E+ +RRL +LQ ++++++ H GLNPR+FR S+G R
Sbjct: 1302 HQVLVTSETGSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGGR- 1360
Query: 1383 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
+VD +L+ + L + + EIA + G +I ++L A+G + L
Sbjct: 1361 ---GMVDGDLVQKWLDLGTQRKAEIASRVGADVWEIRADLE--AIGKAGL 1405
Score = 125 bits (314), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 184/750 (24%), Positives = 304/750 (40%), Gaps = 121/750 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++Y + GS ++ +T+ + + L LV Y L G + L
Sbjct: 28 NLIVAKTTLLQVYNLVNVVYGSSPGQSDEKTRSQY-------SKLVLVAEYALSGTITDL 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ ++I++A +AK+S++E+D H + TS+H +E + +H+ +
Sbjct: 81 GRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISPWTPN 137
Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-------------------- 214
A P + VDP RC VL +G + + IL Q G LV
Sbjct: 138 LAACPSHLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMDDFDSDHDDERQIDTNHTA 196
Query: 215 --GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
DE G + SS V+ + L+ M H F++ Y EP IL+ +
Sbjct: 197 EERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQVAAS 256
Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-I 329
+ + + S ++ + + S LP+D +K++ +P P+GG L+VG+N +
Sbjct: 257 SALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGSNELV 316
Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLV 387
H + A+ +N +A S +S + L+ L +N LL G +
Sbjct: 317 HVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLNGVMA 376
Query: 388 LLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
+++ DGR V + L +T PS +G F GS GDS+L+
Sbjct: 377 VVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SAPVGRGKIFFGSEEGDSILI- 431
Query: 437 FTCGSGTSMLSSGLK----EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY----- 487
G S LS+G K E G+ D + D D + E LY
Sbjct: 432 -----GWSRLSAGAKVSPAPETGE---DNVAELSEDEEDDDDDDDEEDAYEDDLYATPVT 483
Query: 488 -GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNY 539
G NT S T + F + D L N+GP++D + G + D + S +
Sbjct: 484 PGINPRNTASMNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYL 543
Query: 540 ELVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDD 573
ELV G G+ +V+ K + + S A
Sbjct: 544 ELVTTQGYGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPSQSGSLPANAGS 603
Query: 574 EYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVF 627
Y YL++S + +++V + + E T + ++ + RTI G L G RV+QV
Sbjct: 604 NYDHYLLLSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVL 663
Query: 628 ERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
+ R D G + Q ++ SE +V+ S A+PYVL+ D SI LL D
Sbjct: 664 KGEVRSYDSGLGLAQIYPVWDEDT-----SEERSVMHASFAEPYVLIIRDDSSILLLQAD 718
Query: 687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
S ++T I+S+ S +LY DK
Sbjct: 719 ESGDLDEIETDGIIKSTT--WISGSLYQDK 746
>gi|303321596|ref|XP_003070792.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
gi|240110489|gb|EER28647.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
Length = 1394
Score = 196 bits (499), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 155/553 (28%), Positives = 267/553 (48%), Gaps = 61/553 (11%)
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-- 974
+ +I G++ F+SGS PC+ M +L ++ + + H C GF YV +
Sbjct: 866 YSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLRGEAVSSLSSFHIPACEKGFAYVDASV 925
Query: 975 -----------------QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
Q ++++C+LP + +DN W +KV + + YFA +Y
Sbjct: 926 CVPKQYFVPWNKLILVIQNMVRMCRLPGNTRFDNSWVTRKV-HVGDQIDCVEYFAHSEIY 984
Query: 1018 PLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGP 1077
L S V L + D E+ + + +S + +E +++L P
Sbjct: 985 ALGSSHKVDFKLPE------DDEIHPEWRSEVISFMP-----QLERGCIKLLSPRT---- 1029
Query: 1078 WQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1136
W + + +E + ++ + + + T E + +L +GTA V+GED+ RG + +F
Sbjct: 1030 WSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEII 1089
Query: 1137 RNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELN 1187
A +P N ++++K+ +KGA++A++ + QG L++A G K ++ K G+ L
Sbjct: 1090 EVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKEDGSLLP 1149
Query: 1188 GIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1245
+AF D YV L ++ ++GD K I+F + E+ +L L KD L A
Sbjct: 1150 -VAFMDMQ-CYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEYLQVIAA 1207
Query: 1246 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA 1305
+FL DG L ++V+D+ I + Y P+ S KG +LL R+ FH+G H T + L +
Sbjct: 1208 DFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHMG-HFTSTMTL-LPQ 1265
Query: 1306 TSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSV 1359
SS + PG D + + +L + +GSIG + PL E ++RRL +LQ +LV S+
Sbjct: 1266 HSSSPSADDPGEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVTSM 1325
Query: 1360 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1419
H GLNP+++R S+G R IVD LL + + ++ + EIA + G I
Sbjct: 1326 EHPCGLNPKAYRAVESDGFGGR----GIVDGNLLLRWLDMGVQRKAEIAGRVGADIESIR 1381
Query: 1420 SNLNDLALGTSFL 1432
+L ++ G FL
Sbjct: 1382 VDLEKISGGLDFL 1394
Score = 116 bits (290), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 175/731 (23%), Positives = 291/731 (39%), Gaps = 94/731 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + G+ N+ + R ++ L LV Y L G + L
Sbjct: 28 NLIVAKTSILQVFSLVNVAYGTSALPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ +++++A +AK+S++E+D HG+ S+H +E E +H
Sbjct: 85 GRVKI--LDSRSGGEALLVATRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLV-----GDEDTFGSGGG---- 225
P L+ VDP RC +L +G+ + IL Q G LV GD D G
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDGDLDEKPEGASNIPA 200
Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ SS V+ L LD + H F++ Y EP IL+ T +
Sbjct: 201 QIAVENDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ + S ++ + + + LP D +K++ +P PIGG L++G+N IH
Sbjct: 261 LRDRKDIVSYSVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + + +S + L+ L D LL G + +L
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380
Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ DGR V +L K S+L + + ++G F GS DSLL+ + S
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGW---SRP 437
Query: 444 SMLSSGLKEEFGDI---EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
S L K E D + D VN LS S +N +
Sbjct: 438 SQLMRKPKVESADDVFGDHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495
Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
F F D L N+GP+ D + G R +AD + N
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555
Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
+ +++ G+W++ + DS+ Y YL+ S + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610
Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
++V E ++ ++ + T+ G L G RV+QV + R D + +
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
P E S+ +V+S S A+PYVL+ D S+ LL D S V I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725
Query: 706 PVSSCTLYHDK 716
+S C LY DK
Sbjct: 726 WLSGC-LYLDK 735
>gi|238508528|ref|XP_002385456.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus flavus NRRL3357]
gi|220688975|gb|EED45327.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus flavus NRRL3357]
Length = 1204
Score = 194 bits (494), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 144/548 (26%), Positives = 262/548 (47%), Gaps = 47/548 (8%)
Query: 898 DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
D + EE P + I NISG F G P + + L G +
Sbjct: 678 DQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKGGYAQSL 734
Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
+ C GFI + S+G++ +CQ+P G D W +Q+ IP+ + Y + +Y
Sbjct: 735 SKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQ-IPIGEQVDHLAYSSSSGMY 793
Query: 1018 PLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGP 1077
+ S L D E+ + N S V+ ++++ P
Sbjct: 794 VIGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQRSSLKVVSPKT---- 838
Query: 1078 WQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1136
W + + +E+ + V+ ++L + T E + ++ +GTA+ +GED+A+RG V +F
Sbjct: 839 WTVIDSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCVYVFEVI 898
Query: 1137 RNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELN 1187
+ +P+ + V + +KGA++AL+ + QG L++A G K I+ K G+ L
Sbjct: 899 KVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGLKEDGSLLP 958
Query: 1188 GIAFYDAPPLYVVSLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1242
+AF D +++VK ++ D K ++F + E+ +++L AKD L+
Sbjct: 959 -VAFMDVQ----CHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDLDYLEV 1013
Query: 1243 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1302
A +FL DG+ L ++V+D N+ + Y P+ +S G +LLSR++FH G ++ L
Sbjct: 1014 LAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFISTLTLLP 1073
Query: 1303 MLATSSDR----TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1358
+ SS++ A K R +L + +GS+G + + E ++RRL +LQ +L ++
Sbjct: 1074 RTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQSQLTNT 1133
Query: 1359 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
+ H GLNPR+FR S+G A R ++D +LL + + + ++EIA + G +I
Sbjct: 1134 IEHPCGLNPRAFRAVESDGTAGR----GMLDGKLLFQWLDMSKQRKVEIASRVGANEWEI 1189
Query: 1419 LSNLNDLA 1426
++ ++
Sbjct: 1190 KADFEAIS 1197
Score = 99.4 bits (246), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 159/355 (44%), Gaps = 40/355 (11%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
++I+LAF +AK++++E+D +G+ S+H +E + + + G ++ VDP R
Sbjct: 88 EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147
Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
C V +G++ + IL Q G LV D+ + GS G A
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206
Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
SS V+ L LD + H F++ Y EP IL+ + T + + + +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + + S LP D +K++A+P P+GG L++G+N +H + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326
Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S +S ++ L+ L N LL TG++VL+ DGR V + +
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386
Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
P S +G+ FLGS DS+L+ G S+ SSG K+
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKK 435
>gi|357611296|gb|EHJ67409.1| putative cleavage and polyadenylation specific factor 1 [Danaus
plexippus]
Length = 328
Score = 194 bits (494), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 108/320 (33%), Positives = 183/320 (57%), Gaps = 20/320 (6%)
Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQG 1166
+AIGT Y GED+ +RGR+L++ P +N E+Y+KE KG ++AL + G
Sbjct: 16 IAIGTNYNYGEDITSRGRILIYDIIDVVPEPGQPLTKNRFKEIYAKEQKGPVTALTQVLG 75
Query: 1167 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1226
L+ A G KI L + +L G+AF D +YV + VKN IL+ D++KSI L ++ Q
Sbjct: 76 FLISAVGQKIYLWQLKDNDLVGVAFIDTQ-IYVHRMLAVKNLILVADVYKSISLLRYQHQ 134
Query: 1227 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1286
L+L+++D + + +F+ID ++L +VS+ + N ++ + P+ ES+ GQ+L+ +
Sbjct: 135 HRTLSLVSRDLRTAQIYDMQFMIDNTSLGFLVSESEGNFAMYMHQPQARESYGGQRLIRK 194
Query: 1287 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1346
++H+G V RL AA G +T+ +F TLDG +G + P+ E +R
Sbjct: 195 CDYHLGQRVHAMFRL-----------AARGERQTH--VTMFTTLDGGVGYVLPVSEKVYR 241
Query: 1347 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG-PDSIVDCELLSHYEMLPLEEQL 1405
RL LQ + + H+AGLNP+++R + + +A G ++D +L+S Y +P EQ
Sbjct: 242 RLLMLQNVINNYCCHLAGLNPKAYRTYKVSRRALCGGAARGVLDGDLVSLYTSMPRTEQQ 301
Query: 1406 EIAHQTGTTRSQILSNLNDL 1425
+IA + GT +I+S+L ++
Sbjct: 302 DIARKIGTKVEEIMSDLYEI 321
>gi|225679191|gb|EEH17475.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 1377
Score = 194 bits (493), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 146/530 (27%), Positives = 263/530 (49%), Gaps = 50/530 (9%)
Query: 919 NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV + ++
Sbjct: 877 DVCGYRTVFMPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVV 936
Query: 979 KICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1038
++C+ P + +D W +K I L + Y + Y L S
Sbjct: 937 RMCRFPRNTHFDGSWAARK-IGLGEQVDSVEYSSSSETYVLGTS---------------- 979
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV 1098
Q+V ++ + ++H + EE V++L P W + ++++E + V+ +
Sbjct: 980 QKVDFKLPEDD----EIHPEWRNEE-SVKLLNPRT----WSIIDSYQLRTAERVMCVKCL 1030
Query: 1099 TL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSK 1152
L + T E + ++A+GTA +GED+AARG + +F + D P+ + + +
Sbjct: 1031 NLEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEVIKVVPEVDRPETNRKLKLIAKE 1090
Query: 1153 ELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN- 1207
E+KGAI++L+ + QG L+ A G K I+ K G+ L +AF D YV L +K
Sbjct: 1091 EVKGAITSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVSVLKELKGT 1148
Query: 1208 -FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
++GD K ++F + E+ +L+L +KD GSL A +FL G L ++V+D+ NI
Sbjct: 1149 GMCIMGDALKGLWFAGYSEEPYKLSLFSKDDGSLQVMAADFLPHGKRLFIMVADDDCNIH 1208
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1323
+ Y P+ S KG +LL R+ FH G + +T R +L+ + A D +
Sbjct: 1209 VLQYDPEDPGSAKGDRLLHRSTFHTGQFASTLTLLPRTSVLSQGPEAEANAMDLDSSGPL 1268
Query: 1324 -ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
+L + GSI I P+ E+ +RRL +LQ ++++++ H GLNPR+FR S+G R
Sbjct: 1269 HQVLVTSETGSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGGR- 1327
Query: 1383 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
+VD +L+ + L + + EIA + G +I ++L A+G + L
Sbjct: 1328 ---GMVDGDLVQKWLDLGTQRKAEIASRVGADVWEIRADLE--AIGKAGL 1372
Score = 122 bits (306), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 181/747 (24%), Positives = 302/747 (40%), Gaps = 110/747 (14%)
Query: 53 GPVPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
G V V ++++Y + GS ++ +T+ + + L LV Y L G
Sbjct: 4 GAVAAFRVAKTTLLQVYNLVNVVYGSGPGQSDEKTRSQY-------SKLVLVAEYALSGT 56
Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
V L + D+ ++I++A +AK+S++E+D H + TS+H +E + +H+
Sbjct: 57 VTDLGRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISP 113
Query: 173 GRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GDEDTFGSGGGFSAR 229
+ A P + VDP RC VL +G + + IL Q G LV GD F S +
Sbjct: 114 WTPNLAACPSQLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMGD---FDSDHDEERQ 169
Query: 230 IESSHVINLRD--------------------------LDMKHVKDFIFVHGYIEPVMVIL 263
I+++H RD M H F++ Y EP IL
Sbjct: 170 IDTNHTAEERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGIL 229
Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
+ + + + + S ++ + + S LP+D +K++ +P P+GG L+
Sbjct: 230 YSQVAASSALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALL 289
Query: 324 VGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLS 380
VG+N +H + A+ +N +A S +S + L+ L +N LL
Sbjct: 290 VGSNELVHVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLV 349
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRL 429
G + +++ DGR V + L +T PS +G F GS
Sbjct: 350 LLNGVMAVVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SALVGRGKIFFGSEE 405
Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
GDS+L+ ++ S + + E D A+ + DA +D + ++ G
Sbjct: 406 GDSMLIGWSRPSAGATVPPA-PETGEDNVAELSEDEEEEDDDEDAYEDDLYATPVT-PGI 463
Query: 490 ASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNYELV 542
S NT S T + F + D L N+GP++D + G + D + S + ELV
Sbjct: 464 NSRNTASVNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYLELV 523
Query: 543 ELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYH 576
G G+ +V+ K + S Y
Sbjct: 524 TTQGYGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPTQSGSLPVNAGSNYD 583
Query: 577 AYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERG 630
YL++S + +++V + + E T + ++ + RTI G L G RV+QV +
Sbjct: 584 HYLLLSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVLKGE 643
Query: 631 ARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPST 689
R D G + Q ++ SE +V+ S ADPYVL+ D SI LL D S
Sbjct: 644 VRSYDSGLGLAQIYPVWDEDT-----SEERSVVHASFADPYVLIIRDDSSILLLQADESG 698
Query: 690 CTVSVQTPAAIESSKKPVSSCTLYHDK 716
++T IES+ S +LY DK
Sbjct: 699 DLDEIETDGIIESTT--WISGSLYQDK 723
>gi|312069702|ref|XP_003137805.1| hypothetical protein LOAG_02219 [Loa loa]
Length = 1065
Score = 193 bits (491), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 181/699 (25%), Positives = 302/699 (43%), Gaps = 104/699 (14%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEE 809
+ E+G + I+ +P + V+ V K +H+ D + D E + S++
Sbjct: 445 IARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPATAESIDTMSATMT 500
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
T + E + ++EL M + RP LF +L D T+ Y+ + + N
Sbjct: 501 DTFAAKPEEV----IMELLMVGMGMNQGRPMLF-LLIDDTVSVYEMFTY----NNGIQGH 551
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGF 926
+ L + V+ R+ RF LD E+ A + + F+ I G
Sbjct: 552 LAVRFKRLPYTVVT----RSCRFQG--LDGRAAVESVRDAVRHKTVLHFFERIGNVLNGV 605
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
F+ S PC + R+HP DG I++FT +N C +GFI
Sbjct: 606 FICSSYPCIFFLETGVPRLHPVNLDGPILSFTTFNNAACPNGFI---------------- 649
Query: 987 STYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1046
Y E+ + +PV K + +LI+ + +
Sbjct: 650 -----------------------YLTERERLMRVAKLPVTK-----MCVLINDD--KTFE 679
Query: 1047 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV---RVVTLFNT 1103
H D +++Y++++ P+ W+ + + E + VV
Sbjct: 680 EH--EKPDTFVYPEMDQYKLQLYSPED----WKPVQNVEVLFEEFEVVTCCEEVVLRSEG 733
Query: 1104 TTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAI 1158
T + LA+GTA GE+V RGR+++ P ++ + +Y KE KG +
Sbjct: 734 TVSGVQNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPV 793
Query: 1159 SALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
++L S G+LL G K+ + + L GI+F D YV L V+N L D+++S+
Sbjct: 794 TSLCSCNGYLLTGMGQKVFIWLFKDNNLQGISFLDMH-FYVHQLIGVRNLALACDMYRSV 852
Query: 1219 YFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1276
L ++E+ L+L ++D S A +F+ID + V+SDE NI IF Y P+ E
Sbjct: 853 ALLRYQEEYKALSLASRDMRSDVQPPMAAQFIIDNKQMGFVMSDEAANIAIFNYLPETLE 912
Query: 1277 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL-----LFGTLD 1331
S G+KL RAE ++G V F+R++ +S G + F+L LF +LD
Sbjct: 913 SLGGEKLTLRAEINIGTVVNSFIRVKGHISS--------GFVENELFSLERQSVLFASLD 964
Query: 1332 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1391
GS G + PL E FRRL LQ+ + VP AGLN + R H ++VD +
Sbjct: 965 GSFGFLRPLTEKVFRRLHMLQQLMSSMVPQPAGLNAKGARAARPPRPNHYLNTRNLVDGD 1024
Query: 1392 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
++ Y L L E+ ++A + GT+R I+ +L ++ T+
Sbjct: 1025 MVMQYLHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTA 1063
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/387 (23%), Positives = 150/387 (38%), Gaps = 49/387 (12%)
Query: 349 LDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
+D + P F + LD T + + LL + G L L +V D V+ L+L
Sbjct: 1 MDGFTKFPLRDFKHMVLTLDGCVVTVISTNKILLCDRNGRLFTLVLVTDATNSVKSLELK 60
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
+V+ +T+ F+GSRL DS+ + T ++ AP
Sbjct: 61 FQFKTVIPCTMTSCAPGYLFIGSRLCDSVFLHCIFEQST-------------LDESAPKK 107
Query: 465 KRLRRSSSDALQDMVNGEELSLYGSA---SNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+L + +A +D E+ LYG +SA++ + V D L+N+GP K + G
Sbjct: 108 IKLN-TELNANED----EDFELYGEVLPKVAKPDSAEELLNIRVLDKLLNVGPCKKITGG 162
Query: 522 LRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSSRMAAY-------- 571
+ K ++LV G G ++ +S R SS +
Sbjct: 163 CPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSSSIEGVVQYWAVGR 222
Query: 572 -DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
+D+ H Y I S E T+ LET + L E+ E+ + TIAAG L +QV
Sbjct: 223 REDDTHMYFIASKELGTLALETDNDLVEL-EAPIFATSEPTIAAGELADGGLAVQVTTSS 281
Query: 631 ARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDPS 688
++ Q + + V S SI DPY+ + +G + + L P
Sbjct: 282 LVMVAEGQQIQHIPLQLT----------FPVRSASIVDPYIAICTQNGRLLMYELTSHPH 331
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHD 715
+ + P++S ++Y D
Sbjct: 332 VHLKEIDISKRLRHETSPITSLSIYRD 358
>gi|255948500|ref|XP_002565017.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592034|emb|CAP98296.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1392
Score = 191 bits (486), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 155/534 (29%), Positives = 265/534 (49%), Gaps = 46/534 (8%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLCDGSIVAFTVLHNVNC--NHG 968
+ I NISG F+ G+ + VFR + P +L G + +V+ ++G
Sbjct: 877 LRILPNISGFSTIFMPGASSSF--VFRTA-KSSPHIIRLRGGFTRWLSSFDSVDTGRDNG 933
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1028
FIYV SQ ++ CQLPS + +D W ++KV P++ + Y Y L S
Sbjct: 934 FIYVDSQNCVRACQLPSQTQFDYPWTLRKV-PIEEQVDFLAYSTSSETYVLGTS------ 986
Query: 1029 LNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1088
L ++ + N LS + E ++++ P W + P+
Sbjct: 987 REGDFKLPEGDDLHPEWRNEELSFCP-----KIPESSIKVVSPKT----WTIIDSYPLDP 1037
Query: 1089 SENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT 1147
E V+ V + + T E L+ +GTA V+GED+ ARG + +F + A +P+ T
Sbjct: 1038 DEQVTAVKNVNIEVSENTHERRDLIVVGTAIVKGEDMPARGTIYVFDVIKVAPDPEKPET 1097
Query: 1148 ----EVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLY 1198
++ KE +KGA++AL+ + QG +++A G K ++ K G+ L +AF D Y
Sbjct: 1098 GHKLKLIGKESVKGAVTALSGIGGQGFVIVAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CY 1155
Query: 1199 VVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1256
V +K ++LGD K ++F + E+ ++ L KD L+ A +FL DG+ L +
Sbjct: 1156 VTVAKELKGTGLVILGDAVKGLWFAGYSEEPYRMTLFGKDPEYLEVVAADFLPDGNKLYM 1215
Query: 1257 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGA 1313
+V+D N+ + Y P+ +S G +LLSR++F+ G + VT R + + ++ +
Sbjct: 1216 LVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFASSVTLLPRTAVSSERTESSEE 1275
Query: 1314 APGSDKT-NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1372
D+T R +L + +GS+ + + E ++RRL +LQ +L+++V H AGLNPR+FR
Sbjct: 1276 GMDLDETFARHQVLIASQNGSLALVTSVAEESYRRLSALQSQLINTVDHPAGLNPRAFRA 1335
Query: 1373 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
S+G A R +VD LL + + + Q EIA + G T +I ++L +
Sbjct: 1336 IESDGAAGR----GMVDGNLLRLWLNMGKQRQTEIAGRVGATEWEIKADLETIG 1385
Score = 115 bits (287), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 170/731 (23%), Positives = 296/731 (40%), Gaps = 151/731 (20%)
Query: 57 NLVVTAANVIEIY--VVRVQEEGSKE-----SKNSGETKRRVLMDGISAASLELVCHYRL 109
NLVV ++++++ V V + KE S S + + +++++ Y L
Sbjct: 28 NLVVVRTSLLQVFSLVKIVSSQPQKEVPEPLSSQSSQPETKLVLEK----------EYPL 77
Query: 110 HGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
G V L S+ N+R ++I++A +AK+S++E+D G+ S+H +E +
Sbjct: 78 SGTVTDL---SRVKILNTRSGGEAILIAVRNAKLSLIEWDPERRGISTISIHYYERDDLT 134
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDED 218
+ G ++ VDP RC V +G++ + IL Q G LV G+
Sbjct: 135 RSPWVPDLSRCGSILSVDPSSRCA-VYNFGIRNLAILPFHQAGDDLVMDDYDSELDGERP 193
Query: 219 TFGSGGGFSARIE-------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVIL 263
+ SGGG A+IE SS V+ L LD + H F++ Y EP IL
Sbjct: 194 SQNSGGG--AQIEKRKEEPDHQTPYSSSFVLPLTALDPSLLHPISLAFLYEYREPTFGIL 251
Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
+ + T + + + ++ + + S LP D +K++A+P P+GG L+
Sbjct: 252 YSQVATSTALLHERKDVVFYAVFTLDLEQRASTTLLSVSRLPSDLFKVVALPLPVGGALL 311
Query: 324 VGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLS 380
+G+N I H + A+ +N ++ + S +S + L+ L D LL+
Sbjct: 312 LGSNEIVHVDQAGKTNAVGVNEFSRQVSSFSMTDQSDLAFRLEGCVVERLGGDSGDLLLA 371
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDS 432
+G++ L+ DGR V + + P+ DI T +G+ F+GS DS
Sbjct: 372 LASGNMALIKFKLDGRSVSGITVHSL-PAYAGGDILKSAASCSTCLGDGNVFIGSEDADS 430
Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR---RSSSDALQDM----VNGEELS 485
+L++++ S ST++ R + ++D L D+ E+
Sbjct: 431 VLLEWSHTSA--------------------STRKARLESKQTADGLDDLSDEDDQMEDDD 470
Query: 486 LYGSASNNTE---------SAQKTFSFAVRDSLVNIGPLKDFSYGL---RINADASATGI 533
LY SA + S + ++F + D L +IGPL+D + G N + AT
Sbjct: 471 LYSSAPGPIQVDNRMGTDSSTPEFYNFRLNDKLSSIGPLRDITLGKAFSNTNRKSQATTG 530
Query: 534 SKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNADSSR 567
+ + ELV G +W+ RG
Sbjct: 531 TVAAELELVASQGSDRGGGLVVIKREIDPLTTMSLKVDDADAVWSASVTKRRG------- 583
Query: 568 MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGR 620
++ D+ Y++IS + E ++ +S+ F + T+ G+L G
Sbjct: 584 ASSTDNPSCQYVVISRSTDSE-QEVNEVFIVEEQSLKPFKAPEFNPNEDCTVDIGSLAGN 642
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSE---SGSGSENSTVLSVSIADPYVLLGMSD 677
R++QV R SY D+ G S S+ S S DPY+++ D
Sbjct: 643 TRLVQVLRNEVR----SY---DIDLGLSQIYPVWDEDTSDERVAASASFIDPYLVIIRDD 695
Query: 678 GSIRLLVGDPS 688
S+ LL D S
Sbjct: 696 SSVLLLQADES 706
>gi|194374339|dbj|BAG57065.1| unnamed protein product [Homo sapiens]
Length = 330
Score = 191 bits (486), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 114/302 (37%), Positives = 170/302 (56%), Gaps = 35/302 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS Y V+L
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVAL 300
Query: 350 DS 351
+S
Sbjct: 301 NS 302
>gi|341892673|gb|EGT48608.1| CBN-CPSF-1 protein [Caenorhabditis brenneri]
Length = 1440
Score = 191 bits (485), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 175/623 (28%), Positives = 297/623 (47%), Gaps = 93/623 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++AF+DAK+S++ ++ ++ S+H FE+ +L+ G + P+V+ DP+
Sbjct: 91 QDSILMAFDDAKLSIITINEKERNMQTISLHAFENE---YLRDGFVKYFHPPIVRTDPEN 147
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + S RI S ++I L+ +D + +V
Sbjct: 148 RCAASLVYGKHIAILPFHEN-----------------SKRIHS-YIIPLKQIDPRLDNVA 189
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 190 DIVFLDGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 249
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
LL +P P+GG +V G+NTI Y +Q+ C + LN+ D + P S +
Sbjct: 250 CATLLPIPKPLGGAIVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKSMKMT 306
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD + + ++++ + T+ G+L LL +V G V+ L+ SK + + +T
Sbjct: 307 LDCSTSVYMEDGRIAVGTRDGELFLLRLVTSSGGATVKSLEFSKVWDTSIAYTLTVCAPG 366
Query: 422 LFFLGSRLGDSLLVQFTCGSGT--SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
FLGSRLGDS L++++ T S+ +++E +EA+ +
Sbjct: 367 HLFLGSRLGDSQLLEYSLIKTTRESVKRHKMEQEQNHVEAE------------------L 408
Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+ ++L LYG A +++ E ++ F+ D L NIGP+K G R N ++ +
Sbjct: 409 DEDDLELYGGAIEEQQNDDEEQITESLQFSELDRLRNIGPVKSMCVG-RPNYMSNDLVDA 467
Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
K+ + ++++ G G V+ +S R SS + ++E H YLI+
Sbjct: 468 KRRDPVFDVITASGHGKNGSLCVHQRSLRPEIVTSSLLEGAEQLWAVGRKENESHKYLIV 527
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFGRRRVIQVFERG-ARILDGSYM 639
S T+VLE + L E+ E + FV G+ T+AAG L +QV A + DG +
Sbjct: 528 SRIRSTLVLELGEELIELEEPL--FVTGQPTVAAGELSQGAFAVQVTSTSIALVTDGQQL 585
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDPSTCTVSV--- 694
+ N V+ SI DPYV + +G + L LV +P +
Sbjct: 586 AE-----------VKIDSNFPVVQASIVDPYVAVLTQNGRLLLYTLVSNPYMQLQEIDLA 634
Query: 695 QTPAA--IESSKKPVSSCTLYHD 715
QTP + I S ++S ++Y D
Sbjct: 635 QTPFSTFIAQSASQITSISMYAD 657
Score = 179 bits (455), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 187/741 (25%), Positives = 334/741 (45%), Gaps = 81/741 (10%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS 782
++ DA S GE D D + V+ +E+G L + +P V+ + +F +
Sbjct: 736 KRLGHDAIQSGRGGEQSDAIDPSSYTSISHWLVLAHENGRLSVHSLPEMELVYQIGRFPN 795
Query: 783 GRTHIVD-----TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
+VD +K +S EE K+N +++E + + S
Sbjct: 796 VPELLVDLTPEEEEKERRIKAQLAAKEASDEEQLNAEMKKNCE--RIMEAQIVGMGINQS 853
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
P L AI+ D ++ Y+ + +P S L ++ LR S
Sbjct: 854 HPILMAIV-DEQVIMYEMFA-----------NPNSQPGHLGIAFRKLPHFICLRSSPYLK 901
Query: 898 DAYTR------EETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE--RLRVHPQ 948
R EE P I F+ +S + G + G+ P +V+ ++ HP
Sbjct: 902 SDGKRAAFQIVEEDGKRYPL--IHSFERVSTVNNGVIIGGAVPT-LLVYGAWGGMQTHPM 958
Query: 949 LCDGSIVAFTVLHNVNCNHGFIYVT-SQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQ 1007
DGSI AFT + N +GF+Y+T + L+I ++ + Y+ +PV+K I + T H
Sbjct: 959 TIDGSIKAFTPFNIDNVPYGFVYMTQKKSELRIAKMHADFDYEMPYPVKK-IEVGRTIHS 1017
Query: 1008 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHRTYTVEEYE 1065
+ Y ++Y ++ SVP KP N++ ++ D QE H+ D + + + YT+ +
Sbjct: 1018 VRYLMNSDVYVVVSSVP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPPKYTLNLF- 1072
Query: 1066 VRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGE 1122
+ W+ I + E V L + +T ET LAIGT GE
Sbjct: 1073 --------SSQDWKAVPNTEISFEDMEAVTACEDVALKSESTHTGFETYLAIGTVNNYGE 1124
Query: 1123 DVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKII 1177
+V RGR++L P + ++ KE KG ++ L +++G LL G K+
Sbjct: 1125 EVLVRGRIILAEVIEVVPEPGQPTSNRKIKVLFDKEQKGPVTGLCAMEGLLLSGMGQKVF 1184
Query: 1178 LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1237
+ ++ +L G++F D YV L+ +++ L D +S+ + ++E+ +++ ++D
Sbjct: 1185 IWQFKDNDLMGLSFLDMH-YYVYQLHSLRSIALACDARESMSLIRFQEENKAMSVASRD- 1242
Query: 1238 GSLDC----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1293
C A +F++DG+ + ++SDE NI +F YAP+ ES G++L RA ++G
Sbjct: 1243 -DRKCAQAPMAAQFMVDGAHIGFLLSDENGNITLFNYAPEAPESNGGERLTVRAAINIGT 1301
Query: 1294 HVTKFLRLQ----MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1349
++ FLR++ +L AA R + +F +LDGS G I PL E ++RRL
Sbjct: 1302 NINAFLRVKGHTALLNLHEFEKEAA-----EQRMSTIFASLDGSFGFIRPLTEKSYRRLH 1356
Query: 1350 SLQKKLVDSVPHVAGLNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1404
LQ + +AGL+ + R Q NG+ R +++D +++ Y L ++
Sbjct: 1357 FLQTFIGSVSQQIAGLHIKGARSAKPPQPIVNGRNAR----NLIDGDVVEQYLNLSTYDK 1412
Query: 1405 LEIAHQTGTTRSQILSNLNDL 1425
++A + G + I+ +L +L
Sbjct: 1413 TDLARRLGVGKYHIIDDLMEL 1433
>gi|296806499|ref|XP_002844059.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845361|gb|EEQ35023.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 1348
Score = 190 bits (482), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 151/559 (27%), Positives = 269/559 (48%), Gaps = 60/559 (10%)
Query: 904 ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM--------VFRERLRVHPQLCDGSIV 955
E H P + + +I G++ F+ G PC+ + V R R + ++
Sbjct: 820 EGKHPFPRKPLRALSDICGYKTVFMPGQNPCFILKSAITQPHVLRLRGK--------AVQ 871
Query: 956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKN 1015
+ + H C GF YV I+++ +LPS + +D+ W +K IPL I Y +
Sbjct: 872 SLSGFHIAACERGFAYVDEDNIIRMSRLPSNTRFDSTWATRK-IPLGEQVDCIVYSSASE 930
Query: 1016 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1075
Y + SV + L D E + N ++ + +E V++L+P
Sbjct: 931 SYVIGTSV------KEDFKLPEDDESHTEWQNEFITFLP-----QLERGTVKLLDPKN-- 977
Query: 1076 GPWQTRATIP----MQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRV 1130
W P ++ +E + V+ L + T E + ++ +G+A V+GED+ +G +
Sbjct: 978 --WSIADIAPSSHELEPAERITCIEVIRLEISEITHERKDMVVVGSAIVKGEDIVPKGCI 1035
Query: 1131 LLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KW 1181
+F +P N +++++E +KGA++AL+ + QG L++A G K ++ K
Sbjct: 1036 RVFEIIDVVPDPDHSEMNKRLKLFAREEVKGAVTALSGIGSQGFLIVAQGQKCMVRGLKE 1095
Query: 1182 TGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1239
G+ L +AF DA YV L +K ++GD K ++F + E+ +L+L K+ +
Sbjct: 1096 DGSLLP-VAFKDAQ-CYVSVLKELKGTGMCIVGDAIKGLWFTGYSEEPYKLDLFGKENEN 1153
Query: 1240 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1299
+ A +FL DG+ L ++V+D+ N+ + Y P+ S KG +LL R FHVG H +
Sbjct: 1154 IAVIAADFLPDGNRLYVLVADDDCNLHVLQYDPEDPSSSKGDRLLHRNVFHVG-HFASTM 1212
Query: 1300 RLQMLATSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1353
L + + + A + T+ ++ +L GS+G I PL+E ++RRL +LQ
Sbjct: 1213 TLLPQGSHTPHSPADRDAMDTDAPLPPSKYQILMTFQTGSVGIITPLNEDSYRRLLALQS 1272
Query: 1354 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1413
+LV+++ H GLNPR +R S+G + G ++D LL + + + + EIA + G
Sbjct: 1273 QLVNALEHPCGLNPRGYRAVESDGIGGQRG---MIDGNLLLRWLDMGAQRKAEIAGRVGA 1329
Query: 1414 TRSQILSNLNDLALGTSFL 1432
I +L L G ++L
Sbjct: 1330 DVGAIRMDLEKLHGGLAYL 1348
Score = 104 bits (260), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 160/744 (21%), Positives = 297/744 (39%), Gaps = 114/744 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + + R D A L LV Y++ G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSSLANHPDQKSRH---DRSQHAKLVLVAEYQVSGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ + + D+I+++ +AK+S++E+D HG+ S+H +E E +
Sbjct: 85 ERVKISNSKSGG--DAILVSSRNAKLSLIEWDPRNHGISTISIHYYEGEESHMSPWVPDL 142
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE------------------ 217
+ + VDP G C + +G+ + IL Q G LV D+
Sbjct: 143 GSCASNLTVDPNGNCA-IFNFGIHSLAILPFHQTGDDLVMDDYDSVLNGDSAADTINDTQ 201
Query: 218 -DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
T G S E S V+ L LD + H F+H Y EP IL+ +
Sbjct: 202 KPTAGDSTVHSKPYEPSFVLPLAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLS 261
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
+ + ++ + + + LP D +K++++P P+GG L++G N +H
Sbjct: 262 IDRKDVVSYAIFTLDLQQRASTSLLTVSRLPSDMFKVVSLPPPVGGALLIGTNELVHVDQ 321
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
+ A+ +N +A + + +S + L+ L +D LL G + +LT
Sbjct: 322 AGKTNAVGVNEFARQASAFSMVDQSDLEMRLEDCVVEQLGSDAGEVLLILTDGRMAILTF 381
Query: 392 VYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTS 444
DGR V + L ++ S++ + + +G S F GS GDS+L+ ++ S +
Sbjct: 382 KVDGRSVSGISLHYVAEQSGGSIIKARPSCSAGLGRSKLFCGSEEGDSILLGWSKPSSNT 441
Query: 445 MLSSGLKE---EFGDIEADAPSTKRLRR-----------SSSDALQD--MVNGEELSLYG 488
+ E E G E + + + LQ+ +VNG++ +
Sbjct: 442 KKPTKANEDTNEDGTTEFSGEDEQDDDDDDIYEDDLYSANPAPTLQEKRVVNGDDTA--- 498
Query: 489 SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------LRINAD-----------ASAT 531
F F + D L ++GP +D + G L+ D +A
Sbjct: 499 -----------DFVFKIHDRLWSLGPFRDITLGRPPKSKLKDKRDNVPSISASLELVAAR 547
Query: 532 GISKQSNYEL------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYL 579
G K + +++ G+W++ + A ++ +Y YL
Sbjct: 548 GFGKSGGLAVLKREIDPFTIDSLKMDNVYGVWSIRVTDPKSKEASAT---GNSRDYDKYL 604
Query: 580 II-----SLEARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARI 633
++ S + ++V + + ++ ++ + TI G L RV+QV R
Sbjct: 605 LLAKAKCSDKEESVVYSVGNSGLDSIDAPEFNPNEDCTIDIGTLAAGSRVVQVLRTEIRS 664
Query: 634 LDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
D + +TQ ++ SE TV+ S A+PY+L D S+ +L D +
Sbjct: 665 YDYNLGLTQIYPVWDEDT-----SEERTVVQASFAEPYLLAIRDDHSLLVLQADKTGDLD 719
Query: 693 SVQTPAAIESSKKPVSSCTLYHDK 716
V+ + +S VS C LY D+
Sbjct: 720 EVEI-QGLATSADWVSGC-LYEDR 741
>gi|452841862|gb|EME43798.1| hypothetical protein DOTSEDRAFT_79774 [Dothistroma septosporum NZE10]
Length = 1347
Score = 190 bits (482), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 314/1410 (22%), Positives = 556/1410 (39%), Gaps = 212/1410 (15%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G V +LA + D D+++LAF+DAK++++E+D H + S+H
Sbjct: 51 LVLVGEYSLSGTVTNLAQVKL--PDTKTAGDALLLAFKDAKLTLIEWDPENHRISTISIH 108
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
+E + G ++ VDP RC + Q+ +L Q L +ED
Sbjct: 109 YYEGDNVVSQPFGPGLGECENILTVDPNWRCAALKFGTRQLAVLPFRQLDDELGVEEDGD 168
Query: 221 GSGGGFSAR-----------------IESSHVINLRDL--DMKHVKDFIFVHGYIEPVMV 261
+ + ++S V+ L L D+++ D F++GY E +
Sbjct: 169 AEPASTTLKRSESILQNVNGEVQQTPYKASFVLALSTLLEDIRYTVDLGFLYGYRESTLG 228
Query: 262 ILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGV 321
IL + + + + + + LP+ +K++ +P+P+GG
Sbjct: 229 ILSSSLQPSSSLLDIRKDELEYRMFKLELEQGESTELQVVKQLPNSLWKVVPLPAPVGGA 288
Query: 322 LVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP-RSSFSVELDAAHATWL--QNDVA 377
L+VG N+ +H + ++A+N +A +L+S + + +S +++L+ L ++
Sbjct: 289 LLVGTNSFVHVDLNAKVNSVAVNEFA-ALESDRGMEDQSDLNLKLEGCSVEILDAESRQV 347
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLG 426
L+ + G L + GR +Q L DL KT PS + + ++ F+G
Sbjct: 348 LVVLRDGSLATIYFEQSGRSIQGLKVSRVREEHGGDLVKTAPSC----VARLDHNKVFVG 403
Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
S G S LV+++ S LS K G + D E
Sbjct: 404 SEDGASSLVRWS--RSISTLSR--KRTHGQMLGQHGDEDDEEALEDDDDDLYDAAPETK- 458
Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------ATGISKQS--- 537
A++ T++ + SF ++D L ++GP+ D G A TG + S
Sbjct: 459 -KRATSTTDAFETPPSFQIQDVLHSLGPINDVCLGKSDGAQVDKLQMMLGTGRGRSSRIS 517
Query: 538 --NYELVELPG-------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
N ++V + K W V+ K + DD++H L+ + + +
Sbjct: 518 CLNRDIVPVSARKSTIGRAKSAWAVHAKRND-----------RDDDFHDNLLFAYDGQET 566
Query: 589 VLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFG 646
+ D + + + F +G TI L V+Q + R D ++Q +
Sbjct: 567 KIYDVDEVGYMERTAQEFEHEGETIDVQMLAKDTIVVQCRKSEIRTYDADLALSQIIPMV 626
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
++ E ++ +S DPY+L+ +D
Sbjct: 627 DEETD-----EEYEIVYLSFCDPYLLVVRND----------------------------- 652
Query: 707 VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIF 766
SS + H +G E + D +G +I G L + + G + +F
Sbjct: 653 -SSIQVLHVRGKEIEPLEGEGDIAEKKWLGGSIHT---GSLTKDVPALFLLSAQGTMHVF 708
Query: 767 DVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVE 826
+P+ V Y AL ++S + + G KE + + V E
Sbjct: 709 SLPSLEPV----------------YHAPALPHLPPVLSSDAPQRRA-GPKEALTELLVAE 751
Query: 827 LAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASR 886
L ++ P+L A ++ Y+ F PE P + + +
Sbjct: 752 LG----ASGVDTPYLVARTALDDLVLYEP--FRHPE-------PAPSDQWYT-------- 790
Query: 887 LRNLRFSRTPL-------DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
NLRF + P+ +A +EE+ P + I ++ + + GS P ++
Sbjct: 791 --NLRFRKVPVTYIPKYNEAIAQEESTRPLPLRSI----HVGDYDAVTIPGSPPL--LLV 842
Query: 940 RER------LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
+E L V + +H +C GF V + G+L+ LP + Y W
Sbjct: 843 KEASSLPRVLEVRISNESNRVATLLPIHLDHCKKGFAAVNADGLLEEYHLPLSAWYGTGW 902
Query: 994 PVQKVI--PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
VQ+V + Y + +Y + V + + Q G Q D
Sbjct: 903 SVQQVDLGSEDLEVRHLAYHETRGVYVVATCKDVDFYFAEDDHRHLGQSGGGQDDITLRP 962
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1111
V + + V R+++ +RA +P + AL V + + + T E +
Sbjct: 963 QVKQYSIHLVSSKTHRVID---------SRA-MPYLEAITALQVMPLEV-SELTHEQDLR 1011
Query: 1112 LAIGTAYVQGEDVAARGRVLLFS---TGRNADNPQNLVT-EVYSKE-LKGAISALASLQG 1166
+ + TA ++GED+ ARG +++F+ D P++ + V ++E KGAI+ALA G
Sbjct: 1012 ILVSTAAMRGEDMPARGAIIVFNIIDVVPAPDVPESGIKLHVNAREETKGAITALAPFPG 1071
Query: 1167 HLL-IASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNI-VKNFILLGDIHKSIYFLS 1222
+ G KI++ K G+ L +AF DA V + L GD K ++F
Sbjct: 1072 GFVGSGQGQKIMIRGLKEDGSCLP-VAFLDAQCHTTVIKTLGTSGMWLAGDAWKGLWFGG 1130
Query: 1223 WKEQGAQLNLLAK-DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
+ E+ +L +L K ++ A EFL L +++ D ++ + Y P+ +S G
Sbjct: 1131 FTEEPYKLTVLGKAPERQMEVMAAEFLPFDGALYILIIDADMDLHVLQYDPENPKSQNGM 1190
Query: 1282 KLLSRAEFHVGAHVTKFLRL---------QMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1332
+LL R+ FH+G T L L T+ D G +P + + F +L +L G
Sbjct: 1191 RLLHRSTFHLGHFATNMLLLPSSLNPFGENQPFTNGDTNGESP-EESSPLFHVLTTSLTG 1249
Query: 1333 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
SIG I PLDE ++RRL +LQ L + H A LNPR++R S G +VD +
Sbjct: 1250 SIGMITPLDESSYRRLSALQTHLTTILEHPASLNPRAYRAIESESFG---GARGVVDGNI 1306
Query: 1393 LSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ L + ++ + G I S+L
Sbjct: 1307 VRRINELGAARRADVLARAGADAWSIRSDL 1336
>gi|134025022|gb|AAI35011.1| LOC564406 protein [Danio rerio]
Length = 348
Score = 189 bits (479), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 104/283 (36%), Positives = 165/283 (58%), Gaps = 13/283 (4%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE V + L GNV S+A + G + RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 66 LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P+V+VDP+ RC +LVYG +++L + + DE
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT---LADEQEG 175
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R+LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 176 IVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
TC I A+S++ K HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPP 295
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
++LN+ + P+ + LD + A+++ +D ++S
Sbjct: 296 FGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVIS 338
>gi|327304811|ref|XP_003237097.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
gi|326460095|gb|EGD85548.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
Length = 1398
Score = 187 bits (476), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 147/539 (27%), Positives = 260/539 (48%), Gaps = 39/539 (7%)
Query: 911 CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
C+R+ ++ G++ F+SG PC+ ++ R H G V + + H C GF
Sbjct: 882 CKRLRALPDVCGYKTVFMSGHNPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 940
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1029
YV ++++ +LPS + +D+ W +K I I Y + Y + S
Sbjct: 941 AYVDEDNVIRMSRLPSNTRFDSGWATRK-IAFGEQVDSIVYSSASECYVIGTSA------ 993
Query: 1030 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1089
+ L D E + N ++ + +E V++LEP W T + ++ +
Sbjct: 994 KEDFKLPEDDESHTEWRNEFITFLP-----QLERGTVKLLEPRN----WSTIDSHELEPA 1044
Query: 1090 ENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QN 1144
E + + V+ L + T E + ++ +G++ V+GED+ +G + +F P ++
Sbjct: 1045 ERIMCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKS 1104
Query: 1145 LVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYV 1199
++++KE +KGA++AL+ + QG L++A G K ++ K G+ L +AF D YV
Sbjct: 1105 KKLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDGSLLP-VAFKDTQ-CYV 1162
Query: 1200 VSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1257
L +K ++GD K ++F + E+ +L+L K+ +L +FL DG+ L ++
Sbjct: 1163 NVLKELKGTGMCIIGDAFKGLWFTGYSEEPYKLDLFGKENENLAVVDADFLPDGNKLYIL 1222
Query: 1258 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG--AHVTKFLRLQMLATSS--DRTGA 1313
V+D+ N+ + Y P+ S KG +LL R+ FH G A L TSS D
Sbjct: 1223 VADDDCNLHVLQYDPEDPSSSKGDRLLRRSVFHTGHFASTVTLLPHGAHTTSSPVDEDAM 1282
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
S +++ +L GSI I PL E ++RRL +LQ +LV+++ H LNPR +R
Sbjct: 1283 DTDSPPPSKYQILMTFQTGSIAVITPLSEDSYRRLLALQSQLVNALEHPCSLNPRGYRAV 1342
Query: 1374 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
S+G + G ++D LL + + + + EIA + G I +L L G ++L
Sbjct: 1343 ESDGMGGQRG---MIDGNLLLRWLDMGAQRKAEIAGRVGADVGAIRIDLEKLHGGLAYL 1398
Score = 106 bits (265), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 178/735 (24%), Positives = 288/735 (39%), Gaps = 98/735 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + R + A L L Y + G + SL
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSTTAAQPDQKGRN---ERSQHAKLVLAAEYEVPGTITSL 84
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+ NS+ D+II++ +AK+S++E+D HG+ S+H +E E H+
Sbjct: 85 QRVK---ISNSKSGGDAIIVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVP 140
Query: 176 SFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGSGGGFSARIES- 232
P + DP G C + +G+ + IL Q G LV D+ G SA + S
Sbjct: 141 DLGSCPSSLTADPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGNDSADVVSD 199
Query: 233 -----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
S V+ + LD + H F+H Y EP IL+ +
Sbjct: 200 PQKSAPENTAHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSL 259
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ S ++ K + + LP D +K++ +P P+GG L++G N +H
Sbjct: 260 TIDRKDIVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVD 319
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + +S + L+ L + LL G + +L+
Sbjct: 320 QAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIIEQLGSGTGDILLILADGRMSILS 379
Query: 391 VVYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
DGR V + L ++N S+ + T ++G + F GS GDS+L+ ++ S T
Sbjct: 380 FKVDGRSVSGISLHFVAEQSNGSITIARPTCSASLGRNKLFCGSEEGDSILLGWSRPSST 439
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES------A 497
S K G E A D D + ++L AS E +
Sbjct: 440 IKRPS--KAADGVDENGAADLSDEAEQDDDGDDDDMYEDDLYSANLASTRQEKQVVNGDS 497
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WT 552
F F D L ++GP +D + G + + S +EL +G
Sbjct: 498 PADFIFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPEIAAPLELVAARGFGKSGGLA 557
Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART-------------MVLETADLLTEV 599
V + + DS +M DD Y + I ++ ++ ++ +T D +
Sbjct: 558 VLKREIDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTGLSRSYDKYLLLAKTKD--DDK 612
Query: 600 TESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
ESV Y V + TI G L RV+QV R D Y
Sbjct: 613 EESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLAAGTRVVQVLRTEIRSYD--YNLGLA 670
Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAIE 701
P E SE TV+ S A+PY+L D S+ +L D + V VQ AA
Sbjct: 671 QIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA-- 726
Query: 702 SSKKPVSSCTLYHDK 716
S K +S C LY DK
Sbjct: 727 -SGKWISGC-LYEDK 739
>gi|336463425|gb|EGO51665.1| hypothetical protein NEUTE1DRAFT_89273 [Neurospora tetrasperma FGSC
2508]
Length = 1437
Score = 187 bits (474), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 168/624 (26%), Positives = 281/624 (45%), Gaps = 67/624 (10%)
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
KE++ + V +L H P+L + + YQ Y + + + P S S
Sbjct: 791 KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
+ ++ N F++ P + ++ PH A P +R + NISG+ FL GS
Sbjct: 841 ------LFFQKVPNSTFAKAPEEKPVDDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P + + + L + A + H C HGFIY + GI ++ Q+P+ S+Y
Sbjct: 892 SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAE 951
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
K IP+ + Y Y +V ++P L D + + N++
Sbjct: 952 LGLSVKKIPVGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPKDDDYHKEWARENIT 1005
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1110
+ V+ +++L +G W T+ M+ E L V + L + +T E +
Sbjct: 1006 FKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVETLNLEVSESTNERKQ 1056
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK---------GAISAL 1161
L+A+GTA ++GED+ RGRV +F P T SK+LK GA++AL
Sbjct: 1057 LIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKLVAKEDIPRGAVTAL 1113
Query: 1162 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1215
+ + QG +L+A G K ++ K GT L +AF D YV S+ + L+ D
Sbjct: 1114 SEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVKELPGTGLCLMADAF 1171
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
K ++F + E+ ++ L K ++ +FL DG L +V SD +I I + P+
Sbjct: 1172 KGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDADGHIHILQFDPEHP 1231
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+S +G LL R F+ GAH L + A + + + S++ + LL + G +
Sbjct: 1232 KSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENSPHILLLASPTGVLA 1290
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA--HRPGPDS-----IV 1388
+ PL E +RRL SL +L + +PH AGLNP+ +R + A PG D+ IV
Sbjct: 1291 TLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGIGRNIV 1350
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTG 1412
D ++L + L ++ E+A + G
Sbjct: 1351 DGKILERFLELGTGKRQEMAGRAG 1374
Score = 127 bits (318), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 156/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 96 DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + +S + L+ L
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 393
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T++ G S F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRMGRSTVF 453
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + ++ I+ D D + GEE
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRIQDADLDLDLDDEDLEDDDDDDLYGEES 507
Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
+ A + ++ + +F + D L++I P++ +YG + S
Sbjct: 508 TSPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567
Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
A G K S ++ E P +G WTV K + +D
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627
Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+YH ++I++ E + TA +T + G T+ AG + R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
+QV + R DG ++Q + + E+G+ V + SIADP++LL D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740
>gi|350297359|gb|EGZ78336.1| protein cft-1 [Neurospora tetrasperma FGSC 2509]
Length = 1437
Score = 187 bits (474), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 168/624 (26%), Positives = 281/624 (45%), Gaps = 67/624 (10%)
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
KE++ + V +L H P+L + + YQ Y + + + P S S
Sbjct: 791 KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
+ ++ N F++ P + ++ PH A P +R + NISG+ FL GS
Sbjct: 841 ------LFFQKVPNSTFAKAPEEKPVDDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P + + + L + A + H C HGFIY + GI ++ Q+P+ S+Y
Sbjct: 892 SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAE 951
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
K IP+ + Y Y +V ++P L D + + N++
Sbjct: 952 LGLSVKKIPVGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPKDDDYHKEWARENIT 1005
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1110
+ V+ +++L +G W T+ M+ E L V + L + +T E +
Sbjct: 1006 FKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVETLNLEVSESTNERKQ 1056
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK---------GAISAL 1161
L+A+GTA ++GED+ RGRV +F P T SK+LK GA++AL
Sbjct: 1057 LIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKLVAKEDIPRGAVTAL 1113
Query: 1162 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1215
+ + QG +L+A G K ++ K GT L +AF D YV S+ + L+ D
Sbjct: 1114 SEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVKELPGTGLCLMADAF 1171
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
K ++F + E+ ++ L K ++ +FL DG L +V SD +I I + P+
Sbjct: 1172 KGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDADGHIHILQFDPEHP 1231
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+S +G LL R F+ GAH L + A + + + S++ + LL + G +
Sbjct: 1232 KSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENSPHILLLASPTGVLA 1290
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA--HRPGPDS-----IV 1388
+ PL E +RRL SL +L + +PH AGLNP+ +R + A PG D+ IV
Sbjct: 1291 TLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGIGRNIV 1350
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTG 1412
D ++L + L ++ E+A + G
Sbjct: 1351 DGKILERFLELGTGKRQEMAGRAG 1374
Score = 127 bits (318), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 156/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 96 DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + +S + L+ L
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 393
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T++ G S F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRMGRSTVF 453
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + ++ I+ D D + GEE
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRIQDADLDLDLDDEDLEDDDDDDLYGEES 507
Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
+ A + ++ + +F + D L++I P++ +YG + S
Sbjct: 508 TSPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEKERNSEGVRSDLQ 567
Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
A G K S ++ E P +G WTV K + +D
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627
Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+YH ++I++ E + TA +T + G T+ AG + R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
+QV + R DG ++Q + + E+G+ V + SIADP++LL D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740
>gi|164429683|ref|XP_964609.2| hypothetical protein NCU02082 [Neurospora crassa OR74A]
gi|157073577|gb|EAA35373.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 1437
Score = 186 bits (473), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 168/624 (26%), Positives = 281/624 (45%), Gaps = 67/624 (10%)
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
KE++ + V +L H P+L + + YQ Y + + + P S S
Sbjct: 791 KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
+ ++ N F++ P + ++ PH A P +R + NISG+ FL GS
Sbjct: 841 ------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P + + + L + A + H C HGFIY + GI ++ Q+P+ S+Y
Sbjct: 892 SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAE 951
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
K IP+ + Y Y +V ++P L D + + N++
Sbjct: 952 LGLSVKKIPIGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPKDDDYHKEWARENIT 1005
Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1110
+ V+ +++L +G W T+ M+ E L V + L + +T E +
Sbjct: 1006 FKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVETLNLEVSESTNERKQ 1056
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK---------GAISAL 1161
L+A+GTA ++GED+ RGRV +F P T SK+LK GA++AL
Sbjct: 1057 LIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKLVAKEDIPRGAVTAL 1113
Query: 1162 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1215
+ + QG +L+A G K ++ K GT L +AF D YV S+ + L+ D
Sbjct: 1114 SEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVKELPGTGLCLMADAF 1171
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
K ++F + E+ ++ L K ++ +FL DG L +V SD +I I + P+
Sbjct: 1172 KGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDADGHIHILQFDPEHP 1231
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+S +G LL R F+ GAH L + A + + + S++ + LL + G +
Sbjct: 1232 KSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENSPHILLLASPTGVLA 1290
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA--HRPGPDS-----IV 1388
+ PL E +RRL SL +L + +PH AGLNP+ +R + A PG D+ IV
Sbjct: 1291 TLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGIGRNIV 1350
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTG 1412
D ++L + L ++ E+A + G
Sbjct: 1351 DGKILERFLELGTGKRQEMAGRAG 1374
Score = 124 bits (311), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 154/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 96 DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + ++ + L+ L
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T++ G S F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + ++ ++ D D + GEE
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507
Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
+ A + ++ + +F + D L++I P++ +YG + S
Sbjct: 508 ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567
Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
A G K S ++ E P +G WTV K + +D
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627
Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+YH ++I++ E + TA +T + G T+ AG + R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
+QV + R DG ++Q + + E+G+ V + SIADP++LL D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740
>gi|25148482|ref|NP_500157.2| Protein CPSF-1 [Caenorhabditis elegans]
gi|22096347|sp|Q9N4C2.2|CPSF1_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
gi|373220398|emb|CCD73182.1| Protein CPSF-1 [Caenorhabditis elegans]
Length = 1454
Score = 186 bits (473), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 162/589 (27%), Positives = 273/589 (46%), Gaps = 84/589 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++ F+DAK+S++ ++ ++ S+H FE+ +L+ G + + PLV+ DP
Sbjct: 92 QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + S RI S +VI L+ +D + ++
Sbjct: 149 RCAACLVYGKHIAILPFHEN-----------------SKRIHS-YVIPLKQIDPRLDNIA 190
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSS---FSVE 363
+LL +P P+GG LV G+NT+ Y +Q+ C L LN+ D + P +
Sbjct: 251 CSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS---CYDGFTKFPLKDLKHLKMT 307
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD + + ++++ + ++ GDL LL ++ G V+ L+ SK + + +T
Sbjct: 308 LDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKSLEFSKVYETSIAYSLTVCAPG 367
Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD--ALQDMV 479
F+GSRLGDS L+++T T + KRL+ + D A + +
Sbjct: 368 HLFVGSRLGDSQLLEYTLLKTTRDC----------------AVKRLKIDNKDPAAAEIEL 411
Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+ +++ LYG A +++ E ++ F D L N+GP+K G R N ++ +
Sbjct: 412 DEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVGPVKSMCVG-RPNYMSNDLVDA 470
Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
K+ + ++LV G G V+ +S R SS + ++E H YLI+
Sbjct: 471 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLEGAEQLWAVGRKENESHKYLIV 530
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
S R+ ++ E + T+AAG L +QV A + DG M
Sbjct: 531 S-RVRSTLILELGEELVELEEQLFVTGEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 588
Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDP 687
Q++ N V+ SI DPYV L +G + L LV +P
Sbjct: 589 QEVHI----------DSNFPVIQASIVDPYVALLTQNGRLLLYELVMEP 627
Score = 185 bits (470), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 176/706 (24%), Positives = 323/706 (45%), Gaps = 70/706 (9%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD-TYMREALKDSETEINSSSEEGTGQ 813
+V +E+G L I +P V+ + +F + +VD T E + ++ E
Sbjct: 777 IVSHENGRLSIHSLPEMEVVYQIGRFSNVPELLVDLTVEEEEKERKAKAQQAAKEASVPT 836
Query: 814 GRKENIHSM------KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
E +++ +V+E + + + P L AI+ D ++ Y+ + S
Sbjct: 837 DEAEQLNTEMKQLCERVLEAQIVGMGINQAHPILMAIV-DEQVVLYEMF---------SS 886
Query: 868 DDPVSTSRSLSVSNV-------SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
+P+ +S + ++S L N R P + + +G I F+ +
Sbjct: 887 SNPIPGHLGISFRKLPHFICLRTSSHL-NSDGKRAPFEM----KINNGKRFSLIHPFERV 941
Query: 921 SG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QG 976
S + G + G+ P +V+ ++ H DG I AFT +N N HG +Y+T +
Sbjct: 942 SSVNNGVMIVGAVPTL-LVYGAWGGMQTHQMTVDGPIKAFTPFNNENVLHGIVYMTQHKS 1000
Query: 977 ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1036
L+I ++ Y+ +PV+K I + T H + Y ++Y ++ S+P KP N++ ++
Sbjct: 1001 ELRIARMHPDFDYEMPYPVKK-IEVGRTIHHVRYLMNSDVYAVVSSIP--KPSNKIWVVM 1057
Query: 1037 ID--QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
D QE H+ D + + + YT+ + + D A P I + E
Sbjct: 1058 NDDKQEEIHEKDENFV--LPAPPKYTLNLFSSQ----DWAAVP---NTEISFEDMEAVTA 1108
Query: 1095 VRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE----- 1148
V L + +T ETLLA+GT GE+V RGR++L P +
Sbjct: 1109 CEDVALKSESTISGLETLLAMGTVNNYGEEVLVRGRIILCEVIEVVPEPDQPTSNRKIKV 1168
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
++ KE KG ++ L ++ G LL G K+ + ++ +L GI+F D YV L+ ++
Sbjct: 1169 LFDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFLDMH-YYVYQLHSLRTI 1227
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKN 1264
+ D +S+ + ++E +++ ++D C A++ ++DG+ + ++SDE N
Sbjct: 1228 AIACDARESMSLIRFQEDNKAMSIASRD--DRKCAQPPMASQLVVDGAHVGFLLSDETGN 1285
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
I +F YAP+ ES G++L RA ++G ++ F+RL+ + R
Sbjct: 1286 ITMFNYAPEAPESNGGERLTVRAAINIGTNINAFVRLRGHTSLLQLNNEDEKEAIEQRMT 1345
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSNGKA 1379
+F +LDGS G + PL E ++RRL LQ + P +AGL+ + R Q NG+
Sbjct: 1346 TVFASLDGSFGFVRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGSRSAKPSQPIVNGRN 1405
Query: 1380 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
R +++D +++ Y L L ++ ++A + G R I+ +L L
Sbjct: 1406 AR----NLIDGDVVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQL 1447
>gi|326432241|gb|EGD77811.1| hypothetical protein PTSG_08901 [Salpingoeca sp. ATCC 50818]
Length = 1506
Score = 186 bits (473), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 180/690 (26%), Positives = 297/690 (43%), Gaps = 128/690 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLV NV+ +Y + VQ +G+ + + E +DG+ + V R GN
Sbjct: 29 NLVTVQGNVLSVYNL-VQAQGAADKRCHLEADISFTLDGVP----QDVATVRPRGN---- 79
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHLK 171
RD +I F+DA+++++ FD + L S+H FE + W +
Sbjct: 80 ------------SRDLLIFTFKDARVAIVRFDPKMRDLETVSLHAFEDTDTKLGGWHSEQ 127
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF-GSGGGFSARI 230
R R V VDP RC ++VYG ++I++ S G + + DT + F++R
Sbjct: 128 RLR--------VCVDPLHRCAALMVYGCKLIVISFSSGTATAAPEADTQEDTEQSFTSR- 178
Query: 231 ESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
VI+L L + V D F+ GY P + ILH+ W G ++ T ++ALS+
Sbjct: 179 ----VIDLLSLPSTIGRVDDMAFLDGYDVPCLAILHQPRPAWVGHMAKTKDTAHVTALSL 234
Query: 289 S------------TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
+ P++W NLP D + L VP+P+GGV+V+G N + Y +QS
Sbjct: 235 ALDEMTARRAPTAPPPPPPPVVWHQENLPSDTFALQPVPAPLGGVVVIGVNVLFYVTQSL 294
Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR 396
+LALN Y+ + ++ ++ S++LD AH L L + +GD+ LLT+V
Sbjct: 295 VRSLALNGYSRASTNAPIQEQTGISLDLDGAHHALLTPTQILFALPSGDIHLLTIVCTDV 354
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGL--K 451
V L + K SV+ SDI T+G F+ SR SLL+++ + T + SG+ +
Sbjct: 355 TVDGLRMDKLATSVIGSDICTLGRRHIFIASRHATSLLLEWAPIPLSATTHIDVSGVSGR 414
Query: 452 EEFGDIEADAPSTKRLRRSSS------------DALQDMVNGEELSLYGSASNNTESAQK 499
++ G + ST L S+S D D+V+G +G S
Sbjct: 415 DDAGLYGTSSDSTAALNTSASRDGSSTGGDDLDDVYGDVVDGGTTGAHGIGSGGR---VM 471
Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE---------------- 543
T RD+L + P+K + G +A +S YELV
Sbjct: 472 TVKLMARDALPTVAPIKSTAVG--TSAQGVVPHADPRSQYELVSCIGHDKNGALANISYS 529
Query: 544 -----------LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
L K W V+ +S+ +H +++ S +TMV
Sbjct: 530 LKPQVLLTEDALSSVKDCWAVHSNNSK---------------HHTHVVFSKPKKTMVFRV 574
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
A ++ + + T+ AGN+ GR+ V+QV + +LD +D F
Sbjct: 575 AGDFEQLRHPRGFDTEASTVFAGNVMGRQLVLQVTAKHVMLLDD----RDCVFDERM--- 627
Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+ + VS+ADPY+ L ++D + ++
Sbjct: 628 ---KKGVRITKVSVADPYIALLLNDATTKV 654
Score = 122 bits (307), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 163/724 (22%), Positives = 288/724 (39%), Gaps = 92/724 (12%)
Query: 757 CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN------------ 804
C ++G L IF VP+ VF F D+ R+ + E E
Sbjct: 807 CDKNGVLSIFQVPDMREVFCCTVFSVLPNVAWDSVYRKEIGPVELEPEMPLKRAKTMDEK 866
Query: 805 -----------SSSEEGTGQGRKENIHSMKVVELAMQRWSA----HHSRPFLFAILTDGT 849
+ E G+ Q ++ ++ E+ + A SRP LF
Sbjct: 867 GQSVFVEADEEADDESGSAQAEEDEQDRLQRKEMTIVELLAIGLGRGSRPHLFLRNETQH 926
Query: 850 ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
++ Y+ + TS S R RLR T +D + +
Sbjct: 927 VIVYEIF-------TS------SYKRHEKYEGRLQIRLRKRHQHPTWIDERLAQSS--SI 971
Query: 910 PCQRITIFKNISGHQGFFLSGSRPCWCMV--FRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
P F +ISG G F+ RP W M + +R H DG++ FT L +
Sbjct: 972 PPAAFRPFADISGCDGVFVCARRPSWFMCDHTHKVVRHHAMRFDGAVQCFTQLKHAMHTS 1031
Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLK 1027
F+Y T +G++++ +G P ++ P+KA+ + + E +Y +V + +
Sbjct: 1032 CFLYFTGKGVMRMATTAAGQVLSTPLPSRRT-PIKASACYVDFDPESGVY--VVVLKHKE 1088
Query: 1028 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1087
P + E +D S L + E Y + + + WQ P++
Sbjct: 1089 PCAHLPKFGPPMEEAPAVDMKFASDEPLPQR---ERYSICLFSCED----WQLVPNSPVE 1141
Query: 1088 SSENALTVRVVTLFNTTTKENET----LLAIGTAYVQGEDVAARGRVLLFSTGRNADNP- 1142
+ V + N +++ + T +A+GT V GE RG + L+ P
Sbjct: 1142 IPADH-HVTAFKVINISSERHLTGKKPCVAVGTTPVLGERNLERGLLQLYDVLEVVPEPG 1200
Query: 1143 ----QNLVTEVYSKELKGAISALASLQGHLLIA----SGPKIILHKWTGTE-LNGIAFYD 1193
+N + + S + GA++AL S++G+++ A GPKI + + E L IAF +
Sbjct: 1201 KPTTKNRLKLMLSSDETGAVTALNSIEGYVIGALARRDGPKIFVWRVEDDEKLQPIAFLE 1260
Query: 1194 APPLYVVSLNIVKNFILLGDIHKSIYF--LSWKEQGAQLNL-----------LAKDFGSL 1240
++ V+L + NF+++GD + L E LNL + +D
Sbjct: 1261 GS-MFTVTLKVALNFVIIGDYMGRVMLARLIKDETLKILNLSKGTTSQALLQVGRDVAPT 1319
Query: 1241 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH--VTKF 1298
+A +F++ G+ L ++ D+ N+ I + + +G ++L R + H +
Sbjct: 1320 SVYAADFIVRGAELHVLFLDQHANMTILAFDSD-DPTTRGGRILKRHSVYNTGHQRIVAL 1378
Query: 1299 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1358
RLQ + + R + L + TL+G G I + E FRRL LQ +L+
Sbjct: 1379 TRLQNVPPRNSRNAT------VDAHFLTYQTLEGGAGYITSIPEDIFRRLMLLQLRLLPH 1432
Query: 1359 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
+ AGL+P +F+++ S + ++ + ML L+ Q E+A Q GTT Q+
Sbjct: 1433 LKFRAGLHPSAFKKYKSASLHMVHQEVRTICADVYTRLFMLDLDAQKEVARQVGTTTKQL 1492
Query: 1419 LSNL 1422
+
Sbjct: 1493 CDDF 1496
>gi|326477251|gb|EGE01261.1| protein kinase subdomain-containing protein [Trichophyton equinum CBS
127.97]
Length = 1267
Score = 186 bits (471), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 144/539 (26%), Positives = 260/539 (48%), Gaps = 39/539 (7%)
Query: 911 CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
C+ + ++ G++ F+SG PC+ ++ R H G V + + H C GF
Sbjct: 751 CKLLRALPDVCGYKTVFMSGHNPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 809
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1029
YV ++++ +LPS + +D+ W +K I L I Y + Y + S
Sbjct: 810 AYVDEDNVIRMSRLPSNTRFDSGWATRK-IALGEQVDSIVYSSASECYVIGTSA------ 862
Query: 1030 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1089
+ L D E + N ++ + +E V++LEP W T + ++ +
Sbjct: 863 KEDFKLPEDDESHTEWRNEFITFLP-----QLERGTVKLLEPKN----WSTIDSHELKPA 913
Query: 1090 ENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QN 1144
E + V+ L + T E + ++ +G++ V+GED+ +G + +F P ++
Sbjct: 914 ERITCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKS 973
Query: 1145 LVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYV 1199
++++KE +KGA++AL+ + QG L++A G K ++ K G+ L +AF D YV
Sbjct: 974 KKLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDGSLLP-VAFKDTQ-CYV 1031
Query: 1200 VSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1257
L +K ++GD K ++F+ + E+ +L+L K+ +L +FL DG+ L ++
Sbjct: 1032 NVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENENLAVVDADFLPDGNKLYIL 1091
Query: 1258 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS----DRTGA 1313
V+D+ N+ + Y P+ S KG +LL R+ FH G + L A + D
Sbjct: 1092 VADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTLLPHGAYTPSAPVDEDAM 1151
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
S +++ +L GSI I PL E ++RRL +LQ +LV+++ H LNPR +R
Sbjct: 1152 DTDSLPPSKYQILMTFQTGSIAVITPLSEDSYRRLLALQSQLVNALEHPCSLNPRGYRAV 1211
Query: 1374 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
S+G + G ++D LL + + + + EIA + G I ++L L G ++L
Sbjct: 1212 ESDGMGGQRG---MIDGNLLLRWLDMGAQRKAEIAGRVGADVGAIRTDLEKLHGGLAYL 1267
Score = 94.4 bits (233), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 133/579 (22%), Positives = 232/579 (40%), Gaps = 65/579 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + R D A L L Y + G + L
Sbjct: 28 NLIVVKTSLLQVFSLVNVTYGSTTATQPDQKGRN---DRSQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+ NS+ D+I+++ +AK+S++E+D HG+ S+H +E E H+
Sbjct: 85 QRVR---ISNSKSGGDAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVP 140
Query: 176 SFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------D 218
P + VDP G C + +G+ + IL Q G LV D+ D
Sbjct: 141 DLGSCPSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDMVSD 199
Query: 219 TFGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
S G +A + S V+ + LD + H F+H Y EP IL+ +
Sbjct: 200 AQKSAPGNTAHDKPYAPSFVLPMAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSL 259
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ S ++ + + + LP D +K++ +P P+GG L++G N +H
Sbjct: 260 TIDRKDVVSYSIFTLDLQQRASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVD 319
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + +S + L+ L + LL G + +L+
Sbjct: 320 QAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILS 379
Query: 391 VVYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
DGR V + L +K PS S +G + F GS GDS+L+ ++
Sbjct: 380 FKVDGRSVSGISLHFVAEQSGGLITKARPSCSAS----LGRNKLFYGSEEGDSILLGWSR 435
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES--- 496
S T+ S K G E+ A D D + ++L AS E
Sbjct: 436 PSSTTKRPS--KAADGVDESGAADLSDEAEQDDDGDDDDMYEDDLYSVNPASIRQEKQVV 493
Query: 497 ---AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI--- 550
+ F+F D L ++GP +D + G + + S + +EL +G
Sbjct: 494 NGDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPAIAAPLELVAARGFGKS 553
Query: 551 --WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
TV + + DS +M DD Y + I ++ ++
Sbjct: 554 GGLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKS 589
>gi|159123784|gb|EDP48903.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus A1163]
Length = 1401
Score = 185 bits (469), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 152/539 (28%), Positives = 261/539 (48%), Gaps = 41/539 (7%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
+ I NIS F+ G RP ++ + H G V + L + + + GFI
Sbjct: 884 LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942
Query: 971 YVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1030
YV S+ +L+IC+ PS + +D W ++K+ + H + Y Y L S +
Sbjct: 943 YVDSKDVLRICRFPSDTLFDYTWALRKISIGEQVDH-LAYATSSETYVLGTS------HS 995
Query: 1031 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1090
L D E+ N L L + + ++++ P W + + E
Sbjct: 996 ADFKLPDDDELHPDWRNEGLVISFLPE---LRQCSLKVVSPRT----WTVIDSYSLGPDE 1048
Query: 1091 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE- 1148
+ V+ + L + T E ++ +GTA+ +GED+ +RG + +F + +P+ T+
Sbjct: 1049 YVMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIYVFEVIKVVPDPEKPETDR 1108
Query: 1149 ---VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVV 1200
+ KEL KGA++AL+ + QG L+ A G K ++ K G+ L +AF D YV
Sbjct: 1109 KLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVN 1166
Query: 1201 SLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
L +K ++GD K ++F + E+ +++L KD G L+ A EFL DG L ++V
Sbjct: 1167 VLKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKDQGYLEVVAAEFLPDGDKLFILV 1226
Query: 1259 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS- 1317
+D N+ + Y P+ +S G +LL+R++FH+G T L SS++ A P S
Sbjct: 1227 ADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMANPDSM 1286
Query: 1318 ---DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1374
+T +L + GS+G + + E ++RRL +LQ +L +S+ H GLNPR++R
Sbjct: 1287 EIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQSQLANSLEHPCGLNPRAYRAVE 1346
Query: 1375 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL-ALGTSFL 1432
S+G A R ++D LL + + ++EIA + G +I ++L + A G +L
Sbjct: 1347 SDGTAGR----GMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGAEGLGYL 1401
Score = 136 bits (343), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)
Query: 57 NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV +V++I+ +++VQ E+ + + D + L L Y L G V
Sbjct: 28 NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83
Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
+ + S+ G + +++LAF +AK+S++E+D HG+ S+H +E +
Sbjct: 84 ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
+ + G ++ VDP RC V +G++ + IL Q G L D+ F
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196
Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
G G ++ SS V+ L LD + H F++ Y EP IL+ +
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
T +S + + + ++ + + S LP D +K++A+P P+GG L++G+N
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316
Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
+H + A+ +N +A + + + +S ++ L+ + + LL +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376
Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
+VL+ DGR V + L ++ +++ S ++ +G+ F GS DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
+ ++ D +S DA + D+ E E G + +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492
Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
+ F D L NIGPL+D + G + + + K + EL EL +G
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551
Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
+WT G D ++ + EY Y+I+S +
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607
Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
T ET+++ T+ + F TI G L ++RV+QV R SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663
Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
D+ G P E S+ +S S+ADPY+ + D ++ +L D S
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718
Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
V+ A + K SC LY DK
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740
>gi|146324727|ref|XP_747211.2| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus Af293]
gi|148886828|sp|Q4WCL1.2|CFT1_ASPFU RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|129556124|gb|EAL85173.2| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus Af293]
Length = 1401
Score = 184 bits (468), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 152/539 (28%), Positives = 261/539 (48%), Gaps = 41/539 (7%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
+ I NIS F+ G RP ++ + H G V + L + + + GFI
Sbjct: 884 LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942
Query: 971 YVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1030
YV S+ +L+IC+ PS + +D W ++K+ + H + Y Y L S +
Sbjct: 943 YVDSKDVLRICRFPSETLFDYTWALRKISIGEQVDH-LAYATSSETYVLGTS------HS 995
Query: 1031 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1090
L D E+ N L L + + ++++ P W + + E
Sbjct: 996 ADFKLPDDDELHPDWRNEGLVISFLPE---LRQCSLKVVSPRT----WTVIDSYSLGPDE 1048
Query: 1091 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE- 1148
+ V+ + L + T E ++ +GTA+ +GED+ +RG + +F + +P+ T+
Sbjct: 1049 YVMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIYVFEVIKVVPDPEKPETDR 1108
Query: 1149 ---VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVV 1200
+ KEL KGA++AL+ + QG L+ A G K ++ K G+ L +AF D YV
Sbjct: 1109 KLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVN 1166
Query: 1201 SLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
L +K ++GD K ++F + E+ +++L KD G L+ A EFL DG L ++V
Sbjct: 1167 VLKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKDQGYLEVVAAEFLPDGDKLFILV 1226
Query: 1259 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS- 1317
+D N+ + Y P+ +S G +LL+R++FH+G T L SS++ A P S
Sbjct: 1227 ADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMANPDSM 1286
Query: 1318 ---DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1374
+T +L + GS+G + + E ++RRL +LQ +L +S+ H GLNPR++R
Sbjct: 1287 EIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQSQLANSLEHPCGLNPRAYRAVE 1346
Query: 1375 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL-ALGTSFL 1432
S+G A R ++D LL + + ++EIA + G +I ++L + A G +L
Sbjct: 1347 SDGTAGR----GMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGAEGLGYL 1401
Score = 136 bits (342), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)
Query: 57 NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV +V++I+ +++VQ E+ + + D + L L Y L G V
Sbjct: 28 NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83
Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
+ + S+ G + +++LAF +AK+S++E+D HG+ S+H +E +
Sbjct: 84 ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
+ + G ++ VDP RC V +G++ + IL Q G L D+ F
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196
Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
G G ++ SS V+ L LD + H F++ Y EP IL+ +
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
T +S + + + ++ + + S LP D +K++A+P P+GG L++G+N
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316
Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
+H + A+ +N +A + + + +S ++ L+ + + LL +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376
Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
+VL+ DGR V + L ++ +++ S ++ +G+ F GS DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
+ ++ D +S DA + D+ E E G + +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492
Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
+ F D L NIGPL+D + G + + + K + EL EL +G
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551
Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
+WT G D ++ + EY Y+I+S +
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607
Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
T ET+++ T+ + F TI G L ++RV+QV R SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663
Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
D+ G P E S+ +S S+ADPY+ + D ++ +L D S
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718
Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
V+ A + K SC LY DK
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740
>gi|260835073|ref|XP_002612534.1| hypothetical protein BRAFLDRAFT_58262 [Branchiostoma floridae]
gi|229297911|gb|EEN68543.1| hypothetical protein BRAFLDRAFT_58262 [Branchiostoma floridae]
Length = 318
Score = 184 bits (468), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 107/297 (36%), Positives = 168/297 (56%), Gaps = 38/297 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LVV + +Y ++ E S++ K +ELV + ++GN+ S+
Sbjct: 29 SLVVAGTTQLHVYRLKGDMEKSRKQK------------------MELVASFSMYGNIMSV 70
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ G+D RD+++L+F DAK+S++E+D H L+ SMH FE E +K G S
Sbjct: 71 ESVQLAGSD----RDALLLSFMDAKLSIVEYDPGTHDLKTASMHYFEEEE---VKDGYVS 123
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+GRC +L+YG ++++L + G+ DE +G S I +++I
Sbjct: 124 NYHAPMVRVDPEGRCAVMLIYGKRLVVLPFRKEGAV---DEAEMSAGSKSS--ILPTYMI 178
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
L+DLD + +V D F+HGY +P ++IL+E TW GRV+ + TC I A+S++ +
Sbjct: 179 KLQDLDERLINVVDLQFLHGYFDPTLLILYEPLQTWPGRVAVRQDTCCIVAVSLNIAQRV 238
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
HP+IWS NLP D + +AVP PIGGVLV N++ Y +QS Y VSL+S
Sbjct: 239 HPIIWSVGNLPFDCKQAVAVPKPIGGVLVFAVNSLLYLNQSVP------PYGVSLNS 289
>gi|398397855|ref|XP_003852385.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
gi|339472266|gb|EGP87361.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
Length = 1333
Score = 183 bits (465), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 297/1402 (21%), Positives = 533/1402 (38%), Gaps = 200/1402 (14%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
+ + L L+ Y L G V S+A + D ++I+LAF++AK+S++E+D H +
Sbjct: 45 AQSKLVLIGGYPLAGTVTSIARVKT--LDTRTGGEAILLAFKNAKLSLIEWDPENHRIST 102
Query: 157 TSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL------------ 204
S+H +E + G ++ VDP RC + Q+ IL
Sbjct: 103 VSIHYYEGENVIAQPYGPSLGEYESILTVDPGSRCAALKFGARQLAILPFRQFGDELLGE 162
Query: 205 ------KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYI 256
A+ G + D G + S V+ L LD + H D F+H Y
Sbjct: 163 EEGEFENANDGTTSKKHDAMQNGEDEAEQTPYKQSFVLPLTTLDPALSHTIDLAFLHEYR 222
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
EP I+ + + ++ K + + NLP +K++ +PS
Sbjct: 223 EPTFGIISSAIEPSYALFDERKDILSYTVFTLDLEQKASTNLITVPNLPSTLWKVVPLPS 282
Query: 317 PIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQND 375
PIGG L++G N IH + A A+N +A+ +S +++L+ L
Sbjct: 283 PIGGALLIGTNEFIHVDQSGKANATAVNEFAMKESDFGMADQSGLNLKLEGCSVEILNAS 342
Query: 376 VA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLG 426
L+ + G L + GR + + ++ + S S ++ + + F+G
Sbjct: 343 TGEMLVVLRDGSLATVDFKMLGRSISAVIVTIISSENGGKVFSTAPSCVSRLDQNNLFIG 402
Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
S G S L+ C + S LS K + + D +
Sbjct: 403 SEDGSSSLL--GCTNSQSGLSR--KRSHAQMLGQNSGDEEEDVLDEDDDDLYDAEPDSKK 458
Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA-------SATGISKQSNY 539
++ + + F ++D L +IGP+ + G R NA + TG S+ S
Sbjct: 459 RATSVAEQSAGDSSTHFVIKDDLHSIGPINNTCVG-RSNAAGEDKLQLLAGTGKSRSSRL 517
Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
+ +R R ++ HA+ + + E E +
Sbjct: 518 ACI---------------NRDIIPQHIRKNQFEGARHAWAVCAREKNANDDEAGEGGYTE 562
Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSEN 658
+ ++ G T+ L V+QV R D ++Q + +++
Sbjct: 563 RSATEFEHDGETLEVLTLGHGTAVVQVRRMEIRTYDSHLALSQIIPMIDDETDA-----E 617
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGP 718
+++ S DPY+L+ D SI++ ++ +K + + D
Sbjct: 618 FSIVHTSACDPYLLVIRDDSSIQV-----------------LQHERKDIEPLDITPDAAD 660
Query: 719 EPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVD 778
+ W+ GE DG + + G+L + +P+ V
Sbjct: 661 KQWMSGCVYS-------GEFTDG---------NAALFLLSAEGSLHVLTLPDLQLV---- 700
Query: 779 KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR 838
YM AL + S+ G KE + + V +L +
Sbjct: 701 ------------YMTPALPHLPP-VLSADVSHRRMGVKETLTELLVADLGNDGVL----Q 743
Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
P+L ++ Y+ P +S S S + NLRF + P+
Sbjct: 744 PYLTVRTAMDDVVLYE---------------PFHSSPSASTGPWHS----NLRFRKVPVP 784
Query: 899 AYTR-EETPHGAPCQRITIFK--NISGHQGFFLSGSRPCWCMVFRER------LRVHPQL 949
+ ++P P R + I G+ + G+ C ++ +E L V+
Sbjct: 785 YIPKYNDSPLEDPNARPPALRRMQIGGYNTVSIPGAPSC--LLLKEASGPPKILEVNEPK 842
Query: 950 CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQIT 1009
+ T L+ + C +GF V G L CQLP + + W +++ I L ++
Sbjct: 843 RSNATTILTPLNRIGCENGFATVDVNGALHECQLPPDAWFSTGWSIRQ-IDLGDDAREVR 901
Query: 1010 YFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS---SVDLHRTYTVEEYEV 1066
+ A + V+ + +E G + ++S V + + +
Sbjct: 902 HLAYHEARGIFVAATC-----TTVDFYFAEEDGRHPEQDDISIRPQVPQYSVHLISAKTH 956
Query: 1067 RILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAA 1126
+I+ + +P + AL V + + + E + ++ + T +GED+ A
Sbjct: 957 KIIHTHK----------LPYLETVTALKVMPAEV-SELSHEVKPVVVVSTGAQRGEDMPA 1005
Query: 1127 RGRVLLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASLQGHLL-IASGPKIILHK 1180
+G +++F +P L V ++E +GAI+ALAS G ++ A G K+++
Sbjct: 1006 KGALIVFDVIDVVPDPDVEESGLHLHVLAREESRGAITALASFPGGMIGTAQGLKLMIR- 1064
Query: 1181 WTGTELNG----IAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLA 1234
G +G +AF DA Y L + + L GD K ++F + ++ +L LL
Sbjct: 1065 --GMREDGSCLPVAFLDAQ-CYTSLLKTLDSRGLWLAGDAWKGLWFGGFTQEPYKLTLLG 1121
Query: 1235 KDFGS-LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1293
K + ++ +FL L L+V D ++ + Y P+ +S GQ+LL R+ FH+G
Sbjct: 1122 KSPRTEMEVIEADFLPFDGALFLLVLDADADLHVLQYDPENPKSLNGQRLLHRSTFHIGH 1181
Query: 1294 HVTKFLRL-QMLATSSDRTGAAPGSDKTNR---------FALLFGTLDGSIGCIAPLDEL 1343
T + L LA +++ P D + F +L T GSIG I PLDE
Sbjct: 1182 FPTGSMLLPSTLAPFTEQARDLPNGDSEDTKQEEVNSPLFHVLTTTSSGSIGLITPLDES 1241
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA---HRPGPDSIVDCELLSHYEMLP 1400
T+RRL +LQ L + + H AGLNPR +R + KA G +VD L+ L
Sbjct: 1242 TYRRLSALQGHLTNILEHAAGLNPRMYRT-DTEMKATDSEMGGAKGVVDGSLIRRISELG 1300
Query: 1401 LEEQLEIAHQTGTTRSQILSNL 1422
+ ++ + G Q+ S+L
Sbjct: 1301 AARRADVLSRVGGDVWQLRSDL 1322
>gi|119484094|ref|XP_001261950.1| cleavage and polyadenylation specificity factor subunit A, putative
[Neosartorya fischeri NRRL 181]
gi|148886830|sp|A1DB13.1|CFT1_NEOFI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|119410106|gb|EAW20053.1| cleavage and polyadenylation specificity factor subunit A, putative
[Neosartorya fischeri NRRL 181]
Length = 1400
Score = 182 bits (463), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 150/543 (27%), Positives = 261/543 (48%), Gaps = 51/543 (9%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRER----LRVHPQLCDGSIVAFTVLHNVNCNHGF 969
+ I NIS F+ G + + + R+ + G ++ L + + + GF
Sbjct: 885 LRILPNISDLSAVFMPGPSASFILKTAKSCPHVFRLRGEFVRG--LSIFDLASPSLDKGF 942
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1029
IYV S+ +L+IC+ PS + +D W ++K+ + H + Y Y L S
Sbjct: 943 IYVDSKDVLRICRFPSETLFDYTWALRKIGIGEQVDH-LAYATSSETYVLGTS------H 995
Query: 1030 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1089
+ L D E+ N +S + R +++ R W + + +
Sbjct: 996 SADFKLPDDDELHPDWRNEVISFLPELRQCSLKVVSPRT---------WTVIDSYSLGPA 1046
Query: 1090 ENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE 1148
E + V+ + L + T E ++ +GTA+ GED+ +RG + +F + +P+ T+
Sbjct: 1047 EYVMAVKNMDLEVSENTHERRNMIVVGTAFAWGEDIPSRGCIYVFEVIKVVPDPEKPETD 1106
Query: 1149 ----VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYV 1199
+ KEL KGA++AL+ + QG L+ A G K ++ K G+ L +AF D YV
Sbjct: 1107 RKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYV 1164
Query: 1200 VSLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
N+VK ++GD K ++F + E+ +++L KD G L+ A EFL DG L
Sbjct: 1165 ---NVVKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKDQGYLEVVAAEFLPDGDKL 1221
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1314
++V+D N+ + Y P+ +S G +LL+R++FH+G T L SS++ A
Sbjct: 1222 FILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMAD 1281
Query: 1315 PGS----DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1370
P S +T +L + GS+G + + E ++RRL +LQ +L +S+ H GLNPR++
Sbjct: 1282 PDSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQSQLTNSLEHPCGLNPRAY 1341
Query: 1371 RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL-ALGT 1429
R S+G A R ++D LL + + ++EIA + G +I ++L + A G
Sbjct: 1342 RAVESDGTAGR----GMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGAEGL 1397
Query: 1430 SFL 1432
+L
Sbjct: 1398 GYL 1400
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 181/751 (24%), Positives = 310/751 (41%), Gaps = 127/751 (16%)
Query: 57 NLVVTAANVIEIY-VVRVQEE---GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
NLVV +V++I+ +++VQ G+ E K++ D + L L Y L G
Sbjct: 28 NLVVVKTSVLQIFSLLKVQHHLRGGTIEGKSARP-------DRVETTKLVLEREYPLSGT 80
Query: 113 VESLA---ILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
V + IL+ GG ++++LAF +AK+S++E+D HG+ S+H +E +
Sbjct: 81 VVDICRVKILNPKSGG-------EALLLAFRNAKLSLVEWDPERHGISTLSIHYYERDDL 133
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGD-------EDT 219
+ + G ++ VDP RC V +G++ + IL Q G L D +D
Sbjct: 134 TRSPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDD 192
Query: 220 FGS-----GGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
F G ++ SS V+ L LD + H F++ Y EP +L+
Sbjct: 193 FNQVSDHVGNDLKSKDRTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGVLY 252
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
+ T + + + + ++ + + S LP D +K++A+P P+GG L++
Sbjct: 253 SQIATSHALLPERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLI 312
Query: 325 GANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
G+N +H + A+ +N +A + + + +S ++ L+ L + LL
Sbjct: 313 GSNELVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHLSDSTGDLLLVL 372
Query: 382 KTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLL 434
+G++VL+ DGR V + L ++ +++ S ++ +G+ F GS DS+L
Sbjct: 373 SSGNMVLVHFQLDGRSVSGISLRPLPAQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVL 432
Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASN 492
+ ++ S + ++ D +S D + D+ E E G +
Sbjct: 433 LSWSSMSSN---PKKPRPRMSNVAEDREEASVDSQSEEDVYEDDLYTAEPETPALGRRPS 489
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG-----------LRINADASATGISKQS---N 538
S + F + D L NIGPL+D + G L NA + I+ Q N
Sbjct: 490 AETSGVGVYIFQILDRLPNIGPLRDITLGKPASTVENTGRLIENACSELELIAAQGSGRN 549
Query: 539 YELV--------------ELPGCKGIWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIIS 582
LV + +G+WT G D R+ + EY Y+I+S
Sbjct: 550 GGLVLMKREIEPDVAASFDAQSVQGVWTAVVALGSGAPLVPDEQRI---NQEYRQYVILS 606
Query: 583 L-------EARTMVLETADL----LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
++ + + DL E + D TI G L +RRV+QV
Sbjct: 607 KPEAPDKEQSEVFIADKQDLKPFKAPEFNPNNDV-----TIEIGTLSCKRRVVQVLRNEV 661
Query: 632 RILDGSYMTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
R SY D+ G P E S+ +S S+ADPY+ + D ++ LL D
Sbjct: 662 R----SY---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMLLQAD 712
Query: 687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDKG 717
S V+ + + K SC LY DK
Sbjct: 713 DSGDLDEVELDDSTRAGK--WRSCCLYWDKA 741
>gi|121719617|ref|XP_001276507.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus clavatus NRRL 1]
gi|148886827|sp|A1C3U1.1|CFT1_ASPCL RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|119404719|gb|EAW15081.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus clavatus NRRL 1]
Length = 1401
Score = 182 bits (463), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 160/560 (28%), Positives = 263/560 (46%), Gaps = 56/560 (10%)
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG---------SRPCWCMVF 939
N R P D+ T + + + I +ISG+ F+ G SR C +
Sbjct: 861 NHVLPRIPPDSDTNISDKEPSNHRPLCILPDISGYSAVFMPGTSASFIFKTSRSC-PHIL 919
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
R R V L D FT + + GFIYV S+ +++ICQLP + YD W ++KV
Sbjct: 920 RLRGGVVRSLSD---FDFT---DPSLGRGFIYVDSKDVVRICQLPPETIYDYSWTLKKVA 973
Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY 1059
+ H + Y Y L S + L D E+ + N +S + R
Sbjct: 974 IGEHVDH-LAYSISSETYVLGTS------HSADFKLPEDDELHPEWRNEAISFLPELRQC 1026
Query: 1060 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAY 1118
+ +++ P W + + E + V+ + L + T E + ++ +GTA
Sbjct: 1027 CL-----KVVHPKT----WTVIDSYTLGPDEEIMAVKNMNLEVSENTHERKNMIVVGTAL 1077
Query: 1119 VQGEDVAARGRVLLFSTGRNADNPQNLVTE----VYSKEL-KGAISALASL--QGHLLIA 1171
+GED+ ARG + +F + +P+ T+ + KEL KGA++AL+ + QG L+ A
Sbjct: 1078 ARGEDIPARGCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSEIGGQGFLIAA 1137
Query: 1172 SGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQG 1227
G K ++ K G+ L +AF D YV L +K ++GD K I+F + E+
Sbjct: 1138 QGQKCMVRGLKEDGSLLP-VAFMDVQ-CYVNVLKELKGTGMCIVGDAFKGIWFAGYSEEP 1195
Query: 1228 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
+++L KD + A +FL DG L ++V+D N+ + Y P+ S G KLL R+
Sbjct: 1196 YKMSLFGKDLEYPEVVAADFLPDGDKLFILVADSDCNLHVLQYEPEDPMSSNGDKLLVRS 1255
Query: 1288 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSD-----KTNRFALLFGTLDGSIGCIAPLDE 1342
+FH+G H T L L T+S +A + +L + GSIG + + E
Sbjct: 1256 KFHMG-HFTSTLTLLPRTTASYEIPSADSDSMEVDPRITPQQVLITSQSGSIGIVTSIPE 1314
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1402
++RRL +LQ +L ++V H GLNPR++R S+G A R ++D LL + + +
Sbjct: 1315 ESYRRLSALQSQLANTVEHPCGLNPRAYRAIESDGTAGR----GMLDGNLLYQWLSMSKQ 1370
Query: 1403 EQLEIAHQTGTTRSQILSNL 1422
++EIA + G +I ++L
Sbjct: 1371 RRMEIAARVGAHEWEIKADL 1390
Score = 119 bits (297), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 167/723 (23%), Positives = 294/723 (40%), Gaps = 127/723 (17%)
Query: 57 NLVVTAANVIEIYV---VRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
NLVV +V++I+ V EG + S D + + L L Y L G V
Sbjct: 28 NLVVVKTSVLQIFSLLNVSCSAEGEIIAAKSARP------DQLQSTKLILEREYSLSGTV 81
Query: 114 ESLA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
L + ++ G D +I+LAF +AK+S++E+D +G+ S+H +E +
Sbjct: 82 SDLCRVKLLKTKSGGD------AILLAFRNAKLSLVEWDPERYGISTISIHYYERDDITR 135
Query: 170 LKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GD----------- 216
+ + G ++ VDP RC V +G++ + IL Q G LV GD
Sbjct: 136 SPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLVMGDYESDSQKQSHE 194
Query: 217 ---EDTFGS-----GGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
+D+ G+ G SS V+ L LD + H F++ Y EP IL+ +
Sbjct: 195 HEMDDSAGNSKSKEGAVHQTPYASSFVLPLTALDSAILHPVSLAFLYEYREPTFGILYSQ 254
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + ++ + ++ S LP D +K++A+P P+GG L++G
Sbjct: 255 IATSNSLLHERKDAIFYTVFTLDLEQRASTMLLSVTRLPSDLFKVVALPPPVGGALLIGY 314
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + + +S ++ L+ L N LL+ +
Sbjct: 315 NELVHVDQAGKTNAVGVNEFSRQVSTFSMADQSELALRLEGCVVELLGNSSGDLLLALSS 374
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLV 435
G +VL+ DGR V + + + P +I ++G+ F GS +S+L+
Sbjct: 375 GTMVLVHFKLDGRSVSGISI-RPLPGHAGGNILKAAASASASLGSDKVFFGSEDAESVLL 433
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN-- 493
++ S + S + E IE D S D +D LY +A +
Sbjct: 434 GWSLSSSNARKS---RSESKRIEKDHEEGSDDSESEEDVYED-------DLYSAAPDTPA 483
Query: 494 -------TESAQKTFSFAVRDSLVNIGPLKDFSYG-------------------LRINAD 527
S ++ F V D L N PL+D + G L + A
Sbjct: 484 LGHRLSVAPSTFASYKFKVHDVLPNTAPLRDIALGQPAMPVEDTGSHLDNICSELELVAA 543
Query: 528 ASATG-----ISKQSNYELVE----LPGCKGIWT---VYHKSSRGHNADSSRMAAYDDEY 575
+ G + K+ +V+ + G+WT +++ + D + + +E+
Sbjct: 544 YGSNGNGGLVVMKRELEPVVKASLNVGPIHGVWTASIALGSAAKPMSGDQTNI----EEW 599
Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFE 628
Y+I++ + +T+ E +++ ++ F +I G L R+RV+QV
Sbjct: 600 RQYVILT-KPQTIDKEESEVFIVDGLNLKPFKAPEFNPNNDISIQVGTLSNRKRVVQVLR 658
Query: 629 RGARILDGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
R D DL P E S+ LS S+ADPY+ + D ++ LL
Sbjct: 659 NEVRSYD-----SDLELAQIYPVWDE--DTSDERMALSASLADPYIAILRDDSTLLLLQA 711
Query: 686 DPS 688
D S
Sbjct: 712 DDS 714
>gi|393907593|gb|EJD74705.1| CPSF A subunit region family protein [Loa loa]
Length = 990
Score = 182 bits (461), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 167/640 (26%), Positives = 276/640 (43%), Gaps = 83/640 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + RL V+S AI + DS++L F+DAK+S++ + + L+ S+H
Sbjct: 62 LECLLAVRLLAPVQSFAI---ARIPQNPDCDSLLLGFDDAKLSIVGVNPADRSLKTISLH 118
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G P+++VDP RC +LV+G + +L + G+ L
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCAAMLVFGRYLAVLPFNDSGAQL------- 168
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+ + L +D + +V D +F+ GY EP ++ L+E T GR ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217
Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
T + L +S +K+ L +W NLP D ++LA+P P+GG+L+V N + Y +QS
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275
Query: 337 -SCALALNNYAVSLDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
C ++LN+ +D + P F + LD T + + LL + G L L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLRDFKHMVLTLDGCVVTVISTNKILLCDRNGRLFTLVLV 332
Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
D V+ L+L +V+ +T+ F+GSRL DS+ + T
Sbjct: 333 TDATNSVKSLELKFQFKTVIPCTMTSCAPGYLFIGSRLCDSVFLHCIFEQST-------- 384
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA---SNNTESAQKTFSFAVRDS 508
++ AP +L + +A +D E+ LYG +SA++ + V D
Sbjct: 385 -----LDESAPKKIKL-NTELNANED----EDFELYGEVLPKVAKPDSAEELLNIRVLDK 434
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
L+N+GP K + G + K ++LV G G ++ +S R SS
Sbjct: 435 LLNVGPCKKITGGCPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSS 494
Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
+ +D+ H Y I S E T+ LET + L E+ E+ + TIAAG L
Sbjct: 495 SIEGVVQYWAVGRREDDTHMYFIASKELGTLALETDNDLVEL-EAPIFATSEPTIAAGEL 553
Query: 618 FGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
+QV ++ Q + V S SI DPY+ + +
Sbjct: 554 ADGGLAVQVTTSSLVMVAEGQQIQHIPL----------QLTFPVRSASIVDPYIAICTQN 603
Query: 678 GSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
G + + L P + + P++S ++Y D
Sbjct: 604 GRLLMYELTSHPHVHLKEIDISKRLRHETSPITSLSIYRD 643
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 70/274 (25%), Positives = 123/274 (44%), Gaps = 30/274 (10%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEEGTG 812
E+G + I+ +P + V+ V K +H+ D + D E + S++ T
Sbjct: 733 ENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPATAESIDTMSATMTDTF 788
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+ E + ++EL M + RP LF +L D T+ Y+ + + N
Sbjct: 789 AAKPEEV----IMELLMVGMGMNQGRPMLF-LLIDDTVSVYEMFTY----NNGIQGHLAV 839
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGFFLS 929
+ L + V+ R+ RF LD E+ A + + F+ I G F+
Sbjct: 840 RFKRLPYTVVT----RSCRFQG--LDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFIC 893
Query: 930 GSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGST 988
S PC + R+HP DG I++FT +N C +GFIY+T + ++++ +LP+
Sbjct: 894 SSYPCIFFLETGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRVAKLPNDMI 953
Query: 989 YDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS 1022
D +PV++ I + A+ H +TY N Y ++ S
Sbjct: 954 LDTSYPVKR-IDVGASVHSVTYLLHSNTYAVLTS 986
>gi|49619061|gb|AAT68115.1| cleavage and polyadenylation specificity factor 1 [Danio rerio]
Length = 312
Score = 178 bits (452), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 99/253 (39%), Positives = 149/253 (58%), Gaps = 18/253 (7%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE V + L GNV S+A + G + RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 66 LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P+V+VDP+ RC +LVYG +++L + + DE
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFR---NDTLADEQEG 175
Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G S++I++R+LD + ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 176 IVGEGQKFSFLPSYIIDVRELDETLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
TC I A+S++ K HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVP- 294
Query: 339 ALALNNYAVSLDS 351
+ VSL+S
Sbjct: 295 -----PFGVSLNS 302
>gi|358338426|dbj|GAA28838.2| cleavage and polyadenylation specificity factor subunit 1 [Clonorchis
sinensis]
Length = 1741
Score = 178 bits (452), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 188/755 (24%), Positives = 321/755 (42%), Gaps = 104/755 (13%)
Query: 748 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
D+ ++ + + +G LEI+ +P+F ++ V F +VD A + ++ E+N +
Sbjct: 1007 DKSRYFAFIVFTNGVLEIYSLPDFTLLYEVHHFSDLPAMLVDC---RAGQGNKVEVNLEN 1063
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
++NI V+E+ + + RP L + T I ++A S +
Sbjct: 1064 IPNCPAAEEDNIPP-TVLEITVFPIGRNRDRPVLL-VRTSQEIAFFEALC------PSHN 1115
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET-PHGAPCQRITI--------FK 918
+ S S S + R R L PL A R T P A Q + F+
Sbjct: 1116 EAHPFASESWSQEGL---RWRRLPIP-CPLVAPRRVRTDPKIADVQSTMLTRKNLLRPFE 1171
Query: 919 NISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
+I GH G F+ G+ P W +RV DG + +F L+ C GF+Y T
Sbjct: 1172 DIDGHCGVFVCGATPIWLFSSDTGHIRVFNHSIDGIMGSFAPLNTDICPSGFVYFTYSNE 1231
Query: 978 LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
+++ L G ++ + ++ V PL+ TP+ + Y E Y L+ + +K + V L
Sbjct: 1232 MRLATLLPGYSFKEHLGMRWV-PLELTPYFLQYHIESKTYALVGTR--VKSCSSVYHLNA 1288
Query: 1038 D----QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA---GGPWQT--RATIPMQS 1088
+ +EV + L S+D Y +++ P + PWQ A I +
Sbjct: 1289 EGNKEEEVLLRPPTCVLPSLDY--------YVLQMYAPSTSLAEATPWQAIPHACIDFEP 1340
Query: 1089 SENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT 1147
E + L + T + LA+G GE++ RGR+++ P +T
Sbjct: 1341 WEVVTCMITAQLSSEQTFHGTKDYLALGANLSYGEEIPVRGRIIILDVIDVVPEPGQPLT 1400
Query: 1148 E-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1202
+Y E KG ++AL+S QGHL+ A G K+ + +L G+AF D+ LY+ SL
Sbjct: 1401 RHKLKTIYDGEQKGPVTALSSCQGHLVSAIGQKVYIWTLKNADLVGVAFVDSE-LYIHSL 1459
Query: 1203 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1262
VKN IL D+ KSI L ++ L+++++D + + + F +DG L +V+DE+
Sbjct: 1460 LCVKNLILAADVLKSIQLLRFQSDLRVLSVVSRDAIPREVYTSNFFVDGRRLGFLVTDER 1519
Query: 1263 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGA------HVTKFLRLQMLATSS-------- 1308
N+ I+ Y P S G++L+ RA+ + V LR +L+ S
Sbjct: 1520 GNVVIYSYDPLEPSSRSGRRLVRRADMCLPTRAISSLRVANRLRHALLSVKSAGTGTQTT 1579
Query: 1309 -------------DRTG-------AAPG------------SDKTN------RFALLFGTL 1330
+RTG APG TN + ++ GT
Sbjct: 1580 VPSAAGVGGSEVLERTGKTGVSSFVAPGRANSASAMTLSTPSATNIDPEKLKHSVYLGTQ 1639
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1390
G++ I PL + + RL+ +K L+ GL P+ + + + D
Sbjct: 1640 TGAVFLIGPLRDKMYSRLRITEKNLIHHFGPTCGLLPKLCWNYRPSAPELVNPSGQVADA 1699
Query: 1391 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+LL Y LP ++LEIA ++G + I+ ++ +L
Sbjct: 1700 DLLWRYLTLPHSQRLEIAKKSGQSLEGIMDDIAEL 1734
Score = 172 bits (436), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 141/482 (29%), Positives = 225/482 (46%), Gaps = 67/482 (13%)
Query: 129 RRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQ 188
R DS++L+F +AK++V+ FD + L+ S+H +E + +LK GR F+ P+++VDP
Sbjct: 27 RLDSLLLSFTEAKVAVMGFDPVQYELKTLSLHNYE---FENLKSGRTHFSHLPILRVDPL 83
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDE--------DTFGSGGGFS------ARIESSH 234
RC VLVY + +L + + GD+ +T G S A + ++
Sbjct: 84 QRCAVVLVYDRHLAVLPFRRSEALAAGDKYLAKPVTNNTARGAGSLSWERRATAPLLATF 143
Query: 235 VINLRDL---DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
L + +V D F++G+ EP +++L+E TWAGRVS + TC I ALS +
Sbjct: 144 TTCLSSSTGEKINNVLDMQFLNGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQ 203
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---V 347
+ +P+IW +LP+D + +VP PIGGVL++ N+I Y Q+ SC L LN YA
Sbjct: 204 KRTNPVIWFQESLPYDCTYVHSVPEPIGGVLILATNSIIYMKQTLPSCGLPLNCYAQVTT 263
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT--VVYDGRVVQRLDLSK 405
+ Q++P+ + LD + + L+ T+TG + LL+ V + + V L L +
Sbjct: 264 NFPMRQDVPQCG-PLTLDGCRIVTMTDSQFLIVTRTGKMCLLSLWVEHTTQTVSSLLLHE 322
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS------------GTSMLSSGLKEE 453
SV + + F+GSRL DS+L+ T + + + + +
Sbjct: 323 IGCSVPPYSVALLDKGYVFVGSRLCDSVLLHLTASTMFVNTLGRIVDLDETTTADNFRTD 382
Query: 454 FGDIEADA---------PSTKRLRRSSSDALQD----MVNGE------ELSLYGSASNNT 494
IE DA P+ K SS +V+G ++ LYG N
Sbjct: 383 IPMIERDAESIPVDKNNPTEKEAENVSSGTPSKPSGSIVHGPYVFDEVDVELYGDTILNP 442
Query: 495 ESAQK---TFSFAVRDSLVNIGPL-----KDFSYGLRINADASATG-ISKQSNYELVELP 545
S + T+ F V D LVN GP+ + Y N D + I+ Q+ VEL
Sbjct: 443 PSDVRELNTYKFEVADRLVNFGPMGLLTSGEVPYLAPGNTDPTDEALIAAQAEMHHVELL 502
Query: 546 GC 547
C
Sbjct: 503 AC 504
>gi|170576536|ref|XP_001893668.1| CPSF A subunit region family protein [Brugia malayi]
gi|158600196|gb|EDP37499.1| CPSF A subunit region family protein [Brugia malayi]
Length = 1323
Score = 178 bits (451), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 169/647 (26%), Positives = 278/647 (42%), Gaps = 96/647 (14%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + RL V+S AI + DS++L F+DAK+S++ + + L+ S+H
Sbjct: 62 LECLLAVRLLAPVQSFAIARISQNPDC---DSLLLGFDDAKLSIVAVNPADRCLKTISLH 118
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G P+++VDP RC +LV+G + +L + + L
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCASMLVFGRYLAVLPFNDSSTQL------- 168
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+ + L +D + +V D +F+ GY EP ++ L+E T GR ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217
Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
T + L +S +K+ L +W NLP D ++LA+P P+GG+L+V N + Y +QS
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275
Query: 337 -SCALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
C ++LN+ +D + P F ++ LD A T + + LL + G L L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLKDFKHMALTLDGAVVTVVSTNKILLCDRNGRLFTLILV 332
Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
D V+ L+L +V+ +T+ F+GSRL DS+ + C S L
Sbjct: 333 TDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFLH--CIFEQSTLEES-- 388
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA---SNNTESAQKTFSFAVRDS 508
+TK+++ S+ + E+ LYG + ++ + V D
Sbjct: 389 -----------ATKKMKLSTEPNANE--EDEDFELYGEVLPKVAKPDVTEELLNIRVLDK 435
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
L+N+GP K + G + K ++LV G G + +S R SS
Sbjct: 436 LLNVGPCKKITGGCPSVSAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSS 495
Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
+ +D+ H Y I S E T+ LET + L E+ E+ + TIAAG L
Sbjct: 496 SIEGVVQYWAVGRREDDTHMYFIASRELGTLALETDNDLVEL-EAPIFSTSESTIAAGEL 554
Query: 618 FGRRRVIQV-------FERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
+QV G +I Y+ L+F V S SI DPY
Sbjct: 555 ADGGLAVQVTTSSLVMVAEGQQI---QYIPLQLTF--------------PVRSASIVDPY 597
Query: 671 VLLGMSDGSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
+ + +G + + L P + + P++S ++Y D
Sbjct: 598 IAICTQNGRLLMYELTNQPHVSLKEIDISKRLRHETSPITSLSIYRD 644
Score = 147 bits (371), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 96/292 (32%), Positives = 151/292 (51%), Gaps = 16/292 (5%)
Query: 1146 VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1205
+ +Y KE KG +++L S G+LL G K+ + + L GI+F D Y+ L V
Sbjct: 1039 IKTLYDKEQKGPVTSLCSCNGYLLTGMGQKVFIWLFKDNNLQGISFLDMH-FYIHQLIGV 1097
Query: 1206 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQK 1263
+N L D+++S+ L ++E+ L+L ++D S A +FLID + ++SDE
Sbjct: 1098 RNLALACDMYRSLALLRYQEEYKALSLASRDMRSDVQPPMAAQFLIDNKQMGFIMSDEAA 1157
Query: 1264 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1323
NI IF Y P+ ES G+KL RAE ++G V F+R++ +S G + F
Sbjct: 1158 NIAIFNYLPETLESLGGEKLTLRAEINIGTVVNSFIRVKGHISS--------GFVENELF 1209
Query: 1324 AL-----LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+L LF +LDGS G + PL E FRRL LQ+ + V AGLN + R
Sbjct: 1210 SLERQSVLFASLDGSFGYLRPLTEKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRP 1269
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
H ++VD ++ Y L L E+ ++A + GT+R I+ +L ++ T+
Sbjct: 1270 NHYLNTRNLVDGDVAMQYLHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTA 1321
Score = 77.0 bits (188), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 70/277 (25%), Positives = 121/277 (43%), Gaps = 30/277 (10%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEE 809
+ E+G + I+ +P + V+ V K +H+ D + D E + S +
Sbjct: 731 IARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPVTGEGIDAMSGTMT 786
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
T + E + ++EL + + RP LF +L D T+ Y+ + + N
Sbjct: 787 DTFAVKPEEV----IMELLLVGMGMNQGRPLLF-LLIDDTVSAYEMFTY----NNGIQGH 837
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGF 926
+ L + V+ R+ RF T D E+ A + + F+ I G
Sbjct: 838 LAIRFKRLPYTTVT----RSCRFQGT--DGRAAVESVRDAVRHKTVLHFFERIGNVLNGV 891
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-ILKICQLPS 985
F+ S PC + R+HP DG I++FT +N C +GFIY+T + +++ +LPS
Sbjct: 892 FICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAVCPNGFIYLTERDRFMRVAKLPS 951
Query: 986 GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS 1022
D +PV++ I + AT H + Y N Y ++ S
Sbjct: 952 DMILDASYPVKR-INVGATVHSVVYLLHSNTYAVLTS 987
>gi|325189779|emb|CCA24259.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1911
Score = 175 bits (444), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/344 (31%), Positives = 185/344 (53%), Gaps = 24/344 (6%)
Query: 1112 LAIGTAYV--QGEDVAARGRVLLFST---------GRNADNPQNLVTEVYSKELKGAISA 1160
+ IGT YV GED + +GR+LL+ G + L + +GAI++
Sbjct: 1569 IVIGTGYVGPNGEDASGKGRLLLYEVDYAQYVDKDGTTSSKLPKLRLTFIKEHHQGAITS 1628
Query: 1161 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV-KNFILLGDIHKSIY 1219
+ L ++L + G K+I++++ +L G AFYDA +++ SL+++ K +++ D++KS+
Sbjct: 1629 VIQLGMYVLASVGSKMIVYEFKSDQLIGCAFYDAQ-MFITSLSVLRKEYVMYSDVYKSVS 1687
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
FL W+++ QL LLAKD+ L EF I + L+L+ +D ++N+ + YAP ES
Sbjct: 1688 FLRWRQKDRQLILLAKDYEPLAVTTAEFNILDTRLALIAADVEENLHVLQYAPHDIESRG 1747
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTG-AAPGSDKTNRFALLFGTLDGSIGCIA 1338
GQ+LL ++FHVG ++ LR +++ +S + A G N + + G+ +G I +
Sbjct: 1748 GQRLLRTSDFHVGVQISSILRKLVISNASHQQYIPAKGRCIGNMYLNVLGSSEGGIAALI 1807
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS---------IVD 1389
P+ E FRRL +LQ ++ ++P LNPR FR +NG+ D+ +D
Sbjct: 1808 PVPERVFRRLFTLQNVMISALPQNCALNPREFRVMKANGRVRSGRADAWCKQKWKKGFLD 1867
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT-SFL 1432
++L + L Q E+A GT I+ NL++L T SFL
Sbjct: 1868 GQVLCRFLHLDYVAQKELARCIGTNPEVIIQNLSELQRNTMSFL 1911
Score = 167 bits (422), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 152/568 (26%), Positives = 253/568 (44%), Gaps = 123/568 (21%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR+LD+K + DF F+ GY+EP ++ILHE + +GR + + T ++ LSI+
Sbjct: 382 LLRLRELDIKGRIADFAFLDGYLEPTLMILHEENERIASSGRFAIGYDTMCLTVLSITLN 441
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA----- 346
+ HP+IW NLP D ++++ PIGG L++ N I Y +Q+ + LN +A
Sbjct: 442 SRLHPVIWCVKNLPADCFRIIPCKVPIGGALLLSTNAILYFNQTQFYGIKLNVFADKTVN 501
Query: 347 --------VSLDSSQELPRSS--------------FSVELDAAHATWLQNDVALLSTKTG 384
+ + + LP +S S+ L H +L + LLS
Sbjct: 502 QSLFPCQDATYEVLEPLPDASEPPAQGRLAFIEKPLSILLYDCHYDYLGSSDILLSLPDD 561
Query: 385 DLVLLTVVY-DGRVVQRLDLSKTNPSVL---TSDITTIG-------NSLFFLGSRLGDSL 433
L +L + RV + + T +L S +T N F+GSR GDS+
Sbjct: 562 SLYVLKMPQTSNRVFSVEEYNHTGKFILRKVASPASTASCLLVNRENDSIFIGSRCGDSV 621
Query: 434 LVQF--------TCGSGTSMLS----SGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVN 480
L SGT ++S SG G D + +A ++L+ S D +
Sbjct: 622 LYSAHRQKINARKTLSGTVVMSDGSISGTSNVRGADTDNEAALAEKLQAFGSTIALDATD 681
Query: 481 GEELSLYG-----SASNNTESAQKTFSFA------------VRDSLVNIGPLKDFSYGLR 523
++ LYG ++ + FSF+ D + IG + G++
Sbjct: 682 EDDAFLYGPTLSQESTGGAMPSSDCFSFSSMKQEDHSLHLQAIDFIPGIGQITSMDLGVQ 741
Query: 524 INADAS--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNA 563
N+D++ + G SK + ++ EL GC+ +WTV SS +
Sbjct: 742 SNSDSNEQHEELVVSGGSSKDGSISVIHHGLRPIVSTAAELSGCRAMWTVVGMSSDVPES 801
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+R Y +YLI+S+ RTM+L T + + + + ++ G T+ A NLF +RR+
Sbjct: 802 QVTR------RYDSYLILSVAQRTMILRTGEEMEPLEDDSGFYTCGPTLCATNLFSQRRI 855
Query: 624 IQVFERGARILDGSYM----------------------TQDLSFGPSNSESGS---GSEN 658
+QVF++G R++ + + TQ++ F + ESG + N
Sbjct: 856 VQVFKQGVRVMQQASIPASEAKEDDEGTQDVPLTRLVCTQEIPFA-GDIESGGMNVDTAN 914
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGD 686
++SV DPY+LL ++DGSIRLL GD
Sbjct: 915 VGIVSVDTIDPYILLLLTDGSIRLLEGD 942
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 69/317 (21%), Positives = 127/317 (40%), Gaps = 65/317 (20%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS----ETEINSSSEEG- 810
+CY G+L ++ VP+F + +++T E+ + S +T + S+ G
Sbjct: 1085 LCYGDGSLHVYSVPDFGKMGIFPYVTFAPKFLLNTMTPESRRASYGYGDTARHRISKGGP 1144
Query: 811 ----------TGQGRKENIHSMK--VVELAMQRWSA----HHSRPF----LFAILTDGTI 850
T +GR H++ V ++A+ R H+S+ F L L +G +
Sbjct: 1145 RLGFSAIPADTNEGRIRKAHAINSPVADIAIHRIGPSEGQHNSQLFSHMVLLVFLANGDL 1204
Query: 851 LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGA 909
+ Y+ P S D + + V+ +R ++ + +A T +E G+
Sbjct: 1205 IMYKLL----PSIPSPRDSKQPSFHFVRVNENLITRPNLPMKAIKDSGNAGTHDENSLGS 1260
Query: 910 P----------------CQRITIFKNISGHQGFFLSGSRPCWCMVFRER-----LRVHPQ 948
+T F N++ + G F G+ P W + + + L +
Sbjct: 1261 TEASTSAIIAKLRANFRYPMLTRFFNVNNNSGMFFRGAYPVWILPNQGQPVFVPLNIAAA 1320
Query: 949 LCDGS--------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD-----NYWPV 995
D + +++FT H+ NC +GF+Y S G L++C+LPS N + +
Sbjct: 1321 PSDPTRRTTFKVPVLSFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVL 1380
Query: 996 QKVIPLKATPHQITYFA 1012
QKV AT H + Y
Sbjct: 1381 QKV-RFGATIHHLLYLG 1396
>gi|325187036|emb|CCA21579.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1912
Score = 175 bits (444), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/344 (31%), Positives = 185/344 (53%), Gaps = 24/344 (6%)
Query: 1112 LAIGTAYV--QGEDVAARGRVLLFST---------GRNADNPQNLVTEVYSKELKGAISA 1160
+ IGT YV GED + +GR+LL+ G + L + +GAI++
Sbjct: 1570 IVIGTGYVGPNGEDASGKGRLLLYEVDYAQYVDKDGTTSSKLPKLRLTFIKEHHQGAITS 1629
Query: 1161 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV-KNFILLGDIHKSIY 1219
+ L ++L + G K+I++++ +L G AFYDA +++ SL+++ K +++ D++KS+
Sbjct: 1630 VIQLGMYVLASVGSKMIVYEFKSDQLIGCAFYDAQ-MFITSLSVLRKEYVMYSDVYKSVS 1688
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
FL W+++ QL LLAKD+ L EF I + L+L+ +D ++N+ + YAP ES
Sbjct: 1689 FLRWRQKDRQLILLAKDYEPLAVTTAEFNILDTRLALIAADVEENLHVLQYAPHDIESRG 1748
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTG-AAPGSDKTNRFALLFGTLDGSIGCIA 1338
GQ+LL ++FHVG ++ LR +++ +S + A G N + + G+ +G I +
Sbjct: 1749 GQRLLRTSDFHVGVQISSILRKLVISNASHQQYIPAKGRCIGNMYLNVLGSSEGGIAALI 1808
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS---------IVD 1389
P+ E FRRL +LQ ++ ++P LNPR FR +NG+ D+ +D
Sbjct: 1809 PVPERVFRRLFTLQNVMISALPQNCALNPREFRVMKANGRVRSGRADAWCKQKWKKGFLD 1868
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT-SFL 1432
++L + L Q E+A GT I+ NL++L T SFL
Sbjct: 1869 GQVLCRFLHLDYVAQKELARCIGTNPEVIIQNLSELQRNTMSFL 1912
Score = 166 bits (420), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 153/569 (26%), Positives = 252/569 (44%), Gaps = 124/569 (21%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR+LD+K + DF F+ GY+EP ++ILHE + +GR + + T ++ LSI+
Sbjct: 382 LLRLRELDIKGRIADFAFLDGYLEPTLMILHEENERIASSGRFAIGYDTMCLTVLSITLN 441
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA----- 346
+ HP+IW NLP D ++++ PIGG L++ N I Y +Q+ + LN +A
Sbjct: 442 SRLHPVIWCVKNLPADCFRIIPCKVPIGGALLLSTNAILYFNQTQFYGIKLNVFADKTVN 501
Query: 347 --------VSLDSSQELPRSS--------------FSVELDAAHATWLQNDVALLSTKTG 384
+ + + LP +S S+ L H +L + LLS
Sbjct: 502 QSLFPCQDATYEVLEPLPDASEPPAQGRLAFIEKPLSILLYDCHYDYLGSSDILLSLPDD 561
Query: 385 DLVLLTVVY-DGRVVQRLDLSKTNPSVL---TSDITTIG-------NSLFFLGSRLGDSL 433
L +L + RV + + T +L S +T N F+GSR GDS+
Sbjct: 562 SLYVLKMPQTSNRVFSVEEYNHTGKFILRKVASPASTASCLLVNRENDSIFIGSRCGDSV 621
Query: 434 LVQF--------TCGSGTSMLS----SGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVN 480
L SGT ++S SG G D + +A ++L+ S D +
Sbjct: 622 LYSAHRQKINARKTLSGTVVMSDGSISGTSNVRGADTDNEAALAEKLQAFGSTIALDATD 681
Query: 481 GEELSLYG------SASNNTESAQKTFSFA------------VRDSLVNIGPLKDFSYGL 522
++ LYG S + FSF+ D + IG + G+
Sbjct: 682 EDDAFLYGPTLSQESTGGGKLPSSDCFSFSSMKQEDHSLHLQAIDFIPGIGQITSMDLGV 741
Query: 523 RINADAS--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHN 562
+ N+D++ + G SK + ++ EL GC+ +WTV SS
Sbjct: 742 QSNSDSNEQHEELVVSGGSSKDGSISVIHHGLRPIVSTAAELSGCRAMWTVVGMSSDVPE 801
Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
+ +R Y +YLI+S+ RTM+L T + + + + ++ G T+ A NLF +RR
Sbjct: 802 SQVTR------RYDSYLILSVAQRTMILRTGEEMEPLEDDSGFYTCGPTLCATNLFSQRR 855
Query: 623 VIQVFERGARILDGSYM----------------------TQDLSFGPSNSESGS---GSE 657
++QVF++G R++ + + TQ++ F + ESG +
Sbjct: 856 IVQVFKQGVRVMQQASIPASEAKEDDEGTQDVPLTRLVCTQEIPFA-GDIESGGMNVDTA 914
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
N ++SV DPY+LL ++DGSIRLL GD
Sbjct: 915 NVGIVSVDTIDPYILLLLTDGSIRLLEGD 943
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 69/317 (21%), Positives = 127/317 (40%), Gaps = 65/317 (20%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS----ETEINSSSEEG- 810
+CY G+L ++ VP+F + +++T E+ + S +T + S+ G
Sbjct: 1086 LCYGDGSLHVYSVPDFGKMGIFPYVTFAPKFLLNTMTPESRRASYGYGDTARHRISKGGP 1145
Query: 811 ----------TGQGRKENIHSMK--VVELAMQRWSA----HHSRPF----LFAILTDGTI 850
T +GR H++ V ++A+ R H+S+ F L L +G +
Sbjct: 1146 RLGFSAIPADTNEGRIRKAHAINSPVADIAIHRIGPSEGQHNSQLFSHMVLLVFLANGDL 1205
Query: 851 LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGA 909
+ Y+ P S D + + V+ +R ++ + +A T +E G+
Sbjct: 1206 IMYKLL----PSIPSPRDSKQPSFHFVRVNENLITRPNLPMKAIKDSGNAGTHDENSLGS 1261
Query: 910 P----------------CQRITIFKNISGHQGFFLSGSRPCWCMVFRER-----LRVHPQ 948
+T F N++ + G F G+ P W + + + L +
Sbjct: 1262 TEASTSAIIAKLRANFRYPMLTRFFNVNNNSGMFFRGAYPVWILPNQGQPVFVPLNIAAA 1321
Query: 949 LCDGS--------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD-----NYWPV 995
D + +++FT H+ NC +GF+Y S G L++C+LPS N + +
Sbjct: 1322 PSDPTRRTTFKVPVLSFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVL 1381
Query: 996 QKVIPLKATPHQITYFA 1012
QKV AT H + Y
Sbjct: 1382 QKV-RFGATIHHLLYLG 1397
>gi|422295485|gb|EKU22784.1| cleavage and polyadenylation specificity factor subunit 1
[Nannochloropsis gaditana CCMP526]
Length = 395
Score = 174 bits (441), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 108/334 (32%), Positives = 184/334 (55%), Gaps = 22/334 (6%)
Query: 1109 ETLLAIGTAYV--QGEDVAARGRVLLFSTGRNA-----DNPQNLVTEVYSKELKGAISAL 1161
+T LA+GT V +GEDV ++GR+L++ + P + + YS+ G +A+
Sbjct: 68 DTYLAVGTCTVRAKGEDVPSKGRLLMYRISLDPYAGLTSPPTLTLVDQYSQR-SGPPTAI 126
Query: 1162 ASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
A L H++IA+GP + ++ ++ E L IAFYDA YVVSL +VK + + D + S++
Sbjct: 127 AQLGPHIIIAAGPTLWVYAFSAREKLKPIAFYDAD-FYVVSLRVVKTLVAVTDAYHSVHL 185
Query: 1221 LSWKEQGAQ--LNLLAKDFG---SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
L W E L L+ KD+ S + F++D +L ++V D + N+Q+ Y P
Sbjct: 186 LRWHEHDPAHTLELMGKDYSPIVSAQPGGSHFVVDPPSLGMLVGDSRGNLQLLQYDPADV 245
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
ES G +L+ RA+FH+ +H FL+ +A PG+ + ++FG+++G +G
Sbjct: 246 ESRGGNRLVRRADFHL-SHRLSFLQHTRMAEVPR-----PGAYRAGVRVMVFGSVEGGVG 299
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
+ P++E +RRL +LQ +V+++PHV NPR FR + G A +D ELL
Sbjct: 300 ALVPVEEKVYRRLYALQAVMVNALPHVGAFNPRGFRLVEARGWAQG-RKKGTLDGELLWR 358
Query: 1396 YEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1429
+ L + +Q ++A GT+R +L +L ++ + T
Sbjct: 359 FAGLSVGKQEDLASAIGTSREMVLESLLEVDMMT 392
>gi|212541400|ref|XP_002150855.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces marneffei ATCC 18224]
gi|210068154|gb|EEA22246.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces marneffei ATCC 18224]
Length = 1383
Score = 174 bits (441), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 133/531 (25%), Positives = 243/531 (45%), Gaps = 56/531 (10%)
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
++ G+ +SG+ P + + L + I + + C G +YV ++
Sbjct: 873 LSDLGGYAAVVMSGASPNLIVRTSKSLPHVYSIQSDFIRGISGFNGAGCKKGLVYVDNER 932
Query: 977 ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1036
+++ CQL + + D WP+++ IPL + Y Y + +
Sbjct: 933 LVRTCQLYNNAQLDFSWPIRR-IPLNEQVDHLAYSTASGTYVVGTT-------------- 977
Query: 1037 IDQEVGHQIDNHNLSSVDLHRTYTVEEYE---------VRILEPDRAGGPWQTRATIPMQ 1087
H+ D +LH + EE ++++ P W+ +
Sbjct: 978 ------HEQDFKLPDDDELHPEWATEEISLLPKVAYGSIKLINPKT----WKVIDSYTFS 1027
Query: 1088 SSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ--- 1143
+E V + L + T + + ++ +GT Y +GED+AARG V +F +P
Sbjct: 1028 PAERITAVENINLEISEKTGKRKDMIVVGTTYAKGEDIAARGNVYVFDVIDVVPDPDEPG 1087
Query: 1144 -NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPL 1197
NL ++ +E ++GA++A++ + QG +++A G K ++ K G+ L +AF D
Sbjct: 1088 TNLKLKLIGEESIRGAVTAVSGIGGQGFMIVAQGQKCMVRGLKDDGSLLP-VAFIDVQ-C 1145
Query: 1198 YVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
YV + +K L+GD K ++F + E+ ++ L KD L+ +FL DG L
Sbjct: 1146 YVSVIKELKGTGMCLIGDAFKGLWFTGYSEEPYKMTLFGKDLDELEVVTADFLPDGKKLY 1205
Query: 1256 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1315
++V+D N+ + Y P+ +S G +LL+R +FH+G + L A SS+ +
Sbjct: 1206 ILVADGDCNLYVLQYDPEDPKSSNGDRLLNRCKFHMGHFASTLTLLPRTAVSSELAVMSS 1265
Query: 1316 GSDKTNRFALLFGTL----DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
S + + L+ L GS+ I L E ++RRL +LQ +L +++ H GLNPR++R
Sbjct: 1266 DSMDIDSYTPLYQALITTQSGSMALITSLSEESYRRLTALQSQLSNTLEHPCGLNPRAYR 1325
Query: 1372 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
S+G R ++D +LL + L +LEIA + G +I ++L
Sbjct: 1326 SVESDGVVGR----GMIDGKLLMRWLDLSRSRKLEIAGRVGADEWEIRADL 1372
Score = 121 bits (304), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 181/751 (24%), Positives = 305/751 (40%), Gaps = 137/751 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++IY + + + +N + V + A L L Y L+G V +
Sbjct: 28 NLIVVKTSLLQIYTLVAETSTTLILENDQQADDDVKNE---ATKLHLHAEYDLYGTVTDI 84
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ + S+ G D +++L+F +AK+S++E++ G+ S+H +E
Sbjct: 85 SPVKILKSRSGGD------ALLLSFRNAKLSLIEWNPETQGISTMSIHYYE--------- 129
Query: 173 GRESFARGPLV----------KVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE---- 217
+E P V VDP RC +L +G++ I IL Q G LV DE
Sbjct: 130 -KEDITLSPWVPDLSQCDSHLTVDPSSRCA-LLNFGVRNIAILPFHQAGDDLVMDEYDPD 187
Query: 218 ------------------DTFGSGGGF--SARIESSHVINLRDLD--MKHVKDFIFVHGY 255
D+ + G +S V+ L LD + H F+H Y
Sbjct: 188 LDMDDLTDQEENKKPSHTDSKKAEGDLIHQTPYAASFVLPLTALDPTLIHPIGLTFLHEY 247
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL+ T A + + + S ++ + + S LP D ++A+P
Sbjct: 248 REPTFGILYSPIATSAALLEERKDVVVYSVFTLDLEQRASTPLLSIAKLPSDLLHIMALP 307
Query: 316 SPIGGVLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQN 374
+P+GG L++G+N IH + A+A+N +A + + + +S + L+ + + N
Sbjct: 308 APVGGTLLIGSNEMIHIDQSGKASAVAVNEFAKQVSAFPMVDQSDLELRLEGSVVEVINN 367
Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT--------IGNSLFF 424
+ LL+ TG+LVL+ DGR V + P+V D+ + +G+ F
Sbjct: 368 ESGDILLTLSTGELVLVHFKIDGRSVSGFVVFPI-PAVSGGDVVSAVASCAVALGSGKVF 426
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS--SDALQDMVNGE 482
+GS +S+L+ S S S + D E + + S A ++ VN
Sbjct: 427 IGSEDAESVLLDCYLPSAVSKKSRDYDRDHFDEEMNNEEDDDMYEDDLYSSAPKEAVN-- 484
Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV 542
+ G S+N ++F V D L+++GPL+ + G + D++A Q + + +
Sbjct: 485 KTVSNGRISDN-------YTFKVIDRLLSLGPLRAVAVGKPASRDSNAE--DAQQSVDDL 535
Query: 543 ELPGCKG-----------------------------IWTVYHKSSR-GHNADSSRMAAYD 572
EL G +W + +++ GHN DS
Sbjct: 536 ELAAAYGSGRGGGVALLQRTLHLDDVFTLGAESADSVWNITTSNTKSGHN-DSG------ 588
Query: 573 DEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFGRRRVIQV 626
+E +Y+I++ T+V + E + D G TI L G RV+QV
Sbjct: 589 EENQSYVILTKANSPENEETLVYAVNERNLEPFNAPDVNPNGDPTIDIDVLAGNSRVVQV 648
Query: 627 FERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
RI D + M Q P E G E V S S AD Y+L+ D S+ LL
Sbjct: 649 LTGEVRIYDTNLGMAQ---IYPVWDED-EGDERFAV-SASFADHYLLIIRDDSSVLLLHS 703
Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
D S + P + S +P LY D+
Sbjct: 704 DESGDLDELTKPETV--SSQPWLCGCLYTDR 732
>gi|388581811|gb|EIM22118.1| hypothetical protein WALSEDRAFT_28358 [Wallemia sebi CBS 633.66]
Length = 1259
Score = 174 bits (440), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 110/322 (34%), Positives = 176/322 (54%), Gaps = 21/322 (6%)
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGR----NADNPQNL-VTEVYSKELKGAISALASLQ 1165
+ +GT +GEDVA +G + LF + D+ N + + +E KGA+SA+ S
Sbjct: 950 FIGVGTCINRGEDVAVKGAMYLFEIAELIPSSKDSGNNYKLKMLMREETKGAVSAITSCS 1009
Query: 1166 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
G+ ++A G K+++ E L +AFYDA Y+VSL ++KNFIL+GD KSI FL+++
Sbjct: 1010 GYFVVAVGQKVLIRALEINERLISVAFYDAG-TYIVSLEVLKNFILVGDQVKSITFLAFQ 1068
Query: 1225 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1284
E +L L++D ++ + FL +S V +D Q ++++ Y P + G+KL+
Sbjct: 1069 ESPYKLVQLSRDARQIETCVSNFLAHEDQISFVSNDIQGDLRLIDYNPFDPTAEGGEKLI 1128
Query: 1285 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1344
EFH G+ T L L + P S+ LL G +DGS+ C++P+DE+T
Sbjct: 1129 RTTEFHKGSEATCSLLLP-------KPSVRPSSE------LLLGCVDGSLSCLSPVDEIT 1175
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1404
F+ L LQ LV +PH+A LNPR+ R N R I+D LLS Y+ + Q
Sbjct: 1176 FKALWLLQGALVRQIPHIAALNPRAHRHVR-NDYVSRSLSKGILDGLLLSAYQTIDHATQ 1234
Query: 1405 LEIAHQTGTTRSQILSNLNDLA 1426
+EIA + G +++++L L + +
Sbjct: 1235 VEIAKRIGYSKAELLGYLRNFS 1256
Score = 136 bits (343), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 164/666 (24%), Positives = 278/666 (41%), Gaps = 103/666 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV--- 113
N+V TA N ++IY + + +A L L Y+LHG +
Sbjct: 30 NIVTTANNTLKIYEIDIDS-------------------NTPSAKLILRREYQLHGEIIGI 70
Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
+S+ ILS +D +++AF DAKI++LE+ D I+ + S+H +E + + + +
Sbjct: 71 QSIKILST----TEDGKDRLLIAFRDAKIALLEWSDEINDIVTVSIHTYERSQQV-ISQD 125
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
F +++ DP+ RC +L+ + IL + L D D S S
Sbjct: 126 MSRFK--AILRSDPENRCSALLLPDDSLAILPVHSAHAEL-EDLDQDVSNAIKDVPYAPS 182
Query: 234 HVINLR--DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
++ L+ D D+ +V D+ F+ G+ P + +L E TW GR+S TC + L++
Sbjct: 183 FILPLKSIDSDICNVIDYTFLPGFHNPTLAVLCEPRQTWTGRLSDSQDTCQVFFLTLDLV 242
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLD 350
+ +P+I + NLP+D+ L A P IGGV ++ AN IH A N +A +L
Sbjct: 243 TQVYPIIATVDNLPYDSMSLKAAPKEIGGVAILSANAIIHVDQNGRPVGRATNGWA-TLT 301
Query: 351 SSQEL--PRSSFSVELDAAHATWLQ------NDVALLSTKTGDLVLLTVVYDGRVVQRLD 402
S++ P V L+ A +LQ + ALL G++ + +GR + R+D
Sbjct: 302 SARNFDAPPKDLFVRLEGASIEFLQPKSKQTHPQALLFLPNGEIHAVQFYREGRTISRID 361
Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
+SK I + L G L V GTS L + + D+E
Sbjct: 362 ISK---PFAKGSIPSGAYRLDIDGQGLSGGQFVFIPSMVGTSFLIR-VGKSLNDLEL--- 414
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS------FAVRDSLVNIGPLK 516
+ + + A DM + LYGS+ + ++ F + D + + GP++
Sbjct: 415 -FPKQEKVGTTAYDDMDVDVDEELYGSSDKKADEKEEEEEISSEPPFTICDYIESYGPIQ 473
Query: 517 DFSYG---------LRINADASA------TGISKQSNYE---LVELPGCKGIWTVYHKSS 558
D + G L+I A A T ++ +E +++ G G+WT ++ +
Sbjct: 474 DITIGRYMQTRNSPLQILAATGAGHVGGITAFHQEVPFESKHKLDVQGNHGLWT-FNVTG 532
Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
G+ + A D + + L + + L D EV TIAA
Sbjct: 533 VGN-----VLVATDSKSKTKISKLLPSNEVALIAED--NEV-----------TIAADTAA 574
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
R++ + ++L + Q EN V SI+DPY+L S+G
Sbjct: 575 NSTRILMITSNAIKVLKEDGIEQ----------QSLQIENGEVQRASISDPYILTLQSNG 624
Query: 679 SIRLLV 684
SI L +
Sbjct: 625 SISLFI 630
>gi|291232724|ref|XP_002736306.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
[Saccoglossus kowalevskii]
Length = 304
Score = 173 bits (439), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 118/354 (33%), Positives = 175/354 (49%), Gaps = 62/354 (17%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A Y+ +H PTGI +C G EE NL++
Sbjct: 2 YALYRQIHPPTGIEHCVYGH-------------FFSKEE--------------KNLIIAG 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
A + +Y + + + SK+ K+ E R + L GN+ SL
Sbjct: 35 ATDLHVYRL-LSDVDSKQKKSKLEHLRS----------------FSLFGNIMSLQTTRLA 77
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
GA RD+++L+F+DAK+SV+E+D H L+ S+H FE LK G S P
Sbjct: 78 GAS----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEEA---LKEGYVSNYYIPQ 130
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
V VDP RC +L+YG ++++L + G+ D+D G S+ + S++INL+D+D
Sbjct: 131 VVVDPDNRCAVMLMYGSKLVVLPFRREGAA--EDQDGVLPGSSKSSFL-PSYIINLQDID 187
Query: 243 MK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
K ++ D F+HGY EP + IL E TW GRV+ + TC I A+S++ + HP+IWS
Sbjct: 188 QKLINIIDIKFLHGYYEPTLFILFEPLRTWPGRVAVRKDTCCIVAISLNIEQRVHPVIWS 247
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
NLP D K + VP PIGGVLV +++ Y +QS Y VSL+ E
Sbjct: 248 LNNLPFDCIKAIPVPKPIGGVLVFAVDSLLYLNQSVP------PYGVSLNGLTE 295
>gi|149066088|gb|EDM15961.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
isoform CRA_b [Rattus norvegicus]
Length = 241
Score = 173 bits (438), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 94/239 (39%), Positives = 139/239 (58%), Gaps = 13/239 (5%)
Query: 1189 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1248
+AF D LY+ + VKNFIL D+ KSI L ++E+ L+L+++D L+ ++ +F+
Sbjct: 1 MAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFM 59
Query: 1249 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1308
+D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+FHVGAHV F R +
Sbjct: 60 VDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------T 112
Query: 1309 DRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1363
GAA G K N+ F TLDG IG + P+ E T+RRL LQ L +PH A
Sbjct: 113 PCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHA 172
Query: 1364 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
GLNPR+FR H + + + +++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 173 GLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 231
>gi|328864890|gb|EGG13276.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
Length = 1627
Score = 173 bits (438), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 100/287 (34%), Positives = 166/287 (57%), Gaps = 6/287 (2%)
Query: 1143 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1202
Q + +Y K+ KG ++++A L G L+++ GPK+I++ ++ L G+AFYD +++VSL
Sbjct: 1336 QKRLNLLYEKDQKGPVTSIAGLNGLLIMSIGPKMIVNNFSSGSLIGLAFYDTQ-IFIVSL 1394
Query: 1203 NIVKNFILLGDIHKSIYFLSWKE---QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1259
N VKN+IL+GD+ KSI F K Q + LL KD+ + ++++F++D LS+V+S
Sbjct: 1395 NTVKNYILVGDMFKSISFFKLKVCIIQKKNIILLGKDYEEVSTYSSDFIVDEKKLSMVLS 1454
Query: 1260 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1319
D +NI++F + P ES GQ LL+++ FH+G KF+R+ M T+ D ++
Sbjct: 1455 DANRNIRMFSFDPSDPESRAGQMLLAKSSFHIGELNNKFVRIPMKNTNYDNNSSSSSIIV 1514
Query: 1320 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ-FHSNGK 1378
++ L +GTL G I + P+++ L +L+ KL+ AGLNPR FR H N
Sbjct: 1515 NDKHLLFYGTLGGGINLLMPINKRFHEILHALETKLMHR-GQTAGLNPRGFRYGHHVNNT 1573
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+VD +LL+ ++ L ++ ++A G+T IL LN L
Sbjct: 1574 LGHLHNQYVVDGDLLTKFQSLSPDDAKQLATSIGSTTPIILDLLNQL 1620
Score = 110 bits (275), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 114/243 (46%), Gaps = 38/243 (15%)
Query: 912 QRITIFKNISGHQGFFLSG-SRPCWCMVFRERLRVHPQ---------------LCDGSIV 955
+RI F NI +G F+SG S P W + R+HP I
Sbjct: 1018 RRIIPFSNIGNKRGIFVSGVSTPIWIFSEKNFPRIHPMKQQQQTTSSSSSSSSSSKRPIT 1077
Query: 956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKN 1015
FT HN+NC HGFIY G+L IC+LP G+ Y+N WP++K + ++ T H+I+Y +
Sbjct: 1078 TFTTFHNINCKHGFIYFDHTGMLCICRLPDGTNYENEWPIRK-LAIRMTCHKISYHPVQK 1136
Query: 1016 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPDRA 1074
Y L++S P ++ ++E+ L + +EE Y++++++P
Sbjct: 1137 CYVLVLSYPQAPQSDEDEQEEQEREL-------------LKKPLVLEEKYQLKLIDP--- 1180
Query: 1075 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN----ETLLAIGTAYVQGEDVAARGRV 1130
W + + E L +++ L + + + + +GTAY GED +GR+
Sbjct: 1181 ANNWNIIDSFSLAEKETVLCSKIIYLRHADESDIIPKLKPFVIVGTAYTHGEDTVCKGRI 1240
Query: 1131 LLF 1133
L+F
Sbjct: 1241 LIF 1243
>gi|353231025|emb|CCD77443.1| putative cleavage and polyadenylation specificity factor cpsf
[Schistosoma mansoni]
Length = 1825
Score = 172 bits (437), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 182/770 (23%), Positives = 312/770 (40%), Gaps = 126/770 (16%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
++ + + +G LEI+ +P+F ++ V F ++D + + ++ +
Sbjct: 1086 FAFIVFTNGVLEIYSLPDFTLLYEVHHFTDLPQMLID---HRGVSSEQLHKQYTNSQNVS 1142
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
++I ++E+ + RP L + T I ++A L P+
Sbjct: 1143 YTEDDSIPP-PILEILVYPIGIDKDRPVLM-VRTSQEIAFFEA-LCPSPDE--------- 1190
Query: 873 TSRSLSVSNVSASRLRNLRFS-RTPLDAYTREET-PHGAPCQRITI--------FKNISG 922
S L RLR R PL A R T P Q + F+NI
Sbjct: 1191 -SYPLISGTFYEGRLRWRRLPLPCPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGD 1249
Query: 923 HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
H+G F+ G P W +LRV P DG + +F L+ C+ GF+Y T +++
Sbjct: 1250 HRGVFVCGGNPIWLFATDSGQLRVFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLA 1309
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
LP G +++ + + K I L P+ + Y E Y +V + +P V L +
Sbjct: 1310 TLPPGYSFNEHLGI-KWITLDPVPYYVQYHVESKTY-AVVGIHS-EPCKSVFRLNAEGNK 1366
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG------PWQTRATIPMQSSENALTV 1095
+ + V T++ Y +++ P+ PW IP E
Sbjct: 1367 EEDVLVRPKTCV----LPTLDYYSLQMYAPNLNANHRNKQPPW---LLIPNTLIEFEPWE 1419
Query: 1096 RVVTLFNTTTKENETL------LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE- 1148
V L ET LA+G GE++ RGR+L+ P +T
Sbjct: 1420 VVTCLITAQLASEETFHGTKDYLALGANLTYGEEIPVRGRILILDVIDVVPEPGQPLTRH 1479
Query: 1149 ----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNI 1204
++ E KG ++AL S QGHL+ A G KI + T+L G+AF D+ LY+ +L
Sbjct: 1480 KLKIIHDGEQKGPVTALTSCQGHLISAIGQKIYIWTLKNTDLVGVAFVDSE-LYIHNLLC 1538
Query: 1205 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
VKN +L D+ KS+ L ++ L+++++D S + + + F +DG L +VSDE N
Sbjct: 1539 VKNLVLAADVLKSVQLLRFQSDLRVLSVVSRDNISREVYTSNFFVDGRRLGFMVSDELGN 1598
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT------KFLRLQMLATSSDRTGAAPG-- 1316
+ I+ Y P S G++L+ A+ + + T LR +L+ T A
Sbjct: 1599 VTIYSYDPLDPSSRSGRRLVRCADMRLPSRATCSLRVANRLRHALLSVKPSSTTTASAMT 1658
Query: 1317 -------SDKTN--------------------------------------------RFAL 1325
D TN R ++
Sbjct: 1659 AGTSATIQDSTNTVLDNLSRVDSVNQMNNLRQSQQQSTAAQQGTTNPNSGVDPEKFRQSI 1718
Query: 1326 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1385
FG+ +GSI I P+ + + RL+ +K L+ + + G+ P+S + +RP P+
Sbjct: 1719 YFGSQNGSIYRIGPIRDKMYSRLRITEKNLIHHLGPICGMPPKSCWSY------NRPQPE 1772
Query: 1386 ------SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1429
+ D +L+ Y LP ++LEIA ++G + I+ ++ +L T
Sbjct: 1773 LANPCGKVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIMDDIAELIATT 1822
Score = 156 bits (394), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 146/586 (24%), Positives = 252/586 (43%), Gaps = 123/586 (20%)
Query: 4 AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
A +K + PT + NC +TH + + NLV+T
Sbjct: 15 AVFKHISPPTAVDNCLYCHLTHPK---------------------------LKNLVITRG 47
Query: 64 NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
IEIY V+ S SGET+ V ++ N+ + + G
Sbjct: 48 GFIEIYNVK--------SSASGETR------------FNWVYGTSVYENIADIVTVRFTG 87
Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
S++L+F +AK++V+ F+ LR S+H +E + +LK GR +F + P++
Sbjct: 88 DLLD----SLLLSFPEAKVAVMNFNPVTFELRTLSLHNYE---FENLKSGRMNFTKLPIL 140
Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED----TFGSGGGFSARIESSHVINLR 239
++DP RC +LVY + +L + + + D + + + R + +
Sbjct: 141 RLDPHQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHISVRNSLLWQQRATAPLLATFT 200
Query: 240 DL-------DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+ +V D F++G+ EP +++L+E TWAGRVS + TC I ALS +
Sbjct: 201 TCLSTSTGEKINNVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQK 260
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VS 348
+ +P+IW +LP D +++VP PIGGV+V+ AN+I Y Q+ SC L LN YA +
Sbjct: 261 RTNPVIWFQESLPFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTN 320
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKT 406
Q++P S + +D L L+ T++G+L LL++ + + V L K
Sbjct: 321 FPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKV 379
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF---------------------TCGSGTSM 445
+V + + + F+GSR DS+L++ + G+ ++
Sbjct: 380 GHAVPPHCMVLLESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVDHQLLNQSSGTNNTL 439
Query: 446 LSSGLKEEFGDIEAD------------------------APSTKRLRRSSSDALQD---M 478
S L + +E D + STKR +D + D
Sbjct: 440 KDSELVDGKSIVEDDSDEIPNKCPRIEEGENDKTISKSLSQSTKRNTLDENDIISDNHYK 499
Query: 479 VNGEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYG 521
+ ++ LYG + + S + +SF V D L+N+GP+ + G
Sbjct: 500 FDEVDVELYGESILSPPSIYREIVNYSFKVVDRLINLGPMGQLTSG 545
>gi|425765419|gb|EKV04111.1| Cleavage and polyadenylation specificity factor subunit A, putative
[Penicillium digitatum Pd1]
gi|425767100|gb|EKV05682.1| Cleavage and polyadenylation specificity factor subunit A, putative
[Penicillium digitatum PHI26]
Length = 1271
Score = 171 bits (434), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 135/468 (28%), Positives = 235/468 (50%), Gaps = 44/468 (9%)
Query: 978 LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
++ CQLPS + +D W ++KV P++ + + Y Y L S L
Sbjct: 822 IRACQLPSQTQFDYSWTLRKV-PIEEQVNFLAYSTSSETYVLGTS------RQGDFKLPE 874
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
E+ + N LS + E ++++ P W + P+ E V+
Sbjct: 875 GDELHPEWRNEELSFCP-----KIPESSIKVVSPKT----WTIIDSYPLDPDEQVTAVKN 925
Query: 1098 VTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT----EVYSK 1152
V + + T E L+ +GTA +GED+ ARG + +F + A +P+ T ++ K
Sbjct: 926 VNIEVSENTHERMDLIVVGTAIAKGEDMPARGTIYVFDVIKVAPDPERPETGRKLKLIGK 985
Query: 1153 E-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN 1207
E +KGA++AL+ + QG +++A G K ++ K G+ L +AF D YV N+VK
Sbjct: 986 ETVKGAVTALSGIGGQGFIIVAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYV---NVVKE 1040
Query: 1208 F-----ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1262
++LGD K ++F + E+ ++ L KD L+ A +FL DG+ L ++V+D
Sbjct: 1041 LKGTGMVILGDAVKGLWFAGYSEEPYRMTLFGKDPEYLEVVAADFLPDGNKLYMLVADSD 1100
Query: 1263 KNIQIFYYAPKMSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAAPGSDK 1319
N+ + Y P+ +S G +LLSR++F+ G + VT R + + ++ + A D+
Sbjct: 1101 CNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFASSVTLLPRTAVSSELTESSEEAMDVDE 1160
Query: 1320 T-NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
T ++ +L + +GS+ + + E ++RRL LQ +L+++V H AGLN R+FR S+G
Sbjct: 1161 TFAKYQVLIASQNGSLALVTSVAEESYRRLSGLQSQLINTVDHPAGLNARAFRATESDGA 1220
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
A R +VD LL + + + Q EIA + G T +I ++L +
Sbjct: 1221 AGR----GMVDGNLLRLWLNMGKQRQAEIAGRVGATEWEIKADLETIG 1264
Score = 114 bits (286), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 175/758 (23%), Positives = 301/758 (39%), Gaps = 153/758 (20%)
Query: 57 NLVVTAANVIEIYVV------RVQEEGSK---ESKNSGETKRRVLMDGISAASLELVCHY 107
NL+V ++++I+ + ++Q+EGS+ + ETK L L Y
Sbjct: 28 NLIVIRTSLLQIFSLVKIVSSQLQKEGSEPHGSQFSQPETK------------LVLEKEY 75
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
L G V L+ + +N ++I++A +AK+S++E+D HG+ S+H +E +
Sbjct: 76 PLSGTVTDLSRVKI--LNNKSGGEAILIAVRNAKLSLIEWDPERHGISTISIHYYERDDL 133
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDE 217
+ G ++ VDP RC V +G++ + IL Q G LV G+
Sbjct: 134 TRSPWVPDLSRCGSILSVDPSSRCA-VYNFGIRNLAILPFHQAGDDLVMDDYDSELEGER 192
Query: 218 DTFGSGGGFSARIE-----------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
SGGG + SS V+ L LD + H F++ Y EP IL
Sbjct: 193 PIQNSGGGAEPKKSKEGPAYQTPYCSSFVLPLTALDPSLLHPISLAFLYEYREPTFGILF 252
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
+ T + + + ++ + + S LP D +K++A+P P+GG L++
Sbjct: 253 SQVATSTALLYERKDVVFYAVFTLDLEQRASTTLLSVSRLPSDLFKVVALPLPVGGALLL 312
Query: 325 GANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
G+N I H + A+ +N ++ + S +S + L+ L D LL+
Sbjct: 313 GSNEIVHVDQAGKTNAVGVNEFSRQVSSFSMTDQSDLAFRLEGCVVERLGGDSGDLLLAL 372
Query: 382 KTGDLVLLTVVYDGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+GD+ L+ DGR V + D+ K+ S + +G+ F+GS
Sbjct: 373 ASGDMALIKFKLDGRSVSGITIHLLPAHAGGDMLKSAASC----SSCLGDGNVFIGSEDA 428
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA-------LQDMVNGEE 483
DS+L++++ S STK+ R S D E+
Sbjct: 429 DSVLLEWSRSSA--------------------STKKARLESKQTADGFDDLEDDDDQMED 468
Query: 484 LSLYGSASNNTESAQKT---------FSFAVRDSLVNIGPLKDFSYGLRIN---ADASAT 531
LY SA +T+ + ++F ++D L +IGPL+D + G + + AT
Sbjct: 469 DDLYSSAPGSTQVDNRMGTENLTTEFYNFRLKDCLPSIGPLRDITLGKVFSNTYREKQAT 528
Query: 532 GISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNADS 565
+ + ELV G G+W+ K RG
Sbjct: 529 CEAVSAELELVASQGSDRGGGLVVIKREIDPLTTMSLKIDDADGVWSASVKKRRG----- 583
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLF 618
++ D+ Y+++S + E ++ +++ F + T+ G+
Sbjct: 584 --ASSTDNPSRQYVVVSRSTDSEQ-ELNEVFVAEEQNLKPFRAPEFNPNEDCTVDIGSFA 640
Query: 619 GRRRVIQVFERGARILDGSYMTQDLS-FGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
G R++QV R D M LS P E S+ +S S DPY+++ D
Sbjct: 641 GDTRLVQVLRNEVRSYD---MELGLSQIYPVWDE--DTSDERVAVSASFIDPYLMIIRDD 695
Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S+ LL D + V I SS+ S LY+D
Sbjct: 696 SSVLLLQADENGDLDEVPLSTLIISSR--WRSGCLYYD 731
>gi|392585051|gb|EIW74392.1| hypothetical protein CONPUDRAFT_133073 [Coniophora puteana RWD-64-598
SS2]
Length = 1490
Score = 169 bits (428), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 113/350 (32%), Positives = 177/350 (50%), Gaps = 19/350 (5%)
Query: 1078 WQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTG 1136
W T +E V VVTL +T+ ++ +A+GT +GED+A RG +F
Sbjct: 1142 WITLDGYEFAPNEFVNAVEVVTLETLSTETGSKEFVAVGTTINRGEDLAVRGATYIFEVV 1201
Query: 1137 RNADNPQNLVTEVYSKEL------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGI 1189
+P + + Y ++ KG ++AL + G+L+ + G KI + + E L G+
Sbjct: 1202 EVVPDPSSKLDRWYKLKMRVRDDAKGPVTALCGINGYLVSSMGQKIFIRAFDLDERLVGV 1261
Query: 1190 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
AF DA +YV SL +KN +L+GD KS++ ++++E +L +L+KD + +F
Sbjct: 1262 AFLDAG-VYVTSLKALKNLLLIGDAVKSVWLVAFQEDPYKLVILSKDIRRQYAASVDFFF 1320
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
LS+V DE+ ++ + Y P ES GQ+LL EFH + L + +
Sbjct: 1321 ANGELSIVTEDEEGVLRAYEYDPNDPESRSGQQLLCHTEFHGHKECSTTLTIARRTKTEH 1380
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
A L+ G DGS+ + P+DE F+RLQ LQ +L +V H+AGLNPR+
Sbjct: 1381 EIPQA---------KLISGFGDGSLSALTPVDEAAFKRLQLLQGQLTRNVQHIAGLNPRA 1431
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1419
FR N +P I+D +LLS +E + Q E+ Q GT R+ IL
Sbjct: 1432 FRIVR-NETVSKPLSKGILDGQLLSSFEAQGITRQGEMTRQIGTERTTIL 1480
Score = 121 bits (303), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 196/913 (21%), Positives = 349/913 (38%), Gaps = 130/913 (14%)
Query: 57 NLVVTAANVIEIYVVR-------VQEEGSKESKN---------SGETKRRVLMDGI---- 96
NLV +N+I IY VR Q E KE K+ GE + DG
Sbjct: 40 NLVTARSNIIRIYEVREDAASLSSQVEAEKERKSHVRKGTEAVEGEVEMDTGGDGWVNMG 99
Query: 97 ----------SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLE 146
+ V + +HG V + + + + N R D ++++F+DAKI++LE
Sbjct: 100 SVKSTSSGPPTVTRFHFVREHVVHGIVTGMDCI-RTISSNEDRMDRLLVSFKDAKIALLE 158
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+ D+ H L S+H +E E L R L +VDP RC + + + IL
Sbjct: 159 WSDAAHDLITVSIHTYERSE--QLMSIDAPLFRSSL-RVDPLSRCAALSLPNNALAILPF 215
Query: 207 SQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVIL 263
Q + E + G S +++L D + +V DF F+ G+ P + +L
Sbjct: 216 YQTQAEFDVIEGEGETEGMRDVPYSPSFILDLPVDVDSSLCNVIDFAFLPGFNNPTLAVL 275
Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP------ 317
+ E TWAGR+ T ++ ++ P++ + LP DA+ L P
Sbjct: 276 CQSEQTWAGRLKEHRDTTLVVTFTLDLLSCTFPILSTLRGLPSDAFSLSPATLPPDFTSG 335
Query: 318 -------IGGVLVVGANTIHYHSQSASCA-------------LALNNYAVSLDSSQELPR 357
GV+V+ + + Y Q A C L+++N ++ ++++
Sbjct: 336 LSGGASNAHGVVVLTPDAVLYADQ-ARCVGAAVSGWATRTSDLSISNAYLTGGTAKDAEG 394
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-TNPSVLTSDIT 416
+ L+ A L LL ++G++ ++ +V +GR V R+D+ +V+ + +
Sbjct: 395 DVKPLALEGAFPLLLTPTALLLVLRSGEMHVVRLVTEGRSVGRVDVGPCVGQTVMPATVV 454
Query: 417 TIGNSLFFLGSRLGDS--------LLVQFTCGSGTSMLSSGLKEEFGDIEADA--PSTKR 466
+ LG G+ + V G T +LS+ EE + S
Sbjct: 455 RVKAPQRALGQGQGEGEKAKERRMVFVGSIVGPAT-LLSAERVEETAAANGNGVNGSGAN 513
Query: 467 LRRSSSDALQDM--VNGEELSLYGSASNNTE----SAQKTFSFAVRDSLVNIGPLKDFSY 520
+ DA +M ++ LYG + ++ SA++ FA D++ GP+ D ++
Sbjct: 514 GHVENKDAGMEMDVDLDDDDDLYGPTTLTSQPSSGSAEEALRFAFCDAIPAHGPILDMAF 573
Query: 521 GLRINAD------ASATGISKQSNYELVE-------------LPGCKGIWTVYHKSS-RG 560
L D ++TG + L + L G +GIW++ K S RG
Sbjct: 574 ALGKWGDRYVPELVASTGAEHLGGFTLFQRDLPIRTKRKLHVLGGARGIWSISVKQSPRG 633
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLF 618
A S+ + + ++IS +A V A T ++ + G T+ AG F
Sbjct: 634 SAASSAGAGPNPELANDTVVISTDANPSPGVSRIATRSTRTDLAIPTRIPGTTVGAGPFF 693
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
GR ++ V R+L+ D + S ++ + + SI DP VL+ D
Sbjct: 694 GRTAILHVMTNSIRVLE-----PDGTERQSIKDTDGNMPRAKIRWCSICDPVVLIIREDD 748
Query: 679 SIRLLVGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKG-----PEPWLRKTSTDAWLS 732
++ L +G+P + + + + E S + ++ C G +P S+
Sbjct: 749 TLGLFIGEPERGRIRRKDMSPMGEKSSRYIAGCFFADTSGLFEAFMDPKAAAASSKGDKD 808
Query: 733 TGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
G + + + + V+ G LEI+ +P VF+ + D+Y
Sbjct: 809 KGATQTMQSVVNAATNSQ--WLVLVRPQGVLEIWTLPKLTLVFSTTLIATLDNVCADSYD 866
Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
AL Q + V + M + + P L L G +
Sbjct: 867 PAALS-------------LPQDPPRKPQELDVENIVMAQLGESNPTPHLMVFLRSGQVAI 913
Query: 853 YQAYLFEGPENTS 865
Y+ P + S
Sbjct: 914 YETVHHPPPPDPS 926
>gi|196012166|ref|XP_002115946.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
gi|190581722|gb|EDV21798.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
Length = 1187
Score = 167 bits (423), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 148/492 (30%), Positives = 225/492 (45%), Gaps = 83/492 (16%)
Query: 949 LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQI 1008
L DG + F + NC +GF+Y S+ L+IC L TYD WPV KV PL+ T H I
Sbjct: 767 LVDGYVKCFAPFNIANCPNGFLYFNSEEDLRICVLDQRFTYDCPWPVHKV-PLRNTLHFI 825
Query: 1009 TYFAEKNLYPLIVS-VPVLKPLNQVLSL---LIDQEVGHQIDNHNLSSVDLHRTYTVEEY 1064
T+ Y +I S + V + + + + I E G + + + L + T E +
Sbjct: 826 THHFVTKTYVIISSTMTVCEKMPHITTEDKEFIPVEKGDRFIHAPVEKFCL-QLITSETW 884
Query: 1065 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGED 1123
E+ PD A I M E+ ++ V L + T + +A+GT V GE+
Sbjct: 885 EII---PD---------AEIQMAEWEHVTCLKSVKLKSEETVSGLKEFIAVGTTNVCGEE 932
Query: 1124 VAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIIL 1178
VA RGR+++F P +N + Y KE KG ++A+ ++G L+ + G KI +
Sbjct: 933 VACRGRIVIFDVIEVVPEPGKPLTKNKIKTYYDKEQKGPVTAITCVEGFLVTSIGQKIYI 992
Query: 1179 HKW-TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1237
++ +L G+AF D +Y+ SL D
Sbjct: 993 WEFRDNKDLIGMAFIDTL-IYIHSL---------------------------------DR 1018
Query: 1238 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1297
L+ F T F ++ + L V AP ES GQ L+ RAE G++
Sbjct: 1019 HQLEIFNTNFYVNKNQLGFV-------------AP---ESHGGQFLVRRAEIQTGSNAHA 1062
Query: 1298 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1357
F R ++ A + + N+ FGTLDGSIG + P+DE +RRL SLQ KL
Sbjct: 1063 FFRTKVRALNQRQ--------NENKHITWFGTLDGSIGLLLPVDEKEYRRLFSLQAKLSI 1114
Query: 1358 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1417
+ AGLN ++FR F S+ K + +I+D +LL Y L E+ ++A Q +T Q
Sbjct: 1115 YLEQNAGLNQKAFRTFRSHQKKLQNSMRNILDGDLLKRYFHLGFVERRDLAKQIMSTPEQ 1174
Query: 1418 ILSNLNDLALGT 1429
I+++L L L T
Sbjct: 1175 IINDLTKLELST 1186
Score = 144 bits (364), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 95/299 (31%), Positives = 144/299 (48%), Gaps = 19/299 (6%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+ I +Y + +E S + D LE + Y +G + +
Sbjct: 29 NLLTAGPTCIRVYDIIKDQEDIDLDNRSDNADNHLNKDNKLHPELEFLASYSFYGKIYGI 88
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ RDS+ + F DAK+S++E+D L S+H FE E LK G
Sbjct: 89 ----ESVRFRHHHRDSLFICFADAKLSLVEYDADNSNLTTLSLHTFEDDE---LKNGFSR 141
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-DTFGSGGGFSARIESSHV 235
P+++VDP RC ++V + + IL G + D + G + + S+V
Sbjct: 142 NLSIPIIRVDPDNRCAAMVVSNVHLAILPFRHRGPAEQQVQIDPKNTSGKYP--LMPSYV 199
Query: 236 INLRDLDMKHVKDFI---FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+++RDL + V I F+ GY EP ++IL E TW+GRV+ + TC I A+S++T
Sbjct: 200 VDVRDLGNEKVSRLIDIRFLEGYYEPTILILCEILRTWSGRVAVRQDTCSILAVSLNTID 259
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
K HP+IWS NLP D + VP PIGGVL+ AN + + +QS YA SL+S
Sbjct: 260 KVHPVIWSLNNLPFDCLGAITVPRPIGGVLIFAANCLLHLNQSKP------PYAESLNS 312
Score = 75.9 bits (185), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 88/391 (22%), Positives = 159/391 (40%), Gaps = 100/391 (25%)
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT--ESAQKTFSFAVRDSLVNIGPL 515
+ D P++K+LR +++ LY + ++ T ES ++++F V D ++++GP
Sbjct: 336 DTDEPTSKKLRTDDEKEDEELE-----KLYSAHTSCTAKESYLRSYTFEVCDRILHVGPC 390
Query: 516 KDFSYGLRINADASATGISKQSNYELV--------------------------ELPGCKG 549
+ G +T + ++S+ E+V +LPGC
Sbjct: 391 ASIAIG------QISTFVQEESDVEVVICSGHDKNGALSVLNKGIKPQVVASYDLPGCVD 444
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+WTV K R ++ + + H +LIIS + TM+L T +TEV E + + Q
Sbjct: 445 MWTV--KDIRLNDENDGDFET--ENTHKFLIISRDNLTMILRTGKEITEV-EQLGFLTQT 499
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
+T+ AGNL +IQV ++ Q L S ++ S+ DP
Sbjct: 500 KTVFAGNLDNGNCIIQVTPYEVILVSKGEKIQQLEL----------ENESPIVFCSLQDP 549
Query: 670 YVLLGMSDGSIRLL---VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD----------- 715
Y+ L + GSI +L + D V + + S+ +++C L+ D
Sbjct: 550 YISLLLEGGSIMMLAFELSDNGEKQVKLVNTTPLNHSR--IAACCLFQDNNGRMSVSDGI 607
Query: 716 --KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI--------------------- 752
+ P P T+ A L ID + LD D
Sbjct: 608 SIRTPSP----TNEPAELMEDEKFTIDDDELLYLDVNDTNLQTNDVPVASTSYTDNLERK 663
Query: 753 ---YSVVCYESGALEIFDVPNFNCVFTVDKF 780
+ +C ++G LE++ +P+++ V+TV+ F
Sbjct: 664 VSYWLFLCLDNGKLEVYSIPSYDKVYTVNGF 694
>gi|147864212|emb|CAN80950.1| hypothetical protein VITISV_016701 [Vitis vinifera]
Length = 262
Score = 167 bits (422), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 89/148 (60%), Positives = 105/148 (70%), Gaps = 26/148 (17%)
Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLV 510
+++ GDIE D PS KR RRSSSDALQDM N ++L LYG A N+TE++QKTFSF+V DSL+
Sbjct: 53 RKKVGDIEGDVPSAKRSRRSSSDALQDMFNSDKLPLYGLAPNSTETSQKTFSFSVSDSLI 112
Query: 511 NIGPLKDFSYGLRINADASATGISKQSNYEL--------------------------VEL 544
N+GPLKDF+YGLRINAD ATGI KQSNYEL VEL
Sbjct: 113 NVGPLKDFAYGLRINADLKATGIVKQSNYELMCCSGHGKNGALCILQQSIRPERITEVEL 172
Query: 545 PGCKGIWTVYHKSSRGHNADSSRMAAYD 572
PGCKGIWTVYHK++RGHNADS +M + D
Sbjct: 173 PGCKGIWTVYHKNTRGHNADSIKMVSAD 200
>gi|393907594|gb|EJD74706.1| hypothetical protein LOAG_18016 [Loa loa]
Length = 398
Score = 166 bits (419), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 108/329 (32%), Positives = 172/329 (52%), Gaps = 11/329 (3%)
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALAS 1163
+ LA+GTA GE+V RGR+++ P ++ + +Y KE KG +++L S
Sbjct: 72 QNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCS 131
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
G+LL G K+ + + L GI+F D YV L V+N L D+++S+ L +
Sbjct: 132 CNGYLLTGMGQKVFIWLFKDNNLQGISFLDMH-FYVHQLIGVRNLALACDMYRSVALLRY 190
Query: 1224 KEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
+E+ L+L ++D S A +F+ID + V+SDE NI IF Y P+ ES G+
Sbjct: 191 QEEYKALSLASRDMRSDVQPPMAAQFIIDNKQMGFVMSDEAANIAIFNYLPETLESLGGE 250
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1341
KL RAE ++G V F+R++ +S + R ++LF +LDGS G + PL
Sbjct: 251 KLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLE---RQSVLFASLDGSFGFLRPLT 307
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1401
E FRRL LQ+ + VP AGLN + R H ++VD +++ Y L L
Sbjct: 308 EKVFRRLHMLQQLMSSMVPQPAGLNAKGARAARPPRPNHYLNTRNLVDGDMVMQYLHLSL 367
Query: 1402 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
E+ ++A + GT+R I+ +L ++ T+
Sbjct: 368 PEKNDLARKLGTSRYHIIDDLIEICRVTA 396
>gi|258575565|ref|XP_002541964.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902230|gb|EEP76631.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1376
Score = 164 bits (415), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 148/554 (26%), Positives = 253/554 (45%), Gaps = 87/554 (15%)
Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVFRE------RLRVHPQLCDGSIVAFTVLHNVNCN 966
R+ ++ G++ F+ GS PC+ M RL+ P + + + H C
Sbjct: 876 RLRAIPDLCGYKTMFMPGSNPCFIMKSSTSSPHVLRLKGEP------VSSLSSFHMPACE 929
Query: 967 HGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVL 1026
GF YV ++ ++++C+LP + +DN W +K I + + YFA Y L S
Sbjct: 930 KGFAYVDAKNMVRMCRLPGNTRFDNAWAARK-IHIGEQVDCVEYFARSETYVLGTS---- 984
Query: 1027 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1086
++ L D EV + + +S + ++ V++L P W +
Sbjct: 985 --YHEDFKLPEDDEVHTEWRSEVISFMP-----QLDRGRVKLLSPRT----WSIIDCYDL 1033
Query: 1087 QSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS---TGRNADNP 1142
++E L ++ + + + T E + ++ +GTA V+GED+ RG + +F + D P
Sbjct: 1034 GATERILCLKTINMEVSEITHERQDMVVVGTAIVRGEDITPRGSIYVFEIIDVAPDPDRP 1093
Query: 1143 Q-NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPP 1196
+ N ++++KE +KGA++A++ + QG L+ A G K ++ K G+ L +AF D
Sbjct: 1094 ETNQKFKLFAKEDVKGAVTAISGIGGQGFLIAAQGQKCLVRGLKEDGSLLP-VAFMDMQ- 1151
Query: 1197 LYVVSLNIVKN--FILLGDIHKSIYFLSWK------------EQGAQLNLLAKDFGSLDC 1242
YV L ++ ++GD K ++F + E+ +L L KD L
Sbjct: 1152 CYVSVLKELQGTGLCIMGDALKGLWFTGYSVQLSSAVDVETCEEPYKLTLFGKDSEYLQV 1211
Query: 1243 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1302
A +FL P S KG +LL R+ FH G H L L
Sbjct: 1212 VAADFL-----------------------PDDPSSSKGDRLLHRSSFHTG-HFISTLTLI 1247
Query: 1303 MLATSSDRTGAAPGSDKTNR----FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1358
TSS TGA+ + + + ++ + GS+G I PL E T+RRL +LQ +LV S
Sbjct: 1248 PQYTSSG-TGASEDNMDVDYMPAGYQVVVTSQSGSVGVITPLTEETYRRLSALQSQLVMS 1306
Query: 1359 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
+ H GLNP+++R S+G + R +VD LL + + ++ + EIA + G I
Sbjct: 1307 MEHPCGLNPKAYRAVESDGFSGR----GLVDGNLLLRWLDMGVQRKAEIAGRVGADLQSI 1362
Query: 1419 LSNLNDLALGTSFL 1432
++L + G FL
Sbjct: 1363 RADLERINGGLDFL 1376
Score = 114 bits (284), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 186/745 (24%), Positives = 298/745 (40%), Gaps = 123/745 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V +V++++ + G+ S ++ + R ++ L L+ Y L G V L
Sbjct: 28 NLIVAKTSVLQVFSLVNVAYGASTSPSTDDKTR---VERQQYTRLVLLAEYDLPGTVTGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ +++++A +AK+S++E+D HG+ S+H +E E LH
Sbjct: 85 GRVKT--LDSKSGGEALLVATRNAKLSLVEWDHERHGISTVSIHYYER-EDLHNSPWTPD 141
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD---EDTFG---------- 221
P L+ VDP RC +L +G+ + IL Q G LV D ED G
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDDFDEDLRGEKPEDMDNAL 200
Query: 222 ---SGGGFSAR----IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
+ AR SS V+ L LD + H F++ Y EP IL+ T
Sbjct: 201 VESTAANDVARHKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHVATSFA 260
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHY 331
+ + + ++ + + + LP D + ++ +P PIGG L++G+N IH
Sbjct: 261 LLGERKDVVSYAVFTLDIQQRTSTTLVTVSRLPSDLWNVVPLPPPIGGSLLIGSNELIHV 320
Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL---QNDVALLSTKTGDLVL 388
+ A+ +N +A +S + L+ L D+AL+ +G + +
Sbjct: 321 DQAGKTNAVGVNEFARQASEFSMADQSDLELRLEGCVIEQLGTESGDIALV-LASGRMAI 379
Query: 389 LTVVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGS 441
+ DGR V +L ++ S+L + + ++G FLGS DS+LV +T S
Sbjct: 380 VRFKVDGRSVSGIFVQLVSTQAGGSILKARPSCSASLGRGKIFLGSEETDSVLVGWTRPS 439
Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGSASNNT 494
S KRL+R SS D D + E LY + +N T
Sbjct: 440 Q--------------------SIKRLKRDSSGPRAGETDTDDDEDDIYEDDLYSTPTNQT 479
Query: 495 ESAQKT----------FSFAVRDSLVNIGPLKDFSYGLRINA-DASATGISKQS-NYELV 542
+ F F D L ++GP+KD + G D ++ SK S + ELV
Sbjct: 480 TVPKTVSQTNGLIKDEFVFRCHDRLWSLGPMKDITLGRTPGTRDQASKKTSKPSTDLELV 539
Query: 543 EL--PGCKGIWTVYHKSSRGHNADSSRMAAYDD-------------------EYHAYLII 581
G G T+ K + DS +M D Y YL+
Sbjct: 540 VTHGQGDAGGLTILRKELDPYIIDSMKMDNVDGVWSVQIAPSNTSNPSTTSRNYDKYLVF 599
Query: 582 SLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARIL 634
S ++R E + + T +D + T+ G L G RV+QV R
Sbjct: 600 S-KSRGHAKEQSVVYTVGGNGIDEMKAPEFNPNEDHTVDIGTLAGGTRVVQVLTSEVRSY 658
Query: 635 DGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
D DL+ P E S+ +V S A+PY+L+ D S+ LL D S
Sbjct: 659 D-----TDLALAQIYPVWDE--DTSDELSVTGASFAEPYLLITRDDQSLLLLQPDSSGDL 711
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDK 716
V + +S K + C LY DK
Sbjct: 712 DEVNIDGLL-TSNKWLCGC-LYFDK 734
>gi|402590016|gb|EJW83947.1| hypothetical protein WUBG_05142 [Wuchereria bancrofti]
Length = 374
Score = 164 bits (414), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 107/329 (32%), Positives = 172/329 (52%), Gaps = 11/329 (3%)
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALAS 1163
+ LA+GTA GE+V RGR+++ P ++ + +Y KE KG +++L S
Sbjct: 48 QNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCS 107
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
G+LL G K+ + + L GI+F D Y+ L V+N L D+++S+ L +
Sbjct: 108 CNGYLLTGMGQKVFIWLFKDNNLQGISFLDMH-FYIHQLIGVRNLALACDMYRSLALLRY 166
Query: 1224 KEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
+E+ L+L ++D S A +FLID + ++SDE NI IF Y P+ ES G+
Sbjct: 167 QEEYKALSLASRDMRSDVQPPMAAQFLIDNKQMGFIMSDEAANIAIFNYLPETLESLGGE 226
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1341
KL RAE ++G V F+R++ +S + R ++LF +LDGS G + PL
Sbjct: 227 KLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLE---RQSVLFASLDGSFGYLRPLT 283
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1401
E FRRL LQ+ + V AGLN + R H ++VD +++ Y L L
Sbjct: 284 EKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRPNHYLNTRNLVDGDVVMQYLHLSL 343
Query: 1402 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
E+ ++A + GT+R I+ +LN++ T+
Sbjct: 344 PEKNDLARKLGTSRYHIIDDLNEICRVTA 372
>gi|452825139|gb|EME32137.1| cleavage and polyadenylation specificity factor subunit-like protein
[Galdieria sulphuraria]
Length = 1454
Score = 162 bits (409), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 157/579 (27%), Positives = 249/579 (43%), Gaps = 85/579 (14%)
Query: 906 PHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD-----GSIVAFTVL 960
PH P F N+S H G FL+GS P ++ + + H + D G I++ T +
Sbjct: 877 PHLRP------FYNLSSHFGVFLTGSVPSIIVLSKGYPQKHEIMIDSGVEYGDILSITNM 930
Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGS--TYDNYWPVQKVIPLKATPHQITYFAEKNLYP 1018
+ N + S G + ++ + + WPV+ V + + Y A +
Sbjct: 931 GDPENNRKLWILDSNGRIHFGEIRETQLESINWAWPVE-VFRMNGCVKNVVYHATTGTFG 989
Query: 1019 LIVSVPV----LKPLNQVLSLLIDQE---VGHQIDNHNLSSVDLHRTY-------TVEEY 1064
++VS V L+ Q+ E +G Q ++ + VE Y
Sbjct: 990 VVVSSIVSMSRLERKRQIFERQKRDERAILGSQAPPEEENNTEFEENEPKNALPIEVEAY 1049
Query: 1065 EVRILEPDRAGGPWQTRATIPMQSSENAL--TVRVVTLFNTTTKEN-------------- 1108
E++I D W+ + E L T V + T +EN
Sbjct: 1050 ELQIYRAD----TWELVDKFAFKEEEAVLSATFMQVDAYKITEEENNDDKSSRATQQQAE 1105
Query: 1109 --------------ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN-------LVT 1147
+ + IGT +++GED RGR++LF R + +
Sbjct: 1106 AAISQSSRSIKFKPKECIVIGTGFIKGEDAGTRGRLMLFEVARQEAYTEESGAFSAIQLM 1165
Query: 1148 EVYSKELKGAISALASLQGHLLIASGPKIILHKWTG-TELNGIAFYDAPPLYVVSLNIVK 1206
+ KELK +S++A L+G++ A GPK+ ++K +EL +FY L+ S+N VK
Sbjct: 1166 LIAEKELKSVVSSIARLEGYICCAVGPKVEIYKLVNESELVCCSFYSGFQLFSTSINTVK 1225
Query: 1207 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
++ +GD++K YFL W+++ LN L KDF + +TEFLI + VVSD N+
Sbjct: 1226 QYVFVGDMYKGGYFLFWRDRNKSLNFLGKDFDPVQTLSTEFLILNEFILFVVSDNFGNLH 1285
Query: 1267 IFYYA-PKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM---LATSSDRTGAAPGSDKTNR 1322
+ YA P ES G+KLL R H+G + +RL+ S DR G+
Sbjct: 1286 LLEYAGPHEIESRGGEKLLRRGVLHLGTRSSSMIRLRTDWKENNSEDRAGSH-------- 1337
Query: 1323 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKK--LVDSVPHVAGLNPRSFRQFHSNGKAH 1380
++ GT DG + C+ PL + + + L KK L +VAGLNP+ FR K
Sbjct: 1338 -IVVLGTWDGGLACLLPLQQEEYEQKNELLKKVYLHSYSLYVAGLNPQEFRIPRGLSKKT 1396
Query: 1381 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1419
R + ++D LLS + L +EIA G SQ+L
Sbjct: 1397 RTFGERLLDSTLLSSLQGLEYTSIVEIAKSCGLDASQLL 1435
Score = 97.1 bits (240), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 120/528 (22%), Positives = 198/528 (37%), Gaps = 89/528 (16%)
Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
KVDP+ VL+ ++++ ++ D+ + + + +++LR L
Sbjct: 166 KVDPEHGLIAVLIRKKNLLLI----AKYPILSHRDSLSAECSSNKLLSDPVILDLRRLGH 221
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
F F+ GY P + +L E+ TW+G S + ++S + + K+ IW
Sbjct: 222 FETIHFCFMFGYSLPTLALLEEKTPTWSGSFSVTRDSRLVSVVQFDLSDKKMKRIWQVEE 281
Query: 304 LPHDAYKLLAVPS-PIGGVLVVGANTIHYHSQSASC-ALALNNYAVSLDSSQELPRSSFS 361
LPH+ + + +VP GG LV G N I Y + L+ N+ S L
Sbjct: 282 LPHECFMVSSVPFLQGGGFLVFGWNIILYFRDGSFVDGLSCNDLGDVYLSKWSLRSQDAP 341
Query: 362 VELDA--------AHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
+ LD +H T+++N V +L + G L + G + L + S
Sbjct: 342 ISLDGCEVVSEFDSHDTFMKNPVIIL--RDGAFFELCIPKKGG-DSVISLRYCKILIQPS 398
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
++ GN L FLGS + S L++ + T + D
Sbjct: 399 TVSYCGNGLIFLGSHVSPSALLEIIWKNSTEL-----------------------HPEDD 435
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
L+ S +G +SN + S RDSL IGP++D I + +
Sbjct: 436 ELE--------SFFGKSSNKNFVVETIDS---RDSLFCIGPIQDLEVFDNIIGSSRKMEL 484
Query: 534 SK---QSNYELV---------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
NY V L C+ IW V + G S +
Sbjct: 485 IAAVGSRNYGAVIIFRRTVSPSLLTSIRLEDCQQIWNVLCQRKMGERNGSVPL------- 537
Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
LI+S + T+VL +D + E+ +S + RT+ + R +IQVF+ G RIL
Sbjct: 538 ---LILSTQRNTIVLSVSDTIDELVDS-QFQTSSRTLWVSRVLHDRYIIQVFDEGLRILG 593
Query: 636 GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
L P V + DPYV+L +S + +L
Sbjct: 594 NWDSLISLYELPP---------GDVVTQAFVCDPYVMLHLSSSYLVIL 632
>gi|392572878|gb|EIW66021.1| hypothetical protein TREMEDRAFT_70300 [Tremella mesenterica DSM 1558]
Length = 1408
Score = 160 bits (406), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 138/521 (26%), Positives = 237/521 (45%), Gaps = 42/521 (8%)
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRER-LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
+ N+ G G F++G +P W M + LR++ L G++ H + F+ +
Sbjct: 916 YDNLEGQSGAFITGEKPYWIMSSEKHPLRLY-GLKQGAMAFGPTTHLGSMGEYFMKIDDG 974
Query: 976 GILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY--PLIVSVPVLKPLNQVL 1033
IC P D P + ++ T + + Y +SVP
Sbjct: 975 CF--ICYFPQSLNTDLTMPCDRY-EMQRTYTNVVFDPPSGHYLGATAISVPFQA------ 1025
Query: 1034 SLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSEN 1091
D+E Q+ + NL L+ ++E + R PW+ +EN
Sbjct: 1026 ---YDEEGEIQLGPEGENLVP-PLNERSSLELFS-------RGSDPWRVIDGYDFDQNEN 1074
Query: 1092 ALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN---LVT 1147
L+++ V L +++ +A+GT + GED A RG V +F P
Sbjct: 1075 VLSMQSVLLESSSVPGGYRDFVAVGTGFDFGEDRATRGNVYIFEVVEVVPEPGQKSAWAL 1134
Query: 1148 EVYSKE-LKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIV 1205
++ K+ + +SAL ++ G+LL ++GPK+ + E L G+AF D +Y+ S+ +
Sbjct: 1135 KLRCKDPCRNPVSALGNINGYLLHSNGPKMYVKGLDFDERLMGLAFVDVM-IYLTSIKVF 1193
Query: 1206 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1265
KNFIL+ D+ KSI+FLS++E + +++KD + + +FL+ ++ + D +I
Sbjct: 1194 KNFILISDMVKSIWFLSFQEDPYKFTVISKDLMPISVTSADFLVHDGHVTFLTYDRSGDI 1253
Query: 1266 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1325
++ + P ES G++L+ R E+H G+ VT M+A R G + + +
Sbjct: 1254 RMVDFDPANPESINGERLIVRTEYHGGSPVTV---STMIAR---RRGVE--EEFAPQTQI 1305
Query: 1326 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1385
+ DGSI FRRL + +L+ + HVAGLNPR+FR N +P
Sbjct: 1306 ICAHADGSISTFVSTKPARFRRLHFVSDQLIRNAQHVAGLNPRAFRTVR-NDLVAKPLSR 1364
Query: 1386 SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
I+D ELL + + P++ Q E+ Q GT + S+L L
Sbjct: 1365 GILDGELLGRFAIQPIDRQREMLKQIGTDGGTVASDLQALG 1405
Score = 123 bits (309), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 150/656 (22%), Positives = 279/656 (42%), Gaps = 94/656 (14%)
Query: 101 LELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
L L+C + LHG + LA L + D D ++++F+DAK+++LE+ S + S
Sbjct: 117 LHLLCQHTLHGWITGLAPLRTIESSVDG---LDRLLVSFKDAKMALLEW--SRGDIATVS 171
Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
+H +E + + G F PL++ DP R + + + IL Q S L E+
Sbjct: 172 LHTYERCQ--QMVTGDLQFYT-PLLRSDPLSRLAVLTLPEDSLAILPVLQEQSDLDPLEN 228
Query: 219 TFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
F +S S V++L D+ +K+++D +F+ G+ P + +L+ TWAGR
Sbjct: 229 -FTKDAPYSP----SFVLSLADVAPTIKNLQDLLFLPGFHSPTLAVLYSPYHTWAGRYHS 283
Query: 277 KHHTCMISALSISTTLK-QHPLIWSAMNLPHDAYKLLAVPSPIGGV-LVVGANTIHYHSQ 334
+ T + + T +PL+ S LP D+ ++A P+ +GGV LV +H
Sbjct: 284 QRDTFCLEVRTFDITAGGSYPLLTSVSGLPSDSLYIVACPAELGGVVLVTTTGLLHIDQS 343
Query: 335 SASCALALNNY-----AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
+ A ++N + + D S E S + L+ + + ++ LL + GD+ +
Sbjct: 344 GRTVATSVNAWWSHITTLPCDKSSE----SRKISLEGSKSVFVTERDMLLVLQNGDVHQV 399
Query: 390 TVVYDGRVVQRLDLSKTNPSV-LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
+GR + + + + + +V S + T GN F+G GDSLL +
Sbjct: 400 RFEMNGRAIGAIKVDEQSSNVPAPSSMVTTGNQAIFVGCAEGDSLLANVDIKRAVA---- 455
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE----SAQKTFSFA 504
+++ IEA+A D +D+ ++ L A+N + + +
Sbjct: 456 -IEDRKPAIEAEA---------EVDWDEDLYGDIDVPLTNGATNGAKYQAITGPANIVLS 505
Query: 505 VRDSLVNIGPLKDFSYGLRINADASAT--------GISKQSNY-------------ELVE 543
D L +G + D +G+ + + T G SK+S + E
Sbjct: 506 PADVLTGVGKIVDMEFGIASTDEGTRTYPQLVTIGGGSKRSTFNAFRRGIPISKRRRFNE 565
Query: 544 LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA---RTMVLETADLLTEVT 600
L + +W + + G + + + ++ ++ S EA R L ++
Sbjct: 566 LFNTESVWFLPIQRPSGQH-----LKSIPEDRRTTMLFSSEATQTRIFSLSAKPNPEQIG 620
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
+ G+++ G F R V+ V + +LD TQ G+E
Sbjct: 621 R-----ISGKSLTVGPFFQRSNVLVVTQTEVLLLDSDGKTQ----------QSIGNEGEE 665
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKKPVSSCTLYHD 715
++S SI+DPYV++ +GS + VGD +S V+ P+ +S + P + ++ D
Sbjct: 666 IVSASISDPYVVIRRVNGSGSMFVGDTVARQLSEVKIPS--DSLQPPYQAIEVFSD 719
>gi|380494933|emb|CCF32776.1| cft-1, partial [Colletotrichum higginsianum]
Length = 542
Score = 160 bits (406), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 143/545 (26%), Positives = 240/545 (44%), Gaps = 47/545 (8%)
Query: 889 NLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
N +++P++ E E P P + NI+G+ FL G+ P + +
Sbjct: 1 NPTIAKSPVEVADDEANEQPRFVPLRPCA---NINGYSTVFLPGASPSLIVKSAKSSPKV 57
Query: 947 PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPH 1006
L + + H C GFIY S+G ++ QLP+ S + + IP+
Sbjct: 58 VGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVTQLPADSNFAELGVSVRKIPIGDAVG 117
Query: 1007 QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEV 1066
I Y Y + S+ ++ L D + + +S L E V
Sbjct: 118 LIAYHPPMETYAVACSI------SEHFELPKDDDYHKEWAKETTTSYPL-----TERGIV 166
Query: 1067 RILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVA 1125
+++ P W T+ ++ E A+ ++ + L + TKE L+ IGTA +GED+
Sbjct: 167 KLMSPTT----WSVIDTVELEPHEVAMCMKTLHLEVSEETKERRMLITIGTAINRGEDLP 222
Query: 1126 ARGRVLLFSTGRNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKII 1177
RGR+L++ P N ++ +KE +GA++ L + QG +L+A G K +
Sbjct: 223 IRGRILVYDVVPVVPQPGRPETNKKLKLVAKEEIPRGAVTGLCEVGSQGLMLVAQGQKCM 282
Query: 1178 LH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLL 1233
+ K GT L +AF D YV ++ V+ + L+ D K ++F+ + E+ ++ L
Sbjct: 283 VRGLKEDGTLLP-VAFMDMN-CYVTAVREVRGTGYCLMTDAFKGVWFVGYAEEPYKMMLF 340
Query: 1234 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1293
K G+ + +F++ G L +VV D+ I + + P+ +S +G LL+RA F
Sbjct: 341 GKSMGNFEVLTADFVVAGDELHIVVCDKDGVIHVMQFDPEHPKSLQGHLLLNRASFSAAP 400
Query: 1294 -HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1352
H T L L S T + K LL + G++ + PL E +RRL SL
Sbjct: 401 NHPTITLSLPRTPISPSATSVS----KNPPTTLLLASPTGALASLTPLSEQAYRRLTSLA 456
Query: 1353 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEI 1407
+ ++PH A NP+ R + A PG D SIVD LL+ + L + E+
Sbjct: 457 NSIAGALPHAAATNPKGHRLQPLD--ARTPGVDTSAGRSIVDGALLARWNELGAGRRSEV 514
Query: 1408 AHQTG 1412
A + G
Sbjct: 515 AGKGG 519
>gi|219109892|ref|XP_002176699.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411234|gb|EEC51162.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1678
Score = 159 bits (402), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 117/418 (27%), Positives = 200/418 (47%), Gaps = 53/418 (12%)
Query: 1062 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF---------NTTTKENETL- 1111
E + V I++ A W + + E+ +T+ ++ L N T + L
Sbjct: 1267 EAFAVWIVD---AANNWMVVDSFKLDEYEHGMTLSIMELTEFPEEPGSSNDTDVSGDELS 1323
Query: 1112 ----LAIGTAYV--QGEDVAARGRVLLFSTGRNADNPQNLVTEV------YSKEL-KGAI 1158
+A+GT + GEDVA+RGR +L R + + +V Y KE+ GA+
Sbjct: 1324 KRMFVAVGTGVLDHNGEDVASRGRAILLELKRTNSSAKAAGRQVVELSFCYEKEIFHGAV 1383
Query: 1159 SALASL----QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
++L L + LLI +G I + +W +L + F+ A + V+ K+F+LL D
Sbjct: 1384 TSLVCLSSEGKNRLLIGAGADINVEQWGNAKLTQVGFFRAT-MQVLHTIPFKSFLLLSDA 1442
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
+ S+YFL W+E L LLAKD+ + +A + G ++ + D+++N+Q F YAP
Sbjct: 1443 YDSLYFLIWRESDKSLTLLAKDYDPIPVYAAGVMSRGPAMTFLCHDDRQNLQFFQYAPGE 1502
Query: 1275 SESWKGQKLLSRAEFHVGAHVTK----FLRLQMLATSSDRTGAAPG----------SDKT 1320
+ + G +L+ RA++H+G T F R ++ S+ T S++
Sbjct: 1503 AAARGGNRLVCRADYHLGTQTTSFASHFCRSSLMIHSATPTSTLAALKQQDSYFGRSEED 1562
Query: 1321 NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1380
R FGT DG +G + PL E + RL +LQ + +++ L PR++R + + +
Sbjct: 1563 QRLGAYFGTADGGMGAVVPLSEPVYWRLTALQSIVANALESDCALAPRAWRLYRRSTR-- 1620
Query: 1381 RPGPDS------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
R G S ++D +L+ Y L + +Q +IA G+T IL NL +L G+ L
Sbjct: 1621 RGGCRSNDRKKGVIDGDLVLQYADLSISKQEDIASAIGSTVDLILDNLLELQCGSLVL 1678
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 135/333 (40%), Gaps = 79/333 (23%)
Query: 251 FVHGYIEPVMVILHE--RELTWAGRVSWKHHTC-----MISALSISTTLKQHPLIWSAMN 303
F+ GY+EPV+V+LH W+GR+ + ++ALSIS + ++WS +
Sbjct: 243 FLSGYLEPVLVLLHSDVEGPVWSGRLGRERGVAGAPPLFVTALSISVVHGRTAVLWSQV- 301
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSS------QELP 356
+ DA K+L+ G LVVGANT + +A+N +A S + Q P
Sbjct: 302 VSADATKILSFGKT--GCLVVGANTLVILEIGKVQQVIAMNGWARSTCPAALQTALQANP 359
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG--------------------- 395
+++LD TWL A+++ +TG L +L D
Sbjct: 360 VVKLAIQLDGCCVTWLSEHSAIMALRTGQLYVLQRTDDRWAVMPLGQTLGAVGEVAHLAS 419
Query: 396 ------RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
R ++++ + + S + + F GSR GDSL + + +M +
Sbjct: 420 LPIGGLRWLEKMKMDENKASEMQMGV-------LFAGSRTGDSLFLGYAL-EIVTMPWAA 471
Query: 450 LKEE---FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS----------ASNNTES 496
+K E F + E S ++ L ++ EE +LYG+ S E+
Sbjct: 472 IKSEGQTFINFEGSELSKVATTAPIANGLDRILQLEEEALYGTDRSTPLHIVRDSEEEET 531
Query: 497 AQ--------KTFSFAV------RDSLVNIGPL 515
A + +F V D LVN+GPL
Sbjct: 532 ADIPSDAKRLRPVAFTVVRTIVPLDVLVNLGPL 564
>gi|224000243|ref|XP_002289794.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975002|gb|EED93331.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1820
Score = 158 bits (400), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 112/417 (26%), Positives = 206/417 (49%), Gaps = 39/417 (9%)
Query: 1051 SSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET 1110
+S+ L + ++EV +++ A W + E+A ++V+ L + +E
Sbjct: 1408 TSIGLAPSIPTRQFEVWLVD---AASQWTVLDKYQLCDFEHATALKVLFLTDVVEDSDEP 1464
Query: 1111 -----LLAIGTAYVQ--GEDVAARGRVLLFS--TGRNADNPQNLVTEVYSKELK----GA 1157
+A+GT ++ GED+A++GR+LLF+ ++ + +++ E++ K K G
Sbjct: 1465 PKKSLFVAVGTGRIERDGEDIASKGRILLFNLKKKKHQKDKRSMTLELHLKHEKDITIGP 1524
Query: 1158 ISALASLQGH----LLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1213
+++L+SL+ + + +G ++ + +W +L + FY A + V ++++ K F LL D
Sbjct: 1525 VTSLSSLRSEDIFRVAVGAGAEVTVEQWGSGKLVQVGFYHAH-MQVQNISLFKTFFLLSD 1583
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
+ +++FL W+E L LLAKD+ FA + G +S V D+++NIQ YAP
Sbjct: 1584 AYDALHFLVWRESDKSLTLLAKDYEPTQVFAAGMISRGGAMSFVCHDDRQNIQFLQYAPT 1643
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKF--------------LRLQMLATSSDRTGAAPGSDK 1319
+ G KL+ RA+FH+G+ T LA+ + D
Sbjct: 1644 DVAARGGNKLVCRADFHLGSQTTSLNSHWAQSSLLFNSCTVSSTLASLKQQDSLFGRLDD 1703
Query: 1320 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN--- 1376
RFA+ FGT DGS I PL E T+ RL +LQ + +++ A L+ R++R + +
Sbjct: 1704 DQRFAVNFGTTDGSFVSIIPLSEPTYWRLTALQSVMSNALESNAALSHRAWRLYRRSTRR 1763
Query: 1377 -GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
G ++D +L+ + LPL EQ ++ G+T ++ NL +L+ S +
Sbjct: 1764 GGCRTNDRKKGVIDADLVMKFVDLPLPEQEDLTSSIGSTVGLVMDNLLELSCAGSVV 1820
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 91/410 (22%), Positives = 145/410 (35%), Gaps = 141/410 (34%)
Query: 246 VKDFIFVHGYIEPVMVILHERE-----LTWAGRVSWKHHTCM------------------ 282
+ D F+ GYIEP +++LH WAGR+ +
Sbjct: 398 IVDIAFLSGYIEPTLLVLHSNPKRGGGRAWAGRLGRTEEVPLSNNGGSGESKDDYGEDID 457
Query: 283 -----------------------ISALSISTTLKQHPLIWSAMN-LPHDAYKLLAVPSPI 318
++A+S++ ++ ++WS ++ LP DA+KL VP P
Sbjct: 458 LEGGDAAKKGPDLVSTGTKYGLSLTAISLAIHQRRSVVLWSLLDALPADAWKL--VPHPS 515
Query: 319 GGVLVVGANTIHYHSQSA--SCALALNNY------------------AVSLDSSQELPRS 358
GV+V G NT Y S SCALA N + AV L+ + P
Sbjct: 516 DGVIVWGVNTAVYVSMGGKISCALAANGFAKIGCPIGLIPPSGRIGSAVYLEPNPS-PLP 574
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV--------------------------- 391
+++LD A ++ DVA++ G L L +
Sbjct: 575 MLALQLDGARVGFVTEDVAIVCLGNGSLYSLELHRAKSMVSPSMFLSMSPLGHRVGGLGV 634
Query: 392 -------------------VYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFFLGSRL 429
+ D V+ D +K + SV I + G L F GSR+
Sbjct: 635 ASCLSVLAMACHSNSVGHFLVDNEGVKDEDHAKETISKESVSGPKIRSRG--LIFAGSRM 692
Query: 430 GDSLLVQFT---------------CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SD 473
GD L+ F+ G+G L E+ + P+ K+L++ S
Sbjct: 693 GDCSLLAFSLNVPIHLVITDVDSETGAGKRKLGGSRPEQLSSMP--EPAQKQLKKEEISP 750
Query: 474 ALQDMVNGEE--LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ D +GEE + S + + + + DSL +GPL YG
Sbjct: 751 SRTDSEDGEEDIVCAMSSPRRSVRTLSMFRTVSALDSLTGLGPLGQGCYG 800
>gi|402085944|gb|EJT80842.1| cft-1 [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 1450
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 143/574 (24%), Positives = 248/574 (43%), Gaps = 45/574 (7%)
Query: 867 SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-----GAPCQRITIFK--N 919
S+D ++ ++ S S LRF + P A + + AP +R+ + N
Sbjct: 871 SNDDLTIYEPFKIAESSQSLSGTLRFRKLPNPAVAKSQDTKVSDDAPAPMRRMPLRACGN 930
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
I+G+ FL G P + + + L + A + H C+ GFIY +G+ +
Sbjct: 931 IAGYSCVFLPGHSPSFLIKSSKSTPRVIGLQGPGVRAMSPFHTKGCDRGFIYADYEGVAR 990
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
+ Q+P+ ++ K +PL I Y +Y +V+ +P L D
Sbjct: 991 VAQIPNDCSFAELGLSVKKVPLNMDADGIAYHTPSGVY--VVTCSFWEPFE----LPSDD 1044
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
E + N++ E ++++ P W T +E A+ ++ +
Sbjct: 1045 ESHREWAKENITF-----KPQTEHSVLKVINPVN----WSEIWTEEFDKNEVAMCIKSLN 1095
Query: 1100 L-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF---STGRNADNPQ---NLVTEVYSK 1152
L + +T E L+ +GTA +GED+ RG V ++ S D P+ L +
Sbjct: 1096 LEVSQSTNERRHLITVGTAICKGEDLPVRGCVYVYDLASVVPQKDRPETDKKLKLMAKDE 1155
Query: 1153 ELKGAISALASL--QGHLLIASGPKIILHKW-TGTELNGIAFYDAPPLYVVSLNIVKN-- 1207
+GA++AL+ + QG +L+A G K ++ L +AF D YV +
Sbjct: 1156 VPRGAVTALSEIGTQGLMLVAQGQKCLVRGLGEDGRLLPVAFMDMN-CYVSCAKELPGTG 1214
Query: 1208 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1267
F + D K ++F + E ++ + K +L+ +FL DG L LV +D + N+ I
Sbjct: 1215 FCAMADAFKGVWFTGYTEGPYKMMIFGKSSTNLEVINVDFLPDGRNLLLVAADAEGNLHI 1274
Query: 1268 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--- 1324
F + P+ +S +G LL+R F GAH + L M TSS+ + A D + A
Sbjct: 1275 FQFDPEHPKSLQGHLLLNRTTFSTGAHHPQ-KSLLMPTTSSNPSQPATNGDASAAAAGPQ 1333
Query: 1325 -LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1383
+L G + + PL + + RL +L L SVPH A LNP+++R + +
Sbjct: 1334 HILMAAPTGVLAAVQPLGQGVYTRLSALASNLAASVPHHAALNPKAYRMPPAPARNQVAA 1393
Query: 1384 PD-----SIVDCELLSHYEMLPLEEQLEIAHQTG 1412
D ++VD LL+ + L + E+A + G
Sbjct: 1394 VDISVGRAVVDGALLARWAELGSGRRAEVAGRAG 1427
Score = 93.6 bits (231), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 157/686 (22%), Positives = 262/686 (38%), Gaps = 126/686 (18%)
Query: 91 VLMDGISAASLELVCHYRLHGNVESLA------ILSQGGADNSRRRDSIILAFEDAKISV 144
V D S + L+ + L G V LA + GG S D +++AF+DAK+S+
Sbjct: 86 VRSDRASHTKIVLIAEFPLSGTVTGLARVKPPNVSKTGGG--SGVGDLLLIAFKDAKLSL 143
Query: 145 LEFDDSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGL 199
+ +D L S+H +E E W +F + DP RC +
Sbjct: 144 VAWDSERRSLETFSIHYYEQDELQGNPWECPLSDYANF-----LVADPGSRCAALKFGPR 198
Query: 200 QMIILKASQGGSGL-VGDEDTFGSGG------------GFSARIES-----SHVINLRDL 241
+ IL Q + +GD D G ++ IE S V+ L +L
Sbjct: 199 SLAILPFKQADEDIGMGDWDEALDGPRPAQSQSAAVAINGTSTIEDTPYSPSFVLRLPNL 258
Query: 242 D--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
D + H F++ Y EP IL +T + + K H + ++ K I
Sbjct: 259 DPALLHPVHLAFLYEYREPTFGILSS-SITPSNCLDRKDH-LTYTVFTLDLQQKASTTIL 316
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRS 358
S LP D +++A+P+P+GG L+VGAN IH + +A+N + S S
Sbjct: 317 SVGGLPKDLTRVIALPAPVGGALLVGANELIHIDQSGKANGVAVNPFTKQCTSFGLADHS 376
Query: 359 SFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLT 412
++ L+ L ++ L+ G L +T DGR V L + P ++L
Sbjct: 377 DLNLRLEGCTIEVLSAEHGELLVVLDDGRLATITFHIDGRTVSGLKVRIIPPEAGGNILP 436
Query: 413 SDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
+ ++ IG + F GS GDS+++ G + SS + + ++ L
Sbjct: 437 TSVSCLSRIGRNAMFAGSERGDSIVI------GWNRKSSQVSRKKSRVQ-----DPDLDL 485
Query: 470 SSSDALQDMVNGEELSLYGSASNNT--------ESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ ++ LYG T ++ + F D L++I P++D +YG
Sbjct: 486 DIDFDDLEDDEDDDDDLYGDTEKTTTVAGLASGQAKLEDLVFRCHDRLISIAPIRDMAYG 545
Query: 522 LRINADASATGISK----QSNYELV--------------------------ELPGCKGIW 551
TG QS +LV + P +G+W
Sbjct: 546 KPPPPAEGETGSRNSTPIQSELQLVAVVGRDRASSLAIMNREMTPVSIGRFDFPEARGLW 605
Query: 552 TVY-----------HKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETAD 594
T+ K ++ D YD +++++ E+ + + TA
Sbjct: 606 TLACQKPLPKVLQGEKGTKPVGGDFGVPVQYDK----FMVVAKEDDDNFESSNIYVLTAA 661
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
++ + G TI AG + ++IQV + R DG +TQ + P E
Sbjct: 662 GFEKLVGTEFEPAAGFTIEAGTMGNHTKIIQVLKSEVRCYDGDLGLTQII---PMLDEET 718
Query: 654 SGSENSTVLSVSIADPYVLLGMSDGS 679
+ +T S SIADPY+L+ D S
Sbjct: 719 NHEPRAT--SASIADPYLLIIRDDSS 742
>gi|150951283|ref|XP_001387581.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
polyadenylation factor II complex, subunit CFT1 (CPSF
subunit) RNA processing and modification [Scheffersomyces
stipitis CBS 6054]
gi|149388465|gb|EAZ63558.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
polyadenylation factor II complex, subunit CFT1 (CPSF
subunit) RNA processing and modification [Scheffersomyces
stipitis CBS 6054]
Length = 1341
Score = 157 bits (397), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 137/588 (23%), Positives = 258/588 (43%), Gaps = 69/588 (11%)
Query: 836 HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
H +L + G +L Y+ Y F+G N + ++L +
Sbjct: 786 HKEEYLTILTIGGEVLLYKLY-FDG-------------------ENYEFKKEKDLAITGA 825
Query: 896 PLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
P +AY P G +R + F N++G+ F++G P + + Q
Sbjct: 826 PENAY-----PIGTAVERRLAYFPNLNGYTCIFVTGVTPYLILKSLHSIPRIYQFSKIPA 880
Query: 955 VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEK 1014
V+ + H+ +G I++ +Q +ICQLP Y+N WP+ K+I + + ITY
Sbjct: 881 VSISPFHDSKVANGLIFLDNQQNARICQLPLDFNYENTWPM-KLIHIGESIRAITYHESS 939
Query: 1015 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY---TVEEYEVRILEP 1071
+ Y V+ + D+E G I V LH+ + + ++++ P
Sbjct: 940 HTY-------VVSTFKDIDYECFDEE-GKPI-------VGLHKDKPPSSAYKGSIKLISP 984
Query: 1072 DRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLLAIGTAYVQGEDVAAR 1127
W TI + +E +TV+ + L ++TK+ + + IG+ + ED++A
Sbjct: 985 FN----WSVIDTIELADNELGMTVKSMILDVGSSTKKFKHKKEFIVIGSGKYRMEDLSAN 1040
Query: 1128 GRVLLFSTGR---NADNPQ--NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1182
G ++ D P+ + EV+ ++ KGA++++ + G L++ G K+I+
Sbjct: 1041 GSFRIYEIIDIIPEPDRPETNHKFKEVFKEDTKGAVTSVCEVSGRFLVSQGQKVIVRDLQ 1100
Query: 1183 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1242
+ +AF D +YV N ++LGD KS++ + + + ++ +L KD LD
Sbjct: 1101 DDGVVPVAFLDTA-VYVSEAKSFGNMMILGDSLKSVWLVGFDAEPFRMIMLGKDLQGLDV 1159
Query: 1243 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1302
+F+ + ++++D + + Y P+ + GQ+LLS++ F + + VT L
Sbjct: 1160 NCADFITKDEEVFILIADNNNVLHLVQYDPEDPTALNGQRLLSKSSFSINSFVTCLKSLP 1219
Query: 1303 MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHV 1362
D T F + T+DGS + P++E ++RR+ LQ++L D H
Sbjct: 1220 KTEEKYD----------TGNFQTIGSTIDGSFFSVVPINEASYRRMYILQQQLTDKEYHY 1269
Query: 1363 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
GLNPR R + A+ I+D +++ Y L E + +A +
Sbjct: 1270 CGLNPRLNRFGGLSMTANDTNTKPILDYDVIRAYGKLNEERKKNLASK 1317
Score = 96.7 bits (239), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 98/415 (23%), Positives = 183/415 (44%), Gaps = 55/415 (13%)
Query: 55 VPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
V +LVV A +++I+ V VQ + S SK L+L+ ++LHG +
Sbjct: 26 VKHLVVGKATLLQIFEV-VQLKSSTPSK--------------PQHRLKLIDQFKLHGLIT 70
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
+ + + N D ++++ + AK SV+++D +H + S+H +E+
Sbjct: 71 DIKPIRTVESPNF---DYLLVSTKSAKFSVIKWDHHLHTISTVSLHYYENAIQ---NSTY 124
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGL----------QMIILKASQGGSGLVGDEDTFGSGG 224
E ++ L+ ++P G C + L ++ A +V E G
Sbjct: 125 EKLSKSELL-LEPYGSCSCLRFKNLLCFLPFETAEELDDDDADSENEDMVKSEKKEHENG 183
Query: 225 GFSARI--------ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
+ + ++S +I+ + LD + + D F+ Y EP IL +R+ WAG +
Sbjct: 184 TVNVPVTDQPGSFFDTSFLIDGQSLDSSIGSIIDMQFLFKYREPTFGILSQRQQAWAGNL 243
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHS 333
L++ T K + NLP+D +++ +PSP+ G L++G N IH +
Sbjct: 244 PKIKDNVQFCILTLDLTTKSTVSVLKIDNLPYDVDRIVPLPSPLNGCLLLGCNEIIHVDN 303
Query: 334 QSASCALALNNYAVSLDSSQEL--PRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLT 390
+A+N + + +S + ++ +++L+ L ND ALL TG+ L
Sbjct: 304 GGIVRRIAVNQFTSLITASTKAYQDQTHLNLKLEDCSVVALPNDHRALLVLSTGEFYYLN 363
Query: 391 VVYDGRVVQRLDLSKTNPSVLTSD--------ITTIGNSLFFLGSRLGDSLLVQF 437
DG+ +++ + + +L SD I T+ N+L F + G+S LVQF
Sbjct: 364 FEVDGKSIKKFTIESVD-KLLYSDIKLTFPGQIATLDNNLLFFANHNGNSPLVQF 417
>gi|302506529|ref|XP_003015221.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
gi|291178793|gb|EFE34581.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
Length = 1370
Score = 156 bits (394), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 136/535 (25%), Positives = 241/535 (45%), Gaps = 61/535 (11%)
Query: 911 CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
C+ + ++ G++ F+SG PC+ ++ R H G V + + H C GF
Sbjct: 846 CKLLRALPDVCGYRTVFMSGHSPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 904
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1029
YV I L I Y + Y + S
Sbjct: 905 AYVDED-----------------------ITLGEQVDSIVYSSASECYVIGTSA------ 935
Query: 1030 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1089
+ L D E + N ++ + +E +++LEP W T + ++ +
Sbjct: 936 KEDFKLPEDDESHTEWRNEFITFLP-----QLERGTIKLLEPRN----WSTIDSHELEPA 986
Query: 1090 ENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QN 1144
E + V+ L + T E + ++ +G++ V+GED+ +G + +F P +N
Sbjct: 987 ERITCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKN 1046
Query: 1145 LVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYV 1199
++++KE +KGA++AL+ + QG L++A G K ++ K G+ L +AF D YV
Sbjct: 1047 KKLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDGSLLP-VAFKDTQ-CYV 1104
Query: 1200 VSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1257
L +K ++GD K ++F+ + E+ +L+L K+ +L +FL DG+ L ++
Sbjct: 1105 NVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENENLAVVDADFLPDGNKLYIL 1164
Query: 1258 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL----QMLATSSDRTGA 1313
V+D+ N+ + Y P+ S KG +LL R+ FH G + L ++ D
Sbjct: 1165 VADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTLLPHGGHTPSSPVDEDAM 1224
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
S +++ +L GSI I PL E ++RRL +LQ +LV+++ H LNPR +R
Sbjct: 1225 DTDSPPPSKYQILMTFQTGSIAIITPLGEDSYRRLLALQSQLVNALEHPCSLNPRGYRAV 1284
Query: 1374 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
S+G + G ++D LL + + + + EIA + G I +L L G
Sbjct: 1285 ESDGMGGQRG---MIDGNLLLRWLDMGAQRKAEIAGRVGADVGAIRVDLEKLHGG 1336
Score = 105 bits (262), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 181/736 (24%), Positives = 287/736 (38%), Gaps = 100/736 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + R D A L L Y + G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSTTATQPDQKGRH---DRSQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+ NS+ D+I+++ +AK+S++E+D HG+ S+H +E E +
Sbjct: 85 QRVR---ISNSKSGGDAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEESHMSPWVPD 141
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------DT 219
+ + VDP G C + +G+ + IL Q G LV D+ D
Sbjct: 142 LGSCSSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDLVSDA 200
Query: 220 FGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
S G +A + S V+ + LD + H F+H Y EP IL+ +
Sbjct: 201 QKSAPGNTAHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
+ S ++ K + + LP D +K++ +P P+GG L++G N +H
Sbjct: 261 IDRKDVVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
+ A+ +N +A + S + L+ L + LL G + +L+
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADHSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILSF 380
Query: 392 VYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
DGR V + L +K PS S +G + F GS GDS+L+ ++
Sbjct: 381 KVDGRSVSGISLHFVAEQSGGSITKARPSCSAS----LGRNKLFYGSEEGDSVLLGWSRP 436
Query: 441 SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES---- 496
S T+ S K G E A D D + ++L AS E
Sbjct: 437 SSTTKRPS--KAADGVDENGAADLSDEAEQDDDGDDDDMYEDDLYSVNPASTRQEKQVVN 494
Query: 497 --AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
+ F+F D L ++GP +D + G + + S +EL +G
Sbjct: 495 GDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQQDSVPEIAAPLELVAARGFGKSG 554
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR---TMVLETAD---LLTEVT--- 600
TV + + DS +M DD Y + I L+ + T + + D LL +
Sbjct: 555 GLTVLKREVDPYTIDSLKM---DDVYGVWSIRVLDPKSKDTGLSRSYDKYLLLAKAKGED 611
Query: 601 --ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
ESV Y V + TI G L RV+QV R D Y
Sbjct: 612 KEESVVYSVGSSGLDSIDTPEFNPNEDCTIDIGTLATGTRVVQVLRTEIRSYD--YNLGL 669
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAI 700
P E SE TV+ S A+PY+L D S+ +L D + V VQ AA
Sbjct: 670 AQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA- 726
Query: 701 ESSKKPVSSCTLYHDK 716
S K +S C LY DK
Sbjct: 727 --SGKWISGC-LYEDK 739
>gi|224135031|ref|XP_002321966.1| predicted protein [Populus trichocarpa]
gi|222868962|gb|EEF06093.1| predicted protein [Populus trichocarpa]
Length = 180
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 85/152 (55%), Positives = 100/152 (65%), Gaps = 22/152 (14%)
Query: 733 TGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
TG+ EAIDGADGG DQGDIY V+CYE+GALEIFDVPNFN VF VDKFVSG+TH+VD++M
Sbjct: 4 TGISEAIDGADGGAHDQGDIYRVICYETGALEIFDVPNFNSVFIVDKFVSGKTHLVDSFM 63
Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
E +D +N EE G GRKE +V L F ILT GTILC
Sbjct: 64 GEPPRDLTKGMN---EEVAGAGRKE------IVLL-------------FFGILTYGTILC 101
Query: 853 YQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA 884
Y A LFEGP+ SK +DPVS S+ S++SA
Sbjct: 102 YHACLFEGPDGNSKLEDPVSAQNSVGDSSISA 133
Score = 67.0 bits (162), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 31/50 (62%), Positives = 41/50 (82%)
Query: 1020 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1069
I + V +P+NQVLS + DQE GHQI+N NLSSV++HRT +V+E+EVRIL
Sbjct: 131 ISAFAVQRPVNQVLSSMADQEFGHQIENPNLSSVEIHRTDSVDEFEVRIL 180
>gi|189203597|ref|XP_001938134.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187985233|gb|EDU50721.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 1407
Score = 155 bits (392), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 142/537 (26%), Positives = 241/537 (44%), Gaps = 68/537 (12%)
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPC 934
S S S++ LR ++ S+ + Y + + + ++ G+ F G+ P
Sbjct: 831 SRSASDLWTKNLRWVKLSQQHVPRYIEDNGSEDPGFESTLVALDDVCGYSTVFQRGTTPA 890
Query: 935 WCMVFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
+ +F+E P+ L + + T H C GF Y+ S L+ICQLP + Y +
Sbjct: 891 F--IFKEASSA-PRVIGLSGKPVKSLTSFHTSKCQRGFAYLDSTDTLRICQLPPQTHYGH 947
Query: 992 YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID----- 1046
+ +P+ + H +TY LY IV Q +Q+D
Sbjct: 948 LGWATRRMPMDSEVHALTYHP-SGLY--IVGT--------------GQTEDYQLDPTETY 990
Query: 1047 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTT 1105
+++L DL ++E V++L+ W T + E L+++ + L + T
Sbjct: 991 HYDLPKEDLTFKPSIERGVVKLLDEKS----WTIIDTHILDPQEIVLSIKTLNLEVSEIT 1046
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISA 1160
+ + L+A+GT+ V GED+A +G + +F P T + E+KGA+SA
Sbjct: 1047 HQRKDLIAVGTSVVHGEDLATKGCIRIFEVITVVPQPDRPETNKRLKLIVKDEVKGAVSA 1106
Query: 1161 LASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLGDI 1214
++ L QG L++A G K ++ K GT L +AF D YV L + + +GD
Sbjct: 1107 ISELGTQGFLIMAQGQKCMVRGLKEDGTLLP-VAFMDMQ-CYVSDLKNLPGTGMLAMGDA 1164
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
++ ++F + E+ +++L A+ +L+ A +FL L LVV+D N+QI + P
Sbjct: 1165 YRGVWFTGYTEEPYKMSLFARSKHNLETIAVDFLPFDQQLHLVVADADMNLQILQFDPDN 1224
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAAPGSDKTNRFAL------- 1325
+S G +LL +A FH G H+ L L L S AA S+ + FA+
Sbjct: 1225 PKSEAGSRLLHKATFHTG-HLPTSLHLIHSHLKLPSATDFAATNSNPADAFAMDTSPNTT 1283
Query: 1326 -----------LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
L T G++ + PL E ++RRL +L L +++ LNPR+FR
Sbjct: 1284 TDTPQQPFHQILHTTQSGTLALLTPLSEDSYRRLSNLTAYLANTLDSACSLNPRAFR 1340
Score = 139 bits (350), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 168/697 (24%), Positives = 292/697 (41%), Gaps = 86/697 (12%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++I+ ++ V + S+N+ E L + A L LV
Sbjct: 28 NLVVAKNSLLQIFELKSTTTEVTPGAGENSENAAANLDTEAADVPLQRTENTAKLVLVAE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
+ L G V SLA + A N++ + +++++AF DAK+S++E+D + L S+H +E+P
Sbjct: 88 FPLAGTVISLARVK---ALNTKSKGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144
Query: 166 E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ----------- 208
+ W + +F + DP RC + + IL Q
Sbjct: 145 DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLVEDDYDSD 199
Query: 209 -GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
G + G+ G SS V+ L +LD + H F+H Y EP I+
Sbjct: 200 ADGPKETKADQANGTNGEHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAA 259
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
T ++ + S ++ K + S LP+D K++ +PSPIGG L+VG
Sbjct: 260 SRATAPSLLAQRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITKVVPLPSPIGGALLVG 319
Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTK 382
N IH + +A+N +A + S +S ++ L+ L + L+
Sbjct: 320 GNEIIHVDQGGKTNGVAVNEFAKACTSFSLSDQSDLALHLEGCSIELLSQETGDVLIVLN 379
Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLV 435
G L++LT DGR V + + S + +G F+GS G+S+++
Sbjct: 380 NGRLLILTFTLDGRTVSGMTIQTVAADHGGHLLKSAASCTSNLGRGRLFIGSEDGESVML 439
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNT 494
+T L++ L+ + + + D D D+ N +++ +A+ +
Sbjct: 440 GWTG------LTNQLRRKLSNADLDG-EDDSEEEEIDDMEDDLYNDTAPTMHKITAAVSE 492
Query: 495 ESAQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL- 541
+A T++F + D L +I P+KD + G + ++ A G + EL
Sbjct: 493 PTAPGTYTFRIHDVLPSIAPIKDAVLHPGKVTESLNRGEIMLSTGRGAAGAITALDRELH 552
Query: 542 ------VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISL--EART 587
ELP G+W V+ + + D+ A D +Y YL++S E T
Sbjct: 553 PISVATKELPSAHGVWAVHARKQAPGDVTAAFGEDTEANMATDVDYDQYLVMSKNGEDGT 612
Query: 588 MVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
+V E D LTE + +G T+ G L +V+QV RI D +
Sbjct: 613 VVYEVNGDKLTETDKGDFEREEGTTLLVGILAAGTKVVQVMRTEVRIYDSELNLVHIQSM 672
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
E GS E + +++ S ADPY+L+ D S+++
Sbjct: 673 EEEEEGGSTKELN-IINASFADPYLLILREDSSVKIF 708
>gi|256079900|ref|XP_002576222.1| cleavage and polyadenylation specificity factor cpsf [Schistosoma
mansoni]
Length = 1958
Score = 155 bits (391), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 147/586 (25%), Positives = 255/586 (43%), Gaps = 106/586 (18%)
Query: 4 AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
A +K + PT + NC + H + +D+ L + NLV+T
Sbjct: 15 AVFKHISPPTAVDNCLYCHLKH----------ISPPTAVDNCLYCHLTHPKLKNLVITRG 64
Query: 64 NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
IEIY V+ S SGET+ V ++ N+ + + G
Sbjct: 65 GFIEIYNVK--------SSASGETR------------FNWVYGTSVYENIADIVTVRFTG 104
Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
S++L+F +AK++V+ F+ LR S+H +E + +LK GR +F + P++
Sbjct: 105 DLLD----SLLLSFPEAKVAVMNFNPVTFELRTLSLHNYE---FENLKSGRMNFTKLPIL 157
Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED----TFGSGGGFSARIESSHVINLR 239
++DP RC +LVY + +L + + + D + + + R + +
Sbjct: 158 RLDPHQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHISVRNSLLWQQRATAPLLATFT 217
Query: 240 DL-------DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+ +V D F++G+ EP +++L+E TWAGRVS + TC I ALS +
Sbjct: 218 TCLSTSTGEKINNVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQK 277
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VS 348
+ +P+IW +LP D +++VP PIGGV+V+ AN+I Y Q+ SC L LN YA +
Sbjct: 278 RTNPVIWFQESLPFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTN 337
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKT 406
Q++P S + +D L L+ T++G+L LL++ + + V L K
Sbjct: 338 FPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKV 396
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF---------------------TCGSGTSM 445
+V + + + F+GSR DS+L++ + G+ ++
Sbjct: 397 GHAVPPHCMVLLESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVDHQLLNQSSGTNNTL 456
Query: 446 LSSGLKEEFGDIEAD------------------------APSTKRLRRSSSDALQD---M 478
S L + +E D + STKR +D + D
Sbjct: 457 KDSELVDGKSIVEDDSDEIPNKCPRIEEGENDKTISKSLSQSTKRNTLDENDIISDNHYK 516
Query: 479 VNGEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYG 521
+ ++ LYG + + S + +SF V D L+N+GP+ + G
Sbjct: 517 FDEVDVELYGESILSPPSIYREIVNYSFKVVDRLINLGPMGQLTSG 562
Score = 152 bits (385), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 145/577 (25%), Positives = 245/577 (42%), Gaps = 55/577 (9%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
++ + + +G LEI+ +P+F ++ V F ++D + + ++ +
Sbjct: 1103 FAFIVFTNGVLEIYSLPDFTLLYEVHHFTDLPQMLID---HRGVSSEQLHKQYTNSQNVS 1159
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
++I ++E+ + RP L + T I ++A L P+
Sbjct: 1160 YTEDDSIPP-PILEILVYPIGIDKDRPVLM-VRTSQEIAFFEA-LCPSPDE--------- 1207
Query: 873 TSRSLSVSNVSASRLRNLRFS-RTPLDAYTREET-PHGAPCQRITI--------FKNISG 922
S L RLR R PL A R T P Q + F+NI
Sbjct: 1208 -SYPLISGTFYEGRLRWRRLPLPCPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGD 1266
Query: 923 HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
H+G F+ G P W +LRV P DG + +F L+ C+ GF+Y T +++
Sbjct: 1267 HRGVFVCGGNPIWLFATDSGQLRVFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLA 1326
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
LP G +++ + + K I L P+ + Y E Y +V + +P V L +
Sbjct: 1327 TLPPGYSFNEHLGI-KWITLDPVPYYVQYHVESKTY-AVVGIHS-EPCKSVFRLNAEGNK 1383
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG------PWQTRATIPMQSSENALTV 1095
+ + V T++ Y +++ P+ PW IP E
Sbjct: 1384 EEDVLVRPKTCV----LPTLDYYSLQMYAPNLNANHRNKQPPW---LLIPNTLIEFEPWE 1436
Query: 1096 RVVTLFNTTTKENETL------LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE- 1148
V L ET LA+G GE++ RGR+L+ P +T
Sbjct: 1437 VVTCLITAQLASEETFHGTKDYLALGANLTYGEEIPVRGRILILDVIDVVPEPGQPLTRH 1496
Query: 1149 ----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNI 1204
++ E KG ++AL S QGHL+ A G KI + T+L G+AF D+ LY+ +L
Sbjct: 1497 KLKIIHDGEQKGPVTALTSCQGHLISAIGQKIYIWTLKNTDLVGVAFVDSE-LYIHNLLC 1555
Query: 1205 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
VKN +L D+ KS+ L ++ L+++++D S + + + F +DG L +VSDE N
Sbjct: 1556 VKNLVLAADVLKSVQLLRFQSDLRVLSVVSRDNISREVYTSNFFVDGRRLGFMVSDELGN 1615
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1301
+ I+ Y P S G++L+ A+ + + T LR+
Sbjct: 1616 VTIYSYDPLDPSSRSGRRLVRCADMRLPSRATCSLRV 1652
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 67/130 (51%), Gaps = 15/130 (11%)
Query: 1306 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1365
T++ +G P + R ++ FG+ +GSI I P+ + + RL+ +K L+ + + G+
Sbjct: 1835 TTNPNSGVDP---EKFRQSIYFGSQNGSIYRIGPIRDKMYSRLRITEKNLIHHLGPICGM 1891
Query: 1366 NPRSFRQFHSNGKAHRPGPD------SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1419
P+S + +RP P+ + D +L+ Y LP ++LEIA ++G + I+
Sbjct: 1892 PPKSCWSY------NRPQPELANPCGKVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIM 1945
Query: 1420 SNLNDLALGT 1429
++ +L T
Sbjct: 1946 DDIAELIATT 1955
>gi|403178252|ref|XP_003336695.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375164075|gb|EFP92276.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1149
Score = 154 bits (388), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 110/357 (30%), Positives = 170/357 (47%), Gaps = 18/357 (5%)
Query: 1076 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF- 1133
G W T Q +E ++ V L +T + +GT + ED+AARG + +F
Sbjct: 800 GKWVTIDGYEFQQNEWVTSMANVELDSRSTVSGRRQFVGVGTTCNRAEDLAARGGIYVFE 859
Query: 1134 ----STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNG 1188
+ +N + Y +E K ++A+ ++ G+ L G K+ + E L
Sbjct: 860 IVVVNPAQNHRTYNRALRLRYYEETKACVTAVDAINGYFLHTMGQKLYAKCFEQDERLLA 919
Query: 1189 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1248
+ F D P Y + I KNFILLGD K I ++++E+ +L L + L C +FL
Sbjct: 920 VGFLDIKP-YTTCMRIFKNFILLGDAVKGITLVAFQEEPYKLIELGHTYVDLKCSTIDFL 978
Query: 1249 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1308
+ L++V +D I+IF Y P ES GQKLL R+EF+ + +T ++
Sbjct: 979 VIDGKLAIVATDLNGVIRIFEYNPTNIESQGGQKLLCRSEFNTSSEMTCSMQF------G 1032
Query: 1309 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
R A D+ F +LDGSI + P E ++RLQ +Q +L + H AGLNP+
Sbjct: 1033 KRLSA---KDEAKVMGTFFASLDGSISSLVPAKEAVYKRLQLVQTRLTRHIQHFAGLNPK 1089
Query: 1369 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
R N R I+D ELL + +L + +Q EIA G+ R +L NL +L
Sbjct: 1090 GHRTVR-NDLVSRAINRGILDGELLIKFHLLSVTQQAEIAGLAGSDRETVLVNLLNL 1145
>gi|403170487|ref|XP_003329830.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375168746|gb|EFP85411.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1513
Score = 153 bits (387), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 110/357 (30%), Positives = 170/357 (47%), Gaps = 18/357 (5%)
Query: 1076 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF- 1133
G W T Q +E ++ V L +T + +GT + ED+AARG + +F
Sbjct: 1164 GKWVTIDGYEFQQNEWVTSMANVELDSRSTVSGRRQFVGVGTTCNRAEDLAARGGIYVFE 1223
Query: 1134 ----STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNG 1188
+ +N + Y +E K ++A+ ++ G+ L G K+ + E L
Sbjct: 1224 IVVVNPAQNHRTYNRALRLRYYEETKACVTAVDAINGYFLHTMGQKLYAKCFEQDERLLA 1283
Query: 1189 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1248
+ F D P Y + I KNFILLGD K I ++++E+ +L L + L C +FL
Sbjct: 1284 VGFLDIKP-YTTCMRIFKNFILLGDAVKGITLVAFQEEPYKLIELGHTYVDLKCSTIDFL 1342
Query: 1249 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1308
+ L++V +D I+IF Y P ES GQKLL R+EF+ + +T ++
Sbjct: 1343 VIDGKLAIVATDLNGVIRIFEYNPTNIESQGGQKLLCRSEFNTSSEMTCSMQF------G 1396
Query: 1309 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
R A D+ F +LDGSI + P E ++RLQ +Q +L + H AGLNP+
Sbjct: 1397 KRLSA---KDEAKVMGTFFASLDGSISSLVPAKEAVYKRLQLVQTRLTRHIQHFAGLNPK 1453
Query: 1369 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
R N R I+D ELL + +L + +Q EIA G+ R +L NL +L
Sbjct: 1454 GHRTVR-NDLVSRAINRGILDGELLIKFHLLSVTQQAEIAGLAGSDRETVLVNLLNL 1509
Score = 122 bits (307), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 173/771 (22%), Positives = 308/771 (39%), Gaps = 167/771 (21%)
Query: 48 SKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKE---SKNSGETKRRVLMDGISAASLELV 104
SK P+ NL+V + +++++ + + E+ E ++N E K + L +
Sbjct: 37 SKTRPRPITNLIVARSTLLQVFELCLVEDDQAENNHTRNHHELKNK-------NYKLFHL 89
Query: 105 CHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
C +RLHG V L L+ D ++++F+DAK+++LE+ +S L S+H FE
Sbjct: 90 CEHRLHGRVTGLQRLTTLDTQEDGL-DRLLVSFQDAKMTLLEWSNSAADLVPISLHTFEK 148
Query: 165 -PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
P+ R+ + ++VDP RC +L+ + +L Q L D+ G
Sbjct: 149 LPQITQGDLPRDFQGQ---LEVDPLSRCAVLLLPQATLAVLPFFQDQLDL----DSLGLS 201
Query: 224 GGFSARI------------ESSHVINLRD----------LDMKH--VKDFI---FVHGYI 256
GG + + SS +++ LD +H +K I F+ G+
Sbjct: 202 GGLKSALGSEQQRFQTFPYASSFILDFNQQLLNHLPPSSLDSQHRPIKSVIALKFLPGFS 261
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
EP + +L++ + TW+ R+ +T + L++ P+I NLP+DA+ L+A P
Sbjct: 262 EPTLAVLYQSQYTWSARLENHANTAALIVLTLDLGSNHFPIISHTTNLPYDAHGLVACPK 321
Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR------------------- 357
+ GVLV+ A+ I + QS+ N V S ++PR
Sbjct: 322 ELAGVLVLCADMILHVDQSSKIIGLATNGWVKHTSELQIPRQDTVRLITPTNKISGHRST 381
Query: 358 --------------------------SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
V L+ A + + D A + +TG++ L
Sbjct: 382 TNKSDERPEDLEDGEEQDESGVPEGHEKLLVRLENAKIVFSRADRAFVFLRTGEVFSLQF 441
Query: 392 VYDGRVVQRLDLSKTN-PSVLTSDITTIGNSLFFLGSRLGDSLL-----------VQFTC 439
+ DGR + +L L K + S++ S + + N F+GS GDS L
Sbjct: 442 LRDGRTLTKLVLEKLDLLSIIPSTVLKVNNECLFVGSMAGDSALYILDHLRPRSSSDDDN 501
Query: 440 GSGTSMLSSGLKE-----------EFG-DIEADAPSTKRLRRSSSDALQD--MVNGEELS 485
G + SS + + +F DI D T +RR+ L D NG +
Sbjct: 502 DDGHQLPSSSIIQPDKAAKNQSSLDFDEDIYGDRTETDPVRRTDHSQLYDDRPSNGADDG 561
Query: 486 LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
G+ ++ E + + D + GP++DF+ +ATG+ +EL
Sbjct: 562 RPGAGAHLAEPFLR-----LGDVIQAHGPIRDFTM--------AATGVENMP----LELL 604
Query: 546 GCKGI-----WTVYH-----KSSRGHNADSSRMAAYDDEYHAYLII-----SLEARTMVL 590
C G TV+H + R + +S + + + L++ S E + + +
Sbjct: 605 ACTGTGDLGGLTVFHREIPLRKRRKLSFESPSASHINALFFTSLVVESGGLSEERKVVWM 664
Query: 591 ETADLLTEVT---ES-----VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
+ TE+ ES ++ F + +T+A FG++ V+QV ++ S
Sbjct: 665 GRSGPRTEIATYGESGELSLINTFPE-KTLAVSPFFGKQFVVQVTNTAIKLFTSSL---- 719
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
++ +L SI D YV+L G + GD + T+S
Sbjct: 720 -----EEAQVIQPEPAVKILRASIVDDYVMLETHCGLKLIYQGDHDSKTLS 765
>gi|358056450|dbj|GAA97624.1| hypothetical protein E5Q_04302 [Mixia osmundae IAM 14324]
Length = 1305
Score = 153 bits (387), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 169/673 (25%), Positives = 300/673 (44%), Gaps = 98/673 (14%)
Query: 55 VPNLVVTAANVIEIY-----VVRVQE---EGSKESKNSGETKRRVLMDGISAASLELVCH 106
V NLVV +N +++Y V VQ +GS S +T+ L+L+
Sbjct: 37 VRNLVVARSNFLQVYEVLEEPVPVQSSVTDGSSASMREDQTR------------LQLLAE 84
Query: 107 YRLHGNVESLAILSQGGADNSRR--RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
+ HG V LA LS ++R+ R ++++F DAK++V+E+ D +H L SMH FE
Sbjct: 85 HVCHGIVTGLARLS---TLDTRQDGRHRLVISFRDAKMTVMEWSDQLHDLAPVSMHSFE- 140
Query: 165 PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGG 224
L +G + A +++VD RC +L+ + IL Q S L ED G
Sbjct: 141 -RLPQLSQG-DLGAFQAVLRVDQASRCVALLLPDNTLGILPFFQDLSEL---EDMTREGL 195
Query: 225 GFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCM 282
S S I+L ++ +++V DF F+ G+ EP + IL +R+ TW GR+ +
Sbjct: 196 Q-SLPYAPSLTIDLSEIGPGIRNVVDFAFLPGFSEPTIAILFQRKPTWTGRIDFAKDITS 254
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALA 341
+ +++ + +P+I+ A LP+DA L P +GGV+++ AN+ +H S +A
Sbjct: 255 LVMVTLDIGSRNYPVIFEADGLPYDALSLSVCPRELGGVVILCANSLVHIDQSSKMTGIA 314
Query: 342 LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
+N + +L ++ R + + L+ A ++ VA+L T+TG+ L + DGR V +
Sbjct: 315 VNGWTSTLTDARLDSRPTLRLVLEGAQCAFVGQQVAVLCTRTGETFSLHLEKDGRNVSSM 374
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
D + + + I T+G + F+GS G S+L+++ SG DI
Sbjct: 375 DCRPRAVTCIPACIETVGAAYVFVGSAQGQSVLLRWASQSGAG----------ADILDIT 424
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSA-SNNTESAQ-----KTFSFAVRDSLVNIGPL 515
S L + SDA+ D LY +A ++N Q K + D+L G +
Sbjct: 425 ESGTGLVQ--SDAMDD-------DLYATAGAHNGNGHQIAPTGKDVQLELCDTLPGYGTI 475
Query: 516 KDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
+ + D ++ + + S + G+ T++ D A D +
Sbjct: 476 RHIAV-----LDHTSASLDEPSLVACTGVQAMAGLTTIHRHVPSVRQVDLDLPTARDIRH 530
Query: 576 HAYLIISLEAR----------TMVLETADLLTEVTESVDYFVQGRT---------IAAGN 616
+ LE R ++ T + + ++D Q T +AAG+
Sbjct: 531 --IWTVGLEQRQKMGRGPITHQIICSTGS--SSMVYTLDQDTQAATLARKSAEVPLAAGS 586
Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
F R +V++V E R+ G +E+ G ++ + V+++DP+V + +
Sbjct: 587 FFSRSQVLEVTEDMLRLYSPD--------GQITTEAPHGQADA--IDVTVSDPFVAVLSA 636
Query: 677 DGSIRLLVGDPST 689
++ + GDP+T
Sbjct: 637 ARNVTVFFGDPTT 649
Score = 125 bits (314), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 128/529 (24%), Positives = 230/529 (43%), Gaps = 49/529 (9%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRE---RLRVHPQLCDGSIVAFTVLHNVNCNHG 968
+R+ F + +G G F++GS P + + R RL P G AF +
Sbjct: 806 RRLVSFISTTGRSGVFITGSAPFYLLTDRAGIARLYRAPY---GRASAFGAFDPPSSTP- 861
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1028
+ V + G + L ++ PV V AT T A + +V+ V+
Sbjct: 862 -LLVLADGAMHTYDLSDQASLARELPVTHV----ATSKCFTSTAYHDSSHTLVAARVV-- 914
Query: 1029 LNQVLSLLIDQEVG-HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1087
N L D+ ++ + ++ S + R+ + +L P G W
Sbjct: 915 -NAPFELFDDEGAPVYRAPSEDMISPTVFRSC------LELLVP----GSWDCIDGHEFP 963
Query: 1088 SSENALTVRVVTLFNTTTKENETLLAIG-TAYVQGEDVAARGRVLLF--------STGRN 1138
+E+ L + TL + T I T +GED+ RG + +F +
Sbjct: 964 QNESILQLICATLPSATDPSGRARFVIASTCNNRGEDLQTRGGLYVFRISTTESTAASDQ 1023
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPL 1197
A ++ V++ +L+ + A+ + GH++ + G K+ + + + L + F D L
Sbjct: 1024 AQARSAKLSLVHADDLRHPVGAICEVNGHIIHSLGQKVFIKAFDSDQRLITVGFLDVG-L 1082
Query: 1198 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1257
V ++ +KN +++GD YF++++E +L LL K+ D + +FL+ + L L+
Sbjct: 1083 DVSAMRSIKNLLIIGDSLTGTYFVAFQEDPFKLVLLGKEARKTDVYCVDFLVQENRLGLL 1142
Query: 1258 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDRTGAAPG 1316
+ ++ Y P +ES G++LL R E+H+G + L + L+T D +
Sbjct: 1143 SVSRKGLLRQLEYNPGNAESRAGERLLDRTEYHLGKQIIDSLSFAKRLSTDEDLRQSG-- 1200
Query: 1317 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
+L G DGS+ + P+ E+ +RRL L+++L +PH AGLNPR+FR N
Sbjct: 1201 -------VMLVGA-DGSLTWVTPVREVVYRRLALLERQLHRQLPHFAGLNPRAFRTAR-N 1251
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
RP ++D +LL+ Y L Q +A + + NL +L
Sbjct: 1252 DYYSRPLARGMLDGDLLAIYANLHASRQQSLASHINSDPDTLSVNLGNL 1300
>gi|12697776|dbj|BAB21613.1| polyadenylation specificity factor [Homo sapiens]
Length = 216
Score = 153 bits (386), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 81/212 (38%), Positives = 122/212 (57%), Gaps = 12/212 (5%)
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+
Sbjct: 2 KSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAK 61
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTL 1330
ES+ G +LL RA+FHVGAHV F R + GA G K N+ F TL
Sbjct: 62 ESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATL 114
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1390
DG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D
Sbjct: 115 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDG 174
Query: 1391 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 175 ELLNRYLYLSTMERSELAKKIGTTPDIILDDL 206
>gi|257215708|emb|CAX83006.1| Cleavage and polyadenylation specificity factor subunit 1
[Schistosoma japonicum]
Length = 462
Score = 152 bits (384), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 115/402 (28%), Positives = 197/402 (49%), Gaps = 55/402 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLV+T + IEIY ++ S SGET+ + + S+ + N+ +
Sbjct: 41 NLVITRSGFIEIYNIK--------SSVSGETR----FNWVYGTSV--------YENIADI 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ G S++L+F +AK++V+ F+ LR S+H +E + +LK GR +
Sbjct: 81 VSVRFAGDLLD----SLLLSFSEAKVAVMNFNPITFELRTLSLHNYE---FENLKSGRMN 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG------------- 223
F + P++++DP RC +LVY + +L + + + D G
Sbjct: 134 FTKLPILRLDPYQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHIGVRNFLLWQQRATA 193
Query: 224 ---GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
F+ + +S + +V D F+HG+ EP +++L+E TWAGRVS + T
Sbjct: 194 PLLATFTTCLSTS-----TGEKINNVLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDT 248
Query: 281 CMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCA 339
C I ALS + + +P+IW +LP D ++ VP PIGGV+++ AN+I Y Q+ SC+
Sbjct: 249 CCIVALSFNLQKRTNPVIWFQESLPFDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCS 308
Query: 340 LALNNYA---VSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD-- 394
L LN YA + Q++P S + +D L L+ T++G+L LL++ +
Sbjct: 309 LPLNCYAQISTNFPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQA 367
Query: 395 GRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
+ V L K +V + + + F+GSR DS+L++
Sbjct: 368 TQTVTSLLFHKVGHAVPPHCMVLLESKYLFIGSRFCDSVLMK 409
>gi|190348091|gb|EDK40482.2| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
Length = 1320
Score = 152 bits (383), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 131/550 (23%), Positives = 250/550 (45%), Gaps = 53/550 (9%)
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMV 938
SN + ++L + P +AY P G +R ++ +SG F++G P +
Sbjct: 787 SNFKLKKEKDLLITGAPDNAY-----PAGTSIERRLVYIPLVSGFSSIFVTGVVPYFITR 841
Query: 939 FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
R + + + +F + ++G I++ + +IC+LP YDN PV+KV
Sbjct: 842 TRHSIPRIFKFTKIAAQSFASFSDSKVSNGLIFLDNAKNARICELPRDFNYDNNLPVKKV 901
Query: 999 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVD 1054
P+ T +TY N Y +VS P N +D+E G + D + +S
Sbjct: 902 -PIGETVKSVTYHELSNTY--VVSTYREIPYNA-----LDEEGNPIAGLKKDKPSANSY- 952
Query: 1055 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NET 1110
+ ++++ P W T+ ++ +E A+TV+ + L ++TK +
Sbjct: 953 --------KGSLKLISPYN----WTVIETVELRDNEIAMTVKSMVLDIGSSTKRFKHRKE 1000
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALAS 1163
LL +GT + ED+ A G ++ G+ N + E +++ KGA++++
Sbjct: 1001 LLVVGTGRYRMEDLGANGAFKIYEIIDIIPEPGKPETNHK--FKEYNTEDTKGAVTSMCE 1058
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
+ G L+A G KII+ + +AF D +YV N ++LGD KS++ +
Sbjct: 1059 VSGRFLVAQGQKIIVRDVQDDGVVPVAFLDTS-VYVSEAKSFGNLVILGDTLKSVWLAGF 1117
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+ ++ +L KD S+D EF+ + ++++ + + + P+ S GQ+L
Sbjct: 1118 DAEPFRMIMLGKDLQSVDVSCAEFISKDEEIYILIAGNNNVMHLVQFDPEDPTSSNGQRL 1177
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1343
+ RA F+V + T ++M+ + + + ++ F + T+DGS + P++E
Sbjct: 1178 VHRASFNVSSSTTC---MRMVPKNEE-----INTQYSDVFQTVGSTIDGSFFTVFPVNEF 1229
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
T+RR+ +Q++L D H GLNPR R + G I+D +++ Y L +
Sbjct: 1230 TYRRMYIIQQQLTDKEYHYCGLNPRLNRFGGEAFDDSQTGVKPILDHQVIKRYAKLNEDR 1289
Query: 1404 QLEIAHQTGT 1413
+ IA + +
Sbjct: 1290 KQTIAQKVSS 1299
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 127/611 (20%), Positives = 237/611 (38%), Gaps = 68/611 (11%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L+ ++L+G V +L + +S D I++A + AK+S++ +D H + S+H
Sbjct: 52 LRLLDQFKLYGTVTAL---KKFRTVDSPDLDYILVATKAAKVSMIRWDHQTHSIATESLH 108
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-----G 215
+E E+ L+ V+P + + + L S G
Sbjct: 109 YYEKSIQ---AATYETLDETELI-VEPNRYSCFCVRFKNLLTFLPFSTPDDDDDDMDDEG 164
Query: 216 DEDTFGSGGGFSARI-ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
+ GF + + SS +++ + L+ + + D F+H Y EP + IL + TW G
Sbjct: 165 ETKKQKYVPGFDSEVFGSSFMVDAQTLEPSIGTIVDMQFLHNYREPTVAILSSKAATWTG 224
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHY 331
+ ++I K + NLP D +L+ + P+ G L++G N IH
Sbjct: 225 LLPKVKDNITYHVMTIDLATKATTTVLKIENLPFDIDRLVPLSHPLNGCLLLGCNEIIHV 284
Query: 332 HSQSASCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVL 388
+ LA+N Y + +S + ++ ++ L+ L ND LLS TG L
Sbjct: 285 DNGGIVRRLAVNKYTEDITASVKNYHDQTDLNLMLENCAVIPLPNDNRVLLSLSTGSLFH 344
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS-DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
+ D + ++R L + +S D+T G F DS L+ +G S L
Sbjct: 345 INFDVDIKTIKRFALEPVLETHYSSVDLTYPGQPAFL------DSNLLFIANNNGNSPL- 397
Query: 448 SGLKEEFGDIEADAPSTKRL-RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
+E + + + S+ +DM EEL +A Q +
Sbjct: 398 ---------LEVKYLRNEEVTEKVQSNGKEDMDGDEELYDDDNAGEKIVIRQGDIKYFKH 448
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG------CKGIWT-------- 552
D L+N GP+ DF+ G A I+ N + G C I+
Sbjct: 449 DELINHGPVSDFTLGKYSTEKFKANLINPNLNDVCIVSNGGSHKQSCLNIFAPSVQPIIR 508
Query: 553 ---VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+ + +R N ++ + DD I +E L++ D + +
Sbjct: 509 SSLTFSQVNRMWNINNKYLITSDDVNSKSEIFQIEKSYSRLKSKDFIND----------E 558
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TIA L + ++Q+ + + + + + +SF E + ++S ++ D
Sbjct: 559 MTIAMHELNNGKYILQITPKHIEVFNSKF-KRHMSF---EDELKDAMKEDQIISSTVHDD 614
Query: 670 YVLLGMSDGSI 680
Y+++ + G +
Sbjct: 615 YLMIFFASGEV 625
>gi|156040479|ref|XP_001587226.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980]
gi|154696312|gb|EDN96050.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1447
Score = 151 bits (381), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 135/588 (22%), Positives = 256/588 (43%), Gaps = 48/588 (8%)
Query: 872 STSRSLSVSNVSASRLRNLRFSRTP-LDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
STS +L S + ++ N ++ P + A + + + + N+ G+ F+ G
Sbjct: 879 STSPNLLSSTLQFLKIHNTHLAQAPDVSAEEQADETQQTSDKPMRAVSNLGGYSVVFMPG 938
Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P + + + L L + + H C+ GFIY ++GI+++ Q P +T+
Sbjct: 939 GSPSFIVKSSKTLPKVLSLQGTGVRGLSSFHTEGCDRGFIYADTEGIVRVAQFPPTTTFA 998
Query: 991 NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
+ + + + H + Y + Y + S D E+ D+H
Sbjct: 999 DIGMALRKVEIGEDVHAVAYHSPLQTYVIGTST------------FTDFELPKD-DDHRR 1045
Query: 1051 S--SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF-NTTTKE 1107
S D+ ++E+ ++++ P W TI ++ E ++ + L + T E
Sbjct: 1046 SWQEEDIAFKPSIEKSSLKLISPVN----WSVIDTIELEPCEVITCIKTMNLVVSEVTNE 1101
Query: 1108 NETLLAIGTAYVQGEDVAARGRVLLFSTG---RNADNPQN------LVTEVYSKELKGAI 1158
+ LL +GTA +GED+A GR+ ++ D P+ + E ++ G +
Sbjct: 1102 RKPLLVVGTAITKGEDLATTGRLYVYDVVIVVPEPDRPETNKKLKLISAETITRGAGGPV 1161
Query: 1159 SALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLG 1212
+ L+ + QG +L+A G K ++ K GT L +AF D YV S+ + ++
Sbjct: 1162 TGLSEIGTQGFMLVAQGQKCMVRGLKEDGTNLP-VAFMDTN-CYVTSIKELPGTGLCVIA 1219
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D K ++F + E+ ++ L K ++ + L DG L +V +D N+ I Y P
Sbjct: 1220 DALKGVWFAGYTEEPYKMLLFGKSATRMEVLCADLLPDGKDLFIVAADADGNLHIMQYDP 1279
Query: 1273 KMSESWKGQKLLSRAEFHVGAH-------VTKFLRLQMLATSSDRTGAAPGSDK--TNRF 1323
+ +S +G LL R F +GAH + L L T+S + + + +
Sbjct: 1280 EHPKSLQGHLLLHRTTFSLGAHHPTTMTLLPAIPSLHPLTTASSSSLSPSPQEDSPSPSQ 1339
Query: 1324 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1383
+LL + G+ ++PL E +RR +L L +++ H GLNPR++R + G
Sbjct: 1340 SLLLTSRTGTFALLSPLTESQYRRFGTLVSHLTNTLYHPCGLNPRAYR-VDKDANEGIVG 1398
Query: 1384 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+I+D +L + L + + E+A + G ++ L++L G F
Sbjct: 1399 GRTIIDGGVLGRWMELGSQRRGEVAGRVGVDVLELRDELSELRRGLEF 1446
Score = 129 bits (324), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 184/745 (24%), Positives = 307/745 (41%), Gaps = 153/745 (20%)
Query: 57 NLVVTAANVIEIYV-----VRVQEEGSKES---KNSGETKRRVL-MDGISAA-------- 99
NLVV A++++I+ V + E K+S K+ T R DG+ A+
Sbjct: 28 NLVVAKASLLQIFTTKTVSVDLDELSGKDSSTVKDVTSTDPRAHDEDGVEASFLGADSIL 87
Query: 100 ---------SLELVCHYRLHGNVESLA----ILSQGGADNSRRRDSIILAFEDAKISVLE 146
L L+ Y L G V SL I S+ G + ++++ F+DAK+S++E
Sbjct: 88 PRSELARTTKLVLIAEYNLSGTVTSLVRVKTISSKTGGE------ALLVGFKDAKLSLVE 141
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+D G+ S+H +E E + VDP RC + + IL
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201
Query: 207 SQGGSGLVGDED--------------TFGSGGGFSARIESSHVINLRDLDMKHV--KDFI 250
Q + D D + G + SS V+ L LD +
Sbjct: 202 KQDEDVNMDDWDEELDGPRPAKISQKSAAENGILATPYGSSFVLRLSSLDPSLIFPIHLE 261
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
F++ Y EP IL + + + H T M+ L I K I S LP+D
Sbjct: 262 FLYEYREPTFGILSSTMAPSSALLQERKDHLTYMVFTLDIHQ--KASTTILSVGGLPYDL 319
Query: 309 YKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
+ ++ + P+GG L+VGAN IH + +A+N +A + L +S ++ L+
Sbjct: 320 FMIVPLAPPVGGALLVGANELIHIDQAGKANGVAVNMFAKQCTNFSLLDQSDLALRLEGC 379
Query: 368 HATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS----VLT---SDITTI 418
L +N L+ +GD+ +L+ DGR V L + + + +LT S ++++
Sbjct: 380 KIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAELGGDILTGAASCVSSL 439
Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
G F+GS + DS+++ ++ SG PS ++ R SS A+ D+
Sbjct: 440 GAGALFVGSEVSDSVILGWSRKSGQ------------------PSRRKSRLDSS-AIADV 480
Query: 479 ----------------VNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSY 520
+ G+ ++ +A+N T S K ++F++ DS+VNI P+ + ++
Sbjct: 481 DEAMLDEEDLEDDDDDLYGDGPTISPTAANVTASNSKAGDYTFSIHDSMVNIAPITNITF 540
Query: 521 G-----------LRINADAS------ATGISKQSNYELV------------ELPGCKGIW 551
G L++N S A G K + ++ ELP +GIW
Sbjct: 541 GEVALSSDKEEELKLNGVQSELQLLAAVGREKGGSLAVINRNIQPNVIGRFELPEARGIW 600
Query: 552 TVYHK--SSRGHNADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLT 597
T+ K + +G + + D +Y +I+S EA + E+A +
Sbjct: 601 TMSAKKPAPKGLQVNKEKTVIGGDYGVDAQYDRLMIVSKASEAEDAIDESAVYALTNAGF 660
Query: 598 EVTESVDYF-VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
E ++ G TI AG L RVIQV + R DG + Q L + E+G+
Sbjct: 661 EALSGTEFEPAAGSTIEAGTLGNGMRVIQVLKSEVRSYDGDLGLAQILPM--LDDETGA- 717
Query: 656 SENSTVLSVSIADPYVLLGMSDGSI 680
++S S ADP++LL D SI
Sbjct: 718 --EPKIISASFADPFLLLIRDDASI 740
>gi|294659889|ref|XP_462318.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
gi|218511978|sp|Q6BHK3.2|CFT1_DEBHA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|199434312|emb|CAG90824.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
Length = 1342
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 121/501 (24%), Positives = 231/501 (46%), Gaps = 47/501 (9%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFR 940
N + ++L + P +AY+ T +R+ F N++G F++G P +
Sbjct: 810 NFKLVKEKDLIITGAPDNAYSLGTTIE----RRLVYFPNVNGFTSIFVTGITPYYISKTT 865
Query: 941 ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIP 1000
+ + V+F + +G IY+ + +IC++P Y+N WP++K IP
Sbjct: 866 HSVPRIFKFTKLPAVSFAPYSDDKIKNGLIYLDNSKNARICEIPVDFNYENNWPIKK-IP 924
Query: 1001 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS--SVDLHRT 1058
+K + +TY N + V+ ++ +D+E G I + S S + ++
Sbjct: 925 IKESIKSVTYHELSNTF-------VISTYEEIPYDCLDEE-GKPIVGVDKSKPSANSYKG 976
Query: 1059 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLLAI 1114
Y ++++ P W TI + E + V+ + L ++TK+ + L+ I
Sbjct: 977 Y------IKLISPYN----WSVIDTIELVDGEIGMNVQSMVLDVGSSTKKFKNKKELIVI 1026
Query: 1115 GTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGH 1167
GT + ED++A G +F G+ N + E++ ++ KGA++++ + G
Sbjct: 1027 GTGKYRMEDLSANGSFKIFEIIDIIPEPGKPETNHK--FKEIHQEDTKGAVTSICEISGR 1084
Query: 1168 LLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
L++ G KII+ + +AF D +YV N ++LGD KSI+ + +
Sbjct: 1085 FLVSQGQKIIIRDLQDDGVVPVAFLDTS-VYVSEAKSFGNLLILGDSLKSIWLAGFDAEP 1143
Query: 1228 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
++ +L KD SLD +F+I + ++++D + + Y P+ S GQ+L+ +A
Sbjct: 1144 FRMVMLGKDLQSLDVNCADFIIKDEEIFILIADNNSTLHLVKYDPEDPTSSNGQRLIHKA 1203
Query: 1288 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1347
F++ + T + + P S T F + T+DGS + P++E ++RR
Sbjct: 1204 SFNINSTPT------CIRSIPKNEEINPSS--TEVFQSIGSTIDGSFYTVFPINEASYRR 1255
Query: 1348 LQSLQKKLVDSVPHVAGLNPR 1368
+ LQ+++ D H GLNPR
Sbjct: 1256 MYILQQQITDKEYHFCGLNPR 1276
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/414 (21%), Positives = 178/414 (42%), Gaps = 64/414 (15%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
L+V A V++++ + E +++ K L+LV ++LHG + +
Sbjct: 29 LIVGKATVLQVFEIITTETKTQQYK------------------LKLVEQFKLHGLITDIK 70
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+ +NS+ D ++++ + AK+S++++D ++ + S+H +E+ E
Sbjct: 71 AIRT--VENSQL-DYLLVSSKGAKMSLIKWDHHLNSISTVSLHYYENSIQ---SSTYEKL 124
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMII-----------------LKASQGGSGLVGDEDTF 220
LV V+P C + L + + S G +++
Sbjct: 125 TTTDLV-VEPNNNCTCLRFKNLLTFLPFETLDEEEEDDDDDEEMNGSSGSDKKATNKENG 183
Query: 221 GSGGG-FSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
S G S ESS +I+ R LD + + D F++ Y EP + I+ + WAG +
Sbjct: 184 NSNGEEVSELFESSFMIDGRTLDSRIGDIIDMQFLYNYREPTIAIIFSKAHAWAGNLPKV 243
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSA 336
LS+ K + NLP D K++ +P P+ G L++G N IH +
Sbjct: 244 KDNINFIVLSLDLVTKASTTVLKIDNLPFDIDKIIPLPQPLNGSLLMGCNEIIHVDNGGI 303
Query: 337 SCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVY 393
+ LALN + S+ +S + +S +++L+ + ND L+ GD +
Sbjct: 304 TRRLALNQFTSSITTSLKNYHDQSDLNLKLENCSVKPIPNDNKVLMILNNGDFYYINFKI 363
Query: 394 DGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
DG+ +++ D+ T P +I T+ N+L F+ ++ G++ L++
Sbjct: 364 DGKTIKKFFVEKVSDLNYDDIQLTYP----GEIATLDNNLMFISNKNGNNPLLE 413
>gi|330919204|ref|XP_003298516.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
gi|311328242|gb|EFQ93393.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
Length = 1388
Score = 150 bits (380), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 130/495 (26%), Positives = 220/495 (44%), Gaps = 61/495 (12%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT 973
+ +I G+ F G+ P + + L + + T H +C GF Y+
Sbjct: 870 LVALDDICGYSTVFQRGTTPAFILKEASSAPRVIGLSGKPVKSLTSFHTSSCQRGFAYLD 929
Query: 974 SQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVL 1033
S L+ICQLP + Y + + +P+ + H +TY LY IV
Sbjct: 930 STDTLRICQLPPQTHYGHLGWATRRMPMDSEVHTLTYHP-PGLY--IVGT---------- 976
Query: 1034 SLLIDQEVGHQID-----NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1088
Q +Q+D +++L DL ++E +++L+ W T +
Sbjct: 977 ----GQAEDYQLDPTETYHYDLPKEDLTFKPSIERGVIKLLDEKS----WTIIDTHVLDP 1028
Query: 1089 SENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT 1147
E L+++ + L + T + + L+A+GT+ V GED+A +G + +F P T
Sbjct: 1029 QEVVLSIKTLNLEVSEITHQRKDLIAVGTSVVHGEDLATKGCIRIFEVITVVPQPDRPET 1088
Query: 1148 E-----VYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLY 1198
+ E+KGA+SA++ L QG L++A G K ++ K GT L +AF D Y
Sbjct: 1089 NRRLKLIVKDEVKGAVSAISELGTQGFLIMAQGQKCMVRGLKEDGTLLP-VAFMDMQ-CY 1146
Query: 1199 VVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1256
V L + + +GD ++ ++F + E+ +++L A+ +L+ A +FL L L
Sbjct: 1147 VSDLKNLPGTGMLAMGDAYRGVWFTGYTEEPYKMSLFARSKHNLETIAVDFLPFDQQLHL 1206
Query: 1257 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAA 1314
VV+D N+QI + P + G +LL +A FH G H L L L S AA
Sbjct: 1207 VVADADMNLQILQFDPDNPKGEAGSRLLHKATFHTG-HFPTSLHLIHSHLKLPSATDFAA 1265
Query: 1315 PGSDKTNRFAL------------------LFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1356
++ + FA+ L T G++ + PL E ++RRL +L L
Sbjct: 1266 TNNNPADAFAMDTSPNTTTDTPQQPFHQILHTTQSGTLALLTPLSEDSYRRLSNLSAYLA 1325
Query: 1357 DSVPHVAGLNPRSFR 1371
+++ LNPR+FR
Sbjct: 1326 NTLDSACSLNPRAFR 1340
Score = 135 bits (341), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 168/698 (24%), Positives = 290/698 (41%), Gaps = 88/698 (12%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++I+ ++ V + S+N+ E L + A L LV
Sbjct: 28 NLVVAKNSLLQIFELKSTTTEVTPGSGENSENAAANLDTEAADVPLQRTENTAKLVLVAE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + + + +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVISLARVK--ALNTKSKGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE--- 217
W + +F + DP RC + + IL Q LV D+
Sbjct: 146 LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQ--RDLVEDDYDS 198
Query: 218 -----------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
+ G SS V+ L +LD + H F+H Y EP I+
Sbjct: 199 DAEVPKETKADQANDTSGEHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVA 258
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
T ++ + S ++ K + S LP+D K++ +PSPIGG L+V
Sbjct: 259 ASRATAPSLLAQRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITKVVPLPSPIGGALLV 318
Query: 325 GAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
G N IH + +ALN +A + S +S ++ L+ L + L+
Sbjct: 319 GRNEIIHVDQGGKTNGVALNEFAKACTSFSLSDQSDLALHLEGCSIELLSQETGDVLIVL 378
Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLL 434
G L++LT DGR V + + S + +G F+GS G+S++
Sbjct: 379 NNGRLLILTFTLDGRTVSGMTIQTVAADHGGHLVKSAASCTSNLGRGRLFIGSEDGESVM 438
Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNN 493
+ +T L++ L+ + + + D D D+ N +++ +A+ +
Sbjct: 439 LGWTG------LTNQLRRKLSNADLDG-EDDSDEEEIDDMEDDLYNDTAPTMHKITAAVS 491
Query: 494 TESAQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL 541
+A T++F + D L +I P+KD + G + ++ A G + EL
Sbjct: 492 EPTAPGTYTFRIHDVLPSIAPIKDAVLHPGKVTESLNRGEIMLSTGRGAAGAITALDREL 551
Query: 542 -------VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISL--EAR 586
ELP G+W V+ + + D+ A D +Y YL++S E
Sbjct: 552 HPISVATKELPLAHGVWAVHARKQAPGDVTAAFGEDTEANMATDVDYDQYLVMSKNGEDG 611
Query: 587 TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
T+V E D LTE + +G T+ G L +V+QV RI D +
Sbjct: 612 TVVYEVNGDQLTETDKGDFEREEGTTLLVGVLAAGTKVVQVMRTEVRIYDSELNLVHIQS 671
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
E GS E + +++ S ADPY+L+ D S+++
Sbjct: 672 MEEEEEGGSTKELN-IINASFADPYLLILREDSSVKIF 708
>gi|298715584|emb|CBJ28137.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 255
Score = 150 bits (379), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 83/244 (34%), Positives = 140/244 (57%), Gaps = 12/244 (4%)
Query: 1189 IAFYDAPPLYVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEF 1247
I F+D P +YV+SL+++K+ FIL+GD + S+ + W+E+ L L+KD F+ E+
Sbjct: 23 IGFHD-PRVYVMSLSVIKHKFILVGDAYGSVQLVVWREEDHSLTALSKDHEDCQVFSAEY 81
Query: 1248 LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS 1307
LID +++VV+D ++N+++ YAP + S G KLL +++F++G+ V K R +
Sbjct: 82 LIDEPGMAIVVADGRRNVKVLQYAPNATNSRGGTKLLCQSDFYLGSRVGKLTRRRTRGNL 141
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNP 1367
D GA R+ LL GTLDG +G + P+DE FRRL +LQ + +++ H NP
Sbjct: 142 RD--GA--------RYCLLAGTLDGGLGAVLPVDERVFRRLYALQGIMSNALGHNGAANP 191
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1427
R++R F +++D LL + L + Q ++ GTT ++++NL D+ L
Sbjct: 192 RAYRLFDHGPTFRYETKQNMLDGSLLWRFVGLDAKTQHDLTRAIGTTVDRVMANLLDIDL 251
Query: 1428 GTSF 1431
+ F
Sbjct: 252 ASLF 255
>gi|146415762|ref|XP_001483851.1| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
Length = 1320
Score = 150 bits (378), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 130/549 (23%), Positives = 249/549 (45%), Gaps = 53/549 (9%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMVF 939
N + ++L + P +AY P G +R ++ +SG F++G P +
Sbjct: 788 NFKLKKEKDLLITGAPDNAY-----PAGTSIERRLVYIPLVSGFSSIFVTGVVPYFITRT 842
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
R + + + +F + ++G I++ + +IC+LP YDN PV+KV
Sbjct: 843 RHSIPRIFKFTKIAAQSFASFSDSKVSNGLIFLDNAKNARICELPRDFNYDNNLPVKKV- 901
Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDL 1055
P+ T +TY N Y +VS P N +D+E G + D + +S
Sbjct: 902 PIGETVKSVTYHELSNTY--VVSTYREIPYNA-----LDEEGNPIAGLKKDKPSANSY-- 952
Query: 1056 HRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETL 1111
+ ++++ P W T+ ++ +E A+TV+ + L ++TK + L
Sbjct: 953 -------KGSLKLISPYN----WTVIETVELRDNEIAMTVKSMVLDIGSSTKRFKHRKEL 1001
Query: 1112 LAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASL 1164
L +GT + ED+ A G ++ G+ N + E +++ KGA++++ +
Sbjct: 1002 LVVGTGRYRMEDLGANGAFKIYEIIDIIPEPGKPETNHK--FKEYNTEDTKGAVTSMCEV 1059
Query: 1165 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
G L+A G KII+ + +AF D +YV N ++LGD KS++ +
Sbjct: 1060 SGRFLVAQGQKIIVRDVQDDGVVPVAFLDTS-VYVSEAKSFGNLVILGDTLKSVWLAGFD 1118
Query: 1225 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1284
+ ++ +L KD S+D EF+ + ++++ + + + P+ S GQ+L+
Sbjct: 1119 AEPFRMIMLGKDLQSVDVSCAEFISKDEEIYILIAGNNNVMHLVQFDPEDPTSSNGQRLV 1178
Query: 1285 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1344
RA F+V + T ++M+ + + + ++ F + T+DGS + P++E T
Sbjct: 1179 HRASFNVSSSTTC---MRMVPKNEE-----INTQYSDVFQTVGSTIDGSFFTVFPVNEFT 1230
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1404
+RR+ +Q++L D H GLNPR R + G I+D +++ Y L + +
Sbjct: 1231 YRRMYIIQQQLTDKEYHYCGLNPRLNRFGGEAFDDSQTGVKPILDHQVIKRYAKLNEDRK 1290
Query: 1405 LEIAHQTGT 1413
IA + +
Sbjct: 1291 QTIAQKVSS 1299
Score = 80.5 bits (197), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 127/611 (20%), Positives = 237/611 (38%), Gaps = 68/611 (11%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L+ ++L+G V +L + +S D I++A + AK+S++ +D H + S+H
Sbjct: 52 LRLLDQFKLYGTVTAL---KKFRTVDSPDLDYILVATKAAKVSMIRWDHQTHSIATESLH 108
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-----G 215
+E E+ L+ V+P + + + L S G
Sbjct: 109 YYEKSIQ---AATYETLDETELI-VEPNRYSCFCVRFKNLLTFLPFSTPDDDDDDMDDEG 164
Query: 216 DEDTFGSGGGFSARI-ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
+ GF + + SS +++ + L+ + + D F+H Y EP + IL + TW G
Sbjct: 165 ETKKQKYVPGFDSEVFGSSFMVDAQTLEPSIGTIVDMQFLHNYREPTVAILSLKAATWTG 224
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHY 331
+ ++I K + NLP D +L+ + P+ G L++G N IH
Sbjct: 225 LLPKVKDNITYHVMTIDLATKATTTVLKIENLPFDIDRLVPLSHPLNGCLLLGCNEIIHV 284
Query: 332 HSQSASCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVL 388
+ LA+N Y + +S + ++ ++ L+ L ND LLS TG L
Sbjct: 285 DNGGIVRRLAVNKYTEDITASVKNYHDQTDLNLMLENCAVIPLPNDNRVLLSLLTGSLFH 344
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS-DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
+ D + ++R L + +S D+T G F DS L+ +G S L
Sbjct: 345 INFDVDIKTIKRFALEPVLETHYSSVDLTYPGQPAFL------DSNLLFIANNNGNSPL- 397
Query: 448 SGLKEEFGDIEADAPSTKRL-RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
+E + + + S+ +DM EEL +A Q +
Sbjct: 398 ---------LEVKYLRNEEVTEKVQSNGKEDMDGDEELYDDDNAGEKIVIRQGDIKYFKH 448
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG------CKGIWT-------- 552
D L+N GP+ DF+ G A I+ N + G C I+
Sbjct: 449 DELINHGPVSDFTLGKYSTEKFKANLINPNLNDVCIVSNGGSHKQSCLNIFAPSVQPIIR 508
Query: 553 ---VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+ + +R N ++ + DD I +E L++ D + +
Sbjct: 509 SSLTFSQVNRMWNINNKYLITSDDVNLKSEIFQIEKSYSRLKSKDFIND----------E 558
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TIA L + ++Q+ + + + + + +SF E + ++S ++ D
Sbjct: 559 MTIAMHELNNGKYILQITPKHIEVFNSKF-KRHMSF---EDELKDAMKEDQIISSTVHDD 614
Query: 670 YVLLGMSDGSI 680
Y+++ + G +
Sbjct: 615 YLMIFFASGEV 625
>gi|302652143|ref|XP_003017931.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
gi|291181517|gb|EFE37286.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
Length = 429
Score = 150 bits (378), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 200/392 (51%), Gaps = 33/392 (8%)
Query: 1061 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYV 1119
+E V++LEP W T + ++ +E + V+ L + T E + ++ +G++ V
Sbjct: 51 LERGTVKLLEPRN----WSTIDSHELEPAERITCIEVIRLEISELTHERKDMVVVGSSIV 106
Query: 1120 QGEDVAARGRVLLFSTGRNADNP----QNLVTEVYSKE-LKGAISALASL--QGHLLIAS 1172
+GED+ +G + +F P ++ ++++KE +KGA++AL+ + QG L++A
Sbjct: 107 KGEDIVPKGFIRVFEVIDVVPEPDQPEKSKKLKLFAKEEVKGAVTALSGIGGQGFLIVAQ 166
Query: 1173 GPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGA 1228
G K ++ K G+ L +AF D YV L +K ++GD K ++F+ + E+
Sbjct: 167 GQKCMVRGLKEDGSLLP-VAFKDTQ-CYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPY 224
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
+L+L K+ +L +FL DG+ L ++V+D+ N+ + Y P+ S KG +LL R+
Sbjct: 225 KLDLFGKENENLAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSV 284
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAP--------GSDKTNRFALLFGTLDGSIGCIAPL 1340
FH G F L RT ++P S +++ +L GS+ I PL
Sbjct: 285 FHTG----HFASTMTLLPHGARTPSSPVDEDAMDTDSPPPSKYQILMTFQTGSVAVITPL 340
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1400
E ++RRL +LQ +LV+++ H LNPR +R S+G + G ++D LL + +
Sbjct: 341 GEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVESDGMGGQRG---MIDGNLLLRWLDMG 397
Query: 1401 LEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
+ + EIA + G I +L L G ++L
Sbjct: 398 AQRKAEIAGRVGADVGAIRVDLEKLHGGLAYL 429
>gi|167526060|ref|XP_001747364.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774199|gb|EDQ87831.1| predicted protein [Monosiga brevicollis MX1]
Length = 1324
Score = 149 bits (375), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 174/735 (23%), Positives = 305/735 (41%), Gaps = 115/735 (15%)
Query: 739 IDGADGGP-LDQGDIYSVVCY------ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTY 791
D A+ GP + D Y+ Y ESGAL I VP+ F F G + + D
Sbjct: 661 FDAAELGPSVATSDDYAAPTYWALITTESGALYICTVPDLKIAFHCPSFGDGHSLVWDRL 720
Query: 792 MREALK---DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
+A+ D + + + G +E I ++ L + RP L A +D
Sbjct: 721 PNQAIPQAGDGADAPDEARNDDDAHGSEEYIVETLLIGLGQGQ------RPHLLARTSDH 774
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L Y+ + PV V +V+ + +R
Sbjct: 775 HLLMYEVF-------------PV-------VPSVTEASVR-------------------- 794
Query: 909 APCQRITIFKNISGHQGFFLSGSRPCW--CMVFRERLRVHPQLCDGSIVAFTVLHNVNCN 966
R+ F+NI+G G ++G RP C + + + P + ++ F LH +
Sbjct: 795 ----RLKPFQNIAGCDGVCVTGPRPLLVACGHQLKAITIVPLALEDAVKTFHPLHMDDVE 850
Query: 967 HGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVL 1026
+GFIY T G L P G + ++ + L T +I + + L L++ P
Sbjct: 851 NGFIYFTKAGTLCCATAPDGLMLNRGVLARRAV-LGRTIQKIAFDLDSRLAALLLMEP-- 907
Query: 1027 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHR-TYTVEE-------YEVRILEPDRAGGPW 1078
+P E+ N++ S +L +Y +E +++++L P
Sbjct: 908 RP-----------ELKPSRGNNDPPSNELPNISYRPDEPKALTPFFQLQLLSPKSMKLLP 956
Query: 1079 QTRATIPMQSSENALT-VRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1137
TR + + VR+ + N+T K+N +A+G ++G+ G V ++
Sbjct: 957 DTRIEYDLHHHVTSFAAVRLSSSLNSTGKQN--YIAVGVTLLEGQRATTTGFVDFYTVDV 1014
Query: 1138 NADNPQNLVTEVYSKELKGAISALASLQGHLLIAS-----GPKIILHKWT-GTELNGIAF 1191
+ D + + + S + G +SA+ + L+A+ G KI + + G EL +A+
Sbjct: 1015 H-DGKETRLEKRASCKQPGCVSAMDCTEDGFLVAAVGQRLGSKIYVWNFQDGQELQPLAY 1073
Query: 1192 YDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW-KEQGAQ--------------LNLLAKD 1236
++A +Y + ++KN ++GD + L + +++G Q L + D
Sbjct: 1074 FEAG-IYTSCIRVIKNLAIVGDYESGVQLLRFSRQKGLQQMPVFRGTKHRFYSLVKVGAD 1132
Query: 1237 FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 1296
+C+ +F++ S L+++ D N+ Y ++ G+ L+ A FH+G ++
Sbjct: 1133 PHKSNCYCADFVVRESDLAMIYGDADGNLVALDYDADSPDTRGGRILVRSANFHLGTRLS 1192
Query: 1297 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1356
LRLQ A R + FG ++G G + PL E +RRL+ LQKKLV
Sbjct: 1193 AMLRLQ--AAPVVRAPGGLAEAQKCHVVHTFG-IEGQQGVVIPLHEAEYRRLEMLQKKLV 1249
Query: 1357 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1416
S +AGL+P FR F S+ R I+D LL Y L EQL++A Q G +
Sbjct: 1250 -SHSSLAGLHPFQFRAFKSSIWRPRSFAQGILDGALLRQYFCLGRREQLDVAEQLGVSAQ 1308
Query: 1417 QILSNLNDLALGTSF 1431
Q+ ++ AL +F
Sbjct: 1309 QLERDMAH-ALDAAF 1322
Score = 126 bits (316), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 142/291 (48%), Gaps = 29/291 (9%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LEL +RL+G ++ ++ + RD+++L+F DAKIS ++F+ S L +
Sbjct: 34 LELAASFRLNGVATAMVAITL----PKQLRDTVVLSFADAKISAIQFEPSTRTLITQKLI 89
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
E E ++ + P+++ DP RC G LVYG +++I+ A
Sbjct: 90 NLEI-EAVYGSKVNADLP--PVLQADPLHRCIGALVYGCRLVIIPAH------------- 133
Query: 221 GSGGGFSARIESS-HVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
R VI+L L + K F F+ GY P ++LHE W GR +
Sbjct: 134 ----ALQPRTNVQFRVIDLEKLSSPLGQAKSFCFLTGYTTPTALLLHEPRPVWVGRHAVG 189
Query: 278 HHTCMISALS--ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
+C++SALS + TT P +W+ +LP D + L+ P P+GG L+V N + + +Q+
Sbjct: 190 RDSCVLSALSCELDTTDDFAPTVWAKDSLPSDCFALVPTPQPLGGALIVSPNMVLHTNQA 249
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
+S A+A+N A ++ S+ LD A T++ + A+ S ++G L
Sbjct: 250 SSSAVAVNAIAARATGYPHTTQAGLSLNLDNARVTFITSVDAIFSLQSGQL 300
>gi|321260384|ref|XP_003194912.1| cleavage and polyadenylation specific protein [Cryptococcus gattii
WM276]
gi|317461384|gb|ADV23125.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
gattii WM276]
Length = 1431
Score = 147 bits (372), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 130/530 (24%), Positives = 228/530 (43%), Gaps = 49/530 (9%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP----QLCDGSIVAFTVLHNVNCNHGF 969
I F NI G G F++G +P W + HP L ++ H F
Sbjct: 931 IVPFNNIEGLTGAFITGEKPHWII----SSEAHPLRAFALKQAAMAFGKTTHLGGKGEYF 986
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1029
I + IC LP D P + ++ T IT+ Y S+ V P
Sbjct: 987 IRIEDGSF--ICYLPPTLNTDFAIPCDRY-QMERTYTNITFDPTSAHYVGAASIEV--PF 1041
Query: 1030 NQVLSLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1087
D+E Q+ D +L R+ T+E + + PW+
Sbjct: 1042 QAY-----DEEGEIQLGPDGPDLIPPTNQRS-TLELFS-------QGSDPWKVIDGYEFD 1088
Query: 1088 SSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL- 1145
+E +++ V L + +A+GT + GED A RG +F +
Sbjct: 1089 QNEEVMSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGG 1148
Query: 1146 --------VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPP 1196
+ + ++A+ + G+LL +GPK+ + + +L G+AF D
Sbjct: 1149 PGSVPGWKLVRRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGFDYDAQLMGLAFLDIQ- 1207
Query: 1197 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1256
LY ++ + KNF+L+GD+ KS +F+S +E + ++KD + +FL+ ++
Sbjct: 1208 LYATTVKVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKDLQHVSVVTADFLVHDGQVTF 1267
Query: 1257 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1316
+ SD ++++ + P +S G++L+ R E+H G+ T + T+ + AP
Sbjct: 1268 ISSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAATVSKVIARRKTTEEE--FAPQ 1325
Query: 1317 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
+ +++ T DG++ + + + F+RLQ + +LV + HVAGLNPR+FR N
Sbjct: 1326 TQ------IIYATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRAFRTVR-N 1378
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
RP I+D +LL+ + + P+ Q E+ Q GT + S+L L
Sbjct: 1379 DLLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGTDAVTVASDLQALG 1428
Score = 109 bits (273), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 160/733 (21%), Positives = 298/733 (40%), Gaps = 121/733 (16%)
Query: 45 ELPSKRGIGPVPNLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---- 96
+ P + IG NLVV A + ++ +R + E K ++ E K+ V M+ +
Sbjct: 39 DTPDVKVIG---NLVVAGAEALRVFEIREESVPIIEKVKLEEDVAEGKKDVQMEEVGDGF 95
Query: 97 --------------SAASLELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDA 140
+ L L+ + L+G V LA + D D +I++F+DA
Sbjct: 96 FDDGHAERAPLKYQTTRRLYLLAQHELNGTVTGLAATRTLESAIDG---LDRLIVSFKDA 152
Query: 141 KISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ 200
K+++LE+ S + S+H +E ++ +S+ PL++ DP R + +
Sbjct: 153 KMALLEW--SRGDIATVSLHTYERCPQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDS 207
Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEP 258
+ +L Q S L D G A S V++L D+ +K+++D +FV G+ P
Sbjct: 208 LAVLPLIQEQSEL----DPLSEGFSRDAPYSPSFVLSLSDVSTTIKNIQDLLFVPGFHSP 263
Query: 259 VMVILHERELTWAGRV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
+ +L TW+GR+ + K C+ I +S+ +PL+ S LP D+ L+A PS
Sbjct: 264 TIALLFSPMHTWSGRLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPS 322
Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFS--VELDAAHATWLQN 374
+GG+++V + I + Q A A N S +S + +S S + L+ + ++
Sbjct: 323 ELGGIVLVTSTGIVHIDQGGRVAAACVNAWWSRITSLKCSMASVSQKLTLEGSRCVFVTP 382
Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLD-LSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
LL + G + + +GR V ++ L K SD+ G+ F+GS GDS
Sbjct: 383 HDMLLILQNGAVHQVRFSMEGRAVGLIEVLDKGCVVPPPSDLIVTGDGAVFVGSAEGDSW 442
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
L + +R+ R+ + V+ +E LYG ++
Sbjct: 443 LAKVNV-----------------------VRQRVERAEEKKDEMEVDWDE-DLYGDINDA 478
Query: 494 T--ESAQKTF-----SFAVRDSLVNIGPLKDFSYGLRINADA--------SATGISKQSN 538
E AQ+ F + + D L +G + D +G+ + + +G S+ S
Sbjct: 479 ALDEKAQEQFGPAAITLSPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNST 538
Query: 539 YELV-------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
+ + EL +G+W + G + + A +++S E
Sbjct: 539 FNVFRRGIPITKRRRFNELLNAEGVWFLSIDRQTGQ-----KFKDIPEAERATILLSSEG 593
Query: 586 ---RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
R L + ++ + G+T++A F R ++ V +LD +
Sbjct: 594 NATRVFALSSKPTPQQIGR-----LDGKTLSAAPFFQRSCILHVSPLEVVLLDNN----- 643
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
G + +++ SI+DP+ ++ +D S+ VGD TV+ + P E
Sbjct: 644 ---GKIIQTVCPRGDGPKIVNASISDPFAIIRRADDSVTFFVGDTVARTVA-EAPIVSEG 699
Query: 703 SKKPVSSCTLYHD 715
+ ++ D
Sbjct: 700 ESPVCQAVEVFTD 712
>gi|328848896|gb|EGF98089.1| hypothetical protein MELLADRAFT_96156 [Melampsora larici-populina
98AG31]
Length = 1427
Score = 147 bits (371), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 173/384 (45%), Gaps = 45/384 (11%)
Query: 1076 GPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFS 1134
G W T Q +E ++++V+L + + + + GT + ED+AARG V +F
Sbjct: 1051 GSWDTIDGHEFQQNEWVTSMKLVSLDSKSKRSGRRDFIGAGTTCNRAEDLAARGGVYVFE 1110
Query: 1135 TGRNADNPQNL-----VTEVYSKELKGAISALASLQGHLLIASG-------PKIILHKWT 1182
+P++ + Y + K ++A+ L G+ + G P+ K++
Sbjct: 1111 VIEIVPDPKHPERNRGLRLRYHETTKACVTAVDGLNGYFIHTMGQKVDPGYPRSPTRKYS 1170
Query: 1183 GT---------------------ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
L + F D P Y +L ++KNFI+LGD K I +
Sbjct: 1171 DILADQIIAFYSKLYAKCFEQDERLLAVGFLDIRP-YTTTLKVLKNFIVLGDAVKGITLV 1229
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
+++E+ +L L F L C +FL+ + LS+V SD I+IF Y P ES G
Sbjct: 1230 AFQEEPYKLIELGHTFVDLRCSTIDFLVLENKLSIVTSDLGGTIRIFEYNPTNIESQGGL 1289
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1341
KLL R EF + L +S + + LF LDGSI + P+
Sbjct: 1290 KLLCRTEFGTAGEMGSSLGFGKRLSSKEEAKS---------IGTLFAGLDGSISSLVPVK 1340
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1401
E F+RLQ +Q +L+ + H AGLNPR FR N R I+D E++ + L L
Sbjct: 1341 EAVFKRLQIVQTRLIRHLDHFAGLNPRGFRTVR-NDLVSRAMNRGIIDGEIIERFGALKL 1399
Query: 1402 EEQLEIAHQTGTTRSQILSNLNDL 1425
+EQ I G+ R+ IL NLN+L
Sbjct: 1400 DEQDSIGKLAGSDRNTILINLNNL 1423
Score = 104 bits (260), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 193/933 (20%), Positives = 350/933 (37%), Gaps = 171/933 (18%)
Query: 104 VCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE 163
V ++LHG V L ++ + D ++++F+DAKI++LE+ L S+H FE
Sbjct: 73 VLEHQLHGIVTGLQPITTIDT-HVDGLDRLLVSFKDAKITLLEWSHQQSDLVPISLHTFE 131
Query: 164 SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
+ F + ++ DPQ RC + + + +L Q + D +T S
Sbjct: 132 KLPQITQGDFPTIFDQ---LETDPQSRCAILKLPQSTIAVLPFFQENN---LDLETLFSN 185
Query: 224 GGFSA---RIES-----SHVINLRD------------------LDMKHVKDFIFVHGYIE 257
SA RI+S S +I+L +K + F F+ G+ +
Sbjct: 186 SNPSANNQRIQSFPYAPSFIIDLNQSQSFKSQTQTHSQTQTQQKSIKSIISFKFLPGFSQ 245
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + IL+ + TWAGR+ +C + +++ + +I+ NLP+ A+ ++A P
Sbjct: 246 PTLAILYTYQHTWAGRLENTTDSCSLIFITLDLSSNHFTIIFQIDNLPYHAHSIMACPKE 305
Query: 318 IGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP---------------RSSFSV 362
+GGVLV+ A+ I + QS+ N L + ++P V
Sbjct: 306 VGGVLVICADMILHIDQSSKLIGIATNGWSKLSTHLDVPTQQMVKIVTEDGQDQEERLKV 365
Query: 363 ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN-PSVLTSDITTIGNS 421
L+ + ++ D AL+ G + L + DGR + +L L K SV+ S I +
Sbjct: 366 RLENSKLVFVTIDRALMFLTDGQIFRLCLYQDGRTLIKLCLEKFPVVSVIPSVAVKISDH 425
Query: 422 LFFLGSRLGDSLLVQFTC------------------------GSGTSMLSSGLKEEFGDI 457
F+GS LGDS+++ G+ + + E +G
Sbjct: 426 SVFVGSMLGDSIVMGIEFEGEKEVEVVEEVEVEVEAEVVHQNGNEMEIDQAEEDEIYGKE 485
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
E D TK D + ++ + N + ++ S + DS+ GP++D
Sbjct: 486 EPDDKKTK-----DQDGIDSIIK----------ATNKKIHREIRSLRLHDSISGHGPIRD 530
Query: 518 FSYGLRINADASATGISKQSNYE-LVELPGCKGI-----WTVYHKS---SRGHNADSSRM 568
F+ +SK +E +E+ GC G T+++K + DS+
Sbjct: 531 FT-------------MSKIGGFEDSLEMVGCTGSGETGGLTIFYKEMPLMKRKKLDSTNE 577
Query: 569 A---------AYDDEYHA----YLIISLEARTMVLETADLLTEVTESVDY----FVQGRT 611
+ A++D + IS+ RT + E + D + T
Sbjct: 578 SMKITNLNSIAFNDPTGSPGCELAWISIHDRTKIFSMIKNPEEGNRTSDLKFMKTLNAST 637
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
I F + +Q+ ++L + P +E ++ + ++ + Y+
Sbjct: 638 IYVAMFFDQTCFLQITSYEIKLLKVVGFGEVQVIRPIETE----NKKNKIIRAKVVQDYI 693
Query: 672 LLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
LL SD + L G + T+ +Q P KPV+ +L+ P + T+
Sbjct: 694 LLETSDHRVMLYKGQVDSLTIDRIQLPQL----SKPVTYASLFSAHLP-LYDHDDQTN-- 746
Query: 731 LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF-----VSGRT 785
G+G D P + V G L I +P VFTV +
Sbjct: 747 ---GIGLDNDEDAEKP------WLFVTDLGGVLHILSLPELEIVFTVKGIENLPDLLDED 797
Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
+ + A++ + + EE KEN + A +RP L+ L
Sbjct: 798 EDEEQQQQPAIEYEHEDGDVKMEEDEKVEPKENSSIQMIYGFVT---GAKVARPHLYVEL 854
Query: 846 TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF-SRTPLDAYTREE 904
+G + YQ + K DP ++ ++ +++ +F S P+ R
Sbjct: 855 NNGALAVYQISI----AYDRKPGDPSTSKPRRQALSIRLNKVLGYQFESSEPISNLDR-- 908
Query: 905 TPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM 937
++ + K + G LSG P W +
Sbjct: 909 --------KVKVVKKNATFSGIHLSGLEPIWIV 933
>gi|58268668|ref|XP_571490.1| cleavage and polyadenylation specific protein [Cryptococcus
neoformans var. neoformans JEC21]
gi|134113364|ref|XP_774707.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338817789|sp|P0CM63.1|CFT1_CRYNB RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|338817790|sp|P0CM62.1|CFT1_CRYNJ RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|50257351|gb|EAL20060.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57227725|gb|AAW44183.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 1431
Score = 146 bits (368), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 130/530 (24%), Positives = 229/530 (43%), Gaps = 49/530 (9%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP----QLCDGSIVAFTVLHNVNCNHGF 969
I F NI G G F++G +P W + HP L ++ H F
Sbjct: 931 IVPFNNIEGLTGAFITGEKPHWII----SSEAHPLRAFALKQAAMAFGKTTHLGGKGEYF 986
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1029
I + IC LP D P + +A + IT+ Y S+ V P
Sbjct: 987 IRIEDGSF--ICYLPPTLNTDFAIPCDRYQMERAYTN-ITFDPTSAHYVGAASIEV--PF 1041
Query: 1030 NQVLSLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1087
D+E Q+ D +L R+ T+E + + PW+
Sbjct: 1042 QAY-----DEEGEIQLGPDGPDLIPPTNQRS-TLELFS-------QGSDPWKVIDGYEFD 1088
Query: 1088 SSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL- 1145
+E +++ V L + +A+GT + GED A RG +F +
Sbjct: 1089 QNEEVMSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGG 1148
Query: 1146 --------VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPP 1196
+ + + ++A+ + G+LL +GPK+ + ++L G+AF D
Sbjct: 1149 PGSVPGWKLVKRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGLDYDSQLMGLAFLDIQ- 1207
Query: 1197 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1256
LY ++ + KNF+L+GD+ KS +F+S +E + ++KD + +FL+ ++
Sbjct: 1208 LYATTVKVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKDLQHVSVVTADFLVHDGQVTF 1267
Query: 1257 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1316
+ SD ++++ + P +S G++L+ R E+H G+ T + T+ + AP
Sbjct: 1268 ISSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAATVSKVIARRKTAEEE--FAPQ 1325
Query: 1317 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
+ +++ T DG++ + + + F+RLQ + +LV + HVAGLNPR+FR N
Sbjct: 1326 TQ------IIYATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRAFRTVR-N 1378
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
RP I+D +LL+ + + P+ Q E+ Q GT + S+L L
Sbjct: 1379 DLLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGTDAVTVASDLQALG 1428
Score = 112 bits (280), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 161/716 (22%), Positives = 300/716 (41%), Gaps = 108/716 (15%)
Query: 57 NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---------------- 96
NLVV A V+ ++ +R + E K ++ E ++ V M+ +
Sbjct: 48 NLVVAGAEVLRVFEIREESVPIIENVKLEEDVAEGEKDVQMEEVGDGFFDDGHAERAPLK 107
Query: 97 --SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
+ L L+ + L+G + LA ++ D +I++F+DAK+++LE+ S +
Sbjct: 108 YQTTRRLHLLTQHELNGTITGLAA-TRTLESTIDGLDRLIVSFKDAKMALLEW--SRGDI 164
Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
S+H +E ++ +S+ PL++ DP R + + + +L Q S L
Sbjct: 165 ATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDSLAVLPLIQEQSEL- 220
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDM--KHVKDFIFVHGYIEPVMVILHERELTWAG 272
D G A S V++L D+ + K+++D +F+ G+ P + +L TW+G
Sbjct: 221 ---DPLSEGFSRDAPYSPSFVLSLSDMSITIKNIQDLLFLPGFHSPTIALLFSPMHTWSG 277
Query: 273 RV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
R+ + K C+ I +S+ +PL+ S LP D+ L+A PS +GG+++V + I
Sbjct: 278 RLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSELGGIVLVTSTGIV 336
Query: 331 YHSQ----SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
+ Q +A+C A + SL S + S + L+ + ++ LL + G +
Sbjct: 337 HVDQGGRVTAACVNAWWSRITSLKCS--MASVSQKLTLEGSRCVFVTPHDMLLVLQNGAV 394
Query: 387 VLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ +GR V++ LD P SD+T G+ F+GS GDS L +
Sbjct: 395 HQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSWLAKVNVVRQV 452
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
S K+E +++ D + L +DA D E L+G A+ +
Sbjct: 453 VERSEKKKDEM-EVDWD----EDLYGDINDAALDEKAQE---LFGPAA---------ITL 495
Query: 504 AVRDSLVNIGPLKDFSYGL-----------------------RINADASATGISKQSNYE 540
+ D L +G + D +G+ IN I+K+ +
Sbjct: 496 SPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTINVFRRGIPITKRRRFN 555
Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
EL +G+W + G + + A +++S E L ++ T
Sbjct: 556 --ELLNAEGVWFLPIDRQTGQ-----KFKDIPEAERATILLSSEGNAT--RVFALFSKPT 606
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGPSNSESGSGSENS 659
+ G+T++A F R +++V +LD + + Q + G G +
Sbjct: 607 PQQIGRLDGKTLSAAPFFQRSCILRVSPLEVVLLDNNGKIIQTV------CPRGDGPK-- 658
Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
+++ SI+DP+V++ +D S+ VGD TV+ + P E + ++ D
Sbjct: 659 -IVNASISDPFVIIRRADDSVTFFVGDTVARTVA-EAPIVSEGESPVCQAVEVFTD 712
>gi|405121446|gb|AFR96215.1| cleavage and polyadenylation specific protein [Cryptococcus
neoformans var. grubii H99]
Length = 1431
Score = 144 bits (362), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 129/530 (24%), Positives = 228/530 (43%), Gaps = 49/530 (9%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP----QLCDGSIVAFTVLHNVNCNHGF 969
I F NI G G F++G +P W + HP L ++ H F
Sbjct: 931 IVPFNNIEGLTGAFITGEKPHWII----SSEAHPLRAFALKQAAMAFGKTTHLGGKGEYF 986
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1029
I + IC LP D P + +A + IT+ Y S+ V P
Sbjct: 987 IRIEDGSF--ICYLPPTLNTDFAIPCDRYQMERAYTN-ITFDPTSAHYVGAASIEV--PF 1041
Query: 1030 NQVLSLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1087
D+E Q+ D +L R+ T+E + + PW+
Sbjct: 1042 QAY-----DEEGEIQLGPDGPDLIPPTNQRS-TLELFS-------QGSDPWRVIDGYEFD 1088
Query: 1088 SSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL- 1145
+E +++ V L + +A+GT + GED A RG +F +
Sbjct: 1089 QNEEVMSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGG 1148
Query: 1146 --------VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPP 1196
+ + + ++A+ + G+LL +GPK+ + +L G+AF D
Sbjct: 1149 PGSVPGWKLVKRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGLDYDAQLMGLAFLDIQ- 1207
Query: 1197 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1256
LY ++ + KNF+L+GD+ KS +F+S +E + ++KD + +FL+ ++
Sbjct: 1208 LYATTVKVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKDLQHVSVVTADFLVHDGQVTF 1267
Query: 1257 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1316
+ SD ++++ + P +S G++L+ + E+H G+ T + T+ + AP
Sbjct: 1268 ISSDRNGDMRMLDFDPTDPDSLNGERLMLKTEYHAGSAATVSKVIARRKTAEEE--FAPQ 1325
Query: 1317 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
+ +++ T DG++ + + + F+RLQ + +LV + HVAGLNPR+FR N
Sbjct: 1326 TQ------IIYATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRAFRTVR-N 1378
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
RP I+D +LL+ + + P+ Q E+ Q GT + S+L L
Sbjct: 1379 DLLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGTDAVTVASDLQALG 1428
Score = 111 bits (277), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 160/726 (22%), Positives = 301/726 (41%), Gaps = 107/726 (14%)
Query: 45 ELPSKRGIGPVPNLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---- 96
+ P + IG NLVV A V+ ++ +R + E +K ++ E ++ V M+ +
Sbjct: 39 DTPDVKVIG---NLVVAGAEVLRVFEIREESVPIIEKAKLEEDVAEGEKDVQMEEVGDGF 95
Query: 97 --------------SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKI 142
+ L L+ + L+G V LA ++ D +I++F+DAK+
Sbjct: 96 FDDGHAERAPLKYQTTRRLHLLTQHELNGTVTGLAA-TRTLESTIDGLDRLIVSFKDAKM 154
Query: 143 SVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
++LE+ S + S+H +E ++ +S+ PL++ DP R + + +
Sbjct: 155 ALLEW--SRGDIATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLWRLAVLTLPEDSLA 209
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
+L Q S L D G A S V++L D+ +K+++D +F+ G+ P +
Sbjct: 210 VLPLIQEQSEL----DPLSEGFSRDAPYSPSFVLSLSDVSTTIKNIQDLLFLPGFHSPTI 265
Query: 261 VILHERELTWAGRV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
+L TW+GR+ + K C+ I +S+ +PL+ S LP D+ L+A PS +
Sbjct: 266 ALLFSPMHTWSGRLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSEL 324
Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFS--VELDAAHATWLQNDV 376
GG+++V + I + Q A A N S +S + +S S + L+ + ++
Sbjct: 325 GGIVIVTSTGIVHVDQGGRVAAACVNAWWSRITSLKCSTASVSQKLTLEGSRCVFVTPHD 384
Query: 377 ALLSTKTGDLVLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
LL + G + + +GR V++ LD P SD+T G+ F+GS GDS
Sbjct: 385 MLLVLQNGAVHQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSW 442
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
L + + K+E +++ D + L +DA D E+ +G A+
Sbjct: 443 LAKVNVVRQVVERAEKKKDEM-EVDWD----EDLYGDINDAALDEKAQEQ---FGPAA-- 492
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA--------SATGISKQSNYELV--- 542
+ + D L +G + D +G+ + + +G S+ S + +
Sbjct: 493 -------ITLSPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTFNVFRRG 545
Query: 543 ----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA---RTMV 589
EL G+W + G + + A +++S E R
Sbjct: 546 IPITKRRRFNELLNADGVWFLPIDRQTGQ-----KFKDIPEAERATMLLSSEGNATRVFA 600
Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
L + ++ + G+T++A F R ++ V +LD + G
Sbjct: 601 LSSKPTPQQIGR-----LDGKTLSAAPFFQRSCILHVSPLEVVLLDNN--------GKII 647
Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSS 709
+ +++ SI+DP+V++ +D S+ VGD TV + P E +
Sbjct: 648 QTVCPRGDGPKIVNASISDPFVIIRRADDSVTFFVGDTVARTVG-EAPIVSEGESPVCQA 706
Query: 710 CTLYHD 715
++ D
Sbjct: 707 VEIFTD 712
>gi|344229600|gb|EGV61485.1| hypothetical protein CANTEDRAFT_109087 [Candida tenuis ATCC 10573]
Length = 1300
Score = 144 bits (362), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 128/528 (24%), Positives = 234/528 (44%), Gaps = 61/528 (11%)
Query: 918 KNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
+N+SG G F+SG P + + + + I++F N+ I++ +
Sbjct: 814 ENLSGLTGIFVSGDVPYYIVKTNHSIPRIFKFARIPIMSFGKF----ANNQLIFLDDKKN 869
Query: 978 LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
+IC++PS Y+N WP ++ I + T + Y N + ++S P N +
Sbjct: 870 TRICEIPSEFNYENNWPARQ-INIGETIKDVAYHETSNTF--VISTYKEIPYN-----CL 921
Query: 1038 DQE----VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENAL 1093
D+E VG D + S + ++++ P W + +E
Sbjct: 922 DEENVPIVGIMEDKPSALSY---------KGSIKLVSP----ISWTVIDEFELDDNEVGT 968
Query: 1094 TVRVVTL-FNTTT---KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QN 1144
V + L ++T K + IGT ++ ED+AA G + P +
Sbjct: 969 KVSSMVLDVGSSTRRFKSKREFVVIGTGKLRMEDLAANGSFKVLEIIDVIPEPGHPETNH 1028
Query: 1145 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNI 1204
E Y +E KGA++A++ + G L++ G KII+ + +AF D +YV
Sbjct: 1029 KFKEFYKEETKGAVTAVSDVSGRFLVSQGQKIIVRDLQDDGVVPVAFLDCS-VYVSESKS 1087
Query: 1205 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
NF+LLGD KS++ + + ++ +L KD S+D +F++ L ++V D
Sbjct: 1088 YGNFVLLGDTLKSVWLAGFDAEPYRMIMLGKDLKSIDVNCADFIVKDEELYIIVGDNNNI 1147
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
+ + Y P+ S GQ+L+ +A F++ A VT +L+ L D + + GS
Sbjct: 1148 LHLLKYDPEDPNSSNGQRLVEKAAFNLNAKVT---QLKQLPNLMDNSTSCIGS------- 1197
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGKAH 1380
T++GS + P++E ++RR+ LQ++L D H GLNPR R + +N +
Sbjct: 1198 ----TIEGSFFTVFPINESSYRRMYILQQQLTDKAYHHCGLNPRLNRFGGLKLTANESNN 1253
Query: 1381 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
+P I+D +++ Y L + + I + S+I ++ + G
Sbjct: 1254 KP----ILDYDVIKLYAKLNEDRRRNIGAKVSREGSEIWRDMLEFEAG 1297
Score = 106 bits (265), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 105/463 (22%), Positives = 201/463 (43%), Gaps = 45/463 (9%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y+L G + S+ + + + D +++A + AKIS++ +D + H +R S+H
Sbjct: 51 LNLVDQYKLFGTITSIKPIR---TIENPKLDYLLVATQLAKISLVRWDHASHSIRTVSLH 107
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
+E+ + + L+ V+P+ C V L + ++
Sbjct: 108 YYEN---VIQTSTFDKLNSAELI-VEPKNACLCVRYKNLLTFLPFTRLKTEEDEYADEED 163
Query: 221 GS-GGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
G+ + +SS +IN ++LD + + D F+H Y +P + +L ++ WAG + +K
Sbjct: 164 GAVTNSYDGIYDSSFLINGQNLDSRIGTIVDADFLHNYRQPTVALLSSKDQVWAGNLFFK 223
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
LS+ K+ + +LP+D +L+++PSP+ G L+VGAN IH +
Sbjct: 224 KDNISYIVLSLDLNTKKSTTVLKIDDLPYDIDRLISLPSPLNGSLLVGANQLIHIDNGGI 283
Query: 337 SCALALNNYA--VSLDSSQELPRSSFSVELDAAHATWLQNDVALLST-KTGDLVLLTVVY 393
+ +++N + + +S + S ++ L+ L N+ +L TG+ +LT
Sbjct: 284 TRKISVNPFTDLTTKNSKNYINYSHMNLRLENCSVVPLPNENKVLVILSTGEFYMLTFEI 343
Query: 394 DGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
DG+ ++RL S + N+L F+G++ G+S L+Q+
Sbjct: 344 DGKTIKRLTFEVVETSRYNGINVTFPGQFAALDNNLLFVGNKNGNSPLIQYKY------- 396
Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
G KE+ ++ DA + D + S ++ F +
Sbjct: 397 -EGAKEK--AVKEDAKDEEDNDGDEELYEDDEEKVKSFS------------KEKLDFTLC 441
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
D L+N GP+ F++G N + I+ NY+ V + G
Sbjct: 442 DELINHGPISAFTFGFYSNEKFKSNLIN--PNYQEVSIFSNSG 482
>gi|344305212|gb|EGW35444.1| pre-mRNA 3'-end processing factor CF II [Spathaspora passalidarum
NRRL Y-27907]
Length = 1348
Score = 144 bits (362), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 133/592 (22%), Positives = 256/592 (43%), Gaps = 67/592 (11%)
Query: 836 HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
H +L + G ++ Y+ Y F+G N + ++LR +
Sbjct: 783 HKEEYLTILTIGGEVIMYKLY-FDG-------------------ENYIFKKEKDLRITGA 822
Query: 896 PLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
P +AY P G +R + F N++G+ F++G P M + Q
Sbjct: 823 PENAY-----PLGTTIERRLVYFPNLNGYTSIFVTGIIPYLIMKPMHSIPRIFQFSKIPA 877
Query: 955 VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEK 1014
++ + + +G I++ + +IC+L TY+ WP+++ I + + ITY
Sbjct: 878 LSISAFSDSKIKNGLIFLDNSKNARICELSLDFTYEFNWPMRQ-IHIGDSIKSITYHETS 936
Query: 1015 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPD 1072
N Y +VS P + G D + +T Y+ ++++ P
Sbjct: 937 NTY--VVSTFREIPYD-----------GLDEDGKLIVGTLPDKTPRPVAYKGSIKMISPL 983
Query: 1073 RAGGPWQTRATIPMQSSENALTVRVVTLFNTTT----KENETLLAIGTAYVQGEDVAARG 1128
W TI + +E A+ V+ + L ++ K + + IG+ + ED+ A G
Sbjct: 984 N----WTVIDTIELDDTEVAMNVQSMMLDVGSSMKKFKNKKEFIVIGSGKYRNEDLVANG 1039
Query: 1129 RVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
+F P + EV+ ++ +GA++++ L G LLIA G K+I+
Sbjct: 1040 SFKIFEIVDIVPEPGKPETNHKFKEVFQEDTRGAVTSICGLSGRLLIAQGQKVIVRDVQD 1099
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+ +AF D +YV + N ++LGD KS + + + + ++ +L KD L+
Sbjct: 1100 DGVVPVAFLDTA-VYVSESKSLGNLLMLGDPLKSCWLVGFDAEPFRMIMLGKDLHHLNVS 1158
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1303
+F+ + ++++D + + Y P +S GQ+L+S++ F + + V+ +L
Sbjct: 1159 CGDFITKDEDIYMLIADNNNILHLIQYDPDDPQSLNGQRLISKSAFEIESTVSCMRKLPK 1218
Query: 1304 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1363
+ +S +++ + F ++ T DGS + P+DE ++RR+ LQ++L D H
Sbjct: 1219 IESSFEKSEIK--FSPIDEFQIIGSTSDGSFFNVFPVDESSYRRMYILQQQLTDKEYHYC 1276
Query: 1364 GLNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
GLNPR R + N +P I+D L+ Y L + + +A +
Sbjct: 1277 GLNPRLNRFGGAIELRDNETNTKP----ILDFGLIKRYAQLNEDRKRNLASK 1324
Score = 74.3 bits (181), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 100/500 (20%), Positives = 196/500 (39%), Gaps = 77/500 (15%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
L+V N+++I+ + ++ S +K L+++ ++L+G + L
Sbjct: 29 LIVAKGNLLQIFEPVLIKQQSTPTK--------------PKYKLQIIGQFKLNGLITDLH 74
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR--E 175
L +N D +I++ + AK S+++++ +H + S+H +E H R E
Sbjct: 75 PLRT--VENPHL-DYLIVSTKYAKFSIIKWNHHLHTISTVSLHYYE-----HAIRNSTFE 126
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE----------------DT 219
L+ V+P L + + L + D+ D
Sbjct: 127 KLGISELI-VEPTFNSCSCLRFKNLLCFLPFAVSDEEEEEDDEEDMDLDNKKEKKEKLDI 185
Query: 220 FGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
G + +SS +I+ + LD ++ V D F+H Y EP + IL + WAG +
Sbjct: 186 NGKPADAVSFYDSSFIIDAQTLDSSIETVVDIQFMHNYREPTIAILSSKSNVWAGNLLKV 245
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSA 336
+++ K ++ NLP++ +++ +PSP+ G L++G N I H +
Sbjct: 246 KDNVSFQVMTLDLVSKSTVSVFKIDNLPYEIDRIIPLPSPLNGCLLLGCNEIFHVDNGGI 305
Query: 337 SCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
+A+N++ S S Q+ S S+E D + L+ TG +
Sbjct: 306 IKRIAVNSFTSLVTASTKSYQDQTDLSLSLE-DCCIIPIPGDHRVLMVLTTGQFFYINFE 364
Query: 393 YDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
DG+ ++++ + + ++ + ++ + ++L F + G+S LVQF
Sbjct: 365 LDGKAIKKVHIDTVDQALYSQIKLCYPGEVAVLDHNLLFFANENGNSPLVQF-------- 416
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ----KTF 501
+ D+ D + + +E LY N E Q
Sbjct: 417 -------RYTDV--DQKRITQEAAKEEKKEEKDDEEDEDDLYMDEENEEEQKQIISNSPI 467
Query: 502 SFAVRDSLVNIGPLKDFSYG 521
F D L+N GP+ F+ G
Sbjct: 468 EFIHHDELINNGPISSFTLG 487
>gi|402219312|gb|EJT99386.1| hypothetical protein DACRYDRAFT_17537 [Dacryopinax sp. DJM-731 SS1]
Length = 1620
Score = 143 bits (360), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 101/343 (29%), Positives = 171/343 (49%), Gaps = 29/343 (8%)
Query: 1095 VRVVTLFNTTTKENET----LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVY 1150
+ VV N T +E+ +A+GT+ +GED+A RG +F P + +
Sbjct: 1291 INVVKSVNLETLSSESGFKDYIAVGTSTFRGEDLAVRGATYIFEVIEVVSYPDDPLPPYR 1350
Query: 1151 SK-----ELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNI 1204
K E K ++A+ L G+L+ + G K+ + + E L G+AF DA + V SL
Sbjct: 1351 LKLLCRDEAKAPVNAICGLNGYLVSSQGFKVFVRAFEQDERLVGVAFMDAG-VCVTSLTR 1409
Query: 1205 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE--FLIDGSTLSLVVSDEQ 1262
+KN +L+GD +S+ F++++E +L + T+ FL D S++ +D++
Sbjct: 1410 LKNLLLIGDAKRSVSFVAFQEDPFKLR---------PTYVTDAAFLFDEGDFSILAADDE 1460
Query: 1263 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1322
+++F + P ++ + G L+ EF+ + T L + + R P +
Sbjct: 1461 GTLRLFEFDPNLTGATHGNPLICETEFNGQSEHTHILAI------AGRGREDPEEMQIPE 1514
Query: 1323 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
L+FGT+DG++G I+P+ + F+RLQ L +L+ SV H AGLNPR+FR N RP
Sbjct: 1515 AQLIFGTIDGTLGTISPVPDECFKRLQLLSGQLMRSVQHFAGLNPRAFRTVR-NDLLSRP 1573
Query: 1383 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
++D +LL + L + Q I Q GT IL ++ L
Sbjct: 1574 LNKGMLDYDLLHAFRELDIRRQATITKQIGTDTITILRDIRSL 1616
Score = 122 bits (306), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 162/667 (24%), Positives = 267/667 (40%), Gaps = 112/667 (16%)
Query: 101 LELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
L LV +R+HG V L + G D D ++++F+DAKI++LE+ D+I+ L S
Sbjct: 136 LHLVREHRMHGFVTGLEKVRTLASGEDGM---DRLLVSFKDAKIALLEWSDAIYDLSTVS 192
Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD-- 216
+H +E + E PL++ DP+ RC +L+ + IL Q + D
Sbjct: 193 LHTYERSSQVSTSEASE---HRPLLRADPESRCAALLLPKDALAILPFVQRTGLDLADPA 249
Query: 217 EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
D ++ S+V L D D ++HV DF F+ + P + IL++ W GR+
Sbjct: 250 RDKEREHQPYTP----SYVFPLSDADDTLRHVLDFCFLPSFHTPTLAILYQPAQNWTGRL 305
Query: 275 SWKHHTCMISALSISTTLK----------QHPLIWSAMNLPHDAYKLLAVP--SPIGGVL 322
S ++ +++ K +I LP+DA+ LL S GGV+
Sbjct: 306 SQTKDNTSLAIVTLDLVGKGAAAGGGAGGGGAVISRTHGLPYDAFSLLPAREGSTFGGVV 365
Query: 323 VVGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--------LDAAHATWLQ 373
V+ N++ H LA + + S+ P +F+ E L+ + W
Sbjct: 366 VLAGNSVLHVDPAGRIVGLAASGWHAQ-SSALRFPLWAFTAEEGETEERKLEGSRLCWAG 424
Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLD----LSKTN-PSVLT-------SDITTIGNS 421
+L G L V +GR V L L +T+ P+VL + G
Sbjct: 425 EQQLILVGAQGWARELKVGVEGRNVSSLSAGRRLGRTSAPAVLCPVGEQSGRALKPTGRD 484
Query: 422 LFFLGSRLGDSLLVQFTCGSG--TSMLSSGLKEEF--GDIEADAPSTKRLRRSSSDALQD 477
L +L S G S+L+Q G + +G ++E D+E DA S K +D L D
Sbjct: 485 LVWLASEAGQSVLLQVHKGEPRVEEVKPNGEEKEIEGEDMEIDADSDK------NDDLAD 538
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ L ++ A + V D+L G + D S+ L +G + +
Sbjct: 539 IYGDSGLPAAAASGVTAGPALPWLTLEVLDALQGHGQIADMSFALSFR-----SGPDRPT 593
Query: 538 NYELVELP-GCKGIWTVYHK--------------SSRG---------------HNADSSR 567
+ P G +G WTVY +RG +
Sbjct: 594 PKLVCSTPEGERGAWTVYENGLPIRVKRRVPAVAGTRGIWSLRVRRGDRARRGGRRERGE 653
Query: 568 MAAYDDEYHAYLIISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
D E LI+S +A RT+ +++ L ++ + T+AAG F
Sbjct: 654 REWADGEERDNLIVSTDATPSPGISRTITVDSRGELQIISR-----LPALTLAAGVFFSH 708
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
V+QV +LDG ++L N E S ++ + DP+V++ +GS+
Sbjct: 709 TCVMQVTPDSLHLLDGD--GKELQVLKDNE---GNKEASPIIKACVEDPWVVVTRENGSV 763
Query: 681 RLLVGDP 687
L +GDP
Sbjct: 764 ALYLGDP 770
>gi|121925707|sp|Q0UUE2.1|CFT1_PHANO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
Length = 1375
Score = 141 bits (355), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 132/521 (25%), Positives = 224/521 (42%), Gaps = 43/521 (8%)
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
P +S L N+ +L R D E + NI+G+
Sbjct: 836 PSRSSSDLWTHNLRWVKLSQQHVPRYMEDGAQEEAADEPGFESTLLALDNINGYSTVIQR 895
Query: 930 GSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
G P + + L + + T H +C GF Y+ S L+I QLP + Y
Sbjct: 896 GRSPAFILKESSSAPRVIGLSGNPVKSLTRFHTSSCQRGFAYLDSTDTLRISQLPPSTHY 955
Query: 990 DNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVLSLLIDQEVGHQID 1046
+ + +P+ A H + Y LY + P L P + L +E +
Sbjct: 956 GHLGWAARRMPMDAEVHALAYHP-SGLYVIGTGQPEEYTLDPNDTFHYELPKEETSFKPK 1014
Query: 1047 -NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTT 1104
H + V +T+TV + +L+P E L ++ + L + T
Sbjct: 1015 VEHGIIKVMDEKTWTV--IDTHVLDP-----------------QEVILCIKTLNLEVSET 1055
Query: 1105 TKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAIS 1159
T + + ++A+GTA V GED+A +G + +F P + T + E+KG +S
Sbjct: 1056 THQRKDVIAVGTAIVLGEDLATKGNIRIFEVITVVPEPDHPETNKRLKLIVKDEVKGTVS 1115
Query: 1160 ALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGD 1213
A++ L QG L++A G K ++ K GT L +AF D YV +L + N +L+GD
Sbjct: 1116 AISDLGTQGFLIMAQGQKSMVRGLKEDGTLLP-VAFMDMQ-CYVTTLKTLPNTGMLLMGD 1173
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
+K +F + E+ ++ L + L+C +FL L ++V+D N+Q+ + P
Sbjct: 1174 AYKGAWFTGYTEEPYKMMLFGRSKHHLECITADFLPFEEQLHIIVADADMNLQVLQFDPD 1233
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQ---MLATSSDRTGAAPGSDKTNRFALLFGTL 1330
+S G +LL ++ FH G + LQ + T+S+ T + S ++ +L +
Sbjct: 1234 HPKSMGGTRLLQKSTFHTGHFPSTMHLLQSRLHMPTASEFTTSTTSSLPLHQ--ILCTSQ 1291
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
G++ I PL E ++RRL L L + GLN ++FR
Sbjct: 1292 SGTLALITPLSESSYRRLSGLATHLQQFLDSPCGLNGKAFR 1332
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 167/727 (22%), Positives = 295/727 (40%), Gaps = 142/727 (19%)
Query: 57 NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NL+V ++++++ V V G E+ N+ E L + A L LV
Sbjct: 28 NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + + R +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
W + +F + DP RC + + IL
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
+A+Q G E G+ G + + SS V+ L +LD + H F+H Y EP
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
++ + T A ++ + + ++ K + S LP+D +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316
Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
L+VG+N IH + +A+N A + S ++ ++ L+ L D
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDI---TTIGNSLFFLGSRLG 430
++ G + +LT DGR V + + + ++L + T +G F+GS G
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPVPADNGGNILKTRASCSTNLGRGRLFIGSEDG 436
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
+S+L+ +T ++ +LRR S+ Q + E++S
Sbjct: 437 ESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEEE 472
Query: 491 S---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD----------- 517
N+T + K T++F V D L +I P++D
Sbjct: 473 EVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTES 532
Query: 518 FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------GH 561
+ G + ++ A G N EL ELP G+W V+ K G
Sbjct: 533 LTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFGQ 592
Query: 562 NADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAAG 615
+A+++ A+ D +Y YL++S T+V E TE D+ +G T++ G
Sbjct: 593 DAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSVG 650
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
L +V+QV R D + + P E N +++ S ADPY+L+
Sbjct: 651 VLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQR 706
Query: 676 SDGSIRL 682
D S+++
Sbjct: 707 EDSSVKI 713
>gi|448105510|ref|XP_004200513.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|448108635|ref|XP_004201144.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|359381935|emb|CCE80772.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|359382700|emb|CCE80007.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
Length = 1344
Score = 140 bits (353), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 133/608 (21%), Positives = 264/608 (43%), Gaps = 70/608 (11%)
Query: 824 VVELAMQRWSAHHSRPFLFAILT-DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
+ + HS+ ILT G ++ Y+ + F+G N
Sbjct: 774 IKNIVFNELGDEHSKDEYLTILTIGGEVIIYKLF-FDG-------------------DNF 813
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMVFRE 941
+ ++L+ + P +AY P G +R ++ N++G+ F++G P +
Sbjct: 814 KFIKEKDLKITGAPDNAY-----PLGTTLERRLVYVPNVNGYSSIFVTGIIPYFITKTVH 868
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPL 1001
+ + V+F+ + N +GFIY+ + ++C++P Y+N WP++K I +
Sbjct: 869 SVPRIFRFTKLPAVSFSSYSDSNIKNGFIYLDNSKNARMCEIPLDFNYENNWPIKK-IQM 927
Query: 1002 KATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTV 1061
T I Y N + V+ ++ +D+E G I +D +
Sbjct: 928 PETVKAIAYHELSNTF-------VVSSYEEIPYDCLDEE-GKPI-----VGIDKSKP-PA 973
Query: 1062 EEYE--VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL----FNTTTKENETLLAIG 1115
E Y+ +R++ P W TI + +E + V + L K + L+ +G
Sbjct: 974 ESYKGYLRLISPYN----WSVIDTIVLADNEIGMNVLSMVLDVGSSTKKFKSKKELIVLG 1029
Query: 1116 TAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
+ + ED+++ G +F G+ N + EV+ ++ +GA++++ + G L
Sbjct: 1030 SGKYRIEDLSSNGSFKIFEIIDIIPEPGKPETNHK--FKEVHIEDTRGAVTSICEVSGRL 1087
Query: 1169 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
L+ G KII+ + +AF D +YV N ILLGD KS++ + +
Sbjct: 1088 LVTQGQKIIIRDLQDDGVVPVAFLDTA-VYVSEAKSFGNLILLGDSLKSVWLAGFDAEPF 1146
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
++ LL+KD +LD +F++ + ++ +D + + + P+ S GQ+L+ +
Sbjct: 1147 RMILLSKDIQTLDVSCADFIVKDEEIFILFADNNNVLHVVKFDPEDPLSSNGQRLVHKTS 1206
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1348
F++ + T F + + + + + T F + T+DGS + P++E T+RR+
Sbjct: 1207 FNINSAATCF---RTIPKNEENYPSL-----TTSFQSIGSTIDGSFFTVFPINESTYRRM 1258
Query: 1349 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
LQ++L D H+ GLNPR R N +++ +++ + L + + A
Sbjct: 1259 YILQQQLTDKEFHICGLNPRLNRFGGLNETNSDANSKPMLEYDVIKKFVNLNSDRKKNFA 1318
Query: 1409 HQTGTTRS 1416
+ G+ S
Sbjct: 1319 SKIGSKNS 1326
Score = 93.2 bits (230), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 105/502 (20%), Positives = 196/502 (39%), Gaps = 83/502 (16%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL- 116
LVV + +++++ + + SKE K L+LV ++LHG + L
Sbjct: 29 LVVGKSTLLQVFDIVQSNKKSKEYK------------------LKLVEQFKLHGLITDLK 70
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A+ + D D ++++ + AK+S++++D + + S+H +E+ E
Sbjct: 71 AVRTVENPD----LDYLLVSTKSAKMSLVKWDHHENSISTVSLHYYENSIQ---SSTYEK 123
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMII-------------LKASQGGSGLVGDEDTFGSG 223
L+ ++P C + L + + G SG G D +
Sbjct: 124 LTTTELI-MEPNNTCACLRFKNLLTFLPFEMPDEDDEEDGYENVDGASGSRGKHDNKATQ 182
Query: 224 GGFS-ARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
+ A SS VI+ ++LD + +V D F++ Y EP + I+ + TW G +
Sbjct: 183 QDENQALFYSSFVIDAQNLDSRIGNVIDMKFLYNYKEPTLAIISSKNHTWTGLLPLTKDN 242
Query: 281 CMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSASCA 339
LS+ K + NLP D ++ +P P+ G L++G N IH +
Sbjct: 243 ISFIVLSLDLVTKTSTTVLKIDNLPFDIDTIVPLPKPLNGTLLIGCNEIIHVDHGGITRR 302
Query: 340 LALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGR 396
LA+N + S+ SS + R S +++L+ + ND L K GD + DG+
Sbjct: 303 LAVNQFTSSITSSIKNYRDQSELNLKLENCCVKPIPNDHRVFLILKNGDFYYINFAIDGK 362
Query: 397 VVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
++ L K N D+ + N+L F+ ++ G+S L++
Sbjct: 363 TIKNFYLEKVNSINQNEIGISYPEDVVHLDNNLMFICNKNGNSPLIELKF---------- 412
Query: 450 LKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES----------AQK 499
+++ + + +QD NG ++
Sbjct: 413 ---------SESKDNQNAEQQKDTEMQDTENGTTDKNDNDDDDDIYEDDEDNEKVLIKNS 463
Query: 500 TFSFAVRDSLVNIGPLKDFSYG 521
F D L+N GP+ F++G
Sbjct: 464 VIEFTKHDELINNGPVSSFTFG 485
>gi|169603229|ref|XP_001795036.1| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
gi|160706354|gb|EAT88382.2| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
Length = 1338
Score = 140 bits (352), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 124/474 (26%), Positives = 212/474 (44%), Gaps = 43/474 (9%)
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
NI+G+ G P + + L + + T H +C GF Y+ S
Sbjct: 846 LDNINGYSTVIQRGRSPAFILKESSSAPRVIGLSGNPVKSLTRFHTSSCQRGFAYLDSTD 905
Query: 977 ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVL 1033
L+I QLP + Y + + +P+ A H + Y LY + P L P +
Sbjct: 906 TLRISQLPPSTHYGHLGWAARRMPMDAEVHALAYHP-SGLYVIGTGQPEEYTLDPNDTFH 964
Query: 1034 SLLIDQEVGHQID-NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENA 1092
L +E + H + V +T+TV + +L+P E
Sbjct: 965 YELPKEETSFKPKVEHGIIKVMDEKTWTV--IDTHVLDP-----------------QEVI 1005
Query: 1093 LTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE--- 1148
L ++ + L + TT + + ++A+GTA V GED+A +G + +F P + T
Sbjct: 1006 LCIKTLNLEVSETTHQRKDVIAVGTAIVLGEDLATKGNIRIFEVITVVPEPDHPETNKRL 1065
Query: 1149 --VYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSL 1202
+ E+KG +SA++ L QG L++A G K ++ K GT L +AF D YV +L
Sbjct: 1066 KLIVKDEVKGTVSAISDLGTQGFLIMAQGQKSMVRGLKEDGTLLP-VAFMDMQ-CYVTTL 1123
Query: 1203 NIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1260
+ N +L+GD +K +F + E+ ++ L + L+C +FL L ++V+D
Sbjct: 1124 KTLPNTGMLLMGDAYKGAWFTGYTEEPYKMMLFGRSKHHLECITADFLPFEEQLHIIVAD 1183
Query: 1261 EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ---MLATSSDRTGAAPGS 1317
N+Q+ + P +S G +LL ++ FH G + LQ + T+S+ T + S
Sbjct: 1184 ADMNLQVLQFDPDHPKSMGGTRLLQKSTFHTGHFPSTMHLLQSRLHMPTASEFTTSTTSS 1243
Query: 1318 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
++ +L + G++ I PL E ++RRL L L + GLN ++FR
Sbjct: 1244 LPLHQ--ILCTSQSGTLALITPLSESSYRRLSGLATHLQQFLDSPCGLNGKAFR 1295
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 167/727 (22%), Positives = 295/727 (40%), Gaps = 142/727 (19%)
Query: 57 NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NL+V ++++++ V V G E+ N+ E L + A L LV
Sbjct: 28 NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + + R +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
W + +F + DP RC + + IL
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
+A+Q G E G+ G + + SS V+ L +LD + H F+H Y EP
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
++ + T A ++ + + ++ K + S LP+D +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316
Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
L+VG+N IH + +A+N A + S ++ ++ L+ L D
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDI---TTIGNSLFFLGSRLG 430
++ G + +LT DGR V + + + ++L + T +G F+GS G
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPVPADNGGNILKTRASCSTNLGRGRLFIGSEDG 436
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
+S+L+ +T ++ +LRR S+ Q + E++S
Sbjct: 437 ESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEEE 472
Query: 491 S---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD----------- 517
N+T + K T++F V D L +I P++D
Sbjct: 473 EVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTES 532
Query: 518 FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------GH 561
+ G + ++ A G N EL ELP G+W V+ K G
Sbjct: 533 LTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFGQ 592
Query: 562 NADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAAG 615
+A+++ A+ D +Y YL++S T+V E TE D+ +G T++ G
Sbjct: 593 DAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSVG 650
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
L +V+QV R D + + P E N +++ S ADPY+L+
Sbjct: 651 VLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQR 706
Query: 676 SDGSIRL 682
D S+++
Sbjct: 707 EDSSVKI 713
>gi|302403950|ref|XP_002999813.1| cft-1 [Verticillium albo-atrum VaMs.102]
gi|261361315|gb|EEY23743.1| cft-1 [Verticillium albo-atrum VaMs.102]
Length = 1349
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 103/356 (28%), Positives = 171/356 (48%), Gaps = 35/356 (9%)
Query: 1078 WQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1136
W ++ E A+ ++ + L + TKE L A+GTA ++GED+ RGR+L+F
Sbjct: 985 WNVIDEFELEQYEVAMCMKTLLLEVSEETKERRMLFAVGTAILRGEDLPVRGRILVFDVV 1044
Query: 1137 RNADNPQNLVTE-----VYSKEL-KGAISALASLQGHLLIASGPKIILHKWTGTELNGIA 1190
P T+ + +E+ +GA+++L + G L +W +A
Sbjct: 1045 HVIPQPDRPETDRKLKLIAKEEIPRGAVTSLC----EKCMVRG----LRRWHAA---AVA 1093
Query: 1191 FYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1248
D YVV+++ ++N + L+ D + ++F+ + E+ ++ L K L C +FL
Sbjct: 1094 LPDLS-TYVVAVHELRNTGYCLMADANMGVWFVGYSEEPYRMTLFGKSGTQLKCLTADFL 1152
Query: 1249 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFLRLQMLAT- 1306
+ G+ LS+V SDE + I + P+ S +G LL+RA F V H L L T
Sbjct: 1153 VAGNDLSIVASDEDGVLHILQFDPEHPRSLQGHLLLNRASFSVAPNHAWVTLALPRTTTR 1212
Query: 1307 -----SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1361
S T AA ++T LL + G+I + P+ E +RRL SL L +++PH
Sbjct: 1213 PYLPQSEPATNAAGSQNRTQ--TLLLASASGAIASLNPITEHAYRRLTSLTTSLANALPH 1270
Query: 1362 VAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEIAHQTG 1412
AG+NP++ R +G A P D +IVD LL+ + L ++ E A + G
Sbjct: 1271 AAGMNPKAHRLPPQDGAARPPAVDVSAGRTIVDGALLARWNELGARQRAEAAGKGG 1326
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 117/507 (23%), Positives = 199/507 (39%), Gaps = 75/507 (14%)
Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
S V+ L LD + H F F+H Y EP + I+ H T + ++
Sbjct: 197 SFVLALPQLDPEILHPVHFAFLHEYREPTLGIISSSNRRLKMEPQMDHFTFKV--FTVDL 254
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL 349
K I + NLP K++A+ P+GG L++G N IH + +A+N YA +
Sbjct: 255 LQKASTAILTVSNLPQSLKKVVALSKPMGGALLIGENELIHIDQAGKAHGVAVNPYAAKM 314
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL---- 403
+S + L+ + D LL T+ G++ ++T DGR V + +
Sbjct: 315 TKFPLADQSELKLRLEHCEVELMSPDNGEMLLVTRHGEMAVVTFKMDGRSVSGVSVKVVA 374
Query: 404 SKTNPSVL---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
++ +L + +T + + F G+ GDS ++ + S + ++ K D +
Sbjct: 375 TENGGDILPFRAACLTKVSKNSMFYGTIGGDSQVIGW---SRQHVQTARKKARLLD---E 428
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+ D D + GE ++ + F V DSL+++ P+ D +Y
Sbjct: 429 SLDYDLDEDELDDDDDDDLYGEGTVAPQPSAAAGSAKGGDVVFRVHDSLLSLSPIMDMAY 488
Query: 521 G----------------LRINAD-ASATGISKQSNYELV------------ELPGCKGIW 551
G +R D A G + + L+ E P +G W
Sbjct: 489 GKTAFFPGSEEAKNSEGVRSELDLVCAVGRHRGGSLALINQHIQPRVIGRFEFPEARGFW 548
Query: 552 TV-----YHKSSRGHNADSSRMAAYDD-----EYHAYLIISLEARTMVLETADLLTEVTE 601
T KS +G + +A +D +Y ++I++ + ET+D+
Sbjct: 549 TTRVQKTIAKSLQGEKG--ANLAVGNDYGSVTQYDKFMIVA-KVDLDGYETSDVYALTGA 605
Query: 602 SVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
+ G TI AG + R+IQV R DG ++Q L + E+G
Sbjct: 606 GFEALSGTEFDPAAGLTIEAGTMGNDMRIIQVLRSEVRCYDGDLGLSQILPM--LDEETG 663
Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSI 680
+ V+S SI DPY+LL D SI
Sbjct: 664 A---EPRVISASIVDPYLLLLREDSSI 687
Score = 42.7 bits (99), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 66/138 (47%), Gaps = 30/138 (21%)
Query: 57 NLVVTAANVIEIYVVRV-------QEEGSKESKNSGETKRRVLMD--GISAA-------- 99
NL+V+ ++++I+ V+ + +K S +GET R + D G+ +A
Sbjct: 28 NLIVSKGSLLQIFAVKTVSTEIDTSQIQAKSSSKAGETYDRRINDDDGLESAFLGGDGML 87
Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDD 149
L LV Y +HG + LA + +SR +++++ A++S+L++D
Sbjct: 88 MRADRTTNTRLVLVAEYPVHGVIAGLARVK---IQSSRSGGEALLVHSRTARLSLLQWDP 144
Query: 150 SIHGLRITSMHCFESPEW 167
HG+ S+H +E EW
Sbjct: 145 EKHGVEDVSIHFYEKEEW 162
>gi|33411764|emb|CAD58787.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
Length = 180
Score = 137 bits (346), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 72/184 (39%), Positives = 105/184 (57%), Gaps = 12/184 (6%)
Query: 1236 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1295
D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+FHVGAHV
Sbjct: 1 DAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHV 60
Query: 1296 TKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELTFRRLQS 1350
F R + GAA G K N+ F TLDG IG + P+ E T+RRL
Sbjct: 61 NTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLM 113
Query: 1351 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E+ E+A +
Sbjct: 114 LQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKK 173
Query: 1411 TGTT 1414
GTT
Sbjct: 174 IGTT 177
>gi|426235955|ref|XP_004011942.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Ovis aries]
Length = 819
Score = 137 bits (345), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 117/398 (29%), Positives = 179/398 (44%), Gaps = 82/398 (20%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAK--------------ISVLEFDDSIHGLRITSMHCF 162
A + GA +RD+++L+F+DAK + FD + + TSM
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKGGYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTM 140
Query: 163 ESPEWLHL-----------------------------------KRGRESFARGPLVKVD- 186
E P +L L K+ R G +V
Sbjct: 141 E-PGYLFLGSRLGNSLLLKYTEKLQEPPASTTREAADKEEPPSKKKRVDATTGWAGRVRE 199
Query: 187 ---PQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSH---VINLRD 240
PQ + VYG + +Q G+ L T+ F R S +I++R
Sbjct: 200 GELPQDEVDEIEVYGSE------AQSGTQLA----TYS----FEVRWGSEWLPGIIDVRA 245
Query: 241 LDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K HP+I
Sbjct: 246 LDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVI 305
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPR 357
WS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ + +
Sbjct: 306 WSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQ 365
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
+ LD A A ++ D ++S K G++ +LT++ DG
Sbjct: 366 EGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDG 403
Score = 107 bits (268), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 94/340 (27%), Positives = 148/340 (43%), Gaps = 32/340 (9%)
Query: 907 HGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNC 965
+G R+ + K H F+ G P W +V R LR+HP DG I +F HN+NC
Sbjct: 486 YGGRHHRLALHKPPLHH--VFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINC 543
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
GF+Y QG L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S
Sbjct: 544 PRGFLYFNRQGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTST 602
Query: 1026 LKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIP 1085
P +V + G + + + + + E + ++++ P W+
Sbjct: 603 --PCTRVPRM-----TGEEKEFETIERDERYVHPQQEAFCIQLISPVS----WEAIPNAR 651
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL 1145
++ E ++ + + G+ +GE+V RGR+L+ P
Sbjct: 652 IELEEXXXXX------XXGSRGHVYSVPAGSCLKEGEEVTCRGRILIMDVIEVVPEPGQP 705
Query: 1146 VTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTG-TELNGIAFYDAPPLYV 1199
+T+ +Y KE KG ++AL GHL+ A G K LN AF
Sbjct: 706 LTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKXXXXXLPPHAGLNPRAFRMLHVDRR 765
Query: 1200 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1239
V N V+N +L G++ +LS E+G LAK G+
Sbjct: 766 VLQNAVRN-VLDGELLNRYLYLSTMERGE----LAKKIGT 800
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 76/160 (47%), Gaps = 22/160 (13%)
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDL-VLLTVVYDG-RVVQRLDLSKTNPSVLTSDI 415
S SV+L A + D LLS K +LT++ DG R V+ K SVLT+ +
Sbjct: 83 SMASVQLAGA-----KRDALLLSFKDAKGGYVLTLITDGMRSVRAFHFDKAAASVLTTSM 137
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL-------- 467
T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 138 VTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTTREAADKEEPPSKKKRVDATTGWAG 195
Query: 468 RRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVR 506
R + QD V+ E+ +YGS A + T+ A T+SF VR
Sbjct: 196 RVREGELPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVR 231
Score = 48.9 bits (115), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 40/71 (56%)
Query: 1352 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1411
QK +P AGLNPR+FR H + + + +++D ELL+ Y L E+ E+A +
Sbjct: 739 QKXXXXXLPPHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKI 798
Query: 1412 GTTRSQILSNL 1422
GTT IL +L
Sbjct: 799 GTTPDIILDDL 809
>gi|260941626|ref|XP_002614979.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
gi|238851402|gb|EEQ40866.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
Length = 1363
Score = 134 bits (338), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 136/556 (24%), Positives = 248/556 (44%), Gaps = 71/556 (12%)
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRERLRVHP 947
+L + P +AY+ HG +R I F ++SG ++G P M+ R R H
Sbjct: 837 DLPITGAPFNAYS-----HGTSIERRMIYFPDVSGTTCIMVTGVIPY--MITRSR---HS 886
Query: 948 QL-----CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLK 1002
Q+ IV+F +G IY+ ++ +I +LPS +YD WP++KV +
Sbjct: 887 QVKVFKFSKIPIVSFVPFSTDKIKNGLIYLDTKKNARIVELPSEFSYDYNWPIRKV-SIG 945
Query: 1003 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDLHRT 1058
T + + N L+VS P N ID+E VG I + S++ +
Sbjct: 946 ETVKSVAFHEGSNT--LVVSTLKEIPYN-----CIDEEGNPIVG--IKPNKPSAISYKGS 996
Query: 1059 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLLAI 1114
++++ P W I + +E L V+ + L + TK + + +
Sbjct: 997 -------IKLISPVN----WSVIDNIELADNEVGLHVKSMPLDVGSETKRFKSKKEFVLV 1045
Query: 1115 GTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGH 1167
GT + ED+A G L G+ N + E ++ +GA++++ + G
Sbjct: 1046 GTGKYRLEDLACNGSYKLLEIIDIIPEPGKPETNHK--FKEFTQEDTRGAVTSICEVSGR 1103
Query: 1168 LLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
L+A G KII+ +AF D ++V N ++LGD KS++ + +
Sbjct: 1104 FLVAQGQKIIVRDIKDNSAVSVAFLDTS-VFVSESKSFGNLVVLGDTLKSVWLAGFDAEP 1162
Query: 1228 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
++ +L KD LD + +FL+ + ++V+D +++ + Y P+ S GQ+LL ++
Sbjct: 1163 FRMIMLGKDLQGLDVSSADFLVKDEEIYILVADNNRSLHVLQYNPEDPASSNGQRLLHKS 1222
Query: 1288 EFHVGAHVT---KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1344
F T + + L+T D A P F + T++G++ + P+ E T
Sbjct: 1223 SFTTNYLTTCTKSVPKHEQLSTWFD-PQAIP-------FQTVGSTVEGAMYVVFPISEPT 1274
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFR--QFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1402
+RR+ +Q++L+D H GLNPR R + S A+ +++DCEL+ + L +
Sbjct: 1275 YRRMYIMQQQLIDKEYHHCGLNPRLNRIGRIESVNYANL---RAMLDCELIRRFSKLNED 1331
Query: 1403 EQLEIAHQTGTTRSQI 1418
+ ++ + T Q+
Sbjct: 1332 RKRTLSSKISTKNVQV 1347
Score = 86.7 bits (213), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 110/220 (50%), Gaps = 15/220 (6%)
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
SS ++ LD K + D F+H Y +P + +L +++ TWAG + + S LS+
Sbjct: 224 SSFILEASALDNKIGDIIDLQFLHHYRQPTIAVLSQQKSTWAGLLPQTKDNVIFSVLSLD 283
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVS 348
+ + NLP+D K++A+PSP+ G L++G N IH + + +A+N Y
Sbjct: 284 MQTRLTTTVLQIENLPYDLEKIIALPSPLNGSLLIGCNELIHVDTGGITRRIAVNQYTED 343
Query: 349 LDSSQE--LPRSSFSVELDAAHATWLQNDVALLST-KTGDLVLLTVVYDGRVVQRLDLSK 405
+ +S + ++S ++L+ + ND LL +TG++ + DG+ ++R+ + +
Sbjct: 344 ITASLKNYADQTSLDLKLEDCSILPIPNDNKLLMVLRTGEMYFIVFEVDGKTIKRMSVEE 403
Query: 406 TNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLVQF 437
PS S I ++ N+L FL R +S LV+
Sbjct: 404 I-PSETYSQIKLMDPSSFASLDNNLLFLTGRSSNSHLVEL 442
>gi|147772179|emb|CAN73417.1| hypothetical protein VITISV_017053 [Vitis vinifera]
Length = 609
Score = 134 bits (337), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 69/122 (56%), Positives = 80/122 (65%), Gaps = 26/122 (21%)
Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYEL--------------------- 541
F V DSL+N+GPLK F+Y LRINAD ATGI KQSN+EL
Sbjct: 430 FEVNDSLINVGPLKVFAYALRINADLKATGIVKQSNFELMCCSGHGKNGALCILQQSIRP 489
Query: 542 -----VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
VEL GC+ IWTVYHK++RGHNADS++M DDEY AYLIIS E+RTMVLET +LL
Sbjct: 490 EMITEVELSGCERIWTVYHKNTRGHNADSTKMVTKDDEYCAYLIISPESRTMVLETVELL 549
Query: 597 TE 598
E
Sbjct: 550 GE 551
>gi|68471460|ref|XP_720278.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
gi|46442138|gb|EAL01430.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
Length = 758
Score = 132 bits (332), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 126/563 (22%), Positives = 243/563 (43%), Gaps = 54/563 (9%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
N + ++L + P +A+ P+G +R + F N++G F++G P +
Sbjct: 193 NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 247
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
+ Q + ++ + + +G I++ +Q +IC+LP Y+ P++ V
Sbjct: 248 VHSIPRIFQFSKIAAMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHV- 306
Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI--------DNHNLS 1051
+ + I Y + VL Q+ +D+E G I D +S
Sbjct: 307 DIGESIKSIAYHETSD-------TVVLSTFKQIPYDCLDEE-GKPIAGIIKDIKDTPAMS 358
Query: 1052 ---SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 1108
S+ L Y E LE + G ++ S + L +L K+
Sbjct: 359 FKGSIKLVSPYNWTVIETIELEDNEVGMTLKSMILDVGSESGSTLGSDPNSLIKKYNKKK 418
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISAL 1161
+ IG + ED+AA G ++ G+ N + E++ +E +GAI+++
Sbjct: 419 REYIPIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFKEETRGAITSI 476
Query: 1162 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
L G L++ G K+I+ +AF D P +YV N ++LGD+ K + +
Sbjct: 477 CELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLILGDLLKGCWLV 535
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
+ + ++ +L KD + +F+I+ + ++V+D + + Y P +S G
Sbjct: 536 GFDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGT 595
Query: 1282 KLLSRAEFHVGAHVTKFLRLQML-ATSSDRTGA---------APGSDKTNRFALLFGTLD 1331
KLL++A F + + ++ L ++ S +T A P + +N F ++ T D
Sbjct: 596 KLLTKASFELNSTISCLRSLPLIDIEESVQTDALTNIAVPPPLPPNTTSNYFQVIGSTQD 655
Query: 1332 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGKAHRPGPDSI 1387
GS + P++E +RR+ LQ++L+D H GLNPR R + +N +P I
Sbjct: 656 GSFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKP----I 711
Query: 1388 VDCELLSHYEMLPLEEQLEIAHQ 1410
+D +L+ + L + + +A++
Sbjct: 712 LDYDLIRSFTKLSDDRKRNLANK 734
>gi|164655043|ref|XP_001728653.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
gi|159102535|gb|EDP41439.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
Length = 1212
Score = 132 bits (332), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 96/287 (33%), Positives = 143/287 (49%), Gaps = 23/287 (8%)
Query: 1121 GEDVAARGRVLLF-------STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1173
GED +++G + +F S G A + L + ++E++ ++AL L G L+ A G
Sbjct: 902 GEDRSSKGHMYVFDVVECVPSEGMAASDALRL-QLLCTEEMRAPVTALHDLNGFLVAAVG 960
Query: 1174 PKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1232
K+++ W E L +AF D +Y S+ VKNF+LL D ++S YF++++E A+L L
Sbjct: 961 QKLLIRSWEYCEWLVTVAFLDMG-MYTTSIQRVKNFLLLTDYYQSAYFVAFQEDPARLVL 1019
Query: 1233 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1292
L +D+ FLID + LS+V D +++ Y P S GQ+LL+R E+H
Sbjct: 1020 LGRDYIPTSVTCGAFLIDRARLSIVTCDMNGCLRLMDYHPSNPTSLGGQRLLARCEYHAP 1079
Query: 1293 AHVT--KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1350
V + L LATS G T+ L +G++ + P+ E F LQ
Sbjct: 1080 GEVVRARMLHGPYLATS--------GECLTSEIVL--AKRNGAVDVLVPVTEKIFPTLQL 1129
Query: 1351 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1397
Q +LV V H AGLNPR FR N RP I+D LL E
Sbjct: 1130 FQSQLVRMVRHTAGLNPRGFRAVF-NQHISRPLAKGILDGTLLHTAE 1175
Score = 120 bits (302), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 152/591 (25%), Positives = 264/591 (44%), Gaps = 75/591 (12%)
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+RL G V + + Q A RD ++++F DAK++++E+DD L S+H FE
Sbjct: 22 HRLFGQVTGIQSV-QTLASQVDGRDRLLVSFRDAKLALMEWDDVYGDLNSISIHTFERAP 80
Query: 167 WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGF 226
L + SF P + VDP RC +L+ + IL Q S L G +D +
Sbjct: 81 QL-VDGLPPSFV--PRLLVDPASRCAALLLPQDALAILPFVQEASEL-GADDPRDAALLD 136
Query: 227 SARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
A S +++ D +++V+D +F+ G+ +P++ +L+E ELTW G +S T +
Sbjct: 137 QAPYAPSFILSFSEDVDASIRNVRDCVFLPGFQKPMLAVLYEPELTWTGSLSRARLTTRV 196
Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALAL 342
+++ T+ ++P+ ++ LP+D L+A P +GGVLVV + + + Q+A L++
Sbjct: 197 CFITLDLTVTKYPVTVTSEALPYDTLYLVACPDSLGGVLVVTPSALLHLDQTARLVGLSV 256
Query: 343 NNYAVSLDSSQELPRSSFSV---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
+ + S LP ++ ++ +L ++ T+ + + LL + G ++ +GR V
Sbjct: 257 SRWTDFTSSELMLPNATATLGDCDLQSSVLTFTEANGGLLVLRDGRMLTFQCALEGRTVT 316
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
L L+ VL + G + F L + L++ + T + + L E +I A
Sbjct: 317 SLSLN----VVLVPERQ--GGASFV--QALPERLILCASFQDDTYLYAMNLLEAPTEIAA 368
Query: 460 D-APSTKRLR-RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
P + L + DA + G+ + S + A V D L +GPL D
Sbjct: 369 STGPDQQSLEPDADVDADALDLYGDSFKPDVATSKQAQPA----GLDVLDVLPTLGPLND 424
Query: 518 FSYGLRINADASA---TGISKQSNYELVE----------LPGCKGIWTVYHKSSRGHNAD 564
+YG+ NA A + Q + ++E + IWTV N
Sbjct: 425 MTYGVVRNAHGKAHPHMVATMQHHLAVIEPRLRCDVVQNIAPAHAIWTV------SINGK 478
Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
+ A+D+E L+ SLE+ + T + +Q RTIA G+ + VI
Sbjct: 479 WLLLTAWDEE---CLVYSLESNS------------THFLSQHLQ-RTIACGS--TQAGVI 520
Query: 625 QVFERGARILD--GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
+V + A +LD G MT +F ++ + G SI D YV L
Sbjct: 521 RVTSKRAEVLDEHGRIMT---TFAECDANASYG-------DASIQDSYVAL 561
>gi|115490949|ref|XP_001210102.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114196962|gb|EAU38662.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 908
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 176/731 (24%), Positives = 297/731 (40%), Gaps = 148/731 (20%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
++++LAF +AK+S++E+D HG+ S+H +E + + + G ++ V+P R
Sbjct: 62 EAVLLAFRNAKLSLIEWDPERHGISTISIHYYERDDLTCSPWVPDLSSCGSILDVEPSSR 121
Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGD-------------EDTFGSGGGFSARIESSHVI 236
C V +G++ + I+ Q G LV D ++T ++ SS V+
Sbjct: 122 CA-VFNFGIRNLAIIPFHQPGDDLVMDDYDSDLDERKHVDQETTRESPAYATPYASSFVL 180
Query: 237 NLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
L D + H F+H Y EP IL+ + T + + S ++ +
Sbjct: 181 PLTAFDPSILHPISLAFLHEYREPTFGILYSQVATSNALLHERKDVVFYSVFTLDLEQRA 240
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQ 353
+ S LP D + ++A+P P+GG L++G+N +H + A+ +N ++ + +
Sbjct: 241 STTLLSVARLPSDLFHVVALPPPVGGSLLIGSNELVHVDQAGKTNAVGVNEFSRQVSAFS 300
Query: 354 ELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRL---------- 401
+S ++ L+ L ++ +L TG++VL+ DGR V +
Sbjct: 301 MTDQSDLALRLEGCRVERLADNSGDMILILSTGNMVLIKFKLDGRSVSGISVHPVPVHAG 360
Query: 402 -DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
DL K+ S +GN FLGS DSLL+ G S LSSG
Sbjct: 361 GDLMKS----AASSSAFLGNGEVFLGSEDADSLLL------GWSDLSSG----------- 399
Query: 461 APSTKRLR------RSSSDALQDMVNGEEL---SLYGSASNNTESAQKT---------FS 502
TKRLR S D D ++ +++ LY ++ + T ++ ++
Sbjct: 400 ---TKRLRSHKNDANDSGDVSDDNMSDDDVYEDDLYSTSPDATADGRRVSADPSSFGLYN 456
Query: 503 FAVRDSLVNIGPLKDFSYG--------LRINADASATGISKQS---NYELVEL------- 544
F + D L+NI PL+D + G + N A ++ Q N L+ +
Sbjct: 457 FRINDRLLNIAPLRDITLGKPSTFDKDRKDNVSAELELVASQGSDRNGGLIAMRREIDPE 516
Query: 545 -------PGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA---- 593
+WT +SS G ++ H Y+I+S + ET
Sbjct: 517 VLASFTIDSANCVWTACVESSGGKDS------------HQYVIVSKQTNIDKEETEIFRV 564
Query: 594 ---DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---P 647
DL V+ + TI G L + RV+QV + R D DL P
Sbjct: 565 DGLDLKPIKAPEVNP-NEEVTIDVGTLAKQSRVVQVLKNEVRCYDA-----DLGLAQIYP 618
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
E S+ +S S+ DPYV + D ++ LL D S V+ P + ++ K +
Sbjct: 619 VWDE--DTSDEHPAVSASVTDPYVAILRDDSTLLLLHVDDSGDVDEVEMPDNM-AAHKWL 675
Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
SSC LY DK TGV + G Q D++ + + L I+
Sbjct: 676 SSC-LYLDK----------------TGVFASNTDTKGS--RQNDMFLFLLGQDCRLFIYR 716
Query: 768 VPNFNCVFTVD 778
+P+ V T+D
Sbjct: 717 LPDLLLVSTID 727
>gi|414587800|tpg|DAA38371.1| TPA: hypothetical protein ZEAMMB73_571351 [Zea mays]
Length = 108
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 68/98 (69%), Positives = 77/98 (78%)
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
MVL+T D L EVTE+VDY VQG TIAAGNLFGR RVIQV+ +GAR+LDGS+MTQ+L+F
Sbjct: 1 MVLQTGDDLGEVTETVDYNVQGSTIAAGNLFGRCRVIQVYAKGARVLDGSFMTQELNFSM 60
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
SES SE S SIADPYVLL MSDGSIRLL+G
Sbjct: 61 HTSESSLNSEPLAAASASIADPYVLLKMSDGSIRLLIG 98
>gi|453082807|gb|EMF10854.1| CPSF_A-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 1349
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 124/487 (25%), Positives = 227/487 (46%), Gaps = 47/487 (9%)
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPL 1019
+H C HGF+ + + ++ QLP + Y W +Q+V + H I Y AE+ +Y
Sbjct: 875 VHRPGCEHGFLTINADEEVQENQLPEKTWYGTGWSIQQVDIGEDVRH-IAYHAEREVY-- 931
Query: 1020 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQ 1079
V+ + ++ H + D+ V +Y + ++ + Q
Sbjct: 932 -----VVATCRDIDFYFAGEDGRHPEQD------DIELRPQVPQYTIHLV----SAKSHQ 976
Query: 1080 TRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS---T 1135
++ + E ++V++L + T E + L+ + TA +GED+ A+G V+L+
Sbjct: 977 RLQSVELGYLETVTALKVMSLEVSENTHEQKDLVVVSTAAQRGEDMPAKGAVILYDIIDV 1036
Query: 1136 GRNADNPQN--LVTEVYSKELKGAISALAS-LQGHLL-IASGPKIILH--KWTGTELNGI 1189
+ D P++ + ++ ++ +GAI+++A L G L A G K+++ K GT L +
Sbjct: 1037 VPDPDVPESGFQLHQLAREQARGAITSIAGPLPGGFLGTAQGLKLMVRGLKEDGTCLP-V 1095
Query: 1190 AFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS-LDCFATE 1246
AF DA Y +L ++ + L GD K ++F + E+ +L ++ K S ++ E
Sbjct: 1096 AFLDAQS-YTHTLKVLPGRGMWLAGDAWKGLWFGGFTEEPYKLTVMGKSPKSKMEVMTAE 1154
Query: 1247 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QML 1304
FL L +++ D ++ + Y P+ +S G +LL R+ FH+G VT L + +
Sbjct: 1155 FLPFDGALYILIMDADNDLHVLQYDPENPKSVGGMRLLHRSTFHIGHLVTNMLLVPSSLK 1214
Query: 1305 ATSSDRTGAAPGSDKTNRFA---------LLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1355
S A G++ N A +L + GS+G I PLDE +RRL +LQ L
Sbjct: 1215 PFESQDRDMANGTNGNNEEATRAPPSLHHILATSRSGSVGLITPLDEAAYRRLSALQTHL 1274
Query: 1356 VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1415
+ H AGLNPR++R + G +VD L++ L ++ ++ + G+
Sbjct: 1275 TAILEHAAGLNPRAYRAVEAESFG---GARGVVDGSLVNRIGELGAAKRADVLGRAGSDG 1331
Query: 1416 SQILSNL 1422
+ S+L
Sbjct: 1332 WVVRSDL 1338
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 144/680 (21%), Positives = 266/680 (39%), Gaps = 93/680 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV ++++++ G K + N G ++ VL V Y L G V S+
Sbjct: 28 NLVVAKTSLLQVF-------GVKAAGNDGGNEKLVL-----------VGEYSLAGTVTSI 69
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + D ++++L+F+DAK+S++E+D + + S+H +E + G
Sbjct: 70 ARVKT--LDTKSGGEAVLLSFKDAKLSLVEWDPENYRISTISLHFYEGDNVISAPFGPPL 127
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQ---------------GGSGLVGDEDT-- 219
++ VDP RC + Q+ IL Q L + T
Sbjct: 128 ADCDSILTVDPSSRCAALKFGARQLAILPFRQFGDELAGEEEEGEFDADHALATSKRTES 187
Query: 220 --FGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
+G ++S + L LD + H F+H Y EP IL +
Sbjct: 188 VPHANGDTEHTPYKASFTLALTALDPSVSHAVHLAFLHEYREPTFGILSATVEPSYSLLE 247
Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
+ + L++ + + S LP ++++ +P P+GG L++G N + + QS
Sbjct: 248 ERKDILTYTVLTLDLEQRASTNLISVPKLPSTLWEVVPLPLPVGGALLLGTNELVHVDQS 307
Query: 336 ASC-ALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVV 392
A A+N +A +S +++L+ L + L+ T G L +L+
Sbjct: 308 GKANATAVNEFAKLESDFGMADQSHLNLKLEDCRVEVLDSKTGELLIVTNDGSLAILSFQ 367
Query: 393 YDGRVVQRLDLSKTNPSVLTSDITT-------IGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
GR + L++ + ++ I T + S F+GS G S L+ ++ TS
Sbjct: 368 MHGRSISALNVKRATSENGSTTIHTAPSCMARLEGSKIFIGSEDGASSLLGWS--RPTSA 425
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
L+ K + + D E S + T +AQ TFS +
Sbjct: 426 LNR--KRSHAQMLDKEADDEDEEMEEDDDDLYDAAPEPKKRASSETAVTSTAQYTFS--I 481
Query: 506 RDSLVNIGPLKDFSYG--------LRINADASATGISKQS--NYELV-------ELPGCK 548
D L++ GP+ + G L I A A S+ + + ++V +L +
Sbjct: 482 IDELLSTGPIHEVCLGRSGPWKDRLEIAAGAGRKQASRLTLMHRDIVPTVRRKCKLGAAR 541
Query: 549 GIWTVYHKSSRGHNADSSRMA-AYD-DEYHAYLIISL--EARTMVLETADLLTEVTESVD 604
W + K + + +D D+ Y I S + + +A E++D
Sbjct: 542 ATWALRPKQRNAALPEYDNLLFVFDGDDTKVYDIPSQDEDGSSYTERSAPEFESAGETLD 601
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARI-LDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
T+A G + + R ++ A++ LD P E E+ +++
Sbjct: 602 M----ATVADGTIVVQTRRTELRTYNAKLGLD--------QIIPMTDE--ETDEDLSIVH 647
Query: 664 VSIADPYVLLGMSDGSIRLL 683
++++DPYVL+ D S+++L
Sbjct: 648 IAVSDPYVLVIRGDNSVQVL 667
>gi|380488833|emb|CCF37111.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
Length = 1062
Score = 130 bits (327), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 232/1047 (22%), Positives = 400/1047 (38%), Gaps = 173/1047 (16%)
Query: 69 YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL----SQGGA 124
Y R+ ++ ES G V D L LV Y + G V LA + S+ G
Sbjct: 66 YDHRLNDDDGLESSFLGGDGMLVRADRAINTKLVLVAEYPIFGIVTGLAKIKLQYSKSGG 125
Query: 125 DNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL-- 182
+ ++++A A++S++++D H L S+H +E E S GPL
Sbjct: 126 E------ALLIATRVARLSLVQWDPEKHALEDISIHYYEKEEL------EGSPFDGPLNN 173
Query: 183 ----VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES------ 232
+ DP RC + + L Q + D+ G A+ S
Sbjct: 174 YRTHLAADPGSRCAALRFGPRYIAFLPFKQADEDIDMDDWDEDVDGPRPAKEPSATAATN 233
Query: 233 ------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+V+ L LD + H F+H Y EP I+ + H
Sbjct: 234 GTSNIADVPYSTSYVLPLPQLDPSLLHPVHLAFLHEYREPTFGIISSTQRRSNTLPRKDH 293
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSAS 337
+ + L + + I S NLP D +K++A+P P+GG L+VG N IH
Sbjct: 294 FSYKVFTLDLQQ--RASTAILSVNNLPQDLFKVIALPGPVGGALLVGTNELIHIDQSGKP 351
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDG 395
+A+N + + +S + L+ + + +N L+ G L ++T DG
Sbjct: 352 NGVAVNPFTKETTNFPLADQSDLDLRLEHCYIELMSAENGELLMILSDGRLAIITFKIDG 411
Query: 396 RVVQ----RLDLSKTNPSVLTSDITTIGN---SLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
R V +L ++ ++ ++TI ++FF+G+ DSL++ +T +
Sbjct: 412 RTVSGVGVKLVPTEVGGGIVQCSVSTISRLSRNVFFVGTTGSDSLVLGWTRKQAQNARK- 470
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS--LYGSASNNTESAQKTFSFAVR 506
K D D+ D D + GE + + A+ N S +F V
Sbjct: 471 --KTRLVD---DSFEYDLEDEDMEDDDDDDLYGETTTTMIQPGATANGVSKGGDLTFRVH 525
Query: 507 DSLVNIGPLKDFSYGLRI-------------------------NADASATGISKQSNYEL 541
DSL++I P+KD + G + +A A I Q+
Sbjct: 526 DSLLSIAPVKDMTSGKQAFIPDSEEEKNSVGVVADLQLACVVGRGNAGAVAIVNQNIQPK 585
Query: 542 V----ELPGCKGIWTV-----YHKSSRGHNADSSRMAAYDD---EYHAYLIISLEARTMV 589
V E P +G WT+ KS +G ++ +A+ D +Y ++I+S +
Sbjct: 586 VIGKFEFPEARGFWTMCVQKPVPKSLQGDKGANAAVASEFDASSKYDKFMIVS-KVDLDG 644
Query: 590 LETADLLTEVTESVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQ 641
ET+D+ + G T+ AG + R+IQV + R DG ++Q
Sbjct: 645 YETSDVYALTGAGFEALTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLGLSQ 704
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
L + E+G+ V+S SI DPY+LL D SI + D + V+
Sbjct: 705 ILPM--LDEETGA---EPRVISASITDPYLLLVRDDSSIMVAQIDNNCELEEVEKQDDTI 759
Query: 702 SSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG 761
S K ++ C LY D +TG+ + G P Q + + + +G
Sbjct: 760 LSTKWLAGC-LYTD----------------TTGLFAPMQTDKGTPEGQ-NTFMFLLSAAG 801
Query: 762 ALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
AL I+ +PN + V +G T+ V ++ + + GT Q E +
Sbjct: 802 ALYIYALPNLSKPVYV---AAGLTY-VPPFL---------SADYAVRRGTVQ---ETLTE 845
Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
+ V +L + P+L + + Y+ E + T + +++L
Sbjct: 846 LLVADLG----DTTATSPYLIVRHANDDLTIYEPIRLESQDKT------LGLAKTLHFQK 895
Query: 882 VSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
++ N +++P++ E E P P + NI+G+ FL G+ P +
Sbjct: 896 IT-----NPALAKSPVEVADDEANEQPRFVPLRPCA---NINGYSTVFLPGASPSLIV-- 945
Query: 940 RERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
+ + P+ L + + H C GFIY S+G ++ QLP+ S +
Sbjct: 946 -KSAKSSPKVVGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVTQLPADSNFAELGVSV 1004
Query: 997 KVIPLKATPHQITYFAEKNLYPLIVSV 1023
+ IP+ I Y Y + S+
Sbjct: 1005 RKIPIGDAVGLIAYHPPMETYAVACSI 1031
>gi|238881599|gb|EEQ45237.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 1423
Score = 129 bits (324), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 126/563 (22%), Positives = 242/563 (42%), Gaps = 54/563 (9%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
N + ++L + P +A+ P+G +R + F N++G F++G P +
Sbjct: 858 NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 912
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
+ Q + ++ + + +G I++ +Q +IC+LP Y+ P++ V
Sbjct: 913 VHSIPRIFQFSKIAAMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHV- 971
Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI--------DNHNLS 1051
+ + I Y + VL Q+ +D+E G I D +S
Sbjct: 972 DIGESIKSIAYHETSD-------TVVLSTFKQIPYDCLDEE-GKPIAGIIKDIKDTPAMS 1023
Query: 1052 ---SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 1108
S+ L Y E LE + G ++ S + L +L K+
Sbjct: 1024 FKGSIKLVSPYNWTVIETIELEDNEVGMTLKSMILDVGSESGSTLGSDPNSLIKKYNKKK 1083
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISAL 1161
+ IG + ED+AA G ++ G+ N + E++ +E +GAI+++
Sbjct: 1084 REYIVIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFKEETRGAITSI 1141
Query: 1162 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
L G L++ G K+I+ +AF D P +YV N ++LGD K + +
Sbjct: 1142 CELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLILGDPLKGCWLV 1200
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
+ + ++ +L KD + +F+I+ + ++V+D + + Y P +S G
Sbjct: 1201 GFDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGT 1260
Query: 1282 KLLSRAEFHVGAHVTKFLRLQML-ATSSDRTGA---------APGSDKTNRFALLFGTLD 1331
KLL++A F + + ++ L ++ S +T A P + +N F ++ T D
Sbjct: 1261 KLLTKASFELNSTISCLRSLPLIDIEESVQTDAFTNIVVPPTLPPNTTSNYFQVIGSTQD 1320
Query: 1332 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGKAHRPGPDSI 1387
GS + P++E +RR+ LQ++L+D H GLNPR R + +N +P I
Sbjct: 1321 GSFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKP----I 1376
Query: 1388 VDCELLSHYEMLPLEEQLEIAHQ 1410
+D +L+ + L + + +A++
Sbjct: 1377 LDYDLIRRFTKLSDDRKRNLANK 1399
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/238 (23%), Positives = 110/238 (46%), Gaps = 19/238 (7%)
Query: 216 DEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
+ED G+ R+ +SS +I+ LD + V D F+H Y EP + +L ++ WA
Sbjct: 203 EEDKNGTTTNQEPRLFYDSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWA 262
Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IH 330
G + L++ K ++ NLP++ +++ +PSP+ G L+VG N IH
Sbjct: 263 GNLIKSKDNIQFQVLTLDLNSKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIH 322
Query: 331 YHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGD 385
+ +A+N + S S Q+ +S +++L+ + +D LL +TG+
Sbjct: 323 VDNGGVLKRIAVNKFTRLITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGE 380
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
+ DG+ ++R+ + + ++ + ++ F+ + G+S L+Q
Sbjct: 381 FYFINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 438
>gi|149237256|ref|XP_001524505.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146452040|gb|EDK46296.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1380
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 121/522 (23%), Positives = 236/522 (45%), Gaps = 68/522 (13%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS---IVAFTVLHNVNCNHG 968
+R+ F N++G+ F++G P + + L P++ S V+ + + +G
Sbjct: 876 RRLVYFPNLNGYTCIFVTGVIP---FIIIKSLHSIPRIFQFSKIPAVSISAFSDSKIKNG 932
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1028
I + + +IC+L TY+ P+++V L + Y + + ++ S P
Sbjct: 933 LICLDNNKNARICELSLDYTYEFNLPIKRV-DLGELVRSLAYHEQSD--TVVASTFKEVP 989
Query: 1029 LNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1088
N +D+E G+ I + T+ + ++++ P W + ++
Sbjct: 990 YN-----CVDEE-GNIIPGVYKEKLPHALTF---KSSIKLISPHN----WTVIDSFDLED 1036
Query: 1089 SENALTVRVVTLFN-------TTTKENETLLAIGTAYVQGEDVAARGRVLLFST------ 1135
+E +TV+ + L K + IGT ++ ED+AA G ++
Sbjct: 1037 NEVGMTVKSMILDRGSGAASLKKFKSKREYIVIGTGKLRMEDLAANGSFKIYEIIDIIPE 1096
Query: 1136 -GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1194
G+ N + EV+ ++ +GA++A+ L G L++ G KII+ + +AF D
Sbjct: 1097 PGKPETNHK--FKEVFQEDARGAVTAICDLSGRLMVGQGQKIIVRDIEDDGVVPVAFLDT 1154
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
+Y+ N ++LGD KS++ + ++ + ++ +L KD LD +F+I +
Sbjct: 1155 S-VYISEAKSFGNLLILGDPLKSVWLVGFEAEPYRMVMLGKDRQHLDVECADFIIKDEDI 1213
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1314
++V+D I + Y P +S G LL++A F + + AT+ R +
Sbjct: 1214 FILVADNNNCIHLIQYDPDDPKSINGTILLNKASFELNS-----------ATTCLR--SI 1260
Query: 1315 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR------ 1368
P +K + + ++ TLDG++ + P++E T+RR+ LQ+++ D V H GLNPR
Sbjct: 1261 PKGEKGD-YQIIGSTLDGALYNVFPVNEFTYRRMYILQQQISDKVYHFCGLNPRLNRFGG 1319
Query: 1369 --SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
+ R +N K I+D L+ + L L+ Q ++A
Sbjct: 1320 SVTLRDRETNTKP-------ILDYGLIRRFSKLNLDRQQQLA 1354
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 71/328 (21%), Positives = 139/328 (42%), Gaps = 60/328 (18%)
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+SS +I +LD + + D F+H Y +P + +L R +WAG + + +S+
Sbjct: 223 DSSFIIEAGNLDSSIDTIIDLQFLHNYRDPTIALLSSRSHSWAGSLLKSKDNVHLEVMSL 282
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNY-- 345
K I+ NLP++ +++ + +P+ G L+VG N I H + + +++N++
Sbjct: 283 DLLTKLSTSIFKIENLPYEVDRIVPLSAPLNGCLLVGCNEIMHVDNGGIAKRISVNDFTS 342
Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
S+ S+Q+ +S+ ++L+ + +D L+ T+ G DG+ ++R+
Sbjct: 343 LTTASVKSNQD--QSNLGLKLENCSVVQIPDDHRVLIVTEQGSFYFANFELDGKSIKRVF 400
Query: 403 LSKTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
+ + ++ +I + +L F+ + GDS LVQ +K
Sbjct: 401 IDVVDKNMYDKIKFTFPGEIAVLSKNLLFMSNLNGDSPLVQ-------------VKYRNS 447
Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA----------------QK 499
I D T+R+ + G E + +SN + QK
Sbjct: 448 KILEDTRGTRRVEKGK---------GAEKNKNNVSSNEVDDDDDDDDDLYKEEEEEEQQK 498
Query: 500 TFS-----FAVRDSLVNIGPLKDFSYGL 522
S F ++D L+N P+ F+ GL
Sbjct: 499 VLSKSHIEFILQDRLINNSPISTFTLGL 526
>gi|68471006|ref|XP_720510.1| likely Cleavage and Polyadenylation Specificity Factor subunit
[Candida albicans SC5314]
gi|74591422|sp|Q5AFT3.1|CFT1_CANAL RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|46442380|gb|EAL01670.1| likely Cleavage and Polyadenylation Specificity Factor subunit
[Candida albicans SC5314]
Length = 1420
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 121/573 (21%), Positives = 246/573 (42%), Gaps = 74/573 (12%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
N + ++L + P +A+ P+G +R + F N++G F++G P +
Sbjct: 855 NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 909
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
+ Q + ++ + + +G I++ +Q +IC+LP Y+ P++ V
Sbjct: 910 VHSIPRIFQFSKIAAMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHV- 968
Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDL 1055
+ + I Y + VL Q+ +D+E G D + ++
Sbjct: 969 DIGESIKSIAYHETSD-------TVVLSTFKQIPYDCLDEEGKPIAGIIKDIKDTPAMSF 1021
Query: 1056 HRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV----------------- 1098
+ ++++ P W TI + +E +T++ +
Sbjct: 1022 KGS-------IKLVSPYN----WTVIETIELGDNEVGMTLKSMILDVGSESGSTLGSDPN 1070
Query: 1099 TLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYS 1151
+L K+ + IG + ED+AA G ++ G+ N + E++
Sbjct: 1071 SLIKKYNKKKREYIVIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFK 1128
Query: 1152 KELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILL 1211
+E +GAI+++ L G L++ G K+I+ +AF D P +YV N ++L
Sbjct: 1129 EETRGAITSICELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLIL 1187
Query: 1212 GDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA 1271
GD+ K + + + + ++ +L KD + +F+I+ + ++V+D + + Y
Sbjct: 1188 GDLLKGCWLVGFDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYD 1247
Query: 1272 PKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML-ATSSDRTGA---------APGSDKTN 1321
P +S G KLL++A F + + ++ L ++ S +T A P + +N
Sbjct: 1248 PDDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEESVQTDALTNIAVPPPLPPNTTSN 1307
Query: 1322 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNG 1377
F ++ T DGS + P++E +RR+ LQ++L+D H GLNPR R + +N
Sbjct: 1308 YFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNE 1367
Query: 1378 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
+P I+D +L+ + L + + +A++
Sbjct: 1368 TNTKP----ILDYDLIRSFTKLSDDRKRNLANK 1396
Score = 77.0 bits (188), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 52/221 (23%), Positives = 104/221 (47%), Gaps = 17/221 (7%)
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+SS +I+ LD + V D F+H Y EP + +L ++ WAG + L++
Sbjct: 217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY-- 345
LK ++ NLP++ +++ +PSP+ G L+VG N IH + +A+N +
Sbjct: 277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336
Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
S S Q+ +S +++L+ + +D LL +TG+ + DG+ ++R+
Sbjct: 337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394
Query: 403 LSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
+ + ++ + ++ F+ + G+S L+Q
Sbjct: 395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 435
>gi|226290902|gb|EEH46330.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides brasiliensis Pb18]
Length = 1343
Score = 128 bits (321), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 182/745 (24%), Positives = 303/745 (40%), Gaps = 109/745 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++Y + GS ++ +T+ + + L LV Y L G V L
Sbjct: 28 NLIVAKTTLLQVYNLVNVVYGSGPGQSDEKTRSQY-------SKLVLVAEYALSGTVTDL 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ ++I++A +AK+S++E+D H + TS+H +E + +H+ +
Sbjct: 81 GRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISPWTPN 137
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GDEDTFGSGGGFSARIESS 233
A P + VDP RC VL +G + + IL Q G LV GD F S +I+++
Sbjct: 138 LAACPSQLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMGD---FDSDHDEERQIDTN 193
Query: 234 HVINLRD--------------------------LDMKHVKDFIFVHGYIEPVMVILHERE 267
H RD M H F++ Y EP IL+ +
Sbjct: 194 HTAEERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQV 253
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
+ + + S ++ + + S LP+D +K++ +P P+GG L+VG+N
Sbjct: 254 AASSALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGSN 313
Query: 328 T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
+H + A+ +N +A S +S + L+ L +N LL G
Sbjct: 314 ELVHVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLNG 373
Query: 385 DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
+ +++ DGR V + L +T PS +G F GS GDS+
Sbjct: 374 VMAVVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SALVGRGKIFFGSEEGDSM 429
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
L+ ++ S + + E D A+ + DA +D + ++ G S N
Sbjct: 430 LIGWSRPSAGATVPPA-PETGEDNVAELSEDEEEEDDDEDAYEDDLYATPVT-PGINSRN 487
Query: 494 TESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNYELVELPG 546
T S T + F + D L N+GP++D + G + D + S + ELV G
Sbjct: 488 TTSVNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYLELVTTQG 547
Query: 547 --------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
G+ +V+ K + S Y YL+
Sbjct: 548 YGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPTQSGSLPVNAGSNYDHYLL 607
Query: 581 ISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARIL 634
+S + +++V + + E T + ++ + RTI G L G RV+QV + R
Sbjct: 608 LSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVLKGEVRSY 667
Query: 635 DGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
D + + L P E SE +V+ S ADPYVL+ D SI LL D S
Sbjct: 668 DSANLHLGLGLAQIYPVWDE--DTSEERSVVHASFADPYVLIIRDDSSILLLQADESGDL 725
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDK 716
++T IES+ S +LY DK
Sbjct: 726 DEIETDGIIESTT--WISGSLYQDK 748
Score = 119 bits (299), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 122/524 (23%), Positives = 221/524 (42%), Gaps = 95/524 (18%)
Query: 919 NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV + ++
Sbjct: 902 DVCGYRTVFMPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVV 961
Query: 979 KICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1038
++C+ P + +D W +K I L + Y + Y L S L D
Sbjct: 962 RMCRFPRNTHFDGSWAARK-IGLGEQVDSVEYSSSSETYVLGTS------QKADFKLPED 1014
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV 1098
E+ + N +S +++ V++L P W + ++++E + V+ +
Sbjct: 1015 DEIHPEWRNEVISFFP-----QIDKGSVKLLNPRT----WSIIDSYQLRTAERVMCVKCL 1065
Query: 1099 TL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSK 1152
L + T E + ++A+GTA +GED+AARG + +F + D P+ + + +
Sbjct: 1066 NLEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEVIKVVPEVDRPETNRKLKLIAKE 1125
Query: 1153 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
E+KGAI T L+GI + F++
Sbjct: 1126 EVKGAI-------------------------TSLSGIGG--------------QGFLIAA 1146
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
K I KE G+ LL F + C+ + V E K
Sbjct: 1147 QGQKCI-VRGLKEDGS---LLPVAFMDMQCYVS------------VLKELKGTD------ 1184
Query: 1273 KMSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAAPGSDKTNRF-ALLFG 1328
S KG +LL R+ FH G + +T R +L+ + A D + +L
Sbjct: 1185 --PGSAKGDRLLHRSTFHTGQFASTLTLLPRTSVLSQGPEAEANAMDLDSSGPLHQVLVT 1242
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
+ GSI I P+ E+ +RRL +LQ ++++++ H GLNPR+FR S+G R +V
Sbjct: 1243 SETGSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGGR----GMV 1298
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
D +L+ + L + + EIA + G +I ++L A+G + L
Sbjct: 1299 DGDLVQKWLDLGTQRKAEIASRVGADVWEIRADLE--AIGKAGL 1340
>gi|241954348|ref|XP_002419895.1| subunit of the mRNA cleavage and polyadenylation factor, putative
[Candida dubliniensis CD36]
gi|223643236|emb|CAX42110.1| subunit of the mRNA cleavage and polyadenylation factor, putative
[Candida dubliniensis CD36]
Length = 1420
Score = 128 bits (321), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 123/561 (21%), Positives = 242/561 (43%), Gaps = 51/561 (9%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
N + ++L + P +A+ P+G +R + F N++G F++G P +
Sbjct: 856 NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 910
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
+ Q ++++ + + +G I++ +Q +IC+LP Y+ P++ V
Sbjct: 911 IHSIPRIFQFSKIAVMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHV- 969
Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE-------VGHQIDNHNLS- 1051
+ + I Y + VL Q+ +D+E + + D +S
Sbjct: 970 DIGESIKSIAYHETSD-------TVVLSTFKQIPYECLDEEGKPIAGIIKNIKDTPAISF 1022
Query: 1052 --SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1109
SV L Y E L + G ++ S++ + +L K+
Sbjct: 1023 KGSVKLVSPYNWTVIENIELGDNEVGMTIKSMILDVGSESKSTVGTDPNSLIKKYNKKKR 1082
Query: 1110 TLLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALA 1162
+ IG + ED+AA G ++ G+ N + E++ ++ +GAI+++
Sbjct: 1083 EYIVIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFKEDTRGAITSIC 1140
Query: 1163 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
L G L++ G K+I+ +AF D P +YV N ++LGD K + +
Sbjct: 1141 ELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLVILGDPLKGCWLVG 1199
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1282
+ + ++ +L KD + +F+I+ + ++V+D + + Y P +S G K
Sbjct: 1200 FDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTK 1259
Query: 1283 LLSRAEFHVGAHVTKFLRL------QMLATSSDRTGAA--PGSDKT-NRFALLFGTLDGS 1333
LL++A F + + ++ L + + +D AA P + T N F ++ T DGS
Sbjct: 1260 LLTKASFELNSTISCLRSLPLKDIDEKVQNETDAAAAATIPLPNNTQNNFQVIGSTQDGS 1319
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGKAHRPGPDSIVD 1389
+ P++E +RR+ LQ++L+D H GLNPR R + +N +P I+D
Sbjct: 1320 FFNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKP----ILD 1375
Query: 1390 CELLSHYEMLPLEEQLEIAHQ 1410
+L+ + L + + A++
Sbjct: 1376 YDLIRRFTKLSDDRKRNFANK 1396
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 55/236 (23%), Positives = 110/236 (46%), Gaps = 17/236 (7%)
Query: 216 DEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+EDT G+ +SS +I+ LD + V D F+H Y EP + +L ++ WAG
Sbjct: 197 EEDTNGTNKESHLFYDSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGN 256
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ L++ K ++ NLP++ +++ +PSP+ G L+VG N IH
Sbjct: 257 LIKSKDNIQFQVLTLDLNSKSTISVFKIDNLPYEIDRIVPLPSPLNGTLLVGCNELIHVD 316
Query: 333 SQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLV 387
+ +A+N + S+ S Q+ +S +++L+ + +D LL +TG+
Sbjct: 317 NGGVLKRIAVNKFTRLITASIKSFQD--QSDLNLKLENCSIVPIPDDHRVLLILQTGEFY 374
Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
+ DG+ ++R+ + + ++ + ++ F+ + G+S L+Q
Sbjct: 375 FINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAVLDKNMLFIANSNGNSPLIQ 430
>gi|320583269|gb|EFW97484.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Ogataea parapolymorpha DL-1]
Length = 1309
Score = 125 bits (314), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 129/548 (23%), Positives = 226/548 (41%), Gaps = 60/548 (10%)
Query: 837 SRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTP 896
S+ +L A+ G +L Y+ + DP+ + L N + P
Sbjct: 750 SKDYLVALTFGGEVLIYETFF-----------DPIERTYKLMKIN---------EMCQFP 789
Query: 897 LDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV 955
+ H +R I N G++ ++G+ + + Q S +
Sbjct: 790 IVGAPDNSYAHATKIERYLISVDNFQGYKAVLVTGASAFVILKEYNSIPRMLQFTKRSSL 849
Query: 956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKN 1015
F + C +G I + +ICQL S TY N P+ K T ++I Y + N
Sbjct: 850 YFAEYNTDRCPNGVISIDETKACRICQLDSSYTYSNRLPIAKYKIGDKTINKIRYHSLSN 909
Query: 1016 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQI----DNHNLSSVDLHRTYTVEEYEVRILEP 1071
Y I+S P N V E G + D+ L S L T V ++ P
Sbjct: 910 TY--IISTLEEGPYNPV------DEDGEPLPGLRDDRKLKSTSLKGT-------VHLVSP 954
Query: 1072 DRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL 1131
W TI ++ +E ++ V+ L + T +T++ IGTA + ED+A G
Sbjct: 955 ----ANWTIIDTIELEDNEYVTSIEVIELKVSETIATKTVVLIGTARCRNEDLATHGSWK 1010
Query: 1132 LFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE- 1185
++ P +N + + S+ +G + ++ ++ G I G ++++ +
Sbjct: 1011 IYEVIDIVPEPGRPEAKNRLKMITSETARGPVLSICNVSGRFAIVQGQRMLVRTLQKDDN 1070
Query: 1186 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1245
+ +AF D +Y + KN +L+GD +S+ + ++ KD +++ A
Sbjct: 1071 VAPVAFTDTS-IYSKEVKTFKNLVLIGDSFQSVSLYGFDAAPYRMLHFGKDEQNVELRAA 1129
Query: 1246 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA 1305
+FL+ L L+V+DE + Y P S KG KLL R+ A TK M++
Sbjct: 1130 DFLVHDGNLHLLVADEDSVFHLLQYDPYDGNSMKGLKLLRRSLLRSNALTTK-----MIS 1184
Query: 1306 TSSDRTGAAPGSDKTNR----FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1361
+ DR+ + S + + ++ +DGS + P++E +RRL S+Q L D H
Sbjct: 1185 VARDRSLFSMVSTLNHEDDLGYEIIGSNIDGSFYKVMPVNEYQYRRLYSIQNYLYDKELH 1244
Query: 1362 VAGLNPRS 1369
GLNP+S
Sbjct: 1245 WLGLNPKS 1252
Score = 105 bits (261), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 107/464 (23%), Positives = 212/464 (45%), Gaps = 46/464 (9%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+L+ YRL+G + ++ + ++ + D +I++ + AK+SV+++D +H + S+H
Sbjct: 51 LQLIGEYRLNGQIINI---DKFRSNENESLDYLIVSTKLAKLSVIKWDSQLHAISTVSLH 107
Query: 161 CFESP-EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMII--LKASQGGSGLVGDE 217
+++ + L +++ ++ + + DP C + + L + K L D
Sbjct: 108 YYDTALDALTVEKLEKTSVQH---RTDPNSLCTCLRLNELFTFLPFYKEYLDEEELKDDA 164
Query: 218 DTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERE-LTWAGRV 274
+ S ++N L D+K++ D+ F+H Y +P M IL+ E +TWAG +
Sbjct: 165 EEAKDIKKRKKLFTESFILNASSLYPDIKNIVDYQFLHSYRDPTMAILYAPETMTWAGHL 224
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHS 333
T + LS+ K+ I NLP+D + + SP G L+VG+N IH +S
Sbjct: 225 PKAKDTLKVIVLSLDLENKKASAIMELTNLPYDVDYIYPLESPTNGFLLVGSNEIIHVNS 284
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
+ + N Y + + + +SS + L+ + ++ D L+ T++G+ L
Sbjct: 285 LGSVRGIYTNEYFTDISNLKLKDQSSLGLMLENSRVGLVKEDQVLIITESGEFYQLNFEK 344
Query: 394 DG-----RVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
G +Q+++ S N ++ + + ++ LFF+ + GDS L++ + SG
Sbjct: 345 IGGNSTITGLQKVETSNYKGIIVNHPIMITSVPSL--DLFFVCCQGGDSSLIRISSKSG- 401
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
+L KE+ GD + L + E+ + S+ N++ F
Sbjct: 402 -VLPQETKEQNGDTKETKDDDDWL-----------YDEEDQKSHKSSLVNSQ-------F 442
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
D+++N GPL DF+ G R++ + G+ + E V + C
Sbjct: 443 KKMDNILNCGPLVDFTLG-RVSIEQKIMGLPNPNYNEDVLVAAC 485
>gi|312077399|ref|XP_003141287.1| hypothetical protein LOAG_05705 [Loa loa]
Length = 316
Score = 125 bits (313), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 80/265 (30%), Positives = 132/265 (49%), Gaps = 34/265 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + RL V+S AI + DS++L F+DAK+S++ + + L+ S+H
Sbjct: 62 LECLLAVRLLAPVQSFAI---ARIPQNPDCDSLLLGFDDAKLSIVGVNPADRSLKTISLH 118
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G P+++VDP RC +LV+G + +L + G+ L
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCAAMLVFGRYLAVLPFNDSGAQL------- 168
Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+ + L +D + +V D +F+ GY EP ++ L+E T GR ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217
Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
T + L +S +K+ L +W NLP D ++LA+P P+GG+L+V N + Y +QS
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275
Query: 337 -SCALALNNYAVSLDSSQELPRSSF 360
C ++LN+ +D + P F
Sbjct: 276 PPCGISLNS---CMDGFTKFPLRDF 297
>gi|255720869|ref|XP_002545369.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240135858|gb|EER35411.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 1351
Score = 125 bits (313), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 113/498 (22%), Positives = 216/498 (43%), Gaps = 54/498 (10%)
Query: 888 RNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
++L + P +A+ P+G +R + F N++G FL+G P + +
Sbjct: 831 KDLTITGAPENAF-----PYGTSIERRLVYFPNLNGFTCIFLTGVIPYLILKTIHAIPRI 885
Query: 947 PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPH 1006
Q V+ + + +G I++ ++ +IC+LP Y+ P++ V P+ +
Sbjct: 886 FQFTKIPAVSISAFSDSKIKNGLIFLDNEQNARICELPLDYNYEFNLPMKHV-PIGESIK 944
Query: 1007 QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEV 1066
+ Y + ++VS P N +D+E G I + ++ T + +
Sbjct: 945 AMAYHEASDC--VVVSTFKEIPYN-----CVDEE-GKLI----VGVMEDKPAATSFKGSI 992
Query: 1067 RILEPDRAGGPWQTRATIPMQSSENALTVRVVTL------FNTTTKENETLLAIGTAYVQ 1120
+++ P W TI + +E ++++ + L K + +GT +
Sbjct: 993 KLISPYN----WSVIDTIELDDNEVGMSLKSMVLDIGSSSLIKKFKNKREYIVVGTGKYR 1048
Query: 1121 GEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1173
ED+AA G +F G+ N + E + + +KGA++++ L G L++ G
Sbjct: 1049 MEDLAANGAFKIFEIIDIIPEPGKPETNHK--FKETFQENIKGAVTSVCELSGRFLVSQG 1106
Query: 1174 PKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1233
K+I+ +AF D P +YV N ++LGD K + + + + ++ +L
Sbjct: 1107 QKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLILGDPLKGCWLIGFDAEPFRMIML 1165
Query: 1234 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1293
KD L +F+I + ++V+D + + Y P +S G KLL++A F + +
Sbjct: 1166 GKDTQHLSVECADFIIKDDEVYILVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELAS 1225
Query: 1294 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1353
++ L P D N F ++ DGS + P++E T+RR+ LQ+
Sbjct: 1226 PISCLRTL-------------PIDD--NNFQIIGSCQDGSFFNVFPINESTYRRMYILQQ 1270
Query: 1354 KLVDSVPHVAGLNPRSFR 1371
+L + H GLNPR R
Sbjct: 1271 QLTEKEYHYCGLNPRLNR 1288
Score = 78.6 bits (192), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 134/318 (42%), Gaps = 29/318 (9%)
Query: 222 SGGGFSAR--IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
+G F R +SS +I+ LD + V D F+H Y EP + +L + WAG +
Sbjct: 198 NGNSFEPRQFYDSSFIIDATTLDSTVGTVIDMQFLHNYREPTIGVLSSKSEVWAGNLLKS 257
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
L++ K ++ NLP++ +++ +PSP+ GV++VG N IH +
Sbjct: 258 KDNIQFQVLTLDLNSKSTVSVFKIDNLPYEIDRVIPLPSPLNGVILVGCNELIHVDNGGV 317
Query: 337 SCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTV 391
+A+N + S+ S Q+ +S +++L+ + + ND LL KTG+ +
Sbjct: 318 MKRIAVNKFTGLTTASIKSFQD--QSDLNLKLEDSTIVPIPNDHRVLLVLKTGEFYYINF 375
Query: 392 VYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
DG+ ++R+ + + + ++ + +L F + G+S LVQ S
Sbjct: 376 ELDGKSIKRVHIDVIDKKLYEKVKLTYPGEVAVLDKNLLFFANSSGNSPLVQVKYRDSLS 435
Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
G E D E + ++ E+ +L ++ F
Sbjct: 436 DAKIGAPIEESDEEDETQKADEDDDEDDLYKEEEEEEEQKNL----------SKTHIEFV 485
Query: 505 VRDSLVNIGPLKDFSYGL 522
D L+N GP F+ G+
Sbjct: 486 YHDELINNGPSSSFTLGV 503
>gi|343962533|dbj|BAK62854.1| cleavage and polyadenylation specificity factor 160 kDa subunit [Pan
troglodytes]
Length = 269
Score = 123 bits (308), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 68/198 (34%), Positives = 111/198 (56%), Gaps = 7/198 (3%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1140
A I +Q E+ ++ V+L + T + +A GT +QGE+V RGR+L+
Sbjct: 41 ARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVP 100
Query: 1141 NPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1195
P +T+ +Y KE KG ++AL GHL+ A G KI L +EL G+AF D
Sbjct: 101 EPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ 160
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
LY+ + VKNFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L
Sbjct: 161 -LYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLG 219
Query: 1256 LVVSDEQKNIQIFYYAPK 1273
+VSD +N+ ++ Y P+
Sbjct: 220 FLVSDRDRNLMVYMYLPE 237
>gi|50288865|ref|XP_446862.1| hypothetical protein [Candida glabrata CBS 138]
gi|74609915|sp|Q6FSD2.1|CFT1_CANGA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|49526171|emb|CAG59795.1| unnamed protein product [Candida glabrata]
Length = 1361
Score = 120 bits (301), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 157/337 (46%), Gaps = 23/337 (6%)
Query: 1102 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKG 1156
+T TK + +G Y EDV G ++ P T E++ ++++G
Sbjct: 1026 DTRTKRKREYIIVGIGYATMEDVPPTGEFHIYDITEVVPEPGKPNTNFKLKEIFKEDIRG 1085
Query: 1157 AISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1215
+S + + G LI+ KI++ + +AF D P ++V SL N I++GD
Sbjct: 1086 IVSVVNGISGRFLISQSQKIMVRDVQQDNSVIPVAFLDVP-VFVTSLKTFGNLIVIGDAM 1144
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+ I F+ + + ++ L + + EFL++ + +V+D + + YAP
Sbjct: 1145 QGIQFVGFDAEPYRMITLGSSITKFEVISVEFLVNNGDIYFLVTDRDSIMHVLKYAPDQP 1204
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR-FALLFGTLDGSI 1334
+ GQ+L+ + F++ + ML +D P + +R F + +DGSI
Sbjct: 1205 NTLSGQRLVHCSSFNLHS----LNNCTMLLPKNDE---FPRDQRYSRSFQTITAQVDGSI 1257
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ---FHSNGKAHRPGPDSIVDCE 1391
I P+ E T+RRL +Q++++D P +AGLNPR RQ ++ G + RP ++D
Sbjct: 1258 SKIVPVKEETYRRLYFIQQQIIDKEPQLAGLNPRMERQDNKYYHLGHSLRP----MLDFN 1313
Query: 1392 LLSHYEMLPLEEQLEIAHQTGTTRS-QILSNLNDLAL 1427
++ ++ + + + I + G + ++ +L DL
Sbjct: 1314 IIKRFKDMSMNRRSHIVQKLGKNSNLEVWRDLIDLEF 1350
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 132/674 (19%), Positives = 265/674 (39%), Gaps = 122/674 (18%)
Query: 96 ISAASLELVCHYRLHGNVESLAIL---SQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
I + L L+ ++L G + +A++ S G N ++L+ AK+S+L +++
Sbjct: 43 IRSGRLYLMEEHKLSGRINDVALIPKHSNGSNGNGINLSYLLLSTGVAKLSLLMYNNMTS 102
Query: 153 GLRITSMHC----FESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
+ S+H FES L L AR ++++P G +++ ++ +
Sbjct: 103 SIETISLHFYEDKFESATMLDL-------ARNSQLRIEPNGNYA--MLFNNDVLAILPFY 153
Query: 209 GGSGLVGDED----------------TFGSGGGFSARIESSH---VINLRDL--DMKHVK 247
G DED F G + + +H +IN +L +K++K
Sbjct: 154 TGINEDEDEDYINNDKSKINDNSKKSLFKRKKGKTQNNKVTHPSIIINCSELGPQIKNIK 213
Query: 248 DFIFVHGYIEPVMVILHERELTWAGR---VSWKHHTCMIS---ALSISTTLKQHPLIWSA 301
D F+ G+ + + +L++ +L W G V + +IS SI T +I
Sbjct: 214 DIQFLCGFTKSTIGVLYQPQLAWCGNSQLVPLPTNYAIISLDMKFSIDATTFDKAIISEI 273
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVS-LDSSQELPRS 358
LP D + + + G L++G N I + + L LN+Y+ L + + +S
Sbjct: 274 SQLPSDWH---TIAPTLSGSLILGVNEIAFLDNTGVLQSILTLNSYSDKVLPKVRVIDKS 330
Query: 359 SFSVELDAAHATWL----QNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLS------ 404
S V + L +N+ + LL + G + + + +GR++ + +++
Sbjct: 331 SHEVFFNTGSKFALIPSNENERSVENILLFDENGCIFNVDLKSEGRLLTQFNITKLPLGE 390
Query: 405 -----KTNP---SVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFG 455
K+NP S++ +D + F+G + GD+ +++ S + +++
Sbjct: 391 DVLSQKSNPSSVSIIWAD-GRLDTYTIFIGFQSGDATMLKLNHLHSAIEVEEPTFMKDYV 449
Query: 456 DIEADAPSTKRLRRSS-------SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
+ +A A SD D VN + +G+ SN +AQ+
Sbjct: 450 NKQASAAYNNEDDDDDDDDFNLYSDEENDQVNNKNDRTFGTNESNEPFTAQELM------ 503
Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
L NIGP+ G + + + G+ + E+ + T + NA +
Sbjct: 504 ELRNIGPINSMCVGKVSSIEDNVKGLPNPNKQEI------SIVCTSGYGDGSHLNAILAS 557
Query: 568 MAAYDDEYHAYLIIS------LEARTMVLETADL------LTEVTESVDYFVQGR----- 610
+ ++ ++ I+ ++ + L T D + E+ + QGR
Sbjct: 558 VQPRVEKALKFISITKIWNLHIKGKDKFLITTDSTQSQSNIYEIDNNFSQHKQGRLRRDA 617
Query: 611 -TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TI + +R++QV + D ++ + + V+ VS+ DP
Sbjct: 618 TTIHIATIGDNKRIVQVTTNHLYLYDLTF-----------RRFSTIKFDYEVVHVSVMDP 666
Query: 670 YVLLGMSDGSIRLL 683
YVL+ +S G I++
Sbjct: 667 YVLITLSRGDIKVF 680
>gi|406602601|emb|CCH45811.1| hypothetical protein BN7_5397 [Wickerhamomyces ciferrii]
Length = 1287
Score = 120 bits (301), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 111/458 (24%), Positives = 216/458 (47%), Gaps = 50/458 (10%)
Query: 969 FIYVTSQGILKICQLPSGSTYDNY---WPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
F+Y+ +IC LP G + NY P++ ++ L TP+++TY L+ +
Sbjct: 844 FMYIDIDKTARICSLPIGENF-NYSQNLPIE-IVSLGQTPNKVTYHETSGLF-------I 894
Query: 1026 LKPLNQVLSLLIDQE----VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1081
+ ++ ID++ VG S + + + + ++++ P W
Sbjct: 895 VSTFEEISYNAIDEDGVPIVG--------SESEKPKAKNFKGF-LKLINPIN----WTII 941
Query: 1082 ATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR---VLLFSTGR 1137
I M+ +E VR + L ++ +K+ + + G + ED++ G + + S
Sbjct: 942 DEIEMEENEIINDVRSINLTISSRSKKKKEFIIFGIGKYRLEDLSVFGEFKIIDIISIVP 1001
Query: 1138 NADNPQNL--VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDA 1194
+ P+ + E++ + +KGA++ + + G L + G KII+ +AF D
Sbjct: 1002 DPTKPEAIYKFKEIFQEVVKGAVTTINEISGRFLTSQGQKIIIRDLQQDNSTVPVAFMDC 1061
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
Y+ N +L+ D KSI+FL + + +L LL KD + T+F++D +
Sbjct: 1062 AT-YLSDSKSFGNLLLISDSMKSIWFLGFDAEPYRLLLLGKDQQRFNAITTDFIVDDGEI 1120
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1314
+V+D+++++ + Y P +S GQKLL ++ F + +T L+L D+
Sbjct: 1121 YFLVADDEESLHLLTYQPDDPKSLSGQKLLQKSTFTTNS-ITTCLKLVPKFNEFDQGSIT 1179
Query: 1315 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1374
+ + +DGSI + P+DE+++RRL LQ++L D + H GLNPRS R F
Sbjct: 1180 -------SYQNIGVNVDGSIFKMIPIDEISYRRLYILQQQLSDKIAHYVGLNPRSNR-FS 1231
Query: 1375 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
+N + +P I++ LL + L ++++ + + + G
Sbjct: 1232 ANEQGQKP----IIEFGLLKWFINLNVDKRKQFSAKVG 1265
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 129/613 (21%), Positives = 261/613 (42%), Gaps = 62/613 (10%)
Query: 96 ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
I + + +L+ ++ N + + I S D+ + +I+ + AK+S++ FD ++ ++
Sbjct: 42 IDSKNDKLILNHEFKLNGKIIGIKSIKLPDSQYDQLAILTSL--AKLSIVSFDHDLNTIQ 99
Query: 156 ITSMHCFESPEWLH-LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
S+H +ES + + + ES +K+DP + ++VY + L Q ++
Sbjct: 100 TNSLHYYESEFYTKSISKINES-----QLKIDPNNQTS-LVVYNDLLAFLPFKQDDDEII 153
Query: 215 GDEDTFGSGGGFSARIESSH--VI---NLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
D+ S IE H +I N + + ++ D F+H Y +P + ILH +E T
Sbjct: 154 DDDHHTQSNDQQQQNIELFHNSIILPANKLESTVSNIIDCDFLHSYRDPTLAILHNKEQT 213
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-T 328
WA +S K T LS+ I NLP+D + + +P PI G L++G N
Sbjct: 214 WASDLSIKKDTVNFVVLSLDLLNDSSTAILLVENLPYDLWFVKPLPDPINGTLLIGCNEI 273
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
IH + + + LN Y + + +S ++ L+ + L + L+ + G+
Sbjct: 274 IHIDNSGNTKGIGLNKYYQDITDFKLKDQSDLNIFLEHSKVEILNDKNILIIDQFGESYN 333
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS---DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
L DG+ V+ L ++K + IT I F+G + DS+L+++
Sbjct: 334 LQFFIDGKSVKDLLITKFEKDLQIRSPISITNIDEQNIFIGCQSSDSILIKY-------- 385
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
LK+E + + P+ + D + N F+ +
Sbjct: 386 --EKLKQETNEAKPTTPAATKTNNDDDDEDLYEDEDLNNNNDDELIN--------FNLQI 435
Query: 506 RDSLVNIGPLKDFSYGLRINADASATGIS--KQSNYELVEL--PGCKGIWTVYHKSSRGH 561
+D L N GPL F+ G +IN ++ G++ Q++ +V G +G T++++S +
Sbjct: 436 KDKLFNAGPLSSFTLG-KINPNSLIQGLTNPNQNDVSIVGTSGEGKQGKLTLFNQSIQPK 494
Query: 562 NADSSRMAAYDDEY---HAYLIIS-LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
S + + + + YLI + L+ + + + +S D+ TI +
Sbjct: 495 IHSSLKFNNINKTWNILNKYLITTDLQNFKSEIFLINENFKNFQSFDFKNNNITINIDTI 554
Query: 618 FGRRRVIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
++R++Q+ + D ++ + + F +++ I DP++++
Sbjct: 555 QSQKRILQITSNNVYLFDLNFKKLLQINFDF--------------EIINGKIFDPFIIIT 600
Query: 675 MSDGSIRLLVGDP 687
S G +++ DP
Sbjct: 601 SSKGEVKIFEMDP 613
>gi|449019486|dbj|BAM82888.1| similar to cleavage and polyadenylation specificity factor subunit
[Cyanidioschyzon merolae strain 10D]
Length = 1880
Score = 120 bits (301), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 88/324 (27%), Positives = 162/324 (50%), Gaps = 12/324 (3%)
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFST----GRNADNPQNLVTEVYS---KELKGAISALAS 1163
+L +GT +++GED + RGR+L+F GR Q + ++ + E+KGA+SA+A
Sbjct: 1546 VLVVGTCFLRGEDTSIRGRLLVFEISRQEGRQHHQHQRTLYQMQTLAATEVKGAVSAVAP 1605
Query: 1164 LQGHLLIAS-GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
++G + S GP++ ++K E++ I+FY L+ + +K +IL D+ + FL
Sbjct: 1606 VKGGFVCCSAGPRLEVYKLIEDEMSCISFYPGINLFFSHVGTLKQYILASDMRYGVSFLF 1665
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN-IQIFYYAPKMSESWKGQ 1281
W+ + N L +D + A+E+L+ G+ +++ +D N I++ +P ES G
Sbjct: 1666 WRSRNVSQNFLCRDEAQRELVASEWLMHGTKANVLSADMLGNIIELSIPSPVDPESAGGT 1725
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1341
++ A FHVG+ R+++ S++ S N +L GT+DG I ++PL
Sbjct: 1726 RMTFEAGFHVGSRPNAVRRVRIDDPSAETPPPNEPSSLWNTHVILLGTVDGMITMVSPLL 1785
Query: 1342 ELTFRRLQ-SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH--RPGPDSIVDCELLSHYEM 1398
++L+ + Q +++ L RS+R S A R SI+D ++L Y
Sbjct: 1786 RGVAKKLELAAQDLMLEPELRKWCLYARSWRVMRSLTVAAGLRKPKRSILDGDVLQLYGS 1845
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNL 1422
L + EIA + G + + +
Sbjct: 1846 LDTPRRKEIARRIGMPQEALFEAI 1869
Score = 74.3 bits (181), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 112/499 (22%), Positives = 194/499 (38%), Gaps = 128/499 (25%)
Query: 243 MKHVK--DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI----STTLKQHP 296
+ HV+ D F+ G P MV+L+E TWAGRV ++C ++A+ + + + P
Sbjct: 345 LGHVRILDCCFLTGTALPTMVMLYEERPTWAGRVEAVSNSCALAAIVLPPLPAGAAGEEP 404
Query: 297 LI-WSAMNLPHDAYKLLAVPS------PIGGVLVVGANTIHYHSQSASCALAL--NNYA- 346
L+ W LP DA K++ +PS G+L++ AN + + + +L N++
Sbjct: 405 LVAWRIQGLPFDAEKVVPLPSVEWDRAAEQGLLLIAANVLFWIRGNGQIGASLSGNHFGD 464
Query: 347 --VSLDSSQELP---------------RSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
+ LD Q LP R+S + A ++ L G++ L
Sbjct: 465 TFMELDGCQ-LPGALYGGTDSDIISRCRTSQVLHFRGACIAPVRLHRYGLFLADGNVYQL 523
Query: 390 TVVYDGRVVQRLDL------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ D RL+ S+ P+ L D + L F+ + LG S+L + T
Sbjct: 524 ALHADAEYPLRLEALRVRGESRLAPAPL--DAKLLSRDLLFVAAHLGSSVLYRMT----- 576
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
P +R R S+++ G+ N + + +
Sbjct: 577 ---------------QVHPHGRRTRTSAAE-------------NGTLHKNATTKEAQWEL 608
Query: 504 AVRDSLVNIGPLKDF---------SYGLRINADA--SATGISKQS------------NYE 540
RD++ +GP+ D G ++ +ATG QS ++
Sbjct: 609 QQRDTIFQLGPIVDLVVIPPRYSPPAGTLLDPGEILAATGHQHQSCLARCTYQVQTREWQ 668
Query: 541 LVELPGCKGIWTVY--HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
+ GC+ +W++Y H + H + A+ + + L+ R + AD T
Sbjct: 669 RIPSAGCRRVWSLYADHDGTGMHQEEQ----AFLLLSLSKSSVILDIRRGFEQAAD--TR 722
Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY---MTQDL---SFGPSNSES 652
V + TIAAGNL RR + QV G R+LD + +D+ + P + S
Sbjct: 723 V------LLPSPTIAAGNLAQRRLIAQVHRTGIRLLDANLDVVYEEDMLLAALEPGTAVS 776
Query: 653 GSGSENSTVLSVSIADPYV 671
G+ S+ DPY+
Sbjct: 777 GA----------SVVDPYI 785
>gi|255718033|ref|XP_002555297.1| KLTH0G05984p [Lachancea thermotolerans]
gi|238936681|emb|CAR24860.1| KLTH0G05984p [Lachancea thermotolerans CBS 6340]
Length = 1307
Score = 120 bits (300), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 148/320 (46%), Gaps = 21/320 (6%)
Query: 1102 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKG 1156
N+ T+ L +G +V+ ED+ G L+ P T +++ +E +G
Sbjct: 978 NSRTRRKREYLVVGNTFVRDEDIGTMGSFCLYDITEVVPEPGKPDTNYKLKQIFYEEFRG 1037
Query: 1157 AISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1215
A+S++ + G LI+ K+++ + +AF D P ++V N +++GD
Sbjct: 1038 AVSSVCEISGRFLISQSQKVLVRDVQEDNSVVPVAFLDVP-VFVTDSKSCGNLLIIGDAM 1096
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+ F+ + + ++ L K + + EFL++ ++ +VSD + I YAP
Sbjct: 1097 QGFQFVGFDAEPYRMIPLGKSVSKFEVMSLEFLVNNGSIYFLVSDRSNILHILKYAPDEP 1156
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
S GQKL+ F++ H T +L T P + F + DGS+
Sbjct: 1157 NSLSGQKLVHCTSFNL--HSTNTCMKLLLKNDEFPTLGEPPA-----FQAIGAQTDGSLF 1209
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCEL 1392
+ PL E ++RRL +Q++L++ H+ GLNP+ R F+ G RP ++D +
Sbjct: 1210 NVVPLSESSYRRLYMVQQQLIEKDVHLCGLNPKMERLQNDFYQLGHLMRP----MLDFTV 1265
Query: 1393 LSHYEMLPLEEQLEIAHQTG 1412
+ + LPL ++ +IA + G
Sbjct: 1266 IKSFATLPLNKRKQIAAKAG 1285
Score = 84.0 bits (206), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 127/651 (19%), Positives = 262/651 (40%), Gaps = 131/651 (20%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI-HGLRITSM 159
L L+ ++LHG + +A++ Q D ++++ AK+S++ FD S+ L S+
Sbjct: 47 LVLLHEFKLHGQITGMALVPQMEGP----LDCLVVSTGKAKLSLVRFDPSMPMCLETLSL 102
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYG---LQMIILKASQGGSGLVGD 216
H +E+ ++ A+ +++DP+ RC VL++ L ++ L ++ D
Sbjct: 103 HYYEAE---FTRKNLIELAKTSKLRLDPERRC--VLLFNSDVLALLPLNINEEDE----D 153
Query: 217 EDTFGSGGGFSARIES---------SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
++ + ++E+ S V+++ DL ++K+V D F++ + +P + +L++
Sbjct: 154 DNQEPTHQAKKRKVENGDARRLAKQSSVLHVSDLSAELKNVVDIQFLNSFSQPTLAVLYQ 213
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
L W+G M ++I+ K++ I+ LPHD + ++ + + ++VG
Sbjct: 214 PRLAWSGNDKVAGKGSM-RLMAITPHEKKNTTIYQVKELPHDVHTIIPLAN---SCVLVG 269
Query: 326 ANTIHY--HSQSASCALALNNYAVSLDSSQELPRSSFSVELD-----AAHATWLQNDVAL 378
N I ++ + + LN+++ S+++ SS V A+ ++ +
Sbjct: 270 VNEIVSVDNTGAIQSTIQLNSFSPKFTGSKQIDNSSLEVMFTEPIVWASAMVSKDREILI 329
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV--------LTSDITTIGNSL------FF 424
L D+ +T+ +GR++ L + P V L + I + + FF
Sbjct: 330 LMDHKADMYSITLQSEGRLLIDFTLVRL-PIVNDIFKDQNLPTCIVALSGGIRLKTCQFF 388
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+G GD+++V+ S+ L+ F +A DAL G++
Sbjct: 389 IGFSSGDAVVVK----------SNNLRSAFESQYREAIELPNDEDEDYDALY----GDDE 434
Query: 485 SLYGSASNNTESAQKTFSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYEL- 541
L ++N + + F + DSL+N+GP+ G + +A+ G+ + EL
Sbjct: 435 DLARPVNDNKATVETAVPFEIELMDSLINVGPITSICTGRVSSINATIEGLPNPNRNELA 494
Query: 542 ---------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
++ IW + + + + A D
Sbjct: 495 IVSTSGHDSGTYLNVMEPSVRPLVQQALKFTSVTKIWNLKIRKKDKYLVTTDSGAEKSDV 554
Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
Y + A+ ++ VT T+ L G +R++QV + +
Sbjct: 555 YE------IGAKIASIKPKHFKRNVT----------TVEIAILGGGKRIVQVTTKAVYLF 598
Query: 635 DGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+ + MT F V+ VSI DP++LL S G I++
Sbjct: 599 NLGFKKLMTISFDF--------------EVVHVSILDPFILLTNSKGEIKI 635
>gi|313215162|emb|CBY42850.1| unnamed protein product [Oikopleura dioica]
Length = 228
Score = 119 bits (299), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/220 (29%), Positives = 115/220 (52%), Gaps = 2/220 (0%)
Query: 1206 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1265
KN+ L+GDI + I L + + ++ +++ + + A L+DG+ + LV +D Q+N+
Sbjct: 4 KNYALVGDIQQGITLLRHQGERNCISQISRARRAGEVTAVGILLDGNQVGLVSTDMQRNL 63
Query: 1266 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1325
Q++ Y P ES G++L+ +A+ ++G V L +D ++ R
Sbjct: 64 QVYMYKPDQKESNGGKQLVRQADINLGKRVISIW--NSLGRQNDTFTKVALTENDARHVT 121
Query: 1326 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1385
+ LDGSIG I P+ E FRRL+ LQ + +PH GLNPR +R + +
Sbjct: 122 FYAGLDGSIGDIVPVSEKVFRRLEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAK 181
Query: 1386 SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+I+D +LL + L EQ +++ + G TR +L ++ D+
Sbjct: 182 NIIDGDLLERFNGLSFTEQTDLSRKIGVTREALLDDMMDV 221
>gi|448530371|ref|XP_003870046.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis Co
90-125]
gi|380354400|emb|CCG23915.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis]
Length = 1327
Score = 119 bits (297), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 112/513 (21%), Positives = 228/513 (44%), Gaps = 54/513 (10%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
+R+ F N++G+ F++G P + + Q V+ + + +G I+
Sbjct: 827 RRLVYFPNLNGYTSIFVAGVIPFLIIKSCHSIPRIFQFSKIPAVSISAFSDSKIKNGLIF 886
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
+ + +IC+L Y+ P+++V + + + Y + + +++S P N
Sbjct: 887 LDNNQNARICELSLDYNYEFNLPIRRV-HIGESIRSVAYHEQSD--TVVISTFKEIPYN- 942
Query: 1032 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSEN 1091
+D+E G I + T + ++++ P W+ TI +Q +E
Sbjct: 943 ----CVDEE-GKPI----AGVLKDKPPATSFKGSIKLVSPFN----WKVIDTIELQDNEV 989
Query: 1092 ALTVRVVTL-FNTTTKENET---LLAIGTAYVQGEDVAARGRVLLFST-------GRNAD 1140
+ ++ + L ++ K+ +T + +GT ++ ED+AA G ++ G+
Sbjct: 990 GMAIKSMVLDVGSSMKKFKTKREYIVVGTGKLRMEDLAANGSFKIYDIIDIIPEPGKPET 1049
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1200
N + E++ ++ +GA++++ L G L+ G K+I+ + +AF D P +YV
Sbjct: 1050 NHK--FKEIFQEDTRGAVTSVCDLSGRFLVGQGQKVIVRDLEDDGVVPVAFLDTP-VYVS 1106
Query: 1201 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1260
N LLGD KSI+ + ++ ++ +L KD L +F++ + ++V+D
Sbjct: 1107 EAKSFGNLFLLGDPLKSIWLVGFEADPFRMVMLGKDRQHLRVECADFIVKDEEIFILVAD 1166
Query: 1261 EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT 1320
++ + + P +S G L+++A F + T LR + P + T
Sbjct: 1167 VNNSLHLIQFDPDDPKSINGTILINKASFETNSQTT-CLR------------SVPKGE-T 1212
Query: 1321 NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHS 1375
+ + T+DG+ + P++E T+RR+ +Q+++ D H GLNPR R Q
Sbjct: 1213 GDYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQISDKEYHYCGLNPRLNRFGGAVQIRD 1272
Query: 1376 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
N +P I+D L+ + L L+ Q I
Sbjct: 1273 NDTNAKP----ILDYNLIKEFAKLNLDRQKNIT 1301
Score = 74.3 bits (181), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 83/375 (22%), Positives = 157/375 (41%), Gaps = 57/375 (15%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+LV ++L G V L L + D ++++ + AK S++ ++ +H + S+H
Sbjct: 57 LKLVEQFKLQGTVSGLKALRTSECPH---LDYVVVSTKYAKFSIIRWNHQLHNISTVSLH 113
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
+E+ E A L V+P L Y + L
Sbjct: 114 YYEN---CIQHSTFEKLAISDLT-VEPTYSSVSCLRYKNLLCFLPFEGVHEEDDEDDTDD 169
Query: 205 ----KASQGGS----GLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHG 254
+GGS GL + F ++S +I+ LD + V D F+H
Sbjct: 170 EDIDNDKKGGSITKNGLSYENQPF---------YDASFIIDAGILDSTIDTVLDVQFLHN 220
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y EP + IL + +WAG + +++ K +++ NLP+D +++ +
Sbjct: 221 YQEPTIAILSAKSNSWAGNLIKNKDNVQFQVMTLDVQSKSTLPVFNIDNLPYDIDRVIPL 280
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHA 369
P+P+ G L++G N IH + + +A+N + S+ S Q+ S +++L+
Sbjct: 281 PNPLNGCLLIGCNELIHVDNGGIAKRIAVNAFTSLITASVKSYQD--ESDLNLKLENCAI 338
Query: 370 TWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNS 421
+ +D LL TG+ L DG+ ++++ L + + S + ++ +
Sbjct: 339 VPIPDDHRVLLILATGEFYYLNFDLDGKSIKKIHLELVDQKMYDSIRLTYPGQVASLDKN 398
Query: 422 LFFLGSRLGDSLLVQ 436
L F + GDS LV+
Sbjct: 399 LLFFANLNGDSSLVE 413
>gi|354547787|emb|CCE44522.1| hypothetical protein CPAR2_403250 [Candida parapsilosis]
Length = 1334
Score = 118 bits (296), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 111/513 (21%), Positives = 229/513 (44%), Gaps = 54/513 (10%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
+R+ F N++G+ F++G P + + Q V+ + + +G I+
Sbjct: 834 RRLVYFPNLNGYTTIFVTGVIPFLIIKSCHSIPRIYQFSKIPAVSVSAFSDSKIKNGLIF 893
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
+ + +IC+L +Y+ P++KV + + + Y + + V+ +
Sbjct: 894 LDNNQNARICELSWDYSYEFNLPIRKV-HIGESIKSVAYHEQSD-------TVVISTFKE 945
Query: 1032 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSEN 1091
+ +D+E G I ++ T + ++++ P W+ T+ + +E
Sbjct: 946 IPYDCVDEE-GKPI----AGALKDKPPATSFKGSIKLVSPYN----WKVIDTVELSDNEV 996
Query: 1092 ALTVRVVTL-FNTTTKENET---LLAIGTAYVQGEDVAARGRVLLFST-------GRNAD 1140
++++ + L ++ K+ +T + IGT+ ++ ED+AA G ++ G+
Sbjct: 997 GMSIKSMVLDVGSSLKKFKTKREYIVIGTSKLRMEDLAANGSFKIYDIIDIIPEPGKPET 1056
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1200
N + E++ ++ KGA++++ L G L+ G K+I+ + +AF D P +YV
Sbjct: 1057 NHK--FKEIFQEDTKGAVTSICDLSGRFLVGQGQKVIVRDLEDDGVVPVAFLDTP-VYVS 1113
Query: 1201 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1260
N LLGD KSI+ + ++ ++ +L KD L +F++ + ++V+D
Sbjct: 1114 EAKSFGNIFLLGDALKSIWLVGFEADPFRMVMLGKDRQHLHVECADFIVKDEEIFILVAD 1173
Query: 1261 EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT 1320
+ + + P +S G L+++A F + T LR + P D+
Sbjct: 1174 INNGLHLIQFDPDDPKSINGTILVNKASFETNSQTT-CLR------------SVP-KDEA 1219
Query: 1321 NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHS 1375
+ + T+DG+ + P++E T+RR+ +Q+++ D H GLNPR R Q
Sbjct: 1220 GDYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQISDKEFHHCGLNPRLNRFGGAIQIRD 1279
Query: 1376 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
+ +P I+D L+ + L L+ Q IA
Sbjct: 1280 SDTNAKP----ILDYNLIREFAKLNLDRQRNIA 1308
Score = 87.4 bits (215), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 98/458 (21%), Positives = 181/458 (39%), Gaps = 64/458 (13%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+LV ++L G V L L + + D II++ + AK S+++++ +H + S+H
Sbjct: 57 LKLVEQFKLQGTVTGLKPLR---TSENPQLDYIIVSTKYAKFSIIKWNHQLHSISTVSLH 113
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
+E+ E A L+ V+P L Y + L
Sbjct: 114 YYEN---CIQHSTFEKLAISDLI-VEPTYSSVSCLRYKNLLCFLPFEGVNDHDDDDDDDD 169
Query: 205 -----KASQG-GSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYI 256
+G + G + + G+ +SS +I+ L+ + V D F+H Y
Sbjct: 170 DDDDTDDEKGVAENVAGVDKSNGASNDNQPFYDSSFIIDAGTLESSVDSVLDLQFLHHYQ 229
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
E + IL + +WAG + +++ K +++ NLP+D +++ +
Sbjct: 230 ETTIAILSSKSNSWAGNLIKNKDNVQFQVMTLDIQSKSTLPVFTIDNLPYDIDRIIPLSK 289
Query: 317 PIGGVLVVGAN-TIHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATW 371
P+ G L++G N IH + + +A+N + S+ S Q+ + +E D +
Sbjct: 290 PLNGCLLLGCNEIIHVDNGGIAKRIAVNAFTSLITASVKSYQDESELNLKLE-DCSIVPI 348
Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFF 424
++ LL TG+ L DG+ ++R+ L + ++ T+ N+L F
Sbjct: 349 PEDHRVLLILATGEFYFLNFELDGKSIKRIHLEAVEQKAYDAIKLTYSGEVATLDNNLLF 408
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+ GDS LV+ S + +E K D + GEE
Sbjct: 409 FANMNGDSPLVEIKYSSSAKV-----------VEKQVLDKKEEDSDEEDLYNEDEEGEEQ 457
Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
+ + F + DSL+N GP+ F+ GL
Sbjct: 458 KVMRKSH---------IEFKLHDSLINNGPVSSFTLGL 486
>gi|146096490|ref|XP_001467824.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania infantum JPCM5]
gi|134072190|emb|CAM70891.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania infantum JPCM5]
Length = 1542
Score = 117 bits (294), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 131/577 (22%), Positives = 252/577 (43%), Gaps = 60/577 (10%)
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFE--GPENTSKSDDPV-STSRSLSVSNVSASRLRN 889
SA + L IL+ G ++ Y+ + GP K + + + V +R +
Sbjct: 930 SAAPTEATLVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVVESIEARKKR 989
Query: 890 LRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCM--VFRERLRVH 946
L+ R L + T ++ H + +R+ F+ + H+G ++ G P + + +L
Sbjct: 990 LQEERAHLASVT-QQMRHCS--ERLVPFRGLQDRHKGIYVCGQTPVFLVYHAATNQLVCT 1046
Query: 947 PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-PSGSTYDNY-WPVQKVIPLKAT 1004
++ F H+ + + GF+Y +G + + P G + W +++V L T
Sbjct: 1047 RHHATNAVRGFAPFHSRHVHGGFVYC-GEGFVHFATMQPFGELLGSSGWWLERV-RLGCT 1104
Query: 1005 PHQITYFAEKNLYPLIVS-----VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSV 1053
PHQ+ Y + ++ S P P + L ++ D+E V H I+ +L S+
Sbjct: 1105 PHQVIYSPAAHGCFVVASRPQPFSPKRAPFDVQLRMVEDEEGNRVPHVIEPVSLPPLSAT 1164
Query: 1054 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE------ 1107
T E YEV+ + WQ + + +E L+ ++ + TT +
Sbjct: 1165 SGSPVPTNERYEVQFF----STLDWQCMGRLVLDVNEKVLSATLMQVTRDTTMDAANRST 1220
Query: 1108 NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QG 1166
+ A+ TAY GEDV RGR+LL +T + + ++ + +KG ++A+ + +
Sbjct: 1221 TAPVCALATAYPLGEDVTTRGRILLLTTSQQGGQGMQQLRTLHEEPMKGPVTAITRVGED 1280
Query: 1167 HLLIASGPKIILHKW----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
+ +A G + ++++ + E I + A YV L +N++++GD+ S+ F
Sbjct: 1281 CVAVAVGGTVRVYRYDTNKSTMETMAILYAGA---YVTCLQAFRNYLVIGDLFNSVLFAR 1337
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKGQ 1281
+ E+ + +L +D ++ + + L + L+V+D+ +N+ Y P+ + E K
Sbjct: 1338 YSEEIHTITILGRDTNAISVVSNDMLYHDTRFGLLVTDDARNLVCMSYKPRVLEEPGKPP 1397
Query: 1282 KLLSR-----AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1336
K+L E+ + V L++ L +S R N ++ T G IG
Sbjct: 1398 KILESLLTVTGEYRLAGGV--LLKMMRLRAASTR----------NSSVAIYVTNMGEIGY 1445
Query: 1337 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
+ PL + T R Q + ++L V H GL PR F F
Sbjct: 1446 LVPLGDQTSRTGQWVGRRLQSEVAHAGGLPPRMFLGF 1482
Score = 41.2 bits (95), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 86/206 (41%), Gaps = 38/206 (18%)
Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIE 257
+ A+ G G + GG S + V + R D+K +++D FV E
Sbjct: 263 VAFGAASAGPGTASSQKV-TQGGVTSLLLRVGTVTHWRLQDVKTALRNIRDVQFVESAGE 321
Query: 258 PVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-L 304
P++ L E++ TWAGRV W+ TC I ++++ + H L S ++ L
Sbjct: 322 PLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGL 381
Query: 305 PHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSA----------SCALALNNYAVSLD 350
P+D + +P+ P V +H ++S A +L + AVSL+
Sbjct: 382 PYDVTSMTPLPAFQDVPSAVFCVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLE 441
Query: 351 ------SSQELPRSSFSVELDAAHAT 370
+SQ L V L+ A+AT
Sbjct: 442 AVQWRSASQALSTDLVKVNLNFANAT 467
>gi|398020786|ref|XP_003863556.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania donovani]
gi|322501789|emb|CBZ36871.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania donovani]
Length = 1542
Score = 117 bits (294), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 132/578 (22%), Positives = 254/578 (43%), Gaps = 62/578 (10%)
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFE--GPENTSKSDDPV-STSRSLSVSNVSASRLRN 889
SA + L IL+ G ++ Y+ + GP K + + + V +R +
Sbjct: 930 SAAPTEATLVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVVESIEARKKR 989
Query: 890 LRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFR---ERLRV 945
L+ R L + T ++ H + +R+ F+ + H+G ++ G P + +V+ +L
Sbjct: 990 LQEERAHLASVT-QQMRHCS--ERLVPFRGLQDRHKGIYVCGQTPVF-LVYHAATNQLVC 1045
Query: 946 HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-PSGSTYDNY-WPVQKVIPLKA 1003
++ F H+ + + GF+Y +G + + P G + W +++V L
Sbjct: 1046 TRHHATNAVRGFAPFHSRHVHGGFVYC-GEGFVHFATMQPFGELLGSSGWWLERV-RLGC 1103
Query: 1004 TPHQITYFAEKNLYPLIVS-----VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SS 1052
TPHQ+ Y + ++ S P P + L ++ D+E V H I+ +L S+
Sbjct: 1104 TPHQVIYSPAAHGCFVVASRPQPFSPKRAPFDVQLRMVEDEEGNRVPHVIEPVSLPPLSA 1163
Query: 1053 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE----- 1107
T E YEV+ + WQ + + +E L+ ++ + TT +
Sbjct: 1164 TSGSPVPTNERYEVQFF----STLDWQCMGRLVLDVNEKVLSATLMQVTRDTTMDAANRS 1219
Query: 1108 -NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-Q 1165
+ A+ TAY GEDV RGR+LL +T + + ++ + +KG ++A+ + +
Sbjct: 1220 TTAPVCALATAYPLGEDVTTRGRILLLTTSQQGGQGMQQLRTLHEEPMKGPVTAITRVGE 1279
Query: 1166 GHLLIASGPKIILHKW----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
+ +A G + ++++ + E I + A YV L +N++++GD+ S+ F
Sbjct: 1280 DCVAVAVGGTVRVYRYDTNKSTMETMAILYAGA---YVTCLQAFRNYLVIGDLFNSVLFA 1336
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKG 1280
+ E+ + +L +D ++ + + L + L+V+D+ +N+ Y P+ + E K
Sbjct: 1337 RYSEEIHTITILGRDTNAISVVSNDMLYHDTRFGLLVTDDARNLVCMSYKPRVLEEPGKP 1396
Query: 1281 QKLLSR-----AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
K+L E+ + V L++ L +S R N ++ T G IG
Sbjct: 1397 PKILESLLTVTGEYRLAGGV--LLKMMRLRAASTR----------NSSVAIYVTNMGEIG 1444
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
+ PL + T R Q + ++L V H GL PR F F
Sbjct: 1445 YLVPLGDQTSRTGQWVGRRLQSEVAHAGGLPPRMFLGF 1482
Score = 41.2 bits (95), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 86/206 (41%), Gaps = 38/206 (18%)
Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIE 257
+ A+ G G + GG S + V + R D+K +++D FV E
Sbjct: 263 VAFGAASAGPGTASSQKV-TQGGVTSLLLRVGTVTHWRLQDVKTALRNIRDVQFVESAGE 321
Query: 258 PVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-L 304
P++ L E++ TWAGRV W+ TC I ++++ + H L S ++ L
Sbjct: 322 PLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGL 381
Query: 305 PHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSA----------SCALALNNYAVSLD 350
P+D + +P+ P V +H ++S A +L + AVSL+
Sbjct: 382 PYDVTSMTPLPAFQDVPSAVFCVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLE 441
Query: 351 ------SSQELPRSSFSVELDAAHAT 370
+SQ L V L+ A+AT
Sbjct: 442 AVQWRSASQALSTDLVKVNLNFANAT 467
>gi|388856288|emb|CCF50097.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Ustilago hordei]
Length = 1568
Score = 117 bits (292), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 97/312 (31%), Positives = 148/312 (47%), Gaps = 29/312 (9%)
Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFS-----TGRNADNPQNLVTEVYSKELKGA-ISALA 1162
+ +A+GT GED +G V LF + R ++L ++ ++ A ++ALA
Sbjct: 1153 KQFIAVGTTTYHGEDRTCKGSVYLFEIIQVVSSRRFQVGRDLRLKLICRDGSNAPVTALA 1212
Query: 1163 SLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
L G LL SG K+ + E L +AF D P Y+ S+ +VKNF+LL D K ++FL
Sbjct: 1213 ELHGFLLSTSGQKLYVRALEKEEWLISVAFLDCP-FYITSIRVVKNFVLLSDAKKGLWFL 1271
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS------------DEQKNIQIFY 1269
+++E + L EFL+ LSLV + + I+++
Sbjct: 1272 AFQEDPYRFVDLGSALDGHCANLGEFLVYNDKLSLVSTSGVALGGFSGFGQDSGVIRLYE 1331
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAAPGSDKTNRFALLF 1327
Y P S GQ+LL R E+ + T L + L+ S R G ++ R LL
Sbjct: 1332 YNPSSPTSLGGQRLLLRTEYSTPSSTTCSLSAPGRWLSDSELR-----GREQL-RNKLLL 1385
Query: 1328 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1387
+GS+ +A ++E +RL LQ +LV SV H A LNPR+FRQ N RP +
Sbjct: 1386 SKSNGSLDSLASVEEKVAKRLHLLQGQLVRSVLHTAALNPRAFRQVR-NDFVSRPLYKGV 1444
Query: 1388 VDCELLSHYEML 1399
+D LL ++ L
Sbjct: 1445 LDARLLDAFKGL 1456
Score = 85.9 bits (211), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/402 (24%), Positives = 178/402 (44%), Gaps = 67/402 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
LV +++ IY V + S N + ++ L L+ + L G V L
Sbjct: 46 QLVTARDDLLSIYDVYERPSASANPSNGHSSPDS------TSHRLVLIRKHSLFGTVTGL 99
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWL----HLK 171
+ Q + + RD ++++F DAK+++LE++ + L S+H +E +P+ L HL
Sbjct: 100 QRI-QTLSTSKDSRDRLLVSFTDAKLALLEWNHTTDDLETVSIHTYERAPQLLNGIPHLF 158
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS------GLVGDEDTFGSGGG 225
+ P + +DP RC +L+ + IL + + GL D + +G
Sbjct: 159 Q--------PNLNIDPLSRCAALLLPHDALAILPFYRDAAEFEFDHGLHLDLNLDFAGED 210
Query: 226 FSA-----RIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+A +IES S V+ +R++D ++++KDF F+ G+ +P + +L T G
Sbjct: 211 KAAMQAAVQIESLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQKPTVALLFAHSPTCTGL 270
Query: 274 VSWKHHT----------------CMISALSIS----TTLKQHPLIWSAMNLPHDAYKLLA 313
++ + M+ + S S T HP++ ++ +LP+D +L
Sbjct: 271 LAERKDNFSVYLFTLDLAASLDGAMLGSASYSFDDATLRSMHPVLTTSSSLPYDCLYMLP 330
Query: 314 VPSPIGGVLVVGANTIHYHSQSAS-CALALNNY-----AVSLDSSQELPRSSFSVELDAA 367
P +GGVLVV ++I + QS A ALN + A+ +S +LP + +L +
Sbjct: 331 CPQTLGGVLVVCMSSILHVDQSGRVVATALNGWFNLVSAIQPESLLDLPEIA---DLQGS 387
Query: 368 HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+ +L+ GD+ T DGR +Q L + S
Sbjct: 388 QLVFTAETEGVLTLVHGDVYTFTCQMDGRNIQGFRLERMQQS 429
>gi|71654693|ref|XP_815961.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|50363265|gb|AAT75335.1| cleavage polyadenylation specificity factor CPSF160 [Trypanosoma
cruzi]
gi|70881056|gb|EAN94110.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 1436
Score = 116 bits (291), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 124/544 (22%), Positives = 228/544 (41%), Gaps = 68/544 (12%)
Query: 912 QRITIFKNISGHQGFFLSGSRPC---WCMVFRERLRVHPQLCDGSIVAFTVLHNVN---- 964
+RI F I G+ G ++ G P W RE L + G + F +N
Sbjct: 910 RRIVPFDAIGGNTGAYVCGQHPLFLFWDRRTRE-LEAYRHQTLGPVRGFVPFRIINSGYI 968
Query: 965 -CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C GF+ S C+ P+G + + + I L TPH + Y ++ S
Sbjct: 969 YCCEGFVDFASMDTY--CR-PTGQGW-----LTRRIHLGVTPHFVVYHPPARSCFVVTSK 1020
Query: 1024 -----PVLKPLNQVLSLLIDQEVG------HQIDNHNLSSVDLH---RTYTVEEYEVRIL 1069
P P + L+++ D+E G + N+ + + R + +E+R++
Sbjct: 1021 KEPFRPQRAPFDVQLNIVYDEESGGVQSITTEAPVSNMPPIAPNAGIRVPMADRFEIRLM 1080
Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE---NETLLAIGTAYVQGEDVAA 1126
+ W T+ ++ +E L +++ + E + + TA+ GED+
Sbjct: 1081 ----STTDWACTDTLLLEENERVLGAQMMEIQCERDAEGLHTAPVCVVSTAFPLGEDITC 1136
Query: 1127 RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK--WTGT 1184
RGR+LL +T + + +S+ L G +A+ ++ H+ +A G I L + W+
Sbjct: 1137 RGRILLLAT--ICTKKKRKIVLFHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWSNR 1194
Query: 1185 ELN-GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+L G Y YV ++ +N+++ GD+ +S + E+ L++L KD ++
Sbjct: 1195 KLVVGALLYAG--TYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDRNAVSVV 1252
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG--QKLLSR-----AEFHV-GAHV 1295
+ + L+ SD+++N+ + Y P++ E+ G K+L E+ + G +
Sbjct: 1253 HCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEYRLSGGCL 1312
Query: 1296 TKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1355
K LR + LA +S T L+ T G IG I P+ E R L ++L
Sbjct: 1313 VKSLRFRSLAGNSSVT--------------LYVTNYGEIGFIVPIGEQANRTASWLMRRL 1358
Query: 1356 VDSVPHVAGLNPRSFRQF-HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1414
+PH AGL PR F + + + +V LL+ + L + + IA T
Sbjct: 1359 QIDLPHSAGLTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSRKTIASAAYTQ 1418
Query: 1415 RSQI 1418
++
Sbjct: 1419 LERV 1422
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 62/261 (23%), Positives = 107/261 (40%), Gaps = 54/261 (20%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
+++V+D F+ EP++ L ER TWAGRV W+ LS S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
T+ ++ LI ++P++ + +P+G GV+ G NT+ + + + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFSQIPSGVICYGINTVMHVTTKRGYGVYL 365
Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQNDV---AL 378
NN + D E + F V L A+ T + N++ +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFKVNLSLANCTASFMSIVNEMLHLLV 425
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+S + G ++ L++ VQ + ++ S I IG+ + FLGS GDS
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTGCYCSGIARIGDQIVFLGSACGDS------ 479
Query: 439 CGSGTSMLSSGLKEEFGDIEA 459
C + M S + + F IE+
Sbjct: 480 CIAKVDMFHSDVAKRFQIIES 500
>gi|407850337|gb|EKG04765.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 1436
Score = 116 bits (291), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 127/546 (23%), Positives = 228/546 (41%), Gaps = 72/546 (13%)
Query: 912 QRITIFKNISGHQGFFLSGSRPC---WCMVFRERLRVHPQLCDGSIVAFTVLHNVN---- 964
+RI F I G+ G ++ G P W RE L + G + F +N
Sbjct: 910 RRIVPFDAIGGNAGAYVCGQHPLFLFWDRRTRE-LEAYRHQTLGPVRGFVPFRIINSGYI 968
Query: 965 -CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C GF+ S C+ P+G + + + I L TPH + Y ++ S
Sbjct: 969 YCCEGFVDFASMDTY--CR-PTGQGW-----LTRRIHLGVTPHFVVYHPPARSCFVVTSK 1020
Query: 1024 -----PVLKPLNQVLSLLIDQEVG--HQIDNH----NLSSVDLH---RTYTVEEYEVRIL 1069
P P + L ++ D+E G I N+ + + R + +E+R++
Sbjct: 1021 KEPFRPQRSPFDVQLKIVYDEESGGVQSITTEAPVCNMPPIAPNAGIRVPMADRFEIRLM 1080
Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL-----LAIGTAYVQGEDV 1124
+ W T+ ++ +E L +++ + K+ E L + TA+ GED+
Sbjct: 1081 ----STTDWACTDTLLLEENERVLGAQMMEI--QCEKDAEGLHTAPVCVVSTAFPLGEDI 1134
Query: 1125 AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK--WT 1182
RGR+LL +T + + +S+ L G +A+ ++ H+ +A G I L + W
Sbjct: 1135 TCRGRILLLAT--MCTKKKRKIVLFHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWN 1192
Query: 1183 GTELN-GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1241
+L G Y YV ++ +N+++ GD+ +S + E+ L++L KD ++
Sbjct: 1193 NRKLVVGALLYAG--TYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDRNAVS 1250
Query: 1242 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG--QKLLSR-----AEFHV-GA 1293
+ + L+ SD+++N+ + Y P++ E+ G K+L E+ + G
Sbjct: 1251 VVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEYRLSGG 1310
Query: 1294 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1353
+ K LR + LA +S T L+ T G IG I P+ E R L +
Sbjct: 1311 CLVKSLRFRSLAGNSSVT--------------LYVTNYGEIGFIVPIGEQANRTASWLMR 1356
Query: 1354 KLVDSVPHVAGLNPRSFRQF-HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
+L +PH AGL PR F + + + +V LL+ + L + + IA
Sbjct: 1357 RLQIDLPHSAGLTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSRKTIASAAY 1416
Query: 1413 TTRSQI 1418
T ++
Sbjct: 1417 TQLERV 1422
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 61/261 (23%), Positives = 107/261 (40%), Gaps = 54/261 (20%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
+++V+D F+ EP++ L ER TWAGRV W+ LS S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
T+ ++ LI ++P++ + +P+G GV+ G NT+ + + + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFSQIPSGVICYGINTVMHVTTKRGYGVYL 365
Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQNDV---AL 378
NN + D E + F V L A+ T + N++ +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFKVNLSLANCTASFMSIVNEMLHLLV 425
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+S + G ++ L++ VQ + ++ S I +G+ + FLGS GDS
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTDCYCSGIARLGDQIVFLGSACGDS------ 479
Query: 439 CGSGTSMLSSGLKEEFGDIEA 459
C + M S + + F IE+
Sbjct: 480 CIAKVDMFHSDVAKRFRIIES 500
>gi|443894082|dbj|GAC71432.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
[Pseudozyma antarctica T-34]
Length = 1543
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 92/316 (29%), Positives = 148/316 (46%), Gaps = 28/316 (8%)
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSK-----ELKGAISALASLQ 1165
+A GT+ GED ++G V LF L ++ K + + ++A+A L
Sbjct: 1169 FIAAGTSTFHGEDRTSKGSVYLFEVIEVVSGKYQLGRDLRLKLVCRDDARAPVTAIAELN 1228
Query: 1166 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
G LL G K+ + E L +AF D P Y+ SL ++KNF+L+ D KS+ L+++
Sbjct: 1229 GFLLSTCGQKLYVRALEKEEWLISVAFLDGP-FYMTSLRVLKNFVLVSDAKKSLCLLAFQ 1287
Query: 1225 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE------------QKNIQIFYYAP 1272
E+ + L ++ + +FL+ LSLV + + I+++ YAP
Sbjct: 1288 EEPYRFVDLGREINDHNASMAQFLVYNDRLSLVSTSDVPLGGISGFGASAGVIRLYEYAP 1347
Query: 1273 KMSESWKGQKLLSRAEFHVGAHV--TKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
++ + G +LL R+EF A + R + L+ S R G G K L+
Sbjct: 1348 HVATTLGGHRLLLRSEFQTPAAAVGSTVCRGRWLSDSELR-GREEGRSK-----LVLAKA 1401
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1390
+G++ ++ LD+ +RL LQ +LV SV H A LNPR+FR N R I+D
Sbjct: 1402 NGALDSLSALDDKVAKRLHLLQGQLVRSVQHTAALNPRAFRAVR-NDFVPRSLAKGILDA 1460
Query: 1391 ELLSHYEMLPLEEQLE 1406
LL + L + LE
Sbjct: 1461 RLLDRFVWLSRPKMLE 1476
Score = 94.7 bits (234), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 101/406 (24%), Positives = 170/406 (41%), Gaps = 58/406 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
LV +++ IY V + + S T D +L + + L G V L
Sbjct: 46 QLVTARDDLLTIYDVYDRSSSQSAASTSNGTANGTAGDAKPRHTLIVTRRHSLFGTVTGL 105
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ +D R ++++F DAK+++LE++D+ L S+H +E L G
Sbjct: 106 QRVDTLASDKDARH-RLLVSFADAKLALLEWNDTTDDLETVSIHTYERAT--QLLNGTPP 162
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS--------- 227
R P + VDP RC +L+ + IL + + E F G GF
Sbjct: 163 LFR-PNLNVDPLSRCAALLLPHDALAILPFYRDNA-----EFDFDDGLGFDLANDALDAS 216
Query: 228 --------ARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
A +ES S V+ +R++D ++++KDF F+ G+ +P + +L + TW G
Sbjct: 217 DAAAMAAAAHMESLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQKPTVAVLFDHSPTWTG 276
Query: 273 RVSWKHHTCMIS--ALSISTTL------------------KQHPLIWSAMNLPHDAYKLL 312
++ + + + L +S +L HP++ ++ LP+D +L
Sbjct: 277 LLTHRKDSFAVYLFTLDLSASLDGATLGSAAALLDDGNMRSAHPVVTTSSQLPYDCLYML 336
Query: 313 AVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV----ELDAAH 368
P +GGVLVV + I + QS + N S+ E P S V +L A+
Sbjct: 337 PCPQSLGGVLVVCMSAILHVDQSGRVVVTALNRWFKTTSAIE-PESVLDVPGLADLQASQ 395
Query: 369 ATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
+ + A+LS GDL L DGR V+ L + + SD
Sbjct: 396 LVFTTDTDAVLSLSNGDLYRLRCHMDGRSVEGFRLERIDQLTAGSD 441
>gi|343425828|emb|CBQ69361.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Sporisorium reilianum SRZ2]
Length = 1567
Score = 115 bits (289), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 139/562 (24%), Positives = 226/562 (40%), Gaps = 58/562 (10%)
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP--CQRITIFKNISGHQGFF 927
P +RSL ++ + R P + TR E HGA C + + Q
Sbjct: 965 PAGGARSLDADGIATTGYRAGAVKLEPFEIGTRAEK-HGADGDCNALAVLGGGREAQASV 1023
Query: 928 LSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG-FIYVTSQGILKICQLPSG 986
L CW RL P+ G + ++ N F Y G L + + P G
Sbjct: 1024 L-----CWTEQGGYRLLDWPE---GDLCCIASIYTPRANDADFAYCDRAGQLWLARAPHG 1075
Query: 987 STYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1046
+ W V T + T + +V+ + Q ++ E G I
Sbjct: 1076 LYAETSWMSSVV----RTGREYTRVVAHDATHTVVAASI-----QPCRFVLFDEDGEPIA 1126
Query: 1047 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTT 1105
+ T E+ I E DR +++E + +VTL +T
Sbjct: 1127 DPGADEALPSTTAQRGALELFISE-DRTTAA----DGYEFEANETVTALEIVTLDAPSTA 1181
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLF------STGRNADNPQNLVTEVYSKELKGAIS 1159
+ +A GT GED A+G V LF ++ R + V + +G ++
Sbjct: 1182 SGRKQFVAAGTTTFHGEDRTAKGCVYLFEVIEVVASARYQVGRDLRLKLVCRDDSRGPVT 1241
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
A+A L G L+ G K+ + E L IAF D P LYV + +VKNF+LL D KS+
Sbjct: 1242 AIAQLNGFLVSTCGQKLYVRALEKEEWLISIAFLDCP-LYVTGIRVVKNFVLLSDARKSL 1300
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD------------EQKNIQ 1266
+ L+++E+ + L +D ++L+ L+LV + + ++
Sbjct: 1301 WLLAFQEEPYRFVDLGRDIHDHHATLGQYLVYNERLALVSTSGAALGGSTAFGRDAGVVR 1360
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL--RLQMLATSSDRTGAAPGSDKTNRFA 1324
++ YAP ++ + +L+ R EF + T + R + L+ S R G G +K
Sbjct: 1361 LYEYAPHVASA--NTRLVLRTEFQTASPATASVACRGRWLSDSELR-GREHGRNK----- 1412
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1384
L+ +G++ +A D+ +RL LQ +LV SV H A LNPR+FR N R
Sbjct: 1413 LVLAKANGALETLAAADDRVAKRLHVLQGQLVRSVLHTAALNPRAFRAVR-NDFVSRALG 1471
Query: 1385 DSIVDCELLSHYEMLPLEEQLE 1406
++D LL + L + LE
Sbjct: 1472 KGVLDARLLDSFVYLSRPKMLE 1493
Score = 89.0 bits (219), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/349 (24%), Positives = 158/349 (45%), Gaps = 48/349 (13%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV + L G V L + Q A + RD ++++F+DAK+++LE++D L S+H
Sbjct: 92 LVLVRRHTLFGVVTGLQRV-QTLATDKDARDCLLVSFKDAKLALLEWNDLTDDLETVSIH 150
Query: 161 CFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-------KASQGGSG 212
+E +P+ L+ G + P++ VDP RC +L+ + +L
Sbjct: 151 TYERAPQLLN---GTPNLFH-PILNVDPLSRCAALLLPHDALAVLPFYRDAADFDFDLDD 206
Query: 213 LVGDEDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
+ + +A +E+ S V+ +R++D ++++KDF F+ G+ +P + +L
Sbjct: 207 RLDLAKDDAAAVAAAAEMETLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQKPTVAVLFS 266
Query: 266 RELTWAGRVSWKHHTCMI-------------------SALSISTTLKQHPLIWSAMNLPH 306
TW G ++ + T + AL T HP++ ++ LP+
Sbjct: 267 HTPTWTGLLAERKDTFSVYLFTLDLSASLDGTLSSAADALDDGTVRSAHPVVTTSTALPY 326
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCAL-ALNNY-----AVSLDSSQELPRSSF 360
D +++ P +GGVLVV +++ + QS + ALN + A+ +S +LP
Sbjct: 327 DCLYMVSCPQTLGGVLVVCMSSVLHVDQSGRVVVTALNGWFKTISAIEPESVLDLPEIP- 385
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+L + + +L+ GDL DGR V+ L + + S
Sbjct: 386 --DLQGSQLVFTAETAGVLALVDGDLYRFRCQMDGRSVEGFRLERMDQS 432
>gi|366994686|ref|XP_003677107.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
gi|342302975|emb|CCC70752.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
Length = 1340
Score = 115 bits (288), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 155/326 (47%), Gaps = 23/326 (7%)
Query: 1097 VVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYS 1151
++ L + + K+ E ++A G ++ ED+ + G ++ P T E++
Sbjct: 1003 LIQLDSKSRKKREYIVA-GITFIGTEDLPSTGAFHIYDLTEVIPEPGKPDTNFKLKEIFK 1061
Query: 1152 KELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFIL 1210
++++G+++++ + G LI KI++ + +AFYD P ++V NF++
Sbjct: 1062 EDIRGSVNSVCDISGRFLINQSQKIMVRDVQEDNSVVPVAFYDTP-IFVSDAKSFGNFLI 1120
Query: 1211 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1270
LGD + FL + + ++ L + S + + EFLI+ ++ ++D + + + Y
Sbjct: 1121 LGDSMQGFQFLGFDAEPYRMIPLGRSVSSFETVSVEFLINAGEINFAITDREDILHVLKY 1180
Query: 1271 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-NRFALLFGT 1329
AP + GQKL+ + F++ + T L L R SDK +F + G
Sbjct: 1181 APDEPNTLSGQKLVHCSSFNLYSSNTCMLMLP-------RNDEFETSDKAPPKFQAIGGQ 1233
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDS 1386
+DG I I PL E T+RRL +Q++++D + GLNPR R F+ RP
Sbjct: 1234 VDGGIFKIIPLKEDTYRRLYVVQQQIIDKEVQLGGLNPRMERLDNDFYQLTHVMRP---- 1289
Query: 1387 IVDCELLSHYEMLPLEEQLEIAHQTG 1412
++D ++ + L +E + A + G
Sbjct: 1290 MIDFNIIRRFSELSIERRTHFAQKAG 1315
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 83/386 (21%), Positives = 165/386 (42%), Gaps = 64/386 (16%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV + L+ + +A++ Q + S +++A AKIS++ FD + L S+H
Sbjct: 48 LNLVEEFNLNAKITDIALIPQEKSPLS----CLVIASGVAKISIVRFDAVTNSLETLSLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS--------- 211
+E + A+ ++VDP R +L++ I L G+
Sbjct: 104 YYEDKLS---DISLVTLAKTSKLRVDPMNR--ALLLFNNDSIALLPLFSGNHEDEDEDDE 158
Query: 212 ----GLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
+ E T + S + ++++L ++++V D F++ + +P + +L++
Sbjct: 159 EDDYDVTRGEVTTKRSKKNEKHVGQSKIFHVKELHQELQNVLDIQFLNDFTKPTLAVLYQ 218
Query: 266 RELTWAGRVSWKHH--TCMISALSIST----TLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
+LTW G + MI L++ T T +I + +L D ++LL +
Sbjct: 219 PKLTWVGNTELNPQPTSFMIFTLNLRTNELETAFDVVIIATLHDLSWDWFQLLPISR--- 275
Query: 320 GVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSFSVELDA---AHATW--- 371
G +V+G N + Y + + LN++A D S + R EL+ T+
Sbjct: 276 GCVVMGNNEMAYIDNTGVLQSIIHLNSFA---DKSLQRARIIDETELEVFFNEKVTYFWS 332
Query: 372 -------LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----------TNPS-VLT 412
+ ++ L+ + +L + + +GR++ + DL K +NP+ V
Sbjct: 333 ASTDKKNIDDETLLIIDASANLYYVRLEAEGRLLTKFDLIKLPIVNDALKDTSNPTCVAR 392
Query: 413 SDITTIGNSL-FFLGSRLGDSLLVQF 437
D + +S+ F+G GDSL+V+
Sbjct: 393 VDPNSSNSSMDLFIGYLSGDSLVVRL 418
>gi|407410979|gb|EKF33219.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi marinkellei]
Length = 1436
Score = 115 bits (287), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 126/544 (23%), Positives = 229/544 (42%), Gaps = 68/544 (12%)
Query: 912 QRITIFKNISGHQGFFLSGSRPC---WCMVFRERLRVHPQLCDGSIVAFTVLHNVN---- 964
+RI F +I G+ G ++ G P W RE L + G + F +N
Sbjct: 910 RRIVPFDSIGGNAGAYVCGQHPLFLFWDRRTRE-LEAYRHQTLGPVRGFVPFRIINSGYI 968
Query: 965 -CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C GF+ S C+ P+G + + + I L TPH + Y ++ S
Sbjct: 969 YCCEGFVDFASMDTY--CR-PTGQGW-----LTRRIHLGVTPHFVVYHPPARSCFVVTSK 1020
Query: 1024 -----PVLKPLNQVLSLLIDQEVG--HQIDNH----NLSSVDLH---RTYTVEEYEVRIL 1069
P P + L+++ D+E G I N+ + + R + +E+ ++
Sbjct: 1021 KEPFRPQRAPFDVQLNIVYDEESGGVQSITTEAPVCNMPPIPPNAGIRVPMADRFEICLM 1080
Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE---NETLLAIGTAYVQGEDVAA 1126
+ W T+ ++ +E L +++ + E + + TA+ GED+ +
Sbjct: 1081 ----STTDWACTDTLLLEENERVLGAQMMEIHCEKDAEGLHTAPVCVVSTAFPLGEDITS 1136
Query: 1127 RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK--WTGT 1184
RGR+LL ST + L+ +S+ L G +A+ ++ H+ +A G I L + W
Sbjct: 1137 RGRILLLSTMCTKKKRKILL--FHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWENR 1194
Query: 1185 ELN-GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+L G Y YV ++ +N+++ GD+ +S + E+ L++L KD ++
Sbjct: 1195 KLVVGALLYAG--TYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDRNAVSVV 1252
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG--QKLLSR-----AEFHV-GAHV 1295
+ + L+ SD+++N+ + Y P++ E+ G K+L E+ + G +
Sbjct: 1253 HCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEYRLSGGCL 1312
Query: 1296 TKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1355
K LR + LA +S T L+ T G IG I P+ E R L ++L
Sbjct: 1313 VKSLRFRSLAGNSSVT--------------LYVTNYGEIGFIVPIGEQANRTASWLMRRL 1358
Query: 1356 VDSVPHVAGLNPRSFRQF-HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1414
+PH AGL PR F + + + +V LL+ + L + + IA T
Sbjct: 1359 QMDLPHNAGLTPRMFLGLSQGSPRTALRAKEMLVSASLLNEFFFLDIHSRKTIASAAYTQ 1418
Query: 1415 RSQI 1418
++
Sbjct: 1419 LERV 1422
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 63/260 (24%), Positives = 105/260 (40%), Gaps = 54/260 (20%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
+++V+D F+ EP++ L ER TWAGRV W+ LS S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
T+ ++ LI ++P++ + +P+G GV+ G NT+ + + + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFAQIPSGVICYGINTVMHVTTKRGYGVYL 365
Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
NN + D E + F V L A T + N+ + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVSWYDPKMETSTALFKVNLSLASCTASFMSIVNEMLHLLV 425
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+S + G ++ L++ VQ + ++ S IT IG+ + FLGS GDS
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQDIRIAILGTGCYCSGITRIGDQIVFLGSAFGDS------ 479
Query: 439 CGSGTSMLSSGLKEEFGDIE 458
C + M S + F IE
Sbjct: 480 CIAKVDMFHSDAAKRFQIIE 499
>gi|389602597|ref|XP_001567507.2| cleavage and polyadenylation specificity factor-like protein
[Leishmania braziliensis MHOM/BR/75/M2904]
gi|322505515|emb|CAM42945.2| cleavage and polyadenylation specificity factor-like protein
[Leishmania braziliensis MHOM/BR/75/M2904]
Length = 1536
Score = 115 bits (287), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 123/529 (23%), Positives = 236/529 (44%), Gaps = 66/529 (12%)
Query: 912 QRITIFKNISG-HQGFFLSGSRPCWCMVFR---ERLRVHPQLCDGSIVAFTVLHNVNCNH 967
+R+ F + G H+G ++ G P + +V+ +L ++ F H+ + +
Sbjct: 1003 ERLVPFCALQGRHKGIYVCGQTPVF-LVYHYATNQLVCTRHHATSAVRGFAPFHSRHVHG 1061
Query: 968 GFIYVTSQGILKICQL-PSGSTYDNY-WPVQKVIPLKATPHQITYFAEKNLYPLIVS--- 1022
GF+Y +G + + P G + W +++V L TPHQ+ Y + ++VS
Sbjct: 1062 GFVYC-GEGFVHFATMQPFGELLGSSGWWLERVR-LGCTPHQVIYSPAAHGCFVVVSRPQ 1119
Query: 1023 --VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSVDLHRTYTVEEYEVRILEPDRA 1074
P P + L ++ D+E V H ++ +L S+ T YEV++ +
Sbjct: 1120 PFSPKRAPFDVQLRMVEDEEGNRVPHVVEPVSLPPLSATSGSPVPTNGRYEVQLF----S 1175
Query: 1075 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKE------NETLLAIGTAYVQGEDVAARG 1128
WQ + + +E L+ ++ + TT + + A+ TAY GEDV RG
Sbjct: 1176 TLDWQRVDCLALDVNEKVLSATLMQVSRDTTMDVAYRSATAPVCALATAYPLGEDVTTRG 1235
Query: 1129 RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGT--- 1184
RVLL +T + + ++ + +KG ++A+ + + + +A G + ++++ +
Sbjct: 1236 RVLLLATSQQGGQGMQKLRILHEEPMKGPVTAITRIDEDCIAVAVGGTVRVYRYDASKGV 1295
Query: 1185 -ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
E I + A YV L +++++++GD+ S+ F + E+ + +L +D ++
Sbjct: 1296 METTAILYAGA---YVTCLQALRDYLVIGDLFHSVLFARYSEEIHTITILGRDTNAISVV 1352
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKGQKLLSR-----AEFHV-GAHVT 1296
+++ L + L+V+D+ +N+ Y P+ + E K K+L E+ + G +
Sbjct: 1353 SSDMLYHDTRFGLLVADDARNLMCMSYKPRLLEEPGKPPKVLESLLSVTGEYRLAGGVLL 1412
Query: 1297 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1356
K +RL+ A S ++ T G IG + PL + T R Q + ++L
Sbjct: 1413 KMMRLRASAARSSSV-------------AIYVTNMGEIGYLVPLGDQTSRTGQWVVRRLQ 1459
Query: 1357 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1405
V H GL PR F F + S+ E + H+ PL EQL
Sbjct: 1460 SEVAHAGGLPPRMFLGFPQDDPLR-----SLKGDEWMLHF---PLLEQL 1500
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 68/138 (49%), Gaps = 20/138 (14%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSIST 290
+++++D FV EP++ L E++ TWAGRV W+ TC I ++++
Sbjct: 298 LRNIRDVQFVASAGEPLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLAN 357
Query: 291 TLKQHPLIWSAMN-LPHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSASCALALNNY 345
T H L S ++ LP+DA + +P+ P VL V N + + S + + +N
Sbjct: 358 TAAPHMLSLSEVDGLPYDATSMTPLPAFQDVP-SAVLCVSRNMMVHVSTKSGYGVYVN-- 414
Query: 346 AVSLDSSQELPRSSFSVE 363
A+ + ++ L S+ S E
Sbjct: 415 AMGEEQARSLKSSAVSCE 432
>gi|367001853|ref|XP_003685661.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
gi|357523960|emb|CCE63227.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
Length = 1357
Score = 114 bits (285), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 155/325 (47%), Gaps = 19/325 (5%)
Query: 1097 VVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYS 1151
++ N+ T + L+ +G + V ED+ G +++T +P T +V+
Sbjct: 1018 MIIQLNSKTNFKKELIVVGISNVGTEDLPPTGSFYIYNTNEVVPDPSKPDTNYRFKDVFH 1077
Query: 1152 KELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFIL 1210
+++KG I+ + + G ++ K+++ E + +AF+D P ++V + N +
Sbjct: 1078 EQVKGTINNVCEISGRFMVNQSQKLLVRDIQEDESVVPVAFHDVP-VFVADIKSFGNLFI 1136
Query: 1211 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1270
+GD + F+ + + ++ +L + A +F++ + VVSD + I Y
Sbjct: 1137 VGDSMQGFQFVGFDAEPYRMIMLGRSVSKFKTMALDFVVRNGEIYFVVSDTDDILHILKY 1196
Query: 1271 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
+P S GQ+L + F++ + T L ++D G + ++ F + L
Sbjct: 1197 SPDEPNSLSGQRLAHYSSFNIHSTNTSM----HLLPANDEFIENKG-NGSSIFQTIGANL 1251
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSI 1387
DGSI I PL E +FRRL +Q++++D+ H AGLNPR R +++ RP +
Sbjct: 1252 DGSIFKILPLSEDSFRRLYVIQQQIIDTEVHAAGLNPRMERLSNEYYQLTNVTRP----L 1307
Query: 1388 VDCELLSHYEMLPLEEQLEIAHQTG 1412
+D L+ Y L ++++ IA + G
Sbjct: 1308 LDFNLIRRYSNLSIKKRKSIAQKAG 1332
Score = 44.3 bits (103), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 70/365 (19%), Positives = 153/365 (41%), Gaps = 71/365 (19%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
S L L ++L+G V +A++ Q + + D +I+ AK+S++ F+ + L
Sbjct: 45 STNKLHLNYEFKLNGRVSDIALIKQVDS----KLDYLIILTATAKLSLVNFNVFTNSLET 100
Query: 157 TSMHCFESPEWLH--LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
S+H +E + LK +ES R +D C VL++ I + +
Sbjct: 101 ISLHYYEDKFRQNSILKLAKESKLR-----IDQAKNC--VLLFNNDNIAILPISSTTDEF 153
Query: 215 GDED-----------------TFGSGGGFSARIESSHVINLRDLDM----KHVKDFIFVH 253
DED F S +I +S +I L+ ++ +++ D F+
Sbjct: 154 EDEDLGQESSAKTVKRGNMSIKFPSQSQKKNKITNSSII-LKSTELNSKIQNIIDIQFLS 212
Query: 254 GYIEPVMVILHERELTWAGR-----VSWKHHTCMISAL-------------SISTTLKQH 295
+ +P + +L++ +L W G + ++ ++ L S++ L +
Sbjct: 213 NFSKPTLSVLYQPKLAWIGNSNLVTLPTQYMILTLNILERENIKSQENGENSLNQDLIET 272
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY--HSQSASCALALNNY--AVSLDS 351
+I LP++ + ++ + + G +VG+N I Y H+ + +N + +L
Sbjct: 273 TIIGQVSELPYELHTIIPLNN---GSTLVGSNEIIYIDHTGVLQSLIIINQFQDKETLKK 329
Query: 352 SQELPRSSFSVELDA------AHATWLQNDV-----ALLSTKTGDLVLLTVVYDGRVVQR 400
+ + +S ++ L+ A + N+V L+ + ++ L+ + +GR++
Sbjct: 330 GRVIDKSKQNIILNKPIKFINAGSRVESNNVDDKNNVLIFDENNNIYLVNITLEGRLLIN 389
Query: 401 LDLSK 405
D++K
Sbjct: 390 FDINK 394
>gi|261335516|emb|CBH18510.1| cleavage and polyadenylation specificity factor-like protein,
putative [Trypanosoma brucei gambiense DAL972]
Length = 1452
Score = 113 bits (283), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 134/574 (23%), Positives = 251/574 (43%), Gaps = 64/574 (11%)
Query: 879 VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
+ ++ A ++R L+ RT ++ T + H + +RI F ++G G ++ G P + M
Sbjct: 895 IESIEAKKMR-LQSERTMIENDT-QSVRHCS--RRIIPFAAVAGQSGAYVCGQHPLFLM- 949
Query: 939 FRERLR---VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS-GSTYDNYWP 994
+ R R + G + F +++ GFIY +G + + + S N W
Sbjct: 950 WDNRTRQLVAYRHQAPGPVRGFVPFTSMS--GGFIYCC-EGFVDFAVMNTYCSPGGNGW- 1005
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPV-LKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
+++ I + ATPH I Y ++ S V +P Q S + ++ + D++ + SV
Sbjct: 1006 LRRRIHIGATPHFIVYDPPGRSCFVVTSKKVPFRP--QRASFDVQLKIQYDEDSNTVQSV 1063
Query: 1054 DLH---------------RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV 1098
R E +EVR+ + G W + + +E L ++V
Sbjct: 1064 TTEAPVCNMPAIKPGTGVRVPLTERFEVRLHSTFKKG--WDCTDKLMLDENEKVLGAQMV 1121
Query: 1099 TLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK 1155
+ N + + TA+ GEDV RGR++L ++ RN +++V +++S+ L
Sbjct: 1122 EIHQDANADGSATAPVCVVCTAFPLGEDVTCRGRIILLAS-RNIKGRRSIV-QLHSEPLN 1179
Query: 1156 GAISALASLQGHLLIASGP--KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1213
G +A+A + + +A G KI + W +L AF A +Y L++ +N+I+ GD
Sbjct: 1180 GPATAVAGICSQIAVAVGGTIKIFRYDWETKKLVVSAFLYAG-MYATRLSVFRNYIIYGD 1238
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
+ +S + E+ L +L +D ++ + + ++ SD+++N+ I Y P+
Sbjct: 1239 LCRSCSMARFNEENHTLTVLGRDRSAVSVVHCDMMYHDRAFGILCSDDERNVLIMGYTPR 1298
Query: 1274 MSESWKG------QKLLS-RAEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1325
+ E+ G + +LS E+ + + K LR + A +S T
Sbjct: 1299 VQETDAGTHPKVLESVLSLDGEYRLPSGSLVKSLRFRSTAGNSSVT-------------- 1344
Query: 1326 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH-SNGKAHRPGP 1384
L+ + G IG I P+ E R + ++L +P AGL PR F + S+ + G
Sbjct: 1345 LYVSNYGEIGFIVPIGEQANRTALWVTRRLQIDLPCEAGLTPRMFLSLNQSSPRNSLRGK 1404
Query: 1385 DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
+ +V LL L L + IA T ++
Sbjct: 1405 EMLVPAPLLRGLFSLDLRSRKAIARAAYTQLDRV 1438
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 42/234 (17%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
+++V+D F+ EP++ IL ER+ TWAGRV W+ + LS IS T
Sbjct: 254 IRYVRDVQFIGTLGEPLLAILCERKPTWAGRVKLVEWRTKAVESNMLSQQVTWVQISGTA 313
Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHY--------------- 331
P L+ +P++ +L V S + GV+ G NTI +
Sbjct: 314 SALPKLLLVGEVDGVPYNVTHMLPVGSISQAMSGVICFGVNTIMHITTRRGYGAYWNETG 373
Query: 332 -----HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHATWLQND-----VALLS 380
S+S++ + N+ L+SS L R + S+ A ++D +S
Sbjct: 374 KEECTSSKSSAVSYGKINWCDKKLESSTALFRVNLSLANCVAATLEGKDDEGSLQAVAVS 433
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
G +++L + G + + ++ S IT I L FLGS + DS +
Sbjct: 434 EDDGVVLMLQFLSQGSNIHDIRIAVLTSGCYCSSITPISERLMFLGSAVSDSCI 487
>gi|443919095|gb|ELU39366.1| cleavage factor protein [Rhizoctonia solani AG-1 IA]
Length = 788
Score = 113 bits (283), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 198/870 (22%), Positives = 335/870 (38%), Gaps = 167/870 (19%)
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS-IA 667
G T+ AG F + V QV R+L+ + L G++ + V+ +A
Sbjct: 33 GLTMVAGAFFQQTCVAQVTTNSIRLLEPDGAERQLYL------DAEGNKPRPKIKVAHVA 86
Query: 668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTST 727
DP++++ D + L VGD + V + + + +S + + D +RK
Sbjct: 87 DPFIVVLREDDTFGLFVGDTAKGRVRRKDVSHFGENGTICASASFFTDHTDLFQIRKPGD 146
Query: 728 DAWLSTGVGE--------------AIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
+G G A + GGP + ++V E G + V N
Sbjct: 147 VVATHSGPGSHRRRPGESSSSKRRASRKSRGGPAASTTLENIVDAEHGT-QWLVVLRKNG 205
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
F V S R H + + L D ++S + +I + + + + R
Sbjct: 206 FFEVSTRTSTRQHQL-----QGLPD--VLVDSGQAHTVCTDGETDIEHVIIAPIGITR-- 256
Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
+P L I T+ Y+ P ++S++ PV ++ V FS
Sbjct: 257 ---PKPHLVVITKSRTLAIYEPVPAPPPPDSSENSAPVRDQLTVQFVKV---------FS 304
Query: 894 RT-PLDAYT------REETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRV 945
R PLD + R P +P N+SG F++G P W + LR+
Sbjct: 305 RALPLDMHDTKRVAGRSLVPFKSP--------NLSG---IFVTGDHPFWLLRTDASALRI 353
Query: 946 HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATP 1005
+P H YV S G + LP + IP +
Sbjct: 354 YP-------------------HAAQYVNSFGTTVVEWLPDVD-------ISHEIPCR--- 384
Query: 1006 HQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD-LHRTYTVEEY 1064
+Y ++ V+ V S L + D++ L + D H +
Sbjct: 385 ---SYASDDGRVYTSVAYDVSTRHILAASALRTTFAYYDEDSNELYTPDATHPNPEIHCS 441
Query: 1065 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGED 1123
+ ++ PD W T +E V + L +T+ + + +GT +GED
Sbjct: 442 ALELITPDT----WTTVDGYEFAQNEFVNAVESIPLETLSTERGLKDYVVVGTTISRGED 497
Query: 1124 VAARGRVLLFSTGR-----NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIIL 1178
+A +G +F + Q + + ++ KGA++AL + G+L+ + G KI +
Sbjct: 498 LAVKGATYVFEVVEVVPEPGSKTRQYRLRLLCREDSKGAVTALCGMNGYLVSSMGQKIFV 557
Query: 1179 HKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1237
+ E L GIAF D + V SL +KN +L+GD+ KS++F++++E+ +L L KD
Sbjct: 558 RAFDLDEKLTGIAFMDVG-VCVTSLRPLKNLLLVGDMVKSVWFVAFQEEPFKLVPLGKDR 616
Query: 1238 GSLDCFATEFLIDG-STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 1296
L +F + LS V D + +F G +L+ +EFH HVT
Sbjct: 617 QQLSVTHADFFFGSQAQLSFAVLD---DFGVF-----------GLRLICSSEFH--THVT 660
Query: 1297 ---------------KFLRLQMLATSSD----RTGAAPGSDKTNRFALLFGTLDGSIGCI 1337
+ +Q L T S T P + + G+ DG+I +
Sbjct: 661 HRGVLSVSRKADFDSDVMSIQSLGTESSLIFGETKPYPFHQHNSILTMCGGSSDGTIASL 720
Query: 1338 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1397
PL+E F RLQ LQ +L+ R H NG ++D LL+ +E
Sbjct: 721 TPLNESEFGRLQLLQGQLI--------------RNVH-NG---------VLDGNLLAAFE 756
Query: 1398 MLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1427
LP+ +Q+E+ Q G R +IL++L L +
Sbjct: 757 ELPVSKQVEMTQQIGAEREKILNDLLKLRI 786
>gi|401426989|ref|XP_003877978.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322494225|emb|CBZ29522.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 1542
Score = 113 bits (282), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 113/496 (22%), Positives = 220/496 (44%), Gaps = 56/496 (11%)
Query: 912 QRITIFKNISG-HQGFFLSGSRPCWCM--VFRERLRVHPQLCDGSIVAFTVLHNVNCNHG 968
+R+ F+ + H+G ++ G P + + +L ++ F H+ + + G
Sbjct: 1009 ERLVPFRGLQDRHKGMYVCGQTPVFLVYHAATNQLVCTRHHATNAVRGFAPFHSRHVHGG 1068
Query: 969 FIYVTSQGILKICQL-PSGSTYDNY-WPVQKVIPLKATPHQITYFAEKNLYPLIVS---- 1022
F+Y +G + + P G + W +++V L TPHQI Y + ++ S
Sbjct: 1069 FVYC-GEGFVHFATMQPFGELLGSSGWWLERV-RLGCTPHQIIYSPAAHGCFVVASRPQP 1126
Query: 1023 -VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSVDLHRTYTVEEYEVRILEPDRAG 1075
P P + L ++ D+E V H I+ +L S+ T E YEV+ +
Sbjct: 1127 FSPKRAPFDVQLRMVEDEEGNRVPHVIEAVSLPPLSAASGSPVPTNERYEVQFF----ST 1182
Query: 1076 GPWQTRATIPMQSSENALTVRVVTLFNTTTKE------NETLLAIGTAYVQGEDVAARGR 1129
WQ + + ++E L+ ++ + TT + + A+ TAY GEDV RGR
Sbjct: 1183 LDWQCMGRLVLDANEKVLSATLMQVTRDTTMDAANRSTTAPVCALATAYPLGEDVTTRGR 1242
Query: 1130 VLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS-GPKIILHKW----TGT 1184
+LL +T + + + ++ + +KG ++A+ + + A+ G + ++++ +
Sbjct: 1243 ILLLTTTQQGGHGMQHLRTLHEEPMKGPVTAITRVGEDCVAAAVGGTVRVYRYDTYKSTM 1302
Query: 1185 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1244
E I + A YV L ++++++GD+ S+ F + E+ + +L +D ++ +
Sbjct: 1303 ETMAILYAGA---YVTCLQAFRDYLVIGDLFNSVLFARYSEEIHTITILGRDTNAISVVS 1359
Query: 1245 TEFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKGQKLLSR-----AEFHV-GAHVTK 1297
+ L + L+V+D+ +N+ Y P+ + E K K+L E+ + G + K
Sbjct: 1360 NDMLYHDTRFGLLVTDDARNLMCMSYKPRVLEEPGKPPKVLESLLTVTGEYRLAGGVLLK 1419
Query: 1298 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1357
+RL+ + S ++ T G IG + PL + T R Q + ++L
Sbjct: 1420 MMRLRAASAHSSS-------------VAIYVTNMGEIGYLVPLGDQTSRTGQWVVRRLQS 1466
Query: 1358 SVPHVAGLNPRSFRQF 1373
V H GL PR F F
Sbjct: 1467 EVAHAGGLPPRMFLGF 1482
Score = 40.4 bits (93), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 84/185 (45%), Gaps = 37/185 (20%)
Query: 223 GGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHERELTWAGRVS--- 275
GGG S + V + R D+K +++D FV EP++ L E++ TWAGRV
Sbjct: 283 GGGTSLLLRIGTVTHWRLQDVKTALRNIRDIQFVESAGEPLLAFLFEKQPTWAGRVKLLE 342
Query: 276 WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-LPHDAYKLLA------VPSPIG 319
W+ TC I ++++ + H L S ++ LP+D + VPS +
Sbjct: 343 WRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGLPYDVTSMTPLTAFQDVPSAVF 402
Query: 320 GV---LVVGANT-----IHYHSQSASCALALNNYAVSLD------SSQELPRSSFSVELD 365
V ++V +T ++ ++ A +L + AVS + +SQ L V L+
Sbjct: 403 CVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSFEAVQWRSASQALSTDLVKVNLN 462
Query: 366 AAHAT 370
++AT
Sbjct: 463 FSNAT 467
>gi|254564833|ref|XP_002489527.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Komagataella pastoris GS115]
gi|238029323|emb|CAY67246.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Komagataella pastoris GS115]
gi|328349950|emb|CCA36350.1| Protein cft1 [Komagataella pastoris CBS 7435]
Length = 1388
Score = 112 bits (281), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 122/562 (21%), Positives = 242/562 (43%), Gaps = 63/562 (11%)
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF---FLSGSRPCWCMV-FRERLR 944
N+ + P +AY P G +R I N G F+ G + W +
Sbjct: 862 NITITGAPDNAY-----PQGTKLERRLIKLNNIGDSKLSTLFVVGVKSFWITKRHSSSIN 916
Query: 945 VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKAT 1004
+H Q S ++ + C +G + + + ++ ++PS P+++V P+ T
Sbjct: 917 IH-QFTKLSTISCARFNTSRCKNGLMIIDTNKAARMVEIPSNLELSQRLPIRRV-PVGCT 974
Query: 1005 PHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDLHRTYT 1060
+ + K +VS P N +D+E VG +DN +++ +
Sbjct: 975 IKCVAF--HKASRTFVVSTVEETPYN-----CVDEEGNPIVG--VDN------TINKPAS 1019
Query: 1061 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT----KENETLLAIGT 1116
+ ++++ P W + ++ ++++ +TL NT+ K + L +G
Sbjct: 1020 SFKSSIKLISP----ISWTVIDSFDLEDEHVCMSLKSMTL-NTSRIPMFKNLKEYLVLGI 1074
Query: 1117 AYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1169
+ + ED+A+ G++ + G+ N + +++ KGA+++++ + G +
Sbjct: 1075 SNYRMEDLASNGQIRIVDVVDIIPEPGKPETNHK--FKDIFQDATKGAVTSVSDISGRFV 1132
Query: 1170 IASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
I G KII+ + T L + F D P YV +N +L+GD S+ + + +
Sbjct: 1133 IGQGQKIIVRDLQEDNTAL-PVGFVDTP-FYVSETKSFQNLLLVGDSMHSVILVGFDAEP 1190
Query: 1228 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
++ L KD +D A +F++ L ++++DE + + Y P+ S +GQ+LL R+
Sbjct: 1191 YRMISLGKDVAHVDVCAADFVVFEGNLFIIIADEDGMLHLIQYDPEDPASMQGQRLLRRS 1250
Query: 1288 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN---RFALLFGTLDGSIGCIAPLDELT 1344
F + T M P + TN F ++ DGS + P+ E T
Sbjct: 1251 IFKTNQYTT-----CMKMRERKYVIKPPKNQFTNFSEAFEVVAANSDGSFYKVTPISEAT 1305
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1404
+RRL +Q+++ D H GLNPR R + + + P I+D + + + ++
Sbjct: 1306 YRRLYVIQQQIFDQENHKCGLNPRENR--YLSDQYSIPNQRLILDFDNIRRFLEFDEIKK 1363
Query: 1405 LEIAHQTG-TTRSQILSNLNDL 1425
++ H+ G T S+ +L +L
Sbjct: 1364 RDLVHKLGRNTYSEFYRDLLNL 1385
Score = 90.1 bits (222), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 105/444 (23%), Positives = 177/444 (39%), Gaps = 51/444 (11%)
Query: 93 MDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
+D L LV Y+L G V L + D +++A + K S++++D S +
Sbjct: 80 IDFSQNVKLSLVAEYKLDGLVTDLCKIR---TIEDSHHDYVLVATKGVKFSMIKWDQSSN 136
Query: 153 GLRITSMHCFESPEWLHLKRGRES-----FARGPLVKVDPQGRCGGVLVYGLQMIILKAS 207
+ S+H H K+ E+ F + DP C +L + + L
Sbjct: 137 SISTVSLH--------HYKKIVENSLIDKFNVDTKLIADPNNHCSCLLANEI-LFFLPFL 187
Query: 208 QGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
Q DE+ G ++ + DL ++K + D F+HGY EP + +L+
Sbjct: 188 QHEV----DEELDGKFVENKKLYSNTFLQFSNDLQPNIKTIIDIEFLHGYSEPTLAVLYT 243
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
T G + T + S++ K I NLP+D ++L + SP+ G L++G
Sbjct: 244 SFPTCTGALPKAKDTVSLQVFSLNLQNKASTSIIEVNNLPYDTDRILPLSSPLNGCLLIG 303
Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTG 384
AN IH +S + ++ N +A + + +S+ + L+ + ND +L T+ G
Sbjct: 304 ANQIIHLNSMGTAKGISCNLFAAKCSNFKLSDQSNLDLRLEKCVLGQVYNDKVILITEKG 363
Query: 385 DLVLLTVVYDGRV-----VQRLDLSKTNPSVLT--SDITTIGNSLFFLGSRLGDSLLVQF 437
+ G V +Q++ K VL+ + T I FF+G + DS+L
Sbjct: 364 AFYAFSFDIVGGVSSINEIQKIAAEKYQGLVLSLPTMFTNIDGKTFFIGCQGSDSVLF-- 421
Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
G K D ++ + DAL E LY N
Sbjct: 422 -----------GSKARLNTQNVDVNGKSKV-ITEEDALY------EEDLYADDIQNVAQG 463
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYG 521
F DSL+NIGP+ +F+ G
Sbjct: 464 IDHIDFVKLDSLLNIGPITNFTTG 487
>gi|74025892|ref|XP_829512.1| cleavage and polyadenylation specificity factor-like protein
[Trypanosoma brucei brucei strain 927/4 GUTat10.1]
gi|70834898|gb|EAN80400.1| cleavage and polyadenylation specificity factor-like protein,
putative [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 1452
Score = 112 bits (281), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 136/579 (23%), Positives = 248/579 (42%), Gaps = 74/579 (12%)
Query: 879 VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
+ ++ A ++R L+ RT ++ T + H + +RI F ++G G ++ G P + M
Sbjct: 895 IESIEAKKMR-LQSERTMIENDT-QSVRHCS--RRIIPFAAVAGQSGAYVCGQHPLFLM- 949
Query: 939 FRERLRV-------HPQLCDGSIVAFTVLHN--VNCNHGFIYVTSQGILKICQLPSGSTY 989
+ R R P L G V FT + + C GF+ ++ P G
Sbjct: 950 WDNRTRQLVAYRHQAPGLVRG-FVPFTSMPGGFIYCCEGFV---DFAVMNTYCSPGG--- 1002
Query: 990 DNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV-LKPLNQVLSLLIDQEVGHQIDNH 1048
N W +++ I + ATPH I Y ++ S V +P Q S + ++ + D++
Sbjct: 1003 -NGW-LRRRIHIGATPHFIVYDPPGRSCFVVTSKKVPFRP--QRASFDVQLKIQYDEDSN 1058
Query: 1049 NLSSVDLH---------------RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENAL 1093
+ SV R E +EVR+ + G W + + +E L
Sbjct: 1059 TVQSVTTEAPVCNMPAIKPGTGVRVPLTERFEVRLHSTFKKG--WDCTDKLMLDENEKVL 1116
Query: 1094 TVRVVTLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVY 1150
++V + N + + TA+ GEDV RGR++L ++ RN +++V +++
Sbjct: 1117 GAQMVEIHQDANADGSATAPVCVVCTAFPLGEDVTCRGRIILLAS-RNIKGRRSIV-QLH 1174
Query: 1151 SKELKGAISALASLQGHLLIASGP--KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
S+ L G +A+A + + +A G KI + W +L AF A +Y L++ +N+
Sbjct: 1175 SEPLNGPATAVAGICSQIAVAVGGTIKIFRYDWETKKLVVSAFLYAG-MYATRLSVFRNY 1233
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1268
I+ GD+ +S + E+ L +L +D ++ + + ++ SD+++N+ I
Sbjct: 1234 IIYGDLCRSCSMARFNEENHTLTVLGRDRSAVSVVHCDMMYHDRAFGILCSDDERNVLIM 1293
Query: 1269 YYAPKMSESWKG------QKLLS-RAEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKT 1320
Y P++ E+ G + +LS E+ + + K LR + A +S T
Sbjct: 1294 GYTPRVQETDAGTHPKVLESVLSLDGEYRLPSGSLVKSLRFRSTAGNSSVT--------- 1344
Query: 1321 NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1380
L+ + G IG I P+ E R + ++L +P AGL PR F + +
Sbjct: 1345 -----LYVSNYGEIGFIVPIGEQANRTALWVTRRLQIDLPCEAGLTPRMFLSLNQRSPRN 1399
Query: 1381 R-PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
G + +V LL L L + IA T ++
Sbjct: 1400 SLRGKEMLVPAPLLRGLFSLDLRSRKAIARAAYTQLDRV 1438
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 42/234 (17%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
+++V+D F+ EP++ IL ER+ TWAGRV W+ + LS IS T
Sbjct: 254 IRYVRDVQFIGTLGEPLLAILCERKPTWAGRVKLVEWRTKAVESNMLSQQVTWVQISGTA 313
Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHY--------------- 331
P L+ +P++ +L V S + GV+ G NTI +
Sbjct: 314 SALPKLLLVGEVDGVPYNVTHMLPVGSISQAMSGVICFGVNTIMHITTRRGYGAYWNETG 373
Query: 332 -----HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHATWLQND-----VALLS 380
S+S++ + N+ L+SS L R + S+ A ++D +S
Sbjct: 374 KEECTSSKSSAVSYGKINWCDKKLESSTALFRVNLSLANCVAATLEGKDDEGSLQAVAVS 433
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
G +++L + G + + ++ S IT I L FLGS + DS +
Sbjct: 434 EDDGVVLMLQFLSQGSNIHDIRIAVLTSGCYCSSITPISERLMFLGSAVSDSCI 487
>gi|50305395|ref|XP_452657.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74606921|sp|Q6CTT2.1|CFT1_KLULA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|49641790|emb|CAH01508.1| KLLA0C10274p [Kluyveromyces lactis]
Length = 1300
Score = 112 bits (281), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 164/338 (48%), Gaps = 30/338 (8%)
Query: 1089 SENALTVRVVTLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP--- 1142
SEN++ + T+ N+ T+ L+ IG+++V+ ED + G +L+ P
Sbjct: 954 SENSMVNDIKTMLIQLNSKTRRKRELVIIGSSFVKEEDQPSTGCLLVLDITEVVAEPGKP 1013
Query: 1143 -QNL-VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNG---IAFYDAPPL 1197
N +++ +E++G+++A+ + G +I K ++ E N +AF D P +
Sbjct: 1014 DSNFKFKQLFEEEIRGSVNAVCEISGRFMIGQSSKALVRDMQ--EDNSAVPVAFLDMP-V 1070
Query: 1198 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1257
++ N +++GD + F+ + + ++ +L K EFL++ ++ +
Sbjct: 1071 FITDAKSFSNLMIIGDSMQGFTFVGFDAEPYRMIVLGKSTSKFQVMNLEFLVNNGNINFI 1130
Query: 1258 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1317
V+D Q ++ + YAP + S GQ+L+ F++ +++L R GS
Sbjct: 1131 VTDRQNHLHVLRYAPDEANSLSGQRLVHCNSFNMFT-TNNYMKLV-------RKHVEFGS 1182
Query: 1318 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFH 1374
+N AL T DGSI + PL+E ++RR +Q++L+D +AG N + R +++
Sbjct: 1183 KTSNYIALGCQT-DGSIFRMIPLNEASYRRFYLVQQQLLDHEIPLAGFNTKMERLDNEYY 1241
Query: 1375 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
G + RP DS ++L Y LP+ ++ I ++ G
Sbjct: 1242 HKGHSLRPTLDS----QVLKKYIHLPITKRTTIENRVG 1275
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 136/639 (21%), Positives = 257/639 (40%), Gaps = 111/639 (17%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
A L L ++L G + + +L Q G S + IL+ +K+S++ FD L
Sbjct: 45 AQKLVLAYEWKLAGKIIDMQLLPQIG---SPLKMLAILS-SKSKVSLVRFDPVAESLETL 100
Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD 216
S+H + ++++L S ++ VDP RC +LV+ ++ IL + D
Sbjct: 101 SLHYYHD-KFVNL--STSSLKTESIMAVDPLFRC--LLVFNEDVLAILPLKLNTEDMEID 155
Query: 217 EDTFGSGGGFSARIESSHVINLRDLDM---------KHVKDFIFVHGYIEPVMVILHERE 267
ED G + R++ + I + M KHV D +++ + +P + IL++
Sbjct: 156 EDENGIKEPMAKRLKRNQGITSDSIIMPISSLHKSLKHVYDIKWLNNFSKPTVGILYQPV 215
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
L W G +T LS+ ++ +I +LP+D + L VP G VL +G N
Sbjct: 216 LAWCGNEKVLGNTMRYMVLSLDVEDEKTTVIAELADLPNDLHTL--VPLKRGYVL-IGVN 272
Query: 328 TIHYHSQSA---SCALALNNYAVSLDSSQELPRSSFSVELDAA----HATWLQNDVALLS 380
+ Y S S SC + LN +A S +++ S ++ L + + ++D+ +L
Sbjct: 273 ELLYISASGALQSC-IRLNTFATSSINTRITDNSDMNIFLSKSSIYFYKALKRHDLLILI 331
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL-----GD---- 431
+ + + +G ++ + D + I N + F SRL GD
Sbjct: 332 DENCRMYNIITESEGNLLTKFDCVQ----------VPIVNEI-FKNSRLPLSVCGDLNLE 380
Query: 432 --SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
+L+ F G + LK F + ++L + D E +LYG
Sbjct: 381 TGRVLIGFLSGDAMFLQLKNLKVAFA-------AKRQLVETVDDDDD-----EYSALYGE 428
Query: 490 ASNNTES----AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
+ NNT + Q+ F ++ DS+ NIGPL + G + + + + + E +
Sbjct: 429 SQNNTHTRIVETQEPFDISLLDSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKDEF-SIV 487
Query: 546 GCKGI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
G+ T H + + H + + + ++ + ++ + L T D E +
Sbjct: 488 ATSGVGRGSHLTALHSTVQPHIEQALKFTSATRIWN----LKIKGKDKYLVTTDADKEKS 543
Query: 601 E------------SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-----MTQDL 643
+ + D+ RTI + +R++QV G + D + +T D+
Sbjct: 544 DVYQIDRNFEPFRAQDFRKDSRTIGMETMDDDKRILQVTSGGLYLFDVDFKRLARLTIDI 603
Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
++ I DPY+L + G+I++
Sbjct: 604 E----------------IVHACIIDPYILFTDARGNIKI 626
>gi|401624207|gb|EJS42273.1| cft1p [Saccharomyces arboricola H-6]
Length = 1356
Score = 112 bits (280), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 154/346 (44%), Gaps = 25/346 (7%)
Query: 1078 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1135
W+ + P S N + ++ + + T K+ E ++A G A ED G ++
Sbjct: 1000 WKVIDKIDFPNNSVVNEMRSSMIQVNSKTKKKREYIIA-GVANATTEDTPPTGAFHIYDV 1058
Query: 1136 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1189
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1059 TEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1118
Query: 1190 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
AF D P ++V N +L+GD + F+ + + ++ LL + + EFL+
Sbjct: 1119 AFLDIP-VFVTDSKSFGNLLLIGDAMQGFQFIGFDAEPYRMILLGRSISKFQTMSLEFLV 1177
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
+G + +D +N+ + YAP S GQ+L+ + F + + + L L
Sbjct: 1178 NGGDMYFSATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTLHSINSCMLLLP------- 1230
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
GS + F + G +DGSI I PL E T+RRL +Q++++D + GLNPR
Sbjct: 1231 -KNEEFGSSQVPSFQNVGGQVDGSIFKIVPLSEETYRRLYVIQQQIIDREIQLGGLNPRM 1289
Query: 1370 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
R F+ G + RP ++D ++ + L ++ + A + G
Sbjct: 1290 ERLANDFYQMGHSMRP----MLDFNVIRRFSELAIDRRKNTAQKAG 1331
>gi|156847699|ref|XP_001646733.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
70294]
gi|156117413|gb|EDO18875.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
70294]
Length = 1337
Score = 112 bits (279), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 156/330 (47%), Gaps = 24/330 (7%)
Query: 1092 ALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT---- 1147
++T++V N+ TK+ LL +G A + ED+ + G + P T
Sbjct: 998 SMTIQV----NSKTKKKRELLVVGVASIGTEDLPSAGSFHVIDINEVVPEPGKPDTNYKF 1053
Query: 1148 -EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIV 1205
E++ + ++G ++++ + G +I K+++ E + +AF D P +YV
Sbjct: 1054 KEIFQETVRGNVNSVCEISGRFMINQSQKLLVRDIQEDESVVPVAFLDVP-VYVTDTKSF 1112
Query: 1206 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1265
N +++GD + F+ + + ++ L + A EFL++ + +VSD +
Sbjct: 1113 SNLMIVGDSMQGFQFVGFDAEPYRMIPLGRSVSKFKTVALEFLVNNGDIFFIVSDRNDIL 1172
Query: 1266 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1325
+ YAP S GQ+L + F++ + T + L S++ ++P T F
Sbjct: 1173 HVLKYAPDEPNSLSGQRLAHYSSFNIHSTNTSMI----LLPSNNEFQSSPNGQAT--FQS 1226
Query: 1326 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRP 1382
+ +DGSI + PLDE +FRRL +Q++++D+ GLNPR R +++ RP
Sbjct: 1227 VGSCVDGSIFKVIPLDEDSFRRLYVIQQQVIDTEIQAGGLNPRMERLSNEYYQLVHLMRP 1286
Query: 1383 GPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
++D ++ + L + ++ +IA + G
Sbjct: 1287 ----MLDFNIIRRFSNLSITKRTKIAQKAG 1312
>gi|342186481|emb|CCC95967.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 1456
Score = 112 bits (279), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 120/519 (23%), Positives = 227/519 (43%), Gaps = 51/519 (9%)
Query: 879 VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
+ ++ A + R L+ RT ++ T + T H A +RI F ++G G ++ G P + +
Sbjct: 899 IESIEARKCR-LQRERTMIENDT-QSTRHCA--RRIIPFACVAGQSGAYVCGQHPVFLLW 954
Query: 939 FRERLRV--HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
+ + R+ + G++ F + GFIY +G + ++ + + +
Sbjct: 955 DKRKRRIAAYRHQSPGAVRGFVSFPQMA--GGFIYC-CEGFVDFARMNTYCAPNGQGWLT 1011
Query: 997 KVIPLKATPH-----------------QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
+ I + ATPH + T+ ++ + + + + + LN V S+ +
Sbjct: 1012 RRIAIGATPHFLVYDPPGKSCFVVTSEKKTFRPQRAFFDVQLKIHYDEELNTVQSVTAEP 1071
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
V H + + V R VE++EVR+L G W+ ++ +E L + V
Sbjct: 1072 PVCHMPPINPGAGV---RVPMVEQFEVRLL--STTGEQWECTHKFALEENEKVLGAQAVE 1126
Query: 1100 LFNTTT---KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
L + + + TA+ GEDV RGR++L ++ + + +++S+ L G
Sbjct: 1127 LRQDEAIAGAPSAPVCVLCTAFPLGEDVTCRGRIILLAS--KTVKKKRAIVQLHSEPLNG 1184
Query: 1157 AISALASLQGHLLIASGP--KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
+A+ + + +A G KI + W +L AF A +Y L+ +N+I+ GD+
Sbjct: 1185 PATAVTGICSQIAVAVGGTIKIFRYDWETKKLVVSAFLYAG-VYATRLSAFRNYIIYGDL 1243
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
+S + EQ L +L KD ++ + + T ++ S++Q+++ + Y P++
Sbjct: 1244 CRSCAMARFNEQNHTLTVLGKDHNAVSVVHCDMMYHDRTFGILCSNDQRDLLLMGYTPRV 1303
Query: 1275 SESWKGQKLLSR---AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1331
ES G+ SR + F + L LA S AA S T ++ +
Sbjct: 1304 QES--GEHTPSRVLESPFSLDGEYR--LPSGCLAKSLRFRSAAGNSSVT-----VYISNY 1354
Query: 1332 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1370
G +G I PL E R + ++L +P AGL PR F
Sbjct: 1355 GEVGFIVPLGEQANRTALWITRRLQVDLPCDAGLTPRMF 1393
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 130/334 (38%), Gaps = 77/334 (23%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
+++V+D FV EP++ +L ER TWAGRV W+ + LS IS L
Sbjct: 254 LRYVRDLQFVGSSGEPLLGVLCERRPTWAGRVKLVEWRTKAVDTNTLSMQVAWVQISGAL 313
Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPSP---IGGVLVVGANTIHYHSQSASCALALNNYA 346
HP L+ ++P++ ++ V S GV+ G NT+ + + + N+
Sbjct: 314 TTHPKLLLVGEVDSVPYNVTHMIPVESSSQTPSGVICFGINTVMHITTKRGYGVYFNSTG 373
Query: 347 V---------------------SLDSSQELPRSSFSVELDAAHATWLQN------DVALL 379
+ L+SS L R +FS L AT + +
Sbjct: 374 MEECGSNKSSAMSYGKMSWCDAKLESSTALFRVNFS--LANCTATIFSPRSSDSLQILAV 431
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
S + G + +L + G V + +S S +T I ++LFFLGS V F+C
Sbjct: 432 SEEDGVVAVLEFLSQGANVHDIQISVLASGCYCSSLTPISDNLFFLGS------AVSFSC 485
Query: 440 GSGTSMLSSGLKEEFGDIE--------------------ADAPSTKRLRRSSSDALQDMV 479
+ + +SG +F +E AD S R +S+S L+D
Sbjct: 486 IASITPTNSGAIGKFKVVESIEAIGSIRDVDVVDCSNDAADCISGPRGNQSNSSWLEDTP 545
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIG 513
E A N T S A R +++++
Sbjct: 546 FAE------LAGNTTLDPMPNLSVAQRRAIMDLA 573
>gi|254580509|ref|XP_002496240.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
gi|238939131|emb|CAR27307.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
Length = 1331
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 151/321 (47%), Gaps = 20/321 (6%)
Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELK 1155
++ T+ + + +G A+V+ ED+ G + +F P T EV+ + ++
Sbjct: 999 LDSRTRRRKEYVIVGVAHVETEDLPPSGSLSVFDITEVVPEPGKPDTNFKLGEVFKENIR 1058
Query: 1156 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
G +S++ + G LI K+I+ + +AF D P ++V + NF+++GD
Sbjct: 1059 GTVSSVCDISGRFLINQSQKVIVRDVQEDNSVVPVAFLDVP-VFVTDVKSFGNFLIIGDS 1117
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
+ F+ + + ++ L + L+ A EFL++G + V+D + IF YAP
Sbjct: 1118 MQGFQFIGFDAEPYRMIPLGRSVSKLETVALEFLVNGGDIFFAVTDTSNILHIFKYAPDE 1177
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
S GQ+L+ F++ + T + L + G + ++ G DGS+
Sbjct: 1178 PNSLSGQRLVHCTSFNLHSTNTCMVLL------PKNEEFSVGEKSLSPVQVVGGQTDGSL 1231
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCE 1391
+ PL E T+RRL LQ++L + + GLNPR R +++ A RP +++
Sbjct: 1232 FKLVPLREDTYRRLYVLQQQLTEKEVQLGGLNPRMERLSNEYYHLTHAVRP----MLEFN 1287
Query: 1392 LLSHYEMLPLEEQLEIAHQTG 1412
++ + L +E++ + A + G
Sbjct: 1288 VIRRFNTLSVEKRKQTAQKAG 1308
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 140/656 (21%), Positives = 270/656 (41%), Gaps = 124/656 (18%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L ++ G + LA++ Q + D ++L AKISV+ +D++ + + S+H
Sbjct: 48 LILTHEFKFEGRITDLAVVPQKDSP----LDCLLLCTSIAKISVVRYDEASNSIETLSLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS--------- 211
+E R A+ ++VDP RC L++ +I L Q S
Sbjct: 104 YYEDS---FKDRSILELAKESTMRVDPGKRCA--LLFNNDVIALLPLQTTSLNDGEEEDE 158
Query: 212 ---GLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHER 266
D+ + G +A S + N ++L DM +V D F+ + P + ++ E
Sbjct: 159 DMDDERPDKRQKNNKGRITA---PSAIFNAKELHQDMNNVIDVTFLRNFTRPTLAVIFEN 215
Query: 267 ELTWAGR-------VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
+ WAG V++ T +++ ST +K +I + L D + ++ + +
Sbjct: 216 KPVWAGTSQVLPLPVTYMAFTLEVTSNEQSTDIKS-TVIATVKELSWDFHTMIPIAN--- 271
Query: 320 GVLVVGANTIHYHSQSASCA--LALNNYA-VSLDSSQELPRSSFSVELDAAHA-TWLQND 375
G ++VG+N + Y + S + LN+YA ++ ++ + RS + L W +D
Sbjct: 272 GCIIVGSNEMAYIDNTGSLQSIIFLNSYANKNMKKARIVDRSKSKILLHKPTTYNWSVSD 331
Query: 376 VALLSTKTGDLVLLT----------VVYDGRVVQRLDL-----------SKTNPSVLTSD 414
++TG+ +L+ + Y+GR++ + D+ + +N + ++
Sbjct: 332 ---QKSETGETLLIMDHQAAFYYIQLEYEGRLLTKFDIINLPIVNDTLKNNSNATCISRL 388
Query: 415 ITTI-GNSL-FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+T+ GN + F+G R GD+ +++ + L + ++ E +P + +
Sbjct: 389 NSTLSGNYVDLFVGFRSGDASVLRL------NNLKAAIESRDEHKEITSPPENDIEKFED 442
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADAS 529
+D + EE S N E +T F V SL NI P+ + G + D
Sbjct: 443 ---EDDLYSEEASDADKEKENKEVVVETVLPFDIEVLSSLRNIAPITSLTPGKICSVDKF 499
Query: 530 ATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-EARTM 588
G+S + E V L G T G + +M+ + A IS+ + +
Sbjct: 500 VEGLSNPNRNE-VSLVATSGNGT-------GSHLTEIQMSVRPEVQLALKFISITQMWNL 551
Query: 589 VLETADLLTEVTES---------VD----YFVQGR-----TIAAGNLFGR-RRVIQVFER 629
++ D T+S +D + +GR T + ++FG +R++QV
Sbjct: 552 KIKNKDKYLITTDSNKNKSDIYLIDKNFALYKEGRFRRDATTVSISMFGSDKRIVQVTTN 611
Query: 630 GARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+ D ++ T F V+ VS+ DPY+L+ +S G I++
Sbjct: 612 HLYLYDTNFKRLTTMKFEF--------------EVVHVSVMDPYILITVSRGDIKV 653
>gi|430810872|emb|CCJ31592.1| unnamed protein product [Pneumocystis jirovecii]
gi|430814599|emb|CCJ28188.1| unnamed protein product [Pneumocystis jirovecii]
Length = 203
Score = 111 bits (278), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/192 (34%), Positives = 101/192 (52%), Gaps = 2/192 (1%)
Query: 1232 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1291
L KD SL + +FL+D L V+ D+ NI +F Y P+ +S+ GQKLL R +FHV
Sbjct: 3 LFGKDHSSLSVSSADFLVDDEHLYFVIGDDDGNIHVFNYDPENPQSFSGQKLLKRGDFHV 62
Query: 1292 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL-LFGTLDGSIGCIAPLDELTFRRLQS 1350
G+H+ L L A + N+ +L L + DGS+G + L E T+RRL
Sbjct: 63 GSHIKSILMLPKEAFPQNVNDKEETRASKNQDSLCLCASQDGSMGVLISLPEKTYRRLYF 122
Query: 1351 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
+Q +L+++ VAGLNP S+R K P I+D +LL Y L +Q ++A +
Sbjct: 123 IQGQLINTEDKVAGLNPISYRTSTYVSKTSNPAR-GILDGKLLYQYNNLERNKQKDMARK 181
Query: 1411 TGTTRSQILSNL 1422
+G I+ +L
Sbjct: 182 SGMPVETIIYDL 193
>gi|401841121|gb|EJT43641.1| CFT1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 1355
Score = 111 bits (277), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 151/346 (43%), Gaps = 25/346 (7%)
Query: 1078 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1135
W+ + P S N + ++ + N+ TK + G A ED G ++
Sbjct: 999 WKVIDKIDFPKNSVVNEMRSSMIQI-NSKTKRKREYIVAGVANATTEDTPPTGSFYIYDV 1057
Query: 1136 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1189
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1058 IEVVPEPGKPDTNYKLKEIFQEEVNGTVSTVCEISGRFMISQSQKVLVRDIQEDNSVIPV 1117
Query: 1190 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
AF D P ++V N +++GD + F+ + + ++ LL + + EFL+
Sbjct: 1118 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMILLGRSVSKFQTMSLEFLV 1176
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
+G + +D +N+ I YAP S GQ+L+ + F V + + + L
Sbjct: 1177 NGGDMYFAATDADRNVHILKYAPDEPNSLSGQRLVHCSSFTVHSINSCMMLLP------- 1229
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
GS + F + G +DGS+ I PL E T+RRL +Q++++D + GLNPR
Sbjct: 1230 -KNQEFGSSQVPSFQNVGGQVDGSVFKIVPLSEETYRRLYLIQQQIIDRELQLGGLNPRM 1288
Query: 1370 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
R F+ G + RP ++D ++ + L ++ + A + G
Sbjct: 1289 ERLANDFYQMGHSMRP----MLDFNVIRRFSGLSIDRRKNTAQKAG 1330
>gi|356527660|ref|XP_003532426.1| PREDICTED: disease resistance response protein 206-like [Glycine
max]
Length = 281
Score = 110 bits (275), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 56/106 (52%), Positives = 76/106 (71%), Gaps = 8/106 (7%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK--RGIGPVPNL 58
MSFAAYKMM PTGI NC GF+THSR+D+VP +Q +++D E PS+ +G +PNL
Sbjct: 1 MSFAAYKMMQCPTGIDNCAVGFLTHSRSDFVP----LQPDDIDVEWPSRPCHHVGSLPNL 56
Query: 59 VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELV 104
+VT ANV+E+Y VR+QE+ S K + +++ L+DGI ASLELV
Sbjct: 57 IVTVANVLEVYAVRLQEDQSP--KAAIDSRSDTLLDGIVGASLELV 100
>gi|71021721|ref|XP_761091.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
gi|46100541|gb|EAK85774.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
Length = 1597
Score = 109 bits (272), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/341 (28%), Positives = 158/341 (46%), Gaps = 30/341 (8%)
Query: 1086 MQSSENALTVRVVTLFN-TTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1144
+++E ++ +VTL + +T + +A GT GED A+G V LF
Sbjct: 1190 FEANETVTSLEIVTLDSPSTVSGRKQFVAAGTTTFHGEDRTAKGSVYLFEIISVVSAASE 1249
Query: 1145 LVTEVYSK-----ELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLY 1198
L +++ K + + ++A++ + G+L+ G K+ + E L IAF D P Y
Sbjct: 1250 LGSDLRLKLVCRDDSRAPVTAISHINGYLISTCGQKLYVRALEKQEWLISIAFLDCP-FY 1308
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD---------FGSLDCFATEFLI 1249
+ S+ +VKN +LLGD + + +++E + LAK F D + I
Sbjct: 1309 ITSIEVVKNLVLLGDCKRGLGLWAFQEDPYKFVELAKAEDGCVGVGAFLVRDEKVSMLSI 1368
Query: 1250 DGSTLSLVVSDEQKN--IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLA 1305
GS L S E I+++ YAP ++ G+KL+ R+EF + + + L+
Sbjct: 1369 SGSRLGGDASMEASAGVIRLYEYAPHLAVG--GKKLVLRSEFQTTSEAVARVECSGRWLS 1426
Query: 1306 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1365
S R +T R ++F +GS+ +A +DE +RL LQ +LV SV H A L
Sbjct: 1427 DSELR------GRETLRNKVVFAKANGSVESVAAVDEKVGKRLHLLQGQLVRSVMHTAAL 1480
Query: 1366 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1406
NPRSFR N R ++D LL + L + LE
Sbjct: 1481 NPRSFRMVR-NDYVPRALVKGVLDARLLDEFMRLSRPKMLE 1520
Score = 87.4 bits (215), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 95/399 (23%), Positives = 177/399 (44%), Gaps = 54/399 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKE-----SKNSGETKRRVLMDGISAASLELVCHYRLHG 111
LV +V+ IY V Q S S+++ + S +L + ++ L G
Sbjct: 46 QLVTARDDVLTIYDVYGQPHASASTIPGISRHTATSSVSSNTSACSHKNLVISRNHTLFG 105
Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHL 170
V L + Q A + RD ++++F+DAK+++LE++D+I L S+H +E +P+ L+L
Sbjct: 106 AVTGLQRV-QTLASDKDNRDRLLVSFKDAKLALLEWNDAIDDLETISIHTYERAPQLLNL 164
Query: 171 KRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD-------EDTFGSG 223
P++ VDP RC +L+ + IL + + D +
Sbjct: 165 A----PHLFHPILNVDPLSRCAALLLPHDSLAILPFYRDAADFDFDLDDHLEIAKDDVAA 220
Query: 224 GGFSARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
+A ++S S V+ +R++D ++++K F F+ G+ +P + +L TW G +S
Sbjct: 221 VVAAADLQSLPYSPSFVLTMREVDPKIRNLKHFCFLPGFQKPTVAVLFSHNPTWTGLLSE 280
Query: 277 KHHTCMI--------------------SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
+ T + AL T HP++ ++ LP+D ++A P
Sbjct: 281 RKDTFSVYLFTLDLSASLDGATFSSSAEALDDGTARSAHPVVTTSTPLPYDCLYMVACPQ 340
Query: 317 PIGGVLVVGANTIHYHSQSASCAL-ALNNY-----AVSLDSSQELPRSSFSVELDAAHAT 370
+GGV+VV +++ + QS + ALN + A+ +S EL S +L +
Sbjct: 341 TLGGVIVVCMSSLLHVDQSGRVMVTALNQWFKTTSAIEPESILEL---SDIADLQGSQLV 397
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+ +L+ G++ DGR V+ + L + S
Sbjct: 398 FTSKTQGVLTLVNGEIYRFRCQTDGRSVEGIRLERMQES 436
>gi|444313909|ref|XP_004177612.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
gi|387510651|emb|CCH58093.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
Length = 1459
Score = 109 bits (272), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 76/327 (23%), Positives = 149/327 (45%), Gaps = 22/327 (6%)
Query: 1097 VVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS-------TGRNADNPQNLVTEV 1149
+V N+ T+ + G A + ED+ G ++ TG+ N + E+
Sbjct: 1120 MVIQLNSRTRAKREYIVAGLANIGSEDLPPTGSFYIYDISPVLPETGKPDTNYK--FKEI 1177
Query: 1150 YSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
++++++G ++++ + G I KI++ + +AF D P +YV NF
Sbjct: 1178 FTEDVRGLVTSVCEISGRFTINQSQKIMVRDVQEDNSVVPVAFLDIP-VYVTDTKSFGNF 1236
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1268
+L+ D + + F+ + + ++ LL K L EF+ + + +D + IF
Sbjct: 1237 LLISDSMQGLQFVGFDAEPFRMILLGKSIPDLKISTVEFIANNGNIYFAATDYDNILHIF 1296
Query: 1269 YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
YAP S GQKL+ + F++ + + + L +D + F L G
Sbjct: 1297 KYAPDEPNSLSGQKLVHCSSFNLHSSTSCMIML----PGNDEFSENEQDNFIPSFQTLGG 1352
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPD 1385
+DGSI + PL+E +RRL +Q+++ D V GLNP+ R +++ +P
Sbjct: 1353 QVDGSIFKVIPLEESPYRRLYVIQQQITDYEVQVGGLNPKMERLSNEYYQKSNMLKP--- 1409
Query: 1386 SIVDCELLSHYEMLPLEEQLEIAHQTG 1412
++D ++ + MLP++++ A + G
Sbjct: 1410 -MLDFNIIRRFSMLPIDKRRRTAQKAG 1435
Score = 45.8 bits (107), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 58/116 (50%), Gaps = 17/116 (14%)
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC----F 162
++ G + + ++ Q G++ D ++L +AKIS+++FD+ ++ L+ S+H F
Sbjct: 54 FKFSGKITDIVLIPQRGSE----LDCLLLVTPNAKISIIKFDEELNTLKTISLHYYTDEF 109
Query: 163 ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
E L L AR ++V+P+ +C VL++ + I + + DED
Sbjct: 110 EKLSMLQL-------ARTSQLRVEPKKKC--VLLFNTESIAILPFTQQFNIDNDED 156
>gi|367014525|ref|XP_003681762.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
gi|359749423|emb|CCE92551.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
Length = 1327
Score = 108 bits (271), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 71/321 (22%), Positives = 144/321 (44%), Gaps = 20/321 (6%)
Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN-----LVTEVYSKELK 1155
++ T+ + + +G A V ED+ G +F P ++EV+ +E++
Sbjct: 995 LDSRTRRKKEYVIVGVAVVGTEDLPPSGSFFVFDITEVVPEPGKPDTNFKLSEVFQEEIR 1054
Query: 1156 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
G +S + + G LI K+++ + +AF D P ++V NF+++GD
Sbjct: 1055 GTVSTVCEISGRFLINQSQKVLVRDVQDDNSVVPVAFLDIP-VFVTDAKSFGNFMIIGDA 1113
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
+ F+ + + ++ L + ++ + EFL++G + ++D + +F YAP
Sbjct: 1114 MQGFQFVGFDAEPYRMIPLGRSIAKMETVSVEFLVNGGDIFFAITDTDDILHVFKYAPDE 1173
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
S GQ+LL F++ + T +A P F + G +DGS+
Sbjct: 1174 PNSLSGQRLLHCTSFNLHSTNT------CMALLPKNEEFEPAQANMKNFQAIGGQVDGSV 1227
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCE 1391
+ PL E +RRL +Q+++ + + GLNPR R + + RP ++D
Sbjct: 1228 FKLLPLREDVYRRLYVVQQQITEKELQLGGLNPRMERLSNEHYKTTHVLRP----MLDFN 1283
Query: 1392 LLSHYEMLPLEEQLEIAHQTG 1412
++ ++ L + + +I+ + G
Sbjct: 1284 VIQRFKRLSTDRRKQISQKVG 1304
Score = 73.6 bits (179), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 130/638 (20%), Positives = 269/638 (42%), Gaps = 86/638 (13%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
+A L L ++ HG + LA++ Q + D ++L AK+S+++FD + +
Sbjct: 45 SAKLFLTNEFKFHGKITDLALIPQVNSS----LDCLLLCTSIAKVSIVKFDPLSNSIETA 100
Query: 158 SMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYG-LQMIILKASQGGSGLV 214
S+H +E + L+ ++S+ R +DP RC +L L ++ +A+
Sbjct: 101 SLHYYEDKFRDLSLLEIAQQSYFR-----LDPSKRCAIILNNDVLALLPFRAA------T 149
Query: 215 GDEDTFGSGGGFSARIES--------SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILH 264
D++ + R+++ S + ++L ++++V D F++ + +P + IL
Sbjct: 150 DDDEEADAENNDVKRMKTSSDKVTYPSKIFVAKELHSEIRNVIDVQFLNNFSKPTIAILF 209
Query: 265 ERELTWAG--RVSWKHHTCMISALSISTTLKQHPL----IWSAMNLPHDAYKLLAVPSPI 318
E L WAG +++ + + MI L IS+T I L D + L+ + +
Sbjct: 210 EPTLIWAGNRQLNPQPISYMIFTLEISSTDNTTKFGATTIGKLTGLSWDFHSLVPISN-- 267
Query: 319 GGVLVVGANTIHYHSQSASC--ALALNNYA-VSLDSSQELPRSSFSVELDAAHA-TW--- 371
G ++VGAN + + S + + LN+++ +L + + S + + L + A W
Sbjct: 268 -GCMIVGANELAFADNSGALQSVILLNSFSDRNLRQGRIIDNSKYEILLPQSIARCWSPP 326
Query: 372 ----LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFF 424
+ ++ LL ++ + + +GR++ + D+ K N ++ + T + L
Sbjct: 327 TSDKVNDETLLLMDANSNVYYVQLESEGRLLIKFDIIKLPIVNDTLKNNQGCTCMSRLNS 386
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-SSDALQDMVNGEE 483
S LL+ F G + + LK + ++ + S D +D + +E
Sbjct: 387 RSSNNNMDLLMGFKSGDALVVRLNNLKSAAESRDEHKIFSEAMESSFDKDEDEDNLYSDE 446
Query: 484 LSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGI--SKQSN 538
S G A +N E +T F + ++ NIGP+ + G + + G+ ++
Sbjct: 447 ASDAGKADDNKEVIVETVTPFDIELLSTIKNIGPITSLAVGKVCSVEKYVKGLLNPNRNE 506
Query: 539 YELVELP--GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL- 595
Y +V G T S R + + + ++ + ++ R L T D
Sbjct: 507 YSMVATSGNGSGSHLTEIQGSVRPTVEVALKFISVTQIWN----LKIKNRDKYLVTTDSN 562
Query: 596 -----LTEVTESVDYFVQGR------TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
+ E+ + +GR T+ G +R++QV + D ++ + L+
Sbjct: 563 KAKSDIYEIDNNFALHKEGRFRRDATTVCISMFGGDKRIVQVTTNNLILYDTNF--RRLT 620
Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+ E V+ VS+ DPY+L+ +S G I++
Sbjct: 621 TMKFDYE---------VVHVSVMDPYILITVSRGDIKI 649
>gi|365984967|ref|XP_003669316.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
gi|343768084|emb|CCD24073.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
Length = 1388
Score = 108 bits (270), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 76/322 (23%), Positives = 148/322 (45%), Gaps = 24/322 (7%)
Query: 1102 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKG 1156
N+ + + G ++ ED+ G ++ P + EV+ +E++G
Sbjct: 1055 NSKARRKREYIIAGVTHIGTEDLPPTGAFHIYDITEVVPEPGKPDTNYRLKEVFKEEVRG 1114
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTELNGI---AFYDAPPLYVVSLNIVKNFILLGD 1213
+S + + G L+ K+++ E N + AF D P +++ +F++LGD
Sbjct: 1115 IVSTVCEISGRFLVNQSQKVMVRD--AQEDNSVVPVAFLDIP-VFINDAKSFGDFLILGD 1171
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
+ ++F+ + + ++ L K + + EF+++G L ++D + + YAP
Sbjct: 1172 AMQGLHFIGFDAEPYRMINLGKSVTKFETVSVEFVVNGGDLYFALTDRNNILHVLKYAPD 1231
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
S GQKL+ + F++ + + L L D T AP + F + G +DGS
Sbjct: 1232 ELNSLSGQKLVHCSSFNLFSGNSSLLLLPKNEEFED-TKNAPLT-----FQTIGGQVDGS 1285
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDC 1390
I + PL E T+RRL +Q+ + D P + GLNPR R +++ RP ++D
Sbjct: 1286 IFKVIPLREDTYRRLYVIQQHMNDKEPQLGGLNPRMERLSNEYYQLCHVMRP----MLDF 1341
Query: 1391 ELLSHYEMLPLEEQLEIAHQTG 1412
++ + LP++ + +A + G
Sbjct: 1342 NIIRRFSELPIDRRTRVAKRAG 1363
Score = 70.5 bits (171), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 148/725 (20%), Positives = 286/725 (39%), Gaps = 145/725 (20%)
Query: 58 LVVTAANVIEIYVVR-VQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
L+V N++ IY + + S S + ET + A L L+ ++L+G V+ +
Sbjct: 29 LLVIRTNILSIYHLETILSPRSNTSSSQLETIEDATVTTSKQAKLFLINEFKLNGKVQDI 88
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + G NS + I+L+ AK+S+L FD SI+ S+H +E S
Sbjct: 89 ASIPLG---NSSSLECILLSTGTAKLSILNFDPSINSFETLSLHYYEEK---FKDISLVS 142
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL--------------KASQGGSGLVGDEDTFGS 222
A+ +++DP RC +L++ ++ L + ++ + + S
Sbjct: 143 LAKKSQLRMDPLNRC--LLMFNNDVMALLPLHSNNEDEEEEEEDENEEDEVLDNYEANLS 200
Query: 223 GGGFSARIE--------SSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
+ RI+ S + N+ L D+K++ D F++ + +P + +L++ LTWAG
Sbjct: 201 KTSPNKRIKYNNNQFEGKSKIFNINKLHEDVKNISDIQFLNNFNKPTIAVLYQPTLTWAG 260
Query: 273 RVSWKHHTC--MISALSI----STTLKQHP-----------LIWSAMNLPHDAYKLLAVP 315
V MI L I ST H +I L D +K++ +
Sbjct: 261 NVQLNPLPTHFMIFTLDILSENSTNNANHTTENNNNDLNLIIIAKLKELAWDWFKIIPIS 320
Query: 316 SPIGGVLVVGANTIHYHSQSA--SCALALNNYA-VSLDSSQELPRSSFSVELD------- 365
+ G +V+G N I Y + + LN++A +L ++ + S F + +
Sbjct: 321 N---GCVVIGNNEIAYIDNTGVLQSIILLNSFADKNLKKTRIIDESKFQIFFNENVTHVW 377
Query: 366 ----AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL-----------SKTNPSV 410
+ + T ++ LL +L + + +GR++ + D+ NP+
Sbjct: 378 SPSTSKNKTTEDDETLLLMDAQSNLYYVRLEAEGRLLTKFDIINLPIVNDVLRENCNPTC 437
Query: 411 LTSDITTIGNSL--FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
++ + NS F+G GDSL+V+ + LK + + S + +
Sbjct: 438 ISRLDSNATNSTMDLFIGFLSGDSLVVRL----------NNLKSAIDTRDEHSESNEHTQ 487
Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQ------------KTFSFAVRDSLVNIGPLK 516
+ D +E +LY + E A+ + F SL NIGP+
Sbjct: 488 LNGFDE------EDEDNLYSDDEVDVEDARSKRDMETIIHTVQPFDIEYLTSLKNIGPIT 541
Query: 517 DFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGH-NADSSRMAAYDDEY 575
+ G + D + G+ + E I T S+ H N + ++
Sbjct: 542 SLTVGKVSSLDLNVKGLQNPNKNEF-------SIVTTSGNSTGSHLNVIQQTVQPIVEKA 594
Query: 576 HAYLIIS------LEARTMVLETADL------LTEVTESVDYFVQGR-----TIAAGNLF 618
++ ++ ++ + L T D + ++ + +GR T +F
Sbjct: 595 LKFISVTQIWNLKIKNKDKYLVTTDSTKSKSDIYDIDNNFSLHKEGRLRRDATTVYIAMF 654
Query: 619 GR-RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
G +RV+Q+ + D ++ + L+ + E V+ VS+ DPY+L+ +S
Sbjct: 655 GDGKRVVQITTNHLYLFDTNF--RRLTAIKFDFE---------VVHVSVMDPYILITVSR 703
Query: 678 GSIRL 682
G I++
Sbjct: 704 GDIKI 708
>gi|76157351|gb|AAX28300.2| SJCHGC08809 protein [Schistosoma japonicum]
Length = 225
Score = 107 bits (268), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 91/154 (59%), Gaps = 5/154 (3%)
Query: 242 DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 301
+ +V D F+HG+ EP +++L+E TWAGRVS + TC I ALS + + +P+IW
Sbjct: 52 KINNVLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQ 111
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VSLDSSQELPR 357
+LP D ++ VP PIGGV+++ AN+I Y Q+ SC+L LN YA + Q++P
Sbjct: 112 ESLPFDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCSLPLNCYAQISTNFPMRQDVP- 170
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
S + +D L L+ T++G+L LL++
Sbjct: 171 SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSL 204
>gi|157873900|ref|XP_001685450.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania major strain Friedlin]
gi|68128522|emb|CAJ08654.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania major strain Friedlin]
Length = 1541
Score = 107 bits (268), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 136/611 (22%), Positives = 264/611 (43%), Gaps = 72/611 (11%)
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFE--GPENTSKSDDPV-STSRSLSVSNVSASRLRN 889
SA + L IL+ G ++ Y+ + GP K + + + V +R +
Sbjct: 929 SAAPTEATLVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVMESIKARKKR 988
Query: 890 LRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFR---ERLRV 945
L+ R L + T ++ H + +R+ F+ + ++G ++ G P + +V+ +L
Sbjct: 989 LQEERAHLASVT-QQMRHCS--ERLVPFRGLQDRYKGIYVCGQTPVF-LVYHAATNQLVC 1044
Query: 946 HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-PSGSTYD-NYWPVQKVIPLKA 1003
++ F H+ + + GF+Y +G + + P G + W +++V L
Sbjct: 1045 TRHHATNAVRGFAPFHSRHVHGGFVYC-GEGFVHFATMQPFGELLGCSGWWLERV-RLGC 1102
Query: 1004 TPHQITYFAEKNLYPLIVS-----VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SS 1052
TPHQ+ Y + ++ S P P + L ++ D+E V H I+ +L S+
Sbjct: 1103 TPHQVIYSPAAHGCFVVASRPQPFSPKRAPFDVQLRMVEDEEGNRVPHVIEPVSLPPLSA 1162
Query: 1053 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE----- 1107
T E YEV+ WQ + + +E L+ ++ + TT +
Sbjct: 1163 TSGSPVPTNERYEVQFFSTLN----WQCMGRLVLDGNEKVLSATLMQVTRDTTMDAANRS 1218
Query: 1108 -NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-Q 1165
+ A+ TAY GEDV RGR+LL +T + + + ++ + ++G ++A+ + +
Sbjct: 1219 TTAPVCALATAYPLGEDVTTRGRILLLTTSQQSGQGMQQLRTLHEEPMEGPVTAITRVGE 1278
Query: 1166 GHLLIASGPKIILHKWTGT----ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
+ +A G + ++++ E I + A YV L + ++++GD+ S+ F
Sbjct: 1279 DCVAVAVGGTVRVYRYDANKSTMETMAILYAGA---YVTCLQAFREYLVIGDLFNSVLFA 1335
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKG 1280
+ E+ + +L +D ++ + + L + L+V+D+ +N+ Y P+ + E K
Sbjct: 1336 RYSEEIHTITILGRDTSAISVVSNDMLYHDTRFGLLVTDDARNLMCMSYKPRVLEEHGKP 1395
Query: 1281 QKLLSR-----AEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
K+L E+ + G + K +RL+ + S ++ T G I
Sbjct: 1396 PKVLESLLTVTGEYRLAGGVLLKMMRLRAASARSSSV-------------AIYVTNMGEI 1442
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1394
G + PL + T R Q + ++L V H GL PR F F + S+ E +
Sbjct: 1443 GYLVPLGDQTSRTGQWVVRRLQSEVAHAGGLPPRMFLGFPQDDPLR-----SLKGEEWML 1497
Query: 1395 HYEMLPLEEQL 1405
H+ PL EQL
Sbjct: 1498 HF---PLLEQL 1505
Score = 43.9 bits (102), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 81/185 (43%), Gaps = 37/185 (20%)
Query: 223 GGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHERELTWAGRVS--- 275
GGG S + V + R D+K +++D FV EP++ L E++ TWAGRV
Sbjct: 282 GGGTSLLLRVGTVTHWRLQDVKSALRNIRDVQFVQSAGEPLLAFLFEKQPTWAGRVKLLE 341
Query: 276 WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-LPHDAYKLLAVPS----PIGGV 321
W+ TC I ++++ + H L S ++ LP+D + +P+ P
Sbjct: 342 WRSKTVESHMLTCSIEWMKVTLANSATPHMLSLSEVDGLPYDVTSMTPLPAFQDLPSAVF 401
Query: 322 LVVGANTIHYHSQSA----------SCALALNNYAVSLD------SSQELPRSSFSVELD 365
V +H ++S A +L + AVSL+ +SQ L V L+
Sbjct: 402 CVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLEAVQWRSASQALSTDLVKVNLN 461
Query: 366 AAHAT 370
A+AT
Sbjct: 462 FANAT 466
>gi|198432469|ref|XP_002129207.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
DNA-binding protein 1) (UV-damaged DNA-binding factor)
(DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
(UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
pigmentosum group E-co... isoform 1 [Ciona intestinalis]
Length = 1150
Score = 107 bits (267), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 148/316 (46%), Gaps = 28/316 (8%)
Query: 1112 LAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+GTA+V E+ + GR+L+F DN LV E KE+KGA+ L GH+L
Sbjct: 836 FVVGTAFVYMEETEPKHGRILVF---HYIDNKLTLVAE---KEVKGAVFCLCQFNGHVLA 889
Query: 1171 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1230
A + +++WT + + + + L +F+L+GD+ +S+ L++K L
Sbjct: 890 AINTSVSIYQWTTEKELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYKHVEGNL 949
Query: 1231 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1290
+ +AKD+ A E L D + L ++ N+ I + + KL A FH
Sbjct: 950 DEIAKDYSPNWMTAVEILDDDNFLG---AENFYNVFICQKDSGATTDEERSKLREAALFH 1006
Query: 1291 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1350
VG + F ++ + T + ++ +LFGT+ GSIG I +DE + L S
Sbjct: 1007 VGDSINTFRHGSLVMQNVGETAVS------SKGHILFGTVHGSIGVITTVDEDLYAFLHS 1060
Query: 1351 LQKKLVDSVPHVAGLNPRSFRQFHSNGK--AHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
+Q +L + V ++ S+R F +N K AHR VD +L+ + L E+ E+A
Sbjct: 1061 IQNRLAKVIKSVGNIDHESWRSFCTNEKTEAHR----GFVDGDLIECFLDLNREKMAEVA 1116
Query: 1409 HQTGTTRSQILSNLND 1424
+ ++ N ND
Sbjct: 1117 ------KGLMVKNFND 1126
Score = 51.6 bits (122), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 92/438 (21%), Positives = 167/438 (38%), Gaps = 109/438 (24%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F+ RIE VI+ + F+HGY P +VI+++ +H I
Sbjct: 155 FNIRIEELSVIDAK-----------FLHGYTTPTLVIIYQNS-------QGRHVKTYIVD 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA---- 341
+ + W N+ +A ++ VP P+ G +++G +I YH+ +A
Sbjct: 197 VRDKEVVAGP---WKQENIDAEANFIINVPKPLAGSIIIGQESITYHNGDKYIPIAPPQI 253
Query: 342 ---LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYD 394
+N YA +D + +L D+A G L +L + + D
Sbjct: 254 KDTINCYA----------------PVDKDGSRYLLGDLA------GHLFILLLESDEMMD 291
Query: 395 G-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
G V+ L + + I+ + N + ++GSRLGDS L++
Sbjct: 292 GTNTVRDLKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIR----------------- 334
Query: 454 FGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRDSLVN 511
+ D+ R + S L N G + + + Q T S A ++
Sbjct: 335 ---LPTDSSMEGRPKPSLISVLDTYTNLGPIIDMCVVDLDRQGQGQVVTCSGAFKE---- 387
Query: 512 IGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
G L+ G+ I AS ++LPG KG+W + D+SR +Y
Sbjct: 388 -GSLRIIRNGIGIQEHAS------------IDLPGIKGLWPL-------RVFDTSR--SY 425
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
D L+IS + +L+ + E T+ + + +T N+ +++Q+ E+
Sbjct: 426 DT-----LVISFVGHSRILQLSGEEVEETDLPGFDDESQTFYCSNVC-HNQLVQITEKSI 479
Query: 632 RILDGSYMTQDLSFGPSN 649
R++ + Q + P N
Sbjct: 480 RLISHTERRQVHEWKPKN 497
>gi|198432471|ref|XP_002129229.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
DNA-binding protein 1) (UV-damaged DNA-binding factor)
(DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
(UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
pigmentosum group E-co... isoform 2 [Ciona intestinalis]
Length = 1142
Score = 107 bits (267), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 89/319 (27%), Positives = 149/319 (46%), Gaps = 27/319 (8%)
Query: 1112 LAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+GTA+V E+ + GR+L+F DN LV E KE+KGA+ L GH+L
Sbjct: 832 FVVGTAFVYMEETEPKHGRILVF---HYIDNKLTLVAE---KEVKGAVFCLCQFNGHVLA 885
Query: 1171 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1230
A + +++WT + + + + L +F+L+GD+ +S+ L++K L
Sbjct: 886 AINTSVSIYQWTTEKELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYKHVEGNL 945
Query: 1231 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1290
+ +AKD+ A E L D + L ++ N+ I + + KL A FH
Sbjct: 946 DEIAKDYSPNWMTAVEILDDDNFLG---AENFYNVFICQKDSGATTDEERSKLREAALFH 1002
Query: 1291 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1350
VG + F ++ + T + ++ +LFGT+ GSIG I +DE + L S
Sbjct: 1003 VGDSINTFRHGSLVMQNVGETAVS------SKGHILFGTVHGSIGVITTVDEDLYAFLHS 1056
Query: 1351 LQKKLVDSVPHVAGLNPRSFRQFHSNGK--AHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
+Q +L + V ++ S+R F +N K AHR VD +L+ + L E+ E+A
Sbjct: 1057 IQNRLAKVIKSVGNIDHESWRSFCTNEKTEAHR----GFVDGDLIECFLDLNREKMAEVA 1112
Query: 1409 -----HQTGTTRSQILSNL 1422
+ GT R + +L
Sbjct: 1113 KGLMVKEHGTKREATVDDL 1131
Score = 48.5 bits (114), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 51/217 (23%), Positives = 91/217 (41%), Gaps = 40/217 (18%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F+ RIE VI+ + F+HGY P +VI+++ +H I
Sbjct: 155 FNIRIEELSVIDAK-----------FLHGYTTPTLVIIYQNS-------QGRHVKTYIVD 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
+ + W N+ +A ++ VP P+ G +++G +I YH+ +A
Sbjct: 197 VRDKEVVAGP---WKQENIDAEANFIINVPKPLAGSIIIGQESITYHNGDKYIPIA---- 249
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDG-RVVQR 400
L Q+ V+ D + +L D+A G L +L + + DG V+
Sbjct: 250 --PLCFFQDTINCYAPVDKDGSR--YLLGDLA------GHLFILLLESDEMMDGTNTVRD 299
Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L + + I+ + N + ++GSRLGDS L++
Sbjct: 300 LKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIRL 336
>gi|363750592|ref|XP_003645513.1| hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
gi|356889147|gb|AET38696.1| Hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
Length = 1318
Score = 107 bits (266), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 73/321 (22%), Positives = 146/321 (45%), Gaps = 16/321 (4%)
Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELK 1155
N+ TK L +G YV+ ED++ G L+ P T E++ ++++
Sbjct: 979 LNSNTKRKREYLVVGNTYVRDEDISGTGSFYLYDITEVVPEPGKPDTNYKFKEIFQEDIR 1038
Query: 1156 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
G +S + + G +I+ K ++ + +AF D P +++ N +++GD
Sbjct: 1039 GTVSTVCEISGRFMISQSSKAMVRDIQEDNSVVPVAFLDMP-VFITDAKSFGNLMIIGDA 1097
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
F+ + + ++ L K L+ + EFL++ + +++D + + + YAP
Sbjct: 1098 MHGFTFVGFDAEPYRMITLGKSVTKLETMSLEFLVNNGDMYFIITDRSQVMHVLKYAPDE 1157
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
S GQ+L+ F++ + + +RL GS + F + +DGSI
Sbjct: 1158 PNSLSGQRLVYCTSFNLHS-INTCMRLIQKNNEFVDLRRNYGSHMST-FQCIGCHIDGSI 1215
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCE 1391
+ PL E ++RRL +Q++++D + GLNPR R ++ G RP ++D
Sbjct: 1216 FKVVPLTESSYRRLYLVQQQIIDKEVQLCGLNPRMERLQNPYYQLGHLLRP----MLDFT 1271
Query: 1392 LLSHYEMLPLEEQLEIAHQTG 1412
+L + L + ++ +A + G
Sbjct: 1272 ILKKFSTLSISKRRSMASKAG 1292
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 135/646 (20%), Positives = 266/646 (41%), Gaps = 105/646 (16%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
+ L L ++L G+V S+A++ Q G++ +++ K+S+L+FD L
Sbjct: 44 AKGQLVLSYEWKLSGHVHSMALIPQPGSE----LYCLVILTGCGKLSILKFDHMSQSLDT 99
Query: 157 TSMHCFESP-EWLHLKRGRESFARGPLVKVDPQGRCGGV----LVYGLQMIILKASQGGS 211
S+H +E + L L + P + VD RC V + L + + K +
Sbjct: 100 LSLHYYEDKFKELSLLE----ISNTPSLIVDRSFRCLLVRNNDCIAILPLNVTKEEEEEE 155
Query: 212 GLVGDEDTFGSGGGFSAR------------IESSHVINLRDL--DMKHVKDFIFVHGYIE 257
++ +GG FS + + SS ++ L D+K+V D F+HG+ +
Sbjct: 156 EDNEKDEDRSNGGRFSFKRHKLNGGSVKQFVNSSTIMPASHLHSDIKNVLDVQFLHGFNK 215
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + IL++ L W+G + T + LS+ ++ +I LP+D + L+ + +
Sbjct: 216 PTLAILYQPILAWSGNEKLRSQTVKVIILSLDFEDEKSTVINIIQGLPNDLHTLIPLSN- 274
Query: 318 IGGVLVVGANTIHYHSQSASC--ALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ-- 373
+VVG N + Y + + ++LN+++ ++ +++ SS + +
Sbjct: 275 --ASIVVGVNELIYIDNTGALQGTVSLNSFSKTVLNTKVKDNSSLQAFFNRPVCQYTTIS 332
Query: 374 --NDVALLSTKTGDLVLLTVVYDGRVVQ-----RL----DLSKTN--PSVLTSDITTIGN 420
D+ LL + + + + +GR+V RL D+ K N P+ + D+
Sbjct: 333 KGKDIMLLMDEKSQMYNVIIESEGRLVTAFNCVRLPIVNDIFKNNHLPTCICGDVDLETG 392
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
+L F+G + GD++ V+ +S+ S G E +EAD +
Sbjct: 393 NL-FIGFKSGDAMRVRLN-NLRSSLASKGNVVE--TMEADEDYDE--------------- 433
Query: 481 GEELSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
LYG ++ + T F D+L+NIGPL + G + + + ++ +
Sbjct: 434 -----LYGGSTEVEKKNMDTETPFDIETLDNLINIGPLTSLAVGKVSSIEPTIAKLTNPN 488
Query: 538 NYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYD------------DEYHAYLIISLEA 585
EL + G T H + + + A + YL+ + +
Sbjct: 489 RCEL-SIVATSGNSTGSHLTVFENTIVPTVEKALKFISVTQIWNLKIKDKDKYLVTTDSS 547
Query: 586 RTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY---MTQ 641
++ + + D + +S D+ T++ +R++QV +G + D ++ MT
Sbjct: 548 QSKSDIYSIDRDFKPFKSFDFKKNDTTVSTAVTGAGKRIVQVTSKGVYLFDINFKRMMTM 607
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
+ F V+ V I DP++LL S G I++ +P
Sbjct: 608 NFDF--------------EVVHVCINDPFLLLTNSKGDIKIYELEP 639
>gi|151942273|gb|EDN60629.1| cleavage factor II (CF II) component [Saccharomyces cerevisiae
YJM789]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1078 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1135
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1136 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1189
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1190 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1370 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|323309632|gb|EGA62840.1| Cft1p [Saccharomyces cerevisiae FostersO]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1078 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1135
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1136 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1189
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1190 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1370 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|349577352|dbj|GAA22521.1| K7_Cft1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1078 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1135
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1136 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1189
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1190 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1370 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|323338222|gb|EGA79455.1| Cft1p [Saccharomyces cerevisiae Vin13]
gi|365766372|gb|EHN07870.1| Cft1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1078 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1135
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1136 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1189
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1190 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1370 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|207346484|gb|EDZ72967.1| YDR301Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1078 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1135
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1136 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1189
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1190 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1370 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|6320507|ref|NP_010587.1| Cft1p [Saccharomyces cerevisiae S288c]
gi|74583567|sp|Q06632.1|CFT1_YEAST RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|849213|gb|AAB64737.1| Ydr301wp [Saccharomyces cerevisiae]
gi|256271799|gb|EEU06830.1| Cft1p [Saccharomyces cerevisiae JAY291]
gi|285811316|tpg|DAA12140.1| TPA: Cft1p [Saccharomyces cerevisiae S288c]
gi|392300415|gb|EIW11506.1| Cft1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1078 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1135
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1136 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1189
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1190 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1370 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|190404756|gb|EDV08023.1| 150 kDa protein associated with polyadenylation factor 1
[Saccharomyces cerevisiae RM11-1a]
gi|259145538|emb|CAY78802.1| Cft1p [Saccharomyces cerevisiae EC1118]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1078 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1135
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1136 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1189
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1190 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1370 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|302652141|ref|XP_003017930.1| hypothetical protein TRV_08062 [Trichophyton verrucosum HKI 0517]
gi|291181516|gb|EFE37285.1| hypothetical protein TRV_08062 [Trichophyton verrucosum HKI 0517]
Length = 844
Score = 105 bits (262), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 183/741 (24%), Positives = 288/741 (38%), Gaps = 110/741 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + R D A L L Y + G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSTTGTQPDQKGRH---DRSQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHL 170
+ NS+ D+I+++ DAK+S++E+D HG+ S+H +E E W+
Sbjct: 85 QRVR---ISNSKSGGDAILVSSRDAKLSLIEWDPEKHGISTISIHYYEGEESHMSPWVP- 140
Query: 171 KRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDEDT- 219
S + G + VDP G C + +G+ + IL Q G LV GD+ T
Sbjct: 141 --DLGSCSSG--LTVDPNGNC-AIFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTD 195
Query: 220 FGSGGGFSARIESSH--------VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
S SA +SH V+ + LD + H F+H Y EP IL+ +
Sbjct: 196 MVSDAQKSAPGNTSHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVAR 255
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
+ S ++ K + + LP D +K++ +P P+GG L++G N
Sbjct: 256 STSLTIDRKDVVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNEL 315
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
+H + A+ +N +A + +S + L+ L + LL G +
Sbjct: 316 VHVDQAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRM 375
Query: 387 VLLTVVYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
+L+ DGR V + L +K PS S +G + F GS GDS+L+
Sbjct: 376 SILSFKVDGRSVSGISLHFVAEQSGGSITKARPSCSAS----LGRNKLFYGSEEGDSVLL 431
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE 495
++ S T+ S K G E A D D + ++L AS E
Sbjct: 432 GWSRPSSTTKRPS--KSVDGVDENGAADLSDEADQDDDGDDDDMYEDDLYSVNPASTRQE 489
Query: 496 S------AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
+ F+F D L ++GP +D + G + + S +EL +G
Sbjct: 490 KQVVNGDSPADFTFRAYDRLWSLGPYRDITLGKPSKSKSKDQQDSVPEIAAPLELVAARG 549
Query: 550 I-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART-----------MVLETA 593
TV + + DS +M DD Y + I ++ ++ +L
Sbjct: 550 FGKSGGLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTGLSRSYDKYLLLAK 606
Query: 594 DLLTEVTESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGS 637
+ ESV Y V + TI G L RV+QV R D
Sbjct: 607 SKGEDKEESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLATGTRVVQVLRTEIRSYD-- 664
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQ 695
Y P E SE TV+ S A+PY+L D S+ +L D + V VQ
Sbjct: 665 YNLGLAQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQ 722
Query: 696 TPAAIESSKKPVSSCTLYHDK 716
AA S K +S C LY DK
Sbjct: 723 GSAA---SGKWISGC-LYEDK 739
>gi|45184764|ref|NP_982482.1| AAL060Wp [Ashbya gossypii ATCC 10895]
gi|74695871|sp|Q75EY8.1|CFT1_ASHGO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|44980110|gb|AAS50306.1| AAL060Wp [Ashbya gossypii ATCC 10895]
gi|374105681|gb|AEY94592.1| FAAL060Wp [Ashbya gossypii FDAG1]
Length = 1305
Score = 105 bits (262), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 79/343 (23%), Positives = 150/343 (43%), Gaps = 33/343 (9%)
Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELK 1155
N+ TK L +G YV+ ED+ G L+ P T +++ ++++
Sbjct: 967 LNSNTKRRREYLVVGNTYVRDEDIGGTGSFYLYDITEVVPEPGKPDTNYKFKDIFQEDIR 1026
Query: 1156 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
G +S + + G +I+ K ++ + +AF D P +++ N +++GD
Sbjct: 1027 GTVSTVCEISGRFMISQSSKAMVRDIQEDNSVVPVAFLDMP-VFITDAKSFGNLMIIGDS 1085
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
+ FL + + ++ L K L+ EFL++ + +V+D + + YAP
Sbjct: 1086 MQGFSFLGFDAEPYRMLTLGKSVSKLETMCVEFLVNNGDVYFLVTDRNNLMHVLKYAPDE 1145
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR--------FALL 1326
S GQ+L+ F++ + T L +D G K +R F +
Sbjct: 1146 PNSLSGQRLVHCTSFNLHSTNT----CMRLIKKNDEFG------KVSRGFGIYMPSFQCI 1195
Query: 1327 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPG 1383
DG+I + PL E ++R L +Q++L+D + GLNPR R F+ G RP
Sbjct: 1196 GSQADGTIFKVVPLSEASYRSLYLIQQQLIDKEVQLCGLNPRMERLENPFYQMGHILRP- 1254
Query: 1384 PDSIVDCELLSHYEMLPLEEQLEIAHQTG-TTRSQILSNLNDL 1425
++D +L + L + ++ +A + G ++I +L D+
Sbjct: 1255 ---MLDFTVLKRFATLSIPTRMTMASKAGRQAHAEIWRDLIDI 1294
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 120/624 (19%), Positives = 248/624 (39%), Gaps = 124/624 (19%)
Query: 140 AKISVLEFDDSIHGLRITSMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
++S++ FD L S+H +++ E L G P ++ +P RC +LV+
Sbjct: 82 GRVSIVRFDAENQTLETESLHYYDAKFEELSALTVGA-----APRLEQEPAARC--LLVH 134
Query: 198 GLQMIILKASQGGSGLV-------------GDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+ + +G D G G S + +SH+ + D+K
Sbjct: 135 NGDCLAVLPLRGHEEEGEEAEEEEEHPAKRARTDADGRLVGASTVMPASHLHS----DIK 190
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
+VKD F+ G + + +L++ +L+W G T LS+ ++ +I L
Sbjct: 191 NVKDMRFLRGLNKSAVGVLYQPQLSWCGNEKLTRQTMKFIILSLDLDDEKSTVINMLQGL 250
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC--ALALNNYAVS-----------LDS 351
P+ + ++ + + G ++ G N + Y + + A++LN ++ S L +
Sbjct: 251 PNTLHTIIPLSN---GCVLAGVNELLYVDNTGALQGAISLNAFSNSGLNTRIQDNSKLQA 307
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL---------D 402
E P F+ + + D+ LL + + + + +GR++ +
Sbjct: 308 FFEQPLCYFATQSNG-------RDILLLMDEKARMYNVIIEAEGRLLTTFNCVQLPIVNE 360
Query: 403 LSKTN--PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
+ K N P+ + ++ SL F+G + GD++ V+ + L S L+
Sbjct: 361 IFKRNMMPTSICGNMNLETGSL-FIGFQSGDAMHVRL------NNLKSSLEH-------- 405
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGP 514
+ + S+ L+ + + + LYG NN E +K F D L+NIGP
Sbjct: 406 -------KGTVSETLE--TDEDYMELYG---NNAEKEKKNLETESPFDIECLDRLLNIGP 453
Query: 515 LKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADSSRMAA 570
+ + G + + + ++ + EL + G T+ + + + +
Sbjct: 454 VTSLAVGKASSIEHTVAKLANPNKDELSIVATSGNGTGSHLTILENTIVPTVQQALKFIS 513
Query: 571 YDDEYH-------AYLIISLEARTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
++ YL+ + ++T + + D + ++ D+ T++ G +R
Sbjct: 514 VTQIWNLKIKGKDKYLVTTDSSQTRSDIYSIDRDFKPFKAADFRKNDTTVSTAVTGGGKR 573
Query: 623 VIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
++QV +G + D ++ MT + F V+ V I DP++LL S G
Sbjct: 574 IVQVTSKGVHLFDINFKRMMTMNFDF--------------EVVHVCIKDPFLLLTNSKGD 619
Query: 680 IRLLVGDPSTCTVSVQT--PAAIE 701
I++ +P V+T P A++
Sbjct: 620 IKIYELEPKHKKKFVKTVLPDALK 643
>gi|390358537|ref|XP_001201130.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 283
Score = 105 bits (261), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 126/273 (46%), Gaps = 51/273 (18%)
Query: 3 FAAYKMMHWPTGIANC-GSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVT 61
+A Y+ +H PTG+ +C F + P ++ NLVV
Sbjct: 2 YAFYREIHPPTGVEHCVYCHFFS----------------------PDQQ------NLVVA 33
Query: 62 AANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ 121
+ + +Y + + K+S + LE + + G V S+ Q
Sbjct: 34 KGSELTVYSMITVDSNKPTDKDSKPKNK-----------LEEAATFHIFGKVMSM----Q 78
Query: 122 GGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGP 181
RD+++L+F +AK+S++E+D ++H L+ SMH FE E K G P
Sbjct: 79 SAQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEEDE---TKEGVYRNIFHP 135
Query: 182 LVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL 241
+VKVDP RC +L YG ++++L + GLV D D S + S+VI L ++
Sbjct: 136 VVKVDPDHRCAIMLTYGSKLVVLPFRR--DGLVEDLDKSMSASTRRGALMPSYVIRLNEM 193
Query: 242 D--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
D + +V D F+HGY EP ++IL+E TWAG
Sbjct: 194 DDPICNVLDIQFLHGYYEPTLLILYEPLRTWAG 226
>gi|71413583|ref|XP_808925.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|70873226|gb|EAN87074.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 444
Score = 104 bits (260), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 104/449 (23%), Positives = 193/449 (42%), Gaps = 51/449 (11%)
Query: 999 IPLKATPHQITYFAEKNLYPLIVSV-----PVLKPLNQVLSLLIDQEVG--HQIDNH--- 1048
I L TPH + Y ++ S P P + L+++ D+E G I
Sbjct: 4 IHLGVTPHFVVYHPPARSCFVVTSKKEPFRPQRAPFDFQLNIVYDEESGGVQSITTEAPV 63
Query: 1049 -NLSSVDLH---RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTT 1104
N+ + + R + +E+R++ + W T+ ++ +E L +++ +
Sbjct: 64 CNMPPIAPNAGIRVPMADRFEIRLM----STTDWACTDTLLLEENERVLGAQMMEIHCEK 119
Query: 1105 TKE---NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1161
E + + TA+ GED+ RGR+LL +T + L+ +S+ L G +A+
Sbjct: 120 DAEGLHTAPVCVVSTAFPLGEDITCRGRILLLATMCTKKKRKILL--FHSEPLNGPATAV 177
Query: 1162 ASLQGHLLIASGPKIILHK--WTGTELN-GIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
++ H+ +A G I L + W +L G Y YV ++ +N+++ GD+ +S
Sbjct: 178 VGIRHHIAVAVGGTIKLFRFDWEKRKLVVGALLYAGT--YVTRMSSFRNYLIYGDLSRSC 235
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
+ E+ L++L KD ++ + + L+ SD+++N+ + Y P++ E+
Sbjct: 236 AIARFNEENHTLSVLGKDRNAVSVVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETE 295
Query: 1279 KG--QKLLSR-----AEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
G K+L E+ + G + K LR + LA +S T L+ T
Sbjct: 296 AGSPNKVLESVLSLDGEYRLSGGCLVKSLRFRSLAGNSSVT--------------LYVTN 341
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF-HSNGKAHRPGPDSIVD 1389
G IG I P+ E R L ++L +PH AGL PR F + + + +V
Sbjct: 342 YGEIGFIVPIGEQANRTASWLMRRLQIDLPHSAGLTPRMFLGLSQGSPRTAMRAKEMLVS 401
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
LL+ + L + + IA T ++
Sbjct: 402 ASLLNEFFFLDIHSRKTIASAAYTQLERV 430
>gi|348679451|gb|EGZ19267.1| hypothetical protein PHYSODRAFT_492468 [Phytophthora sojae]
Length = 736
Score = 104 bits (259), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 111/427 (25%), Positives = 169/427 (39%), Gaps = 102/427 (23%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR+L++ V D F+ GY+EP +++LHE + + GR++ T I+ +SI+
Sbjct: 261 LLRLRELEITGKVIDLAFLDGYLEPTLMVLHEENEKNSTCGRLAAGFDTYCITVISINMN 320
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+ HP IW+ NLP D +KL+ +P+GGV+V+ AN Y +Q+ LA N L
Sbjct: 321 TRLHPKIWTVKNLPSDCFKLIPCRAPLGGVVVLSANAFLYFNQTQFHGLATN----VLRE 376
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL-DLSKTNPSV 410
+ + ++ L +L LL+ GD +L++ Y+ R V+ + KT
Sbjct: 377 QDDHEMAQLNIVLYDCQFEYLHEKEVLLTMPNGDAYVLSLPYEDRSVRFWRSIKKT---- 432
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF------TCGSGTSMLSSG----LKEEFGDIEAD 460
F+GSR GDS+L + G S L +KEE E
Sbjct: 433 ------------LFVGSRSGDSVLYALDQKKLTSAGGEASKLQEDEEMLIKEEVVKEEVT 480
Query: 461 -----------------------APSTKRLRRSSSDALQDMVNGEELSLYGSASN----N 493
AP+ + S S + VNG S N
Sbjct: 481 AEVKAEPAEEEEEDEDDLFLCGAAPTKEEPTTSGS---TEAVNGTNGSAVKKEENGHAVE 537
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV----------- 542
ES + D L +IG + + NAD S + ELV
Sbjct: 538 EESGPYDYVLHQIDVLPSIGQITSIELSIENNAD------SNEKREELVISGGYEHSGAI 591
Query: 543 ---------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
EL GC+ +WTV + R Y+AYLI+S+ RT
Sbjct: 592 SVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATKSSDGR------SYNAYLILSVAHRT 645
Query: 588 MVLETAD 594
MVL T +
Sbjct: 646 MVLRTGE 652
>gi|298711490|emb|CBJ26578.1| n/a [Ectocarpus siliculosus]
Length = 1135
Score = 102 bits (253), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/339 (28%), Positives = 157/339 (46%), Gaps = 25/339 (7%)
Query: 1085 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFST-GRNADNP 1142
P+ + EN ++ V +F KE L +GT YV+ ++ A GR+L+FS G+ A+
Sbjct: 789 PLDAFENGSSM-VSCVFANDKKE---YLVVGTGYVREDECEPAVGRLLVFSVEGQGAERK 844
Query: 1143 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY---V 1199
+L EV E +GA+ L G LL K+ L +W + +GI Y +
Sbjct: 845 VDLAAEV---ETRGAVYVLNGFNGKLLACINSKVQLFRWIEKD-DGIQELQTECGYHGHI 900
Query: 1200 VSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1257
++L++ +FI++GD+ +S+ L +K + +A+D+ + A E L D +
Sbjct: 901 LALHMQSRGDFIIVGDLMRSVSLLVYKAVDGAIEEVARDYHANWMTAVEMLNDDVYIG-- 958
Query: 1258 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG- 1316
+ NI + + +L + EFH+G V KF R +L SS+ +PG
Sbjct: 959 -GEADCNIFTLRRNADAATEEERARLEIQGEFHLGEFVNKFCRGSLLMQSSEVN--SPGG 1015
Query: 1317 --SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1374
S LLFGT++G +G I L E R L LQ + V V G + +R F
Sbjct: 1016 MDSPLVKGQPLLFGTVNGMVGTILTLTEDNHRFLAQLQTAMTKVVKGVGGFSHDEWRSF- 1074
Query: 1375 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1413
+NG+ P + +D +L+ Y +P Q E+ T
Sbjct: 1075 TNGRRTSPSSN-FIDGDLVESYLDMPRHNQEEVLRHVDT 1112
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 96/418 (22%), Positives = 166/418 (39%), Gaps = 94/418 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
A+ + N+R L+ V D F+ G + + +L++ + + +H I
Sbjct: 143 MDAKGQLKDAFNIR-LEELEVLDIQFLSGCPKATIAVLYQDQR------NARH----IKT 191
Query: 286 LSISTTLKQHPL-IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
+IST K+ W+ +N+ H+A +L+ VP+P GGVL++G TI YHS A + + N
Sbjct: 192 YTISTRDKEFDTGPWAQLNVEHNASELIPVPAPFGGVLILGHQTICYHSGKAFITIPIQN 251
Query: 345 YAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL--VLLTVVYDGRVVQR 400
+ A+ W+ D + L+S +G L V+LT V+
Sbjct: 252 TRM------------------CAYG-WVDADGSRLLVSDHSGGLHVVILTPDATNTAVET 292
Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
+ + S I+ + N + F+GS GDS L++ + E D
Sbjct: 293 AHIEALGETSCASSISYLDNGVVFIGSASGDSQLIKL------------------NPEKD 334
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
A T + D L +++ + T S +D G L+
Sbjct: 335 AQGTYIQVLETYDNLGPILD----MCVADLDRQGQGQAVTCSGCSKD-----GSLRIIRN 385
Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
G+ IN A+ +EL G KG+W++ R N + + YL+
Sbjct: 386 GIGINEHAA------------IELAGIKGMWSL-----RPSNTNHDK----------YLV 418
Query: 581 ISLEARTMVL---ETADLLTEVTE-SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + T VL E D ++ E + F +G T+ G G +QV +RG ++
Sbjct: 419 QAFISETRVLAFEEDEDGDHQLAEGEIAGFQEGCTLFCG-CVGGNMAVQVTKRGVVLI 475
>gi|340059653|emb|CCC54046.1| putative mitochondrial carrier protein [Trypanosoma vivax Y486]
Length = 1481
Score = 102 bits (253), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 114/508 (22%), Positives = 213/508 (41%), Gaps = 64/508 (12%)
Query: 899 AYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRER--LRVHPQLCDGSIV 955
A ET C R I F +++G+ G ++ G P + + R L + G +
Sbjct: 939 AMIESETQLTRHCSRSIIPFDSLAGNVGAYVCGRHPLFLLWDRRTGLLSGYRHQIQGPVR 998
Query: 956 AFTVLH-----NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
F V C GF T ++ P G + + + I + ATPH I+Y
Sbjct: 999 GFAPFPLMEGGFVYCGEGF---TDFAVMNTYCRPIGHGW-----LGRRIDVGATPHFISY 1050
Query: 1011 FAEKNLYPLIVS-----VPVLKPLNQVLSLLIDQEVG--HQIDNHNLS-------SVDLH 1056
++ S P P + L + ++E G I L+ S
Sbjct: 1051 NMPGRGCFVVTSHKQPFRPQRAPFDVQLKISYNEETGAIQSIATEPLTCSMPPIASSAGV 1110
Query: 1057 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETL--LA 1113
R + +EVR + A W T ++ +E L++++V + + K N T+
Sbjct: 1111 RVPMADWFEVRFMST--AHVDWPCEDTFKLEENERVLSIQMVQIDGDRGMKINGTVPVCV 1168
Query: 1114 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1173
+ TA+ G+DV RGR+ L +T + + + ++++ L G +A+A ++ H+ +A G
Sbjct: 1169 VSTAFPLGDDVTCRGRIHLLAT--KSLRRGHKIVHLHAEALNGPATAVAEIRHHIAVAVG 1226
Query: 1174 PKIILHKW---TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1230
I ++++ +G + + Y +Y L++++N+I+ GD+ S + E+ L
Sbjct: 1227 GTIKIYRYDWQSGKLVVSVLLYAG--IYATKLSVIRNYIVYGDLIHSCAMARFNEENHTL 1284
Query: 1231 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK-------L 1283
+L ++ S+ + + ++ SD+Q+N+ + Y P++ E+ G+ L
Sbjct: 1285 TVLGRNRNSISVVDCNMMYHDRSFGILCSDDQRNVLVMGYTPRVQEAGAGRPAKTLESLL 1344
Query: 1284 LSRAEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1342
E+ + + K LR SD N +L+ + G +G I P+ E
Sbjct: 1345 TLDGEYRLPSGCLAKSLRFS--------------SDFGNSSVMLYTSNYGEVGFIVPIGE 1390
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSF 1370
R + ++L VP AGL PR F
Sbjct: 1391 QANRTALWVTRRLQTDVPCDAGLTPRMF 1418
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/231 (26%), Positives = 103/231 (44%), Gaps = 41/231 (17%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHH-------TCMISALSISTTL 292
+++V+D F+ EP++ IL ER+ TWAGRV W+ T ++ + IS ++
Sbjct: 268 IRYVRDLQFIGSSGEPLLAILCERQPTWAGRVKLVEWRTKVVESNTLTMHVTWVQISASM 327
Query: 293 KQHP---LIWSAMNLPHDAYKLLAV---PSPIGGVLVVGANTIHYHSQSASCALALNNY- 345
HP LI +P++ +L V + GV+ G N I + + N+
Sbjct: 328 TAHPKLLLIGEVEGVPYNVTHMLPVEPFSQTMSGVVCFGTNVIMHITTKRGYGAYFNDTG 387
Query: 346 ----------AVS----------LDSSQELPRSSFSVELDAAHATW--LQNDVALLSTKT 383
AVS LD S L R + S+ AA + + +++ +L+
Sbjct: 388 REECINSKFSAVSFGKAVWSDPQLDKSSALARVNMSLANCAATSMVGKMGDELQVLALLE 447
Query: 384 GDLVLLTV--VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
D V++T+ V G V+ + ++ S ++ IG L FLGS +GDS
Sbjct: 448 EDGVVITLHFVARGSSVEEVRITMLGSGCYCSSVSRIGRQLVFLGSTVGDS 498
>gi|242208344|ref|XP_002470023.1| predicted protein [Postia placenta Mad-698-R]
gi|220730923|gb|EED84773.1| predicted protein [Postia placenta Mad-698-R]
Length = 696
Score = 100 bits (250), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 95/313 (30%), Positives = 141/313 (45%), Gaps = 33/313 (10%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDV 1124
+ ++ P+ G W T E + VTL T+T + +GT ED+
Sbjct: 396 LELISPEPEG--WVTMDGFESAQKEFVTCLDCVTLETTSTGSGMMDFIIVGTTINCREDL 453
Query: 1125 AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1184
A +G V +FS + Q + KG ++AL L L+ + G KI + +
Sbjct: 454 AVKGAVYIFSIVEVVPDLQ------CRDDAKGPVAALCGLNNSLVSSMGQKIFVRAFDLN 507
Query: 1185 E-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
E L G+AF D +Y+ SL VKN +++ D KS + +L +L KD + C
Sbjct: 508 ERLVGVAFLDVG-VYITSLRAVKNLLVISDAVKS-------KDPYKLVILGKDPYQV-CV 558
Query: 1244 ATE--FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1301
T F DG L++ DE I+I+ Y P ES GQ LL R EFH
Sbjct: 559 TTADLFFADGQVF-LLIGDEDGVIRIYEYDPHDPESRGGQHLLRRTEFHG---------- 607
Query: 1302 QMLATSSDRTGAAPGSD-KTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVP 1360
QM + S G D + L+ G+ +GS+ +DE+ +RL LQ +L +V
Sbjct: 608 QMESRMSILIIRRRGKDTDIPQARLISGSTNGSLSMFTYVDEVASKRLHLLQGQLTRNVQ 667
Query: 1361 HVAGLNPRSFRQF 1373
HV GLNP+ FR +
Sbjct: 668 HVVGLNPKVFRPY 680
>gi|403218521|emb|CCK73011.1| hypothetical protein KNAG_0M01580 [Kazachstania naganishii CBS 8797]
Length = 1345
Score = 100 bits (248), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 65/321 (20%), Positives = 147/321 (45%), Gaps = 18/321 (5%)
Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELK 1155
N+ TK + +GT++V ED+ A G ++ P + + + +EL+
Sbjct: 1010 LNSRTKRKIEYVVVGTSFVGTEDLPATGSFQMYDIAEVVPEPGKPDTNYKIKQFFKEELR 1069
Query: 1156 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
A++++ + G +I+ K+++ + +AF D P L+ + N +++GD
Sbjct: 1070 SAVTSVCDISGRFVISQSQKLMVRDAQEDNSVVPVAFLDIP-LFTADMKSFGNLLIIGDA 1128
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
+ I + + + ++ L + + + EFL++G L + D + + YAP
Sbjct: 1129 MQGIQLVGFDAEPYRMIPLGRSVLKFETLSLEFLVNGGDLYFTLIDRNDILHVLKYAPDE 1188
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
S GQ+L+ + F++ + + L ++ P + + ++ G DGS+
Sbjct: 1189 PNSLSGQRLIHCSSFNMYSTTS----CTRLIPKNELFVDGPLNPAIQSYQVIGGQADGSL 1244
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCE 1391
+ P+ E +RRL +Q++++D +AG+NP+ R ++ RP ++D
Sbjct: 1245 FKVMPVPETVYRRLYVVQQQIIDKETPLAGINPKMERLSNDYYQTSHLLRP----MLDYN 1300
Query: 1392 LLSHYEMLPLEEQLEIAHQTG 1412
++ + + + ++ +AH+ G
Sbjct: 1301 VVKQFCAMSIPKRTTLAHKLG 1321
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 149/708 (21%), Positives = 285/708 (40%), Gaps = 150/708 (21%)
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
T A+ +E+ +VR + SG+ L L ++L + LA++
Sbjct: 22 TTADYVELLIVRTNLLSIYKVTESGK--------------LLLTHEFKLQARITDLALV- 66
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF--- 177
G +N+ + ++L + K+S+++F+ + L S+H +E K SF
Sbjct: 67 -GSVENTGL-NYLLLGIGNCKLSIVKFNSLNNSLETISLHYYEE------KFKANSFIEL 118
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA--------R 229
A+ +++DPQ RC +L ++IL SQ E+ +
Sbjct: 119 AKKTELRIDPQNRCA-LLFNNDNIVILPFSQQQEEEDYGEEEEEEDNYNMEDGPNVKKLK 177
Query: 230 IES--------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH- 278
+ES S + + + LD +++V D F+ + P + IL++ +LTWAG +
Sbjct: 178 LESASTNLTLPSIITDSKKLDSTIENVVDIQFLRNFSRPTLGILYQPKLTWAGNLQLNPL 237
Query: 279 -HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
++ +L+I+ + + +I LP D++ L +P+ G VL+ G+N + Y +
Sbjct: 238 PTKFLVISLNIAVSELEGTVITKLEGLPWDSHTL--IPTWNGCVLL-GSNEVSYIDNTGV 294
Query: 337 -SCALALNNYA-VSLDSSQELPRSSFSVEL--DAAHATWLQ--------NDVALLSTKTG 384
A+ LN+YA SL + + + + L D + W +++ LL ++
Sbjct: 295 LQSAIFLNSYADASLRKVRVVDHTDQQITLNKDLVKSLWSAPTKESGGADEILLLMDESS 354
Query: 385 DLVLLTVVYDGRVVQRLDL-----------SKTNPSVLT--SDITTIGNSLFFLGSRLGD 431
+L + + ++GR++ + D+ +P+ +T + N F+G + GD
Sbjct: 355 NLYYIQLEFEGRLMTKFDMINLPIVNDIFVHNLHPTCITRIDESKHNININLFIGFQTGD 414
Query: 432 SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS---LYG 488
SL+V+ +I + + +++SS++ V E+ LYG
Sbjct: 415 SLVVRL-----------------NNIRSAIETRHEYKQTSSESGLGKVEDEDEDEDDLYG 457
Query: 489 -------SASNNTESAQ----KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK-- 535
+AS N ++A + F + L NIGP+ G + G+
Sbjct: 458 DDGAHDKNASVNNDNAVVHTVQPFDIEMMSCLRNIGPVTSLVIGEASSVQPVIKGLPNPN 517
Query: 536 QSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL--------EART 587
+ Y LV G + G N +++ + A IS+ + R
Sbjct: 518 KGEYSLVATCG----------NGTGSNLMVGQISVQPEVELALKFISVTQIWNLKVKNRD 567
Query: 588 MVLETADL------LTEVTESVDYFVQGR------TIAAGNLFGRRRVIQVFERGARILD 635
L T D + E+ + + QGR T+ G +R++QV + D
Sbjct: 568 KYLITTDSTKTKSDIYEIENNFALYKQGRLRRDATTVYISMFGGEKRIVQVTTNHLYLYD 627
Query: 636 GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
++ + L N E V+ VS+ DPY+L+ +S G I +
Sbjct: 628 TNF--RRLFLNKFNYE---------VVHVSVMDPYLLITLSRGDIMIF 664
>gi|410079681|ref|XP_003957421.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
gi|372464007|emb|CCF58286.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
Length = 1350
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 76/340 (22%), Positives = 153/340 (45%), Gaps = 25/340 (7%)
Query: 1086 MQSSENALTVRVVTLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1142
++ EN+L + ++ ++ TK + G + V ED+ G L+ P
Sbjct: 996 VEFEENSLVNDIRSMIIQIDSRTKRKREYIVAGFSAVGTEDLPPSGSFHLYDITAVVPEP 1055
Query: 1143 QNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPP 1196
T + +E++G+++++ + G I+ KI++ + +AF D P
Sbjct: 1056 GKPDTNYKFERFFKEEVRGSVTSVCEISGRFAISQSQKIMVRDAQEDGSVVPVAFLDIP- 1114
Query: 1197 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1256
++V + N +++ D F+ + + ++ L K + EFL++ +
Sbjct: 1115 IFVTDMKSFGNLMIISDAMHGFQFVGFDAEPYRMIQLGKSVSKFKTMSVEFLVNNGDIYF 1174
Query: 1257 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1316
V+D + + YAP S+ GQKL+ + F++ A + +L +D
Sbjct: 1175 AVTDRDNILHVLKYAPDEPNSFSGQKLVHCSSFNLYADNS----CMVLLAKNDEFNKV-- 1228
Query: 1317 SDKTNR-FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---Q 1372
D TNR + ++ G DGS+ I PL E ++RRL +Q++++D + GLNPR R Q
Sbjct: 1229 -DDTNRTYQVVGGQTDGSMFKIVPLSEESYRRLYVIQQQIIDKETQLGGLNPRMERLSNQ 1287
Query: 1373 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
+ RP ++D ++ + +P+ ++ +A + G
Sbjct: 1288 YLPLCHVMRP----MLDFNVIRKFSAMPISKRQALAQKLG 1323
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 144/669 (21%), Positives = 270/669 (40%), Gaps = 133/669 (19%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L ++ G + +A+L + A D ++L AKIS+++FD + + S+H
Sbjct: 48 LFLTNEFKFDGRITDIALLPRQDA----ALDYLLLCTAVAKISIVKFDLESNSIETVSLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRC------GGVLVYGLQMIILKASQGGSGLV 214
+E ++ L R +++DP RC + V M + G
Sbjct: 104 YYED-KFKDLSLAE--LTRESKLRLDPASRCLVLFNEDNIAVLPFVMKEDEEDDDEEGEE 160
Query: 215 GDEDTFGSG-GGFSARIES-----SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHER 266
DEDT+ F A I S +++ + + D++++ D F++ Y +P + IL++
Sbjct: 161 EDEDTYEPRIKRFRANINGRVTFPSTILSAKTIHEDIQNIIDIEFLNNYSKPTVAILYQP 220
Query: 267 ELTWAGRVSWKH--HTCMISALSIST----TLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
+LTW G + +I L +T T H +I LP D ++L+ V + G
Sbjct: 221 KLTWVGNLQLHPLPTKLLIVTLECNTNGFETSLSHIVIARLNELPWDWHRLIPVTN---G 277
Query: 321 VLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQND 375
+++VG N + Y + + LN++A + L +S S E + + +++
Sbjct: 278 IVIVGINELAYVDNTGVLQTVILLNSFA-----DRNLKKSRIIDHSKEESVFNHSAMKH- 331
Query: 376 VALLSTKTGD---------------LVLLTVVYDGRVVQRLDLSK--------------T 406
+ +L T G+ L + ++ +GR++ + D+ K T
Sbjct: 332 ICILKTTDGNEDDADLLLLMDDRSNLYYVQMISEGRLMTQFDIIKLPIINNIFINNLNPT 391
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
+ S L S + + LFF G + GD+ F C + ++E D+ D PS
Sbjct: 392 SISRLDSSSSRVNLDLFF-GFQSGDA----FVCRLNNIKSAVETRKEHKDV-LDYPS--- 442
Query: 467 LRRSSSDALQDMVNGEEL----SLYGSASNNTESAQ-------------KTFSFAVRDSL 509
++D + +G +L LY + +T+ A + F A+ SL
Sbjct: 443 ----NADEYDE--DGADLYGDDDLYSDEATSTQRANSKENGRSNMIETVEPFDIALLSSL 496
Query: 510 VNIGPLKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADS 565
NIGPL + G D + G+S +N EL + G T S R +
Sbjct: 497 NNIGPLTSLTSGKVSAVDQNNKGLSNPNNNELSIVATSGNGTGSHLTAVLPSVRPEIELA 556
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADL------LTEVTESVDYFVQGR-----TIAA 614
+ + ++ + + + L T D + E+ + +GR T +
Sbjct: 557 LKFISITQIWN----LKFKGKDKFLVTTDSTKSKSDIYEIDNNFALHREGRLRRDATTVS 612
Query: 615 GNLFGR-RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
+FG +R++QV +LD ++ + + V+ VS+ DPY+L+
Sbjct: 613 IAMFGSDKRIVQVTTNHLYLLDTTF-----------RRLNTIKFDYEVVHVSVMDPYILI 661
Query: 674 GMSDGSIRL 682
+S G I++
Sbjct: 662 TVSRGDIKV 670
>gi|9794906|gb|AAF98387.1| cleavage and polyadenylation specificity factor [Drosophila
melanogaster]
Length = 279
Score = 97.4 bits (241), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 81/300 (27%), Positives = 141/300 (47%), Gaps = 65/300 (21%)
Query: 425 LGSRLGDSLLVQFTCGSGTSMLS------------SGLKEEFGDIEADAPSTKRLRRSSS 472
LGSRLG+SLL+ FT +++++ L++E ++E + +L + +
Sbjct: 1 LGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQRNLQDEDQNLE-EIFDVDQLEMAPT 59
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD----- 527
A + EEL +YGS + + + F F V DSL+N+ P+ G R+ +
Sbjct: 60 QAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSLMNVAPINYMCAGERVEFEEDGVT 119
Query: 528 ---------------ASATGISKQS---------NYELV---ELPGCKGIWTVYHKSSRG 560
+ATG SK N +++ EL GC +WTV+
Sbjct: 120 LRPHAESLQDLKIELVAATGHSKNGALSVFVNCINPQIITSFELDGCLDVWTVFD----- 174
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
D+++ ++ +D+ H ++++S T+VL+T + E+ E+ + V TI GNL +
Sbjct: 175 ---DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQEINEI-ENTGFTVNQPTIFVGNLGQQ 229
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
R ++QV R R+L G+ + Q++ S V+ VSIADPYV L + +G +
Sbjct: 230 RFIVQVTTRHVRLLQGTRLIQNVPIDVG----------SPVVQVSIADPYVCLRVLNGQV 279
>gi|159470707|ref|XP_001693498.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283001|gb|EDP08752.1| predicted protein [Chlamydomonas reinhardtii]
Length = 366
Score = 96.3 bits (238), Expect = 1e-16, Method: Composition-based stats.
Identities = 69/217 (31%), Positives = 93/217 (42%), Gaps = 54/217 (24%)
Query: 921 SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILK 979
+ H G F++G+RP W + R L H +G + A T HNVNC GFI S +G LK
Sbjct: 164 ANHSGVFVAGARPLWLVAGRGGLAAHAMWSEGPVAALTPFHNVNCPLGFITACSARGQLK 223
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
+C LP + D W ++V PLK TPH++ +F E IV+ +P
Sbjct: 224 VCCLPPHTRLDGAWATRRV-PLKVTPHRLAWFREAG----IVAAITSRPAPS-------- 270
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
R EE GG E AL ++ V
Sbjct: 271 -----------------RPRPAEE----------PGG-------------EQALCLKFVY 290
Query: 1100 LFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1136
L N TT + +TLLA+GT GED GR+LL+S
Sbjct: 291 LRNATTGDTDTLLAVGTGTPLGEDYPCLGRLLLYSVA 327
Score = 56.6 bits (135), Expect = 9e-05, Method: Composition-based stats.
Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 7/111 (6%)
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL------SFGPSNSESGSGSE 657
+Y TIAAGNLF ++Q G R+L+G + QDL + G + S G
Sbjct: 4 EYITDQPTIAAGNLFHNAVIVQACPGGVRLLEGMSLVQDLPLSELQALGGVAAASRPGVA 63
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPA-AIESSKKPV 707
T+ + +ADPYVL+ +S+G+ LL D + T+ + A + ++PV
Sbjct: 64 PPTITHMQVADPYVLVSLSNGTACLLEADLLSMTLGLGGCAGGVPDCERPV 114
>gi|301093651|ref|XP_002997671.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110061|gb|EEY68113.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 478
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 159/387 (41%), Gaps = 101/387 (26%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR++++ V D F+ GY+EP +++LHE + + GR++ T ++ +SI+
Sbjct: 106 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 165
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+ HP IW+ NLP D ++L+ +P+GGV+V+ AN I Y +Q+ LA N +A
Sbjct: 166 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTHE 225
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
+ +L +V L +LQ LL+ G + +L++ Y+ + L
Sbjct: 226 TAQL-----NVVLYDCQFEYLQEKELLLTMPCGQVYVLSLPYEDTSSRGL---------- 270
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE----------ADA 461
G F+GSR GDS+L L + +EE D E A
Sbjct: 271 ---YGFGGKQTLFIGSRSGDSVLFVLD----KKKLVTATEEEPKDEEMPIKEVVIKQESA 323
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT--------------------- 500
P K S A ++ + ++L LYG+A E A +
Sbjct: 324 PEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPAATSSTECTNGVGVSSVKTEENGAP 378
Query: 501 ------FSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYELV---------- 542
+ + +R D L +IG + G+ NAD S + ELV
Sbjct: 379 EQDTGPYDYELRQIDVLPSIGQITSIELGVENNAD------SNEKREELVISGGYERSGA 432
Query: 543 ----------------ELPGCKGIWTV 553
EL GC+ +WTV
Sbjct: 433 ISVLHNGLRPIVGTEAELNGCRAMWTV 459
>gi|168066745|ref|XP_001785293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663100|gb|EDQ49884.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1090
Score = 95.9 bits (237), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/355 (26%), Positives = 164/355 (46%), Gaps = 35/355 (9%)
Query: 1061 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQ 1120
+E + VR++E ++ + P+ EN ++ + T ++ +GTAY
Sbjct: 743 METHYVRLIEDQ----TFEIISGFPLDPYENGCSIITCSF----TDDSNVYYCVGTAYAL 794
Query: 1121 GEDVA-ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1179
E+ ++GR+L+FS D LV E KE+KGA+ L + G LL KI L+
Sbjct: 795 PEESEPSKGRILVFSV---EDGKIQLVAE---KEVKGAVYNLNAFNGKLLAGINQKIALY 848
Query: 1180 KWT----GTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLL 1233
KWT GT + + + ++++L + +FI++GD+ KSI L +K + +
Sbjct: 849 KWTLRDDGTR--ELQYESSHHGHILALYVQSRGDFIVVGDLMKSISLLIYKPEEGAIEER 906
Query: 1234 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1293
A+D+ + A E L D + L ++ N+ + + +L E+H+G
Sbjct: 907 ARDYNANWMTAVEILDDDTYLG---AENSFNLFTVRKNNDAATDEERGRLEVVGEYHLGE 963
Query: 1294 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1353
V +F ++ P S+ + ++FGT++G IG IA L + F LQ LQ+
Sbjct: 964 FVNRFRHGSLVMR-------LPDSEASQIPTVIFGTVNGVIGVIASLPQDQFLFLQKLQQ 1016
Query: 1354 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
LV + V GL+ +R F + K + +D +L+ + L + EIA
Sbjct: 1017 ALVKVIKGVGGLSHEQWRSFSNERKT--VDARNFLDGDLIESFLDLSRNKMEEIA 1069
Score = 71.2 bits (173), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 126/557 (22%), Positives = 220/557 (39%), Gaps = 121/557 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A+ L+ + ++G + +L + G +D + ++FE K VL++D
Sbjct: 39 RIEIHLLTASGLQPMLDVPIYGRIATLELFRPPG----ESQDVLFISFERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
GL +T S + GR + G + VDP R G+ +Y GL +I ++
Sbjct: 95 ET-GLLVTRAMGDVSD-----RIGRPT-DNGQIGIVDPDCRLIGLHLYDGLFKVIPIDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCAKPTIAVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + P W NL + A L+ VP P+GG +++G T
Sbjct: 186 D-------ARHVKTYEVQLKEKDFGEGP--WLQNNLDNGAGLLIPVPLPLGGAIIIGEQT 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y++ S A+ + + ++ V+ D + LLS G L L
Sbjct: 237 IVYYNGSVFKAIPIR---------PSITKAYGRVDSDGSR--------YLLSDHNGMLYL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + +D V L++ + S ++ + N + F+GS GDS L++
Sbjct: 280 LVISHDKERVSALNVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
+ +ADA + S + L+ VN G + L Q T S A +
Sbjct: 329 -------NHQADA------KNSYVEVLESYVNLGPIVDLCVVDLERQGQGQVVTCSGAFK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN ASA EL G KG+W++ SS
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSLRASSS-------- 410
Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
D Y +L++S E R + + T D L E TE + + +T+ N +++
Sbjct: 411 ------DVYDTFLVVSFISETRILAMNTDDELEE-TEIDGFDSEAQTLFCYNAV-HDQLV 462
Query: 625 QVFERGARILDGSYMTQ 641
QV R++D Q
Sbjct: 463 QVTAGSLRLVDAKTRRQ 479
>gi|159470705|ref|XP_001693497.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283000|gb|EDP08751.1| predicted protein [Chlamydomonas reinhardtii]
Length = 461
Score = 95.5 bits (236), Expect = 2e-16, Method: Composition-based stats.
Identities = 58/164 (35%), Positives = 95/164 (57%), Gaps = 7/164 (4%)
Query: 230 IESSHVINLRDL-DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+ +S+V+NL + ++ V+D +F+HGY EPV+++LHE + TWAGR+ + TC ++A+S+
Sbjct: 128 VGNSYVLNLHKMMGIREVRDCVFLHGYTEPVLLLLHEPDPTWAGRLRERKDTCCLTAISV 187
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ-SASCALALNNYAV 347
S LK+H ++W A LP+D Y+LL +P LV+ + + SQ S A ALN+ A+
Sbjct: 188 SLRLKRHTVLWRAAGLPYDCYRLLPLPQR-PAALVLSPSLVMLTSQASQPQAAALNSTAL 246
Query: 348 SLDSSQEL----PRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
++ L R + SV A + ND A ++ LV
Sbjct: 247 PGEAPPPLVFDPAREAPSVTAARMAAEFALNDCAPALGRSAALV 290
>gi|301103688|ref|XP_002900930.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
gi|262101685|gb|EEY59737.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
Length = 613
Score = 94.7 bits (234), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/383 (24%), Positives = 159/383 (41%), Gaps = 93/383 (24%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR++++ V D F+ GY+EP +++LHE + + GR++ T ++ +SI+
Sbjct: 241 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 300
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+ HP IW+ NLP D ++L+ +P+GGV+V+ AN I Y +Q+ LA N +A
Sbjct: 301 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTHE 360
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
+ +L +V L +LQ LL+ +G + +L++ Y+ + L
Sbjct: 361 TVQL-----NVVLYDCQFEYLQEKELLLTMPSGQVYVLSLPYEDTSSRGL---------- 405
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI------EADAPSTK 465
G F+GSR GDS+L + K+E I + AP K
Sbjct: 406 ---YGFGGKQTLFIGSRSGDSVLFVLDKKKLVTATEEEPKDEEMPIKEVVIKQESAPEIK 462
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------------------------- 500
S A ++ + ++L LYG+A E A +
Sbjct: 463 -----SEPAEEEEEDEDDLFLYGAAPTKEEPAATSSTECTNGVGVSSVKTEENGAPEQDT 517
Query: 501 --FSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYELV-------------- 542
+ + +R D L +IG + G+ NAD S + ELV
Sbjct: 518 GPYDYELRQIDVLPSIGQITSIELGVENNAD------SNEKREELVISGGYERSGAISVL 571
Query: 543 ------------ELPGCKGIWTV 553
EL GC+ +WTV
Sbjct: 572 HNGLRPIVGTEAELNGCRAMWTV 594
>gi|301121252|ref|XP_002908353.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
gi|262103384|gb|EEY61436.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
Length = 1150
Score = 94.4 bits (233), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 84/318 (26%), Positives = 145/318 (45%), Gaps = 33/318 (10%)
Query: 1108 NETLLAIGTAYVQGEDVAA-RGRVLLFS-TGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
N + +GTAY+ E+ +GR+L+F+ TG + + LVTE KE+KGA+ L S
Sbjct: 802 NASYFVVGTAYIHEEEAEPHQGRILVFAVTGIHGERKLQLVTE---KEVKGAVYCLNSFN 858
Query: 1166 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLY----VVSLNIVKNFILLGDIHKSIYFL 1221
G +L K L+KW+ N Y V+ + +FI++GD+ KSI L
Sbjct: 859 GKVLAGVNSKAQLYKWSENTDNEKELVSECGHYGHTLVLYMESRGDFIVVGDLMKSISLL 918
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
S+K+ + +AKD S + + ++D T + S+ N+ + +
Sbjct: 919 SYKQLDGTIEEIAKDLNS-NWMSAVGIVDDDTY--IGSETDFNLFTVQRNSGAASDEERG 975
Query: 1282 KLLSRAEFHVGAHVTKF----LRLQMLATSS---------------DRTGAAPGSDKTNR 1322
+L + EFH+G V +F L +Q +++S D +AP +
Sbjct: 976 RLETVGEFHLGEFVNRFRYGSLVMQNSSSTSQTPSGVVSTGPTAMVDVGESAPAAPVVQN 1035
Query: 1323 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
++LFGT+ G IG I P+ + + L +Q+ L V V G + + +R F +
Sbjct: 1036 QSMLFGTVSGMIGVILPISKDQYSFLLRVQQALTHVVKGVGGFSHKDWRTFENRRSVSE- 1094
Query: 1383 GPDSIVDCELLSHYEMLP 1400
+ +D +L+ + LP
Sbjct: 1095 -ARNFIDGDLVESFLDLP 1111
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 111/467 (23%), Positives = 183/467 (39%), Gaps = 108/467 (23%)
Query: 174 RESFARGPLV----KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
R+S R + +DP+GR G+ +Y ++ G L DTF
Sbjct: 107 RDSIGRSSEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGIL---RDTF--------- 154
Query: 230 IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
N+R LD V D F+HGY +P + +L+E + A V H L
Sbjct: 155 -------NIR-LDELRVIDIKFLHGYNKPTICVLYE-DYKAARHVKTYH------ILLKE 199
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
+ P WS N+ A L+ VP+P GGVL+V TI YH+ S A+ + + + +
Sbjct: 200 KDFAEGP--WSQSNVESGASLLIPVPAPTGGVLIVSNQTIVYHNGSTFHAIPMQSTVIQV 257
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+ + S F LL+ + G L ++ + + G+ V + L +
Sbjct: 258 YGAVDKDGSRF-----------------LLADQYGTLSVVALQHTGKEVSGVHLEVLGET 300
Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
+ S ++ + N + F+GS GDS L++ + AD T
Sbjct: 301 NIASCLSYLDNGVVFIGSTFGDSQLIK--------------------LNADRDETG---- 336
Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINAD 527
S + L VN + + + + + T S A +D G L+ G+ IN
Sbjct: 337 SYIEVLDSYVNVGPIIDFCVMDLDRQGQGQIVTCSGADKD-----GTLRVIRNGIGINEQ 391
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
ASA ELPG KG+W + AA D++ +S E R
Sbjct: 392 ASA------------ELPGIKGMWAL-----------RETFAAEHDKFLLQSYVS-EVRI 427
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + D + E + + F +T+ N++G +QV E R++
Sbjct: 428 LAIGDEDEMEE--KEIPAFTNVKTLLCRNMYGDYW-LQVTESEVRLI 471
>gi|401828022|ref|XP_003888303.1| pre-mRNA cleavage and polyadenylation specificity factor
[Encephalitozoon hellem ATCC 50504]
gi|392999575|gb|AFM99322.1| pre-mRNA cleavage and polyadenylation specificity factor
[Encephalitozoon hellem ATCC 50504]
Length = 1155
Score = 94.4 bits (233), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 85/346 (24%), Positives = 164/346 (47%), Gaps = 31/346 (8%)
Query: 1083 TIPMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1141
T ++ +E V+ + L + ++ L I T +++GED A+GR+ + +
Sbjct: 823 TYELEENEYVFDVKYLILDDMQGNYGKSPFLLICTTFIEGEDKPAKGRLHVLEIISVVPS 882
Query: 1142 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGI---AFYD 1193
P++ + K L KG+I + ++G + + G KI+++K + NGI FYD
Sbjct: 883 PESPFKDCKLKVLGIEKTKGSIVQCSEIRGKIALCLGTKIMIYKIDRS--NGIIPIGFYD 940
Query: 1194 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1253
++ S+++VKN+IL DI++ + F ++ + +L+L++ + +TE LI G+
Sbjct: 941 LH-IFTSSISVVKNYILASDIYRGLSFFFFQSKPIRLHLISSSEPLKNVTSTELLIAGNE 999
Query: 1254 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1313
LS+V D + I + Y+P S G KL+ RAE + T+ R +
Sbjct: 1000 LSMVCCDSKGTIHAYTYSPNNIISMDGAKLVKRAE---------------MKTNLGRLFS 1044
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
+ + N + + + SI +A +D+L + +L +Q ++ + V GLN R +
Sbjct: 1045 SGIGFRKNSI-MFYSKTNLSIH-LAGIDDLNYPKLLEIQTSIMVHLKSVLGLNQRDY--L 1100
Query: 1374 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1419
+S+ H S + +L+ + L Q I+ +R +IL
Sbjct: 1101 NSDIHLHSLSLKSPIVMHILNLFSYFDLNTQKLISSSVRMSRREIL 1146
>gi|159470709|ref|XP_001693499.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283002|gb|EDP08753.1| predicted protein [Chlamydomonas reinhardtii]
Length = 279
Score = 94.0 bits (232), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 123/262 (46%), Gaps = 28/262 (10%)
Query: 1186 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1245
L+ AF+D P L L VK+++L D+H+ ++FL + + L ++KDF D
Sbjct: 29 LDKRAFFDLPSL-ATGLVTVKDYLLASDVHQGLFFLRYSDASRVLEFMSKDFDGRDVLTC 87
Query: 1246 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMS---ESWKGQKLLSRAEFHVGAHVTKFLRLQ 1302
+I L + +D +Q+ + K E W GQ+L HV V +Q
Sbjct: 88 GVVIAEPKLHFLAADAAGTLQMMEFYGKRDTNPEFWAGQRLAPMGLLHVARRVGVAASVQ 147
Query: 1303 MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKLVDSVPH 1361
+ + D NR ALL G+ +G + +AP+ D RL +LQ + ++PH
Sbjct: 148 LAS-----------RDGRNRHALLCGSAEGGLSFVAPVPDPQAAARLAALQAHMSATLPH 196
Query: 1362 VAGLNPRSFRQFH-------SNGKAHR-PGP----DSIVDCELLSHYEMLPLEEQLEIAH 1409
VAGLNPRSFR G+ HR P P ++D +LL + L ++Q E A
Sbjct: 197 VAGLNPRSFRHRFIRIPKALGGGEHHRAPLPPRNNSGLLDGQLLLGFPHLSRQQQAEAAE 256
Query: 1410 QTGTTRSQILSNLNDLALGTSF 1431
G++ Q+L +L +A +F
Sbjct: 257 AVGSSPQQLLEDLRAIAAAATF 278
>gi|298715583|emb|CBJ28136.1| cleavage and polyadenylation specificity factor CG10110-PA
[Ectocarpus siliculosus]
Length = 1906
Score = 94.0 bits (232), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 93/303 (30%), Positives = 136/303 (44%), Gaps = 77/303 (25%)
Query: 215 GDEDTFGSGGGFSARIESSHVINLR-------DLDMKHVKDFI----FVHGYIEPVMVIL 263
G+ED G G G +A+ + NL DL+ + FI F+ G+ EP + +L
Sbjct: 265 GEEDG-GLGNGATAKGDGGAGGNLAVSKPFTIDLEEAGITGFIKAAAFLEGFHEPALALL 323
Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
+E T AGR++ K TC ++ LSI+ T + P+IW NLPHD++ L+ VPSPIGG+ V
Sbjct: 324 YEPIQTCAGRLASKRSTCRLALLSINLTQGRAPVIWQVENLPHDSWDLVPVPSPIGGLQV 383
Query: 324 VGANTIHYHSQS-ASCALALNNYA-VSLDSS-QELP------------------------ 356
+ N + + +QS LA+N YA ++D + E P
Sbjct: 384 ISTNAVMHVNQSEVRSILAVNGYARATVDPALLECPLRGGDSDWGWTSFRRSHPEREVVD 443
Query: 357 RSSFSV--ELDAAHATWLQNDVALLSTKTGD-----LVLLTVVYD--------------- 394
SS+ V ELD +L LLS +TG+ L L TV
Sbjct: 444 LSSYDVCIELDVVRCAFLTPTSMLLSLRTGEVYALRLHLTTVTAAAADAAGCSRPPGGAA 503
Query: 395 ----GRVVQR--LDLSKTNP-SVLT---------SDITTIGNSLFFLGSRLGDSLLVQFT 438
RVV + + + +P SVL + L F+GSR+GDSLLV ++
Sbjct: 504 FGTPNRVVGQSMRPVGRASPCSVLAVAASGGSGGDGGSGASKGLVFMGSRVGDSLLVDYS 563
Query: 439 CGS 441
S
Sbjct: 564 VAS 566
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 50/225 (22%), Positives = 91/225 (40%), Gaps = 60/225 (26%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+ Y+ +H PTG+ + G +T + + +LVV
Sbjct: 7 YTCYRQLHPPTGVDHAVFGSVTAAGSR---------------------------DLVVAK 39
Query: 63 ANVIEIYVVRVQEEGS----------KESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
A+ +E+Y V + S +++ N E D S LEL + L GN
Sbjct: 40 ASTLELYRVHRDDHSSTAAAAAAAAARDTSNGDERDD----DDASGYYLELAGTFPLAGN 95
Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ +LA++ D ++++F AK++++ +D + L S+H F++
Sbjct: 96 ITALAVIP----------DILVVSFGVAKMALVAYDSVLGRLETISIHNFDAGAIGPGAG 145
Query: 173 GRES-FARGPLVK--------VDPQGRCGGVLVYGLQMIILKASQ 208
G ES + +K DP GRC +V G Q+++L A +
Sbjct: 146 GVESGYGLAAALKDRPRTISSSDPAGRCLAAVVAGCQLVVLPARR 190
>gi|452824087|gb|EME31092.1| DNA damage-binding protein 1 isoform 1 [Galdieria sulphuraria]
Length = 1128
Score = 93.2 bits (230), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 121/550 (22%), Positives = 229/550 (41%), Gaps = 93/550 (16%)
Query: 840 FLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDA 899
+ A L DG +L Y+ L + ++T + R +S+ AS L
Sbjct: 612 YFLAALGDGRLLTYR--LDKSAKDTDSEKKFLYDQRQMSIGTQPAS-----------LSI 658
Query: 900 YTREETPH-GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFT 958
+ + H A C R T+ + SG G C + RE RV C S AF
Sbjct: 659 FETQNALHVFAACDRPTVIHSSSG------GGKLLCSNVNLREVTRV----CSFSSEAFP 708
Query: 959 VLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ-KVIPLKATPHQITYFAEKNLY 1017
+C + + ++G L + T DN + + IPL P +I + +++
Sbjct: 709 -----DC----LALVTEGSLLL------GTVDNIQKLHIRTIPLGEQPRRIAHLDTHHVF 753
Query: 1018 PLIVS---VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA 1074
++ + V + + N+ LS ++ ID+ + +++ +Y +E++E
Sbjct: 754 AVLTTKQVVTISEDGNEALSETTEEGYVRLIDD---TMMEIVHSYKLEQFETPC------ 804
Query: 1075 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG-EDVAARGRVLLF 1133
+ I + ++A K+N+ +GTAY E +RGR+L+F
Sbjct: 805 -------SVITVNFGDDA-----------AAKDNQDYFVVGTAYSYADEPEPSRGRMLVF 846
Query: 1134 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1193
+ + +T V + KGA+ ++ + G +L + + L +W+ TE +
Sbjct: 847 AV------REQRLTLVAERTFKGALYSMDAFNGKILASVNSMLKLVRWSETESGARTLTE 900
Query: 1194 A----PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
++++ + + +FIL+GD+ +S+ L++K + +A+D E L
Sbjct: 901 ECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIEDVARDIDPSWITVIEML- 959
Query: 1250 DGSTLSLVVSDEQK-NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1308
L +S E N+ S + +L E+H+G V + +++
Sbjct: 960 ---DLDYYISAENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIRHGRLVL--- 1013
Query: 1309 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
P S + +LL+GT +G++G IA +DE TF+ L SLQ L + + V G+
Sbjct: 1014 ----QIPESGISILKSLLYGTANGALGVIASIDEKTFQFLHSLQTALNEVIKGVGGIQHE 1069
Query: 1369 SFRQFHSNGK 1378
+R+F S +
Sbjct: 1070 DWRRFTSERR 1079
Score = 41.2 bits (95), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 84/211 (39%), Gaps = 35/211 (16%)
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
I L +LD V D F++G+ +P + +L + +H +L
Sbjct: 151 IRLEELD---VLDIQFLYGHSKPTIAVL------YTDSEENRHLKTYTVSLK-DKDFGNG 200
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
PL NL A L+ VP+PIGGV+V+G T+ Y S S L Y S+ S +
Sbjct: 201 PLFQG--NLESGASMLIPVPTPIGGVVVLGQETVTYISGS-----GLRGYH-SIPVSATI 252
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL---------TVVYDGRVVQRLDLSKT 406
R+ ++ D LL + G L LL T + L +
Sbjct: 253 FRAYGRIDKDGTR--------YLLGDEKGILYLLVLEQSTSLSTFTETETKITGLKIQTL 304
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ L S I + N ++GS GDS L++
Sbjct: 305 GETSLPSTIDYLDNGFVYIGSCHGDSQLIRL 335
>gi|303391353|ref|XP_003073906.1| pre-mRNA cleavage and polyadenylation specificity factor
[Encephalitozoon intestinalis ATCC 50506]
gi|303303055|gb|ADM12546.1| pre-mRNA cleavage and polyadenylation specificity factor
[Encephalitozoon intestinalis ATCC 50506]
Length = 601
Score = 92.8 bits (229), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 106/458 (23%), Positives = 206/458 (44%), Gaps = 62/458 (13%)
Query: 974 SQGILKICQLPS---GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLK--P 1028
S+G L +C++PS G + + + + K IP+ TP I Y + Y ++ S ++ P
Sbjct: 185 SKGYLMVCRVPSIENGYVFGSGF-IGKKIPVLRTPKHIEY---ADRYMVVASCEEVEFSP 240
Query: 1029 LNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1088
N ++ G ++ + VDL+ E+YE +T ++
Sbjct: 241 KN-------GKDCGVPVNTYRFY-VDLYS----EKYE--------------HISTYELEE 274
Query: 1089 SENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT 1147
+E ++ + L + ++ L + T +++GED A+GR+ + + ++
Sbjct: 275 NEYVFDIQYLVLDDMQGNYGKSPFLLVCTTFIEGEDRPAKGRLHVLEIISVVPSLESPFK 334
Query: 1148 EVYSKEL-----KGAISALASLQGHLLIASGPKIILHKW-TGTELNGIAFYDAPPLYVVS 1201
+ K L KG+I + ++G +++ G KI+++K G+ + I F+D + S
Sbjct: 335 DCKLKVLGIEKTKGSIVQCSEVRGKIVLCLGTKIMIYKIDRGSGIIPIGFHDLHT-FTSS 393
Query: 1202 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1261
+++VKN+IL DI++ + F ++ + +L+L++ + +TE LI G+ LS+V D
Sbjct: 394 ISVVKNYILASDIYRGLSFFFFQSKPIRLHLISSSEPLKNATSTELLITGNELSMVCCDI 453
Query: 1262 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1321
+ I + Y+P S G KL+ R E T RL SS R G S
Sbjct: 454 KGTIHAYTYSPNNIISMDGAKLVKRVEMK-----TNLGRL-----SSSRVGFRKNS---- 499
Query: 1322 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
++ + + + +D ++ +L +Q ++ V GLNPR + +S+ H
Sbjct: 500 ---IMLYSRSNQLVHVNGVDNSSYLKLLGIQTSIMGCFKAVFGLNPRDY--LNSDIHLHS 554
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1419
S + +L+ + L Q I+ +R +IL
Sbjct: 555 LSLKSPIVLHILNLFSYFDLSIQESISSSVVMSRREIL 592
>gi|58383228|ref|XP_312466.2| AGAP002472-PA [Anopheles gambiae str. PEST]
gi|55242305|gb|EAA08181.2| AGAP002472-PA [Anopheles gambiae str. PEST]
Length = 1138
Score = 92.8 bits (229), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/374 (24%), Positives = 167/374 (44%), Gaps = 50/374 (13%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ E G +++ HNL +D + + ++ MQ+ E AL++
Sbjct: 778 NTEFGQEVEVHNLLIIDQNTFEVLHAHQF-------------------MQT-EYALSLMS 817
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
L N + T +GT V E+ + GR++++ R ADN +V++ KE+KG
Sbjct: 818 AKLGN----DPNTYFIVGTGLVNPEEPEPKTGRIIIY---RYADNELKMVSD---KEVKG 867
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGD 1213
A +L G +L + L++WT + L F + LY + +FIL+GD
Sbjct: 868 ACYSLVEFNGRVLACINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKG---DFILVGD 924
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
+ +SI L +K+ +A+D+ A E L D + L +D N+ +
Sbjct: 925 LMRSITLLQYKQMEGSFEEIARDYQPNWMTAVEILDDDAFLG---ADNSNNLFVCLKDSA 981
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDG 1332
+ + Q++ A+FH+G V F ++ + S+R+ G +LFGT+ G
Sbjct: 982 ATTDEERQQMPEVAQFHLGDMVNVFRHGSLVMQNISERSTPTTG-------CVLFGTVSG 1034
Query: 1333 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
+IG + + + L+ LQ+ L +++ V ++ +R FH+ K R + +D +L
Sbjct: 1035 AIGLVTQIQSDFYEFLRKLQENLTNTIKSVGKIDHSYWRSFHTETKMER--CEGFIDGDL 1092
Query: 1393 LSHYEMLPLEEQLE 1406
+ + L E+ E
Sbjct: 1093 VESFLDLSREKMRE 1106
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 117/519 (22%), Positives = 195/519 (37%), Gaps = 126/519 (24%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G L +DP+ R G+ +Y GL II D+DT +L
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DKDT-----------NELKATSL 157
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
R +M HV+D F++G P ++++H+ ++ +H I IS K+ I
Sbjct: 158 RMEEM-HVQDVEFLYGTTHPTLIVIHQD-------INGRH----IKTHEISLKDKEFTKI 205
Query: 299 -WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAVSL 349
W N+ +A L+AVP P+GG +V+G +I YH + A+A +N YA
Sbjct: 206 AWKQDNVETEATMLIAVPMPLGGAIVIGQESIVYHDGDSYVAVAPAIIKQSTINCYA--- 262
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLS 404
+D+ +L ++A G+L ++ + + V+ + +
Sbjct: 263 -------------RIDSKGLRYLLGNMA------GNLFMMFLETEENAKGQTTVRDIKVE 303
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
+ IT + N + F+GSR GDS LV+ +G + L E F ++ AP
Sbjct: 304 LLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVMLMETFTNL---APIV 360
Query: 465 KRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI 524
L+ G+ ++ GS G L+ G+ I
Sbjct: 361 DMCVVD----LERQGQGQMITCSGSFKE--------------------GSLRIIRNGIGI 396
Query: 525 NADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
A ++LPG KG+W + R+ D Y LI+S
Sbjct: 397 QEHAC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLILSFV 431
Query: 585 ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQD 642
T VL + E TE +T N+ +++QV AR++ D M +
Sbjct: 432 GHTRVLMLSGDEVEETEIAGILGDQQTFYCANV-SHGQILQVTPSSARLISCDNKAMICE 490
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIR 681
P N G N+T + + A + + DG +
Sbjct: 491 WK-PPDNKRIGVVGANTTQIVCASAQDVYYVEIGDGKLE 528
>gi|168047617|ref|XP_001776266.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672361|gb|EDQ58899.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1089
Score = 91.7 bits (226), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/308 (27%), Positives = 140/308 (45%), Gaps = 22/308 (7%)
Query: 1105 TKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
T ++ +GTAY E+ +GR+L+F D LV E KE+KGA+ L +
Sbjct: 779 TDDSNVYYCVGTAYALPEESEPTKGRILVFLV---EDGKLQLVAE---KEMKGAVYNLNA 832
Query: 1164 LQGHLLIASGPKIILHKWT---GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
G LL KI L+KWT GT + I + + + +FI++GD+ KSI
Sbjct: 833 FNGKLLAGINQKIALYKWTLRDGTRVLEIESSHHGHILALYVQSRGDFIVVGDLMKSISL 892
Query: 1221 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1280
L +K + + A+D+ + A E L D + L ++ N+ + +
Sbjct: 893 LIYKPEEGAIEERARDYNANWMTAVEILDDDTYLG---AENSFNLFTVRKNNDAATDEER 949
Query: 1281 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
+L E+H+G V +F ++ P S+ + ++FGT++G IG IA L
Sbjct: 950 GRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSEASLIPTVIFGTVNGVIGVIASL 1002
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1400
+ F LQ LQ+ LV + V GL+ +R F + K + +D +L+ + L
Sbjct: 1003 PQDKFLFLQKLQQALVKVIKGVGGLSHEQWRSFSNERKT--VDARNFLDGDLIESFLDLS 1060
Query: 1401 LEEQLEIA 1408
+ EIA
Sbjct: 1061 RNKMEEIA 1068
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 122/556 (21%), Positives = 214/556 (38%), Gaps = 119/556 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A+ L+ + L+G + +L + G +D + ++FE K VL++D
Sbjct: 39 RIEIHLLTASGLQSMLDVPLYGRIATLELFRPPG----ESQDVLFISFERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
G IT S + GR + G + VDP R G+ +Y ++
Sbjct: 95 ET-GSPITRAMGDVSD-----RTGRPT-DNGQIGIVDPDCRLIGLHLYDGMFKVIPIDNK 147
Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
G F+ R+E V++++ F++G P + +L++
Sbjct: 148 GQ----------LKEAFNIRLEELQVLDIK-----------FLYGCANPTIAVLYQDNKD 186
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI 329
H + P W NL + A L+ VP P+GG +++G TI
Sbjct: 187 -------ARHVKTYEVNLKEKDFGEGP--WLQNNLDNGAGLLIPVPLPLGGAIIIGEQTI 237
Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
Y++ S A+ + + ++ V+ D + LLS G L LL
Sbjct: 238 VYYNGSVFKAIPIR---------PSITKAYGRVDSDGSR--------YLLSDHNGMLYLL 280
Query: 390 TVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
+ +D V L++ + S ++ + N + F+GS GDS L++
Sbjct: 281 VISHDKERVSALNVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRL------------ 328
Query: 450 LKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRD 507
+ +AD ++ S + L+ VN G + L Q T S A +D
Sbjct: 329 ------NHQAD------VKGSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAFKD 376
Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
G L+ G+ IN AS VEL G KG+W++ SS
Sbjct: 377 -----GSLRIVRNGIGINEQAS------------VELQGIKGMWSLRASSS--------- 410
Query: 568 MAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
D Y +L++S E R + + T D L E TE + + +T+ N +++Q
Sbjct: 411 -----DVYDTFLVVSFISETRILAMNTDDELEE-TEIDGFDSEAQTLFCHNAV-HDQLVQ 463
Query: 626 VFERGARILDGSYMTQ 641
V R+++ Q
Sbjct: 464 VTAGSLRLVNAKTRKQ 479
>gi|348681092|gb|EGZ20908.1| hypothetical protein PHYSODRAFT_259403 [Phytophthora sojae]
Length = 1137
Score = 90.9 bits (224), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 81/312 (25%), Positives = 142/312 (45%), Gaps = 22/312 (7%)
Query: 1112 LAIGTAYVQGEDVAA--RGRVLLFS-TGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
+GTAY+ ED A +GR+L+F+ TG + + LVTE KE+KGA+ L + G +
Sbjct: 806 FVVGTAYIH-EDEAEPHQGRILVFAVTGIHGERKLQLVTE---KEVKGAVYCLNAFNGKV 861
Query: 1169 LIASGPKIILHKWTGTELNGIAFYDAPPLY----VVSLNIVKNFILLGDIHKSIYFLSWK 1224
L K L+KW+ N Y V+ + +FI++GD+ KS+ LS+K
Sbjct: 862 LAGVNSKAQLYKWSENTDNEKELVSECGHYGHTLVLYMESRGDFIVVGDLMKSVSLLSYK 921
Query: 1225 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1284
+ + +AKD S + + ++D T + S+ N+ + + +L
Sbjct: 922 QLDGTIEEIAKDLNS-NWMSALGIVDDDTY--IGSETDFNLFTVQRNSGAASDEERGRLE 978
Query: 1285 SRAEFHVGAHVTKFLRLQMLATSS------DRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1338
+ EFH+G V +F + ++ D AP ++LFGT+ G IG I
Sbjct: 979 TVGEFHLGEFVNRFRYGSLTPAAAGPTDMVDVVEQAPIVPAAQNQSMLFGTVSGMIGVIL 1038
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
PL + + L +Q+ L V V G + + +R F + + +D +L+ +
Sbjct: 1039 PLTKDQYSFLLRVQQALTQVVKGVGGFSHKDWRMFENRRSVSE--ARNFIDGDLVESFLD 1096
Query: 1399 LPLEEQLEIAHQ 1410
LP + ++ +
Sbjct: 1097 LPKAQMTKVVDK 1108
Score = 77.0 bits (188), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 109/467 (23%), Positives = 182/467 (38%), Gaps = 108/467 (23%)
Query: 174 RESFARGPLV----KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
R+S R + +DP+GR G+ +Y ++ G L +DTF
Sbjct: 107 RDSIGRSSEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGIL---KDTF--------- 154
Query: 230 IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
N+R LD V D F+HGY +P + +L+E H L
Sbjct: 155 -------NIR-LDELRVIDIKFLHGYTKPTICVLYED-------YKAARHIKTYHILLKE 199
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
+ P WS N+ A L+ VP+P+GGVL+V TI YH+ S A+ + + + +
Sbjct: 200 KDFAEGP--WSQSNVESGASLLIPVPAPVGGVLIVSNQTIVYHNGSTFHAIPMQSTVIQV 257
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+ + S F LL+ + G L ++ + + G+ V + L +
Sbjct: 258 YGAVDKDGSRF-----------------LLADQYGTLSVVALQHTGKEVTGVHLEVLGET 300
Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
+ S ++ + N + F+GS GDS L++ ++E G
Sbjct: 301 NIASCLSYLDNGVVFIGSTFGDSQLIKLNAD----------RDENG-------------- 336
Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINAD 527
S + L VN + + + + + T S A +D G L+ G+ IN
Sbjct: 337 SYIEVLDTYVNVGPIIDFCVMDLDRQGQGQIVTCSGADKD-----GTLRVIRNGIGINEQ 391
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
ASA ELPG KG+W + AA D+Y +S E R
Sbjct: 392 ASA------------ELPGIKGMWAL-----------RETFAAEHDKYLLQSYVS-EIRI 427
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + D + E + + F +T+ N++G +QV E R++
Sbjct: 428 LAIGDEDEMEE--KEIPAFTNVKTLLCRNMYGDVW-LQVTESEVRLI 471
>gi|255080490|ref|XP_002503825.1| predicted protein [Micromonas sp. RCC299]
gi|226519092|gb|ACO65083.1| predicted protein [Micromonas sp. RCC299]
Length = 1114
Score = 90.9 bits (224), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 89/325 (27%), Positives = 147/325 (45%), Gaps = 30/325 (9%)
Query: 1114 IGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GT Y + E RGR+L+F R D LV E KE+KGA+ L + G LL
Sbjct: 802 VGTGYSLPEEPEPTRGRILVF---RAEDGKLQLVAE---KEVKGAVYNLNAFNGKLLAGI 855
Query: 1173 GPKIILHKW---TGTELNGIAFYDAPPL-----YVVSLNIV--KNFILLGDIHKSIYFLS 1222
K+ L + G + G + Y+ ++V+L + FI++GD+ KS+ L+
Sbjct: 856 NSKVELFRGGDPVGADGAGGSTYELAKECSHHGHIVALYVAVRGEFIVVGDLMKSVSLLA 915
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1282
+K + + + A+D+ + A + L D + L ++ N+ + + +
Sbjct: 916 YKPEESVIEERARDYNANWMTAVDILDDDTYLG---AENNFNLFTLRRQSDAATDEERSR 972
Query: 1283 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1342
L E+HVG V +F R ++ D+ A + LLFGT+ G IG +A L
Sbjct: 973 LEVVGEYHVGEFVNRFRRGSLVMRLPDQENA-------DVPTLLFGTVSGVIGVLATLPR 1025
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP--GPDSIVDCELLSHYEMLP 1400
F L +LQ L +V V GL+ ++R F N HR G VD +L+ + L
Sbjct: 1026 EQFEFLSALQAALNKTVSGVGGLSHDAWRSFQ-NEHRHRAKDGARGFVDGDLIESFLDLR 1084
Query: 1401 LEEQLEIAHQTGTTRSQILSNLNDL 1425
E+ E+A + ++ + DL
Sbjct: 1085 PEKAREVAAAVKLSVDELTRRVEDL 1109
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 89/212 (41%), Gaps = 33/212 (15%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F R+E +V++++ F+HG P + +L+E H
Sbjct: 153 FDVRLEELNVVDVK-----------FMHGCATPTICVLYED-------TKEARHVKTYEV 194
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
TL+ P WS ++ + ++ VP+P+GG +VVG + I Y ++
Sbjct: 195 DVKEKTLRDGP--WSQSDVEGGSSLIIPVPAPLGGAIVVGESVIVYLNKDGG-------- 244
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ SV + A LLS TG L LL +V+D R V L L
Sbjct: 245 -----NGAGGAIATKSVNVMAHGVVDADGSRYLLSDSTGMLHLLVLVHDRRRVHALKLES 299
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ + S ++ + N + ++GS GDS LV+
Sbjct: 300 LGQTSIASTLSYLDNGVVYVGSAYGDSQLVRL 331
>gi|302788810|ref|XP_002976174.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
gi|300156450|gb|EFJ23079.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
Length = 1089
Score = 90.1 bits (222), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 88/304 (28%), Positives = 140/304 (46%), Gaps = 38/304 (12%)
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQN 1144
+ + EN T+ + T + T +GTAY + E+ ++GR+L+F+ D
Sbjct: 764 LDTFENGCTIITCSF----TDDPATYYCVGTAYALPEENEPSKGRILIFTV---EDGKFQ 816
Query: 1145 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT----GTELNGIAFYDAP--PLY 1198
LVTE KE KGA+ L + G LL KI L+KWT EL + LY
Sbjct: 817 LVTE---KETKGAVYNLNAFNGKLLAGINQKIQLYKWTQRDSTRELQSECGHHGHILALY 873
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTL 1254
V S +FI++GD+ KSI L +K + + A+D+ + A E L D G+
Sbjct: 874 VQSRG---DFIVVGDLMKSISLLLYKPEEGAIEERARDYNANWMTAVEILDDDIYLGAEN 930
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1314
S + +KN + ++ +G +L E+H+G V +F ++
Sbjct: 931 SFNLFTVRKN------SDAATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMR-------L 976
Query: 1315 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1374
P ++ + ++FGT++G IG +A L + F LQ LQ L + V GL+ +R F
Sbjct: 977 PDNETSQIPTVIFGTVNGVIGVVASLQQEQFNFLQRLQHCLAKVIKGVGGLSHEQWRSFS 1036
Query: 1375 SNGK 1378
S K
Sbjct: 1037 SERK 1040
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 122/557 (21%), Positives = 217/557 (38%), Gaps = 121/557 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ ++A L+ + ++G + +L + G +D + ++ E K VL++D
Sbjct: 39 RIEFHLLTAQGLQPLLDVPIYGRIATLELFRPPG----ETQDVLFVSTERYKFCVLQWDS 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + VDP+ R G+ +Y GL +I ++
Sbjct: 95 ETTELVTRAMGDVSD------RIGRPT-DNGQIGIVDPECRLIGLHLYDGLFKVIPIDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIAVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + P W NL + A L+ VP+P+GGV+++G T
Sbjct: 186 D-------ARHVKTYEIQLKEKDFGEGP--WLQNNLDNGAGMLIPVPTPLGGVIIIGEQT 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y+S SA A+ + + ++ V+ D + LLS TG L L
Sbjct: 237 IVYYSGSAFKAIPIR---------PSITKAYGKVDADGSR--------YLLSDHTGSLHL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + S ++ + N + ++GS GDS L++
Sbjct: 280 LVITHERDRVLGLKVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
+ + D+ R S + L+ VN G + L Q T S A +
Sbjct: 329 -------NAQVDS------RNSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN ASA EL G KG+W++
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
A D + +L++S E R + + D L E TE + + +T+ N ++I
Sbjct: 406 -RATSKDVFDIFLVVSFISETRILAMNMDDELEE-TEIEGFDSEAQTLFCHNAI-HDQII 462
Query: 625 QVFERGARILDGSYMTQ 641
QV R++D + Q
Sbjct: 463 QVTSTSLRLVDATSRRQ 479
>gi|302769568|ref|XP_002968203.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
gi|300163847|gb|EFJ30457.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
Length = 1089
Score = 90.1 bits (222), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 88/304 (28%), Positives = 140/304 (46%), Gaps = 38/304 (12%)
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQN 1144
+ + EN T+ + T + T +GTAY + E+ ++GR+L+F+ D
Sbjct: 764 LDTFENGCTIITCSF----TDDPATYYCVGTAYALPEENEPSKGRILIFTV---EDGKFQ 816
Query: 1145 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT----ELNGIAFYDAP--PLY 1198
LVTE KE KGA+ L + G LL KI L+KWT EL + LY
Sbjct: 817 LVTE---KETKGAVYNLNAFNGKLLAGINQKIQLYKWTQRDSTRELQSECGHHGHILALY 873
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTL 1254
V S +FI++GD+ KSI L +K + + A+D+ + A E L D G+
Sbjct: 874 VQSRG---DFIVVGDLMKSISLLLYKPEEGAIEERARDYNANWMTAVEILDDDIYLGAEN 930
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1314
S + +KN + ++ +G +L E+H+G V +F ++
Sbjct: 931 SFNLFTVRKN------SDAATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMR-------L 976
Query: 1315 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1374
P ++ + ++FGT++G IG +A L + F LQ LQ L + V GL+ +R F
Sbjct: 977 PDNETSQIPTVIFGTVNGVIGVVASLQQEQFNFLQRLQHCLAKVIKGVGGLSHEQWRSFS 1036
Query: 1375 SNGK 1378
S K
Sbjct: 1037 SERK 1040
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 123/557 (22%), Positives = 218/557 (39%), Gaps = 121/557 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ ++A L+ + ++G + +L + G +D + ++ E K VL++D
Sbjct: 39 RIEFHLLTAQGLQPLLDVPIYGRIATLELFRPPG----ETQDVLFVSTERYKFCVLQWDS 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + VDP+ R G+ +Y GL +I ++
Sbjct: 95 ETTELVTRAMGDVSD------RIGRPT-DNGQIGIVDPECRLIGLHLYDGLFKVIPIDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIAVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + P WS NL + A L+ VP+P+GGV+++G T
Sbjct: 186 D-------ARHVKTYEIQLKEKDFGEGP--WSQNNLDNGAGMLIPVPTPLGGVIIIGEQT 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y+S SA A+ + + ++ V+ D + LLS TG L L
Sbjct: 237 IVYYSGSAFKAIPIR---------PSITKAYGKVDADGSR--------YLLSDHTGSLHL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + S ++ + N + ++GS GDS L++
Sbjct: 280 LVITHERDRVLGLKVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
+ + D+ R S + L+ VN G + L Q T S A +
Sbjct: 329 -------NAQVDS------RNSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN ASA EL G KG+W++
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
A D + +L++S E R + + D L E TE + + +T+ N ++I
Sbjct: 406 -RATSKDVFDIFLVVSFISETRILAMNMDDELEE-TEIEGFDSEAQTLFCHNAI-HDQII 462
Query: 625 QVFERGARILDGSYMTQ 641
QV R++D + Q
Sbjct: 463 QVTSTSLRLVDATSRRQ 479
>gi|218197365|gb|EEC79792.1| hypothetical protein OsI_21216 [Oryza sativa Indica Group]
Length = 1089
Score = 90.1 bits (222), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 149/331 (45%), Gaps = 33/331 (9%)
Query: 1105 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+ +N +GTAYV E+ ++GR+L+F+ D L+ E KE KGA+ +L +
Sbjct: 778 SDDNNVYYCVGTAYVLPEENEPSKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 831
Query: 1164 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL A KI L+KW EL + L + + +FI++GD+ KSI
Sbjct: 832 FNGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYT-QTRGDFIVVGDLMKSI 890
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY---APKMS 1275
L +K + + + LA+D+ + A E L D + + N IF + +
Sbjct: 891 SLLVYKHEESAIEELARDYNANWMSAVEMLDDE-----IYIGAENNYNIFTVRKNSDAAT 945
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGSI 1334
+ +G +L E+H+G V + ++ D G P ++FGT++G I
Sbjct: 946 DEERG-RLEVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIP--------TVIFGTINGVI 996
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1394
G IA L + L+ LQ LV + V L+ +R FH++ K + +D +L+
Sbjct: 997 GIIASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTSE--ARNFLDGDLIE 1054
Query: 1395 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ L + E+A G ++ + +L
Sbjct: 1055 SFLDLSRNKMEEVAKGMGVPVEELSKRVEEL 1085
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + ++ +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G ++P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL + P WS NL + A L+ VP+P+GGV+++G T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y + +++ ++ Q + R+ V+ D + LL G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N + ++GSR GDS LV+
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++AD S + L+ VN + + + + + T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++D Y YL++S + T L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429
>gi|115465791|ref|NP_001056495.1| Os05g0592400 [Oryza sativa Japonica Group]
gi|48475231|gb|AAT44300.1| putative DNA damage binding protein 1 [Oryza sativa Japonica Group]
gi|113580046|dbj|BAF18409.1| Os05g0592400 [Oryza sativa Japonica Group]
gi|215694552|dbj|BAG89545.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222632766|gb|EEE64898.1| hypothetical protein OsJ_19757 [Oryza sativa Japonica Group]
Length = 1090
Score = 89.7 bits (221), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 146/328 (44%), Gaps = 27/328 (8%)
Query: 1105 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+ +N +GTAYV E+ ++GR+L+F+ D L+ E KE KGA+ +L +
Sbjct: 779 SDDNNVYYCVGTAYVLPEENEPSKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 832
Query: 1164 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL A KI L+KW EL + L + + +FI++GD+ KSI
Sbjct: 833 FNGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYT-QTRGDFIVVGDLMKSI 891
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
L +K + + + LA+D+ + A E L D + ++ NI +
Sbjct: 892 SLLVYKHEESAIEELARDYNANWMSAVEMLDDEIYIG---AENNYNIFTVRKNSDAATDE 948
Query: 1279 KGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGSIGCI 1337
+ +L E+H+G V + ++ D G P ++FGT++G IG I
Sbjct: 949 ERGRLEVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIP--------TVIFGTINGVIGII 1000
Query: 1338 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1397
A L + L+ LQ LV + V L+ +R FH++ K + +D +L+ +
Sbjct: 1001 ASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTSE--ARNFLDGDLIESFL 1058
Query: 1398 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
L + E+A G ++ + +L
Sbjct: 1059 DLSRNKMEEVAKGMGVPVEELSKRVEEL 1086
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 152/367 (41%), Gaps = 83/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G ++P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCVKPTIVVLYQ------DNKDARHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P WS NL + A L+ VP+P+GGV+++G TI Y + +++
Sbjct: 197 LK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEETIVYCNANSTFR------ 247
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
++ Q + R+ V+ D + LL G L LL + ++ V L +
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHLLVLTHERERVTGLKIEY 297
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N + ++GSR GDS LV+ +++AD
Sbjct: 298 LGETSIASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADPNG-- 337
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
S + L+ VN + + + + + T S A +D G L+ G+
Sbjct: 338 ----SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIG 388
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ KSS ++D Y YL++S
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------FNDPYDMYLVVSF 422
Query: 584 EARTMVL 590
+ T L
Sbjct: 423 ISETRFL 429
>gi|325186344|emb|CCA20849.1| predicted protein putative [Albugo laibachii Nc14]
Length = 1148
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 161/355 (45%), Gaps = 40/355 (11%)
Query: 1097 VVTLFNTTTKENETLLAIGTAYVQGEDVAA-RGRVLLFS-TGRNADNPQNLVTEVYSKEL 1154
++T T T +GTA+V E+ +GR+L+F+ +G + D LVTE KE+
Sbjct: 799 IITCIFTGDSSGGTYYVVGTAFVHEEEAEPHQGRILVFTVSGIHGDRRLQLVTE---KEV 855
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY----VVSLNIVKNFIL 1210
KG++ L + G LL K+ L KW+ +E NG + V+ + +FI+
Sbjct: 856 KGSVYCLNAFNGKLLAGVNSKVYLFKWSESEENGEELVSECGHHGHTLVLYMESRGDFIV 915
Query: 1211 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTLSLVVSDEQKNIQ 1266
+GD+ KSI L+ K+ + +A+D S A + D GS + Q+N
Sbjct: 916 VGDLMKSISLLNHKQLDGSIEEIARDLNSNWMTAVGIIDDDNYVGSETDFNLFTVQRN-- 973
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA-APG--------- 1316
+ S+ +G +L + E+H+G V +F ++ + GA APG
Sbjct: 974 ----SGAASDEERG-RLETIGEYHLGEFVNRFRYGSLVMQHNLSIGAEAPGISLSDDRPE 1028
Query: 1317 --SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1374
S + + ++LFGT+ G IG I P+ + L +Q L + V G + +R F
Sbjct: 1029 SLSPLSVQRSMLFGTVSGMIGVILPISKEKHEFLMRVQSALNQVIQGVGGFSHSEWRTFE 1088
Query: 1375 ---SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
S+ +AH + +D +L+ + L +E ++ + + + + L LA
Sbjct: 1089 NRRSSIEAH-----NFIDGDLIESFLDLSKDEMKQVVDELNRDQLEGKTTLEALA 1138
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 114/511 (22%), Positives = 192/511 (37%), Gaps = 115/511 (22%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+D I L + + VL +D ++ + + + R E G +DP G
Sbjct: 74 QDWIFLVTQRFQFCVLAYDTTLQQIITKANGSLRDT----IGRNSEILTNG---NIDPDG 126
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDF 249
R G+ +Y ++ L F+ R++ LR LD+K
Sbjct: 127 RLIGMNIYEGYFKVIPIDNHSKSL---------KAAFNIRLD-----ELRILDIK----- 167
Query: 250 IFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAY 309
F++GY +P + +L+E H L + P WS N+ A
Sbjct: 168 -FLYGYNKPTICVLYED-------FKAARHVKTYFILLKEKDFAEGP--WSQSNVEAGAN 217
Query: 310 KLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHA 369
L+ VP P GGVL++ TI YH+ + A+ + N + + + S F
Sbjct: 218 LLIPVPMPYGGVLIISNQTIVYHNGTYFHAIPMQNTMIQVYGAVGDDGSRF--------- 268
Query: 370 TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL 429
LL+ + G L ++ + +G+ V + L + + S ++ + N + F+GS
Sbjct: 269 --------LLADQYGALHVVALQTEGKEVLDVYLEVLGQTSIASCVSYLDNGVVFVGSTF 320
Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
GDS LV+ ++E G S + L VN + +
Sbjct: 321 GDSQLVKLNSK----------RDESG--------------SYIEVLDSYVNIGPIIDFCV 356
Query: 490 ASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
+ + + T S A +D G L+ G+ IN ASA ELPG
Sbjct: 357 MDLDRQGQGQIVTCSGADKD-----GSLRVIRNGIGINEQASA------------ELPGI 399
Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDY 605
KG+W + + EY YL+ S E R M + +D + EV ++
Sbjct: 400 KGMWALRESLA--------------SEYDKYLVQSYLNEIRIMTIGDSDEMEEV--EIEA 443
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
F+ +T+ N+ +QV E RI+D
Sbjct: 444 FLDAKTLYCRNV-NEDGWLQVTETEVRIIDA 473
>gi|255571318|ref|XP_002526608.1| DNA repair protein xp-E, putative [Ricinus communis]
gi|223534048|gb|EEF35767.1| DNA repair protein xp-E, putative [Ricinus communis]
Length = 1033
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 89/330 (26%), Positives = 145/330 (43%), Gaps = 31/330 (9%)
Query: 1105 TKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+ +N +GTAYV E+ +GR+L+F D ++TE KE KGA+ +L S
Sbjct: 722 SDDNNLYYCVGTAYVMPEENEPTKGRILVFLV---EDGKLQVITE---KETKGAVYSLNS 775
Query: 1164 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL A KI L+KW EL + L + + +FI++GD+ KSI
Sbjct: 776 FNGKLLAAINQKIQLYKWMLRDDGSRELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 834
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
L +K + + A+D+ + A E L D L + N +F K SE
Sbjct: 835 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFT-VRKNSEGA 888
Query: 1279 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ +L E+H+G V +F ++ P SD ++FGT++G IG
Sbjct: 889 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTVNGVIG 941
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
IA L + L+ LQ L + V GL+ +R F++ K + +D +L+
Sbjct: 942 VIASLPHEQYIFLEKLQSNLRRVIKGVGGLSHEQWRSFNNEKKTVE--AKNFLDGDLIES 999
Query: 1396 YEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ L EI+ G + ++ + +L
Sbjct: 1000 FLDLSRNRMDEISKAIGVSVEELCKRVEEL 1029
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 146/362 (40%), Gaps = 85/362 (23%)
Query: 231 ESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
E+S +I L+ V D F++G +P +V+L++ +H AL
Sbjct: 95 ETSELIT--RLEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVALK-DK 145
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
+ P W+ NL + A L+ VP P+ GVL++G TI Y S +A A+ +
Sbjct: 146 DFGEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR------- 196
Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV 410
+ R+ V+ D + LL G L LL + ++ V L + +
Sbjct: 197 --PSITRAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETS 246
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
+ S I+ + N++ ++GS GDS LV+ +++ DA + S
Sbjct: 247 IASTISYLDNAVVYIGSSYGDSQLVKL------------------NLQPDA------KGS 282
Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADA 528
+ L+ VN + + + + T S A +D G L+ G+ IN A
Sbjct: 283 YVEVLESYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIGINEQA 337
Query: 529 SATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
S VEL G KG+W++ ++ DD + +L++S + T
Sbjct: 338 S------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSFISETR 371
Query: 589 VL 590
+L
Sbjct: 372 IL 373
>gi|12082087|dbj|BAB20761.1| UV-damaged DNA binding protein [Oryza sativa Japonica Group]
Length = 1090
Score = 89.4 bits (220), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 149/331 (45%), Gaps = 33/331 (9%)
Query: 1105 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+ +N +GTAYV E+ ++GR+L+F+ D L+ E KE KGA+ +L +
Sbjct: 779 SDDNNVYYCVGTAYVLPEENEPSKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 832
Query: 1164 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL A KI L+KW EL + L + + +FI++GD+ KSI
Sbjct: 833 FNGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYT-QTRGDFIVVGDLMKSI 891
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY---APKMS 1275
L +K + + + LA+D+ + A E L D + + N IF + +
Sbjct: 892 SLLVYKHEESAIEELARDYNANWMSAVEMLDDE-----IYIGAENNYNIFTVRKNSDAAT 946
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGSI 1334
+ +G +L E+H+G +F ++ D G P ++FGT++G I
Sbjct: 947 DEERG-RLEVVGEYHLGEFGNRFRHGSLVMRLPDSEMGQIP--------TVIFGTINGVI 997
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1394
G IA L + L+ LQ LV + V L+ +R FH++ K + +D +L+
Sbjct: 998 GIIASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTSE--ARNFLDGDLIE 1055
Query: 1395 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ L + E+A G ++ + +L
Sbjct: 1056 SFLDLSRNKMEEVAKGMGVPVEELSKRVEEL 1086
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + ++ +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G ++P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL + P WS NL + A L+ VP+P+GGV+++G T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y + +++ ++ Q + R+ V+ D + LL G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N + ++GSR GDS LV+
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++AD S + L+ VN + + + + + T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++D Y YL++S + T L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429
>gi|413946716|gb|AFW79365.1| hypothetical protein ZEAMMB73_562969 [Zea mays]
Length = 1089
Score = 89.0 bits (219), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/396 (23%), Positives = 170/396 (42%), Gaps = 36/396 (9%)
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+ + HQ + L+ +VEE E +R+L+ +++ P+ E ++
Sbjct: 717 RRICHQEQSRTLAFCSFKYNQSVEESETHLIRLLDHQ----TFESLCVYPLDQYECGCSI 772
Query: 1096 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1154
+ ++ +GTAYV E+ +GR+L+F+ D L+ E KE
Sbjct: 773 ISCSF----ADDSNVYYCVGTAYVIPEENEPTKGRILVFAV---EDGSLQLIVE---KET 822
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWTGT-----ELNGIAFYDAPPLYVVSLNIVKNFI 1209
KGA+ +L + G LL A KI L+KW EL + L + + +FI
Sbjct: 823 KGAVYSLNAFNGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILALYT-QTRGDFI 881
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
++GD+ KSI L +K + + + A+D+ + A E L D V ++ N+
Sbjct: 882 VVGDLMKSISLLVYKHEESAIEERARDYNANWMTAVEMLDDE---VYVGAENSYNLFTVR 938
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
+ + +L E+H+G V +F ++ P SD ++FGT
Sbjct: 939 KNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDIGQIPTVIFGT 991
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1389
++G IG IA L + L+ LQ LV + V L+ +R FH++ K + +D
Sbjct: 992 INGVIGIIASLPHDQYIFLEKLQSTLVKYIKGVGNLSHEQWRSFHNDKKTAE--ARNFLD 1049
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+L+ + L + E++ G ++ + +L
Sbjct: 1050 GDLIESFLDLSRSKMEEVSKAMGVPVEELSKRVEEL 1085
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 151/367 (41%), Gaps = 83/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F+HG +P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLHGCAKPTIVVLYQ------DNKDVRHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P WS N+ + A L+ VP+P+GGV+++G I Y + +++
Sbjct: 197 LK-DKDFVEGP--WSQNNVDNGAGLLIPVPAPLGGVIIIGEEQIVYCNANSTFK------ 247
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
++ Q + R+ V+ D + LL TG L LL + ++ V L +
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSRY--------LLGDNTGILHLLVLTHERERVTGLKIEY 297
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N + ++GSR GDS LV+ +++ADA
Sbjct: 298 LGETSIASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADASG-- 337
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
S + L+ VN + + + + + T S A +D G L+ G+
Sbjct: 338 ----SFVEILERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIG 388
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ KSS +D + YL++S
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------INDPFDMYLVVSF 422
Query: 584 EARTMVL 590
+ T L
Sbjct: 423 ISETRFL 429
>gi|393905247|gb|EJD73911.1| CPSF A subunit region family protein [Loa loa]
Length = 1145
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 74/311 (23%), Positives = 146/311 (46%), Gaps = 25/311 (8%)
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQN 1144
++ SE A+++ L N +++ +GTA + ++ ++ GR+++F + ++ P+
Sbjct: 805 LEGSEMAMSLASCQLGN----DSQPYFVVGTAVIMSDETESKMGRIMMF---QASEGPER 857
Query: 1145 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNI 1204
+ VY KE+KGA ++ S+ G L++A + L +WT + + D + + L
Sbjct: 858 MRL-VYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKT 916
Query: 1205 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
+ IL+GD+ +S+ LS+K + +A+DF + A E + S L + +
Sbjct: 917 KNDLILVGDLMRSLSLLSYKSVESTFEKVARDFMTNWMSACEIIDSDSFLG-----AENS 971
Query: 1265 IQIFYYAPKMSESWK--GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1322
+F +K G +L F++G V F + AT D AP
Sbjct: 972 YNLFTVVKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVD---VAP----LYH 1024
Query: 1323 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
++L+GT DG IG I + + + L +QK+L D + ++ +R F + ++
Sbjct: 1025 SSILYGTSDGGIGVIVQMPPVLYTFLHDVQKRLADYTENCMRISHTQYRTFETEKRSEV- 1083
Query: 1383 GPDSIVDCELL 1393
P+ +D +L+
Sbjct: 1084 -PNGFIDGDLI 1093
Score = 47.0 bits (110), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 78/356 (21%), Positives = 136/356 (38%), Gaps = 90/356 (25%)
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
+W NL +A ++ VP P GG L+ G + I YH + AL YA S
Sbjct: 201 LWKHDNLEGEASMVIGVPEPAGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----------VVQRLDLSKTN 407
+ +D +L D+A G+L +L + + G+ V+ + +
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLEF-GKGQEQDESSTVSVKDMKVESLG 309
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPST 464
+ + + + N + F+GSR GDS L++ + GT +S L + + ++ AP
Sbjct: 310 NTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS--LLDSYTNL---AP-- 362
Query: 465 KRLRRSSSDALQDM----VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
++DM NG++ L S + + + + + L +
Sbjct: 363 ----------IRDMTVMRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS--------- 403
Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
VEL G K ++T+ + D E+ YLI
Sbjct: 404 ---------------------VELKGIKNMFTLRTR---------------DHEFDDYLI 427
Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
+S ++ T VL E T+ + V G T+ AG LF ++QV ++DG
Sbjct: 428 LSFDSDTHVLLINGEELEDTQITGFVVDGATLWAGCLFQSTTILQVTHGEVILIDG 483
>gi|226510488|ref|NP_001145925.1| uncharacterized protein LOC100279448 [Zea mays]
gi|219884971|gb|ACL52860.1| unknown [Zea mays]
Length = 416
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/407 (23%), Positives = 174/407 (42%), Gaps = 41/407 (10%)
Query: 1028 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATI 1084
PLN+ + + HQ + L+ +VEE E +R+L+ +++
Sbjct: 38 PLNEQA-----RRICHQEQSRTLAFCSFKYNQSVEESETHLIRLLDHQ----TFESLCVY 88
Query: 1085 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQ 1143
P+ E ++ + ++ +GTAYV E+ +GR+L+F+ D
Sbjct: 89 PLDQYECGCSIISCSF----ADDSNVYYCVGTAYVIPEENEPTKGRILVFAV---EDGSL 141
Query: 1144 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-----ELNGIAFYDAPPLY 1198
L+ E KE KGA+ +L + G LL A KI L+KW EL + L
Sbjct: 142 QLIVE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILA 198
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
+ + +FI++GD+ KSI L +K + + + A+D+ + A E L D V
Sbjct: 199 LYT-QTRGDFIVVGDLMKSISLLVYKHEESAIEERARDYNANWMTAVEMLDDE---VYVG 254
Query: 1259 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1318
++ N+ + + +L E+H+G V +F ++ P SD
Sbjct: 255 AENSYNLFTVRKNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSD 307
Query: 1319 KTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
++FGT++G IG IA L + L+ LQ LV + V L+ +R FH++ K
Sbjct: 308 IGQIPTVIFGTINGVIGIIASLPHDQYIFLEKLQSTLVKYIKGVGNLSHEQWRSFHNDKK 367
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ +D +L+ + L + E++ G ++ + +L
Sbjct: 368 T--AEARNFLDGDLIESFLDLSRSKMEEVSKAMGVPVEELSKRVEEL 412
>gi|402913617|ref|XP_003919276.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Papio anubis]
Length = 132
Score = 88.6 bits (218), Expect = 2e-14, Method: Composition-based stats.
Identities = 48/121 (39%), Positives = 72/121 (59%), Gaps = 12/121 (9%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LEL + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 18 LELAASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 73
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVG 215
FE PE L+ G P V+VDP GRC +LVYG ++++L ++ GLVG
Sbjct: 74 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVG 130
Query: 216 D 216
+
Sbjct: 131 E 131
>gi|396082420|gb|AFN84029.1| pre-mRNA cleavage and polyadenylation [Encephalitozoon romaleae
SJ-2008]
Length = 1156
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 110/480 (22%), Positives = 208/480 (43%), Gaps = 61/480 (12%)
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP---VQKVIPLKATPHQITYFAEKNL 1016
L++ N + S+G L +C++PS + V + IP+ P I Y +
Sbjct: 725 LNSATVNKNQLIQLSRGHLMVCKVPSVRDEQYVFGDGLVGRKIPILRIPKHIEY---ADR 781
Query: 1017 YPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG 1076
Y ++ S D E + + V+ +R Y V+ Y R
Sbjct: 782 YMVVASCK-------------DVEFSSKDEKDCGIPVNTYRFY-VDLYSER--------- 818
Query: 1077 PWQTRATIPMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRV---LL 1132
++ +T + +E V+ + L + ++ L + T +++GED ARGR+ +
Sbjct: 819 -YEHISTYELDENEYIFDVKYLVLDDMQGNYGKSPFLLVCTTFIEGEDRPARGRLHVLEI 877
Query: 1133 FSTGRNADNP-QNLVTEVYSKE-LKGAISALASLQGHLLIASGPKIILHKWT-GTELNGI 1189
S + ++P ++ +V E KG+I + ++G + + G KI+++K T + I
Sbjct: 878 ISVVPSLESPFRDCKLKVLGIEKTKGSIVQCSEVRGKIALCLGTKIMIYKIDRSTGIIPI 937
Query: 1190 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
FYD ++ S++++KN+IL DI++ + F ++ + +L+L++ + +TE L
Sbjct: 938 GFYDLH-IFTSSISVMKNYILASDIYRGLSFFFFQSKPIRLHLISSSEPLKNVTSTELLT 996
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
G+ LS+V D + I + Y+P S G KL+ R+E L L++S
Sbjct: 997 AGNELSMVCCDAKGTIHAYTYSPNNIISMDGAKLVKRSEMKTN--------LGRLSSSGI 1048
Query: 1310 --RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNP 1367
R + KTN L G +D+ + +L +Q ++ + V GLN
Sbjct: 1049 GFRKNSIMFYSKTNLLIYLVG-----------MDDSYYLKLLKIQTSIMVHLKSVLGLNQ 1097
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1427
R + +S+ H S + +L+ + L Q I+ +R +IL L L +
Sbjct: 1098 RDY--LNSDIHLHSLSLKSPIIMHILNLFSYFDLNTQKLISTSVKMSRREILDVLASLNI 1155
>gi|413948669|gb|AFW81318.1| hypothetical protein ZEAMMB73_456332 [Zea mays]
Length = 674
Score = 88.2 bits (217), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 158/363 (43%), Gaps = 45/363 (12%)
Query: 1028 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATI 1084
PLN+ + + HQ + L+ +VEE E +R+L+ +++
Sbjct: 296 PLNEQA-----RRICHQEQSKTLAFCSFKYNQSVEESETHLIRLLDHQ----TFESLCVY 346
Query: 1085 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQ 1143
P+ E ++ + + +N +GTAYV E+ +GR+L+F+ D
Sbjct: 347 PLDQYECGCSIISCSFVD----DNNVYYCVGTAYVIPEENEPTKGRILVFAV---EDGSL 399
Query: 1144 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-----ELNGIAFYDAPPLY 1198
L+ E KE KGA+ +L + G LL A KI L+KW EL + L
Sbjct: 400 QLIVE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILA 456
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
+ + +FI++GD+ KSI L +K + + + A+D+ + A E L D V
Sbjct: 457 LYT-QTRGDFIVVGDLMKSISLLVYKHEESAIEERARDYNANWMTAVEMLDDE---VYVG 512
Query: 1259 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGS 1317
++ N+ + + KL E+H+G V +F ++ D G P
Sbjct: 513 AENGYNLFTVRKNSDAATDDERAKLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGKIP-- 570
Query: 1318 DKTNRFALLFGTLDGSIGCIA--PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1375
++FGT++G IG IA P D TF L+ Q LV + V ++ +R FH+
Sbjct: 571 ------TVIFGTINGVIGIIASLPHDHYTF--LEKFQSTLVKYIKGVGNMSHEQWRSFHN 622
Query: 1376 NGK 1378
+ K
Sbjct: 623 DKK 625
>gi|312076590|ref|XP_003140929.1| CPSF A subunit region family protein [Loa loa]
Length = 655
Score = 87.8 bits (216), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 149/314 (47%), Gaps = 31/314 (9%)
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQN 1144
++ SE A+++ L N +++ +GTA + ++ ++ GR+++F + ++ P+
Sbjct: 306 LEGSEMAMSLASCQLGN----DSQPYFVVGTAVIMSDETESKMGRIMMF---QASEGPER 358
Query: 1145 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVS 1201
+ VY KE+KGA ++ S+ G L++A + L +WT + L F + LY+ +
Sbjct: 359 MRL-VYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKT 417
Query: 1202 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1261
N + IL+GD+ +S+ LS+K + +A+DF + A E + S L
Sbjct: 418 KN---DLILVGDLMRSLSLLSYKSVESTFEKVARDFMTNWMSACEIIDSDSFLG-----A 469
Query: 1262 QKNIQIFYYAPKMSESWK--GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1319
+ + +F +K G +L F++G V F + AT D AP
Sbjct: 470 ENSYNLFTVVKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVD---VAP---- 522
Query: 1320 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1379
++L+GT DG IG I + + + L +QK+L D + ++ +R F + ++
Sbjct: 523 LYHSSILYGTSDGGIGVIVQMPPVLYTFLHDVQKRLADYTENCMRISHTQYRTFETEKRS 582
Query: 1380 HRPGPDSIVDCELL 1393
P+ +D +L+
Sbjct: 583 EV--PNGFIDGDLI 594
>gi|55976392|sp|Q6E7D1.1|DDB1_SOLCE RecName: Full=DNA damage-binding protein 1; AltName: Full=UV-damaged
DNA-binding protein 1
gi|49484911|gb|AAT66742.1| UV-damaged DNA binding protein 1 [Solanum cheesmaniae]
Length = 1095
Score = 87.8 bits (216), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 152/356 (42%), Gaps = 40/356 (11%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED-VAARGRVLLFSTGRNAD 1140
+T P+ E ++ L + + ++ IGTAYV E+ +GR+L+F D
Sbjct: 764 STYPLDQFEYGCSI----LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIV---ED 816
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP----- 1195
L+ E KE KGA+ +L + G LL A KI L+KW E G
Sbjct: 817 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTECGHHG 873
Query: 1196 ---PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1252
LYV + +FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 874 HILALYVQTRG---DFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAVEILDDDI 930
Query: 1253 TLSLVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
L + N +F K SE + +L E+H+G V +F ++
Sbjct: 931 YLG-----AENNFNLFT-VRKNSEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMR--- 981
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
P SD ++FGT++G IG IA L + L+ LQ L + V GL+
Sbjct: 982 ----LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQ 1037
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+R F++ K + +D +L+ + L EI+ +++ + +L
Sbjct: 1038 WRSFYNEKKT--VDAKNFLDGDLIESFLDLSRNRMEEISKAMSVPVEELMKRVEEL 1091
Score = 68.6 bits (166), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 113/502 (22%), Positives = 193/502 (38%), Gaps = 123/502 (24%)
Query: 95 GISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
G+ L+ + ++G + +L + G +D + +A E K VL++D +
Sbjct: 49 GLQCICLQPMLDVPIYGRIATLELFRPHG----ETQDLLFIATERYKFCVLQWDTEASEV 104
Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGL 213
+M + GR + G + +DP R G+ +Y GL +I ++G
Sbjct: 105 ITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLK- 156
Query: 214 VGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
F+ R+E V++++ F++G +P +V+L++
Sbjct: 157 ----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ------DN 189
Query: 274 VSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
+H + +LK I W+ NL + A L+ VP P+ GVL++G TI
Sbjct: 190 KDARH------VKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIGEETIV 243
Query: 331 YHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT 390
Y S SA A+ + + R+ V+ D + LL G L LL
Sbjct: 244 YCSASAFKAIPIR---------PSITRAYGRVDADGSR--------YLLGDHNGLLHLLV 286
Query: 391 VVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
+ ++ V L + + + S I+ + N+ F+GS GDS LV+
Sbjct: 287 ITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNL----------- 335
Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDS 508
P TK S + L+ VN + + + + T S A +D
Sbjct: 336 ----------QPDTK---GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD- 381
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRM 568
G L+ G+ IN AS VEL G KG+W++
Sbjct: 382 ----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--------------R 411
Query: 569 AAYDDEYHAYLIISLEARTMVL 590
+A DD Y +L++S + T VL
Sbjct: 412 SATDDPYDTFLVVSFISETRVL 433
>gi|350537001|ref|NP_001234275.1| DNA damage-binding protein 1 [Solanum lycopersicum]
gi|350539125|ref|NP_001233864.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
gi|55976440|sp|Q6QNU4.1|DDB1_SOLLC RecName: Full=DNA damage-binding protein 1; AltName: Full=High
pigmentation protein 1; AltName: Full=UV-damaged
DNA-binding protein 1
gi|38455768|gb|AAR20885.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
gi|42602165|gb|AAS21683.1| UV-damaged DNA binding protein 1 [Solanum lycopersicum]
Length = 1090
Score = 87.8 bits (216), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 152/356 (42%), Gaps = 40/356 (11%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED-VAARGRVLLFSTGRNAD 1140
+T P+ E ++ L + + ++ IGTAYV E+ +GR+L+F D
Sbjct: 759 STYPLDQFEYGCSI----LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIV---ED 811
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP----- 1195
L+ E KE KGA+ +L + G LL A KI L+KW E G
Sbjct: 812 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTECGHHG 868
Query: 1196 ---PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1252
LYV + +FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 869 HILALYVQTRG---DFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAVEILDDDI 925
Query: 1253 TLSLVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
L + N +F K SE + +L E+H+G V +F ++
Sbjct: 926 YLG-----AENNFNLFT-VRKNSEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMR--- 976
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
P SD ++FGT++G IG IA L + L+ LQ L + V GL+
Sbjct: 977 ----LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQ 1032
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+R F++ K + +D +L+ + L EI+ +++ + +L
Sbjct: 1033 WRSFYNEKKT--VDAKNFLDGDLIESFLDLSRNRMEEISKAMSVPVEELMKRVEEL 1086
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 113/507 (22%), Positives = 196/507 (38%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----ETQDLLFIATERYKFCVLQWDT 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
+ +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EASEVITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK I W+ NL + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S SA A+ + + R+ V+ D + LL G
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITRAYGRVDADGSR--------YLLGDHNGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
L LL + ++ V L + + + S I+ + N+ F+GS GDS LV+
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNL------ 330
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
P TK S + L+ VN + + + + T S
Sbjct: 331 ---------------QPDTK---GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
+A DD Y +L++S + T VL
Sbjct: 406 ----RSATDDPYDTFLVVSFISETRVL 428
>gi|301093655|ref|XP_002997673.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110063|gb|EEY68115.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 176
Score = 87.8 bits (216), Expect = 4e-14, Method: Composition-based stats.
Identities = 56/175 (32%), Positives = 89/175 (50%), Gaps = 18/175 (10%)
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS-----DRTGAAPGSDKTNRFA 1324
+AP+ ES GQ+LL ++FH+G V+ R ++ A+ S + AAP S+ N
Sbjct: 3 FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVN--- 59
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1384
+ GT +G +G + P+ E FRRL +LQ +V+++P LNPR FR +N + P
Sbjct: 60 -VMGTSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRP 118
Query: 1385 DS---------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
D+ +D +L + L Q E+A GTT ++ NL ++ TS
Sbjct: 119 DAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVMHNLLEVQHATS 173
>gi|301124072|ref|XP_002909688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262107255|gb|EEY65307.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 176
Score = 87.4 bits (215), Expect = 5e-14, Method: Composition-based stats.
Identities = 56/175 (32%), Positives = 89/175 (50%), Gaps = 18/175 (10%)
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS-----DRTGAAPGSDKTNRFA 1324
+AP+ ES GQ+LL ++FH+G V+ R ++ A+ S + AAP S+ N
Sbjct: 3 FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVN--- 59
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1384
+ GT +G +G + P+ E FRRL +LQ +V+++P LNPR FR +N + P
Sbjct: 60 -VMGTSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRP 118
Query: 1385 DS---------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
D+ +D +L + L Q E+A GTT ++ NL ++ TS
Sbjct: 119 DAWSKKKWKKSFLDAFVLFRFLQLNYVAQKELARCIGTTPEVVMHNLLEVQHATS 173
>gi|224061051|ref|XP_002300334.1| predicted protein [Populus trichocarpa]
gi|222847592|gb|EEE85139.1| predicted protein [Populus trichocarpa]
Length = 1088
Score = 87.4 bits (215), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 104/403 (25%), Positives = 176/403 (43%), Gaps = 48/403 (11%)
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+ + HQ + S + EE E +R+L+ ++ +T P+ + E ++
Sbjct: 716 RRICHQEQSRTFSICSMKNQSNAEESEMHFIRLLDDQ----TFEFISTYPLDTFEYGCSI 771
Query: 1096 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1154
L + + ++ +GTAYV E+ +GR+L+F D L+ E KE
Sbjct: 772 ----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGKLQLIAE---KET 821
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1209
KGA+ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
++GD+ KSI L +K + + A+D+ + A E L D L + N +F
Sbjct: 881 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFT 935
Query: 1270 YAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1326
K SE + +L E+H+G V +F ++ P SD ++
Sbjct: 936 -VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVI 987
Query: 1327 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1386
FGT++G IG IA L + L+ LQ L + V GL+ +R F++ K +
Sbjct: 988 FGTVNGVIGVIASLPHEQYLFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKT--VDAKN 1045
Query: 1387 IVDCEL------LSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1423
+D +L LS M + + +EI+ + R + L+ L+
Sbjct: 1046 FLDGDLIESFLDLSRSRMDEISKAMEISVEELCKRVEELTRLH 1088
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 109/507 (21%), Positives = 193/507 (38%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ ++ ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEINLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ETSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F+HG +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLHGCSKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + LK I WS NL + A L+ VP P GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVALKDKDFIEGPWSQNNLDNGADLLIPVPPPFCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S + A+ + + ++ V+ D + LL G
Sbjct: 234 EETIVYCSANVFRAIPIR---------PSITKAYGRVDADGSR--------YLLGDHAGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
L LL + ++ V L + + + S I+ + N+ F+GS GDS LV+
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKL-------- 328
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
++ DA T + L VN + + + + T S
Sbjct: 329 ----------NLHPDAKGT------YVEVLDRYVNLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
+ DD + +L++S + T +L
Sbjct: 406 ----RSLTDDPFDTFLVVSFISETRIL 428
>gi|412992547|emb|CCO18527.1| predicted protein [Bathycoccus prasinos]
Length = 1275
Score = 87.0 bits (214), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 97/391 (24%), Positives = 175/391 (44%), Gaps = 52/391 (13%)
Query: 1062 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-- 1119
EE+ VR+ + ++T A P++ +EN ++ + +++ +GTA+
Sbjct: 892 EEFFVRLFD----NKTFETLAKYPLEPNENDASIISCSF----DGDDDIYFVVGTAFADP 943
Query: 1120 QGEDVAARGRVLLF-------STGRNA-----DNP-----------QNLVTEVYSKELKG 1156
E ++RGR+L+F S G NA D+ Q +T V KE +G
Sbjct: 944 HSEPESSRGRILVFKVSNTSSSGGGNAVVNGNDHGDGRASASSSVLQKSLTLVCEKETRG 1003
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTELNGIAF-YDAPPL-YVVSLNI--VKNFILLG 1212
A+ L + G LL + L W ++ N ++ + ++++L + N I++G
Sbjct: 1004 AVYNLNAFCGKLLAGINSLVKLFNWGVSKENKRELVHECSHMGHIIALKVETKDNLIVVG 1063
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ KSI L ++ + ++ +A DF S A E L D + L S +Q A
Sbjct: 1064 DLMKSITLLQYQRESGRIEEVAHDFSSNWMTAVEILDDNTYLGAESSYNLFTVQ--RNAD 1121
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA----LLFG 1328
+E +G L A FH+G V +F R ++ D SD T+ + LFG
Sbjct: 1122 ADTEDKRGTLELCGA-FHLGDSVNRFRRGSLVMRMPDL------SDDTSSLSEISTWLFG 1174
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
T+ G +G +A L + F L +Q+ + V V + FR FH+ ++ + +
Sbjct: 1175 TISGGLGVVATLPKRDFMLLNKVQEAMQKVVTGVGNFSHSDFRSFHNVQRSVE--MRNFI 1232
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1419
D +L+ + L E+Q+ ++ +G + S+ L
Sbjct: 1233 DGDLVEIFLDLSKEDQVAVSELSGVSNSEDL 1263
>gi|428164905|gb|EKX33915.1| hypothetical protein GUITHDRAFT_158867 [Guillardia theta CCMP2712]
Length = 1092
Score = 87.0 bits (214), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 156/365 (42%), Gaps = 31/365 (8%)
Query: 1061 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY-V 1119
VEE +++ + ++ T +Q EN +V + + T +GTA V
Sbjct: 744 VEEQFIKLFDDQ----TFEILDTYQLQEFENTCSVECASFSDDPT----LYYIVGTATAV 795
Query: 1120 QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1179
E GR+L+F D +L SKE+KGA + G LL KI L
Sbjct: 796 PQESEPKEGRLLVFEV---IDRKLHLKA---SKEIKGAPYQIKPFNGKLLAGINSKIELF 849
Query: 1180 KWTGTELNGIAFYDA----PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1235
+ + ++ + + V+ L +FI+ GD+ +SI L++K+ Q+ +A+
Sbjct: 850 RLSDSDTGHMELVSECCHRGHILVLYLQTRGDFIVAGDLMRSISLLTYKQVDGQIEEIAR 909
Query: 1236 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1295
DF + A + L D + L ++ N+ + + +L E+H+G V
Sbjct: 910 DFNANWMTAVDILDDDTFLG---AEGYFNLFTVRKNTDATSDEERARLEVVGEYHLGDMV 966
Query: 1296 TKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1355
+F R ++ SSD P +D ++FGT++G IG IA L + + L +Q L
Sbjct: 967 NRFQRGSLVLRSSD----TPTTD-----TIIFGTVNGMIGVIAVLSKEEYEFLLKVQDAL 1017
Query: 1356 VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1415
+ V GL +R F + P +D +L+ + L E+ E+ H G+
Sbjct: 1018 NFVIKGVGGLRHEDWRSFENERTQGARAPKGFIDGDLIESFLDLRREKMEEVCHAIGSIT 1077
Query: 1416 SQILS 1420
+ LS
Sbjct: 1078 VEELS 1082
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 128/601 (21%), Positives = 236/601 (39%), Gaps = 127/601 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+++ ++ L+ V ++G + ++ + + GA+ R+S+ + E K ++E+D
Sbjct: 37 RLVIYTLTPEGLQPVLDTGIYGRIAAIELYTVAGAE----RESLYILTERLKFCIVEYDS 92
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVY-GLQMIIL 204
S L +M + +S R GP+ +DP+ R G L+Y GL +I
Sbjct: 93 STGELITKAMGDVQ-----------DSVGRPVDGGPIAHIDPERRMIGFLLYDGLFKVIP 141
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH 264
++ G F+ R+E V++++ F++GY +P +V+L
Sbjct: 142 IDTRNGQ----------LREAFNIRLEELQVLDVQ-----------FLYGYAQPTIVLL- 179
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGV 321
++ M + +++ I WS + A ++ VP+PIGG
Sbjct: 180 -----------YQDPKEMRHLKTYQVSIRDKDFIAGPWSQTGVEIGATMIIPVPTPIGGC 228
Query: 322 LVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
+++G TI Y + + + +D + + R+ ++ D LL
Sbjct: 229 ILLGEQTISYLNGDKG-----DTKTIHMDMT--VIRAWGKIDEDGRR--------YLLGD 273
Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
G L +L + +DG V L L + IT + + + F+GS GDS L++
Sbjct: 274 HLGQLYVLVLEFDGNKVLGLKLDTLGETSSAKTITYLDSGVVFIGSCFGDSQLIRL---- 329
Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
K S+ + L+ N + + + +
Sbjct: 330 --------------------HPDKDENDSNIEVLESFTNLGPIQDFCVVDLERQGQGQVV 369
Query: 502 SFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGH 561
+ + G LKD S LR+ + GI++Q+ VELPG KG+W++
Sbjct: 370 TCS--------GTLKDGS--LRVVRN--GIGINEQAA---VELPGIKGLWSLRE------ 408
Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
+ D +Y YLI S T VLE AD TE + +TI N+ G
Sbjct: 409 --------SIDAQYDKYLIQSFVNETRVLEIADEELSETEIDGFDHNAQTIFCSNVLG-D 459
Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSE--SGSGSENSTVLSVSIADPYVLLGMSDGS 679
++Q+ E R++ + P N E + +G V+ S + L +S+G
Sbjct: 460 CLLQITEVSLRLVSTKSKQLLKEWFPPNGERITVAGGNVQQVVLTSGKRTLIYLDVSNGD 519
Query: 680 I 680
+
Sbjct: 520 V 520
>gi|170057515|ref|XP_001864517.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167876915|gb|EDS40298.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1138
Score = 86.7 bits (213), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 75/304 (24%), Positives = 138/304 (45%), Gaps = 26/304 (8%)
Query: 1110 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
T +GTA V E+ + GR++++ A +T+V KE+KGA +L G +
Sbjct: 826 TYYIVGTAMVNPEEREPKVGRIIIYHYADGA------LTQVSEKEIKGACYSLVEFNGRV 879
Query: 1169 LIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1225
L + L++WT + L F + LY + +FIL+GD+ +SI L +K+
Sbjct: 880 LATINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKG---DFILVGDLMRSITLLQYKQ 936
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1285
+A+D+ A E L D + L ++ N+ + + + Q++
Sbjct: 937 MEGSFEEIARDYQPKWMTAVEILDDDAFLG---AENSNNLFVCLKDSAATTDDERQQMPE 993
Query: 1286 RAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1344
A+FH+G V F ++ + +RT G +LFGT+ G+IG + +
Sbjct: 994 VAQFHLGDMVNVFRHGSLVMQNIGERTTPTSG-------CVLFGTVSGAIGLVTQIPPDY 1046
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1404
+ L+ LQ+ L +++ V ++ +R FH+ K + +D +L+ + L E+
Sbjct: 1047 YEFLRKLQENLTNTIKSVGRIDHTYWRSFHTEMKTE--NSEGFIDGDLVESFLDLTREKM 1104
Query: 1405 LEIA 1408
E A
Sbjct: 1105 HEAA 1108
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 105/473 (22%), Positives = 175/473 (36%), Gaps = 129/473 (27%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G L +DP+ R G+ +Y GL II D DT H +
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154
Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
L M+ HV+D F++G P ++++H+ ++ +H I I+ K
Sbjct: 155 TSLRMEEMHVQDVEFLYGTAHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203
Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
I W N+ +A L+ VP+P+GG +V+G ++ YH + A+A +N YA
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYA- 262
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
+D+ + LL G L ++ + + +L +
Sbjct: 263 ---------------RVDSRGFRY------LLGNMIGHLFMMFLETEENTRGQLTVKDIK 301
Query: 408 PSVL-----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
+L IT + N + F+GSR GDS LV+ + S + E F ++ AP
Sbjct: 302 VELLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAAASGAYVTVMETFTNL---AP 358
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
L+ G+ ++ GS G L+ G+
Sbjct: 359 IIDMCIVD----LERQGQGQMITCSGSYKE--------------------GSLRIIRNGI 394
Query: 523 RINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
I A ++LPG KG+W + R+ D Y L++S
Sbjct: 395 GIQEHAC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLS 429
Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
T +L + E TE + +T N+ FG ++IQV AR++
Sbjct: 430 FVGHTRILMLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPMTARLI 480
>gi|157128864|ref|XP_001655231.1| DNA repair protein xp-e [Aedes aegypti]
gi|108882186|gb|EAT46411.1| AAEL002407-PB [Aedes aegypti]
Length = 1138
Score = 86.3 bits (212), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/304 (25%), Positives = 141/304 (46%), Gaps = 26/304 (8%)
Query: 1110 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
T +GTA V E+ + GR++++ AD NL T+V KE+KG+ +L G +
Sbjct: 826 TYYIVGTALVNPEEPEPKVGRIIIY---HYADG--NL-TQVSEKEIKGSCYSLVEFNGRV 879
Query: 1169 LIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1225
L + + L++WT + L F + LY + +FIL+GD+ +SI L +K+
Sbjct: 880 LASINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKG---DFILVGDLMRSITLLQYKQ 936
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1285
+A+D+ A E L D + L +D N+ + + + Q++
Sbjct: 937 MEGSFEEIARDYQPNWMTAVEILDDDAFLG---ADNSNNLFVCLKDGAATTDDERQQMPE 993
Query: 1286 RAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1344
A+ H+G V F ++ + +RT G +LFGT+ G+IG + +
Sbjct: 994 VAQVHLGDMVNVFRHGSLVMENIGERTTPTSG-------CVLFGTVSGAIGLVTQIPADY 1046
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1404
+ L+ LQ+ L D++ V ++ +R FH+ K R + +D +L+ + L E+
Sbjct: 1047 YEFLRKLQENLTDTIKSVGKIDHAYWRSFHTEMKTER--CEGFIDGDLVESFLDLSREKM 1104
Query: 1405 LEIA 1408
E A
Sbjct: 1105 HEAA 1108
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 109/468 (23%), Positives = 174/468 (37%), Gaps = 119/468 (25%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G L +DP+ R G+ +Y GL II D DT H +
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154
Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
L M+ HV+D F++G P ++++H+ ++ +H I I+ K
Sbjct: 155 TSLRMEEVHVQDVEFLYGTQHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203
Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
I W N+ +A L+ VP+P+GG +V+G ++ YH + A+A +N YA
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYAR 263
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
+ S L +N LLS K + LL + T
Sbjct: 264 VDSKGFRYLLGNMSGHLFMMFLETEENSKGLLSVKDIKVELLGDI-------------TI 310
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
P IT + N + F+GSR GDS LV+ +G + + E F ++ AP
Sbjct: 311 PEC----ITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMETFTNL---APIIDMC 363
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
L+ G+ ++ GS G L+ G+ I
Sbjct: 364 IVD----LEKQGQGQMITCSGSYKE--------------------GSLRIIRNGIGIQEH 399
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
A ++LPG KG+W + R+ D Y L++S T
Sbjct: 400 AC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLSFVGHT 434
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
+L + E TE + +T N+ FG ++IQV AR++
Sbjct: 435 RILTLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPTTARLI 480
>gi|170589357|ref|XP_001899440.1| CPSF A subunit region family protein [Brugia malayi]
gi|158593653|gb|EDP32248.1| CPSF A subunit region family protein [Brugia malayi]
Length = 655
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 140/293 (47%), Gaps = 27/293 (9%)
Query: 1107 ENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
+++ +GTA + ++ ++ GR+++F + ++ P+ + VY KE+KGA ++ S+
Sbjct: 331 DSQPYFVVGTAVIMSDETESKMGRIMMF---QASEGPERMRL-VYEKEIKGAAYSIQSMD 386
Query: 1166 GHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
G L++A + L +WT + L F + LY+ + N + IL+GD+ +S+ LS
Sbjct: 387 GKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKTKN---DLILVGDLMRSLSLLS 443
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK--G 1280
+K + +A+DF + A E + + L + + +F +K G
Sbjct: 444 YKSMESTFEKVARDFMTNWMSACEIIDSDNFLG-----AENSYNLFTVMKDSFTVFKEEG 498
Query: 1281 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
+L F++G V F + AT D AP ++L+GT DG IG I +
Sbjct: 499 TRLQELGLFYLGEMVNVFCHGSLTATQVD---VAP----LYHSSILYGTSDGGIGVIVQM 551
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+ + LQ +QK+L + + ++ +R F + ++ P+ +D +L+
Sbjct: 552 PPVLYTFLQDVQKRLAEYAENCMRISHTQYRTFETEKRSE--APNGFIDGDLI 602
>gi|225443992|ref|XP_002280744.1| PREDICTED: DNA damage-binding protein 1 isoform 2 [Vitis vinifera]
Length = 1068
Score = 85.9 bits (211), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 139/306 (45%), Gaps = 33/306 (10%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNAD 1140
+T P+ + E ++ L + + ++ +GTAYV E+ +GR+L+F D
Sbjct: 738 STYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---ED 790
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP 1195
L+ E KE KGA+ +L + G LL A KI L+KW GT EL + +
Sbjct: 791 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGH 847
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
L + + +FI++GD+ KSI L +K + + A+D+ + A E L D L
Sbjct: 848 IL-ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 906
Query: 1256 LVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTG 1312
+ N IF K SE + +L E+H+G V +F ++
Sbjct: 907 -----AENNFNIFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------ 954
Query: 1313 AAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1372
P SD ++FGT++G IG IA L + L+ LQ L + V GL+ +R
Sbjct: 955 -LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEKLQANLRKVIKGVGGLSHEQWRS 1013
Query: 1373 FHSNGK 1378
F++ K
Sbjct: 1014 FNNEKK 1019
Score = 57.4 bits (137), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 88/391 (22%), Positives = 153/391 (39%), Gaps = 84/391 (21%)
Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMV 261
+I +A S +G G F + + N+R L+ V D F++G +P +V
Sbjct: 99 VITRAMGDVSDRIGRPTDNGQVIPFDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIV 157
Query: 262 ILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGV 321
+L++ +H AL + P W+ NL + A L+ VP P+ GV
Sbjct: 158 VLYQ------DNKDARHVKTYEVALK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGV 208
Query: 322 LVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
L++G TI Y S SA A+ + + ++ V+ D + LL
Sbjct: 209 LIIGEETIVYCSASAFKAIPIR---------PSITKAYGRVDADGSR--------YLLGD 251
Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
G L LL + ++ V L + + + S I+ + N+ ++GS GDS L++
Sbjct: 252 HAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKI---- 307
Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK-- 499
++ DA + S + L+ VN + + + +
Sbjct: 308 --------------HLQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVV 347
Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSR 559
T S A +D G L+ G+ IN AS VEL G KG+W++
Sbjct: 348 TCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL------ 384
Query: 560 GHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 385 --------RSSTDDPHDTFLVVSFISETRIL 407
>gi|225443990|ref|XP_002280735.1| PREDICTED: DNA damage-binding protein 1 isoform 1 [Vitis vinifera]
Length = 1089
Score = 85.9 bits (211), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 139/306 (45%), Gaps = 33/306 (10%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNAD 1140
+T P+ + E ++ L + + ++ +GTAYV E+ +GR+L+F D
Sbjct: 759 STYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---ED 811
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP 1195
L+ E KE KGA+ +L + G LL A KI L+KW GT EL + +
Sbjct: 812 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGH 868
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
L + + +FI++GD+ KSI L +K + + A+D+ + A E L D L
Sbjct: 869 IL-ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 927
Query: 1256 LVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTG 1312
+ N IF K SE + +L E+H+G V +F ++
Sbjct: 928 -----AENNFNIFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------ 975
Query: 1313 AAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1372
P SD ++FGT++G IG IA L + L+ LQ L + V GL+ +R
Sbjct: 976 -LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEKLQANLRKVIKGVGGLSHEQWRS 1034
Query: 1373 FHSNGK 1378
F++ K
Sbjct: 1035 FNNEKK 1040
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 83/367 (22%), Positives = 145/367 (39%), Gaps = 84/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P W+ NL + A L+ VP P+ GVL++G TI Y S SA A+ +
Sbjct: 197 LK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR-- 251
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ V+ D + LL G L LL + ++ V L +
Sbjct: 252 -------PSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 296
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N+ ++GS GDS L++ ++ DA
Sbjct: 297 LGETSIASTISYLDNAFVYVGSSYGDSQLIKI------------------HLQPDA---- 334
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
+ S + L+ VN + + + + T S A +D G L+ G+
Sbjct: 335 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIG 387
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ ++ DD + +L++S
Sbjct: 388 INEQAS------------VELQGIKGMWSL--------------RSSTDDPHDTFLVVSF 421
Query: 584 EARTMVL 590
+ T +L
Sbjct: 422 ISETRIL 428
>gi|356512638|ref|XP_003525025.1| PREDICTED: DNA damage-binding protein 1a-like isoform 2 [Glycine max]
Length = 1068
Score = 85.9 bits (211), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)
Query: 1105 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+ +N +GTAYV E+ +GR+L+F+ D L+ E KE KGA+ L +
Sbjct: 757 SDDNNVYYCVGTAYVLPEENEPTKGRILVFAV---EDGKLQLIAE---KETKGAVYCLNA 810
Query: 1164 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL A KI L+KW GT EL + L + + +FI++GD+ KSI
Sbjct: 811 FNGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 869
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
L +K + + A+D+ + A E + D L +N + K SE
Sbjct: 870 SLLIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLG------AENSFNLFTVRKNSEGA 923
Query: 1279 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ +L E+H+G V +F ++ P SD ++FGT++G IG
Sbjct: 924 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 976
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
IA L + L+ LQ L + V GL+ +R F++ K
Sbjct: 977 VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1019
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 85/367 (23%), Positives = 148/367 (40%), Gaps = 84/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P +V+L++ +H A
Sbjct: 123 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 175
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L L + P WS NL + A L+ VP P+ GVL++G TI Y S +A A+ +
Sbjct: 176 LKDKDFL-EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-- 230
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ V+ D + LL TG L LL + ++ V L +
Sbjct: 231 -------PSITKAYGRVDPDGSR--------YLLGDHTGLLSLLVITHEKEKVTGLKIEP 275
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N+ ++GS GDS L++ +++ DA
Sbjct: 276 LGETSIASTISYLDNAFVYIGSSYGDSQLIKL------------------NLQPDA---- 313
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
+ S + L+ VN + + + + T S A +D G L+ G+
Sbjct: 314 --KGSYVEGLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRVVRNGIG 366
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ ++ DD + +L++S
Sbjct: 367 INEQAS------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSF 400
Query: 584 EARTMVL 590
+ T +L
Sbjct: 401 ISETRIL 407
>gi|357519461|ref|XP_003630019.1| DNA damage-binding protein [Medicago truncatula]
gi|355524041|gb|AET04495.1| DNA damage-binding protein [Medicago truncatula]
Length = 1171
Score = 85.9 bits (211), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)
Query: 1105 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+ +N +GTAYV E+ +GR+L+FS + LV E KE KGA+ L +
Sbjct: 860 SDDNNVYYCVGTAYVLPEENEPTKGRILVFSV---EEGKLQLVAE---KETKGAVYCLNA 913
Query: 1164 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL A KI L+KW GT EL + L + + +FI++GD+ KSI
Sbjct: 914 FNGKLLAAINQKIQLYKWVLREDGTRELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 972
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
L +K + + A+D+ + A E L D L +N + K SE
Sbjct: 973 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDVYLG------AENSFNLFTVRKNSEGA 1026
Query: 1279 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ +L E+H+G + +F ++ P SD ++FGT++G IG
Sbjct: 1027 TDEERGRLEVAGEYHLGEFINRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 1079
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
IA L + L+ LQ L + V GL+ +R F++ K
Sbjct: 1080 VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1122
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 78/352 (22%), Positives = 145/352 (41%), Gaps = 66/352 (18%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A L+ + L+G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTAQGLQSILDVPLYGRIATLELFRPHG----ETQDFLFIATERYKFCVLQWDT 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L SM + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EKSELVTRSMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + LK + WS +L + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVALKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S + A+ + + ++ V+ D + LL TG
Sbjct: 234 EETIVYCSANGFKAIPIR---------AAITKAYGRVDPDGSRY--------LLGDHTGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L LL + ++ V L + + + S I+ + N+ ++GS GDS L++
Sbjct: 277 LSLLVITHEKEKVTGLKIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKL 328
>gi|91087281|ref|XP_975549.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270010588|gb|EFA07036.1| hypothetical protein TcasGA2_TC010010 [Tribolium castaneum]
Length = 1149
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/401 (24%), Positives = 167/401 (41%), Gaps = 73/401 (18%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV--------EEYEVRILEPDRAGGPWQTRATIPMQSSENAL 1093
G +++ HNL +D H T+ V +EY + I+ +R GG
Sbjct: 791 GQEVEVHNLLIIDQH-TFEVLHAHQLMQQEYAMSIISTNRLGGDM--------------- 834
Query: 1094 TVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSK 1152
NE + +GTA V E+ + GR+L+F N +T+V K
Sbjct: 835 --------------NEYYI-VGTATVNPEESEPKQGRILIFQWNDNK------LTQVSEK 873
Query: 1153 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
E+KGA +LA G LL + + L +WT + + + + L +FILLG
Sbjct: 874 EIKGACYSLAEFNGKLLASINSTVRLFEWTVEKELRLECSHFNNILTLFLKTKGDFILLG 933
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ +S+ L +K +A+D+ A E L D L ++ NI +
Sbjct: 934 DLMRSMTLLQYKTMEGSFEEIARDYNPNWMTAVEILDDDIFLG---AENSFNIFVCQKDS 990
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLA-TSSDRTGAAPGSDKTNRFALLF 1327
+ + ++ FHVG + F L +Q L TS+ TG +LF
Sbjct: 991 AATTDEERSQMHEVGRFHVGDMINVFRHGSLVMQNLGETSTPTTG-----------CVLF 1039
Query: 1328 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1387
GT+ G+IG + + + + L LQ KL + V ++ +R F+++ K +
Sbjct: 1040 GTVSGAIGLVTQITQDFYDFLLELQNKLSTVIKSVGKIDHSQWRAFNTDIKTEP--SEGF 1097
Query: 1388 VDCEL------LSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+D +L LSH +M + + L+I + G + + +L
Sbjct: 1098 IDGDLIESFLDLSHDKMKEVADGLQITGEGGMKQDCTVDDL 1138
Score = 68.6 bits (166), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 104/469 (22%), Positives = 175/469 (37%), Gaps = 110/469 (23%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G L +DP+ R G+ +Y I+ + S L N+R
Sbjct: 119 GILAVIDPKARVIGLRLYDGLFKIIPLEKDNSELKAS--------------------NIR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG P ++++H+ V+ +H + IS K+ +
Sbjct: 159 -IDELQVHDVEFLHGCANPTLILIHQD-------VNGRH----VKTHEISLREKEFVKVP 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VPSP+GG +++G I YH +A + +S
Sbjct: 207 WRQDNVETEASMIIPVPSPLGGAIIIGQENILYHDGITPVVVA----------PAVIKQS 256
Query: 359 SFS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVL 411
+ ++D +L D+A G L +L + D R VV+ L +
Sbjct: 257 TIVCYAKVDPGGLRYLLGDMA------GHLFMLFLEVDNRGDGNDVVKDLKVELLGEIAT 310
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
IT + N + F+GSRLGDS LV+ T S + E F ++ AP L
Sbjct: 311 PECITYLDNGVLFIGSRLGDSQLVKLTTKPNESGSYVTVMESFTNL---API---LDMCV 364
Query: 472 SDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
D L+ G+ ++ G+ G L+ G+ I AS
Sbjct: 365 VD-LERQGQGQLVTCSGAFKE--------------------GSLRIIRNGIGIQEHAS-- 401
Query: 532 GISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE 591
++LPG KG+W + A D Y L+++ +T VL
Sbjct: 402 ----------IDLPGIKGMWAL--------------QVASDGRYDNTLVLAFVGQTRVLS 437
Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
E T+ + +T GN+ +++Q+ AR++ T
Sbjct: 438 LNGEEVEETDIAGFASDQQTFFCGNVI-HEQIVQITPISARLISAQNKT 485
>gi|145351726|ref|XP_001420218.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580451|gb|ABO98511.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1120
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/394 (23%), Positives = 175/394 (44%), Gaps = 31/394 (7%)
Query: 1039 QEVGHQIDNHNLS-SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ + HQ+D + + +V+ + +E +R+++ G + T ++ E A ++
Sbjct: 746 RRIAHQVDTNTFAVAVEHLMSKGDQELFIRLIDD----GSFDTLHQFRLEEHELASSLMS 801
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1157
+ F ++E ++ G AY Q ED +RGR+L+ +A LV+E KE++GA
Sbjct: 802 CS-FAGDSREY-YVVGTGFAYEQ-EDEPSRGRILVLRVEADA---LELVSE---KEVRGA 852
Query: 1158 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA----PPLYVVSLNIVKNFILLGD 1213
+ L + +G LL K+ L KWT E + + S+ ++IL+GD
Sbjct: 853 VYNLNAFKGKLLAGINSKLELFKWTPREDDAHELVSECSHHGQIITFSVKTRGDWILVGD 912
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
+ KS+ L +K + ++ +A+DF + A L D T + ++ +F A
Sbjct: 913 LLKSMSLLQYKPEEGAIDEIARDFNANWMTAVAMLDDDETY----LGAENSLNLFTVARN 968
Query: 1274 MSESWKGQ--KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1331
M+ + +L E+H+G V F ++ + D D LLFGT +
Sbjct: 969 MNAMTDEERSRLEITGEYHLGEFVNVFSPGSLVMSLKD-------GDSLEVPTLLFGTGN 1021
Query: 1332 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1391
G IG +A L + + + LQ + + V GL +R F + + VD +
Sbjct: 1022 GVIGVLASLPKDAYDFAERLQTSMNKHIQGVGGLKHAEWRSFRHTLRRKSDPSRNFVDGD 1081
Query: 1392 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
L+ + L +E+ +A R++I+ + +L
Sbjct: 1082 LVESFLDLKVEQADVVAADMKCDRAEIIRRVEEL 1115
Score = 74.3 bits (181), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 127/549 (23%), Positives = 224/549 (40%), Gaps = 113/549 (20%)
Query: 96 ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
+ A L+ V ++G + ++++ G D R + L E +VL +D++ L+
Sbjct: 58 LHAEGLKPVLDVPINGRIATMSLCQTGSGDGKAR---LYLTTERYGFTVLSYDEANEELK 114
Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLV 214
+ + GR + G + VD R G+ +Y GL +I +GG
Sbjct: 115 TEAFGDVQD------NIGRPA-DDGQIGIVDDTCRAIGLRLYDGLFKVIPCDEKGG---- 163
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
++ + I L +L V+D F+HG +P + +L+ R+ A V
Sbjct: 164 ---------------VKEAFNIRLEEL---RVEDIKFLHGTPKPTIAVLY-RDTKDA--V 202
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
K + I ++ P W+ +L + K++ VP+PIGGV+V+G I Y
Sbjct: 203 HIKTYEIGIREKEFVSS----P--WAQNDLEGGSNKIIPVPAPIGGVVVLGQEIIVY--- 253
Query: 335 SASCALALNNYAVSLD---SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
LN + D + +P + A LL G L LL +
Sbjct: 254 -------LNKFEDDADVFLKAINIPNIPDRTNITCYGAIDPDGSRYLLGDADGMLYLLVI 306
Query: 392 VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
++DG+ V+ L + + + + S ++ + N + F+GS GDS L++ L
Sbjct: 307 LHDGKRVRELKIERLGDTSIASTLSYLDNGVVFVGSTYGDSQLIK-------------LH 353
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGE--ELSLYGSASNNTESAQKTFSFAVRDSL 509
E I+ D T L +V+ +L +G T S
Sbjct: 354 AEKTSIDKDGNPTYVQILEEFTNLGPIVDFAFVDLERHGQGQVVTCS------------- 400
Query: 510 VNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMA 569
G LKD S LR+ + GI +Q+ +++LPG KG++++ ++D S+M
Sbjct: 401 ---GALKDGS--LRVVRN--GIGIDEQA---VIQLPGVKGLFSL-------RDSDDSQM- 442
Query: 570 AYDDEYHAYLIISLEARTMVL----ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
YL+++ T +L + D L E TE + + +T+ GN+ G +Q
Sbjct: 443 ------DKYLVVTFINETRILGFVGDEGDTLDE-TEIAGFDAEAQTLCCGNMQG-NVFLQ 494
Query: 626 VFERGARIL 634
V RG R++
Sbjct: 495 VTHRGVRLV 503
>gi|297799958|ref|XP_002867863.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
lyrata]
gi|297313699|gb|EFH44122.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
lyrata]
Length = 1088
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 99/397 (24%), Positives = 170/397 (42%), Gaps = 38/397 (9%)
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+ + HQ + L + EE E VR+L+ ++ +T P+ + E ++
Sbjct: 716 RRICHQEQTRTFAICCLRNQPSAEESEMHFVRLLDAQ----SFEFLSTYPLDAFEYGCSI 771
Query: 1096 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1154
L + T + +GTAYV E+ +GR+L+F + L+TE KE
Sbjct: 772 ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 821
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1209
KGA+ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
++GD+ KSI L +K + + A+D+ + A E L D L +D N+
Sbjct: 881 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMAAVEILDDDIYLG---ADNCFNLFTVK 937
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1328
+ + + ++ E+H+G V +F ++ D G P ++FG
Sbjct: 938 KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
T+ G IG IA L + + L+ LQ L + V GL+ +R F N + S +
Sbjct: 990 TVSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSF--NNEKRTAEAKSYL 1047
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
D +L+ + L + EI+ ++ + +L
Sbjct: 1048 DGDLIESFLDLSRGKMEEISKGMDVQVEELCKRVEEL 1084
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 112/513 (21%), Positives = 203/513 (39%), Gaps = 135/513 (26%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + L+G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQTILDVPLYGRIATLELFRPHG----EAQDFLFVATERYKFCVLQWD- 93
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
+ES E + G S G + +DP R G+ +Y GL +
Sbjct: 94 ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKV 141
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
I ++G F+ R+E V++++ F++G +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
L++ +H + +LK+ + WS NL + A L+ VPSP+
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKEKDFVEGPWSQNNLDNGADLLIPVPSPLC 227
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVL++G TI Y S +A A+ + + ++ V+LD + LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
+G + LL + ++ V L + + + S I+ + N++ F+GS GDS L++
Sbjct: 271 GDHSGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+++ DA S + L+ VN + + + +
Sbjct: 329 ----------------NLQPDATG------SYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366
Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
T S A +D G L+ G+ IN AS VEL G KG+W++ KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
S D+ + +L++S + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428
>gi|356512636|ref|XP_003525024.1| PREDICTED: DNA damage-binding protein 1a-like isoform 1 [Glycine max]
Length = 1089
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)
Query: 1105 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+ +N +GTAYV E+ +GR+L+F+ D L+ E KE KGA+ L +
Sbjct: 778 SDDNNVYYCVGTAYVLPEENEPTKGRILVFAV---EDGKLQLIAE---KETKGAVYCLNA 831
Query: 1164 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL A KI L+KW GT EL + L + + +FI++GD+ KSI
Sbjct: 832 FNGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 890
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
L +K + + A+D+ + A E + D L +N + K SE
Sbjct: 891 SLLIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLG------AENSFNLFTVRKNSEGA 944
Query: 1279 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ +L E+H+G V +F ++ P SD ++FGT++G IG
Sbjct: 945 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 997
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
IA L + L+ LQ L + V GL+ +R F++ K
Sbjct: 998 VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1040
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 110/504 (21%), Positives = 200/504 (39%), Gaps = 117/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQPMLDVPIYGRIATLELFRPHG----EAQDYLFIATERYKFCVLQWDS 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ETAELVTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL L + P WS NL + A L+ VP P+ GVL++G T
Sbjct: 183 ---DNKDARHVKTYEVALKDKDFL-EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S +A A+ + + ++ V+ D + LL TG L L
Sbjct: 237 IVYCSANAFKAIPIR---------PSITKAYGRVDPDGSR--------YLLGDHTGLLSL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N+ ++GS GDS L++
Sbjct: 280 LVITHEKEKVTGLKIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++ DA + S + L+ VN + + + + T S A +
Sbjct: 329 -------NLQPDA------KGSYVEGLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428
>gi|440492924|gb|ELQ75450.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
(CPSF subunit) [Trachipleistophora hominis]
Length = 1254
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/231 (26%), Positives = 123/231 (53%), Gaps = 25/231 (10%)
Query: 1095 VRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST----GRNADNPQNLVTEVY 1150
+++++L ++ N+ ++ + + E RGR+L+F AD ++
Sbjct: 946 LKIMSLCDSNGDHNDFVVVVLSQRTSNE--ILRGRILVFEVIDVISDTADRKTKKALKLL 1003
Query: 1151 -SKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPLYVVSLNIVKNF 1208
S+ KG IS A+++G + ++ +++++++ T + IAFYD +Y VSL ++KN+
Sbjct: 1004 GSERTKGPISCCAAVRGRIAVSLATRLMVYEFDRNTGIVAIAFYDLY-MYAVSLAVIKNY 1062
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAK-----DFGSLDCFATEFLIDGSTLSLVVSDEQK 1263
I++GDI ++F+ ++ + +L+LL+K + GSLD F G L + D+
Sbjct: 1063 IVVGDIMMGLHFVYFQSEPVKLHLLSKSGRVANLGSLDFFNA-----GDRLFITGIDKTG 1117
Query: 1264 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1314
+QIF ++P S +G+KL+ R +F AH Q + T++ R+ A+
Sbjct: 1118 EVQIFSFSPGNLYSNEGEKLVKRQQFETYAH------FQSIRTNTYRSYAS 1162
>gi|186511557|ref|NP_001118940.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
gi|332657118|gb|AEE82518.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
Length = 1067
Score = 85.1 bits (209), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 153/350 (43%), Gaps = 36/350 (10%)
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+ + HQ L EE E VR+L+ ++ +T P+ S E ++
Sbjct: 695 RRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 750
Query: 1096 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1154
L + T++ +GTAYV E+ +GR+L+F D L+ E KE
Sbjct: 751 ----LSCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 800
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1209
KGA+ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 801 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 859
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
++GD+ KSI L +K + + A+D+ + A E L D L ++ N+
Sbjct: 860 VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 916
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1328
+ + + +L E+H+G V +F ++ D G P ++FG
Sbjct: 917 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 968
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
T++G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 969 TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1018
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 100/453 (22%), Positives = 178/453 (39%), Gaps = 108/453 (23%)
Query: 147 FDDSIHGLRITSMHCF----ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
D I+G RI ++ F E+ ++L + R F +++ DP+ +
Sbjct: 54 LDVPIYG-RIATLELFRPHGEAQDFLFIATERYKFC---VLQWDPES----------SEL 99
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
I +A S +G G F + + N+R L+ V D F+ G +P + +
Sbjct: 100 ITRAMGDVSDRIGRPTDNGQVIPFDNKGQLKEAFNIR-LEELQVLDIKFLFGCAKPTIAV 158
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
L++ +H + +LK + WS +L + A L+ VP P+
Sbjct: 159 LYQ------DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLC 206
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVL++G TI Y S SA A+ + + ++ V++D + LL
Sbjct: 207 GVLIIGEETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLL 249
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
G + LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 250 GDHAGMIHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-- 307
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
++ DA + S + L+ +N + + + +
Sbjct: 308 ----------------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQ 345
Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
T S A +D G L+ G+ IN AS VEL G KG+W++ KS
Sbjct: 346 VVTCSGAFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KS 386
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
S D+ + +L++S + T +L
Sbjct: 387 S------------IDEAFDTFLVVSFISETRIL 407
>gi|297809743|ref|XP_002872755.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
gi|297318592|gb|EFH49014.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
Length = 1088
Score = 85.1 bits (209), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 152/350 (43%), Gaps = 36/350 (10%)
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+ + HQ L EE E VR+L+ ++ +T P+ S E ++
Sbjct: 716 RRICHQEQTRTFGICSLGNQSNAEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 771
Query: 1096 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1154
L + T + +GTAYV E+ +GR+L+F D L+ E KE
Sbjct: 772 ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 821
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1209
KGA+ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
++GD+ KSI L +K + + A+D+ + A E L D L ++ N+
Sbjct: 881 VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLVTVK 937
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1328
+ + + +L E+H+G V +F ++ D G P ++FG
Sbjct: 938 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
T++G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 990 TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1039
Score = 57.0 bits (136), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F+ G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK + WS NL + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S +A A+ + + ++ V++D + LL G
Sbjct: 234 EETIVYCSANAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
+ LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
++ DA + S + L+ +N + + + + T S
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
D+ + +L++S + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428
>gi|110741229|dbj|BAF02165.1| UV-damaged DNA binding factor - like protein [Arabidopsis thaliana]
Length = 727
Score = 85.1 bits (209), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 153/350 (43%), Gaps = 36/350 (10%)
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+ + HQ L EE E VR+L+ ++ +T P+ S E ++
Sbjct: 355 RRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 410
Query: 1096 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1154
L + T++ +GTAYV E+ +GR+L+F D L+ E KE
Sbjct: 411 ----LSCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 460
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1209
KGA+ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 461 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 519
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
++GD+ KSI L +K + + A+D+ + A E L D L ++ N+
Sbjct: 520 VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 576
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1328
+ + + +L E+H+G V +F ++ D G P ++FG
Sbjct: 577 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 628
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
T++G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 629 TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 678
>gi|15235577|ref|NP_192451.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
gi|55976605|sp|Q9M0V3.1|DDB1A_ARATH RecName: Full=DNA damage-binding protein 1a; AltName: Full=UV-damaged
DNA-binding protein 1a; Short=DDB1a
gi|7267302|emb|CAB81084.1| UV-damaged DNA binding factor-like protein [Arabidopsis thaliana]
gi|25054828|gb|AAN71904.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
gi|332657117|gb|AEE82517.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
Length = 1088
Score = 85.1 bits (209), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 153/350 (43%), Gaps = 36/350 (10%)
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+ + HQ L EE E VR+L+ ++ +T P+ S E ++
Sbjct: 716 RRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 771
Query: 1096 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1154
L + T++ +GTAYV E+ +GR+L+F D L+ E KE
Sbjct: 772 ----LSCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 821
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1209
KGA+ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
++GD+ KSI L +K + + A+D+ + A E L D L ++ N+
Sbjct: 881 VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 937
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1328
+ + + +L E+H+G V +F ++ D G P ++FG
Sbjct: 938 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
T++G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 990 TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1039
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDP 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F+ G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK + WS +L + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S SA A+ + + ++ V++D + LL G
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGM 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
+ LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
++ DA + S + L+ +N + + + + T S
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
D+ + +L++S + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428
>gi|449435512|ref|XP_004135539.1| PREDICTED: DNA damage-binding protein 1-like [Cucumis sativus]
Length = 1093
Score = 84.7 bits (208), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 145/330 (43%), Gaps = 31/330 (9%)
Query: 1105 TKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+ +N +GTAYV E+ +GR+L+F + L+ E KE KG++ +L +
Sbjct: 782 SDDNNVYYCVGTAYVMPEENEPTKGRILVFVV---EEGKLQLIAE---KETKGSVYSLNA 835
Query: 1164 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL A KI L+KWT GT EL + L + + +FI++GD+ KSI
Sbjct: 836 FNGKLLAAINQKIQLYKWTLRDDGTRELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 894
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
L +K + + A+D+ + A E L D L +N + K SE
Sbjct: 895 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG------AENYFNLFTVRKNSEGA 948
Query: 1279 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ +L E+H+G V +F ++ P SD ++FG+++G IG
Sbjct: 949 TDEERSRLEVVGEYHLGEFVNRFQHGSLVMR-------LPDSDVGQIPTVIFGSVNGVIG 1001
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
IA L + L+ LQ L + V GL+ +R F N + + +D +L+
Sbjct: 1002 VIASLPHDQYVFLERLQSNLRKVIKGVGGLSHEQWRSF--NNEKRTAEAKNFLDGDLIES 1059
Query: 1396 YEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ L + EI+ + ++ + +L
Sbjct: 1060 FLDLNRSKMEEISRAMSVSAEELCKRVEEL 1089
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 107/504 (21%), Positives = 197/504 (39%), Gaps = 117/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTAQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDT 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ESSELITRAMGDVSD------RIGRPTDS-GQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSRPTIVVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + + P WS NL + A L+ VP P+ GV+++G T
Sbjct: 186 D-------ARHVKTYEVVLKDKDFVEGP--WSQNNLDNGAAVLIPVPPPLCGVIIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S +A A+ + + R+ V+ D + LL G L L
Sbjct: 237 IVYCSATAFKAIPVR---------PSITRAYGRVDADGSR--------YLLGDHAGLLHL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N+ ++GS GDS LV+
Sbjct: 280 LVITHEKERVTGLKIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++ DA + S + L+ VN + + + + T S A +
Sbjct: 329 -------NVQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428
>gi|356525403|ref|XP_003531314.1| PREDICTED: DNA damage-binding protein 1-like isoform 2 [Glycine max]
Length = 1068
Score = 84.7 bits (208), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 80/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)
Query: 1105 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+ +N +GTAYV E+ +GR+++F+ D L+ E KE KGA+ L +
Sbjct: 757 SDDNNVYYCVGTAYVLPEENEPTKGRIIVFAV---EDGKLQLIAE---KETKGAVYCLNA 810
Query: 1164 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL A KI L+KW GT EL + L + + +FI++GD+ KSI
Sbjct: 811 FNGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 869
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
L +K + + A+D+ + A E + D L +N + K SE
Sbjct: 870 SLLIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLG------AENSFNLFTVRKNSEGA 923
Query: 1279 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ +L E+H+G V +F ++ P SD ++FGT++G IG
Sbjct: 924 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 976
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
IA L + L+ LQ L + V GL+ +R F++ K
Sbjct: 977 VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1019
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 83/367 (22%), Positives = 148/367 (40%), Gaps = 84/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P +V+L++ +H A
Sbjct: 123 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 175
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P WS NL + A L+ VP P+ GVL++G TI Y S +A A+ +
Sbjct: 176 LK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-- 230
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ V+ D + LL TG + LL ++++ V L +
Sbjct: 231 -------PSITKAYGRVDPDGSR--------YLLGDHTGLVSLLVIIHEKEKVTGLKIEP 275
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N+ ++GS GDS L++ +++ DA
Sbjct: 276 LGETSIASTISYLDNAFVYVGSSYGDSQLIKL------------------NLQPDA---- 313
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
+ S + L+ VN + + + + T S A +D G L+ G+
Sbjct: 314 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRVVRNGIG 366
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ ++ DD + +L++S
Sbjct: 367 INEQAS------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSF 400
Query: 584 EARTMVL 590
+ T +L
Sbjct: 401 ISETRIL 407
>gi|356525401|ref|XP_003531313.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Glycine max]
Length = 1089
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 80/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)
Query: 1105 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+ +N +GTAYV E+ +GR+++F+ D L+ E KE KGA+ L +
Sbjct: 778 SDDNNVYYCVGTAYVLPEENEPTKGRIIVFAV---EDGKLQLIAE---KETKGAVYCLNA 831
Query: 1164 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL A KI L+KW GT EL + L + + +FI++GD+ KSI
Sbjct: 832 FNGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 890
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
L +K + + A+D+ + A E + D L +N + K SE
Sbjct: 891 SLLIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLG------AENSFNLFTVRKNSEGA 944
Query: 1279 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ +L E+H+G V +F ++ P SD ++FGT++G IG
Sbjct: 945 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 997
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
IA L + L+ LQ L + V GL+ +R F++ K
Sbjct: 998 VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1040
Score = 63.5 bits (153), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 108/504 (21%), Positives = 200/504 (39%), Gaps = 117/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQPMLDVPIYGRIATLELFRPHG----EAQDYLFIATERYKFCVLQWDS 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ETGELVTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL + P WS NL + A L+ VP P+ GVL++G T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S +A A+ + + ++ V+ D + LL TG + L
Sbjct: 237 IVYCSANAFKAIPIR---------PSITKAYGRVDPDGSR--------YLLGDHTGLVSL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L ++++ V L + + + S I+ + N+ ++GS GDS L++
Sbjct: 280 LVIIHEKEKVTGLKIEPLGETSIASTISYLDNAFVYVGSSYGDSQLIKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++ DA + S + L+ VN + + + + T S A +
Sbjct: 329 -------NLQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428
>gi|449519304|ref|XP_004166675.1| PREDICTED: DNA damage-binding protein 1a-like [Cucumis sativus]
Length = 596
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 145/330 (43%), Gaps = 31/330 (9%)
Query: 1105 TKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+ +N +GTAYV E+ +GR+L+F + L+ E KE KG++ +L +
Sbjct: 285 SDDNNVYYCVGTAYVMPEENEPTKGRILVFVV---EEGKLQLIAE---KETKGSVYSLNA 338
Query: 1164 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL A KI L+KWT GT EL + L + + +FI++GD+ KSI
Sbjct: 339 FNGKLLAAINQKIQLYKWTLRDDGTRELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 397
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
L +K + + A+D+ + A E L D L +N + K SE
Sbjct: 398 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG------AENYFNLFTVRKNSEGA 451
Query: 1279 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ +L E+H+G V +F ++ P SD ++FG+++G IG
Sbjct: 452 TDEERSRLEVVGEYHLGEFVNRFQHGSLVMR-------LPDSDVGQIPTVIFGSVNGVIG 504
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
IA L + L+ LQ L + V GL+ +R F N + + +D +L+
Sbjct: 505 VIASLPHDQYVFLERLQSNLRKVIKGVGGLSHEQWRSF--NNEKRTAEAKNFLDGDLIES 562
Query: 1396 YEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ L + EI+ + ++ + +L
Sbjct: 563 FLDLNRSKMEEISRAMSVSAEELCKRVEEL 592
>gi|385304556|gb|EIF48568.1| rna-binding subunit of the mrna cleavage and polyadenylation factor
[Dekkera bruxellensis AWRI1499]
Length = 289
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 62/272 (22%), Positives = 126/272 (46%), Gaps = 14/272 (5%)
Query: 1107 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISAL 1161
+ + + +G+ + ED+A +G +++ +P +N + + S+ +G+I
Sbjct: 3 DTKNYVIVGSGKYRVEDLATKGSWMVYEIIDVVPDPNHPEAKNRLKLIKSESSRGSILGS 62
Query: 1162 ASLQGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
++ G + ++++ K G + +AF D LY + ++ +++GD +
Sbjct: 63 CNISGRFSLVQAQRMLVRTIKKDGNAV-PVAFXDTS-LYTKDVKSFEDMMIIGDAFDGLS 120
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
+ + ++ L K+ +L A +F++ L ++ +DE + + Y P ES K
Sbjct: 121 LYGFDAEPYRMLKLGKETQNLSLTACDFIVXEGGLYIIAADEDSVLHLLEYDPYDPESMK 180
Query: 1280 GQKLLSRAEFHVGAHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1336
G KLL+R+ F + T R + + D PG+D F ++ ++GS
Sbjct: 181 GXKLLTRSVFRFNGYTTAMRLCDRKNSIFSMLDTLAIPPGADLG--FEVIGCNIEGSFYK 238
Query: 1337 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
+ P +E T+RRL +LQ + D H GLNP+
Sbjct: 239 VTPANEYTYRRLYALQNHISDKESHWLGLNPK 270
>gi|154421858|ref|XP_001583942.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
gi|121918186|gb|EAY22956.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
Length = 1297
Score = 84.0 bits (206), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 153/368 (41%), Gaps = 49/368 (13%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV + G + + GG DSII+ + +K+ VL+ D+ L+ T H
Sbjct: 48 LRLVWEKKFWGEIFGVYRHKSGG-----EYDSIIVGCDTSKVIVLQVIDN--DLKETEYH 100
Query: 161 CFESPEWLHLKRGRES--------FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
F P + ++ DP G C +L+ + +L +
Sbjct: 101 EFNRPGPPEPDPPKPERPFDISTRLRNKTIMDADPTGTCLALLLAQNILYVLPLANK--- 157
Query: 213 LVGDEDTFGSGGGFSAR---IESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHE 265
+ E T +G + + I+ + ++ D K ++D +F+ GY P + I+HE
Sbjct: 158 -IKIESTEKAGDEYHSSWKVIKDAFAYDVH-TDFKSPLYRIRDMVFLDGYKNPTLAIIHE 215
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLI---------WSAMNLPHDAYKLLAVPS 316
TW+ R+ + T +S +S K+ LI W++ LPH+++ L+ VP
Sbjct: 216 LIPTWSVRLPLQKSTVAVSIVSPPLKKKETVLISASIDKVTMWTSRALPHNSFGLVHVPD 275
Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYA-----VSLDSSQELPRSSFSVELDAAHATW 371
PIGG LV+ N I Y + ALALN A V +D + P EL + T
Sbjct: 276 PIGGFLVLSKNAIIYMDHTNIVALALNKLAYLDDEVPVDITANGPGCH---ELYSKVGTA 332
Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVV-----QRLDLSKTNPSVLTSDITTIGNSLFFLG 426
+ LL+ L +LT+ Y+G V + +PS S T SL F+G
Sbjct: 333 IDKSHILLTVDQHYLSILTLHYNGVKVTNLSLNVNLNLEFHPSCFLSLNYTNNRSLVFMG 392
Query: 427 SRLGDSLL 434
S DS L
Sbjct: 393 STTHDSTL 400
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 97/463 (20%), Positives = 174/463 (37%), Gaps = 49/463 (10%)
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQ----KVIPLKATPHQITYFAEKNLYPLIV 1021
NH I Q +++C L + N++ + + IP+ T +I Y N I
Sbjct: 810 NHFLIADEDQ--IRLCNLENIKPEHNFFIIDGCIVERIPVGMTVRRIAYCQNPNCVAFIA 867
Query: 1022 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE-VRILEPDRAGGPWQT 1080
S P +P ID EV + H + Y E+YE + +R T
Sbjct: 868 SHP--EPFTTENEKKIDVEVYENLQVHYQEPPSPAKVYPDEDYETIPKWNEERYSLFLYT 925
Query: 1081 RATIP-MQSSENALTVRVVTLFNTTTKENE------TLLAIGTAYVQGEDVAARGRVLLF 1133
+ + M N V V +TT + T LA+G+ ++ + RG + ++
Sbjct: 926 KDGLQQMVDYANHEIVNTVQFVHTTPMPEDGITLLNTYLAVGSGFLSQPEKMMRGVLYIY 985
Query: 1134 ST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1186
G N + L E +K K I + G++ I G + L ++
Sbjct: 986 QIRYMQNDEGFNEITLRPLYNET-NKIYKNPIIEITDNSGYMAIFCGNLLYLMRFFNENT 1044
Query: 1187 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1246
I + + S+ +KN++L D ++ W++ G +L +A+D + +
Sbjct: 1045 VKIEAFLVGRFFASSIVSLKNYLLYADSYEGFEVARWRKYGKKLISMARDTMTKLPLSAA 1104
Query: 1247 FLIDGSTLSLVVSDEQKNIQIF----YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1302
FL L VV D+ N IF Y P ++ ++ F++G +
Sbjct: 1105 FLQYEDCLGGVVFDDDGNAHIFDVDEYAIP-------ADAVVRKSIFYIGGRAISSGQFP 1157
Query: 1303 MLATSSDRTGAAPGSDKTNRFALL----------FGTLDGSIGCIAPLDELTFRRLQSLQ 1352
+ A + T P + L + T G IG P+DE +L +Q
Sbjct: 1158 IKAVTQ-ATQQNPNEEIDEELLQLQTKIGGHIAWYVTTHGKIGAFTPIDENDRHKLVGVQ 1216
Query: 1353 KKLVDSVPHVAGLNPRS--FRQFHSNGKAHRPGPDSIVDCELL 1393
S+ ++ L RS F+ ++ P +++DC++L
Sbjct: 1217 SAYEKSLCGLSHLEYRSGKFKNMIEQDIFNQ-SPKNVIDCDML 1258
>gi|452824086|gb|EME31091.1| DNA damage-binding protein 1 isoform 2 [Galdieria sulphuraria]
Length = 1150
Score = 83.6 bits (205), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 123/565 (21%), Positives = 230/565 (40%), Gaps = 101/565 (17%)
Query: 840 FLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDA 899
+ A L DG +L Y+ L + ++T + R +S+ AS L
Sbjct: 612 YFLAALGDGRLLTYR--LDKSAKDTDSEKKFLYDQRQMSIGTQPAS-----------LSI 658
Query: 900 YTREETPH-GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFT 958
+ + H A C R T+ + SG G C + RE RV C S AF
Sbjct: 659 FETQNALHVFAACDRPTVIHSSSG------GGKLLCSNVNLREVTRV----CSFSSEAFP 708
Query: 959 VLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ-KVIPLKATPHQITYFAEKNLY 1017
+C + + ++G L + T DN + + IPL P +I + +++
Sbjct: 709 -----DC----LALVTEGSLLL------GTVDNIQKLHIRTIPLGEQPRRIAHLDTHHVF 753
Query: 1018 PLIVS---VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA 1074
++ + V + + N+ LS ++ ID+ + +++ +Y +E++E
Sbjct: 754 AVLTTKQVVTISEDGNEALSETTEEGYVRLIDD---TMMEIVHSYKLEQFETPC------ 804
Query: 1075 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG-EDVAARGRVLLF 1133
+ I + ++A K+N+ +GTAY E +RGR+L+F
Sbjct: 805 -------SVITVNFGDDA-----------AAKDNQDYFVVGTAYSYADEPEPSRGRMLVF 846
Query: 1134 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1193
+ + +T V + KGA+ ++ + G +L + + L +W+ TE +
Sbjct: 847 AV------REQRLTLVAERTFKGALYSMDAFNGKILASVNSMLKLVRWSETESGARTLTE 900
Query: 1194 A----PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
++++ + + +FIL+GD+ +S+ L++K + +A+D E L
Sbjct: 901 ECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIEDVARDIDPSWITVIEML- 959
Query: 1250 DGSTLSLVVSDEQK-NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF----LRLQML 1304
L +S E N+ S + +L E+H+G V + L LQ+
Sbjct: 960 ---DLDYYISAENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIRHGRLVLQIP 1016
Query: 1305 AT-----SSDRTGAAPGSDKT------NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1353
+ S G D +++ GT +G++G IA +DE TF+ L SLQ
Sbjct: 1017 ESGISILKSLLYGMYICFDDNLKELFMHKYRFNLGTANGALGVIASIDEKTFQFLHSLQT 1076
Query: 1354 KLVDSVPHVAGLNPRSFRQFHSNGK 1378
L + + V G+ +R+F S +
Sbjct: 1077 ALNEVIKGVGGIQHEDWRRFTSERR 1101
Score = 40.8 bits (94), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 84/211 (39%), Gaps = 35/211 (16%)
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
I L +LD V D F++G+ +P + +L + +H +L
Sbjct: 151 IRLEELD---VLDIQFLYGHSKPTIAVL------YTDSEENRHLKTYTVSLK-DKDFGNG 200
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
PL NL A L+ VP+PIGGV+V+G T+ Y S S L Y S+ S +
Sbjct: 201 PLFQG--NLESGASMLIPVPTPIGGVVVLGQETVTYISGS-----GLRGYH-SIPVSATI 252
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL---------TVVYDGRVVQRLDLSKT 406
R+ ++ D LL + G L LL T + L +
Sbjct: 253 FRAYGRIDKDGTR--------YLLGDEKGILYLLVLEQSTSLSTFTETETKITGLKIQTL 304
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ L S I + N ++GS GDS L++
Sbjct: 305 GETSLPSTIDYLDNGFVYIGSCHGDSQLIRL 335
>gi|307186138|gb|EFN71863.1| DNA damage-binding protein 1 [Camponotus floridanus]
Length = 1136
Score = 83.6 bits (205), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 93/380 (24%), Positives = 165/380 (43%), Gaps = 62/380 (16%)
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
E+G +I+ HNL +D H T E L P +E AL+
Sbjct: 777 EIGQEIEVHNLLIIDQH---TFEVLHAHTLMP-----------------TEYALS----- 811
Query: 1100 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1155
L +T E+ T +GTA++ ++ + GR+LLF S G+ +++V KE+K
Sbjct: 812 LISTRLGEDSTSYYVVGTAFINPDETEPKMGRILLFHWSDGK--------LSQVAEKEIK 863
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1212
G+ +L G LL + + L +WT + L F + LY L +F+L+G
Sbjct: 864 GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKSDFVLVG 920
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ +S+ L +K +A+D+ + E L D + L ++ N+ I
Sbjct: 921 DLMRSLTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFLG---AENCFNLFICQKDS 977
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 978 AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFG 1027
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
T+ G+IG + + + L++L+ KL + V + +R F ++ K + + +
Sbjct: 1028 TVSGAIGLVTQIPFGFYEFLRNLEDKLTSVIKSVGKIEHNFWRSFKTDLKIEQ--CEGFI 1085
Query: 1389 DCELLSHYEMLPLEEQLEIA 1408
D +L+ + L ++ E+A
Sbjct: 1086 DGDLIESFLDLSHDKMAEVA 1105
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/205 (23%), Positives = 94/205 (45%), Gaps = 35/205 (17%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + I+ K+ I W
Sbjct: 159 MDEQQVQDVNFLHGCTNPTLILIHQD-------INGRH----VKTHEINLREKEFSKIPW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + A+ + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 257
Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
+ ++D +L D+A G L +L + + + VV+ L + +
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLELEKKPDGTQVVKDLKVELLGEISIP 311
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
IT + N + ++GSRLGDS L++
Sbjct: 312 ECITYLDNGVIYVGSRLGDSQLIKL 336
>gi|255316764|gb|ACU01763.1| putative DNA damage binding protein [Brachypodium distachyon]
Length = 384
Score = 83.6 bits (205), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 149/332 (44%), Gaps = 35/332 (10%)
Query: 1105 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+ +N +GTAYV E+ +GR+L+F+ D L+ E KE KGA+ +L +
Sbjct: 73 SDDNNFYYCVGTAYVLPEENEPTKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 126
Query: 1164 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL A KI L+KW EL + L + + +FI++GD+ KSI
Sbjct: 127 FNGKLLAAINQKIQLYKWMTREDGSHELQSECGHHGHILALFT-QTRGDFIVVGDLMKSI 185
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTLSLVVSDEQKNIQIFYYAPKM 1274
L +K + + + LA+D+ + A E + D G+ S + +KN +
Sbjct: 186 SLLVYKHEESAIEELARDYNANWMTAVEMIDDDIYVGAENSYNLFTVRKN------SDAA 239
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGS 1333
++ +G +L E+H+G V +F ++ D G P ++FGT++G
Sbjct: 240 TDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDTEMGQIP--------TVIFGTINGV 290
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
IG IA L + L+ LQ L + V L+ +R FH+ K + +D +L+
Sbjct: 291 IGIIASLPHDQYVFLEKLQSILGKFIKGVGSLSHDQWRSFHNEKKT--AEARNFLDGDLI 348
Query: 1394 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ L + E++ G + + + +L
Sbjct: 349 ESFLDLNRSKMEEVSKGMGVSVENLSKRVEEL 380
>gi|193644722|ref|XP_001942922.1| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
Length = 1156
Score = 83.2 bits (204), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 85/339 (25%), Positives = 149/339 (43%), Gaps = 42/339 (12%)
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQN 1144
+ S+E AL++ L + + T +GTA V ED + GR+L+F + D+ +
Sbjct: 816 LNSNEYALSIISAKLGD----DPATYYILGTAVVNPEDQDPKLGRILIF----HWDDSSS 867
Query: 1145 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVS 1201
+T + KE+KGA +A G LL A + L +WT + L F + L+V +
Sbjct: 868 KLTPITEKEVKGACYGMAEFNGKLLAAVNCTVRLFEWTAEKELRLECSHFNNIVALFVKT 927
Query: 1202 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1261
+FI+ GD+ +S+ L +K +A+D+ A E + D L ++
Sbjct: 928 KG---DFIVCGDLMRSLTLLQYKTMEGSFEEIARDYNPKWSTAIEIIDDDVFLG---AEN 981
Query: 1262 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1321
KN+ I + ++ +L +FH G + F R G+ T+
Sbjct: 982 DKNLFIIHKDSTLTSDEARHQLQEIGQFHCGDLINVF-----------RHGSLVMQHFTD 1030
Query: 1322 RFA-----LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
+ +L+GT G++G + L F L L+K L V V +N + +R +H+
Sbjct: 1031 TYVSVQGGILYGTCSGALGLVTQLTPKMFDFLSDLEKSLATVVKGVGKINHQFWRSYHTE 1090
Query: 1377 GKAHRPGPDSIVDCEL------LSHYEMLPLEEQLEIAH 1409
+ +S VD +L LS EM+ + + L+ A+
Sbjct: 1091 IRTE--PSESFVDGDLIESFLDLSKREMIAVVDALQGAY 1127
Score = 48.1 bits (113), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 58/267 (21%), Positives = 109/267 (40%), Gaps = 59/267 (22%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G + +DP R G+ +Y GL II D G + R+E + +
Sbjct: 122 GAMAVIDPSARVIGLKLYDGLFKII------------PLDKEGELKAYCLRMEE---VEV 166
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH-PL 297
+D+D F++G P ++I+H+ + GR I A +S K+
Sbjct: 167 QDID--------FLYGCANPTIIIIHQDTM---GR--------HIKAKELSIKDKEFVKT 207
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
W N+ +A ++ VP P+ G +++G ++ YH+ S+ A+ S + +
Sbjct: 208 PWKQENVETEASMIIPVPEPLCGAIIIGRESVLYHNGSSFIAI----------SPPVIKQ 257
Query: 358 SSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL---- 411
S+ +D +L D+A G L +L + Y+ + +L
Sbjct: 258 STIVCYARIDPEGTRYLLGDMA------GHLFMLLLNYEKNPDGTFKIKDPKVDLLGEIS 311
Query: 412 -TSDITTIGNSLFFLGSRLGDSLLVQF 437
+T + N + ++ SR+GDS L++
Sbjct: 312 IPESLTYLDNKIIYVASRVGDSQLIKL 338
>gi|62318656|dbj|BAD95136.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
Length = 1088
Score = 83.2 bits (204), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 153/349 (43%), Gaps = 34/349 (9%)
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+ + HQ + L + EE E VR+L+ ++ ++ P+ + E ++
Sbjct: 716 RRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQ----SFEFLSSYPLDAFECGCSI 771
Query: 1096 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1154
L + T + +GTAYV E+ +GR+L+F + L+TE KE
Sbjct: 772 ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 821
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1209
KGA+ +L + G LL + KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
+GD+ KSI L +K + + A+D+ + A E L D L +D NI
Sbjct: 881 AVGDLMKSISLLIYKHEEGAIEERARDYNANWMAAVEILNDDIYLG---TDNCFNIFTVK 937
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
+ + + ++ E+H+G V +F ++ P SD ++FGT
Sbjct: 938 KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVM-------KLPDSDIGQIPTVIFGT 990
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 991 VSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKR 1039
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + L+G + ++ + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
+ES E + G S G + +DP R G+ +Y GL +
Sbjct: 94 ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
I ++G F+ R+E V++++ F++G +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
L++ +H + +LK + WS NL + A L+ VPSP+
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVL++G TI Y S +A A+ + + ++ V+LD + LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
G + LL + ++ V L + + + S I+ + N++ F+GS GDS L++
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+++ DA + S + L+ VN + + + +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366
Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
T S A +D G L+ G+ IN AS VEL G KG+W++ KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
S D+ + +L++S + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428
>gi|15233515|ref|NP_193842.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
gi|73620956|sp|O49552.2|DDB1B_ARATH RecName: Full=DNA damage-binding protein 1b; AltName: Full=UV-damaged
DNA-binding protein 1b; Short=DDB1b
gi|110739453|dbj|BAF01636.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
gi|332659001|gb|AEE84401.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
Length = 1088
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 153/349 (43%), Gaps = 34/349 (9%)
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+ + HQ + L + EE E VR+L+ ++ ++ P+ + E ++
Sbjct: 716 RRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQ----SFEFLSSYPLDAFECGCSI 771
Query: 1096 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1154
L + T + +GTAYV E+ +GR+L+F + L+TE KE
Sbjct: 772 ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 821
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1209
KGA+ +L + G LL + KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
+GD+ KSI L +K + + A+D+ + A E L D L +D NI
Sbjct: 881 AVGDLMKSISLLIYKHEEGAIEERARDYNANWMTAVEILNDDIYLG---TDNCFNIFTVK 937
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
+ + + ++ E+H+G V +F ++ P SD ++FGT
Sbjct: 938 KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVM-------KLPDSDIGQIPTVIFGT 990
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 991 VSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKR 1039
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + L+G + ++ + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
+ES E + G S G + +DP R G+ +Y GL +
Sbjct: 94 ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
I ++G F+ R+E V++++ F++G +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
L++ +H + +LK + WS NL + A L+ VPSP+
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVL++G TI Y S +A A+ + + ++ V+LD + LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
G + LL + ++ V L + + + S I+ + N++ F+GS GDS L++
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+++ DA + S + L+ VN + + + +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366
Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
T S A +D G L+ G+ IN AS VEL G KG+W++ KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
S D+ + +L++S + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428
>gi|345328202|ref|XP_003431248.1| PREDICTED: DNA damage-binding protein 1-like [Ornithorhynchus
anatinus]
Length = 1045
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 145/333 (43%), Gaps = 42/333 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 728 KDANTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 781
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 782 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 836
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 837 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 893
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 894 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 943
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
+ L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 944 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKT-EPAT-GFIDGDLIES 1001
Query: 1396 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ +M + L+I +G R + +L
Sbjct: 1002 FLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1034
>gi|357132340|ref|XP_003567788.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
distachyon]
Length = 1090
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 149/332 (44%), Gaps = 35/332 (10%)
Query: 1105 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+ +N +GTAYV E+ +GR+L+F+ D L+ E KE KGA+ +L +
Sbjct: 779 SDDNNFYYCVGTAYVLPEENEPTKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 832
Query: 1164 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL A KI L+KW EL + L + + +FI++GD+ KSI
Sbjct: 833 FNGKLLAAINQKIQLYKWMTREDGSHELQSECGHHGHILALFT-QTRGDFIVVGDLMKSI 891
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTLSLVVSDEQKNIQIFYYAPKM 1274
L +K + + + LA+D+ + A E + D G+ S + +KN +
Sbjct: 892 SLLVYKHEESAIEELARDYNANWMTAVEMIDDDIYVGAENSYNLFTVRKN------SDAA 945
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGS 1333
++ +G +L E+H+G V +F ++ D G P ++FGT++G
Sbjct: 946 TDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDTEMGQIP--------TVIFGTINGV 996
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
IG IA L + L+ LQ L + V L+ +R FH+ K + +D +L+
Sbjct: 997 IGIIASLPHDQYVFLEKLQSILGKFIKGVGSLSHDQWRSFHNEKKTAE--ARNFLDGDLI 1054
Query: 1394 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ L + E++ G + + + +L
Sbjct: 1055 ESFLDLNRSKMEEVSKGMGVSVENLSKRVEEL 1086
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 152/367 (41%), Gaps = 83/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G + P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCLRPTIVVLYQ------DNKDARHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P WS NL + A L+ VP+P+GGV+++G TI Y + +++
Sbjct: 197 LK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEETIVYCNANSTFK------ 247
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
++ Q + R+ V+ D + LL TG L LL + + V L +
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSR--------YLLGDNTGILHLLVLTQERERVTGLKIEH 297
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N + ++GSR GDS LV+ +++ADA
Sbjct: 298 LGETSVASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADATG-- 337
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
S + L+ VN + + + + + T S A +D G ++ G+
Sbjct: 338 ----SFVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSIRVVRNGIG 388
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ KSS ++D Y +L++S
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------FNDPYDTFLVVSF 422
Query: 584 EARTMVL 590
+ T L
Sbjct: 423 ISETRFL 429
>gi|290998415|ref|XP_002681776.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
gi|284095401|gb|EFC49032.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
Length = 1103
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/346 (23%), Positives = 159/346 (45%), Gaps = 44/346 (12%)
Query: 1104 TTKENETLLAIGTAYVQG-EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALA 1162
T +NE + +GTA +G E+ ++GR+L+ D+ L E K++KGA+ L
Sbjct: 775 TDDDNEYFI-VGTAITEGDEEEPSKGRILVLQV---QDDKLVLKAE---KDVKGAVMVLH 827
Query: 1163 SLQGHLLIASGPKIILHKWTGTEL--NGIAFYDAP---PLYVVSLNIVKNFILLGDIHKS 1217
S G LL +++L KW ++ N + +Y++ ++ +FIL+GD+ KS
Sbjct: 828 SFNGKLLAGVSGRLMLFKWAESDDGDNKDLVQECSCSGGIYILDIDSHGDFILIGDMMKS 887
Query: 1218 IYFLSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
++ ++ Q L L++KD+ + +++ S V D+Q N+
Sbjct: 888 VHLFVYENPEEQHVSGNLRLISKDY-QYSWLSCSLMLNES--EYVAVDQQGNMITLKKND 944
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTK----FLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
+ + + ++L+ +++ V + F+ ++ +SSD P LFG
Sbjct: 945 EAASEEERKQLVRVGKYYCSDRVNRIQPGFIGMRFANSSSD-INTQPVK------TALFG 997
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI- 1387
T+ G IG +A L TF + +QK + V +A ++ ++RQ+ S R DS+
Sbjct: 998 TISGGIGVLAQLPPETFAFVTKIQKAMSSVVTGLANISRETYRQYRS----ERTREDSVG 1053
Query: 1388 -VDCELLSHYEMLPLE------EQLEIAHQTGTTRSQILSNLNDLA 1426
+D + + + E E+L HQ T +++ N+ DL+
Sbjct: 1054 FIDGDFVESFLEFDFETQQRVIEELSNNHQEQITLEELVKNIEDLS 1099
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 77/358 (21%), Positives = 151/358 (42%), Gaps = 60/358 (16%)
Query: 82 KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAK 141
KN R+ +G+S+ V + G ++++++ G ++D + + ED
Sbjct: 35 KNQYLQVNRLSEEGVSS-----VVEFEAPGRIDTMSLFRPSG----EKQDLLFITIEDTF 85
Query: 142 ISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQ 200
++ D I L S+ + P GR S G + +DP R + +Y GL
Sbjct: 86 FTLGFIDGKIETLSSGSI---DDP------VGRRS-ESGSITTIDPLCRAVALSIYEGLL 135
Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
II ++ F F+ R+E +VI++ L+ K P
Sbjct: 136 KII--------PFENNKHQFKEA--FNVRLEELNVIDIAFLESLGSK------SKSGPTF 179
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
+L++ + H ++ +++ L + +N+ H A L+ VP+P+GG
Sbjct: 180 ALLYQDHV-------GSRHVKTYEVKTLDKDMEESSL--NQLNVDHGANILIPVPAPLGG 230
Query: 321 VLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
V+ VG + Y ++S N++V+ ++ + S+ +LD + W D
Sbjct: 231 VICVGEAQVSYINESN------KNHSVASPANSRMAIRSYG-KLD--NTRWFLGD----- 276
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
++G L LL++ V L L + + ++S I+ + N F+GS GDS +++ +
Sbjct: 277 -QSGQLYLLSLQVSDSEVTGLTLKELGVTSISSCISYLDNGYVFIGSNYGDSQVIRIS 333
>gi|2911067|emb|CAA17529.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
gi|7268907|emb|CAB79110.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
Length = 1102
Score = 82.4 bits (202), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 153/349 (43%), Gaps = 34/349 (9%)
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+ + HQ + L + EE E VR+L+ ++ ++ P+ + E ++
Sbjct: 730 RRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQ----SFEFLSSYPLDAFECGCSI 785
Query: 1096 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1154
L + T + +GTAYV E+ +GR+L+F + L+TE KE
Sbjct: 786 ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 835
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1209
KGA+ +L + G LL + KI L+KW GT EL + L + + +FI
Sbjct: 836 KGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 894
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
+GD+ KSI L +K + + A+D+ + A E L D L +D NI
Sbjct: 895 AVGDLMKSISLLIYKHEEGAIEERARDYNANWMTAVEILNDDIYLG---TDNCFNIFTVK 951
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
+ + + ++ E+H+G V +F ++ P SD ++FGT
Sbjct: 952 KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVM-------KLPDSDIGQIPTVIFGT 1004
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 1005 VSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKR 1053
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 85/370 (22%), Positives = 150/370 (40%), Gaps = 90/370 (24%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P + +L++ +H
Sbjct: 158 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCTKPTIAVLYQ------DNKDARH------V 204
Query: 286 LSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALAL 342
+ +LK + WS NL + A L+ VPSP+ GVL++G TI Y S +A A+ +
Sbjct: 205 KTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLIIGEETIVYCSANAFKAIPI 264
Query: 343 NNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLD 402
+ ++ V+LD + LL G + LL + ++ V L
Sbjct: 265 R---------PSITKAYGRVDLDGSR--------YLLGDHAGLIHLLVITHEKEKVTGLK 307
Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
+ + + S I+ + N++ F+GS GDS L++ +++ DA
Sbjct: 308 IELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL------------------NLQPDA- 348
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSY 520
+ S + L+ VN + + + + T S A +D G L+
Sbjct: 349 -----KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRN 398
Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
G+ IN AS VEL G KG+W++ KSS D+ + +L+
Sbjct: 399 GIGINEQAS------------VELQGIKGMWSL--KSS------------IDEAFDTFLV 432
Query: 581 ISLEARTMVL 590
+S + T +L
Sbjct: 433 VSFISETRIL 442
>gi|348560393|ref|XP_003465998.1| PREDICTED: DNA damage-binding protein 1-like [Cavia porcellus]
Length = 1140
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 127/286 (44%), Gaps = 34/286 (11%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
+ L E + L +Q +L + V + +R FH+ K +
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ 1084
>gi|159155577|gb|AAI54419.1| Cpsf1 protein [Danio rerio]
Length = 400
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 110/240 (45%), Gaps = 51/240 (21%)
Query: 483 ELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRINADAS---- 529
E+ +YGS A + T+ A T+SF V DS++NIGP S G + N +
Sbjct: 54 EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVV 111
Query: 530 -ATGISKQSNYELV------------ELPGCKGIWTVYHKSSR---------GHNADSSR 567
+G K ++ ELPGC +WTV + + G + + +
Sbjct: 112 VCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEK 171
Query: 568 MAAY---DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
D + H +LI+S E TM+L+T + E+ S + QG T+ AGN+ + +I
Sbjct: 172 REPTIEDDKKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYII 230
Query: 625 QVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
QV G R+L+G L F P + S ++ S+ADPYV++ ++G + + V
Sbjct: 231 QVSPMGIRLLEG---VNQLHFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFV 280
>gi|403255013|ref|XP_003920244.1| PREDICTED: DNA damage-binding protein 1 [Saimiri boliviensis
boliviensis]
Length = 1140
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|395544366|ref|XP_003774082.1| PREDICTED: DNA damage-binding protein 1 [Sarcophilus harrisii]
Length = 1239
Score = 82.0 bits (201), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 922 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 975
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 976 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 1030
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 1031 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1087
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1088 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1137
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1138 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1180
>gi|384250802|gb|EIE24281.1| hypothetical protein COCSUDRAFT_28729 [Coccomyxa subellipsoidea
C-169]
Length = 1101
Score = 81.6 bits (200), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/321 (25%), Positives = 143/321 (44%), Gaps = 27/321 (8%)
Query: 1114 IGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GTA V E +GR+L+F +LV E KE+KGA L QG L+
Sbjct: 796 VGTAITVAEEPEPTKGRILVFGA---KGGKLSLVCE---KEVKGAAYNLHPFQGKLIAGI 849
Query: 1173 GPKIILHKWTGTELNGIAFYDAPPL--YVVSLNIVK--NFILLGDIHKSIYFLSWKEQGA 1228
++ L KWT +E + +V++L IV +F+++GD+ +S+ L ++
Sbjct: 850 NSRVQLFKWTQSEDGSRELTNECSHVGHVLALYIVTRGDFVIVGDLMRSLQLLIYRADEG 909
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
L + A+D+ + A E L D + L ++ NI + +L + +
Sbjct: 910 ILEVRARDYKTHWMTAVEVLDDDTYLG---AENSNNIFTLRKNTDAAADEDRNRLETVGQ 966
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1348
+H+G V +F ++ P S+ +LF T++GSIG IA L + F+ L
Sbjct: 967 YHLGVFVNRFRHGSLVMK-------LPDSEAAKIPTVLFVTINGSIGVIASLPQQQFQFL 1019
Query: 1349 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELLSHYEMLPLEEQLEI 1407
LQ L + V GL+ ++R F H P + VD +L+ + L + +
Sbjct: 1020 SRLQDCLRKVIKGVGGLSHVAWRTFQDE---HTKMPSQNFVDGDLIEQFLDLKRDSMERV 1076
Query: 1408 AHQT--GTTRSQILSNLNDLA 1426
A + G T +L + +L+
Sbjct: 1077 AREMGEGVTSEDLLRMVEELS 1097
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 125/551 (22%), Positives = 206/551 (37%), Gaps = 113/551 (20%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ V ++G V ++ + G +D + L+ E K VLE+D
Sbjct: 44 RIEIHTLTPEGLKGVADVAIYGRVATMELFRPVG----ESKDLLFLSTERYKFCVLEYDS 99
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
L + E + GR G + VDP G +MI L G
Sbjct: 100 ETGELVTRANGDIED------QVGRPC-DNGQIGIVDP----------GCRMIGLHLYDG 142
Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
++ +D F+ RI+ +VI D IF+ G +P + +L++
Sbjct: 143 LFKVIPIDDKGQLHEAFNMRIDELNVI-----------DMIFLEGCAKPTIAVLYQDN-- 189
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI 329
H + L + P W NL A +++AVP P+GG LVVG + I
Sbjct: 190 -----KDARHIKTYEVVLKEKDLTEGP--WRQSNLDAGASRVIAVPEPLGGALVVGESVI 242
Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVL 388
Y Q Q + + + AH ++ LL G+L L
Sbjct: 243 AYMGQ-----------------GQAMKCTPIKATIIRAHGRVDEDGSRYLLGDYVGNLYL 285
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + +DG V L + + S +T + N + F+GS GDS LV+
Sbjct: 286 LVLQHDGEHVAGLKVEPLGRTSAPSTLTYLDNGVVFVGSSGGDSQLVRL----------- 334
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDS 508
P T + + + L+ M N + + + + +
Sbjct: 335 ----------HPTPVTPQEPSNFVEVLETMTNLGPIIDFVVVDLERQGQGQVVMCS---- 380
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRM 568
G + D S LRI + G+ +Q+ VELPG KG+W + +S M
Sbjct: 381 ----GIMADGS--LRIVRN--GIGMIEQAT---VELPGIKGMWALR----------ASHM 419
Query: 569 AAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQV 626
A+D +L+IS E R + + D L E E + +T+ GN ++QV
Sbjct: 420 DAFD----TFLVISFVGETRILAINADDELDE-AELPGFSADAQTLCCGNTVS-DHLVQV 473
Query: 627 FERGARILDGS 637
R++D S
Sbjct: 474 AGADVRLVDAS 484
>gi|119594340|gb|EAW73934.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_b [Homo
sapiens]
Length = 923
Score = 81.6 bits (200), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 606 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 659
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 660 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 714
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 715 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 771
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 772 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 821
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 822 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 864
>gi|307111604|gb|EFN59838.1| hypothetical protein CHLNCDRAFT_29381 [Chlorella variabilis]
Length = 1108
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 125/292 (42%), Gaps = 22/292 (7%)
Query: 1110 TLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
T +GTA+ E +GR+ + + + V KE +GA+ +LA QG L
Sbjct: 798 TYYVVGTAFAPPNEPEPTKGRIFVLAAAGGK------LCVVCEKETRGAVYSLAEFQGRL 851
Query: 1169 LIASGPKIILHKWTGTELNGIAFYD----APPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
L ++ ++KW G A A + + L + +++GD+ KSI L+W
Sbjct: 852 LAGINSRVQMYKWLEQGEGGRALVPECSHAGHVLALYLATRGDLVVVGDLMKSIQLLAWG 911
Query: 1225 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1284
E+ L L A+DF A L D + + ++ N+ + + +L
Sbjct: 912 EEEGALELRARDFHPNWMSAVTVLDDDTYMG---AENSYNLFTVRRNADAATDEERSRLE 968
Query: 1285 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1344
+ +H+G V +F ++ P S+ + +LFGT++G IG +A L
Sbjct: 969 TVGRYHLGEFVNRFQPGSLVMR-------LPDSELSQIPTVLFGTINGVIGVVASLPHAQ 1021
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
++ L+SLQ+ + V V G + +R F + P VD +L+ +
Sbjct: 1022 YQLLESLQEAMRKVVKGVGGFDHAQWRAFSNQHMPATPA-RQFVDGDLIEQF 1072
>gi|395852550|ref|XP_003798801.1| PREDICTED: DNA damage-binding protein 1 [Otolemur garnettii]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|300707023|ref|XP_002995737.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
gi|239604943|gb|EEQ82066.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
Length = 1155
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/236 (25%), Positives = 116/236 (49%), Gaps = 12/236 (5%)
Query: 1072 DRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN--ETLLAIGTAYVQGEDVAARGR 1129
D ++ +T ++S E L ++ ++L N + N + I V+GED +RGR
Sbjct: 820 DLYTSAYKFISTFDLESDEYVLDIKELSL-NDSIGINGKNNFIVICVTKVEGEDKHSRGR 878
Query: 1130 VLLFSTGRNADNPQNL-----VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1184
+++F + N+ + + S+ +KG I+ ++G+L++A G K +++K +
Sbjct: 879 IIVFELIDIIVDKANVHKDKKLKVLASENIKGCITKCDEIKGNLIVALGIKTMIYKIDRS 938
Query: 1185 E-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
E L I +D L S+ +KNF+L DI++ + F ++ + +LNL+ +
Sbjct: 939 EGLIPIGIHDLYTL-TTSMITIKNFVLFSDIYRGLSFFYYQNKPVRLNLVCTSESIKNAV 997
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE--FHVGAHVTK 1297
+F++ L ++ +D NI + Y+P S G K + R E F++G V K
Sbjct: 998 HVDFIVKEPALGIICTDFAGNIHTYTYSPVNILSCNGTKFVKRCETNFNLGKLVIK 1053
>gi|328770638|gb|EGF80679.1| hypothetical protein BATDEDRAFT_11194 [Batrachochytrium dendrobatidis
JAM81]
Length = 1098
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 81/333 (24%), Positives = 148/333 (44%), Gaps = 27/333 (8%)
Query: 1085 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQ 1143
P + + + +T+R T ++ +GT + ED RGR+L+F N
Sbjct: 765 PFEIASSLITIRF-------TDDDTLYYTVGTGFAFPHEDEPVRGRILVFKV-----NDM 812
Query: 1144 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW-TGTELNGIAFYDAPPLYVVSL 1202
L+ V+ +++G+ + S+ G L+ +++ +W + T L + + + +SL
Sbjct: 813 RLLQLVHEYDIRGSAYSFVSVHGRLVAGVNSNVMVLRWNSDTSLLELQSMNHGHVLALSL 872
Query: 1203 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1262
+ +FIL+ D+ KSI L + L LA D S A E + D + L +D
Sbjct: 873 AVRGDFILVADLIKSITLLQFDLATDSLKELAYDADSNWMTAAELIDDDTFLG---ADSS 929
Query: 1263 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1322
NI + Q+L + FH G + +F + + ++D T A P +
Sbjct: 930 MNIFALSKQGDQVSEEERQRLRPKGWFHTGELINRFRKGSLTLHATDETLALPAIPE--- 986
Query: 1323 FALLFGTLDGSIGCIA--PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1380
+L+ T+ G+IG +A P DE T + L +LQ+ L V V GL +R++ + ++
Sbjct: 987 --ILYCTVHGAIGVVARIPSDE-TAKILSTLQEALKSVVQGVGGLIHSDWRRYRTERRSI 1043
Query: 1381 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1413
+ I+D +L+ + L Q + Q T
Sbjct: 1044 KSA--GIIDGDLIESFLELDRSMQDHVFTQVAT 1074
>gi|311247551|ref|XP_003122699.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Sus scrofa]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|413081953|ref|NP_741992.2| DNA damage-binding protein 1 [Rattus norvegicus]
gi|293344614|ref|XP_002725831.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
gi|293356422|ref|XP_002728912.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
gi|149062405|gb|EDM12828.1| damage-specific DNA binding protein 1 [Rattus norvegicus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|354504619|ref|XP_003514371.1| PREDICTED: DNA damage-binding protein 1-like [Cricetulus griseus]
gi|344258340|gb|EGW14444.1| DNA damage-binding protein 1 [Cricetulus griseus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|270346571|pdb|3I7H|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Hbx
gi|270346573|pdb|3I7K|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Whx
gi|270346575|pdb|3I7L|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Ddb2
gi|270346577|pdb|3I7N|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdtc1
gi|270346579|pdb|3I7O|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Iqwd1
gi|270346581|pdb|3I7P|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr40a
gi|270346583|pdb|3I89|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr22
gi|270346585|pdb|3I8C|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr21a
gi|270346587|pdb|3I8E|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr42a
gi|270346588|pdb|3I8E|B Chain B, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr42a
Length = 1143
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 826 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 879
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 880 NGKLLASINSTVRLYEWTTEKDVRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 934
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 935 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 991
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 992 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1041
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1042 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1084
>gi|122692537|ref|NP_001073731.1| DNA damage-binding protein 1 [Bos taurus]
gi|426251842|ref|XP_004019630.1| PREDICTED: DNA damage-binding protein 1 [Ovis aries]
gi|134034086|sp|A1A4K3.1|DDB1_BOVIN RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|119223918|gb|AAI26630.1| Damage-specific DNA binding protein 1, 127kDa [Bos taurus]
gi|296471644|tpg|DAA13759.1| TPA: DNA damage-binding protein 1 [Bos taurus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|221046711|pdb|3EI1|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 14 Bp 6-4 Photoproduct
Containing Dna-Duplex
gi|221046715|pdb|3EI2|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Abasic Site
Containing Dna-Duplex
gi|221046719|pdb|3EI3|A Chain A, Structure Of The Hsddb1-Drddb2 Complex
Length = 1158
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 841 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 894
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 895 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 949
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 950 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1006
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1007 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1056
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1057 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1099
>gi|119594343|gb|EAW73937.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_e [Homo
sapiens]
Length = 896
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/284 (26%), Positives = 126/284 (44%), Gaps = 34/284 (11%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 579 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 632
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 633 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 687
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 688 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 744
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 745 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 794
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1379
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 795 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKT 838
>gi|149725200|ref|XP_001502072.1| PREDICTED: DNA damage-binding protein 1 [Equus caballus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|344295432|ref|XP_003419416.1| PREDICTED: DNA damage-binding protein 1 [Loxodonta africana]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/285 (26%), Positives = 126/285 (44%), Gaps = 34/285 (11%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1380
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE 1083
>gi|259155222|ref|NP_001158852.1| DNA damage-binding protein 1 [Salmo salar]
gi|223647700|gb|ACN10608.1| DNA damage-binding protein 1 [Salmo salar]
Length = 1139
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 128/293 (43%), Gaps = 36/293 (12%)
Query: 1114 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GTA V E+ + GR+++F D V E KE+KGA+ ++ G LL +
Sbjct: 830 VGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEFNGKLLASI 883
Query: 1173 GPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
+ L++WT TE N + + LY L +FIL+GD+ +S+ L++K
Sbjct: 884 NSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPME 938
Query: 1228 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
+A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 939 GNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVG 995
Query: 1288 EFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1343
FH+G V F L LQ L SS T + +LFGT++G IG + L E
Sbjct: 996 VFHLGEFVNVFSHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEG 1045
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ L LQ +L + V + +R FH+ K + +D +L+ +
Sbjct: 1046 WYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 1096
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 77/341 (22%), Positives = 131/341 (38%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG VV + L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLRVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G D L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ +++IQ+ G R++
Sbjct: 440 GEEVEETELPGFVDNLQTFYCGNV-AHQQLIQITSGGVRLV 479
>gi|410974071|ref|XP_003993471.1| PREDICTED: DNA damage-binding protein 1 [Felis catus]
Length = 1193
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 876 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 929
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 930 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 984
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 985 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1041
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1042 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1091
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1092 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1134
>gi|384941436|gb|AFI34323.1| DNA damage-binding protein 1 [Macaca mulatta]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|312283457|dbj|BAJ34594.1| unnamed protein product [Thellungiella halophila]
Length = 1088
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 151/350 (43%), Gaps = 36/350 (10%)
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+ + HQ L EE E VR+L+ ++ +T P+ + E ++
Sbjct: 716 RRICHQEQTRTFGICSLGNQTNAEESEMHFVRLLDDQ----SFEFVSTYPLDAFEYGCSI 771
Query: 1096 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1154
L + + +GTAYV E+ +GR+L+F D L+ E KE
Sbjct: 772 ----LSCSFADDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGKLQLIAE---KET 821
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1209
KG++ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 822 KGSVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
++GD+ KSI L +K + + A+D+ + A E L D L ++ N+
Sbjct: 881 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 937
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1328
+ + + +L E+H+G V +F ++ D G P ++FG
Sbjct: 938 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
T++G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 990 TVNGVIGVIASLPQEQYMFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1039
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 109/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPMYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F+ G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK + WS NL + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S +A A+ + + ++ V++D + LL G
Sbjct: 234 EETIVYCSANAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
+ LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
++ DA + S + L+ VN + + + + T S
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 373 AFKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
D+ + +L++S + T VL
Sbjct: 409 -------IDEAFDTFLVVSFISETRVL 428
>gi|221046721|pdb|3EI4|A Chain A, Structure Of The Hsddb1-Hsddb2 Complex
gi|221046723|pdb|3EI4|C Chain C, Structure Of The Hsddb1-Hsddb2 Complex
gi|221046725|pdb|3EI4|E Chain E, Structure Of The Hsddb1-Hsddb2 Complex
Length = 1158
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 841 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 894
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 895 NGKLLASINSTVRLYEWTTEKDVRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 949
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 950 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1006
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1007 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1056
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1057 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1099
>gi|355683071|gb|AER97036.1| damage-specific DNA binding protein 1, 127kDa [Mustela putorius furo]
Length = 1122
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|73983859|ref|XP_533275.2| PREDICTED: DNA damage-binding protein 1 [Canis lupus familiaris]
gi|291409601|ref|XP_002721069.1| PREDICTED: damage-specific DNA binding protein 1 [Oryctolagus
cuniculus]
gi|301781686|ref|XP_002926259.1| PREDICTED: DNA damage-binding protein 1-like [Ailuropoda melanoleuca]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|418316|sp|P33194.1|DDB1_CERAE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=DDBa; AltName:
Full=Damage-specific DNA-binding protein 1; AltName:
Full=UV-damaged DNA-binding protein 1; Short=UV-DDB 1
gi|304026|gb|AAA03021.1| UV-damaged DNA-binding protein [Chlorocebus aethiops]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|90108797|pdb|2B5L|A Chain A, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
Protein
gi|90108798|pdb|2B5L|B Chain B, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
Protein
gi|90108801|pdb|2B5M|A Chain A, Crystal Structure Of Ddb1
gi|116667897|pdb|2HYE|A Chain A, Crystal Structure Of The Ddb1-cul4a-rbx1-sv5v Complex
gi|1136228|gb|AAA88883.1| UV-damaged DNA binding factor [Homo sapiens]
gi|1588524|prf||2208446A xeroderma pigmentosum group E-binding factor
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKDVRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|361132523|pdb|4A0L|A Chain A, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
gi|361132525|pdb|4A0L|C Chain C, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
Length = 1144
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 827 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 880
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 881 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 935
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 936 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 992
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 993 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1042
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1043 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1085
>gi|441604084|ref|XP_004087862.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
[Nomascus leucogenys]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|148529014|ref|NP_001914.3| DNA damage-binding protein 1 [Homo sapiens]
gi|296218432|ref|XP_002807395.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
[Callithrix jacchus]
gi|397516558|ref|XP_003828491.1| PREDICTED: DNA damage-binding protein 1 [Pan paniscus]
gi|402893195|ref|XP_003909786.1| PREDICTED: DNA damage-binding protein 1 [Papio anubis]
gi|426368721|ref|XP_004051351.1| PREDICTED: DNA damage-binding protein 1 [Gorilla gorilla gorilla]
gi|12643730|sp|Q16531.1|DDB1_HUMAN RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=DNA damage-binding protein a;
Short=DDBa; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=HBV X-associated protein 1;
Short=XAP-1; AltName: Full=UV-damaged DNA-binding factor;
AltName: Full=UV-damaged DNA-binding protein 1;
Short=UV-DDB 1; AltName: Full=XPE-binding factor;
Short=XPE-BF; AltName: Full=Xeroderma pigmentosum group
E-complementing protein; Short=XPCe
gi|203282525|pdb|3E0C|A Chain A, Crystal Structure Of Dna Damage-Binding Protein 1(Ddb1)
gi|695362|gb|AAA62838.1| X-associated protein 1, partial [Homo sapiens]
gi|1052865|gb|AAC50349.1| DDBa p127 [Homo sapiens]
gi|15079750|gb|AAH11686.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|29792243|gb|AAH50530.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|30354567|gb|AAH51764.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|61354161|gb|AAX44048.1| damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|119594341|gb|EAW73935.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_c [Homo
sapiens]
gi|168275638|dbj|BAG10539.1| DNA damage-binding protein 1 [synthetic construct]
gi|189065506|dbj|BAG35345.1| unnamed protein product [Homo sapiens]
gi|355566436|gb|EHH22815.1| Damage-specific DNA-binding protein 1 [Macaca mulatta]
gi|380784123|gb|AFE63937.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|380808126|gb|AFE75938.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|380810144|gb|AFE76947.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|383408123|gb|AFH27275.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|410305600|gb|JAA31400.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
gi|410352015|gb|JAA42611.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|400260815|pdb|4E54|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
Dimerization And Its Roles In Chromatinized Dna Repair
gi|401871507|pdb|4E5Z|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
Dimerization And Its Roles In Chromatinized Dna Repair
Length = 1150
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 833 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 886
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 887 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 941
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 942 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 998
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 999 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1048
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1049 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1091
>gi|5353754|gb|AAD42230.1|AF159853_1 damage-specific DNA binding protein 1 [Mus musculus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/285 (26%), Positives = 126/285 (44%), Gaps = 34/285 (11%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1380
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE 1083
>gi|358440070|pdb|4A0B|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
gi|358440072|pdb|4A0B|C Chain C, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
Length = 1159
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 842 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 895
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 896 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 950
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 951 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1007
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1008 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1057
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1058 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1100
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 96/461 (20%), Positives = 172/461 (37%), Gaps = 116/461 (25%)
Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+DP+ R G+ +Y ++ + L F+ R+E HVI+++
Sbjct: 143 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 187
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
F++G P + +++ GR H + P W N+
Sbjct: 188 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 231
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
+A ++AVPSP GG +++G +I YH+ A+A + + S V
Sbjct: 232 EAEASMVIAVPSPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 280
Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
+D + +L D+ G L +L + DG V ++ L + + + +T
Sbjct: 281 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 334
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ N + F+GSRLGDS LV+ S G+ +++ G I D R+
Sbjct: 335 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 393
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ T S A ++ G L+ G+ I+ AS
Sbjct: 394 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 420
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +R E L++S +T VL
Sbjct: 421 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 458
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + + +T GN+ +++IQ+ R++
Sbjct: 459 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 498
>gi|358440058|pdb|4A08|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 13 Bp Cpd-Duplex (
Purine At D-1 Position) At 3.0 A Resolution (Cpd 1)
gi|358440062|pdb|4A09|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 15 Bp Cpd-Duplex
(Purine At D-1 Position) At 3.1 A Resolution (Cpd 2)
Length = 1159
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 842 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 895
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 896 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 950
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 951 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1007
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1008 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1057
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1058 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1100
>gi|74215029|dbj|BAE33503.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQRDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|74138855|dbj|BAE27231.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|7657011|ref|NP_056550.1| DNA damage-binding protein 1 [Mus musculus]
gi|134034087|sp|Q3U1J4.2|DDB1_MOUSE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=UV-damaged DNA-binding factor
gi|5931596|dbj|BAA84699.1| XPE UV-damaged DNA binding factor [Mus musculus]
gi|16307148|gb|AAH09661.1| Damage specific DNA binding protein 1 [Mus musculus]
gi|74182145|dbj|BAE34102.1| unnamed protein product [Mus musculus]
gi|74196166|dbj|BAE32993.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|359546285|pdb|4A11|A Chain A, Structure Of The Hsddb1-Hscsa Complex
gi|361132519|pdb|4A0K|C Chain C, Structure Of Ddb1-Ddb2-Cul4a-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
Length = 1159
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 842 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 895
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 896 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 950
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 951 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1007
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1008 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1057
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1058 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1100
>gi|224587439|gb|ACN58665.1| DNA damage-binding protein 1 [Salmo salar]
Length = 444
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 128/293 (43%), Gaps = 36/293 (12%)
Query: 1114 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GTA V E+ + GR+++F D V E KE+KGA+ ++ G LL +
Sbjct: 135 VGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMMEFNGKLLASI 188
Query: 1173 GPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
+ L++WT TE N + + LY L +FIL+GD+ +S+ L++K
Sbjct: 189 NSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPME 243
Query: 1228 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
+A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 244 GNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVG 300
Query: 1288 EFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1343
FH+G V F L LQ L SS T + +LFGT++G IG + L E
Sbjct: 301 VFHLGEFVNVFSHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEG 350
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ L LQ +L + V + +R FH+ K + +D +L+ +
Sbjct: 351 WYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 401
>gi|358440066|pdb|4A0A|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.6 A Resolution (Cpd 3)
Length = 1159
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 842 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 895
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 896 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 950
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 951 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1007
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1008 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1057
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1058 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1100
>gi|355752055|gb|EHH56175.1| Damage-specific DNA-binding protein 1, partial [Macaca fascicularis]
Length = 1125
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 808 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 861
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 862 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 916
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 917 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 973
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 974 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1023
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1024 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1066
>gi|194381178|dbj|BAG64157.1| unnamed protein product [Homo sapiens]
Length = 826
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 509 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 562
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 563 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 617
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 618 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 674
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 675 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 724
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 725 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 767
>gi|385865228|gb|AFI92852.1| DNA damage-binding protein 1 [Danio rerio]
Length = 1140
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 128/293 (43%), Gaps = 36/293 (12%)
Query: 1114 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GTA V E+ + GR+++F D V E KE+KGA+ ++ G LL +
Sbjct: 831 VGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEFNGKLLASI 884
Query: 1173 GPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
+ L++WT TE N + + LY L +FIL+GD+ +S+ L++K
Sbjct: 885 NSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPME 939
Query: 1228 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
+A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 940 GSFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVG 996
Query: 1288 EFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1343
FH+G V F L LQ L SS T + +LFGT++G IG + L E
Sbjct: 997 LFHLGEFVNVFSHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEG 1046
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ L LQ +L + V + +R FH+ K + +D +L+ +
Sbjct: 1047 WYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 1097
>gi|348526664|ref|XP_003450839.1| PREDICTED: DNA damage-binding protein 1-like [Oreochromis niloticus]
Length = 1140
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 94/365 (25%), Positives = 151/365 (41%), Gaps = 60/365 (16%)
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1101
G +++ HNL VD H +EV +P SE AL++ L
Sbjct: 783 GEEVEVHNLLVVDQHT------FEV-----------LHAHQFLP---SEYALSLVSCRL- 821
Query: 1102 NTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISA 1160
K+ +GTA V E+ + GR+++F D V E KE+KGA+ +
Sbjct: 822 ---GKDPSVYFIVGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYS 872
Query: 1161 LASLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1215
+ G L + + L++WT TE N + + LY L +FIL+GD+
Sbjct: 873 MVEFNGKFLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLM 927
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+S+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 928 RSVLLLAYKSMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAAT 984
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1331
+ Q L FH+G V F L LQ L SS T + +LFGT++
Sbjct: 985 TDEERQHLQEVGLFHLGEFVNVFCHGSLVLQNLGESSTPTQGS----------VLFGTVN 1034
Query: 1332 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1391
G IG + L E + L LQ +L + V + +R FH+ K + +D +
Sbjct: 1035 GMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGD 1092
Query: 1392 LLSHY 1396
L+ +
Sbjct: 1093 LIESF 1097
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 74/341 (21%), Positives = 129/341 (37%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG V ++ L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVAVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G D L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ +++IQ+ R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHQQLIQITSGSVRLV 479
>gi|74178494|dbj|BAE32502.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKLGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|223647932|gb|ACN10724.1| DNA damage-binding protein 1 [Salmo salar]
Length = 1139
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 128/293 (43%), Gaps = 36/293 (12%)
Query: 1114 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GTA V E+ + GR+++F D V E KE+KGA+ ++ G LL +
Sbjct: 830 VGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMMEFNGKLLASI 883
Query: 1173 GPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
+ L++WT TE N + + LY L +FIL+GD+ +S+ L++K
Sbjct: 884 NSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPME 938
Query: 1228 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
+A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 939 GNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVG 995
Query: 1288 EFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1343
FH+G V F L LQ L SS T + +LFGT++G IG + L E
Sbjct: 996 VFHLGEFVNVFSHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEG 1045
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ L LQ +L + V + +R FH+ K + +D +L+ +
Sbjct: 1046 WYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 1096
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 76/341 (22%), Positives = 131/341 (38%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG VV + L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLRVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + ++ R E L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPLRSEAGR--------------ETDDMLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ +++IQ+ G R++
Sbjct: 440 GEEVEETELPGFVDNLQTFYCGNV-AHQQLIQITSGGVRLV 479
>gi|383863765|ref|XP_003707350.1| PREDICTED: DNA damage-binding protein 1-like [Megachile rotundata]
Length = 1138
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 161/378 (42%), Gaps = 58/378 (15%)
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
++ +I+ HNL +D H T E +L P+ E AL+
Sbjct: 779 DISQEIEVHNLLIIDQH---TFEVLHAHMLMPN-----------------EYALS----- 813
Query: 1100 LFNTTTKENETLL-AIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGA 1157
L +T E+ T +GTA V ++ + GR+LL+ +T+V KE+KG+
Sbjct: 814 LISTKLGEDPTFYYVVGTALVNPDETEPKMGRILLYHWNDGK------LTQVAEKEIKGS 867
Query: 1158 ISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
+L G LL + + L +WT + L F + LY L +F+L+GD+
Sbjct: 868 CYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVGDL 924
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
+S+ L +K +A+D+ A E L D + L ++ N+ +
Sbjct: 925 MRSLTLLQYKTMEGSFEEIARDYNPNWMTAVEILDDDTFLG---AENCFNLFVCQKDSAA 981
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
+ + Q++ +FH+G V F L +Q L SS T +LFGT+
Sbjct: 982 TSEDERQQMQEIGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFGTV 1031
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1390
G+IG + + + L+ L+ +L + + V + R +R F++ K + +D
Sbjct: 1032 SGAIGLVTQIPFTFYEFLRHLEYRLTEVIKSVGKIEHRFWRSFNTELKVE--NCEGFIDG 1089
Query: 1391 ELLSHYEMLPLEEQLEIA 1408
+L+ + L ++ E+A
Sbjct: 1090 DLIESFLDLSPDKMAEVA 1107
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 92/422 (21%), Positives = 166/422 (39%), Gaps = 90/422 (21%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 162 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + A+ + +S+
Sbjct: 211 RQDNVEREATMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 260
Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DG-RVVQRLDLSKTNPSVLT 412
+ ++D +L D+A G L +L + DG +VV+ L + +
Sbjct: 261 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKNPDGTQVVKDLKVELLGEISIP 314
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
IT + N + F+GSRLGDS L++ +AD + + +
Sbjct: 315 ECITYLDNGVIFVGSRLGDSQLIKLIT------------------KADENGSYCVPMETF 356
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATG 532
L +V+ + L + T S A ++ G L+ G+ I AS
Sbjct: 357 TNLAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS--- 404
Query: 533 ISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 ---------IDLPGIKGMWAL---RIGGGNFDNT------------LVLSFVGQTRILTL 440
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
E T+ + +T GN+ IQ+ AR++ T + P N +
Sbjct: 441 NGEEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRT 499
Query: 653 GS 654
S
Sbjct: 500 IS 501
>gi|197097564|ref|NP_001126613.1| DNA damage-binding protein 1 [Pongo abelii]
gi|75041202|sp|Q5R649.1|DDB1_PONAB RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|55732122|emb|CAH92767.1| hypothetical protein [Pongo abelii]
Length = 1140
Score = 80.5 bits (197), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ +
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYPMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|19074861|ref|NP_586367.1| CLEAVAGE AND POLYADENYLATION SPECIFIC FACTOR [Encephalitozoon
cuniculi GB-M1]
gi|19069586|emb|CAD25971.1| CLEAVAGE AND POLYADENYLATION SPECIFIC FACTOR [Encephalitozoon
cuniculi GB-M1]
Length = 1156
Score = 80.5 bits (197), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 72/323 (22%), Positives = 148/323 (45%), Gaps = 26/323 (8%)
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL-----KGAISALASLQ 1165
L + T +++GED ARGR+ + + ++ + K L KG+I ++
Sbjct: 853 FLLVCTTFIEGEDRPARGRLHVLEIISVVPSLESPFKDCKLKVLGIEKTKGSIVRCEEVR 912
Query: 1166 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
G + + G KI+++K + + I FYD ++ S+++VKN+IL DI++ + F ++
Sbjct: 913 GKIALCLGTKIMIYKIDRSSGIIPIGFYDLH-IFTSSISVVKNYILASDIYRGLSFFFFQ 971
Query: 1225 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1284
+ +L+L++ + +TE L G+ LS++ D + I + Y+P S G +L+
Sbjct: 972 SKPIRLHLISSSEPLRNATSTELLSTGNELSMLCCDAKGTIHGYTYSPNNIISMDGARLV 1031
Query: 1285 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1344
RAE + T+ R + K N +++F + + ++ +D+
Sbjct: 1032 KRAE---------------IKTNLGRLSSFGAGFKKN--SIMFYSRSNMLIHVSGIDDAH 1074
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1404
+ +L +Q ++ + V GLN R + +S+ H S + +L+ + + Q
Sbjct: 1075 YLKLLGVQTAIMAHLKSVFGLNQRDY--LNSDIHLHSLSLKSPIVLHILNLFSYFDMSTQ 1132
Query: 1405 LEIAHQTGTTRSQILSNLNDLAL 1427
++ R +I + L L
Sbjct: 1133 ESVSSSARIDRKEISDMIASLNL 1155
>gi|345498295|ref|XP_001607743.2| PREDICTED: DNA damage-binding protein 1-like [Nasonia vitripennis]
Length = 1140
Score = 80.5 bits (197), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 89/380 (23%), Positives = 163/380 (42%), Gaps = 62/380 (16%)
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
E+G +++ HNL VD H T E L P T ++
Sbjct: 781 EIGQEVEIHNLLIVDQH---TFEVLHAHTLVP----------------------TEYAMS 815
Query: 1100 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1155
L +T E+ T +GTA + ++ + GR+LL+ + G+ +T+V KE+K
Sbjct: 816 LISTKLGEDPTPYYIVGTAMINPDESEPKSGRILLYHWNDGK--------LTQVAEKEIK 867
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1212
G+ +L G LL + + L +WT + L F + LY L +F+L+G
Sbjct: 868 GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVG 924
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ +S+ L +K +A+D+ + E L D + L ++ N+ +
Sbjct: 925 DLMRSVTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFLG---AENCFNLFVCQKDS 981
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 982 AATSEEERQQMQEVGQFHLGDMVNVFRHGSLVMQHLGESSTPTHGC----------VLFG 1031
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
T+ G+IG + + + L++L+ +L + V + +R F+++ K + + +
Sbjct: 1032 TVCGAIGLVTQIPSTFYEFLRNLEDRLTSVIKSVGKIEHNFWRSFNTDLKIEQ--CEGFI 1089
Query: 1389 DCELLSHYEMLPLEEQLEIA 1408
D +L+ + L E+ E+A
Sbjct: 1090 DGDLIESFLDLSHEKMAEVA 1109
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 90/420 (21%), Positives = 160/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D ++V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 162 MDEQNVQDVNFLHGCTNPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + Y + + S
Sbjct: 211 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTT--------YVTVVPPIIKQSTIS 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L + D + V++ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDLA------GHLFMLFLEQDKKADGSMVIKDLKVELLGEVSIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSRLGDS L++ + E F ++ AP A
Sbjct: 317 ITYLDNGVIFIGSRLGDSQLIKLNTKPDENGSYCSTMETFTNL---APIVDM-------A 366
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+ D+ + T S A ++ G L+ G+ I AS
Sbjct: 367 VVDL------------ERQGQGQIVTCSGAFKE-----GSLRIIRNGIGIQEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + S N L++S +T +L
Sbjct: 405 -------IDLPGIKGMWALKVDSVNFDNT---------------LVLSFVGQTRILMLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E TE + +T GN+ +IQ+ AR++ + + P N + S
Sbjct: 443 EEVEETEIPGFVADEQTFHTGNV-TNDVIIQITPTSARLISNKSSSVISEWEPDNKRTIS 501
>gi|194377326|dbj|BAG57611.1| unnamed protein product [Homo sapiens]
Length = 451
Score = 80.5 bits (197), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 134 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 187
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 188 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 242
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 243 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 299
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 300 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 349
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 350 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 392
>gi|147906138|ref|NP_001083624.1| DNA damage-binding protein 1 [Xenopus laevis]
gi|82186503|sp|Q6P6Z0.1|DDB1_XENLA RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|38303806|gb|AAH61946.1| Ddb1 protein [Xenopus laevis]
Length = 1140
Score = 80.5 bits (197), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 144/333 (43%), Gaps = 42/333 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V ++ + GR+++F N L T V KE+KGA+ ++
Sbjct: 823 KDPTTYFVVGTAMVYPDEAEPKQGRIVVFQY-----NDGKLQT-VAEKEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSPPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
+ L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 1039 LVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIES 1096
Query: 1396 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ +M + L+I +G R + +L
Sbjct: 1097 FLDISRPKMQEVIANLQIDDGSGMKRETTVDDL 1129
>gi|301616502|ref|XP_002937687.1| PREDICTED: DNA damage-binding protein 1-like [Xenopus (Silurana)
tropicalis]
Length = 1140
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 144/333 (43%), Gaps = 42/333 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V ++ + GR+++F N L T V KE+KGA+ ++
Sbjct: 823 KDPTTYFVVGTAMVYPDEAEPKQGRIVVFQY-----NDGKLQT-VAEKEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSPPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
+ L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 1039 LVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIES 1096
Query: 1396 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ +M + L+I +G R + +L
Sbjct: 1097 FLDISRPKMQEVIANLQIDDGSGMKRETTVDDL 1129
>gi|429965418|gb|ELA47415.1| hypothetical protein VCUG_01066 [Vavraia culicis 'floridensis']
Length = 1176
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 99/181 (54%), Gaps = 18/181 (9%)
Query: 1151 SKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPLYVVSLNIVKNFI 1209
S+ KG IS A+++G + ++ K+++++ + + IAFYD +Y VSL ++KN+I
Sbjct: 927 SERTKGPISCCAAVRGKIAVSLATKLMVYECDRNSGIVAIAFYDLY-MYAVSLAVIKNYI 985
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAK-----DFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
++GDI ++F+ ++ + +L+LL+K + GSLD F G +L + D+
Sbjct: 986 IVGDIMMGLHFVYFQSEPVKLHLLSKSDRIANLGSLDFFNA-----GESLFITGIDKTGK 1040
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
+QIF ++P S G+KL+ R EF A Q + TSS R+ A+ S + A
Sbjct: 1041 VQIFSFSPSNLYSNGGEKLVKRQEFETYAC------FQSIKTSSYRSYASFFSSQNFLIA 1094
Query: 1325 L 1325
L
Sbjct: 1095 L 1095
>gi|327278830|ref|XP_003224163.1| PREDICTED: DNA damage-binding protein 1-like [Anolis carolinensis]
Length = 1140
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 145/333 (43%), Gaps = 42/333 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V ++ + GR+++F +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPDEAEPKQGRIVVF---HYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LYV + +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALYVKTKG---DFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEFGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
+ L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 1039 LVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTERKT-EPAT-GFIDGDLIES 1096
Query: 1396 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ +M + L+I +G R + +L
Sbjct: 1097 FLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129
>gi|320163506|gb|EFW40405.1| UV-damaged DNA binding protein [Capsaspora owczarzaki ATCC 30864]
Length = 1123
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 142/302 (47%), Gaps = 31/302 (10%)
Query: 1075 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLF 1133
G ++ R + + S+E ++ + F + ++ L +GTA+V ED RGR+L+F
Sbjct: 840 GQTFEIRDSFQLPSTETIMSF-ISCSFANDSSDSTVYLVVGTAFVIPSEDEPKRGRILVF 898
Query: 1134 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW--TGTELNGIAF 1191
A +LVT +K++KG + +L + G LL K+ L KW TG + +
Sbjct: 899 DVAGGA---LHLVT---AKDVKGCVYSLNAFNGKLLAGINSKVNLFKWNLTGDGIRELVS 952
Query: 1192 YDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKD-----FGSLDCFA 1244
+ ++++L + +FI++GD+ +SI L +K + + +A+D ++D
Sbjct: 953 ECSHHGHILTLYLKSRGDFIIVGDLMRSISLLMYKSGTSSIEEIAQDTCPNWVTAVDMLD 1012
Query: 1245 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1304
+ I G + S + ++N++ S + ++L EFHVG + +F ++
Sbjct: 1013 DDVFIGGES-SFNIFTCRRNLE-------ASTDEERKRLEVVGEFHVGEFINQFRAGSLV 1064
Query: 1305 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1364
D ++ + + LFGT +G IG IA L + LQ +Q + + V G
Sbjct: 1065 MKLPDE------QEQPIQPSTLFGTGNGVIGVIARLTRSQYEFLQLVQAAMAKVIKGVGG 1118
Query: 1365 LN 1366
LN
Sbjct: 1119 LN 1120
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 75/318 (23%), Positives = 127/318 (39%), Gaps = 69/318 (21%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
N+R L+ V D F+ GY P +++L++ H L + P
Sbjct: 208 NIR-LEELQVFDIKFLRGYDRPTILVLYQD-------TKETRHVKTYQVLLKEKEFAEGP 259
Query: 297 LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
W+ N+ A L+ V P+GGVL+VG TI YHS SA ++A+ +
Sbjct: 260 --WAQNNVEGGASLLIPVLMPLGGVLIVGEQTITYHSGSAFRSVAMRPAII--------- 308
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDI 415
+SV + + LL+ G+L+ + + +D + V + + + + + S +
Sbjct: 309 -KCYSV---------IDTNRFLLADSEGNLLSVLLTHDRQDKVTAIKIDRLGVTSILSCL 358
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
T + N + F GS+ GDS L++ ++E G S L A+
Sbjct: 359 TYLDNGVVFGGSQFGDSQLLRLATE----------RDETGSFVRVLESFSNLGPICDMAV 408
Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
D+ + T S A +D G L+ G+ GI +
Sbjct: 409 VDL------------ERQGQCQVVTCSGAFKD-----GSLRVVRNGV---------GIEE 442
Query: 536 QSNYELVELPGCKGIWTV 553
Q+ +ELPG KGIW++
Sbjct: 443 QAT---IELPGIKGIWSL 457
>gi|449328561|gb|AGE94838.1| cleavage and polyadenylation specific factor [Encephalitozoon
cuniculi]
Length = 1156
Score = 80.1 bits (196), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/325 (22%), Positives = 149/325 (45%), Gaps = 30/325 (9%)
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL-----KGAISALASLQ 1165
L + T +++GED ARGR+ + + ++ + K L KG+I ++
Sbjct: 853 FLLVCTTFIEGEDRPARGRLHVLEIISVVPSLESPFKDCKLKVLGIEKTKGSIVRCEEVR 912
Query: 1166 GHLLIASGPKIILHKWTGTELNGI---AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
G + + G KI+++K + NGI FYD ++ S+++VKN+IL DI++ + F
Sbjct: 913 GKIALCLGTKIMIYKIDRS--NGIIPIGFYDLH-IFTSSISVVKNYILASDIYRGLSFFF 969
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1282
++ + +L+L++ + +TE L G+ LS++ D + I + Y+P S G +
Sbjct: 970 FQSKPIRLHLISSSEPLRNATSTELLSTGNELSMLCCDAKGTIHGYTYSPNNIISMDGAR 1029
Query: 1283 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1342
L+ RAE + T+ R + K N +++F + + ++ +D+
Sbjct: 1030 LVKRAE---------------IKTNLGRLSSFGAGFKKN--SIMFYSRSNMLIHVSGIDD 1072
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1402
+ +L +Q ++ + V GLN R + +S+ H + + +L+ + +
Sbjct: 1073 AHYLKLLGVQTAIMAHLKSVFGLNQRDY--LNSDIHLHSLSLKNPIVLHILNLFSYFDMS 1130
Query: 1403 EQLEIAHQTGTTRSQILSNLNDLAL 1427
Q ++ R +I + L L
Sbjct: 1131 TQESVSSSARIDRKEISDMIASLNL 1155
>gi|74208347|dbj|BAE26370.1| unnamed protein product [Mus musculus]
Length = 599
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+ +F + +D V E KE+KGA+ ++
Sbjct: 282 KDPNTYFIVGTAMVYPEEAEPKQGRIAVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 335
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 336 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 390
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 391 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 447
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 448 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 497
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 498 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 540
>gi|194389106|dbj|BAG61570.1| unnamed protein product [Homo sapiens]
Length = 1009
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 692 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 745
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 746 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 800
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N + + +
Sbjct: 801 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNSFVCQKDSAATTDEE 857
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 858 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 907
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 908 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 950
>gi|444513057|gb|ELV10249.1| DNA damage-binding protein 1 [Tupaia chinensis]
Length = 1146
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 128/291 (43%), Gaps = 40/291 (13%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS------FRQFHSNGKAH 1380
+ L E + L +Q +L + ++P S +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKRCFQISPNSLTDMSTWRSFHTERKTE 1089
>gi|195996153|ref|XP_002107945.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
gi|190588721|gb|EDV28743.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
Length = 1134
Score = 79.3 bits (194), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 79/336 (23%), Positives = 152/336 (45%), Gaps = 35/336 (10%)
Query: 1081 RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED---VAARGRVLLFSTGR 1137
+ + +Q E +++ T N + E +GTA+V ED R+L + G+
Sbjct: 798 QCALQLQDCEWGMSLISCTFEN----DPEAYYCVGTAFVNLEDKEPTKGNIRILKYFEGK 853
Query: 1138 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE--LNGIAFYDAP 1195
+ +V+SKE+ GA+ + + G LL + + +++WT + + +F++
Sbjct: 854 --------IQQVHSKEVSGAVYCMVAFNGRLLASVNSTVSVYEWTSNKELVEETSFHNN- 904
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
+ + L +FIL+GD+ +SI +++ ++ L+ K+ A E + D S L
Sbjct: 905 -VLALYLKTKGDFILIGDLMRSISLCAYRPMNNEIELICKNNDPNWMTAVEIIDDDSYLG 963
Query: 1256 LVVSDEQKNIQIFYYAPKMSESWKGQK-LLSRAEFHVGAHVTKFLRLQMLATSS-DRTGA 1313
+ + +F S S + QK L + +HVG V F + ++ ++ D +
Sbjct: 964 -----GENSHNLFTCQKNSSSSEEEQKHLPTVGVYHVGEFVNVFRQGSLVMQNTVDIPDS 1018
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
GS +LFGT+ G++G + L F + ++ KL V V + + +R F
Sbjct: 1019 VQGS-------ILFGTVSGAVGVVVTLAPAMFEFVSAIANKLSTVVKGVGKIEHQFWRSF 1071
Query: 1374 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1409
SN + P S VD +L+ + L E+ +A+
Sbjct: 1072 -SNDRKTEPC-QSFVDGDLVESFLDLSPEDMQRVAN 1105
Score = 47.4 bits (111), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 49/200 (24%), Positives = 91/200 (45%), Gaps = 29/200 (14%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
L+ V D F++G+ EP + +++E + R + + +A + + P W+
Sbjct: 158 LEELQVLDVKFLYGFTEPTIALIYE---SGQNRYLKTYEISLQNA-----DIHRQP--WN 207
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
+ +A+ +L VP P G++V+GA +I Y+ S L+ SL R +
Sbjct: 208 IGKVEEEAFMILPVPPPSCGMVVIGAGSISYYKGQDS----LHITPASLKD-----RITC 258
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----GRVVQRLDLSKTNPSVLTSDIT 416
+D+ +L D + G L +L +V + G V+ L L + + S IT
Sbjct: 259 FGRVDSNGCRYLLGDYS------GRLFMLILVQEHSQSGIKVKDLCLEYLGETSIPSCIT 312
Query: 417 TIGNSLFFLGSRLGDSLLVQ 436
+ N+ ++GS GDS L++
Sbjct: 313 YLDNAFAYIGSSCGDSQLIK 332
>gi|328788389|ref|XP_396048.3| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Apis
mellifera]
Length = 1141
Score = 79.3 bits (194), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 92/380 (24%), Positives = 166/380 (43%), Gaps = 62/380 (16%)
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
++ +I+ HNL +D H T E +L P +E AL+
Sbjct: 782 DICQEIEVHNLLIIDQH---TFEVLHAHMLMP-----------------TEYALS----- 816
Query: 1100 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1155
L +T E+ T +GTA V ++ + GR+LL+ S G+ +T+V KE+K
Sbjct: 817 LISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGK--------LTQVAEKEIK 868
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1212
G+ +L G LL + + L +WT + L F + LY+ S +FIL+G
Sbjct: 869 GSCYSLTEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKSKG---DFILVG 925
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ +S+ L +K +A+D+ A E L D + L ++ N+ +
Sbjct: 926 DLMRSLTLLQYKTMEGCFEEIARDYNPNWMTAIEILDDDTFLG---AENCFNLFVCQKDS 982
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 983 AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFG 1032
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
T+ G+IG + + + + L++L+ +L + V + +R F++ K + + +
Sbjct: 1033 TVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1090
Query: 1389 DCELLSHYEMLPLEEQLEIA 1408
D +L+ + L ++ E+A
Sbjct: 1091 DGDLIESFLDLSPDKMAEVA 1110
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
++ V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH N Y + + +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L V + + VV+ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKADGTQVVKDLKVELLGEISIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSRLGDS LV+ +AD + + +
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L + T S A ++ G L+ G+ I AS
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 -------IDLPGIKGMWAL---KIGGGNFDNT------------LVLSFVGQTRILTLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E T+ + +T GN+ IQ+ AR++ T + P N + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISYETKTVVSEWEPENKRTIS 501
>gi|2632123|emb|CAA05770.1| Xeroderma Pigmentosum Group E Complementing protein [Homo sapiens]
Length = 1140
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KG + ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGDVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKDVRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|307205760|gb|EFN83990.1| DNA damage-binding protein 1 [Harpegnathos saltator]
Length = 1138
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 164/380 (43%), Gaps = 62/380 (16%)
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
E+G +I+ HNL +D H T E L P +E AL+
Sbjct: 779 EIGQEIEVHNLLIIDQH---TFEVLHAHTLMP-----------------TEYALS----- 813
Query: 1100 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1155
L +T E+ T +GTA + ++ + GR+LL+ S G+ +T+V KE+K
Sbjct: 814 LISTRLGEDPTSYFVVGTALINPDETEPKMGRILLYHWSDGK--------LTQVAEKEIK 865
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1212
G+ +L G LL + + L +WT + L F + LY L +F+L+G
Sbjct: 866 GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVG 922
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ +S+ L +K +A+D+ + E L D + L ++ N+ +
Sbjct: 923 DLMRSLTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFLG---AENCFNLFVCQKDS 979
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 980 AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTLG----------CVLFG 1029
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
T+ G+IG + + + L++L+ +L + V + +R F++ K + + +
Sbjct: 1030 TVSGAIGLVTQIPFAFYEFLRNLEDRLNSVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1087
Query: 1389 DCELLSHYEMLPLEEQLEIA 1408
D +L+ + L ++ E+A
Sbjct: 1088 DGDLIESFLDLNHDKMAEVA 1107
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 94/205 (45%), Gaps = 35/205 (17%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + A+ + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYIAVV----------PPIIKQST 257
Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
+ ++D +L D+A G L +L + + + VV+ L + +
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGTQVVKDLKVELLGEISIP 311
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
IT + N + F+GSRLGDS L++
Sbjct: 312 ECITYLDNGVIFVGSRLGDSQLIKL 336
>gi|224135035|ref|XP_002321967.1| predicted protein [Populus trichocarpa]
gi|222868963|gb|EEF06094.1| predicted protein [Populus trichocarpa]
Length = 60
Score = 78.6 bits (192), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 36/48 (75%), Positives = 41/48 (85%)
Query: 684 VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
+ DPSTC VSV TP+A +SSKK VS+CTLYHDKGPEP LRKTS +AWL
Sbjct: 1 MTDPSTCMVSVNTPSAFQSSKKSVSACTLYHDKGPEPLLRKTSPNAWL 48
>gi|313238818|emb|CBY20011.1| unnamed protein product [Oikopleura dioica]
gi|313245836|emb|CBY34826.1| unnamed protein product [Oikopleura dioica]
Length = 1135
Score = 78.2 bits (191), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 73/308 (23%), Positives = 143/308 (46%), Gaps = 26/308 (8%)
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
K++E + +GTA E GR+ +FS + + +T V +K++ GA+ ++ +L
Sbjct: 821 KKDEQFIVVGTAITADEQECKNGRICVFSYSK-----EEKLTLVSTKQVNGAVYSVKALN 875
Query: 1166 GHLLIASGPKIILHKWTGTELNGIAFY--DAP---PLYVVSLNIVKN-FILLGDIHKSIY 1219
G+ +I + I + E+N +AP + V++++ KN FIL D+ +SI
Sbjct: 876 GNKIICA----INQQLKVFEMNEQTTLQSEAPIANHITCVAVDVSKNGFILSADLMRSIS 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
S+K L +A+D+ A + + D + + ++ +NI I + +
Sbjct: 932 VFSYKPLEGALEEIARDYHPNWMTAIKMIDDDNYIG---AENSENIFICTRNTEAPDEED 988
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
Q+LL +HVG H+ + ++ + P T F L G++ G +G +A
Sbjct: 989 RQQLLPTGYYHVGEHINTIVEGNLVMDVHVESSITP----TRTF--LMGSVSGYVGLLAI 1042
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
E ++ L L+ K+ + V ++ S+R+F S+ + VD +L+ ++ L
Sbjct: 1043 FPEKQWQFLSKLEAKMRKVIRGVGKIDHESWRRFESDSRME--DCKGFVDGDLIEMFQDL 1100
Query: 1400 PLEEQLEI 1407
E+Q E+
Sbjct: 1101 RPEKQKEV 1108
Score = 49.3 bits (116), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 130/626 (20%), Positives = 228/626 (36%), Gaps = 173/626 (27%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ ++ + L+ V + L+G + + + + ++D + + E +LE+ D
Sbjct: 43 RIEVNLSTQTGLKPVTEFNLYGRIAVIEVFRY----KNEKKDCLFILTESCYACILEYVD 98
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
G IT + ++ S ++ G VDP+ RC + +Y + I+ +
Sbjct: 99 ---GKIITRA-------YGDMRDKNYSVSQSGMHACVDPEARCIALRLYDGVLKIINLNS 148
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
L E RIE V+ D F+H +P + +L++
Sbjct: 149 SSKHLTSAEQ----------RIEEILVV-----------DMCFLHTANKPTLALLYDDN- 186
Query: 269 TWAGRVSWKH-HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
S +H T I+ + + H + + D ++AVP P+ G+L++G
Sbjct: 187 ------SSRHLSTIAITLDNSGSGASIHKGPFRHTQVEQDTILIVAVPEPLAGILLLGHV 240
Query: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
I YH ++ N V+ T + L G+L
Sbjct: 241 NITYHDSKNRSTCSIENI----------------VKRTIECVTPIDKHRYLCGDSNGELF 284
Query: 388 LLTVVYDGRVV----QRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
LL + Y+ + RL + L + ++ I N + F+GS GDS L++
Sbjct: 285 LLLLDYNENRIPEERMRLATKYLGRTTLPNTLSYIDNYVVFVGSTFGDSELIR------- 337
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
IE + NN S Q S
Sbjct: 338 -------------IEV------------------------------SDNN--SGQHFTSL 352
Query: 504 AVRDSLVNIGPLKDFSY--------GLRINADASATGISKQ--------SNYELVELPGC 547
D+L GP+KD G + A TG S + Y ++L G
Sbjct: 353 HQYDNL---GPIKDMCIVDFEKQGQGQLVTASGVGTGGSLRIIRNGVGIHEYASIDLEGV 409
Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMV--LETADLLTEVTESVDY 605
KG+W + + SS S++ + L++S +T+ LE D +TEV E +
Sbjct: 410 KGLWALKYLSS------STKQDS--------LLLSFVGQTIFLRLEGQD-VTEV-EEIPG 453
Query: 606 FVQG-RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS----ENST 660
F G +T+ AGN+ ++ +Q+ E+ R++ ES GS EN+
Sbjct: 454 FTNGEQTMYAGNV-TDQQFLQITEKQVRLI--------------ADESLKGSWEPEENTQ 498
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGD 686
+ S+ VLLG+ +I L + D
Sbjct: 499 INLCSVNKNQVLLGVGSTAIYLEIND 524
>gi|81868411|sp|Q9ESW0.1|DDB1_RAT RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|9843869|emb|CAB89874.2| damage-specific DNA binding protein 1 [Rattus norvegicus]
Length = 1140
Score = 78.2 bits (191), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 125/285 (43%), Gaps = 38/285 (13%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1162
K+ T +GTA V E+ + GR+++F S G+ + V KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSGGK--------LQTVAEKEVKGAVYSMV 874
Query: 1163 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
G LL + + L++WT TE N + + LY L +FIL+GD+ +S
Sbjct: 875 EFNGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 929
Query: 1218 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 930 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 986
Query: 1278 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
+ Q L FH+G V F L +Q L +S T ++L GT++G
Sbjct: 987 EERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLLGTVNGM 1036
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
IG + L E + L +Q +L + V + +R FH+ K
Sbjct: 1037 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|332030156|gb|EGI69950.1| DNA damage-binding protein 1 [Acromyrmex echinatior]
Length = 1138
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 86/380 (22%), Positives = 164/380 (43%), Gaps = 62/380 (16%)
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
E+G +I+ HNL +D H + + + ++E AL+
Sbjct: 779 EIGQEIEVHNLLIIDQHTFEVLHAH--------------------TLMATEYALS----- 813
Query: 1100 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1155
L +T E+ T +GTA++ ++ + GR+LL+ S G+ T+V KE+K
Sbjct: 814 LISTKLGEDPTSYFVVGTAFINPDETEPKMGRILLYHWSEGK--------FTQVAEKEIK 865
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1212
G+ +L G LL + + L +WT + L F + LY L +F+L+G
Sbjct: 866 GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVG 922
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ +S+ L +K +A+D+ + E L D + L ++ N+ +
Sbjct: 923 DLMRSLTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFLG---AENCFNLFVCQKDS 979
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 980 AATSEDERQQMQEIGQFHLGDMVNVFRHGSLVMQNLGESSTPTLG----------CVLFG 1029
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
T+ G+IG + + + L++++ +L + V + +R F++ K + + +
Sbjct: 1030 TVSGAIGLVTQIPVTFYEFLRNMEDRLNSVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1087
Query: 1389 DCELLSHYEMLPLEEQLEIA 1408
D +L+ + L ++ E+A
Sbjct: 1088 DGDLIESFLDLNHDKMAEVA 1107
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/205 (23%), Positives = 94/205 (45%), Gaps = 35/205 (17%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + I+ K+ I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEINLRDKEFAKIPW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + A+ + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 257
Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
+ ++D +L D+A G L +L + + + VV+ L + +
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGSQVVKDLKVELLGEISIP 311
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
IT + N + ++GSRLGDS L++
Sbjct: 312 ECITYLDNGVIYVGSRLGDSQLIKL 336
>gi|326919947|ref|XP_003206238.1| PREDICTED: DNA damage-binding protein 1-like [Meleagris gallopavo]
Length = 1079
Score = 77.4 bits (189), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 144/335 (42%), Gaps = 46/335 (13%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1162
K+ T +GTA V E+ + GR+++F S G+ + + KE+KGA+ ++
Sbjct: 762 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 813
Query: 1163 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
G LL + + L++WT TE N + + LY L +FIL+GD+ +S
Sbjct: 814 EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 868
Query: 1218 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 869 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 925
Query: 1278 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
+ Q L H+G V F L +Q L +S T + +LFGT++G
Sbjct: 926 EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGM 975
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
IG + L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 976 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKT-EPAT-GFIDGDLI 1033
Query: 1394 SHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ +M + L+I +G R + +L
Sbjct: 1034 ESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1068
>gi|224050582|ref|XP_002191856.1| PREDICTED: DNA damage-binding protein 1 [Taeniopygia guttata]
Length = 1140
Score = 77.4 bits (189), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 144/335 (42%), Gaps = 46/335 (13%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1162
K+ T +GTA V E+ + GR+++F S G+ + + KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 874
Query: 1163 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
G LL + + L++WT TE N + + LY L +FIL+GD+ +S
Sbjct: 875 EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 929
Query: 1218 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 930 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 986
Query: 1278 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
+ Q L H+G V F L +Q L +S T + +LFGT++G
Sbjct: 987 EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGM 1036
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
IG + L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 1037 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLI 1094
Query: 1394 SHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ +M + L+I +G R + +L
Sbjct: 1095 ESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129
>gi|410912407|ref|XP_003969681.1| PREDICTED: DNA damage-binding protein 1-like [Takifugu rubripes]
Length = 1140
Score = 77.4 bits (189), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 123/296 (41%), Gaps = 26/296 (8%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ +GTA V E+ + GR+++F D V E KE+KGA+ ++
Sbjct: 823 KDPSVYFVVGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
G LL + + L++WT + + + L +FIL+GD+ +S+ L++K
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECSHYNNIMALYLKTKGDFILVGDLMRSVLLLAYK 936
Query: 1225 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1284
+A+DF A E L D + L ++ N+ + + Q L
Sbjct: 937 PMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEDRQHLQ 993
Query: 1285 SRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
FH+G V F L LQ L +S T + +LFGT+ G IG + L
Sbjct: 994 EVGVFHLGEFVNVFCHGSLVLQNLGETSTPTQGS----------VLFGTVTGMIGLVTSL 1043
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
E L LQ +L + V + +R FH+ K + +D +L+ +
Sbjct: 1044 SEGWHSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--AKGFIDGDLIESF 1097
Score = 47.4 bits (111), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 73/341 (21%), Positives = 127/341 (37%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG V ++ L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGD LV+ S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDPQLVKLNVDSNDQGSFVTVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G D L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ ++IQ+ R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHNQLIQITSGSVRLV 479
>gi|119594342|gb|EAW73936.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_d [Homo
sapiens]
Length = 1146
Score = 77.0 bits (188), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 76/289 (26%), Positives = 128/289 (44%), Gaps = 40/289 (13%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGL------NPRSFRQFHSNGK 1378
+ L E + L +Q +L + + +P ++R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKRCFLISTCSLTHPSTWRSFHTERK 1087
>gi|45383688|ref|NP_989547.1| DNA damage-binding protein 1 [Gallus gallus]
gi|82098863|sp|Q805F9.1|DDB1_CHICK RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=UV-damaged DNA-binding factor
gi|28375613|dbj|BAC56999.1| damaged-DNA binding protein DDB p127 subunit [Gallus gallus]
gi|53130071|emb|CAG31438.1| hypothetical protein RCJMB04_6h2 [Gallus gallus]
Length = 1140
Score = 77.0 bits (188), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 144/335 (42%), Gaps = 46/335 (13%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1162
K+ T +GTA V E+ + GR+++F S G+ + + KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 874
Query: 1163 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
G LL + + L++WT TE N + + LY L +FIL+GD+ +S
Sbjct: 875 EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 929
Query: 1218 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 930 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 986
Query: 1278 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
+ Q L H+G V F L +Q L +S T + +LFGT++G
Sbjct: 987 EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGM 1036
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
IG + L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 1037 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLI 1094
Query: 1394 SHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ +M + L+I +G R + +L
Sbjct: 1095 ESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129
>gi|432851195|ref|XP_004066902.1| PREDICTED: DNA damage-binding protein 1-like [Oryzias latipes]
Length = 1140
Score = 77.0 bits (188), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 129/301 (42%), Gaps = 36/301 (11%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ +GTA V E+ + GR+++F D V E KE+KGA+ ++
Sbjct: 823 KDPSVYFIVGTAMVYPEEPEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L LQ L SS T + +LFGT++G IG
Sbjct: 989 RQHLQEVGVFHLGEFVNVFCHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
+ L E L LQ +L + V + +R F++ K + +D +L+
Sbjct: 1039 LVTSLSEGWHSLLLDLQNRLNKVIKSVGKIEHSFWRSFYTERKTEQ--ATGFIDGDLIES 1096
Query: 1396 Y 1396
+
Sbjct: 1097 F 1097
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 74/341 (21%), Positives = 130/341 (38%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG V ++ L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVTVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PILDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G +D L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRESDD------------MLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ +++IQ+ R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHQQLIQITSGSVRLV 479
>gi|260790329|ref|XP_002590195.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
gi|229275385|gb|EEN46206.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
Length = 1152
Score = 77.0 bits (188), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 74/305 (24%), Positives = 129/305 (42%), Gaps = 26/305 (8%)
Query: 1110 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
T IGTA V E+ + GR+++F + +V KE+KGA+ +L L
Sbjct: 834 TYFIIGTAMVYPEESEPKSGRIIVFQYTDGK------LQQVAEKEVKGAVYSLVQFNNKL 887
Query: 1169 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
L + + L +WT + + + + L +FIL+GD+ +S+ L++K
Sbjct: 888 LASINSTVRLFEWTAEKELRVECNHYNNILALYLKTKGDFILVGDLMRSVTLLAYKPMEG 947
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW---KGQKLLS 1285
+A+DF A E L D + L +N F+ K S + + Q L
Sbjct: 948 CFEEIARDFNPNWMSAVEILDDDNFLG------AENSFNFFTCQKDSAATTDEERQHLQE 1001
Query: 1286 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-NRFALLFGTLDGSIGCIAPLDELT 1344
FH+G V F ++ PG T + ++LFGT++G++G + L
Sbjct: 1002 VGHFHLGEFVNVFRHGSLVMQH-------PGETSTPTQGSVLFGTVNGAVGLVTQLPADF 1054
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1404
F LQ +Q KL + V + +R F++ K +D +L+ + L ++
Sbjct: 1055 FNFLQEVQSKLTRVIKSVGKIEHSFWRSFNTERKTE--ACQGFIDGDLIESFLDLSRDKM 1112
Query: 1405 LEIAH 1409
E+
Sbjct: 1113 QEVVQ 1117
Score = 48.5 bits (114), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 53/228 (23%), Positives = 98/228 (42%), Gaps = 46/228 (20%)
Query: 225 GFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
F+ R+E +VI+++ F++G P +V +++ H +
Sbjct: 154 AFNIRLEELNVIDVK-----------FLYGCQVPTVVFVYQ-----------DPHGRHVK 191
Query: 285 ALSISTTLKQHPL-IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALN 343
IS K+ W N+ +A ++AVP P G L++G +I YH+ A+A
Sbjct: 192 TYEISVRDKEFSKGPWKQDNVETEASMVIAVPEPFCGSLIIGQESITYHNGDKYVAVA-- 249
Query: 344 NYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV 397
+ +S+ +DA + +L D++ G L +L + + DG V
Sbjct: 250 --------PPAIKQSTLICHGRVDANGSRYLLGDMS------GRLFMLLLEKEELIDGSV 295
Query: 398 -VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
V+ L + + + +T + N + +LGSRLGDS L++ + S
Sbjct: 296 TVKDLKVELLGETSIAECLTYLDNGVVYLGSRLGDSQLIKLNVDADDS 343
>gi|357623954|gb|EHJ74904.1| putative DNA repair protein xp-e [Danaus plexippus]
Length = 1128
Score = 77.0 bits (188), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 78/299 (26%), Positives = 136/299 (45%), Gaps = 27/299 (9%)
Query: 1113 AIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1171
A+GTA + E+ + GR+LLF +T+V KE+KG L G LL +
Sbjct: 819 AVGTAILNPEESEPKQGRILLFHWCEGK------LTQVAEKEIKGGCYTLVEFNGKLLAS 872
Query: 1172 SGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
+ L +WT + L F + LY L + +FIL+GD+ +S+ L +K+
Sbjct: 873 INSTVRLFEWTSEKELRLECSHFNNIVALY---LKVKGDFILVGDLMRSMSLLQYKQMEG 929
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
+A+D+ A E L D + L ++ N+ + + + Q++ +
Sbjct: 930 SFEEIARDYSPNWMTAVEILDDDTFLG---AENSFNLFVCQKDSAATTDEERQQMGYMGQ 986
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1348
FHVG V R ++A +D AAP + +L T+ G+I + L + F L
Sbjct: 987 FHVGDMVNVMRRGALVAQLADT--AAPVAR-----PVLLATVSGAICLVVQLSQELFDFL 1039
Query: 1349 QSLQKKLVDSVPHVAGLNPRSF-RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1406
L+++L ++ V + P SF R F+++ K P + +D +L+ + L + Q E
Sbjct: 1040 HQLEERLTHTIKSVGKI-PHSFWRSFNTDIKTE-PA-EGFIDGDLIESFLDLSRDMQQE 1095
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 102/460 (22%), Positives = 176/460 (38%), Gaps = 107/460 (23%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G L +DPQ R G+ +Y I+ + + L S R+E +N+
Sbjct: 120 GILAVIDPQARVIGLRLYDGLFKIIPLDKDSTEL----------KAASLRLEE---LNVY 166
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
DL+ F+HG P ++++H+ ++ +H I I+ K+ I
Sbjct: 167 DLE--------FLHGCSNPTLILIHQD-------LNGRH----IKTHEINLRDKEFMKIP 207
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSP+GG +V+G +I YH + A+A ++
Sbjct: 208 WKQDNVETEASILIPVPSPLGGAIVIGQESIVYHDGQSYVAVAPPQIKTPINC------- 260
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----VVQRLDLSKTNPSVLTSD 414
+D +L D+A G L +L + R V+ L + +
Sbjct: 261 --YCRVDVRGLRYLLGDIA------GRLFMLLLELSERDGTASVRDLKVELLGDIPIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+T + N + F+GSRLGDS LV+ + DA + + + +
Sbjct: 313 MTYLDNGVVFVGSRLGDSALVRLAA-----------------VRDDASQYVQPMETFT-S 354
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L N + F +G L+ G+ I AS
Sbjct: 355 LAPIVDMCVVDLERQGQNQLITCSGAF---------KMGSLRIIRNGIGIQEQAS----- 400
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + + G +H L++S +T VL
Sbjct: 401 -------IDLPGIKGMWAL----TLGQGP-----------HHDTLVLSFVGQTRVLTLNG 438
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ ++IQV + G R++
Sbjct: 439 EEVEETEIKGFVSDRQTFFTGNVC-HDQLIQVTDEGIRLI 477
>gi|431910407|gb|ELK13480.1| DNA damage-binding protein 1 [Pteropus alecto]
Length = 1143
Score = 77.0 bits (188), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 126/286 (44%), Gaps = 37/286 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF---RQFHSNGK 1378
+ L E + L +Q +L + V + + R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSQRSFHTERK 1084
>gi|402592185|gb|EJW86114.1| CPSF A subunit region family protein [Wuchereria bancrofti]
Length = 278
Score = 77.0 bits (188), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 64/250 (25%), Positives = 116/250 (46%), Gaps = 22/250 (8%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIV 1205
VY KE+KGA ++ S+ G L++A + L +WT + L F + LY+ + N
Sbjct: 15 VYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKTKN-- 72
Query: 1206 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1265
+ IL+GD+ +S+ LS+K + +A+DF + A E + + L + +
Sbjct: 73 -DLILVGDLMRSLSLLSYKSMESTFEKVARDFMTNWMSACEIIDSDNFLG-----AENSY 126
Query: 1266 QIFYYAPKMSESWK--GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1323
+F +K G +L F++G V F + AT D AP
Sbjct: 127 NLFTVMKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVD---VAP----LYHS 179
Query: 1324 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1383
++L+GT DG IG I + + + LQ +QK+L + + ++ +R F + ++
Sbjct: 180 SILYGTSDGGIGVIVQMPPVLYTFLQDVQKRLAEYAENCMRISHTQYRTFETEKRSE--A 237
Query: 1384 PDSIVDCELL 1393
P+ +D +L+
Sbjct: 238 PNGFIDGDLI 247
>gi|440893607|gb|ELR46310.1| DNA damage-binding protein 1 [Bos grunniens mutus]
Length = 1143
Score = 76.6 bits (187), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 126/286 (44%), Gaps = 37/286 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF---RQFHSNGK 1378
+ L E + L +Q +L + V + + R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSQRSFHTERK 1084
>gi|390366809|ref|XP_780126.3| PREDICTED: DNA damage-binding protein 1-like isoform 1
[Strongylocentrotus purpuratus]
Length = 630
Score = 76.6 bits (187), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/303 (24%), Positives = 132/303 (43%), Gaps = 30/303 (9%)
Query: 1114 IGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+G A V ++ + GR+++F S G+ + E+ KE+KGA +L G LL
Sbjct: 322 VGLANVHPDEAEPKSGRIVVFQYSDGK--------LQEIAEKEIKGAPYSLVEFNGKLLA 373
Query: 1171 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1230
+ + L +WT + + + L +FI++GD+ +SI L++K L
Sbjct: 374 SVNSVVRLFEWTPEHSLRVECSHYNNVLALYLKTKGDFIVVGDLMRSITLLAYKPMEGCL 433
Query: 1231 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1290
+A+D+ A E L D + L ++ N+ + + + L FH
Sbjct: 434 EEIARDYSPNWMSAVEILDDDTFLG---AENSSNLFTCQKDSAATTDEERRHLQEVGLFH 490
Query: 1291 VGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1346
+G V F L +Q + S+ T + +LFGT+ GS+G + L+E +R
Sbjct: 491 LGEFVNVFRHGSLVMQNIGESTIPTTGS----------VLFGTVSGSVGLVTQLNEEFYR 540
Query: 1347 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1406
L +Q KL + V + +R F+S K D+ +D +LL + L + E
Sbjct: 541 FLLEVQNKLTKVIKSVGKIKHSFWRSFYSERKTE--PMDNFIDGDLLESFLDLSRDTMDE 598
Query: 1407 IAH 1409
+A
Sbjct: 599 VAQ 601
>gi|387593561|gb|EIJ88585.1| hypothetical protein NEQG_01275 [Nematocida parisii ERTm3]
gi|387597215|gb|EIJ94835.1| hypothetical protein NEPG_00359 [Nematocida parisii ERTm1]
Length = 1261
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/251 (23%), Positives = 126/251 (50%), Gaps = 26/251 (10%)
Query: 1055 LHRTYTVEEY---EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ET 1110
L R Y+++ Y E+++ R G ++++E ++VTL + E
Sbjct: 894 LTRAYSLKIYSHEEMKVC--SRTEGVLMAVDEYRLENNEYIAYHKIVTLPDKQNTEGFSE 951
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQ 1165
+ + T Y+ ED+ ARGR+++ R+ ++ + + +++ KGA + ++
Sbjct: 952 FVIVCTTYITDEDLMARGRLIVLEIASVVPQRDRIETRHKLKALAAEKTKGATTCCDIVK 1011
Query: 1166 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
G++++ G K++++ + E L +AF+D +++ S +++N I+ GD +K + L ++
Sbjct: 1012 GNIVVCVGTKLMIYMFDRNEGLRAVAFHDIH-VFLTSCMVMRNIIVCGDAYKGTFLLFYQ 1070
Query: 1225 EQGAQLNLLAKDFGSLDCFATEFLIDG-------STLSLVVSDEQKNIQIFYYAPKMSES 1277
L++L++ G + +L+ G + LSL+ D K + I+ Y+P+ S
Sbjct: 1071 SDPPLLHMLSQSSGGV------YLLKGIGMTLHDTALSLISYDSLKTVCIYTYSPQHILS 1124
Query: 1278 WKGQKLLSRAE 1288
G +L+SR E
Sbjct: 1125 QDGSRLISRGE 1135
>gi|281345356|gb|EFB20940.1| hypothetical protein PANDA_015888 [Ailuropoda melanoleuca]
Length = 1124
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 128/285 (44%), Gaps = 36/285 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 805 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 858
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 859 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 913
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 914 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 970
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 971 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1020
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAG--LNPRSFRQFHSNGK 1378
+ L E + L +Q +L + ++ + ++R FH+ K
Sbjct: 1021 LVTSLSESWYNLLLDMQNRLNKVIKNITHSLTHLSTWRSFHTERK 1065
>gi|167384458|ref|XP_001736962.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165900458|gb|EDR26769.1| hypothetical protein EDI_171140 [Entamoeba dispar SAW760]
Length = 836
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/326 (25%), Positives = 141/326 (43%), Gaps = 50/326 (15%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
++L F++AK+S+L +D++ + I S+HCFE P LKR +E P + +D +
Sbjct: 74 LVLLFKEAKVSILRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTNPRLLIDKR 129
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
GRC ++ Y M ++ GF ++S+ INL + + D
Sbjct: 130 GRCISLICYDRLMWVIPL------------------GFD---KTSYSINLEKFGINRIID 168
Query: 249 FIFVHGYIEPVMVILHERELTWAGR-VSWKHHTCMISALSISTTL---KQHPLIWSAMNL 304
I + GY P + LH + TW GR V+ T I LS+ + KQ + +
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIVLSLEPDVIHEKQDIVATVSYQF 228
Query: 305 PHDAYKLLAVPS--PIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSF 360
+ Y L + P G+L++ N+I Y S ++ S L + V + + P SSF
Sbjct: 229 SYVPYNALQIVDCYPTNGILILTINSIIYLSTTSFESFILPFGKFFV-IPKNNNRPLSSF 287
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSVL 411
+ T + N V + T L ++ V+ + R+ D+ TN P
Sbjct: 288 QI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYKY 344
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
TI ++ F+GS + DS + +
Sbjct: 345 HPTTFTISSNHLFIGSTVHDSYIYNY 370
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/311 (22%), Positives = 136/311 (43%), Gaps = 36/311 (11%)
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
K+ + L +G ED +G+ +F N +N L+ ++ + K ++ A+ +
Sbjct: 519 KQLKNYLVVGVNKQTTEDNPVKGKTYIF----NIENQIQLINKI--GDGKKSVHAVNEIG 572
Query: 1166 GHLLIASGPKIIL------HKWTGTELNGIAFY----DAPPLYVVSLNIVKN--FILLGD 1213
G L +ASG ++ L +W + I+ + PL V+ K ILL D
Sbjct: 573 GFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMEKGNEKECYLILLSD 632
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
++S+ L +K + L KD ++ C + F+I S++ D ++N+ + Y+
Sbjct: 633 FYRSVVLLLFKPYDYTVIPLGKDARNIHCIDSTFIITKDYFSVLEFDSEQNLSLLNYSSA 692
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
+E ++ A F++G ++ KF RL G + ++ T++GS
Sbjct: 693 ATEQLSIFEI--DATFNLGMNLLKFTRLW--------NGKG--------YIYMYVTVEGS 734
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+G I+ ++E ++ L+ + K+ H AG N +R G +D ++L
Sbjct: 735 VGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDML 794
Query: 1394 SHYEMLPLEEQ 1404
+ +L E+Q
Sbjct: 795 KQFRLLNEEQQ 805
>gi|340714589|ref|XP_003395809.1| PREDICTED: DNA damage-binding protein 1-like [Bombus terrestris]
Length = 1141
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 159/368 (43%), Gaps = 62/368 (16%)
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
++ +I+ HNL +D H T E +L P +E AL+
Sbjct: 782 DICQEIEVHNLLIIDQH---TFEVLHAHMLMP-----------------TEYALS----- 816
Query: 1100 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1155
L +T E+ T +GTA V ++ + GR+LL+ S G+ +T+V KE+K
Sbjct: 817 LISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGK--------LTQVAEKEIK 868
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1212
G+ +L G LL + + L +WT + L F + LY L +FIL+G
Sbjct: 869 GSCYSLTEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFILVG 925
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ +S+ L +K +A+D+ A E L D + L ++ N+ +
Sbjct: 926 DLMRSLTLLQYKTMEGCFEEIARDYNPNWMTAIEILDDDTFLG---AENCFNLFVCQKDS 982
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 983 AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFG 1032
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
T+ G+IG + + + L++L+++L + V + +R F++ K + + +
Sbjct: 1033 TVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1090
Query: 1389 DCELLSHY 1396
D +L+ +
Sbjct: 1091 DGDLIESF 1098
Score = 56.6 bits (135), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 88/420 (20%), Positives = 160/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
++ V+D F+HG P ++++H+ ++ +H + IS K+ + W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKVPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH N Y + + +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L V + + VV+ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKPDGTQVVKDLKVELLGEISIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSR GDS LV+ +AD + + +
Sbjct: 317 ITYLDNGVIFVGSRFGDSQLVKLIT------------------KADENGSYCVPMETFTN 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +++ + L + T S A ++ G L+ G+ I AS
Sbjct: 359 LAPIIDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 -------IDLPGIKGMWAL---KVGGGNFDNT------------LVLSFVGQTRILTLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E T+ + +T GN+ IQ+ AR++ T + P N + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRTIS 501
>gi|350410909|ref|XP_003489174.1| PREDICTED: DNA damage-binding protein 1-like [Bombus impatiens]
Length = 1141
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 159/368 (43%), Gaps = 62/368 (16%)
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
++ +I+ HNL +D H T E +L P +E AL+
Sbjct: 782 DICQEIEVHNLLIIDQH---TFEVLHAHMLMP-----------------TEYALS----- 816
Query: 1100 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1155
L +T E+ T +GTA V ++ + GR+LL+ S G+ +T+V KE+K
Sbjct: 817 LISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGK--------LTQVAEKEIK 868
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1212
G+ +L G LL + + L +WT + L F + LY L +FIL+G
Sbjct: 869 GSCYSLTEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFILVG 925
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ +S+ L +K +A+D+ A E L D + L ++ N+ +
Sbjct: 926 DLMRSLTLLQYKTMEGCFEEIARDYNPNWMTAIEILDDDTFLG---AENCFNLFVCQKDS 982
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 983 AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFG 1032
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
T+ G+IG + + + L++L+++L + V + +R F++ K + + +
Sbjct: 1033 TVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1090
Query: 1389 DCELLSHY 1396
D +L+ +
Sbjct: 1091 DGDLIESF 1098
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 90/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
++ V+D F+HG P ++++H+ ++ +H + IS K+ + W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKVPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH N Y + + +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L V + + VV+ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKPDGTQVVKDLKVELLGEISIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSRLGDS LV+ +AD + + +
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L + T S A ++ G L+ G+ I AS
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 -------IDLPGIKGMWAL---KVGGGNFDNT------------LVLSFVGQTRILTLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E T+ + +T GN+ IQ+ AR++ T + P N + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRTIS 501
>gi|166158025|ref|NP_001107422.1| damage-specific DNA binding protein 1, 127kDa [Xenopus (Silurana)
tropicalis]
gi|157422734|gb|AAI53474.1| Zgc:63840 protein [Danio rerio]
gi|163916541|gb|AAI57552.1| LOC100135265 protein [Xenopus (Silurana) tropicalis]
Length = 306
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 121/279 (43%), Gaps = 35/279 (12%)
Query: 1127 RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG--- 1183
+GR+++F D V E KE+KGA+ ++ G LL + + L++WT
Sbjct: 11 QGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKE 64
Query: 1184 --TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1241
TE N + + LY L +FIL+GD+ +S+ L++K +A+DF
Sbjct: 65 LRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPMEGSFEEIARDFNPNW 119
Query: 1242 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF--- 1298
A E L D + L ++ N+ + + + Q L FH+G V F
Sbjct: 120 MSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFSHG 176
Query: 1299 -LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1357
L LQ L SS T + +LFGT++G IG + L E + L LQ +L
Sbjct: 177 SLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNK 226
Query: 1358 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ V + +R FH+ K + +D +L+ +
Sbjct: 227 VIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 263
>gi|297740793|emb|CBI30975.3| unnamed protein product [Vitis vinifera]
Length = 1043
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 128/283 (45%), Gaps = 33/283 (11%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNAD 1140
+T P+ + E ++ L + + ++ +GTAYV E+ +GR+L+F D
Sbjct: 759 STYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---ED 811
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP 1195
L+ E KE KGA+ +L + G LL A KI L+KW GT EL + +
Sbjct: 812 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGH 868
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
L + + +FI++GD+ KSI L +K + + A+D+ + A E L D L
Sbjct: 869 IL-ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 927
Query: 1256 LVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTG 1312
+ N IF K SE + +L E+H+G V +F ++
Sbjct: 928 -----AENNFNIF-TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------ 975
Query: 1313 AAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1355
P SD ++FGT++G IG IA L + L+ LQ L
Sbjct: 976 -LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEKLQANL 1017
Score = 56.6 bits (135), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 83/367 (22%), Positives = 145/367 (39%), Gaps = 84/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P W+ NL + A L+ VP P+ GVL++G TI Y S SA A+ +
Sbjct: 197 LK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR-- 251
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ V+ D + LL G L LL + ++ V L +
Sbjct: 252 -------PSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 296
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N+ ++GS GDS L++ ++ DA
Sbjct: 297 LGETSIASTISYLDNAFVYVGSSYGDSQLIKI------------------HLQPDA---- 334
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
+ S + L+ VN + + + + T S A +D G L+ G+
Sbjct: 335 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIG 387
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ ++ DD + +L++S
Sbjct: 388 INEQAS------------VELQGIKGMWSL--------------RSSTDDPHDTFLVVSF 421
Query: 584 EARTMVL 590
+ T +L
Sbjct: 422 ISETRIL 428
>gi|440302955|gb|ELP95261.1| hypothetical protein EIN_430670 [Entamoeba invadens IP1]
Length = 1175
Score = 75.9 bits (185), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/352 (21%), Positives = 151/352 (42%), Gaps = 37/352 (10%)
Query: 1094 TVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKE 1153
++RV+ L T L+ G ED +G V LFS ++ ++ V
Sbjct: 844 SLRVLEL--TINSVKRVLVGCGVNTQTTEDDPVKGNVFLFSLESTSEGTIRHISTVCDG- 900
Query: 1154 LKGAISALASLQGHLLIASGPKIILHK------WTGTELNGIAFYDAPPLYVVSLNIVKN 1207
K A+ A+ S+ G+L +A G ++ + K W + I+ + + + + KN
Sbjct: 901 -KKAVHAINSIGGYLAVAEGNELQILKGKTESLWVKKCFSDISIL-INTITFLPMTLSKN 958
Query: 1208 -------FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1260
ILL D+++S+ L ++ Q + L KD + F++D ++ D
Sbjct: 959 KVDEMCYLILLNDMYRSVILLLFQPQKKSVIPLGKDGRDIHAIDAAFVLDKDYFHVLEID 1018
Query: 1261 EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT 1320
++N+ + Y +E+ + A F+VG + + RL++
Sbjct: 1019 YERNLSVMNYL--RTETERISIFEVAATFNVGVDILRLTRLRL----------------G 1060
Query: 1321 NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1380
N + ++ + GS+G + ++E +++ L+ + K+ H AG NP FR G +
Sbjct: 1061 NGYVFVYLSAQGSVGYLTVVNERSYQTLRQINAKMNREPWHFAGTNPEEFRMEKGYGVGY 1120
Query: 1381 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
I+D ++L + L E+Q + + T+ S +++ L++ +S L
Sbjct: 1121 GRRKQVILDGDILKEFHFLTQEQQKRVCLR-NTSISDVVNILDNALQRSSLL 1171
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 82/363 (22%), Positives = 153/363 (42%), Gaps = 48/363 (13%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCG 192
+IL F+ A++SV+ ++ + + S+HCFE PE ++ + P + +D +GRC
Sbjct: 74 LILLFKQARLSVMRYNTETNRFVVHSLHCFEYPELRIREKCTPTAYDDPRMFIDKKGRCI 133
Query: 193 GVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFV 252
+L Y + ++ GS SS+ ++L + + D I +
Sbjct: 134 SLLCYDRLLWVIP--------------LGSN-------RSSYRVDLEKFGVSRIVDVISL 172
Query: 253 HGYIEPVMVILHERELTWAGR-VSWKHHTCMISALSISTTL--KQHPLIWSAMN----LP 305
GY P + LH TW R V+ T I+ ++++ + ++ + +N LP
Sbjct: 173 SGYETPTLAFLHMTVPTWDARTVNTGEATNEIAIINVNPGVVGEEEQECANVVNRISRLP 232
Query: 306 HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE----------L 355
++ K++ P+ G+L++ + ++ Y S ++S + L + + + L
Sbjct: 233 YNTLKMVEC-YPLPGILLLASVSVLYISTTSSESFIL-PFGTYFNPPEVWKGVVPFLKLL 290
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
P ++L + QN + L T GD + + +VQ + LS P +
Sbjct: 291 PMKIRIIQLVKSIHQLSQN-LYLTFTDKGDSYYIHLNCVEGIVQEIVLSNA-PYKFIPNT 348
Query: 416 TTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
++ + FLGS DS L +T C G G + FG DA K L+ S
Sbjct: 349 VSLYDDYIFLGSVFHDSYLFNYTICEYG-----KGDIKPFGIHCGDAVRIKNLQERSGQM 403
Query: 475 LQD 477
+D
Sbjct: 404 EED 406
>gi|119594339|gb|EAW73933.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_a [Homo
sapiens]
Length = 1094
Score = 75.9 bits (185), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 132/296 (44%), Gaps = 38/296 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKL---VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
+ L E + L +Q +L + SV + +P F +G+ ++P S V
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSHPPG-DPFTPSGRQNQPQVSSTV 1093
>gi|297267724|ref|XP_001082958.2| PREDICTED: DNA damage-binding protein 1 [Macaca mulatta]
Length = 1092
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 125/286 (43%), Gaps = 38/286 (13%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
+ L E + L +Q +L + V + FH +HR
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIE----HSFHLEILSHR 1080
>gi|221040048|dbj|BAH11787.1| unnamed protein product [Homo sapiens]
Length = 1092
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 125/286 (43%), Gaps = 38/286 (13%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
+ L E + L +Q +L + V + FH +HR
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIE----HSFHLEILSHR 1080
>gi|380025901|ref|XP_003696702.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
[Apis florea]
Length = 1141
Score = 75.1 bits (183), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 164/380 (43%), Gaps = 62/380 (16%)
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
++ +I+ HNL +D H T E +L P +E AL+
Sbjct: 782 DICQEIEVHNLLIIDQH---TFEVLHAHMLMP-----------------TEYALS----- 816
Query: 1100 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1155
L +T E+ T +GTA V ++ + GR+LL+ S G+ +T+V KE K
Sbjct: 817 LISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGK--------LTQVAEKEXK 868
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1212
G+ +L G LL + + L +WT + L F + LY+ S +FIL+G
Sbjct: 869 GSCYSLTEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKSKG---DFILVG 925
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ +S+ L +K +A+D+ A E L D + L ++ N+ +
Sbjct: 926 DLMRSLTLLQYKTMEGCFEEIARDYNPNWMTAIEILDDDTFLG---AENCFNLFVCQKDS 982
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
+ + Q++ +FH+G V F L +Q L SS T +L G
Sbjct: 983 AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLXG 1032
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
T+ G+IG + + + + L++L+ +L + V + +R F++ K + + +
Sbjct: 1033 TVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1090
Query: 1389 DCELLSHYEMLPLEEQLEIA 1408
D +L+ + L ++ E+A
Sbjct: 1091 DGDLIESFLDLSPDKMAEVA 1110
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
++ V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH N Y + + +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L V + + VV+ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKTDGTQVVKDLKVELLGEISIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSRLGDS LV+ +AD + + +
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L + T S A ++ G L+ G+ I AS
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 -------IDLPGIKGMWAL---KIGGGNFDNT------------LVLSFVGQTRILTLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E T+ + +T GN+ IQ+ AR++ T + P N + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISYETKTVVSEWEPENKRTIS 501
>gi|324501533|gb|ADY40680.1| Splicing factor 3B subunit 3 [Ascaris suum]
Length = 1214
Score = 74.7 bits (182), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 129/290 (44%), Gaps = 37/290 (12%)
Query: 1158 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
++A+ +G L+ G KI L+ +L P+ VV + + I++ D +S
Sbjct: 946 VNAVHDFRGMALVGVGKKIRLYDLGKKKLLAKCENKQLPVQVVDIRSMGQRIVVSDSQES 1005
Query: 1218 IYFLSWKEQGAQLNLLAKD----FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
++F+ +K+Q QL++ D F + C ++D T++ V D +I +
Sbjct: 1006 LHFMRYKKQDNQLSIFCDDTSPRFVTCIC-----ILDYDTVA--VGDRFGSIAVLRLPKG 1058
Query: 1274 MSESWKGQKLLSRAEFHVG---------AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
++E + RA + G H+ +F + TS +T PG++
Sbjct: 1059 VTEEVQEDPTGVRALWDRGNLNGASQKVEHIGQFY-VGDTVTSMQKTSLVPGAND----C 1113
Query: 1325 LLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
L++ T+ G+IG + P DE F Q+L+ L P + G + +FR F+ K
Sbjct: 1114 LVYTTISGTIGMLVPFVSRDEFDF--FQNLEMHLRVEFPPLCGRDHLAFRSFYFPVKC-- 1169
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
I+D +L Y ++PL++Q +A + G ++I L D+ +F
Sbjct: 1170 -----IIDGDLCEQYALMPLDKQKAVAEELGRKPAEIHKKLEDIRTRYAF 1214
>gi|384253371|gb|EIE26846.1| hypothetical protein COCSUDRAFT_52476 [Coccomyxa subellipsoidea
C-169]
Length = 1205
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 119/496 (23%), Positives = 203/496 (40%), Gaps = 76/496 (15%)
Query: 964 NCNHGFIYVTSQGILKICQLPS-GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS 1022
C GF V S+ +L+I L G ++ Q+V L+ TP + E N+ + +
Sbjct: 747 QCPEGFCAV-SKSMLRILTLERLGEAFN-----QQVTRLRYTPRKFVVHPESNMLIVAEA 800
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--------------VRI 1068
PL + ++ E G ++D ++ EE + +R+
Sbjct: 801 DHAAVPLAERRAV----EDGMEMDAALTEGIEFDEERAAEEEQHGAPKNSTGRWASCIRV 856
Query: 1069 LEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG----EDV 1124
L+P QT + + + +E A+++ ++ N E +LA+GT VQG
Sbjct: 857 LDPTS----LQTSSVLELDGNEAAVSLCLLRFSNW--PEEGMVLAVGT--VQGLAFYPRT 908
Query: 1125 AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAI-SALASLQGHLLIASGPKIILHKWTG 1183
A G + L+ R D+ + L E+ K G I ALA+ +G LL GP + +++
Sbjct: 909 ADEGYIRLY---RFRDSGRQL--ELIHKTPTGGIPGALAAFKGRLLAGVGPTLRIYEAGK 963
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+L + P ++ +L + I +GD+ +S+++ +K L A D
Sbjct: 964 KKLLRKCEHRKLPTHIATLATSGDRIFVGDLQESMHYFRYKANENALYEYADDIAPRHLT 1023
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFY------YAPKMSESWKGQKLLSRAEFHVGA---- 1293
A +D T V+ K IF + ++ E G K A GA
Sbjct: 1024 AA-LPLDYDT----VAGADKFCNIFVTRLPRDVSTQVEEDPTGGKFAGAAGLLNGAPHKL 1078
Query: 1294 -HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQ 1349
V F + L TS R PG R LL+ T+ G+IG + P +++ F
Sbjct: 1079 EDVVNF-HVGDLVTSLQRAVLQPG----GREVLLYATVMGAIGAMLPFPSREDVDF--FS 1131
Query: 1350 SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1409
L+ L P + G + S+R ++ P D ++D +L H+ LP +Q IA
Sbjct: 1132 HLEMHLRQEHPPMGGRDHMSYR------GSYFPVKD-VIDGDLCEHFSQLPAAKQKSIAD 1184
Query: 1410 QTGTTRSQILSNLNDL 1425
+ T +IL L D+
Sbjct: 1185 ELERTPGEILKKLEDI 1200
>gi|378755148|gb|EHY65175.1| hypothetical protein NERG_01621 [Nematocida sp. 1 ERTm2]
Length = 822
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 114/212 (53%), Gaps = 9/212 (4%)
Query: 1085 PMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1138
P++ +E ++VTL + E + + + T Y+ ED+ ARGR+++ R+
Sbjct: 486 PLEDNEYIAHHKIVTLPDKQNTEGVSEFVIVCTTYITDEDLMARGRLIVLEIASVVPQRD 545
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPL 1197
++ + + +++ KGA + ++G++++ G K++++ + E L +AF+D +
Sbjct: 546 RIETRHKLKALAAEKTKGATTCCDIVKGNIVVCVGTKLMIYMFDRNEGLRAVAFHDIH-V 604
Query: 1198 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF-ATEFLIDGSTLSL 1256
++ S +++N I+ GD +K + L ++ + + L+LL++ G + + GS LSL
Sbjct: 605 FLTSCMVMRNIIVCGDAYKGTFLLFYQSEPSLLHLLSQSSGGVYLLKGIGMTLYGSVLSL 664
Query: 1257 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
+ D K + I+ Y+P+ S G +L+SR E
Sbjct: 665 LSYDSAKTVCIYSYSPQHILSQGGTRLISRGE 696
>gi|242010743|ref|XP_002426118.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
gi|212510165|gb|EEB13380.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
Length = 1148
Score = 74.3 bits (181), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 95/398 (23%), Positives = 166/398 (41%), Gaps = 64/398 (16%)
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
E G +++ HNL +D H + ++ MQS E AL++
Sbjct: 789 EFGQEVEVHNLLVIDQHTFEVLHAHQF-------------------MQS-EYALSLISTK 828
Query: 1100 LFNTTTKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAI 1158
L + + T +GTA V E + +GR+L+F + + +V KE+KGA
Sbjct: 829 LGD----DPNTYYIVGTAMVNPDESESKQGRILIFQF------QEGKLYQVAEKEIKGAA 878
Query: 1159 SALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1215
+L G LL + + L +WT + L F + LY L +FIL+GD+
Sbjct: 879 YSLVEFNGKLLASINSTVRLFEWTAEQELRLECSHFNNIISLY---LKTKGDFILVGDLI 935
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+S+ L +K +A+D A E + D + L ++ N+ + +
Sbjct: 936 RSMTLLQYKTMEGCFEEMARDHNPNWMTAVEIIDDDTFLG---AENSFNLFVCQKDSAAA 992
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKF----LRLQMLA-TSSDRTGAAPGSDKTNRFALLFGTL 1330
+ Q++ + FH+G V F L +Q + TS+ TG +LFGT+
Sbjct: 993 TDEERQQMHAVGMFHLGDMVNVFRHGSLVMQNVGETSTPTTGC-----------ILFGTV 1041
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1390
G+IG + + + L L+ KL + + V + +R F + K D +D
Sbjct: 1042 SGAIGLVTQISANFYNFLHELECKLTEVIKSVGKIKHSFWRSFTTEIKTEP--CDGFIDG 1099
Query: 1391 EL------LSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+L LSH +M + L+I + +G + + +L
Sbjct: 1100 DLIESFLDLSHEKMKEVAAGLQIDNGSGMKQEATVDDL 1137
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/276 (21%), Positives = 116/276 (42%), Gaps = 61/276 (22%)
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES 232
G++S G + +DP+ R G+ +Y + I+ + S L S R+E
Sbjct: 113 GKQS-ETGIIAVIDPEARVIGLRLYDGLLKIIPLGKDNSELKAS----------SIRMEE 161
Query: 233 SHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
V +D F+HG P ++++H+ ++ +H + IS
Sbjct: 162 VEV-----------QDLNFLHGCQNPTIILIHQD-------INGRH----VKTHEISLRD 199
Query: 293 KQH-PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA---LNNYAVS 348
K+ + W N+ DA ++ VP P+ G +++G +I YH+ + A+A +N ++
Sbjct: 200 KEFVKMPWKQDNVEPDASIVIPVPEPLCGAIIIGQESILYHNGAGYVAVAPPVINQSTIT 259
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-------VQRL 401
+ ++D+ + +L D+A G L +L + + ++ L
Sbjct: 260 CYT-----------QVDSNGSRYLLGDMA------GHLFMLLLETEEKIDGTPCVKENGL 302
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ + IT + N + F+GSR GDS LV+
Sbjct: 303 KVELLGEISIPEAITYLDNGVLFIGSRCGDSQLVKL 338
>gi|443922899|gb|ELU42250.1| splicing factor 3B subunit 3 [Rhizoctonia solani AG-1 IA]
Length = 1212
Score = 74.3 bits (181), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 116/487 (23%), Positives = 211/487 (43%), Gaps = 50/487 (10%)
Query: 965 CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP 1024
C G I + +L+I Q+P S Q +PL TP +I E L+ +I S
Sbjct: 753 CPEGLIGIAGS-VLRIFQIPKLSDKLK----QVTMPLSYTPRKIAVHPEHQLFYVIESDH 807
Query: 1025 VL---KPLNQVLSLLIDQEVGHQIDNH--NLSSVD--LHRTYTVEEYE-VRILEPDRAGG 1076
+ N+ L+ L+ G QID +L + D L R + +RI++P
Sbjct: 808 RTWGSEAKNKRLAELV--RAGRQIDQELVDLPAEDFGLPRAGAGQWASCIRIIDPTEVFC 865
Query: 1077 PWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1136
T I + ++E+A +V VV +ENE L +GTA + +V R V ++G
Sbjct: 866 S-ATLFKIELDNNESAFSVAVVPF---AARENELFLVVGTA--KDTNVLPRQCVGAVTSG 919
Query: 1137 R--NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1194
+++T E+ AL ++G L G + +++ +L +
Sbjct: 920 SLVKLGWSTHILTRPIQTEVDDVPLALLGIKGRLCAGVGKALRIYEMGKKKLLRKSENKG 979
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
+V+L + I++G++ +S+++ ++K + +L + A D S + L+D T+
Sbjct: 980 FATAIVTLTSQGSRIIVGEMQESVHYATYKPESNRLLVFADD-TSARWVTSAALVDYDTV 1038
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEF-HVGAHVTKFL---RLQMLAT 1306
+ V D+ NI + +S+ G ++ EF H H TK L + + T
Sbjct: 1039 A--VGDKFGNIFVNRLPANISQQVDDDPTGAGIMHEREFLHGAPHKTKLLAHYNVGDIVT 1096
Query: 1307 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1363
S R PG R + + L G+IG + PL +++ F + +L++ + +
Sbjct: 1097 SVHRAALVPG----GRDVVAYTGLHGTIGVLIPLASKEDVDF--ITTLEQHMRSEHSSLV 1150
Query: 1364 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1423
G + ++R ++ KA +VD +L + MLP +Q IA + T ++L L
Sbjct: 1151 GRDHLAYRGYYVPVKA-------VVDGDLCERFAMLPSTKQKSIAGELDRTVGEVLKKLE 1203
Query: 1424 DLALGTS 1430
L + S
Sbjct: 1204 GLRVAGS 1210
>gi|297820284|ref|XP_002878025.1| hypothetical protein ARALYDRAFT_906938 [Arabidopsis lyrata subsp.
lyrata]
gi|297323863|gb|EFH54284.1| hypothetical protein ARALYDRAFT_906938 [Arabidopsis lyrata subsp.
lyrata]
Length = 454
Score = 73.6 bits (179), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 88/381 (23%), Positives = 161/381 (42%), Gaps = 57/381 (14%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+RIL+P A T + +Q +E A +V V N KE TLLA+GT V+G
Sbjct: 103 IRILDPKTA----TTTCLLELQDNEAAYSVCTV---NFHDKEYGTLLAVGT--VKGMQFW 153
Query: 1126 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
+ ++ R + ++L ++ +++G AL QG LL GP + L+
Sbjct: 154 PKKNLVAGFIHIYRFVEEGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGK 212
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD------- 1236
L P ++S+ ++ I +GDI +S ++ ++ QL + A D
Sbjct: 213 KRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLT 272
Query: 1237 ---------FGSLDCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1286
D F + + LS + ++ +I + K++ + K+
Sbjct: 273 ASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGA--PNKVDEI 330
Query: 1287 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1343
+FHVG VT + M+ S+ ++++GT+ GSIG + D++
Sbjct: 331 VQFHVGDVVTCLQKASMIPGGSE--------------SIMYGTVMGSIGALHAFTSRDDV 376
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
F L+ + P + G + ++R A+ P D ++D +L + LP++
Sbjct: 377 DF--FSHLEMHMRQEYPPLCGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDL 427
Query: 1404 QLEIAHQTGTTRSQILSNLND 1424
Q +IA + T ++IL L D
Sbjct: 428 QRKIADELDRTPAEILKKLED 448
>gi|255588145|ref|XP_002534515.1| spliceosomal protein sap, putative [Ricinus communis]
gi|223525135|gb|EEF27867.1| spliceosomal protein sap, putative [Ricinus communis]
Length = 1214
Score = 73.6 bits (179), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 93/387 (24%), Positives = 168/387 (43%), Gaps = 67/387 (17%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA----YVQG 1121
+R+L+P A T + +Q +E A +V V N KE+ TLLA+GTA +
Sbjct: 863 IRVLDPRTAA----TTCLLELQDNEAAFSVCTV---NFHDKEHGTLLAVGTAKGLQFWPK 915
Query: 1122 EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1181
++A G + ++ + D+ + L ++ +++G AL+ QG LL GP + L+
Sbjct: 916 RSLSA-GFIHIY---KFVDDGRALEL-LHKTQVEGVPLALSQFQGRLLAGIGPVLRLYDL 970
Query: 1182 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----- 1236
L P +VS+ ++ I +GDI +S +F ++ QL + A D
Sbjct: 971 GKKRLLRKCENKLFPNSIVSIQTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDCVPRW 1030
Query: 1237 -----------FGSLDCFATEFLIDGSTLSLVVSDEQKNI----QIFYYAPKMSESWKGQ 1281
D F + + L VSDE + +I + K++ +
Sbjct: 1031 LTASHHVDFDTMAGADKFGNIYFV---RLPQDVSDEIEEDPTGGKIKWEQGKLNGA--PN 1085
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL- 1340
K+ +FH+G VT + ++ PG + +++GT+ GS+G + P
Sbjct: 1086 KVEEIVQFHIGDVVTSLSKASLI----------PGGGE----CIIYGTVMGSVGALLPFT 1131
Query: 1341 --DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
D++ F L+ L P + G + ++R A+ P D ++D +L +
Sbjct: 1132 SRDDVDF--FSHLEMHLRQDHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFPT 1182
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNLNDL 1425
LPL+ Q +IA + T +IL L ++
Sbjct: 1183 LPLDAQRKIADELDRTPGEILKKLEEV 1209
>gi|18410222|ref|NP_567015.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
gi|18410226|ref|NP_567016.1| putative splicing factor [Arabidopsis thaliana]
gi|7019653|emb|CAB75754.1| spliceosomal-like protein [Arabidopsis thaliana]
gi|7019655|emb|CAB75756.1| spliceosomal-like protein [Arabidopsis thaliana]
gi|332645831|gb|AEE79352.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
gi|332645833|gb|AEE79354.1| putative splicing factor [Arabidopsis thaliana]
Length = 1214
Score = 73.6 bits (179), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 87/381 (22%), Positives = 162/381 (42%), Gaps = 57/381 (14%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+R+L+P A T + +Q +E A +V V N KE TLLA+GT V+G
Sbjct: 863 IRVLDPKTA----TTTCLLELQDNEAAYSVCTV---NFHDKEYGTLLAVGT--VKGMQFW 913
Query: 1126 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
+ ++ R ++ ++L ++ +++G AL QG LL GP + L+
Sbjct: 914 PKKNLVAGFIHIYRFVEDGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGK 972
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD------- 1236
L P ++S+ ++ I +GDI +S ++ ++ QL + A D
Sbjct: 973 KRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLT 1032
Query: 1237 ---------FGSLDCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1286
D F + + LS + ++ +I + K++ + K+
Sbjct: 1033 ASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGA--PNKVDEI 1090
Query: 1287 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1343
+FHVG VT + M+ S+ ++++GT+ GSIG + D++
Sbjct: 1091 VQFHVGDVVTCLQKASMIPGGSE--------------SIMYGTVMGSIGALHAFTSRDDV 1136
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
F L+ + P + G + ++R A+ P D ++D +L + LP++
Sbjct: 1137 DF--FSHLEMHMRQEYPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFPTLPMDL 1187
Query: 1404 QLEIAHQTGTTRSQILSNLND 1424
Q +IA + T ++IL L D
Sbjct: 1188 QRKIADELDRTPAEILKKLED 1208
>gi|297816810|ref|XP_002876288.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322126|gb|EFH52547.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 73.2 bits (178), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 87/381 (22%), Positives = 161/381 (42%), Gaps = 57/381 (14%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+R+L+P A T + +Q +E A +V V N KE TLLA+GT V+G
Sbjct: 282 IRVLDPKTA----TTTCLLELQDNEAAYSVCTV---NFHDKEYGTLLAVGT--VKGMQFW 332
Query: 1126 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
+ ++ R + ++L ++ +++G AL QG LL GP + L+
Sbjct: 333 PKKNLVAGFIHIYRFVEEGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGK 391
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD------- 1236
L P ++S+ ++ I +GDI +S ++ ++ QL + A D
Sbjct: 392 KRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLT 451
Query: 1237 ---------FGSLDCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1286
D F + + LS + ++ +I + K++ + K+
Sbjct: 452 ASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGA--PNKVDEI 509
Query: 1287 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1343
+FHVG VT + M+ S+ ++++GT+ GSIG + D++
Sbjct: 510 VQFHVGDVVTCLQKASMIPGGSE--------------SIMYGTVMGSIGALHAFTSRDDV 555
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
F L+ + P + G + ++R A+ P D ++D +L + LP++
Sbjct: 556 DF--FSHLEMHMRQEYPPLCGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDL 606
Query: 1404 QLEIAHQTGTTRSQILSNLND 1424
Q +IA + T ++IL L D
Sbjct: 607 QRKIADELDRTPAEILKKLED 627
>gi|242089089|ref|XP_002440377.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
gi|241945662|gb|EES18807.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
Length = 1783
Score = 73.2 bits (178), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 151/373 (40%), Gaps = 59/373 (15%)
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+ + HQ + L+ +VEE E +R+L+ +++ P+ E ++
Sbjct: 717 RRICHQEQSKTLAFCSFKYNQSVEESETHLIRLLDHQT----FESLCVYPLDQYEFGCSI 772
Query: 1096 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1154
+ +N +GTAYV E+ +GR+L+F+ D L+ E KE
Sbjct: 773 ISCSF----ADDNNVYYCVGTAYVIPEENEPTKGRILVFAV---EDGSLQLIVE---KET 822
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWTGT-----ELNGIAFYDAPPLYVVSLNIVKNFI 1209
KGA+ +L + G LL A KI L+KW EL + L + + +FI
Sbjct: 823 KGAVYSLNAFNGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILALYT-QTRGDFI 881
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLL-----------------------AKDFGSLDCFATE 1246
++GD+ KSI L +K + L A+D+ + A E
Sbjct: 882 VVGDLMKSISLLVYKVVPLTVCLTHIVLSVIFFVSLFVVLESAIEERARDYNANWMTAVE 941
Query: 1247 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1306
L D V ++ N+ + + +L E+H+G V +F ++
Sbjct: 942 MLDDE---VYVGAENGYNLFTVRKNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMR 998
Query: 1307 SSD-RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1365
D G P ++FGT++G IG IA L + L+ LQ LV + V L
Sbjct: 999 LPDSEIGQIP--------TVIFGTINGVIGIIASLPHDQYVFLEKLQSTLVKYIKGVGNL 1050
Query: 1366 NPRSFRQFHSNGK 1378
+ +R FH++ K
Sbjct: 1051 SHEQWRSFHNDKK 1063
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 108/464 (23%), Positives = 189/464 (40%), Gaps = 112/464 (24%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+D + +A E K VL++D L +M + GR + G + +DP
Sbjct: 75 QDFLFIATERYKFCVLQWDAEKSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDC 127
Query: 190 RCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
R G+ +Y GL +I ++G F+ R+E V++++
Sbjct: 128 RLIGLHLYDGLFKVIPFDNKGQLK-----------EAFNIRLEELQVLDIK--------- 167
Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
F+HG ++P +V+L++ +H AL + P WS N+ + A
Sbjct: 168 --FLHGCVKPTIVVLYQ------DNKDVRHVKTYEVALK-DKDFVEGP--WSQNNVDNGA 216
Query: 309 YKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAH 368
L+ VP+P+GGV+++G I Y + +++ ++ Q + R+ V+ D +
Sbjct: 217 GLLIPVPAPLGGVIIIGEEQIVYCNANSTFK--------AIPIKQSIIRAYGRVDPDGSR 268
Query: 369 ATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSR 428
LL TG L LL + ++ V L + + + S I+ + N + ++GSR
Sbjct: 269 Y--------LLGDNTGILHLLVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSR 320
Query: 429 LGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
GDS LV+ +++ADA S + L+ VN + +
Sbjct: 321 FGDSQLVKL------------------NLQADASG------SFVEILERYVNLGPIVDFC 356
Query: 489 SASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
+ + + T S A +D G L+ G+ IN AS VEL G
Sbjct: 357 VVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIGINEQAS------------VELQG 399
Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
KG+W++ KSS ++D Y YL++S + T L
Sbjct: 400 IKGLWSL--KSS------------FNDPYDMYLVVSFISETRFL 429
>gi|303271531|ref|XP_003055127.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463101|gb|EEH60379.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1223
Score = 73.2 bits (178), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 87/360 (24%), Positives = 147/360 (40%), Gaps = 62/360 (17%)
Query: 1114 IGTAYVQGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GTA+ E+V +RGR+L+ R + +LV E KE+KGA+ L + G LL
Sbjct: 873 VGTAFSLPEEVEPSRGRILVL---RADEGRLSLVAE---KEVKGAVYNLNAFNGKLLAGI 926
Query: 1173 GPKIILHKW---------------------------------------TGTELNGIAFYD 1193
K+ L KW T EL +
Sbjct: 927 NSKVQLFKWVSRGAGAGAGAGGGAEGGAVAMADGGGGGGGGGGAPAAATTCELASECSHH 986
Query: 1194 APPLYVVSL--NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDG 1251
++V+L ++ +FI++GD+ KSI L +K + A+DF A L D
Sbjct: 987 G---HIVALYVDVRGDFIVVGDLMKSISLLVYKPDEGVIEERARDFNPNWMTAVCALDDE 1043
Query: 1252 STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF------LRLQMLA 1305
+ L ++ N+ + + +L E+H+G V +F +RL
Sbjct: 1044 TYLG---AENSFNLFTVRKNSDAAADEERSRLDVIGEYHLGEFVNRFRAGSLVMRLPGDG 1100
Query: 1306 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1365
+ S++ LFGT++G+IG +A L E T L +LQK + V V G
Sbjct: 1101 DGAGLGLGLDASNEAP--TQLFGTVNGAIGVVASLPESTHTFLAALQKAMNKVVSGVGGF 1158
Query: 1366 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ ++R FH+ ++ VD +L+ + L E+ E+A G ++ + +L
Sbjct: 1159 SHDAWRSFHNEHRSRLVEARGFVDGDLIESFLDLRPEKASEVASVVGVGVEELTKRIEEL 1218
Score = 49.3 bits (116), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 62/262 (23%), Positives = 113/262 (43%), Gaps = 35/262 (13%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
GP+ VDP+ R G+ +Y ++ Q G FS R+E V +++
Sbjct: 130 GPIGAVDPECRMYGLHLYDGLFKVIPMDQTGQ----------LREAFSVRLEELQVFDVK 179
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
F+ G +P + +L++ T GR + C+ +P W
Sbjct: 180 -----------FLAGTPKPTIAVLYQD--TKEGRHIKTYEVCLKDK-------DFNPGPW 219
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
+ ++ + L+AVP+P+GGV+VVG I Y ++ + + + ++
Sbjct: 220 AQNDVESGSRFLIAVPAPLGGVVVVGEKVIAYLNKETTHGVGDGGGGGGGGGGGMIVKA- 278
Query: 360 FSVELDAAHATWLQNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
+++ DA T+ D LLS G L LL +++D V+ L L + + S +
Sbjct: 279 IAMQSDATIMTYGAVDKDGSRYLLSDSAGRLHLLVLMHDKTRVRALKLESLGQTSIASSL 338
Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
+ + N + ++GS GDS LV+
Sbjct: 339 SYLDNGVVYVGSAYGDSQLVRL 360
>gi|430810873|emb|CCJ31593.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 301
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 128/292 (43%), Gaps = 41/292 (14%)
Query: 245 HVKDFIFV-HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
H+ D F+ + Y EP + IL+ T G + ++ T +A I++
Sbjct: 6 HIVDLWFIFYDYREPTLAILYSAFQTSTGLLPYRQDTMTSTA------------IYTVDK 53
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC-ALALNNYAVSLDSSQELPRSSFSV 362
LP+D + +L +P+PIGG L++G N + Y Q+A A+++N++A + ++
Sbjct: 54 LPYDLFSVLPLPNPIGGTLLIGNNELVYVDQAARVKAVSVNSFARKCTHLDFIEDYDLNL 113
Query: 363 ELDAAHATWL-----QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV----LTS 413
L+ A +L Q LL + G V + DGRVV L + + SV L S
Sbjct: 114 RLNGAVGVYLELLDDQPGAVLLVIEDGRFVQVGFKLDGRVVSSLSVKILDQSVKNDFLKS 173
Query: 414 D---ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
+ I + N F+GS++ +S+L+++ S + E + +
Sbjct: 174 EASCIVLLNNEQLFIGSKVSNSVLLEWKRQSEIA-------------EKLLSEPRVIFDE 220
Query: 471 SSDALQDMVNGEELSLYGSASN-NTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ L D+ GE+ + ++S F + D+L + GP+ D + G
Sbjct: 221 DREVLNDLY-GEDFDIVDTSSILQRNGVFGDIQFRLFDTLYSCGPIVDMTIG 271
>gi|357135348|ref|XP_003569272.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
distachyon]
Length = 1074
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/323 (24%), Positives = 143/323 (44%), Gaps = 44/323 (13%)
Query: 1113 AIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1171
+GTAY+ ++ +GR+L+F + LV E +E KGA+ +L +L G LL A
Sbjct: 782 CVGTAYILPYEIEPTKGRILIFLV---EERKLRLVAE---RETKGAVYSLNALTGKLLAA 835
Query: 1172 SGPKIILHKWTGT----ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
KII++KW +L Y L + +FI++GD+ +S+ L +K +
Sbjct: 836 VNQKIIVYKWVRRDNRHQLQSECSYRGCVL-ALHTQTHGHFIVVGDMVRSVSLLRYKYEE 894
Query: 1228 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA-PKMSESWKGQKLLSR 1286
+ ++ +DF + A L D + +D N+ + P +
Sbjct: 895 GLIEVVTRDFNTKWITAVAMLDDDIYIG---ADNCCNLFTLHSGRPGVV----------- 940
Query: 1287 AEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGSIGCIA--PLDEL 1343
E+H+G V + ++ +D G P ++FGT+ G+IG IA P D+
Sbjct: 941 GEYHLGDLVNRMHHGSLVMHHTDSEIGQIP--------TVIFGTISGAIGVIASFPYDQY 992
Query: 1344 TF-RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1402
F +LQS+ K + SV +++ + RSF +A + VD +L+ + L
Sbjct: 993 VFLEKLQSVLVKFIKSVGNLSHVEWRSFYNVSRTAEAR-----NFVDGDLIESFLSLSPS 1047
Query: 1403 EQLEIAHQTGTTRSQILSNLNDL 1425
+ E++ G ++ + +L
Sbjct: 1048 KMEEVSQVMGLRADELCKIVEEL 1070
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 135/605 (22%), Positives = 240/605 (39%), Gaps = 141/605 (23%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V N +EIY++ Q L+L+ L+G + +L
Sbjct: 31 NLIVAKCNRMEIYLLTPQ-------------------------GLQLMVDVPLYGTIATL 65
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ S +D + ++ E + VL +D L S + +++ GR +
Sbjct: 66 ELFRS----RSETQDFLFISMERYRCIVLHWDGRNSELITRSGG--DVSDFI----GRPT 115
Query: 177 FARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
G + +DPQ R G+ +Y GL +I F + G
Sbjct: 116 -DNGQIGVIDPQNRLIGLSLYDGLFKVI---------------PFDNKGNLK------EA 153
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
+N+R L V D F++G P +V+LH+ +H AL ++
Sbjct: 154 LNIR-LQEFLVLDIKFLYGCARPTVVVLHQ------DNKDSRHVKTYEVALEDKDFVEGS 206
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
WS NL + A+ L +P P+GGV+++G +TI Y S + AL++ Q +
Sbjct: 207 ---WSQSNLDNSAH--LLIPVPLGGVIIIGEHTIVYCSATTFKALSIK---------QSI 252
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
R+ V+ D + + N TG L L+ + ++ V L + + S I
Sbjct: 253 IRAVGRVDPDGSRYLYGDN--------TGALHLIVITHEWGRVTDLKTHYMGETSIASTI 304
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
+ + + L ++GSR GDS L++ +I+ADA + S + L
Sbjct: 305 SYLDSGLVYIGSRFGDSQLIKL------------------NIQADASA------SFVEIL 340
Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
+ +N + + + + + G KD S I A + I+
Sbjct: 341 EQFMNTGPIVDFCVVDTERRGQGQVITCS--------GAYKDGS----IRAVRNGVVITD 388
Query: 536 QSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
Q++ VEL G KG+W++ KSS D+ + + +E H +L +++E LE D+
Sbjct: 389 QAS---VELRGMKGLWSM--KSSLNDPYDTFLVVTFINETH-FLAMNMENE---LEEVDI 439
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQDLSFGPSNSESGS 654
+E+ +T+A G+ ++IQV R R++ S D F P+
Sbjct: 440 KGFDSET-------QTLACGSAI-HNQLIQVTSRSVRLVSSVSLELLDQWFAPARFSVNV 491
Query: 655 GSENS 659
+ N+
Sbjct: 492 AAANA 496
>gi|308808936|ref|XP_003081778.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
gi|116060244|emb|CAL56303.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
Length = 1282
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/343 (23%), Positives = 152/343 (44%), Gaps = 33/343 (9%)
Query: 1039 QEVGHQIDNHNLSSVDLHR-TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ + HQ + + + V H + + ++ VR+++ G ++T + ++ E ++
Sbjct: 907 RRIAHQPETNTFAVVVEHLWSKSSQDCFVRLVD----DGSFETLSQFQLEDQELTSSLTS 962
Query: 1098 VTLFNTTTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
T +T T +GT ++ ED +RGR+L+F D+ LV+E KE++G
Sbjct: 963 CTFAGDST----TYYVVGTGIALETEDEPSRGRILVFKVD---DDQLVLVSE---KEVRG 1012
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP----LYVVSLNIVKNFILLG 1212
A+ L + +G LL K+ L KWT E + ++ ++IL+G
Sbjct: 1013 AVYNLNAFKGKLLAGINSKLELFKWTPREDEVHELVSECSHHGQIVTFAVKTRGDWILVG 1072
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ KS+ L +K + ++ +A+DF + A L D T + ++ +F +
Sbjct: 1073 DLMKSMSLLLYKPEEGAIDEVARDFNANWMTAVAMLDDDETY----LGAENSLNLFTVSR 1128
Query: 1273 KMSE--SWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
++ + +L E+H+G V F ++ + D + + LLFGT
Sbjct: 1129 NVNAVTDEERSRLEITGEYHLGELVNAFAPGSLVMSLRD-------GESLSVPTLLFGTA 1181
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
+G IG +A L + + + LQ + + V GL +R F
Sbjct: 1182 NGVIGVLASLPKDVYEFTERLQASINKHIQGVGGLKHADWRSF 1224
Score = 44.3 bits (103), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 101/206 (49%), Gaps = 24/206 (11%)
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
N+R L+ V+D F+HG +P + +L+ R++ A V K + + ++
Sbjct: 376 EAFNIR-LEELRVEDIQFLHGTAKPTIAVLY-RDMKEA--VHIKTYEIGVREKEFVSS-- 429
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
W+ +L + K++ VP+P+GGV+V+G TI Y ++++ ++ V L +
Sbjct: 430 ----PWAQNDLEGGSSKIIPVPAPVGGVVVLGEETIVYLNKTS------DDTDVFLKAIN 479
Query: 354 ELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
RSS +D + +L D G L LL +V+DG+ V L + + + +
Sbjct: 480 IPERSSIVCYGAIDPDGSRYLLGD------HDGTLYLLVLVHDGKRVNELKIERLGETSI 533
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
S ++ + N + F+GS GDS L++
Sbjct: 534 PSTVSYLDNGVVFVGSAYGDSQLIKL 559
>gi|397627714|gb|EJK68584.1| hypothetical protein THAOC_10223, partial [Thalassiosira oceanica]
Length = 456
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 123/289 (42%), Gaps = 47/289 (16%)
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1215
G + AL QG LL+ G + L++ +L P V +L + +GD+
Sbjct: 189 GPVLALVHFQGRLLVGIGKSLRLYEMGKRQLLKKCELRGLPTMVKTLQAAGDRAFVGDMM 248
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+S+ F+ + +L L+A+D + E L+D +T++ V D+ N+ P+ +
Sbjct: 249 QSMQFVRYDATANRLVLVARDRSARPITCQE-LLDVNTVA--VGDKFGNVTTLRL-PRGA 304
Query: 1276 ES-----------WKGQ------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1318
++ W KL + +HVG VT R ++A ++
Sbjct: 305 DTGAVDVSGTRALWDSSREDATPKLETLCTYHVGEVVTSLTRASLVAGGAE--------- 355
Query: 1319 KTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1375
+L++ T+ G IG + P +++ F SL+ + VP G +P+S+R F+
Sbjct: 356 -----SLIYVTVTGRIGALVPFTSREDVEF--YTSLESHVRSEVPRPTGRDPQSYRSFYC 408
Query: 1376 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
K ++D +L Y LP E + IA Q + +++ L D
Sbjct: 409 PVK-------HVIDGDLCEAYGGLPYEARERIADQMERSTGEVMKKLED 450
>gi|405970039|gb|EKC34976.1| DNA damage-binding protein 1 [Crassostrea gigas]
Length = 1160
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 74/319 (23%), Positives = 131/319 (41%), Gaps = 30/319 (9%)
Query: 1114 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GTA V E+ + GR+++F N ++ KE+KGA L G LL +
Sbjct: 851 VGTALVHPEEAEPKQGRIVIFHFHEGKLN------QIAEKEIKGAAYTLVEFNGKLLASI 904
Query: 1173 GPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1229
+ L +WT + L F LY L +FIL+GD+ +SI L +K
Sbjct: 905 NSTVRLFEWTTDKELRLECNYFNSIVALY---LKTKGDFILVGDLMRSITLLLYKPMEGT 961
Query: 1230 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1289
+A+D A E L D + L ++ N+ + Q L F
Sbjct: 962 FEEIARDCNPNWTTAVEILDDDNFLG---AENSFNLFTCQKDSASTTDEDRQNLQEVGMF 1018
Query: 1290 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1349
H+G V F ++ S T + ++L+GT++G++G + + + + LQ
Sbjct: 1019 HLGEFVNVFRHGSLVMQHSGETSTP------TQGSVLYGTVNGAVGLVTQVPQEFYSFLQ 1072
Query: 1350 SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY------EMLPLEE 1403
+Q +L + V + +R FH+ K + +D +L+ + +M +
Sbjct: 1073 DIQSRLAKVIKSVGKIEHSFWRSFHTERKTE--ACEGFIDGDLIESFLDLNRDKMQETVK 1130
Query: 1404 QLEIAHQTGTTRSQILSNL 1422
L+I +G R + +L
Sbjct: 1131 GLQIDDGSGMKREATVDDL 1149
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 98/219 (44%), Gaps = 35/219 (15%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTW--------AGRVSWKH--HTCMISAL 286
N+R L+ V D F+HG P ++++H+ L +S+K H +
Sbjct: 156 NIR-LEELTVIDIQFLHGCTTPTLILIHQANLNCYHLMTLCITNLLSFKQDQHGRHVKTY 214
Query: 287 SISTTLKQ-HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
IS K+ W N+ +A L+AVP P GG L++G +I YH +A
Sbjct: 215 EISLRDKEFQKGPWKQDNVETEACMLIAVPEPFGGALIIGQESITYHKGDNFIPIA---- 270
Query: 346 AVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----V 398
+ +S+ + ++DA + +L D+ G L +L + + ++ V
Sbjct: 271 ------PPAIKQSTLTCYGKVDANGSRYLLGDMM------GRLFMLMLEKEEKMDSTVTV 318
Query: 399 QRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ L + + + IT + N++ ++GSRLGDS LV+
Sbjct: 319 KDLKVELLGETTIAECITYLDNAVVYIGSRLGDSQLVKL 357
>gi|443707495|gb|ELU03057.1| hypothetical protein CAPTEDRAFT_148808 [Capitella teleta]
Length = 1084
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/270 (24%), Positives = 118/270 (43%), Gaps = 16/270 (5%)
Query: 1110 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
T +GTA V E+ + GR+++F R D +T+V KE+KGA L G L
Sbjct: 771 TYYVVGTAMVYPEEAEPKQGRIIVF---RFHDGK---LTQVAEKEIKGAAYTLTEFNGKL 824
Query: 1169 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
L + + L +WT + + + + L +FIL+GD+ +S+ LS+K
Sbjct: 825 LASINSTVRLFEWTAEKELRVECSYFNNIIALYLKTKGDFILVGDLMRSVTLLSYKPMEG 884
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
+A+D+ + + L D + L ++ NI + + Q L
Sbjct: 885 CFEEIARDYNPNWMTSIDVLDDDTFLG---AENSFNIFTCQKDSAATTDEERQHLQEVGL 941
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1348
+H+G V F R L +P + ++LFGT++G++G + L + + L
Sbjct: 942 YHLGEFVNVF-RHGSLVMQHPGECTSP-----TQGSVLFGTVNGALGLVTQLPQEFYLFL 995
Query: 1349 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+Q KL ++ V + +R FH+ K
Sbjct: 996 LEVQNKLAKTIKSVGKVEHAFWRSFHTERK 1025
>gi|357478323|ref|XP_003609447.1| Splicing factor 3B subunit [Medicago truncatula]
gi|355510502|gb|AES91644.1| Splicing factor 3B subunit [Medicago truncatula]
Length = 1225
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 165/379 (43%), Gaps = 51/379 (13%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+R+L+P R G T + +Q +E A ++ V N KE TLLA+GTA
Sbjct: 874 IRVLDP-RTG---NTTCLLELQENEAAFSICTV---NFHDKEYGTLLAVGTAKGLQFTPK 926
Query: 1126 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1185
V R D+ ++L ++ +++G AL QG LL GP + L+
Sbjct: 927 RSLTVGFIHIYRFLDDGRSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGKRR 985
Query: 1186 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1245
L + P+ +VS++ ++ I +GDI +S ++ ++ QL + A D S+ + T
Sbjct: 986 LLRKCENKSFPISIVSIHAYRDRIYVGDIQESFHYCKYRRDENQLYIFADD--SVPRWLT 1043
Query: 1246 -EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKL---LSRAEFH 1290
+ ID T++ +D+ NI +S+ W+ KL L++ E
Sbjct: 1044 ASYHIDFDTMA--GADKFGNIFFARLPQDVSDEVEEDPTSGKIKWEQGKLNGALNKVEEI 1101
Query: 1291 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR--- 1347
V HV + TS + PG + +++GT+ +GC+ L T R
Sbjct: 1102 VQFHVGDVI------TSLQKAALVPGGGE----CIVYGTV---MGCVGALHAFTSRDDVD 1148
Query: 1348 -LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1406
L+ + P + G + ++R A+ P D ++D +L + LP++ Q +
Sbjct: 1149 FFSHLEMHMRQDNPPLCGRDHMAYR------SAYFPVKD-VLDGDLCEQFPTLPMDLQRK 1201
Query: 1407 IAHQTGTTRSQILSNLNDL 1425
IA + T +IL L +L
Sbjct: 1202 IADELDRTPGEILKKLEEL 1220
>gi|407035910|gb|EKE37921.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
Length = 836
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
++L F++AK+SVL +D++ + I S+HCFE P LKR +E P + +D +
Sbjct: 74 LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
GRC ++ Y M ++ +G + T S+ INL + + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168
Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
I + GY P + LH + TW GR+ T +I +L ++ ++ +
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
+P++A +++ P G+L++ N+I Y S ++ S L + V + + P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTVNSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
F + T + N V + T L ++ V+ + R+ D+ TN P
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
TI ++ F+GS + DS + +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 69/311 (22%), Positives = 136/311 (43%), Gaps = 36/311 (11%)
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
K+ + L +G ED +G+ +F N +N L+ ++ + K ++ A+ +
Sbjct: 519 KQLKNYLVVGVNKQTTEDNPVKGKTYIF----NIENQIQLINKI--GDGKKSVHAVNEIG 572
Query: 1166 GHLLIASGPKIIL------HKWTGTELNGIAFY----DAPPLYVVSLNIVKN--FILLGD 1213
G L +ASG ++ L +W + I+ + PL V+ K ILL D
Sbjct: 573 GFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLILLSD 632
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
++S+ L +K + L KD ++ C + F+I S++ D ++N+ + Y+
Sbjct: 633 FYRSVVLLLFKPYDYTVIPLGKDARNIHCIDSTFIITKDYFSVLEFDSEQNLSLLNYSSA 692
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
+E ++ A F++G ++ KF RL G + ++ T++GS
Sbjct: 693 ATEQLSIFEI--DATFNLGMNLLKFTRLW--------NGKG--------YIYMYVTVEGS 734
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+G I+ ++E ++ L+ + K+ H AG N +R G +D ++L
Sbjct: 735 VGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDML 794
Query: 1394 SHYEMLPLEEQ 1404
+ +L E+Q
Sbjct: 795 KQFRLLNEEQQ 805
>gi|119580419|gb|EAW60015.1| hCG2010549, isoform CRA_a [Homo sapiens]
Length = 323
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/74 (44%), Positives = 46/74 (62%), Gaps = 1/74 (1%)
Query: 943 LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLK 1002
LR+HP +G + +F + HNVNC GF+Y QG L+I LP+ +YD+ WPV+K IPL
Sbjct: 184 LRLHPVGINGPVNSFALFHNVNCPRGFLYFNRQGKLRISVLPAYLSYDSPWPVRK-IPLC 242
Query: 1003 ATPHQITYFAEKNL 1016
T H + Y E +
Sbjct: 243 CTVHCVAYHVESKI 256
>gi|351699158|gb|EHB02077.1| DNA damage-binding protein 1 [Heterocephalus glaber]
Length = 1144
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 79/306 (25%), Positives = 133/306 (43%), Gaps = 42/306 (13%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKEL-KGAISALAS 1163
K+ T +GTA V E+ + GR+++F + +D + EV S+ L KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDEER----EVSSRGLVKGAVYSMVE 875
Query: 1164 LQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 876 FNGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSV 930
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 931 LLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDE 987
Query: 1279 KGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
+ Q L FH+G V F L +Q L +S T ++LFGT++G I
Sbjct: 988 ERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMI 1037
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVA----GLNPRSFRQFHSNGKAHRPGPDSIVDC 1390
G + L E + L +Q +L + V L P FH+ K + +D
Sbjct: 1038 GLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSRAVSFHTERKTEQ--ATGFIDG 1095
Query: 1391 ELLSHY 1396
+L+ +
Sbjct: 1096 DLIESF 1101
>gi|340520436|gb|EGR50672.1| predicted protein [Trichoderma reesei QM6a]
Length = 1212
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 105/465 (22%), Positives = 202/465 (43%), Gaps = 67/465 (14%)
Query: 995 VQKVIPLKATPHQITYFAEKNLY-----------PLIVSVPVLKP--LNQVLSLLIDQEV 1041
+Q IPL TP ++ ++ L+ P + + + P +N +L +E
Sbjct: 787 IQSSIPLTYTPKKMVKHPDQPLFYVIEADNHTLSPALCAQLLADPARVNGDSKVLPPEEF 846
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1101
GH N +S + +++P G Q I ++ +E A+++ +VT
Sbjct: 847 GHPRGNRRWASC------------ISVVDPLAEDG--QVLQRIDLEENEAAVSLAIVTF- 891
Query: 1102 NTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAI 1158
++ENET L +GT G+D+ R R + + LV ++ +++
Sbjct: 892 --ASQENETFLVVGT----GKDMVLNPRSFSDAFVHIYRFERDGRGLVF-IHKTKVEEPP 944
Query: 1159 SALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
A+ QG +L+ G + ++ +L + + P +VSLN + I++GD+ + I
Sbjct: 945 MAMIPFQGRVLVGIGKMLRIYDLGMRQLLRKSQAEVAPQQIVSLNAQGSRIVVGDVQQGI 1004
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
++ +K+Q +L D ++ + T ++D T + D+ NI I K SE
Sbjct: 1005 TYVVFKQQTNKLIPFVDD--TVARWTTCSTMVDYETTA--GGDKFGNIFIVRAPQKASEE 1060
Query: 1278 WK----GQKLL-SRAEFHVGAHVTKF---LRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
G LL +R+ H +H L Q + TS +T G + LL+
Sbjct: 1061 ADEEPAGLHLLNARSYLHGTSHRLDLMCHLYTQDIPTSITKTSLVVGGQE----VLLWSG 1116
Query: 1330 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1386
L G+IG + P ++ F QSL++ L P +AG + +R +++ K
Sbjct: 1117 LMGTIGVLIPFVTREDADF--FQSLEQHLRAEDPPLAGRDHLMYRSYYAPMKG------- 1167
Query: 1387 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
++D +L Y +LP +++ IA + + +I ++D+ ++F
Sbjct: 1168 VIDGDLCERYALLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1212
>gi|449704103|gb|EMD44407.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
Length = 1088
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 86/351 (24%), Positives = 161/351 (45%), Gaps = 48/351 (13%)
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLL--FSTGRNADNP 1142
++ +E+AL++ + + ENE + IGTA+ + +V + GR+L+ GR
Sbjct: 756 LKENEHALSIEQIVI-----DENE-MFVIGTAFAKPNEVEPSSGRILIVQIKDGR----- 804
Query: 1143 QNLVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGTELNG---IAFYDAPP-- 1196
+ ++ K++ GA+ ++ +L + +L ++ K+++ ++ NG + +
Sbjct: 805 ---LEIIFEKDVNGAVYSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVKLQEKGSCN 861
Query: 1197 -----LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL---AKDFGSLDCFATEFL 1248
LYV +L N IL+GD+ KSI S+ G N L ++DF + A EF+
Sbjct: 862 VKLIGLYVKTLG---NKILVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFV 918
Query: 1249 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1308
+ LS SD NI IF +ES + +L + A HVG + + + T S
Sbjct: 919 DEDCYLS---SDSNSNILIFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHS 974
Query: 1309 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN-P 1367
+ + +LFG + G IG I + + L +Q +++ + + P
Sbjct: 975 TY-------ETVQKKCILFGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECTTP 1027
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
++++ + K R +I+D ++ Y + E+Q EIAH +G QI
Sbjct: 1028 DNWKKVIDDWK--RMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEQI 1076
>gi|321478515|gb|EFX89472.1| hypothetical protein DAPPUDRAFT_303245 [Daphnia pulex]
Length = 1158
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 128/297 (43%), Gaps = 17/297 (5%)
Query: 1113 AIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1171
+GTA V E+ + GR++LF + AD +T V KE+KGA +L +L A
Sbjct: 847 VVGTALVVPEESEPKQGRIVLF---QWADGK---LTTVAEKEVKGACYSLVDFNSKILAA 900
Query: 1172 SGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1231
+ L++WT + + + + + L +FIL+GD+ +SI L +K
Sbjct: 901 INNVVRLYEWTAEKELRLECSNFNHIIALYLKRKGDFILVGDLMRSITLLQYKTMEGSFE 960
Query: 1232 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1291
+A+D A E L D + L ++ N+ + + + Q+L FH+
Sbjct: 961 EMARDSNPNWMSAVEILDDDTFLG---AENSFNLFVCQKDSAATTEEERQQLTEVGRFHL 1017
Query: 1292 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1351
G V F ++ + T P + +LFGT+ G+IG + L + L +
Sbjct: 1018 GDMVNVFRHGSLVMDHAAETLTTP-----TQGCVLFGTVHGAIGVVTQLPSEFYHFLSEV 1072
Query: 1352 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
Q ++ + V + +R F + K + +D +L+ + L ++ E+A
Sbjct: 1073 QTRMARVIKPVGKIEHSFWRSFATERKVEP--CEGFIDGDLIESFLDLSSDKMKEVA 1127
Score = 56.6 bits (135), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 89/395 (22%), Positives = 149/395 (37%), Gaps = 83/395 (21%)
Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-WSAMNL 304
++D F++G P +VI+H+ H + IS K+ W N+
Sbjct: 164 IQDIAFLYGCANPTVVIIHQ-----------DAHGRHVKTREISLRDKEFAKTSWKQDNV 212
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVEL 364
+A LL VP P GG L++G +I YH+ NY + + ++
Sbjct: 213 ETEAAMLLPVPEPYGGALIIGQESITYHNG--------QNYVTIAPPIIKQSTVTCYGKV 264
Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDITTIG 419
D + +L D+A G L +L + DG V V+ + + + +T +
Sbjct: 265 DPNGSRYLLGDLA------GHLFMLVLEKEEKMDGTVTVRDIKIELLGEVSIPECLTYLD 318
Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
N + F+GSR GDS LV+ + + E F ++ AP + DM
Sbjct: 319 NGVVFIGSRFGDSQLVKLNVTPDDNNSYVTVMETFTNL---AP------------IVDMT 363
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
+ T S A ++ G L+ G+ I+ AS
Sbjct: 364 -------IVDLDRQGQGQLVTCSGAYKE-----GSLRIIRNGIGIHEQAS---------- 401
Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
++LPG KGIW + SS + D + +++S +T VL E
Sbjct: 402 --IDLPGIKGIWALKMGSSGNPSVDDT------------VVLSFVGQTRVLMLNGEEMEE 447
Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
TE +T GN+ G+ V+Q+ R++
Sbjct: 448 TEIPGLTADQQTFFCGNV-GKDSVLQITTGSVRLI 481
>gi|67463896|ref|XP_648489.1| cleavage and polyadenylation specificity factor subunit [Entamoeba
histolytica HM-1:IMSS]
gi|56464653|gb|EAL43100.1| cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba histolytica HM-1:IMSS]
Length = 1150
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
++L F++AK+SVL +D++ + I S+HCFE P LKR +E P + +D +
Sbjct: 74 LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
GRC ++ Y M ++ +G + T S+ INL + + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168
Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
I + GY P + LH + TW GR+ T +I +L ++ ++ +
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
+P++A +++ P G+L++ N+I Y S ++ S L + V + + P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTINSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
F + T + N V + T L ++ V+ + R+ D+ TN P
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
TI ++ F+GS + DS + +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 69/315 (21%), Positives = 137/315 (43%), Gaps = 36/315 (11%)
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
K+ + L +G ED +G+ +F N +N L+ ++ + K ++ A+ +
Sbjct: 833 KQLKNYLVVGVNKQTTEDNPVKGKTYIF----NIENQIQLINKI--GDGKKSVHAVNEIG 886
Query: 1166 GHLLIASGPKIIL------HKWTGTELNGIAFY----DAPPLYVVSLNIVKN--FILLGD 1213
G L +ASG ++ L +W + I+ + PL V+ K ILL D
Sbjct: 887 GFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLILLSD 946
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
++S+ L +K + L KD ++ C + F+I S++ D ++N+ + Y+
Sbjct: 947 FYRSVVLLLFKPYDYTVIPLGKDARNIHCIDSTFIITKDYFSVLEFDSEQNLSLLNYSSA 1006
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
+E ++ A F++G ++ KF RL G + ++ T++GS
Sbjct: 1007 ATEQLSIFEI--DATFNLGMNLLKFTRLW--------NGKG--------YIYMYVTVEGS 1048
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+G I+ ++E ++ L+ + K+ H AG N +R G +D ++L
Sbjct: 1049 VGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDML 1108
Query: 1394 SHYEMLPLEEQLEIA 1408
+ +L E+Q +
Sbjct: 1109 KQFRLLNEEQQKRVC 1123
>gi|449710759|gb|EMD49776.1| cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba histolytica KU27]
Length = 836
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
++L F++AK+SVL +D++ + I S+HCFE P LKR +E P + +D +
Sbjct: 74 LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
GRC ++ Y M ++ +G + T S+ INL + + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168
Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
I + GY P + LH + TW GR+ T +I +L ++ ++ +
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
+P++A +++ P G+L++ N+I Y S ++ S L + V + + P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTINSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
F + T + N V + T L ++ V+ + R+ D+ TN P
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
TI ++ F+GS + DS + +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/311 (22%), Positives = 136/311 (43%), Gaps = 36/311 (11%)
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
K+ + L +G ED +G+ +F N +N L+ ++ + K ++ A+ +
Sbjct: 519 KQLKNYLVVGVNKQTTEDNPVKGKTYIF----NIENQIQLINKI--GDGKKSVHAVNEIG 572
Query: 1166 GHLLIASGPKIIL------HKWTGTELNGIAFY----DAPPLYVVSLNIVKN--FILLGD 1213
G L +ASG ++ L +W + I+ + PL V+ K ILL D
Sbjct: 573 GFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLILLSD 632
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
++S+ L +K + L KD ++ C + F+I S++ D ++N+ + Y+
Sbjct: 633 FYRSVVLLLFKPYDYTVIPLGKDARNIHCIDSTFIITKDYFSVLEFDSEQNLSLLNYSSA 692
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
+E ++ A F++G ++ KF RL G + ++ T++GS
Sbjct: 693 ATEQLSIFEI--DATFNLGMNLLKFTRLW--------NGKG--------YIYMYVTVEGS 734
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+G I+ ++E ++ L+ + K+ H AG N +R G +D ++L
Sbjct: 735 VGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDML 794
Query: 1394 SHYEMLPLEEQ 1404
+ +L E+Q
Sbjct: 795 KQFRLLNEEQQ 805
>gi|323447810|gb|EGB03719.1| hypothetical protein AURANDRAFT_72671 [Aureococcus anophagefferens]
Length = 760
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 95/403 (23%), Positives = 168/403 (41%), Gaps = 65/403 (16%)
Query: 1020 IVSVPVL-KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPW 1078
+VSVP+ +PL + H + +H + + + +R L R P+
Sbjct: 374 VVSVPLAEQPLC----------ICHDLQSHLFAVCTIDHREGDNQGVIRFL---RDEAPY 420
Query: 1079 QTRATIPMQSSENALTVRVVTLFNTTT-KENETLLAIGTAYVQGED--VAARGRVLLFST 1135
++ E L +++L + +T K+ +GTA+ E+ GR+++F +
Sbjct: 421 NDVHREALEPLEIPLCCSIISLDSISTYKDQRAHFVVGTAFAAQENDFEPCSGRMIIFRS 480
Query: 1136 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-------ELNG 1188
G+ P L + E GA+ +A+++ LL+ + I H + L
Sbjct: 481 GQANVAPSVL----FFVEANGAVYDVAAMRASLLVCAVNHAI-HIYDPVVRDNRRGHLKP 535
Query: 1189 IAFYDAPPLYVVSLNI--VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1246
A YD VV+L + N I++GD+ +S+ L+ Q + +A D+ + A E
Sbjct: 536 RASYDG---LVVALKVQCYGNLIVVGDMMRSVTLLNLIRQKMIIVEVACDYNTNWVCALE 592
Query: 1247 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1306
+ DGS + S ++ Y S KG L SRA+ H+G VT F R ++
Sbjct: 593 VIGDGSFIIADASGSLVALESLY-----GNSDKGYFLESRAKMHLGDVVTCFARGSIMTQ 647
Query: 1307 SSDRTGAAPGSDKTNRFA--LLFGTL-----------------DGSIGCIAPLDELTFRR 1347
R + K ++ A L+FG + G+IGCI +D++T
Sbjct: 648 DDWR------NPKVSKVATPLIFGCVTSSRVLVLTPLIARYQVSGAIGCIVSIDDVTHSL 701
Query: 1348 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVD 1389
L+ L K L++ V + +F+ H+N P D +D
Sbjct: 702 LERLSKVLLEFHSGVGDFDHETFQALHNNVATCNAAPMDDFID 744
>gi|224100909|ref|XP_002312063.1| predicted protein [Populus trichocarpa]
gi|222851883|gb|EEE89430.1| predicted protein [Populus trichocarpa]
Length = 1213
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 167/384 (43%), Gaps = 61/384 (15%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+R+L+P A T + +Q +E A +V V N KE+ TLLA+GTA +G
Sbjct: 862 IRVLDPRSA----TTTCLLELQDNEAAFSVCTV---NFHDKEHGTLLAVGTA--KGLQFW 912
Query: 1126 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
+ ++ + D+ ++L ++ +++G AL QG LL G + L+
Sbjct: 913 PKRSLIAGFIHIYKFVDDGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGSVLRLYDLGK 971
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
L P +VS++ ++ I +GDI +S +F ++ QL + A D S+ +
Sbjct: 972 KRLLRKCENKLFPNSIVSIHTYRDRIYVGDIQESFHFCKYRRDENQLYIFADD--SVPRW 1029
Query: 1244 AT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR----- 1286
T + +D T++ +D+ NI +S+ W+ KL
Sbjct: 1030 LTASYHVDFDTMA--GADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKVE 1087
Query: 1287 --AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1341
+FH+G V + ++ PG + +++GT+ GS+G + P D
Sbjct: 1088 EIVQFHIGDVVNSLQKASLI----------PGGGE----CIMYGTVMGSVGALLPFTSRD 1133
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1401
++ F L+ L P + G + ++R A+ P D ++D +L + LPL
Sbjct: 1134 DVDF--FSHLEMHLRQDHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFPTLPL 1184
Query: 1402 EEQLEIAHQTGTTRSQILSNLNDL 1425
+ Q +IA + T +IL L ++
Sbjct: 1185 DAQRKIADELDRTPGEILKKLEEV 1208
>gi|407044103|gb|EKE42371.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
Length = 1088
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 87/351 (24%), Positives = 160/351 (45%), Gaps = 48/351 (13%)
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLL--FSTGRNADNP 1142
++ +E+AL++ + + ENE + IGTA+ + +V + GR+L+ GR
Sbjct: 756 LKENEHALSIEQIVI-----DENE-MFVIGTAFAKPNEVEPSSGRILIVQIKDGR----- 804
Query: 1143 QNLVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGTELNG---IAFYDAPP-- 1196
+ V+ K++ GA+ ++ +L + +L I+ K+++ ++ NG + +
Sbjct: 805 ---LEIVFEKDVNGAVYSMKTLLKKYLAISIEKKLVVFEYQRVITNGEFEVKLQEKGSCN 861
Query: 1197 -----LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL---AKDFGSLDCFATEFL 1248
LYV +L N IL+GD+ KSI S+ G N L ++DF + A EF+
Sbjct: 862 VKLIGLYVKTLG---NKILVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFV 918
Query: 1249 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1308
+ LS SD NI IF +ES + +L + A HVG + + + T S
Sbjct: 919 DEDCYLS---SDSNSNILIFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHS 974
Query: 1309 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN-P 1367
+ + +LFG + G IG I + + L +Q +++ + + P
Sbjct: 975 TY-------ETVQKKCILFGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTP 1027
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
+++ + K R +I+D ++ Y + E+Q EIAH +G +I
Sbjct: 1028 DDWKKVIDDWK--RMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEKI 1076
>gi|183232997|ref|XP_653855.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
gi|169801778|gb|EAL48469.2| damaged DNA binding protein, putative [Entamoeba histolytica
HM-1:IMSS]
Length = 1088
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 86/351 (24%), Positives = 160/351 (45%), Gaps = 48/351 (13%)
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLL--FSTGRNADNP 1142
++ +E+AL++ + + ENE + IGTA+ + +V + GR+L+ GR
Sbjct: 756 LKENEHALSIEQIVI-----DENE-MFVIGTAFAKPNEVEPSSGRILIVQIKDGR----- 804
Query: 1143 QNLVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGTELNG---IAFYDAPP-- 1196
+ ++ K++ GA+ ++ +L + +L ++ K+++ ++ NG + +
Sbjct: 805 ---LEIIFEKDVNGAVYSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVKLQEKGSCN 861
Query: 1197 -----LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL---AKDFGSLDCFATEFL 1248
LYV +L N IL+GD+ KSI S+ G N L ++DF + A EF+
Sbjct: 862 VKLIGLYVKTLG---NKILVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFV 918
Query: 1249 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1308
+ LS SD NI IF +ES + +L + A HVG + + + T S
Sbjct: 919 DEDCYLS---SDSNSNILIFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHS 974
Query: 1309 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN-P 1367
+ + +LFG + G IG I + + L +Q +++ + + P
Sbjct: 975 TY-------ETVQKKCILFGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECTTP 1027
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
+++ + K R +I+D ++ Y + E+Q EIAH +G QI
Sbjct: 1028 DDWKKVIDDWK--RMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEQI 1076
>gi|397615212|gb|EJK63291.1| hypothetical protein THAOC_16062, partial [Thalassiosira oceanica]
Length = 322
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 123/289 (42%), Gaps = 47/289 (16%)
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1215
G + AL QG LL+ G + L++ +L P V +L + +GD+
Sbjct: 55 GPVLALVHFQGRLLVGIGKSLRLYEMGKRQLLKKCELRGLPTMVKTLQAAGDRAFVGDMM 114
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+S+ F+ + +L L+A+D + E L+D +T++ V D+ N+ P+ +
Sbjct: 115 QSMQFVRYDATANRLVLVARDRSARPITCQE-LLDVNTVA--VGDKFGNVTTLRL-PRGA 170
Query: 1276 ES-----------WKGQ------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1318
++ W KL + +HVG VT R ++A ++
Sbjct: 171 DTGAVDVSGTRALWDSSREDATPKLETLCTYHVGEVVTSLTRASLVAGGAE--------- 221
Query: 1319 KTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1375
+L++ T+ G IG + P +++ F SL+ + VP G +P+S+R F+
Sbjct: 222 -----SLIYVTVTGRIGALVPFTSREDVEF--YTSLESHVRSEVPRPTGRDPQSYRSFYC 274
Query: 1376 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
K ++D +L Y LP E + IA Q + +++ L D
Sbjct: 275 PVK-------HVIDGDLCEAYGGLPYEARERIADQMERSTGEVMKKLED 316
>gi|18377609|gb|AAL66955.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
Length = 270
Score = 70.5 bits (171), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 69/253 (27%), Positives = 113/253 (44%), Gaps = 24/253 (9%)
Query: 1152 KELKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP--PLYVVSLNI 1204
KE KGA+ +L + G LL A KI L+KW GT EL + LYV +
Sbjct: 1 KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG- 59
Query: 1205 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
+FI++GD+ KSI L +K + + A+D+ + A E L D L ++ N
Sbjct: 60 --DFIVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFN 114
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRF 1323
+ + + + +L E+H+G V +F ++ D G P
Sbjct: 115 LLTVKKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP-------- 166
Query: 1324 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1383
++FGT++G IG IA L + + L+ LQ L + V GL+ +R F N +
Sbjct: 167 TVIFGTVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSF--NNEKRTAE 224
Query: 1384 PDSIVDCELLSHY 1396
+ +D +L+ +
Sbjct: 225 ARNFLDGDLIESF 237
>gi|385304555|gb|EIF48567.1| rna-binding subunit of the mrna cleavage and polyadenylation factor
[Dekkera bruxellensis AWRI1499]
Length = 353
Score = 70.5 bits (171), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 67/293 (22%), Positives = 129/293 (44%), Gaps = 39/293 (13%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERE-LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 301
+K++ D+ F++ Y EP + IL+ E L+WAG + + LS++ + I
Sbjct: 69 VKNIMDYQFLYSYREPTIAILYAPEGLSWAGYLXKLKDNMKVVVLSLNLDTHKADSIMVL 128
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSF 360
NLP+D + +PSPI G L++G+N I H +S + + N Y + S
Sbjct: 129 PNLPYDLNSIYPLPSPINGFLLIGSNEILHVNSLGSIKGVYTNKYFPETSDMKLRDESDL 188
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG------RVVQRLDLSKTNPSVLTSD 414
++E + +++ +D LL ++ G +L+ G ++++ + + N SV ++
Sbjct: 189 NLECEGCSVSFVGDDQVLLISQIGKFYVLSFNESGGISNLNKIIEIPEANYCNVSV--NN 246
Query: 415 ITTIGN----SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
+ I N + FL + DS+L+ + + P+ + +S
Sbjct: 247 VLQITNIEDCNSAFLCCQGSDSILLHWN--------------------YNVPTRGTVSKS 286
Query: 471 SSDALQDMVNGEELSLY--GSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
++ ++ E+ LY S + + +F D LVN GP DF+ G
Sbjct: 287 NAGIEKE---DEDSWLYHEDETSQTSNRPLTSCTFTXIDKLVNCGPTSDFTIG 336
>gi|358378986|gb|EHK16667.1| hypothetical protein TRIVIDRAFT_40938 [Trichoderma virens Gv29-8]
Length = 1212
Score = 70.1 bits (170), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 105/464 (22%), Positives = 197/464 (42%), Gaps = 65/464 (14%)
Query: 995 VQKVIPLKATP--------HQITYFAEKN---LYPLIVSVPVLKP--LNQVLSLLIDQEV 1041
+Q IPL TP H + Y E + L P + + + P +N +L +E
Sbjct: 787 IQSSIPLTYTPKKMVKHPDHPLFYVIEADNHTLAPELCAKLLADPARVNGDTKILPAEEF 846
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1101
GH N +S + +++P G Q I ++ +E A++V +VT
Sbjct: 847 GHPRGNRRWASC------------ISVVDPLAEDG--QVLQRIDLEENEAAVSVAIVTF- 891
Query: 1102 NTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAI 1158
++ENET L +GT G+D+ R R + + LV ++ +++
Sbjct: 892 --ASQENETFLVVGT----GKDMVVNPRSFSDAFVHIYRFERDGRGLVF-IHKTKVEEPP 944
Query: 1159 SALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
A+ QG +L+ G + ++ +L A + P ++SL+ + I++GD+ + I
Sbjct: 945 MAMIPFQGRVLVGIGKTLRIYDLGMRQLLRKAQAEVAPQQIISLSTQGSRIVVGDVQQGI 1004
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
+ +K+ +L D + T ++D T++ D+ NI I K SE
Sbjct: 1005 TYAVYKQSTNKLIPFVDDTVARWTTCTT-MVDYETVA--GGDKFGNIFIVRSPQKASEEA 1061
Query: 1279 KGQK-----LLSRAEFHVGAHVTKF---LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
++ L +R H +H L Q + TS +T G LL+ L
Sbjct: 1062 DEEQAGLHLLNARDYLHGTSHRLDLMCHLFTQDIPTSIAKTSLVVGGQD----VLLWSGL 1117
Query: 1331 DGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1387
G+IG + P ++ F QSL++ + P +AG + +R +++ K I
Sbjct: 1118 MGTIGVLIPFITREDTDF--FQSLEQHMRAEDPPLAGRDHLMYRSYYAPMKG-------I 1168
Query: 1388 VDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+D +L Y +LP +++ IA + + +I ++D+ ++F
Sbjct: 1169 IDGDLCERYALLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1212
>gi|324502823|gb|ADY41238.1| DNA damage-binding protein 1, partial [Ascaris suum]
Length = 1129
Score = 70.1 bits (170), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 65/275 (23%), Positives = 127/275 (46%), Gaps = 23/275 (8%)
Query: 1113 AIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1171
A+GTA V ++ ++ GR+L+F +++ + + V+ KE+KGA ++ L G L++A
Sbjct: 815 AVGTAVVLTDETESKSGRLLIFQVAPSSEGGRMRL--VHDKEIKGAAYSIQVLMGKLVVA 872
Query: 1172 SGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
+ L +WT + L F + LY+ + N + +L+GD+ +S+ L++K +
Sbjct: 873 INSCVRLFEWTAEKELRLECSDFDNVTALYLRTKN---DVVLVGDLMRSLSVLAYKPMES 929
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR-- 1286
+A+DF + A E +ID T + +F S +G +L +
Sbjct: 930 SFEKIARDFVTNWMTACE-IIDMETF----LGAEIMFNLFTVVKDCSSKDEGIRLQLQET 984
Query: 1287 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1346
+++G V F ++AT D T + +L+GT DG +G I L +
Sbjct: 985 GMYYLGESVNAFCHGSLIATHIDLT-------PSFTTPILYGTSDGGLGVIVQLTPQFYD 1037
Query: 1347 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
+ L+ ++ + + +R F S+G+ +
Sbjct: 1038 FVHELETRIAAVTKNCMRIEHGQYRTFESDGRTEQ 1072
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 65/144 (45%), Gaps = 12/144 (8%)
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
P +W N+ +A ++ +P P GGV+VVG I YH + N Y+
Sbjct: 200 PPLWKQDNIEAEACMVIPIPQPYGGVIVVGHEAISYHKDA-------NAYSAIAPPLIHQ 252
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVL-LTVVYDGRV-VQRLDLSKTNPSVLTS 413
+ S ++D +L D LS + L+L L V DG V+ L + + +
Sbjct: 253 SQISCYGKIDRDGQRYLLGD---LSGRIFMLLLDLDVATDGTASVKDLKVELLGETSIPE 309
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
+ + N + F+GSR GDS LV+
Sbjct: 310 CVVYLDNGVVFIGSRFGDSQLVRL 333
>gi|312072035|ref|XP_003138882.1| hypothetical protein LOAG_03297 [Loa loa]
gi|307765956|gb|EFO25190.1| hypothetical protein LOAG_03297 [Loa loa]
Length = 1197
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 81/364 (22%), Positives = 148/364 (40%), Gaps = 32/364 (8%)
Query: 1079 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRN 1138
+T + P E A + +V F + L+ G A G + F N
Sbjct: 855 ETLSHFPFAEDEAAFAIAMVQ-FQNQSDTQFVLVGCGCELQLKPRKANGGCIYTFLLAAN 913
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
Q L+ + E+ ++A+ +G L G K+ L+ +L P
Sbjct: 914 GTTLQ-LLHRTATDEV---VNAIHDFRGMALAGVGKKVRLYDLGKRKLLAKCENRQIPTQ 969
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
VV + + I++ D +S++F+ +K+Q QL++ D S L+D T++ V
Sbjct: 970 VVDIRSMGQRIVVSDSQESVHFMRYKKQDGQLSIFC-DETSPRYVTCVCLLDYDTVA--V 1026
Query: 1259 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA--------TSSDR 1310
D NI + ++E + RA + G +L+ +A TS +
Sbjct: 1027 GDRFGNIAVLRLPKGVTEEVQEDPTGVRALWDRGNLNGASQKLEAIAHLYIGDAITSMQK 1086
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
T PG++ L + T+ G IG + P DE F Q+L+ + P + G +
Sbjct: 1087 TSLVPGAND----CLCYTTISGIIGILVPFMSRDEFEF--FQNLEMHMRVEYPPLCGRDH 1140
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1427
++R ++ K SI+D +L Y ++PL++Q + + G ++I L D+
Sbjct: 1141 LAYRSYYFPVK-------SIIDGDLCEQYSLMPLDKQKSVGEELGRKSTEIHKKLEDIRT 1193
Query: 1428 GTSF 1431
+F
Sbjct: 1194 RYAF 1197
>gi|145348011|ref|XP_001418451.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578680|gb|ABO96744.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1196
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 88/377 (23%), Positives = 158/377 (41%), Gaps = 49/377 (12%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
VRI++P A ++ + + SE AL++ V L T NE LLA+GTA A
Sbjct: 847 VRIVDPKEA----KSTFVLELHKSEAALSLCHVFL----TGPNELLLAVGTAV--NLTFA 896
Query: 1126 AR----GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1181
R G + L+ G N + V+S G + AL +GHLL + ++ +
Sbjct: 897 PRNCDGGFIHLYRYG----NDGRTLNLVHSTPTDGPVGALCGYKGHLLAGVNNSLRIYDY 952
Query: 1182 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1241
+L P ++ +L+ + I +GD+ +SI+++ +K + + A D
Sbjct: 953 GKKKLLRKVENRNFPNFITTLHAAGDRIYVGDVQESIHYVKYKADEGSIYIFADDTKPRY 1012
Query: 1242 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----------GQKLLSRAEFHV 1291
AT +D TL+ +D+ NI + +SE Q +L+ A
Sbjct: 1013 ITAT-LPLDYDTLA--GADKFGNIFVNRLPKDVSEDMDDDPTGGKNIYSQGVLNGAPNKS 1069
Query: 1292 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRL 1348
++ + A + + PG + +++GT G IGC+ P E+ F
Sbjct: 1070 ETSAQTYIGETVCALT--KGALQPGGIEI----IMYGTFMGGIGCLLPFSSRSEIEF--F 1121
Query: 1349 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
L+ + P + G + +FR +++ K +++D +L + LP + Q IA
Sbjct: 1122 THLEMHMRQEAPSIVGRDHMAFRSYYAPVK-------NVIDGDLCEQFGALPADVQRRIA 1174
Query: 1409 HQTGTTRSQILSNLNDL 1425
+ T +IL L +
Sbjct: 1175 EEMDRTPGEILKKLEQV 1191
>gi|156389050|ref|XP_001634805.1| predicted protein [Nematostella vectensis]
gi|156221892|gb|EDO42742.1| predicted protein [Nematostella vectensis]
Length = 1157
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 76/316 (24%), Positives = 135/316 (42%), Gaps = 31/316 (9%)
Query: 1103 TTTKENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAIS 1159
T + + T +GTAYV E+ + GR+LLF S G+ + +V KE+KGA+
Sbjct: 833 TLSDDPHTYYCVGTAYVFPEEPEPKAGRLLLFHLSEGK--------LVQVAEKEVKGAVY 884
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTE--LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
+L G +L + + +WT + ++YD + + L +FIL+GD+ +S
Sbjct: 885 SLVEFNGKVLAGINSTVSIFEWTADKEFRYECSYYDN--ILALYLKTKGDFILVGDLMRS 942
Query: 1218 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
+ L + +A DF A E L D + L ++ N+ +
Sbjct: 943 MTLLVYLPLEGSFQEIAHDFSPKWMTAIEILDDDTFLG---AENSYNLFTCTKDSGATTD 999
Query: 1278 WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF--ALLFGTLDGSIG 1335
+ L ++H+G V F ++ PG D + F +LFGT++G IG
Sbjct: 1000 EERYHLQDAGQYHLGEFVNVFRHGSLVMEH-------PG-DASTPFQGCVLFGTVNGRIG 1051
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD---SIVDCEL 1392
+A + + F L +QKKL + V ++ + H + +H + +D +L
Sbjct: 1052 IVAQIAQDLFNFLIQVQKKLNKVIKSVGKIDHSLYPFPHCSNLSHSRKMEPAHGFIDGDL 1111
Query: 1393 LSHYEMLPLEEQLEIA 1408
+ + LP E+
Sbjct: 1112 IESFLDLPRARMEEVV 1127
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 95/212 (44%), Gaps = 40/212 (18%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
N+R L+ HV D F++G P +V +++ H + I+ L+ H
Sbjct: 155 NIR-LEELHVVDIQFLYGCANPTIVFIYQDP-----------HGRHVKTYEIN--LRDHE 200
Query: 297 LI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
W N+ +A +++AVP+P+GG L++G +I YH S A+A
Sbjct: 201 FAKGPWKQDNVEVEACRVIAVPNPLGGALIIGQESITYHKGSNYHAIA----------PP 250
Query: 354 ELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKT 406
L +SS + ++D + +L D+ G L +L + + DG V+ L L
Sbjct: 251 ALKQSSLTCHGKIDTNGSRYLLGDM------NGRLYMLLLERQELIDGTYEVKDLKLEML 304
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+ + + + N + F+GS LGDS L + +
Sbjct: 305 GETSIAHCLVYLDNGVVFIGSMLGDSQLAKLS 336
>gi|168064351|ref|XP_001784126.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664326|gb|EDQ51050.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1214
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 93/385 (24%), Positives = 167/385 (43%), Gaps = 62/385 (16%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA---YVQGE 1122
+R+L+P + T + +Q +E A ++ V + KE TL+A+GTA +
Sbjct: 862 IRVLDPKTS----TTTCLLELQENEAAFSLCAVNFHDN--KELGTLIAVGTAKDLQFMPK 915
Query: 1123 DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1182
A+ G + ++ R AD + ++ V+ + G +AL QG LL+ G + ++
Sbjct: 916 KEASGGFIHIY---RFADEGK-VLELVHKTPVDGVPTALCQFQGRLLVGVGQVLRIYDLG 971
Query: 1183 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1242
+L P +++++ + I +GDI +S +++ ++ QL A D S
Sbjct: 972 KRKLLRKCENKNFPNTIIAIHTYGDRIYVGDIQESFHYVKYRRDENQLYTFADD--SCPR 1029
Query: 1243 FATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKG-------QKL 1283
+ T L ID T++ +D+ NI + +SE W+ K+
Sbjct: 1030 WLTASLHIDFDTMA--GADKFGNIYVMRLPQDVSEEIEDDPTGGKIKWEQGRLNGAPNKV 1087
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1340
+FHVG VT + ++ PG ++ +L+GT+ GS+G + P
Sbjct: 1088 EEIIQFHVGEVVTSLQKASLI----------PGGGES----VLYGTIMGSMGALLPFSSR 1133
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1400
+++ F L+ L P + G + FR A+ P D ++D +L Y ML
Sbjct: 1134 EDVDF--FSHLEMHLRQENPPLCGRDHMGFR------SAYFPVKD-VIDGDLCEQYPMLT 1184
Query: 1401 LEEQLEIAHQTGTTRSQILSNLNDL 1425
E Q +IA T +IL L D+
Sbjct: 1185 SELQKKIADDLDRTPGEILKKLEDI 1209
>gi|224109600|ref|XP_002315251.1| predicted protein [Populus trichocarpa]
gi|222864291|gb|EEF01422.1| predicted protein [Populus trichocarpa]
Length = 1213
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 92/385 (23%), Positives = 167/385 (43%), Gaps = 63/385 (16%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+R+L+P A T + +Q +E A ++ V N KE+ TLLA+GTA +G
Sbjct: 862 IRVLDPRSAA----TTCLLELQDNEAAFSLCTV---NFHDKEHGTLLAVGTA--KGLQFW 912
Query: 1126 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
+ ++ + D+ ++L ++ +++G AL QG LL G + L+
Sbjct: 913 PKRSLVTGFIHIYKFVDDGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGSVLRLYDLGK 971
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD------- 1236
L P +VS++ ++ I +GDI +S +F ++ QL + A D
Sbjct: 972 KRLLRKCENKLFPNTIVSIHTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLT 1031
Query: 1237 ------FGSL---DCFATEFLIDGSTLSLVVSDEQKNI----QIFYYAPKMSESWKGQKL 1283
F S+ D F + + L VSDE + +I + K++ + K+
Sbjct: 1032 SSYHVDFDSMAGADKFGNIYF---ARLPQDVSDEIEEDPTGGKIKWEQGKLNGA--PNKV 1086
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1340
+FH+G V + ++ PG + +++GT+ GS+G + P
Sbjct: 1087 EEIVQFHIGDVVNSLQKASLI----------PGGGE----CIIYGTVMGSVGALLPFTSR 1132
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1400
D++ F L+ L P + G + S+R A+ P D ++D +L + LP
Sbjct: 1133 DDVDF--FSHLEMHLRQDHPPLCGRDHMSYR------SAYFPVKD-VIDGDLCEQFPTLP 1183
Query: 1401 LEEQLEIAHQTGTTRSQILSNLNDL 1425
L+ Q +IA + T +IL L ++
Sbjct: 1184 LDAQRKIADELDRTPGEILKKLEEV 1208
>gi|449283451|gb|EMC90093.1| DNA damage-binding protein 1 [Columba livia]
Length = 1140
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 85/346 (24%), Positives = 147/346 (42%), Gaps = 57/346 (16%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1162
K+ T +GTA V E+ + GR+++F S G+ + + KE+KGA+ ++
Sbjct: 812 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 863
Query: 1163 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
G LL + + L++WT TE N + + LY L +FIL+GD+ +S
Sbjct: 864 EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 918
Query: 1218 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 919 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 975
Query: 1278 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
+ Q L H+G V F L +Q L +S T ++LFGT++G
Sbjct: 976 EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGM 1025
Query: 1334 IGCIAPLDELTFRRLQSLQKKL---VDSV--------PHVAGLNPRSFRQFHSNGKAHRP 1382
IG + L E + L +Q +L + SV P + L + + FH+ K P
Sbjct: 1026 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSLVQLRAWASQSFHTERKT-EP 1084
Query: 1383 GPDSIVDCELLSHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1422
+D +L+ + +M + L+I +G R + +L
Sbjct: 1085 AT-GFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129
>gi|356576847|ref|XP_003556541.1| PREDICTED: splicing factor 3B subunit 3-like [Glycine max]
Length = 1214
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/389 (23%), Positives = 168/389 (43%), Gaps = 71/389 (18%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA----YVQG 1121
+R+L+P R G T + +Q +E A ++ + N KE TLLA+GTA ++
Sbjct: 863 IRVLDP-RTG---NTTCLLELQENEAAFSICTI---NFHDKEYGTLLAVGTAKGLQFLPK 915
Query: 1122 EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1181
+ A G + ++ R ++ ++L ++ +++G AL QG LL GP + L+
Sbjct: 916 RTITA-GFIHIY---RFVEDGRSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDL 970
Query: 1182 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1241
L P +VS++ ++ I +GD+ +S ++ ++ QL + A D
Sbjct: 971 GKRRLLRKCENKLFPNTIVSIHAYRDRIYVGDVQESFHYCKYRRDENQLYIFADD----- 1025
Query: 1242 C----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR 1286
C + ID T++ +D+ NI +S+ W+ KL
Sbjct: 1026 CVPRWLTASYHIDFDTMA--GADKFGNIYFVRLPQDVSDEIEEDPTGGRIKWEQGKLNGA 1083
Query: 1287 -------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
+FH+G VT + ++ PG + ++FGT+ GS+G +
Sbjct: 1084 PNKVEEIVQFHIGDVVTCLQKASLI----------PGGGE----CIVFGTVMGSVGALHA 1129
Query: 1340 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
D++ F L+ + P + G + ++R A+ P D ++D +L Y
Sbjct: 1130 FTSRDDVDF--FSHLEMHMRQDHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQY 1180
Query: 1397 EMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
LP++ Q +IA + T +IL L ++
Sbjct: 1181 PTLPMDLQRKIADELDRTPGEILKKLEEV 1209
>gi|356536504|ref|XP_003536777.1| PREDICTED: splicing factor 3B subunit 3-like [Glycine max]
Length = 1214
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/389 (23%), Positives = 167/389 (42%), Gaps = 71/389 (18%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA----YVQG 1121
+R+L+P + T + +Q +E A ++ V N KE TLLA+GTA ++
Sbjct: 863 IRVLDPRTS----NTTCLLELQENEAAFSICTV---NFHDKEYGTLLAVGTAKGLQFLPK 915
Query: 1122 EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1181
V A G + ++ R ++ ++L ++ +++G AL QG LL GP + L+
Sbjct: 916 RTVTA-GFIHIY---RFVEDGRSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDL 970
Query: 1182 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1241
L P ++S++ ++ I +GD+ +S ++ ++ QL + A D
Sbjct: 971 GKKRLLRKCENKLFPNTIISIHAYRDRIYVGDVQESFHYCKYRRDENQLYIFADD----- 1025
Query: 1242 C----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR 1286
C + ID T++ +D+ NI +S+ W+ KL
Sbjct: 1026 CVPRWLTASYHIDFDTMA--GTDKFGNIYFVRLPQDVSDEIEEDPTGGRIKWEQGKLNGA 1083
Query: 1287 -------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
+FHVG VT + ++ PG + ++FGT+ GS+G +
Sbjct: 1084 PNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CIVFGTVMGSVGALHA 1129
Query: 1340 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
D++ F L+ + P + G + ++R A+ P D ++D +L Y
Sbjct: 1130 FTSRDDVDF--FSHLEMHMRQDHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQY 1180
Query: 1397 EMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
LP++ Q +IA + T +IL L ++
Sbjct: 1181 PTLPMDLQRKIADELDRTPGEILKKLEEV 1209
>gi|224004656|ref|XP_002295979.1| spliceosome associated factor 3b, subunit 3; 130kD spliceosome
associated protein [Thalassiosira pseudonana CCMP1335]
gi|209586011|gb|ACI64696.1| spliceosome associated factor 3b, subunit 3; 130kD spliceosome
associated protein [Thalassiosira pseudonana CCMP1335]
Length = 1212
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 122/289 (42%), Gaps = 47/289 (16%)
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1215
G + +L QG LL+ G + L++ +L P V +L + +GD+
Sbjct: 945 GPVLSLVHFQGRLLVGVGKTVRLYEMGKRQLLKKCELRGMPTMVKTLQAAGDRAFVGDMM 1004
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+S+ F+ + +L L+AKD E L+D +T++ V D+ N+ I P+ +
Sbjct: 1005 QSMQFIRYDSTANRLVLVAKDRNPRPITCQE-LLDINTVA--VGDKFGNVTILRL-PRGA 1060
Query: 1276 ES-----------WKGQ------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1318
++ W KL + +HVG VT R ++A ++
Sbjct: 1061 DAGAIDVTGTRALWDSARDDATPKLETLCTYHVGEVVTSMTRASLVAGGAE--------- 1111
Query: 1319 KTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1375
+L++ T+ G +G P D++ F SL+ L P G +P+S+R +++
Sbjct: 1112 -----SLIYVTVTGRVGAFVPFTSRDDVEF--YTSLEGFLRTETPRPTGRDPQSYRSYYA 1164
Query: 1376 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
K IVD +L + LP E + +IA + +++ L D
Sbjct: 1165 PMK-------HIVDGDLCDAFAQLPYETKQKIAESLDRSVGEVMKKLED 1206
>gi|353236335|emb|CCA68332.1| probable splicing factor 3B subunit 3 [Piriformospora indica DSM
11827]
Length = 1243
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 114/495 (23%), Positives = 215/495 (43%), Gaps = 71/495 (14%)
Query: 965 CNHGFIYVTSQGILKICQLPS-GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C GFI + + L+I Q+P GS Q+++PL TP + + + + LI S
Sbjct: 789 CPDGFIGI-ANNTLRIFQVPKLGSKLK-----QEILPLSYTPRKFVSHPQNSYFYLIES- 841
Query: 1024 PVLKPLNQVLSLLIDQEV------GHQIDNHNLS-SVDLHRTYTVEEYE----VRILEPD 1072
+ +I++ V G ++D L ++ + E +RI++P
Sbjct: 842 ----DHRAMSETMIEERVKAIEMSGEKVDREMLELDPRIYGHFKAPEGVWASCIRIIDPV 897
Query: 1073 RAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARG---- 1128
++ A + ++E A ++ VV + E LL +GTA +A R
Sbjct: 898 N----LRSVAAFSLDNNEAAFSIAVVPF---AARNGELLLVVGTAV--DTHLAPRSCSTG 948
Query: 1129 --RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1186
RV F+ G + + ++ ++ +AL + QG L+ G + L+ +L
Sbjct: 949 YLRVYSFTEGGSG------LELLHKTDIDEVPTALMAFQGRLIAGVGKALRLYDIGKKKL 1002
Query: 1187 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1246
A +V+L+ + IL GDI++SIY++++K +L + A D S
Sbjct: 1003 LRKAENRQFATAIVTLSTQGSRILAGDINQSIYYVAYKAAENRLLIFADD-TSARWITAS 1061
Query: 1247 FLIDGSTLSLVVSDEQKNIQIFYY----APKMSESWKGQKLLSRAEFHVGA-HVTKFL-- 1299
++D +T +V +D+ N+ + + ++ + G +L +GA H TK L
Sbjct: 1062 TMLDYNT--VVAADKFGNVFVNRLDEAVSKQVDDDPTGAGILHEKSVLMGAPHKTKMLTH 1119
Query: 1300 -RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKL 1355
+ + TS R G+ R +++ L G+IG + PL D++ F + +L++ +
Sbjct: 1120 FHVGDVITSIQRVALVAGA----REVVVYFGLHGTIGILVPLVTKDDVDF--ISTLEQHM 1173
Query: 1356 VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1415
+ G + S+R +++ KA VD +L ++ LP ++QL IA + T
Sbjct: 1174 RTENLSLVGRDHLSWRGYYTPVKA-------TVDGDLCEYFAKLPTQKQLAIAGELDRTV 1226
Query: 1416 SQILSNLNDLALGTS 1430
+L L L + S
Sbjct: 1227 GDVLKKLESLRVTAS 1241
>gi|357478269|ref|XP_003609420.1| Splicing factor 3B subunit [Medicago truncatula]
gi|355510475|gb|AES91617.1| Splicing factor 3B subunit [Medicago truncatula]
Length = 1225
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 112/509 (22%), Positives = 209/509 (41%), Gaps = 90/509 (17%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C G + V S+ + G T++ Q VIPL+ TP + ++ L +I S
Sbjct: 755 QCFEGVVSVASEALRIFTVERLGETFN-----QNVIPLRYTPRKFVLQPKRKLLVVIESD 809
Query: 1024 PVLKPLNQVLSLLID-----------------QEVGHQIDNHNLSSVDLHRTYTVEEYE- 1065
+ + + E G + ++++ S D H Y E +
Sbjct: 810 QGALTAEEREAARKECFEAAHAGENKTGSEDQMENGGEDEDNDDSLSDEHYGYPKSESDK 869
Query: 1066 ----VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA---- 1117
+R+L+P R G T + +Q +E A ++ V N KE TLLA+GTA
Sbjct: 870 WVSCIRVLDP-RTG---NTTCLLELQENEAAFSICTV---NFHDKEYGTLLAVGTAKGLQ 922
Query: 1118 YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKII 1177
+ + A G + ++ R D+ ++L ++ +++G AL QG LL GP +
Sbjct: 923 FTPKRSLTA-GFIHIY---RFLDDGRSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLR 977
Query: 1178 LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1237
L+ L + P +VS++ ++ I +G I +S ++ ++ QL + A D
Sbjct: 978 LYDLGKRRLLRKCENKSFPSSIVSIHAYRDRIYVGGIQESFHYCKYRRDENQLYIFADD- 1036
Query: 1238 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR 1286
+ + ID T++ +D+ NI +S+ W+ KL
Sbjct: 1037 SVPRWLTSSYHIDFDTMA--GADKFGNIFFARLPQDVSDEIEEDPTGGKIKWEQGKLNGA 1094
Query: 1287 -------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
+FHVG +T + ++ PG + +++GT+ GS+G +
Sbjct: 1095 PNKVEEIVQFHVGDVITSLQKASLV----------PGGGE----CIVYGTVMGSVGALHA 1140
Query: 1340 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
D++ F L+ + P + G + ++R A+ P D ++D +L +
Sbjct: 1141 FTSRDDVDF--FSHLEMHMRQDNPPLCGRDHMAYRS------AYFPVKD-VIDGDLCEQF 1191
Query: 1397 EMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
LP++ Q +IA + T +IL L ++
Sbjct: 1192 PTLPMDLQRKIADELDRTPGEILKKLEEV 1220
>gi|324518783|gb|ADY47203.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
suum]
Length = 108
Score = 68.9 bits (167), Expect = 2e-08, Method: Composition-based stats.
Identities = 37/100 (37%), Positives = 60/100 (60%), Gaps = 5/100 (5%)
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1303
A +FLID ++ ++SDE NI +F Y P+ ES G++L+ R+E ++G +V F+R++
Sbjct: 10 AAQFLIDNRQMAFIMSDEAANIAVFNYLPEALESSGGERLILRSEINIGTNVNSFMRVKG 69
Query: 1304 LATSSDRTGAAPGSDKT-NRFALLFGTLDGSIGCIAPLDE 1342
+S G + NR ++LF +LDGS G + PL E
Sbjct: 70 HISS----GFVENEHYSLNRQSVLFCSLDGSFGFVRPLSE 105
>gi|358056808|dbj|GAA97158.1| hypothetical protein E5Q_03834 [Mixia osmundae IAM 14324]
Length = 1243
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 83/321 (25%), Positives = 139/321 (43%), Gaps = 35/321 (10%)
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR----GRVLLFSTGRNADN 1141
+QS E+ + V+L A+GTA+ V AR GRVL F R+ D
Sbjct: 914 LQSDEHGTALETVSLHGAAH------FAVGTAF-SDRTVDAREPKKGRVLTFM--RDGDK 964
Query: 1142 PQNLVTEVYSKELKGAISALASLQGHLL--IASGPKIILHKWTGTELNGIAFYDAPPLYV 1199
+ V V L+G + L L L IA+ + H ++ + + A
Sbjct: 965 FEQHVHAV----LEGGVFGLCQLPNSFLAAIANAQVKVFHVTEQAHIDQMTCW-AGTFLA 1019
Query: 1200 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1259
S++ + I++GD+++S+ L W E L+ +A++ A EFL G T +
Sbjct: 1020 QSISSRDSQIIVGDLYRSVVLLQWDEAKDTLSEVAREHHVNGMSAVEFL--GFTDDRYIG 1077
Query: 1260 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRT-GAAPGSD 1318
EQ+ + IF K + L + FH+G +VT+ + ++ +D + GAAP
Sbjct: 1078 TEQE-LNIFTLT-KTKTRERIDILETEGMFHIGEYVTRIRKGALVPGYTDTSFGAAP--- 1132
Query: 1319 KTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
LLFGT DGS+G I +L +L++ + + GL +R F + +
Sbjct: 1133 -----QLLFGTSDGSLGVIVNCTPEVSLKLFALERNMRAVIRAFGGLEQVDWRAFRAPHR 1187
Query: 1379 AHRPGPDSIVDCELLSHYEML 1399
H P VD +++ + L
Sbjct: 1188 VHEPV--GFVDGDMIGRFAEL 1206
>gi|268568396|ref|XP_002640241.1| C. briggsae CBR-TAG-203 protein [Caenorhabditis briggsae]
Length = 1218
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/320 (21%), Positives = 145/320 (45%), Gaps = 39/320 (12%)
Query: 1127 RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1186
RG V F N D L + E + A+ +G L+ G + ++ +L
Sbjct: 923 RGCVYTFHLSPNGDRFDFL----HRTETPLPVGAIHDFRGMALVGFGKFLRMYDIGQKKL 978
Query: 1187 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----FGSLDC 1242
P+ +V++ I++ D +S++FL +++ QL + A D + S C
Sbjct: 979 LAKCENKNFPVNIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQLVVFADDTTPRYVSCVC 1038
Query: 1243 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1302
++D T++ ++D+ N+ + +++E + +S++ + G +++
Sbjct: 1039 -----VLDYHTVA--IADKFGNLSVVRLPERVNEDVQDDPTVSKSVWDRGWLNGASQKVE 1091
Query: 1303 MLA--------TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSL 1351
++A TS +T PG+++ AL++ T+ G+IGC+ DE+ F +L
Sbjct: 1092 LVANFFIGDTITSLQKTSLMPGANE----ALVYTTIGGAIGCLVSFMSKDEVDF--FTNL 1145
Query: 1352 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1411
+ + P + G + S+R +++ K S++D ++ + ++ L +Q E+A +
Sbjct: 1146 EMHVRSEYPPLCGRDHLSYRSYYAPCK-------SVIDGDICEQFSLMELSKQKEVAEEL 1198
Query: 1412 GTTRSQILSNLNDLALGTSF 1431
G T S+I L D+ +F
Sbjct: 1199 GKTVSEISKKLEDIRTRYAF 1218
>gi|68471462|ref|XP_720279.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
gi|46442139|gb|EAL01431.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
Length = 423
Score = 68.6 bits (166), Expect = 3e-08, Method: Composition-based stats.
Identities = 45/199 (22%), Positives = 93/199 (46%), Gaps = 15/199 (7%)
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
F+H Y EP + +L ++ WAG + L++ LK ++ NLP++ +
Sbjct: 3 FLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTLDLNLKSTISVFKIDNLPYEIDR 62
Query: 311 LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELD 365
++ +PSP+ G L+VG N IH + +A+N + S S Q+ +S +++L+
Sbjct: 63 VIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTRLITASFKSFQD--QSDLNLKLE 120
Query: 366 AAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITT 417
+ +D LL +TG+ + DG+ ++R+ + + ++
Sbjct: 121 NCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAI 180
Query: 418 IGNSLFFLGSRLGDSLLVQ 436
+ ++ F+ + G+S L+Q
Sbjct: 181 LDKNMLFIANSNGNSPLIQ 199
>gi|340381612|ref|XP_003389315.1| PREDICTED: DNA damage-binding protein 1-like [Amphimedon
queenslandica]
Length = 1142
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 137/318 (43%), Gaps = 27/318 (8%)
Query: 1103 TTTKENETLLAIGTAYVQGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1161
TT E ++ +GTA V+ E+ ++ GR+L+F+ + ++ K GA+ +
Sbjct: 821 TTNDEERSVYVVGTALVKPEEKESSTGRILVFAVNSGK------LELLHEKLENGAVFQV 874
Query: 1162 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
G +L + + ++ L Y L + L +FIL+GDI +S+ L
Sbjct: 875 LGFNGKILNSVNSGVFVNALVDGALKEECAYKNNIL-ALYLKTKGDFILVGDILRSLKLL 933
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1280
+KE+ L + D CF T +ID + + ++I + K +E+
Sbjct: 934 VYKEE-LGLEEIGVDHNISPCFCTAIEMIDDENY---LGADGRHI---FICQKNTEATSE 986
Query: 1281 QKLLSRAE---FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1337
LL + + G +V F R + D GA S + +LFGT+ G+IG I
Sbjct: 987 ADLLYMVQPSRMYFGDNVNVFSRGSFVM---DHPGAGASSLLQGK-PILFGTVHGAIGLI 1042
Query: 1338 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELLSHY 1396
L+ T+ L LQ+K+ ++ V + +R F + HR P +D +L+ +
Sbjct: 1043 GTLNMDTYTLLSKLQQKMAANIKSVGNIEHEIYRSFSNE---HRSKPFAGFIDGDLVEKF 1099
Query: 1397 EMLPLEEQLEIAHQTGTT 1414
LP + +I TT
Sbjct: 1100 LELPRPQMSQIVQGIKTT 1117
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 104/447 (23%), Positives = 165/447 (36%), Gaps = 92/447 (20%)
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
I L DL ++ D F+HG P + + E GRV + IS K+
Sbjct: 156 IRLEDL---YITDIQFLHGTENPTIAYISEEPSVATGRV--------LKTFVISQRDKEL 204
Query: 296 -PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
P W + A L +VPSP G++VVGA+++ Y N+ + ++D
Sbjct: 205 LPGPWKPNTIEGQASLLCSVPSPYNGLIVVGADSVAY----------FNDTSHTVDPIV- 253
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV------VQRLDLSKTNP 408
+ S S H+ +L D G L+ L + + + + + L
Sbjct: 254 IKESVISCIEPLDHSRYLLGDFR------GRLLTLFLEFSEEMESGMTNIVNMKLEVLGE 307
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
+ ++ + N + F+GS GDS LV+ LSS E G I
Sbjct: 308 ISIPHTLSYLDNGVVFVGSTKGDSQLVK---------LSSSPLENGGYI----------- 347
Query: 469 RSSSDALQDMVN-GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
D L+ M N G L + S Q L G L+ G+ IN
Sbjct: 348 ----DVLESMTNIGPILDM----SVVDLDKQGRDVLVCCSGLGKDGALRIVKSGIGINEA 399
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
AS ++LPG KGIW++ + A +DE ++++ +T
Sbjct: 400 AS------------IDLPGIKGIWSL-------------KCAGREDELDDTVVLTFVGQT 434
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
M L A E TE +T N+ G +IQ+ + R++D M + P
Sbjct: 435 MALRLAGEEVEETELPALVTDQQTFYCSNVTG-NAIIQITTKSVRLMDDKAMELICDWSP 493
Query: 648 SNSE--SGSGSENSTVLSVSIADPYVL 672
+ S + +S V+ D Y L
Sbjct: 494 PDGRGISTAACNSSQVMVAVGCDLYYL 520
>gi|168045572|ref|XP_001775251.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673464|gb|EDQ59987.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1201
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 88/385 (22%), Positives = 163/385 (42%), Gaps = 62/385 (16%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA---YVQGE 1122
+R+L+P + T + +Q +E A ++ V + KE TL+A+GTA
Sbjct: 849 IRVLDPKTS----TTTCLLELQENEAAFSICTVNFHDN--KELGTLIAVGTAKDLQFMPR 902
Query: 1123 DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1182
A+ G + ++ ++ V+ + G +AL QG LL+ G + ++
Sbjct: 903 KEASGGFIHIYRFAEEG----RVLELVHKTPVDGVPTALCQFQGRLLVGVGQVLRIYDLG 958
Query: 1183 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1242
+L P +++++ + I +GDI +S +++ ++ QL A D S
Sbjct: 959 KRKLLRKCENKNFPNTIIAIHTYGDRIYVGDIQESFHYVKYRRDENQLYTFADD--SCPR 1016
Query: 1243 FATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSRA--- 1287
+ T L ID T++ +D+ N+ + +SE W+ +L
Sbjct: 1017 WLTASLHIDFDTMA--GADKFGNVYVMRLPQDVSEEIEDDPTGGKIKWEQGRLNGAPNKV 1074
Query: 1288 ----EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1340
+FHVG VT + ++ PG ++ +L+GT+ GS+G + P
Sbjct: 1075 DEIIQFHVGEVVTSLQKASLI----------PGGGES----MLYGTVMGSMGALLPFSSR 1120
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1400
+++ F L+ L P + G + +FR A+ P D ++D +L Y ML
Sbjct: 1121 EDVDF--FSHLEMHLRQENPPLCGRDHMAFR------SAYFPVKD-VIDGDLCEQYSMLT 1171
Query: 1401 LEEQLEIAHQTGTTRSQILSNLNDL 1425
E Q +IA T +I+ L D+
Sbjct: 1172 SELQKKIADDLDRTPGEIVKKLEDI 1196
>gi|339259094|ref|XP_003369733.1| splicing factor 3B subunit 3 [Trichinella spiralis]
gi|316965959|gb|EFV50595.1| splicing factor 3B subunit 3 [Trichinella spiralis]
Length = 1241
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 67/276 (24%), Positives = 125/276 (45%), Gaps = 29/276 (10%)
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1216
A++ALAS +G LL ++G + ++ +L P + + + + I +GD+ +
Sbjct: 982 AVTALASFRGRLLASAGKMLRIYDLGKKKLLRKCENKHMPNLITHILTMGHRIFVGDVQE 1041
Query: 1217 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL-VVSDEQKNIQ-----IFYY 1270
S++F +K QL + A D C A ++D T++L + SD ++Q I
Sbjct: 1042 SVFFYRYKPIENQLVVFADDTHQRFCSAM-CILDYDTVALRLPSDCTDDVQEDPTGIRAL 1100
Query: 1271 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
K + QK A F+VG VT + ++ PGS ++ L++ T+
Sbjct: 1101 WDKGILNGASQKCEMVATFYVGECVTCLQKAMLI----------PGSSES----LVYSTM 1146
Query: 1331 DGSIGCIAPL-DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1389
G IG + P + + Q L+ L P + G + ++R F++ K ++D
Sbjct: 1147 SGMIGALVPFSSKEDYEFFQHLEMHLRTEYPPLCGRDHLAYRSFYAPVKG-------VID 1199
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+L Y +L +Q EI+++ S+I+ L D+
Sbjct: 1200 GDLCEQYCLLEYGKQKEISNELDRVPSEIMKKLEDI 1235
>gi|156097003|ref|XP_001614535.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148803409|gb|EDL44808.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 2558
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 71/323 (21%), Positives = 145/323 (44%), Gaps = 38/323 (11%)
Query: 1110 TLLAIGTAYVQGEDVA--ARGRVLLFSTGRNADNPQNLVTEVYSKELK-GAISALASLQG 1166
TL+ +GTA E + + G + +F + + Q + VY+ + G I+ L +
Sbjct: 2215 TLICVGTAN-NNERITEPSSGHIYVFVAKKQTN--QFEIKHVYTYNVSCGGITHLKQFRD 2271
Query: 1167 HLLIASGPKIIL------------HKWTGTELNGIAFYDA--------PPLYVVSLNIVK 1206
++ A +++ + + ++ I DA P +++SL++V+
Sbjct: 2272 KIVAAVNNTVVILDIGNFLANLGAYIYNSSKAIKIESNDAFLEVASFTPSSWIMSLDVVE 2331
Query: 1207 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
N+I++GDI S+ LS+ + A LN + +D+ ++ C + L S +VSD + N
Sbjct: 2332 NYIVVGDIMTSVTLLSYDFENAILNEVCRDYANIWCTSVSAL---SENHFLVSDMESNFL 2388
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL--RLQMLATSSDRTGAAPGSDKTNRFA 1324
+ + + KL ++F+ G+ V K L+ L +R +++
Sbjct: 2389 VLQKSNIKFNDEESFKLSLVSQFNHGSVVNKMFSTSLRNLVDDEERRNEILQKEQS---- 2444
Query: 1325 LLFGTLDGSIGCIAPLDE-LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1383
+L + +GSI + P L F+R ++ + D++ + L+ S+R++ + +
Sbjct: 2445 ILCASSEGSISALIPFSNFLQFKRALCIEIAINDNISSLGNLSHSSYREYKVSLAS--KN 2502
Query: 1384 PDSIVDCELLSHYEMLPLEEQLE 1406
+VD EL + LP E QL+
Sbjct: 2503 CKGVVDGELFKMFFYLPFERQLK 2525
>gi|223994993|ref|XP_002287180.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976296|gb|EED94623.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1517
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 74/314 (23%), Positives = 137/314 (43%), Gaps = 42/314 (13%)
Query: 1105 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQ----NLVTEVYSKELK---- 1155
T E + + IGTAY ED +GR+L+ P + + + YS+ ++
Sbjct: 1124 TSEYKPYILIGTAYAYPDEDEPTQGRILVVECNSGEAEPHLKSDDDMEDTYSRYVRHVTQ 1183
Query: 1156 ----GAISALASLQGHLLIAS-GPKIILHKWT--GTELNGIAFYDAP------PLYVVSL 1202
G + +++ G ++A+ K L + + ++ + F A L+V SL
Sbjct: 1184 MPTRGGVYSISPFYGGTVLATVNSKTHLCRLSIGCDQIGELKFVGAGHHGHMLSLFVKSL 1243
Query: 1203 -------------NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
K ++GD+ +SI + ++ + + LA+D+ + C A E L
Sbjct: 1244 AGSESESESSGTNRQAKQLAIVGDLMRSISLVEYQPKHNVIEELARDYNANFCTAVEMLT 1303
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
+G+ L S+ N+ + + S +L + E+H+G KF+ ++ S+
Sbjct: 1304 NGTYLG---SEGFNNLFVLRHNANASSEEARVRLDTVGEYHLGEMTNKFMGGSLIMPSN- 1359
Query: 1310 RTGAAPGSDKTNRFA-LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
+G G+ + LFGT+DGSIG + LD TF L LQ+ ++ V V ++
Sbjct: 1360 -SGGIMGAQNAYVGSQTLFGTVDGSIGSVLGLDGPTFAFLACLQRAILSIVKTVGDISHE 1418
Query: 1369 SFRQFHSNGKAHRP 1382
+R F + + RP
Sbjct: 1419 EYRAFRAERQV-RP 1431
>gi|428180158|gb|EKX49026.1| hypothetical protein GUITHDRAFT_68305 [Guillardia theta CCMP2712]
Length = 1202
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 89/391 (22%), Positives = 157/391 (40%), Gaps = 71/391 (18%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
VR+++P+ +T+ I + +E AL+V V T ++ ++ T L GTA G V
Sbjct: 847 VRVIDPNE----RETKQIIELDPNEAALSVCVATFYD---RKGHTFLCFGTAV--GHKVG 897
Query: 1126 ARGRVLLFSTGRNADNPQNLV----TEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1181
+R TG + ++V T V+ + G AL S QG LL+ G + L++
Sbjct: 898 SR-------TGSGFLHTYSVVGSQLTFVHKTPIDGVPRALCSFQGRLLVGVGSALRLYEM 950
Query: 1182 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----- 1236
+L P VV+++ + + I +GD+ +SI FL + +L + A D
Sbjct: 951 GKRKLLRKCENRNIPNLVVTISTMGDRIYVGDVAESISFLKYNRILNELVIFADDTHPRW 1010
Query: 1237 -----------FGSLDCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1284
D F FL +S +S+E + +F +K ++++
Sbjct: 1011 MTAACPVDYDTVAGADKFGNIFLTRLPDNVSDEISEEPGAVGMFEGNDLQGAHYKAEEIV 1070
Query: 1285 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---- 1340
++HVG V + + SD A+++GT+ G IG + P
Sbjct: 1071 ---QYHVGETVCSLQKATLSPGGSD--------------AIIYGTMYGGIGALQPFVSRE 1113
Query: 1341 DELTFRRLQSLQKKLVDSVPH------VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1394
D F L+ + + H + G + SFR ++ K +VD +L
Sbjct: 1114 DVDFFLHLEMHLRGAAGAREHKPAGEGICGRDQLSFRSYYFPVK-------DVVDGDLCE 1166
Query: 1395 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ L Q +IA T ++ L D+
Sbjct: 1167 TFNYLSPSRQKQIAEDLDRTPGEVAKKLEDM 1197
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 110/554 (19%), Positives = 202/554 (36%), Gaps = 97/554 (17%)
Query: 99 ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
++ +C G + S+A G++ +D ++L + +ISVLEF +
Sbjct: 47 GKIQSICQMECFGLIRSMASFRLPGSN----KDYLVLGADSGRISVLEFSKERNQFERVH 102
Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
+ + K G G + DP+GR + Q ++ V + D
Sbjct: 103 LETYG-------KSGCRRIVPGQFLASDPKGRAVMISAIEKQKLVY---------VFNRD 146
Query: 219 TFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRV 274
S+++ S + H G+ P+ L + + G+
Sbjct: 147 A-------SSKLTISSPLEAHKASTIHFSIVGVDVGFDNPIFAALEMDYSDADADETGQ- 198
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP-----IGGVLVVGANTI 329
S + +++ + L + + P DA + +P P GVLV N I
Sbjct: 199 SAEEFNKVLTFYELDLGLNH---VVRKASEPIDAASNMLIPVPGDTDGPSGVLVCAENKI 255
Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
Y +AL + Q L + +S H LL ++ GDL L
Sbjct: 256 AYKKPDHEDVVALIPRRQGMPLDQPLLITGYS------HLKQKDGFFFLLQSEIGDLYRL 309
Query: 390 TVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---ML 446
T+ Y+ V ++++ + + IT + F+ S G+ L QF G+ M+
Sbjct: 310 TLTYNDEEVSEINITYFDTVPVAQSITILKTGFLFVASEFGNHALYQFLSIKGSDESDMM 369
Query: 447 SSGLKEEFGDIE----ADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
++ E IE A P L ++L +++ L L G E + ++
Sbjct: 370 PVEVEIEGETIEIPHFAPRPLKNLLLVDEMESLSPILDMRVLDLAG------EETPQIYA 423
Query: 503 FAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGCK-GIWTVYHKSS 558
L GP L+ +GL + + + ELP +WTV
Sbjct: 424 ------LCGKGPRSTLRTLRHGLAV------------AEMAVSELPSNPLAVWTV----- 460
Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
+G + D++ Y++++ T+VL D + EVT+S + +T++ +L
Sbjct: 461 KGSSKDAA---------DKYIVVTFANATIVLSIGDTVEEVTDS-GFLATNKTLSV-SLL 509
Query: 619 GRRRVIQVFERGAR 632
G ++QV G R
Sbjct: 510 GDDSLLQVHPNGLR 523
>gi|300176205|emb|CBK23516.2| unnamed protein product [Blastocystis hominis]
Length = 702
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 83/358 (23%), Positives = 159/358 (44%), Gaps = 25/358 (6%)
Query: 1078 WQTRATIPMQSSENALTVRVVTLFN-TTTKENETLLAIGTAYV-QGEDVAARGRVLLFST 1135
+ R +P++ SE AL V ++F + E + +GTA+V E+ ++GR+L+
Sbjct: 317 YAIRDELPLKPSEIALCVASGSIFPLSNAPERNEVFVVGTAFVLPEENEPSQGRLLVL-- 374
Query: 1136 GRNADNPQNLVTEVYSKELKGAISALASLQGHLL--IASGPKIILHKWTGTELNGIAFYD 1193
R ++ LV E L G ++ +G ++ + S ++ + ++ +A +
Sbjct: 375 -RAVEHRLELVAETM---LSGGCLSICLFKGKVVCGVNSELQVFDVDEKTSTISKLA-SE 429
Query: 1194 APPLYVVSL--NIVKNFILLGDI------HKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1245
+ V SL N I LGDI +K + + Q AQL +A + D A
Sbjct: 430 VACISVTSLSPNEADETIALGDILYSVVVYKLVLEVVRGRQLAQLECIASERRRRDVTAL 489
Query: 1246 EFLIDGSTLSLVVSDEQKNIQIFYYAPK--MSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1303
E L + + +VV D N+ + + + S + ++++ FH+ + +F+ +Q+
Sbjct: 490 ERLPEAQS-EMVVGDAYGNLMVMQVVEEADLDRSNPQKIVVTKESFHLDDQINRFVPVQL 548
Query: 1304 LAT-SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHV 1362
+ + D+ + F L F T+ G IG I L++ FR L++++ + + + V
Sbjct: 549 FRSGAEDKKKEKRAEESEIAFNLAFATVSGRIGMIGALNDREFRMLRAIETAMENVITPV 608
Query: 1363 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1420
GL+ + +R SN +D +L+ + L E Q +IA T LS
Sbjct: 609 GGLDHKQWR--CSNTPFGIKNLAYCIDGDLVEMFLELDDESQAKIADSVSTELRSALS 664
>gi|221055487|ref|XP_002258882.1| CPSF (cleavage and polyadenylation specific factor), subunit A
[Plasmodium knowlesi strain H]
gi|193808952|emb|CAQ39655.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
putative [Plasmodium knowlesi strain H]
Length = 2478
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 70/322 (21%), Positives = 139/322 (43%), Gaps = 36/322 (11%)
Query: 1110 TLLAIGTAYVQGEDVA--ARGRVLLFSTGRNADNPQNLVTEVYSKELK-GAISALASLQG 1166
TL+ +GTA E + + G + +F + + Q + +Y+ + G I+ L +
Sbjct: 2135 TLICVGTA-NNNERITEPSSGHIYVFVAKKKTN--QFEIKHIYTYNVNCGGITHLKQFRD 2191
Query: 1167 HLLIASGPKIILHKWTGTELN-GIAFYDA-------------------PPLYVVSLNIVK 1206
++ A +++ N G Y+A P +++SL++VK
Sbjct: 2192 KIVAAVNNTVLILDIRNFLTNLGTYIYNASKAMKVESNDAFLEVASFTPSSWIMSLDVVK 2251
Query: 1207 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
N+I++GDI S+ LS+ + A LN + +D+ ++ C A S +VSD + N
Sbjct: 2252 NYIVVGDIMTSVTLLSYDFENAILNEVCRDYANIWCTAL------SEDHFLVSDMESNFL 2305
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFAL 1325
+ + + KL ++F+ G+ V K L + + + P ++
Sbjct: 2306 VLQKSNIKFNDEESFKLSLVSQFNHGSVVNKMLSTSLRNLVDEYESEERPNEIVQKERSI 2365
Query: 1326 LFGTLDGSIGCIAPLDE-LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1384
L + +GSI + P + F+R ++ + D++ + L+ S+R++ +
Sbjct: 2366 LCASSEGSISTLIPFSNFIQFKRALCIEIAINDNISSLGNLSHSSYREYKITLAS--KNC 2423
Query: 1385 DSIVDCELLSHYEMLPLEEQLE 1406
+VD EL + LP E QL+
Sbjct: 2424 KGVVDGELFKMFFYLPFERQLK 2445
>gi|339235331|ref|XP_003379220.1| DNA damage-binding protein 1 [Trichinella spiralis]
gi|316978142|gb|EFV61158.1| DNA damage-binding protein 1 [Trichinella spiralis]
Length = 1329
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 75/322 (23%), Positives = 142/322 (44%), Gaps = 35/322 (10%)
Query: 1099 TLFNTTTKENETLLAIGTAYVQGEDVAA--RGRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
++ + T E++ I +A V D +GR+L+ R+ +L V+ KE+ G
Sbjct: 995 SILSCTMGEDQNPFFILSAAVITADETEPLQGRLLMLRYERDGQGNSSL-NLVHEKEVNG 1053
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS--LNIVKNFILLGDI 1214
+ A+AS + LL+A ++L +W +++ G+ + L+V + L + IL+GDI
Sbjct: 1054 CVYAMASFKSKLLVAMNSSVLLFEW--SDVTGLQLVSSCSLFVTAMHLKVRDEVILVGDI 1111
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDF-----GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
+SI L + + A+D+ +++ ++ + SL ++ QK++Q
Sbjct: 1112 QRSIAVLRYVPSESSFVEEARDYHPNWISAIEVIDNDYFMAAEN-SLNITVSQKDLQ--- 1167
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
+SES Q + S H+G ++ F S + A S +N ++ GT
Sbjct: 1168 -QQPVSES---QVVKSAGRLHLGEYINVFKH----GALSMYSYAGISSLVSN--PIMIGT 1217
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL----NPRSFRQFHSNGKAHRPGPD 1385
+GSI + + FR L LQ+ D VP G + R + + N A
Sbjct: 1218 AEGSILIYCQIHDSHFRVLNDLQRCFSDIVPDNVGCIAYDSYRRYVVYEKNAPAF----- 1272
Query: 1386 SIVDCELLSHYEMLPLEEQLEI 1407
+D +L+ +P +E + +
Sbjct: 1273 GFIDGDLIEQLLEMPRQEAIRL 1294
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 81/359 (22%), Positives = 152/359 (42%), Gaps = 74/359 (20%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R + +SA L+ V ++ G + + + + G + + +++ ++++E+D+
Sbjct: 205 RFEVHSVSAEGLQYVTEGKMFGRIGAAKLFTPKGENKAL----MVIVTLKQDVAIVEYDN 260
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVYGLQMIILK 205
RI + L + E+F R G L+ V P G G+ + +
Sbjct: 261 G----RIKT---------LASRNISENFGRPASNGILLSVHPDGEVIGLRIMSSTFKCIT 307
Query: 206 ASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHE 265
++ S L S++ +N + H+ DF+F+HG+ PV+ +++
Sbjct: 308 WNRATSKL------------------STYSLNY---SLTHLSDFVFLHGFQFPVIALIY- 345
Query: 266 RELTWAGRVSWKHH-TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
G + +H TC IS + P WS ++ +A+ L+AVP P+ GV+VV
Sbjct: 346 ------GDLVGRHVITCRISL--DEQEFENGP--WSRGHIEWEAHTLIAVPPPLCGVIVV 395
Query: 325 GANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTG 384
G +++ Y +N +S S L +S + DAA L G
Sbjct: 396 GCSSLLY---------IRDNSTISTVSPPFLSKSIVNC-YDAAP----DGLTYFLGQLDG 441
Query: 385 DLVLLTVVYDGRVVQRLDLSKTNPSVL--TSDITTIG----NSLFFLGSRLGDSLLVQF 437
L LL + + ++ LS+ ++L TS ++ SL F+GSR+ DS L++
Sbjct: 442 TLSLLKLDIETDAEGKVTLSRMRATILGVTSPPDSLSYMHKESLLFVGSRIADSKLLRL 500
>gi|281208174|gb|EFA82352.1| UV-damaged DNA binding protein1 [Polysphondylium pallidum PN500]
Length = 1054
Score = 67.0 bits (162), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 78/300 (26%), Positives = 137/300 (45%), Gaps = 28/300 (9%)
Query: 1112 LAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+ +GTA+ + E ++GR+L+F R DN L+ EV L + L G LL
Sbjct: 754 VVVGTAFHNEVESQQSKGRILVF---RIEDNRLILLDEV---ALPACVYCLLPFNGRLLA 807
Query: 1171 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGA 1228
++ W G + N + ++ + +S ++V +F+L+ D+ KS+ L +QGA
Sbjct: 808 GINKRVQAFNW-GVDTNKLTKAESYSGHTLSHSMVSRGHFVLVADLMKSMTLLVEDQQGA 866
Query: 1229 QLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
+ LA++ L + + +ID T + D N+ + + S + L +
Sbjct: 867 -IKELARN--PLPIWLSRIEMIDDETF--IGGDNSYNLIVVQKNAEASSEIDNELLDTVG 921
Query: 1288 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1347
+FH+G + KF + L TS P D +LFGT+ G+IG I + + +
Sbjct: 922 QFHLGETINKF-KHGSLVTS-------PDMDSPKLPTILFGTVSGAIGVIVSISKDDYEF 973
Query: 1348 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELLSHYEMLPLEEQLE 1406
+ LQK L V V GL ++R F + H P + +D +L+ + L ++ LE
Sbjct: 974 FEKLQKGLNRVVHGVGGLPFENWRSFSTE---HMTIPSKNFIDGDLIETFLDLRHDKMLE 1030
>gi|449684814|ref|XP_004210722.1| PREDICTED: DNA damage-binding protein 1-like, partial [Hydra
magnipapillata]
Length = 725
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 67/272 (24%), Positives = 121/272 (44%), Gaps = 19/272 (6%)
Query: 1110 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
T +GT+ V E+ + G+++LF + ++ SK + GA+ L G L
Sbjct: 414 TYYCVGTSMVYPEESEPKEGKIILFQLFEGK------LVQIGSKTVNGAVYVLQGFNGKL 467
Query: 1169 LIASGPKIILHKWTG-TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
L + +++WT EL Y L + L +FIL+GD+ +S+ L++K G
Sbjct: 468 LAGVNSLVSVYEWTSDKELKQECCYHNTIL-ALYLKSKGDFILVGDLMRSMTLLAYKPLG 526
Query: 1228 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
+L +A DF A E + D + L ++ N+ I + L +
Sbjct: 527 -RLEEIAHDFSPNWMTAVEIIDDDTFLG---AENSFNLFICQKDNSSVNDEERHHLQTIG 582
Query: 1288 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1347
++H+G V F ++ S S ++L+GT+ G+IG +A L + TF
Sbjct: 583 KYHLGDFVNVFKHGSLVMHHSTEQLTPISS------SILYGTVRGAIGLVAGLPKNTFDF 636
Query: 1348 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1379
L +Q+KL ++ V + +R F+++ K
Sbjct: 637 LSQVQEKLSKTIKSVGKIEHEFWRSFYNDKKT 668
>gi|241560031|ref|XP_002400960.1| spliceosomal protein sap, putative [Ixodes scapularis]
gi|215501812|gb|EEC11306.1| spliceosomal protein sap, putative [Ixodes scapularis]
Length = 1019
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 91/395 (23%), Positives = 167/395 (42%), Gaps = 64/395 (16%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDV 1124
+R+L P Q+ + ++ +E AL+V +V T+ +E + +G A +Q
Sbjct: 660 IRVLNPSDG----QSLCKVALEQNEAALSVALVRF---TSHPDEQFVVVGAAREMQLNPR 712
Query: 1125 AARGRVLLFSTGRNADNPQNLVTE------VYSKELKGAISALASLQGHLLIASGPKIIL 1178
RG LL T R A NP+ + V++ ++ A +AL QG LL G + L
Sbjct: 713 VCRGGGLLL-TYRLAPNPEEPMAGPTQLELVHATPVEEAPTALCPFQGRLLAGVGKCLRL 771
Query: 1179 HKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1238
+ +L P +VS+ + N +++ D+ +S +FL +K Q QL + A D
Sbjct: 772 YDLGRKKLLRKCENKYIPNAIVSIQAMGNRVVVSDVQESFFFLRYKRQENQLVIFADD-- 829
Query: 1239 SLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES---------------WKG-- 1280
S+ + T ++D T++ +D+ N+ I +S+ W G
Sbjct: 830 SVPRWITASCMLDYETVA--GADKFGNVSIIRLPSSISDDVDEDPTGIKSLWDRGWLGGS 887
Query: 1281 -QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
QK + FH+G V + ++ PG ++ L++ TL G++G + P
Sbjct: 888 SQKADVISNFHIGETVLSLQKATLI----------PGGSES----LVYVTLSGTVGVLVP 933
Query: 1340 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
++ F Q L+ + P + G + SFR + K +++D +L +
Sbjct: 934 FTAHEDHDF--FQHLEMHMRYENPPLCGRDHLSFRSSYFPVK-------NVIDGDLCEQF 984
Query: 1397 EMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
L +Q IA + S++ L D+ +F
Sbjct: 985 NSLDPSKQKSIAEELDRNPSEVSKKLEDIRTRYAF 1019
>gi|449459948|ref|XP_004147708.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
gi|449513493|ref|XP_004164340.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
Length = 1214
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 97/403 (24%), Positives = 172/403 (42%), Gaps = 70/403 (17%)
Query: 1054 DLHRTYTVEEYE-----VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 1108
D H Y E E +R+L+P A T + +Q +E A +V V N KE
Sbjct: 846 DEHYGYPKAESEKWVSCIRVLDPRSA----TTTCLLELQDNEAAFSVCTV---NFHDKEY 898
Query: 1109 ETLLAIGTA----YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
TLLA+GTA + + A G + ++ R ++ ++L ++ +++G ALA
Sbjct: 899 GTLLAVGTAKGLQFFPKRSLVA-GYIHIY---RFLEDGKSLEL-LHKTQVEGVPLALAQF 953
Query: 1165 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
QG LL G + L+ L P +VS+ ++ I +GDI +S ++ ++
Sbjct: 954 QGRLLAGLGSVLRLYDLGKRRLLRKCENKLFPNTIVSIQTYRDRIYVGDIQESFHYCKYR 1013
Query: 1225 EQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE------- 1276
QL + A D S+ + T + +D T++ +D+ NI +S+
Sbjct: 1014 RDENQLYIFADD--SVPRWLTASYHVDFDTMA--GADKFGNIYFVRLPQDVSDEIEEDPT 1069
Query: 1277 ----SWKGQKLLSRA-------EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1325
W+ KL +FH+G VT + ++ PG + +
Sbjct: 1070 GGKIKWEQGKLNGAPNKVEEIIQFHIGDVVTSLQKASLI----------PGGGE----CI 1115
Query: 1326 LFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
L+GT+ GS+G + D++ F L+ + P + G + +R A+ P
Sbjct: 1116 LYGTVMGSLGALHAFTSRDDVDF--FSHLEMHMRQEHPPLCGRDHMGYR------SAYFP 1167
Query: 1383 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
D ++D +L + LPL+ Q +IA + T +IL L ++
Sbjct: 1168 VKD-VIDGDLCEQFPSLPLDMQRKIADELDRTPGEILKKLEEV 1209
Score = 44.7 bits (104), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 126/324 (38%), Gaps = 57/324 (17%)
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVLV N + Y +Q A+ + +LP + + AA LL
Sbjct: 246 GVLVCAENFVIYKNQGHPDVRAV------IPRRADLPAERGVLIVSAAMHKQKTMFFFLL 299
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
T+ GD+ +T+ ++ V+ L + + +T+ + + + F S G+ L QF
Sbjct: 300 QTEYGDIFKVTLEHNNDSVKELKIKYFDTIPVTASMCVLKSGFLFAASEFGNHSLYQFQA 359
Query: 440 -GSGTSMLSSG-----LKEEFGDIEADAPSTKRLRR-SSSDALQDMVNGEELSLYGSASN 492
G + SS +E F + K L R ++L +++ + ++L+
Sbjct: 360 IGEDADVESSSATLMETEEGFQPVFFQPRRLKNLMRIDQVESLMPIMDMKIINLF----- 414
Query: 493 NTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-K 548
E + F+ R GP L+ GL I S + ELPG
Sbjct: 415 -EEETPQIFTLCGR------GPRSSLRILRPGLAI------------SEMAVSELPGVPS 455
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+WTV +DE+ AY+++S T+VL + + EV++S F+
Sbjct: 456 AVWTVKKN--------------INDEFDAYIVVSFANATLVLSIGETVEEVSDS--GFLD 499
Query: 609 GRTIAAGNLFGRRRVIQVFERGAR 632
A +L G ++QV G R
Sbjct: 500 TTPSLAVSLIGDDSLMQVHPNGIR 523
>gi|170580631|ref|XP_001895346.1| splicing factor 3B subunit 3 [Brugia malayi]
gi|158597745|gb|EDP35799.1| splicing factor 3B subunit 3, putative [Brugia malayi]
Length = 1181
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/364 (21%), Positives = 145/364 (39%), Gaps = 32/364 (8%)
Query: 1079 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRN 1138
+T + P E A + +V F + L+ G A G + F N
Sbjct: 839 ETLSHFPFAEDEAAFAIAMVQ-FQNQSDTQFVLVGCGCDLQLKPRKANGGCIYTFLLAAN 897
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
Q L + ++A+ +G L G K+ L+ +L P
Sbjct: 898 GTTLQLL----HRTPTDEVVNAIHDFRGMALAGVGKKVRLYDLGKRKLLAKCENRQIPTQ 953
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
VV + + I++ D +S++F+ +K+Q QL++ D S L+D T++ V
Sbjct: 954 VVDIRSMGQRIVVSDSQESVHFMRYKKQDGQLSIFC-DETSPRYVTCVCLLDYDTVA--V 1010
Query: 1259 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA--------TSSDR 1310
D N+ + ++E + RA + G +L+ +A TS +
Sbjct: 1011 GDRFGNVAVLRLPKGVTEEVQEDPTGVRALWDRGNLNGASQKLEAIAHLYIGDAITSMQK 1070
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
T PG++ L + T+ G IG + P DE F Q+L+ + P + G +
Sbjct: 1071 TSLVPGAND----CLSYTTISGIIGILVPFMSRDEFEF--FQNLEMHMRVEYPPLCGRDH 1124
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1427
++R ++ K S++D +L Y ++PL++Q + + G ++I L D+
Sbjct: 1125 LAYRSYYFPVK-------SVIDGDLCEQYSLMPLDKQKSVGEELGRKPTEIHKKLEDIRT 1177
Query: 1428 GTSF 1431
+F
Sbjct: 1178 RYAF 1181
>gi|357496593|ref|XP_003618585.1| Splicing factor 3B subunit [Medicago truncatula]
gi|355493600|gb|AES74803.1| Splicing factor 3B subunit [Medicago truncatula]
Length = 702
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 105/479 (21%), Positives = 197/479 (41%), Gaps = 68/479 (14%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C G + + + + G T++ + VIPL+ TP + ++ L +I S
Sbjct: 276 QCVEGVVALAGEALRIFTIERLGETFN-----ETVIPLRYTPMKFVLQPKRKLLVVIES- 329
Query: 1024 PVLKPLNQVLSLLIDQEV-GHQIDNHNLSSVDLHRTYTVEEYE-----VRILEPDRAGGP 1077
+Q ++E G + ++ + D H Y E + +RIL+P
Sbjct: 330 ------DQGAFTAEEREANGGEDEDKDDPLSDEHYGYPKAESDKWASCIRILDPKTG--- 380
Query: 1078 WQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL--LFST 1135
T + +Q +E A + V N KE TLL +GTA +G R +
Sbjct: 381 -NTTCLLELQDNEAAFSGCTV---NFHDKEYGTLLDVGTA--KGLQFTPRRSLTAGFIHI 434
Query: 1136 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1195
R ++ ++L ++ +++G AL+ QG LL GP + + L
Sbjct: 435 YRFLEDGRSLEL-LHKTQVEGVPLALSQFQGRLLAGIGPVLRFYDLGKRRLLRKYENKLF 493
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC----FATEFLIDG 1251
P +VS+ ++ I +GD +S ++ ++ QL + A D C + ID
Sbjct: 494 PNTIVSIQTYRDRIYVGDTQESFHYCKYRWDENQLYIFADD-----CVPRWLTASYHIDF 548
Query: 1252 STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRT 1311
T++ + D +I + K++ + K+ +FHVG ++ + ++
Sbjct: 549 DTMAGIEEDPTGG-RIKWEQGKLNGA--PNKVEEIVQFHVGDVISCLQKASLI------- 598
Query: 1312 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
PG + +L GT+ GSIG + D++ F L+ + P + G +
Sbjct: 599 ---PGGGE----CILNGTVMGSIGALHAFTSRDDVDF--FSHLEMHMRQDNPPLCGRDHM 649
Query: 1369 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1427
++R A+ P D ++D +L + LP++ Q +IA + TR +IL L + +
Sbjct: 650 AYRS------AYFPVKD-VIDGDLCEQFPTLPMDLQRKIADELDRTRGEILKKLEEYKI 701
>gi|402222132|gb|EJU02199.1| hypothetical protein DACRYDRAFT_21931 [Dacryopinax sp. DJM-731 SS1]
Length = 1209
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 126/571 (22%), Positives = 234/571 (40%), Gaps = 64/571 (11%)
Query: 887 LRNLRFSRTPLDAY------TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFR 940
L+N RT LD TR P + I + N+ G SR ++
Sbjct: 674 LQNGVLLRTVLDPVNGQLTDTRTRFLGSRPVRLIRV--NVHGLPSILALSSRSWLNYTYQ 731
Query: 941 ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS-GSTYDNYWPVQKVI 999
L P + D A++ + C G I + S +L+I Q+P+ G Q VI
Sbjct: 732 NLLHFTPLIFDPLEYAWSFSAEL-CPDGLIGI-SGNVLRIFQVPNLGQKLK-----QDVI 784
Query: 1000 PLKATPHQITYFAEKNLYPLIVSV-PVLKPLNQVLSLLIDQEVGHQIDNHNLS-SVDLHR 1057
PL TP ++ + L+ +I S VL P L + G ++D + D+
Sbjct: 785 PLSYTPRKMLQHPTERLFYVIESDHRVLSPEAADKKLQKLKSTGQRLDQEVIDLPADIFG 844
Query: 1058 TYTVEEYE----VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLA 1113
+ ++I++P ++ +P+ ++E A ++ + T + E L
Sbjct: 845 RPRADAGTWASCIQIIDPANV----RSVLEVPLDNNEAAFSLAITTFI---ARPGELFLV 897
Query: 1114 IGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+GTA +DV + T + +++ ++L ++ E+ AL S QG L+
Sbjct: 898 VGTA----QDVIVSPKSCKSGFLRTYKISEDGRSL-EFLHKTEVDDVPLALLSFQGRLVA 952
Query: 1171 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1230
G + + L + +V+L+ + I++GD+ +SIYF ++K +L
Sbjct: 953 GIGKALRIFDMGKKRLLRKCENKSFATAIVTLSTQGSRIIVGDMAESIYFATYKPPENRL 1012
Query: 1231 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSR 1286
+ A D ++D T+ D+ N+ + PK+ E G +L
Sbjct: 1013 LIFADD-SQPRWITASAMVDYDTVC--AGDKFGNVFVNRLPPKVGEQVDEDPTGAGVLHE 1069
Query: 1287 AEFHVGA-HVTKFLR---LQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1340
+GA H T L + + TS + G R +L+ L G+IG + P
Sbjct: 1070 KGLFMGAPHKTNMLAHYYVGDIITSMHKVALVTG----GRDIVLYTGLHGTIGVLIPFIS 1125
Query: 1341 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
+++ F R +L++ + P + G + ++R ++ K +VD +L + +L
Sbjct: 1126 KEDVDFIR--TLEQHMRTEAPSLVGRDHLTYRGYYVPVKG-------VVDGDLCELFSLL 1176
Query: 1400 PLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
P ++Q IA + T S++L L L + T+
Sbjct: 1177 PTQKQQSIAGELDRTYSEVLKKLEQLRVTTT 1207
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 87/376 (23%), Positives = 150/376 (39%), Gaps = 83/376 (22%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL +T+ ++ V+ + + + + S + + + F+ S G+ L QF
Sbjct: 308 LLQSEDGDLFKVTIDHEDEEVKTMKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367
Query: 438 -TCGSGTSML--SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN-- 492
G + SS + G + + R R + L D +N + + +N
Sbjct: 368 QKLGDDDDEIEYSSVSYPDNGMADPIPQAYFRPRPLENLVLADELNSFDPIVDAKVTNLL 427
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
NT++ Q F+ R + + L+ +GL + S+ ELPG +W
Sbjct: 428 NTDTPQ-IFAACGRGARSSFRMLR---HGLDVEETVSS------------ELPGIPNAVW 471
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV K+ DD+Y AY+I+S T+VL + + EV+++ + T
Sbjct: 472 TVKLKA--------------DDQYDAYIILSFVNGTLVLSIGETIEEVSDT-GFLSSSPT 516
Query: 612 IAAGNLFGRRRVIQVFERGAR------------------ILDGS---------------- 637
IA + G ++QV+ G R I+ +
Sbjct: 517 IAVQQI-GEDSLLQVYPHGIRHVLSDRRVNEWRCPQHTTIVAATTNSRQVAIALSSAQLV 575
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVGDPST 689
Y DL G N S S VL++SIA+ PY+ +G D ++R++ DP T
Sbjct: 576 YFELDLE-GQLNEYQDRKSLGSGVLAMSIAEVPEGRQRTPYLAVGCEDQTVRIISLDPDT 634
Query: 690 C--TVSVQTPAAIESS 703
+S+Q A SS
Sbjct: 635 TLENISLQALTAPPSS 650
>gi|395330962|gb|EJF63344.1| hypothetical protein DICSQDRAFT_153890 [Dichomitus squalens LYAD-421
SS1]
Length = 1263
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 141/314 (44%), Gaps = 42/314 (13%)
Query: 1112 LAIGTAYVQGEDV-AARGRVLLFST----GRNADNPQNLVTEVYSKELKGAISALASLQG 1166
A+GT Y++ E+ ++GR+LLFS G N ++L T + S + G + ALA+L
Sbjct: 944 FALGTVYIRPEEREPSKGRILLFSVSSTEGARGANVRSLHT-LASVNVGGCVYALANLSE 1002
Query: 1167 HLLIAS-GPKIILHKWTGTELNGIAFYDAPPL------------YVVSLNIVKNFILLGD 1213
+L++A+ ++L K T E ++ PL +V ++ + IL+GD
Sbjct: 1003 NLIVAAINTSVVLFKSTENEAG-----ESTPLSLEKVTEWNHNHFVTNVVVDGERILVGD 1057
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
S+ L W E+ +L +A+D+G L A I+G+ L+ ++ N+ F
Sbjct: 1058 AISSVSVLKWNERLERLESIARDYGPLWPIA----IEGTGNGLIGANADCNLFSFSLQSV 1113
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
++ L +H+ KF+R + TS+D D+ + + +F T G
Sbjct: 1114 PHRTY----LEKDGVYHLNDVTNKFVRGAL--TSTDV-----AEDQVVKASHVFFTSTGC 1162
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS--IVDCE 1391
IG I ++++T + +LQ+ + ++ G N R S + H S +D +
Sbjct: 1163 IGAILDMNDVTSLHMTALQRNMAKTLTGPGGDNHTKLRA-PSTPRGHTDAEASYGFLDGD 1221
Query: 1392 LLSHYEMLPLEEQL 1405
L Y P EQ
Sbjct: 1222 FLEQYLTHPHPEQF 1235
>gi|195500686|ref|XP_002097479.1| GE26244 [Drosophila yakuba]
gi|194183580|gb|EDW97191.1| GE26244 [Drosophila yakuba]
Length = 1140
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 146/368 (39%), Gaps = 44/368 (11%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ EVG +ID HNL +D + +L P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHHFVSPETISSLMSAKLGDDPNTYYV 831
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
V T+ V E+ + GR+++F N +T+V ++ G
Sbjct: 832 V----------------ATSLVIPEEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1216
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMR 929
Query: 1217 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1276
SI L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986
Query: 1277 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
+ + + + L LQ++L + V + +R F N K + +D +L+
Sbjct: 1040 IVTQIPQDFYDFLHGLQERLKKIIKSVGKIEHTYYRNFQINNKVE--PSEGFIDGDLIES 1097
Query: 1396 YEMLPLEE 1403
+ L E+
Sbjct: 1098 FLDLSREK 1105
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 107/264 (40%), Gaps = 53/264 (20%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVMAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELTVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251
Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
+F +A N + LL G L +L + G V+ + + + +
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTSETSKGVTVKDIKVEQLGEISIPE 311
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
IT + N ++G+R GDS LV+
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335
>gi|124359136|gb|ABD32504.2| CPSF A subunit, C-terminal; WD40-like [Medicago truncatula]
Length = 632
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 103/478 (21%), Positives = 196/478 (41%), Gaps = 70/478 (14%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C G + + + + G T++ + VIPL+ TP + ++ L +I S
Sbjct: 210 QCVEGVVALAGEALRIFTIERLGETFN-----ETVIPLRYTPMKFVLQPKRKLLVVIES- 263
Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE-----VRILEPDRAGGPW 1078
+Q ++E + ++ + D H Y E + +RIL+P
Sbjct: 264 ------DQGAFTAEEREAAKKDEDKDDPLSDEHYGYPKAESDKWASCIRILDPKTG---- 313
Query: 1079 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL--LFSTG 1136
T + +Q +E A + V N KE TLL +GTA +G R +
Sbjct: 314 NTTCLLELQDNEAAFSGCTV---NFHDKEYGTLLDVGTA--KGLQFTPRRSLTAGFIHIY 368
Query: 1137 RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1196
R ++ ++L ++ +++G AL+ QG LL GP + + L P
Sbjct: 369 RFLEDGRSLEL-LHKTQVEGVPLALSQFQGRLLAGIGPVLRFYDLGKRRLLRKYENKLFP 427
Query: 1197 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC----FATEFLIDGS 1252
+VS+ ++ I +GD +S ++ ++ QL + A D C + ID
Sbjct: 428 NTIVSIQTYRDRIYVGDTQESFHYCKYRWDENQLYIFADD-----CVPRWLTASYHIDFD 482
Query: 1253 TLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTG 1312
T++ ++ +I + K++ + K+ +FHVG ++ + ++
Sbjct: 483 TMA----EDPTGGRIKWEQGKLNGA--PNKVEEIVQFHVGDVISCLQKASLI-------- 528
Query: 1313 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
PG + +L GT+ GSIG + D++ F L+ + P + G + +
Sbjct: 529 --PGGGE----CILNGTVMGSIGALHAFTSRDDVDF--FSHLEMHMRQDNPPLCGRDHMA 580
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1427
+R A+ P D ++D +L + LP++ Q +IA + TR +IL L + +
Sbjct: 581 YRS------AYFPVKD-VIDGDLCEQFPTLPMDLQRKIADELDRTRGEILKKLEEYKI 631
>gi|168046759|ref|XP_001775840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672847|gb|EDQ59379.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1214
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 89/388 (22%), Positives = 166/388 (42%), Gaps = 68/388 (17%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA---YVQGE 1122
+R+L+P + T + +Q +E A ++ V + KE TL+A+GTA +
Sbjct: 862 IRVLDPKTS----TTTCLLELQENEAAFSLCAVNFHDN--KELGTLIAVGTAKNMQFMPK 915
Query: 1123 DVAARGRVLLF---STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1179
++ G + ++ GR ++ V+ + G +AL QG LL+ G + ++
Sbjct: 916 KESSGGFIHIYRFVEEGR-------ILELVHKTPVDGVPTALCQFQGRLLVGVGQVLRIY 968
Query: 1180 KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1239
+L P +++++ + I +GDI +S +++ ++ QL A D S
Sbjct: 969 DLGKRKLLRKCENKNFPNTIIAIHTYGDRIYVGDIQESFHYVKYRRDENQLYTFADD--S 1026
Query: 1240 LDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKG------- 1280
+ T L ID T++ +D+ N+ + +SE W+
Sbjct: 1027 CPRWLTASLHIDFDTMA--GADKFGNVYVMRLPQDVSEEIEDDPTGGKIKWEQGRLNGAP 1084
Query: 1281 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
K+ +FHVG VT + ++ PG ++ +L+GT+ GS+G + P
Sbjct: 1085 NKVEEIIQFHVGEVVTSLQKASLI----------PGGGES----VLYGTIMGSVGALLPF 1130
Query: 1341 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1397
+++ F L+ L P + G + +FR A+ P D ++D +L Y
Sbjct: 1131 SSREDVDF--FSHLEMHLRQENPPLCGRDHMAFR------SAYFPVKD-VIDGDLCEQYP 1181
Query: 1398 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
ML E Q +IA T ++L L D+
Sbjct: 1182 MLTSELQRKIADDLDRTPGEVLKKLEDI 1209
>gi|17508021|ref|NP_491953.1| Protein TEG-4 [Caenorhabditis elegans]
gi|351060889|emb|CCD68627.1| Protein TEG-4 [Caenorhabditis elegans]
Length = 1220
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/394 (19%), Positives = 169/394 (42%), Gaps = 55/394 (13%)
Query: 1071 PDRAGGPWQTRATIPMQSSENALT---------VRVVTLFNTTTKENETLLAIGTAYVQG 1121
P A G W + ++ +S + L+ + V L + NE ++ +G +
Sbjct: 849 PRAARGKWASAISLISATSGDKLSYFELPQDENAKCVALVQFSKHPNEAMVLVGCGVNEV 908
Query: 1122 EDV-----------AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+V RG V F N D L + E + A+ +G L+
Sbjct: 909 LNVHDIDPNDTSIRPTRGCVYTFHLSANGDRFDFL----HRTETPLPVGAIHDFRGMALV 964
Query: 1171 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1230
G + ++ +L P+ +V++ I++ D +S++FL +++ QL
Sbjct: 965 GFGRFLRMYDIGQKKLLAKCENKNFPVSIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQL 1024
Query: 1231 NLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
+ A D + C ++D T++ V+D+ N+ + +++E + +S++
Sbjct: 1025 VVFADDTTPRYVTCVC---VLDYHTVA--VADKFGNLAVVRLPERVNEDVQDDPTVSKSV 1079
Query: 1289 FHVGAHVTKFLRLQMLA--------TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
+ G ++++++ TS +T PG+++ AL++ T+ G+IGC+
Sbjct: 1080 WDRGWLNGASQKVELVSNFFIGDTITSLQKTSLMPGANE----ALVYTTIGGAIGCLVSF 1135
Query: 1341 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1397
DE+ F +L+ + P + G + ++R +++ K S++D ++ +
Sbjct: 1136 MSKDEVDF--FTNLEMHVRSEYPPLCGRDHLAYRSYYAPCK-------SVIDGDICEQFS 1186
Query: 1398 MLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
++ ++Q ++A + G T S+I L D+ +F
Sbjct: 1187 LMDTQKQKDVAEELGKTVSEISKKLEDIRTRYAF 1220
>gi|341884150|gb|EGT40085.1| CBN-DDB-1 protein [Caenorhabditis brenneri]
Length = 1134
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/310 (19%), Positives = 129/310 (41%), Gaps = 19/310 (6%)
Query: 1107 ENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
+N + +GTA V ++ + GR+++F +TE+ ++GA + L
Sbjct: 816 DNNSYYIVGTALVYPDESETKIGRIIVFEVDETDKTKLRFMTEIV---VRGAPMGIRILN 872
Query: 1166 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1225
G L+ A + + +WT + + + V L ++ I + D+ +S+ LS++
Sbjct: 873 GKLVAAINSSVRMFEWTAEKELRVECSTFNHIAAVDLKVLNEEIAVADVMRSVSLLSYRT 932
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1285
+AKD+ S EF+ S L +++ P + G+ +L
Sbjct: 933 LEGNFEEVAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPVTDD---GRYVLE 989
Query: 1286 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPLDEL 1343
+ +TK + +L D D + R+ +++GT GS+G + +D++
Sbjct: 990 PTGYWYLGELTKVMIRAVLVPQPD--------DNSIRYTQPIMYGTNQGSLGLVVQIDDM 1041
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
+ L S++K + D+ + + ++R F N + P +D +L+ +
Sbjct: 1042 YKKFLLSIEKAISDAEKNCMQIEHSTYRSFTYNKRIEPPS--GFIDGDLIESILDMDRSR 1099
Query: 1404 QLEIAHQTGT 1413
+EI + T
Sbjct: 1100 AIEILEKANT 1109
Score = 46.6 bits (109), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 87/354 (24%), Positives = 143/354 (40%), Gaps = 82/354 (23%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
D+ L+ VPSPI GV+V+G +++ Y S N+ V SS L + F+ +
Sbjct: 210 DSSMLIPVPSPISGVVVLGTHSLLYKSSE-------NDGEVVPYSSPLLENTIFTSHSIV 262
Query: 365 DAAHATWLQNDV--ALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGN 420
D ++ +D LL ++LL V + G V+ + + + + I I N
Sbjct: 263 DPTGERFIVSDTDGRLL------MLLLNAVENQSGLSVKEIRIDLLGDTSVAESINYIDN 316
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
+ F+GSR GDS L++ S S L + + ++DM+
Sbjct: 317 GVVFIGSRFGDSQLIRLLSEKTNSSYISVLDTYY----------------NIGPIRDMIM 360
Query: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
E ++ + T S A +D G L+ G+ I A+
Sbjct: 361 VE---------SDGQPQLVTCSGAEKD-----GSLRVIRNGIGIEELAT----------- 395
Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
V+LPG GI+ + SS AD+ Y+I+SL T VL+ E
Sbjct: 396 -VDLPGVVGIFPIRLDSS----ADN------------YVIVSLVEETHVLQITGEELEDV 438
Query: 601 ESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSE 651
+ + T+ AG LFG V+QV ER R++ +++ + P+N E
Sbjct: 439 QFLQIDTALPTMFAGTLFGPNDSGLVVQVTERQVRLMSNGGLSK--FWEPANGE 490
>gi|307109500|gb|EFN57738.1| hypothetical protein CHLNCDRAFT_56079 [Chlorella variabilis]
Length = 1144
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 90/385 (23%), Positives = 162/385 (42%), Gaps = 63/385 (16%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
VRI++P QT I M+ +E AL+V +V + E+ TLLA+GTA QG
Sbjct: 793 VRIVDPVS----LQTTHCIEMEDNEAALSVCLVEF--DSHPEHGTLLAVGTA--QGLKFY 844
Query: 1126 AR----GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1181
+ G V L+ R D+ + + ++ ++G A+A+ +G LL+ + L+
Sbjct: 845 PKECQNGFVHLY---RFLDDGKR-IELLHKTAVEGVPGAMAAFKGRLLVGVDAVLRLYDM 900
Query: 1182 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1241
+ Y P + +L++ + I +GD +S +F+ +K+ Q + A D
Sbjct: 901 GKKRMLRKCEYRRLPTRIATLHVSGSRIYVGDGQESTFFMRYKKGDNQFYIFADDIVPRH 960
Query: 1242 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG------------------QKL 1283
A L D TL+ +D N+ + ++S + KL
Sbjct: 961 VTAALHL-DYDTLA--GADRFGNVFVSRLPQEVSAQVEDDPTGGKYATETGLLGGAPNKL 1017
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1340
+ FHVG VT R + PG R +++GT++G+IG + P
Sbjct: 1018 RTINSFHVGETVTALQRAVL----------QPG----GRELIVYGTINGAIGVLYPFTSK 1063
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1400
++ F Q L+ + P + G + ++R F+ K +VD +L Y L
Sbjct: 1064 EDCDF--FQHLEMHMRQEHPPLLGRDHLAYRSFYFPVK-------DVVDGDLCEQYPQLA 1114
Query: 1401 LEEQLEIAHQTGTTRSQILSNLNDL 1425
++ +A + + ++L L D+
Sbjct: 1115 ADKARGVAEELDRSPGEVLKKLEDI 1139
>gi|147779836|emb|CAN63685.1| hypothetical protein VITISV_020449 [Vitis vinifera]
Length = 64
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/47 (74%), Positives = 42/47 (89%)
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
+PRSSFSVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGR+ L
Sbjct: 1 MPRSSFSVELDAANATWLSNDVAMLSTKTGELLLLTLXYDGRLFTDL 47
>gi|195571247|ref|XP_002103615.1| GD18880 [Drosophila simulans]
gi|194199542|gb|EDX13118.1| GD18880 [Drosophila simulans]
Length = 1140
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/361 (21%), Positives = 144/361 (39%), Gaps = 44/361 (12%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ EVG +ID HNL +D + +L + P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVSPETISSLMSAKLGDDPNTYYV 831
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
V T+ V E+ + GR+++F N +T+V ++ G
Sbjct: 832 V----------------ATSLVIPEEPEPKVGRIIIFHYNENK------LTQVAETKVDG 869
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1216
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMR 929
Query: 1217 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1276
SI L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986
Query: 1277 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
+ + + + L L+++L + V + +R F N K + +D +L+
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIES 1097
Query: 1396 Y 1396
+
Sbjct: 1098 F 1098
Score = 56.6 bits (135), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 92/207 (44%), Gaps = 33/207 (15%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
NLR +D +V D F+HG + P ++++H+ GR H I+ K+
Sbjct: 156 NLR-MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFM 203
Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
I W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 204 KIAWKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA------- 249
Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSV 410
+F +A N + LL G L +L + G V+ + + +
Sbjct: 250 -PLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEIS 308
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ IT + N ++G+R GDS LV+
Sbjct: 309 IPECITYLDNGFLYIGARHGDSQLVRL 335
>gi|409075182|gb|EKM75565.1| hypothetical protein AGABI1DRAFT_64324 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1213
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 116/469 (24%), Positives = 197/469 (42%), Gaps = 76/469 (16%)
Query: 996 QKVIPLKATPHQITYFAEKNLYPLIV-------SVPVLKPLNQVL--SLLIDQEVGHQID 1046
Q IPL TP + + NL+ LI V K LN++ + IDQEV
Sbjct: 785 QDSIPLSYTPRKFITYPLNNLFYLIEGDHRVMGQDAVDKKLNELRQQNRAIDQEV----- 839
Query: 1047 NHNLSSVDLHRTYTVE---EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNT 1103
NLS R +RI++P +T + +P+ +E+A ++ VV
Sbjct: 840 -LNLSPEVFGRPKAANGTWASNIRIIDPVEG----KTISVVPLDGNESAFSLAVVPF--- 891
Query: 1104 TTKENETLLAIGTAYVQGEDVAARG------RVLLF-STGRNADNPQNLVTEVYSKELKG 1156
+ K NE L +GTA ++ R RV F GR + V+ E+
Sbjct: 892 SAKGNELHLVVGTA--ADTKLSPRTCSTGFLRVYKFLEDGRQLE-------LVHKTEIDD 942
Query: 1157 AISALASLQGHLLIASGPKIILH----KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
AL + QG L+ G + ++ K ++ F A +V+L+ + IL+G
Sbjct: 943 VPLALMAFQGRLVAGVGKALRIYDIGKKKMLRKVENKQFGSA----IVTLSTQGSRILVG 998
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ +SI+F +K +L + A D + ++D +T +V +D NI + P
Sbjct: 999 DMQESIFFAVYKAPENRLLIFADD-SQPRWISAATMVDYNT--VVAADRFGNIFVNRLDP 1055
Query: 1273 KMS----ESWKGQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFA 1324
++S E G +L ++GA H TK + + L TS + G R
Sbjct: 1056 RVSDQVDEDPTGAGILHEKGLYMGAPHKTKMICHFHVGDLITSIHKVSLVAG----GREV 1111
Query: 1325 LLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
LL+ L G+IG + P +++ F + +L++ + + G + +R ++ KA
Sbjct: 1112 LLYTGLHGTIGILVPFVTKEDVDF--ISTLEQHMRTEQVSLVGRDHLGWRGYYVPVKA-- 1167
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
+VD +L Y LP +Q IA + + ++L L L + S
Sbjct: 1168 -----VVDGDLCEMYAKLPGSKQSAIAGELDRSIGEVLKKLEQLRVTAS 1211
Score = 45.8 bits (107), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 99/385 (25%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL +T+ ++ V+ L + + + S + + + F+ S G+ L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367
Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSDALQDMVNGEELSLY 487
S TS SSG+ E +A P + R + AL D + + +
Sbjct: 368 QKLGDDDEEPEFSSTSFPSSGMAEP----QAALPRVYFKPRPLDNLALADELESLDPIID 423
Query: 488 GSASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
N N+++ Q F+ R + + L+ +GL + S+ +L
Sbjct: 424 SKVLNLLPNSDTPQ-IFAACGRGARSS---LRTLQHGLEVEESVSS------------DL 467
Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG +WT DD Y +Y+I+S T+VL + + EV ++
Sbjct: 468 PGIPNAVWTTKRNE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT- 512
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS-------- 637
+ T+A + G ++QV G R I+ +
Sbjct: 513 GFLSSAPTLAVQQI-GSDALLQVHPHGIRHVLADRRVNEWRVPSNKTIVAATTNKRQVVV 571
Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
Y DL G N + STVL++SI D PY+ +G D ++R
Sbjct: 572 ALSSAELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGCEDQTVR 630
Query: 682 LLVGDPSTC--TVSVQT----PAAI 700
++ DP + T+S+Q P+AI
Sbjct: 631 IISLDPESTLETISLQALTAPPSAI 655
>gi|195037449|ref|XP_001990173.1| GH18378 [Drosophila grimshawi]
gi|193894369|gb|EDV93235.1| GH18378 [Drosophila grimshawi]
Length = 1140
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/371 (21%), Positives = 149/371 (40%), Gaps = 44/371 (11%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ EVG +ID HNL +D T+ V +P ++ ++ ++
Sbjct: 780 NAEVGQEIDVHNLLIID-QNTFEV----------------LHAHQFVPPETISTLMSAKL 822
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
+ T + T+ V E+ + GR+++F N +T+V ++ G
Sbjct: 823 -------GDDPNTYYVVATSLVYPEEPEPKVGRIIIFHYNDNK------LTQVAETKVDG 869
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1216
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929
Query: 1217 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1276
SI L K+ +A+D A E L D + L D N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLGCETHD---NLFVCQKDSAATT 986
Query: 1277 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ Q L A FH+G + F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTINVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
+ + + + L L+++L + V ++ +R + N K + +D +L+
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIDHTYYRNYQINTKVE--PSEGFIDGDLIES 1097
Query: 1396 YEMLPLEEQLE 1406
+ L E+ E
Sbjct: 1098 FLDLNREKMRE 1108
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 66/264 (25%), Positives = 109/264 (41%), Gaps = 51/264 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG + P ++++H GR H I+ K+ I
Sbjct: 159 -MDELTVYDVEFLHGCLNPTVIVIHRDN---DGRHVKSHE--------INLRDKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPI GV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLIPVPSPICGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSD 414
++ +D + LL G L +L T G V+ + + + +
Sbjct: 260 CYA-RIDEKGLRY------LLGNMDGQLYMLFLGTTETSKGITVKDIKVEQLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
IT + N ++GSR GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGSRHGDSQLVRLS 336
>gi|21357503|ref|NP_650257.1| piccolo [Drosophila melanogaster]
gi|74872881|sp|Q9XYZ5.1|DDB1_DROME RecName: Full=DNA damage-binding protein 1; Short=D-DDB1; AltName:
Full=Damage-specific DNA-binding protein 1; AltName:
Full=Protein piccolo
gi|4928452|gb|AAD33592.1|AF132145_1 damage-specific DNA binding protein DDBa p127 subunit [Drosophila
melanogaster]
gi|7299719|gb|AAF54901.1| piccolo [Drosophila melanogaster]
gi|220942640|gb|ACL83863.1| DDB1-PA [synthetic construct]
Length = 1140
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/361 (21%), Positives = 144/361 (39%), Gaps = 44/361 (12%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ EVG +ID HNL +D + +L + P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVAPETISSLMSAKLGDDPNTYYV 831
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
V T+ V E+ + GR+++F N +T+V ++ G
Sbjct: 832 V----------------ATSLVIPEEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1216
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929
Query: 1217 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1276
SI L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986
Query: 1277 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
+ + + + L L+++L + V + +R F N K + +D +L+
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINSKVE--PSEGFIDGDLIES 1097
Query: 1396 Y 1396
+
Sbjct: 1098 F 1098
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDS---DGRHVKSHE--------INLRDKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251
Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
+F +A N + LL G L +L + G V+ + + + +
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
IT + N ++G+R GDS LV+
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335
>gi|426192113|gb|EKV42051.1| hypothetical protein AGABI2DRAFT_229642 [Agaricus bisporus var.
bisporus H97]
Length = 1213
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 116/469 (24%), Positives = 197/469 (42%), Gaps = 76/469 (16%)
Query: 996 QKVIPLKATPHQITYFAEKNLYPLIV-------SVPVLKPLNQVL--SLLIDQEVGHQID 1046
Q IPL TP + + NL+ LI V K LN++ + IDQEV
Sbjct: 785 QDSIPLSYTPRKFITYPLNNLFYLIEGDHRVMGQDAVDKKLNELRQQNKAIDQEV----- 839
Query: 1047 NHNLSSVDLHRTYTVE---EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNT 1103
NLS R +RI++P +T + +P+ +E+A ++ VV
Sbjct: 840 -LNLSPEVFGRPKAANGTWASNIRIIDPVEG----KTISVVPLDGNESAFSLAVVPF--- 891
Query: 1104 TTKENETLLAIGTAYVQGEDVAARG------RVLLF-STGRNADNPQNLVTEVYSKELKG 1156
+ K NE L +GTA ++ R RV F GR + V+ E+
Sbjct: 892 SAKGNELHLVVGTA--ADTKLSPRTCSTGFLRVYKFLEDGRQLE-------LVHKTEIDD 942
Query: 1157 AISALASLQGHLLIASGPKIILH----KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
AL + QG L+ G + ++ K ++ F A +V+L+ + IL+G
Sbjct: 943 VPLALMAFQGRLVAGVGKALRIYDIGKKKMLRKVENKQFGSA----IVTLSTQGSRILVG 998
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ +SI+F +K +L + A D + ++D +T +V +D NI + P
Sbjct: 999 DMQESIFFAVYKAPENRLLIFADD-SQPRWISAATMVDYNT--VVAADRFGNIFVNRLDP 1055
Query: 1273 KMS----ESWKGQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFA 1324
++S E G +L ++GA H TK + + L TS + G R
Sbjct: 1056 RVSDQVDEDPTGAGILHEKGLYMGAPHKTKMICHFHVGDLITSIHKVSLVAG----GREV 1111
Query: 1325 LLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
LL+ L G+IG + P +++ F + +L++ + + G + +R ++ KA
Sbjct: 1112 LLYTGLHGTIGILVPFVTKEDVDF--ISTLEQHMRTEQVSLVGRDHLGWRGYYVPVKA-- 1167
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
+VD +L Y LP +Q IA + + ++L L L + S
Sbjct: 1168 -----VVDGDLCEMYAKLPGSKQSAIAGELDRSIGEVLKKLEQLRVTAS 1211
Score = 45.8 bits (107), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 99/385 (25%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL +T+ ++ V+ L + + + S + + + F+ S G+ L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367
Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSDALQDMVNGEELSLY 487
S TS SSG+ E +A P + R + AL D + + +
Sbjct: 368 QKLGDDDEEPEFSSTSFPSSGMAEP----QAALPRVYFKPRPLDNLALADELESLDPIID 423
Query: 488 GSASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
N N+++ Q F+ R + + L+ +GL + S+ +L
Sbjct: 424 SKVLNLLPNSDTPQ-IFAACGRGARSS---LRTLQHGLEVEESVSS------------DL 467
Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG +WT DD Y +Y+I+S T+VL + + EV ++
Sbjct: 468 PGIPNAVWTTKRNE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT- 512
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS-------- 637
+ T+A + G ++QV G R I+ +
Sbjct: 513 GFLSSAPTLAVQQI-GSDALLQVHPHGIRHVLADRRVNEWRVPSNKIIVAATTNKRQVVV 571
Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
Y DL G N + STVL++SI D PY+ +G D ++R
Sbjct: 572 ALSSAELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGCEDQTVR 630
Query: 682 LLVGDPSTC--TVSVQT----PAAI 700
++ DP + T+S+Q P+AI
Sbjct: 631 IISLDPESTLETISLQALTAPPSAI 655
>gi|392570042|gb|EIW63215.1| hypothetical protein TRAVEDRAFT_161375 [Trametes versicolor FP-101664
SS1]
Length = 1213
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 127/555 (22%), Positives = 222/555 (40%), Gaps = 100/555 (18%)
Query: 919 NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
NI + SRP ++ + P + + A++ + C G I + S +L
Sbjct: 714 NIQQNPAILALSSRPWLNYTYQNFMHFTPLIFENLDYAWSFSAEL-CTEGLIGI-SGSLL 771
Query: 979 KICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1038
+I Q+P T Q +PL TP + + P N +L L+
Sbjct: 772 RIFQIPKLGTKLK----QDSLPLSYTPRKF----------------MPHPTNGLLYLI-- 809
Query: 1039 QEVGHQIDNHNLSSVDLH----RTYTVEEYEVRILEPDRAGGP------WQT--RATIPM 1086
E H++ + +S L R ++E EV +L P++ G P W + R P+
Sbjct: 810 -EGDHRVMSEEAASKKLQEMRARGERIDE-EVLLLPPEQFGRPKAPAGTWASCIRIINPL 867
Query: 1087 QSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQGEDVAARGRVLLF-S 1134
+SS TV+V+ L N + NE L +GTA Q ++ R F
Sbjct: 868 ESS----TVKVIHLDNNEAAFSMAIVPFAARGNELHLVVGTA--QDTFLSPRSCTSGFLR 921
Query: 1135 TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1194
T R D+ ++L ++ E A+ + QG L+ G + L+ +L
Sbjct: 922 TYRFIDDGRDL-EFLHKTETSDVPLAVMAFQGKLIAGVGKSLRLYDVGKKKLLRKVENKG 980
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
P +V+LN + I++GD+ +S+++ +K +L + A D AT L D +T
Sbjct: 981 FPAAIVTLNTQGSRIIVGDMQESVFYAVYKAPENRLLVFADDAQPRWVTATTML-DYNT- 1038
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESW------------KG------QKLLSRAEFHVGAHVT 1296
+V D N+ + K+S+ KG K + + FHVG VT
Sbjct: 1039 -VVAGDRFGNVFVNRLDSKISDQIDDDPTGAGILHEKGVLFGAPHKSVMLSHFHVGDIVT 1097
Query: 1297 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKL 1355
++ ++A R LL+ L G+IG + P + + L +L++ +
Sbjct: 1098 SLHKVALVA--------------GGREVLLYTCLHGTIGILVPFVSKEDVDLLTTLEQHM 1143
Query: 1356 VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1415
+ G + ++R ++ K S+VD +L + LP +Q IA + T
Sbjct: 1144 RTEQLSLVGRDHLTWRGYYVPVK-------SVVDGDLCESFAKLPANKQSTIAGELDRTV 1196
Query: 1416 SQILSNLNDLALGTS 1430
++L L L + S
Sbjct: 1197 GEVLKKLEQLRVTAS 1211
>gi|194901554|ref|XP_001980317.1| GG19434 [Drosophila erecta]
gi|190652020|gb|EDV49275.1| GG19434 [Drosophila erecta]
Length = 1140
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/361 (21%), Positives = 144/361 (39%), Gaps = 44/361 (12%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ EVG +ID HNL +D + +L + P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVSPETISSLMSAKLGDDPNTYYV 831
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
V T+ V E+ + GR+++F N +T+V ++ G
Sbjct: 832 V----------------ATSLVIPEEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1216
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMR 929
Query: 1217 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1276
SI L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986
Query: 1277 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
+ + + + L L+++L + V + +R F N K + +D +L+
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIES 1097
Query: 1396 Y 1396
+
Sbjct: 1098 F 1098
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCMNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251
Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
+F +A N + LL G L +L + G V+ + + + +
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
IT + N ++G+R GDS LV+
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335
>gi|157128866|ref|XP_001655232.1| DNA repair protein xp-e [Aedes aegypti]
gi|108882187|gb|EAT46412.1| AAEL002407-PA [Aedes aegypti]
Length = 980
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 109/468 (23%), Positives = 174/468 (37%), Gaps = 119/468 (25%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G L +DP+ R G+ +Y GL II D DT H +
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154
Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
L M+ HV+D F++G P ++++H+ ++ +H I I+ K
Sbjct: 155 TSLRMEEVHVQDVEFLYGTQHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203
Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
I W N+ +A L+ VP+P+GG +V+G ++ YH + A+A +N YA
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYAR 263
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
+ S L +N LLS K + LL + T
Sbjct: 264 VDSKGFRYLLGNMSGHLFMMFLETEENSKGLLSVKDIKVELLGDI-------------TI 310
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
P IT + N + F+GSR GDS LV+ +G + + E F ++ AP
Sbjct: 311 PEC----ITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMETFTNL---APIIDMC 363
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
L+ G+ ++ GS G L+ G+ I
Sbjct: 364 IVD----LEKQGQGQMITCSGSYKE--------------------GSLRIIRNGIGIQEH 399
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
A ++LPG KG+W + R+ D Y L++S T
Sbjct: 400 AC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLSFVGHT 434
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
+L + E TE + +T N+ FG ++IQV AR++
Sbjct: 435 RILTLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPTTARLI 480
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 73/149 (48%), Gaps = 13/149 (8%)
Query: 1110 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
T +GTA V E+ + GR++++ AD NL T+V KE+KG+ +L G +
Sbjct: 826 TYYIVGTALVNPEEPEPKVGRIIIY---HYADG--NL-TQVSEKEIKGSCYSLVEFNGRV 879
Query: 1169 LIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1225
L + + L++WT + L F + LY + +FIL+GD+ +SI L +K+
Sbjct: 880 LASINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKG---DFILVGDLMRSITLLQYKQ 936
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
+A+D+ A E L D + L
Sbjct: 937 MEGSFEEIARDYQPNWMTAVEILDDDAFL 965
>gi|195449948|ref|XP_002072297.1| GK22405 [Drosophila willistoni]
gi|194168382|gb|EDW83283.1| GK22405 [Drosophila willistoni]
Length = 1140
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 76/343 (22%), Positives = 134/343 (39%), Gaps = 40/343 (11%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ EVG +ID HNL +D + +L + P A + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVSPETISALMSAKLGDDPNTYYV 831
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1157
V E E + GR+++F N +T+V ++ G
Sbjct: 832 VATSLVIPDEPEPKV---------------GRIIIFHYHDNK------LTQVAETKVDGT 870
Query: 1158 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
AL G +L G + L++WT + + + + L +FIL+GD+ +S
Sbjct: 871 CYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMRS 930
Query: 1218 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
I L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 931 ITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATTD 987
Query: 1278 WKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGC 1336
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 988 EERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIGI 1040
Query: 1337 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1379
+ + + + L L+++L + V + +R F N K
Sbjct: 1041 VTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKV 1083
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 112/264 (42%), Gaps = 51/264 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMEKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELMVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
++ +D+ + LL G L +L + G V+ + + + +
Sbjct: 260 CYA-RVDSKGLRY------LLGNMHGQLYMLFLGTSESSKGITVKDIKVEQLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
IT + N ++G+R GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLS 336
>gi|195395112|ref|XP_002056180.1| GJ10363 [Drosophila virilis]
gi|194142889|gb|EDW59292.1| GJ10363 [Drosophila virilis]
Length = 1140
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 74/361 (20%), Positives = 147/361 (40%), Gaps = 44/361 (12%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ EVG +ID HNL +D T+ V + +P ++ + ++ ++
Sbjct: 780 NAEVGQEIDVHNLLVID-QNTFEV----------------LHSHQFVPPETISSLMSAKL 822
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
+ T + T+ V ++ + GR+++F N +T+V ++ G
Sbjct: 823 -------GDDPNTYYVVATSLVFPDEPEPKVGRIIIFHYNENK------LTQVAETKVDG 869
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1216
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929
Query: 1217 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1276
SI L K+ +A+D A E L D + L D N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLGCETHD---NLFVCQKDSAATT 986
Query: 1277 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ Q L A FH+G + F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTINVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
+ + + + L L+++L + V ++ +R + N K + +D +L+
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIDHTYYRNYQINTKVE--PSEGFIDGDLIES 1097
Query: 1396 Y 1396
+
Sbjct: 1098 F 1098
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 64/263 (24%), Positives = 107/263 (40%), Gaps = 49/263 (18%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
+D V D F+HG P ++++H+ GR H + I W
Sbjct: 159 -MDELTVYDVEFLHGCQNPTVIVIHKDN---DGRHVKSHEINLRDKEFIKVA-------W 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A L+ VPS IGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 208 KQDNVETEATMLIPVPSSIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTINC 260
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDI 415
++ +D+ + LL G L +L T G V+ + + + + I
Sbjct: 261 YA-RVDSKGLRY------LLGNMDGQLYMLFLGTTETSKGTTVKDIKVEQLGEISIPECI 313
Query: 416 TTIGNSLFFLGSRLGDSLLVQFT 438
T + N ++GSR GDS LV+ +
Sbjct: 314 TYLDNGFLYIGSRHGDSQLVRLS 336
>gi|195329354|ref|XP_002031376.1| GM24084 [Drosophila sechellia]
gi|194120319|gb|EDW42362.1| GM24084 [Drosophila sechellia]
Length = 1140
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 75/338 (22%), Positives = 136/338 (40%), Gaps = 42/338 (12%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ EVG +ID HNL +D + +L + P + + Q ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVSPETISSLMSAQLGDDPNTYYV 831
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
V T+ V E+ + GR+++F N +T+V ++ G
Sbjct: 832 V----------------ATSLVIPEEPEPKVGRIIIFHYNENK------LTQVAETKVDG 869
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1216
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMR 929
Query: 1217 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1276
SI L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986
Query: 1277 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
+ + + + L L+++L + V + + +R F
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKLVGKIGHKFYRNF 1077
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251
Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
+F +A N + LL G L +L + G V+ + + + +
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
IT + N ++G+R GDS LV+
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335
>gi|449488592|ref|XP_004158102.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
[Cucumis sativus]
Length = 570
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 108/504 (21%), Positives = 194/504 (38%), Gaps = 117/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTAQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDT 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + + G + +DP R G+ +Y GL +I
Sbjct: 95 ESSELITRAMGDVSD------RIGRPTDS-GQIGIIDPDCRLIGLHLYDGLFKVI----- 142
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
F + + N+R L+ V D F++G P +V+L++
Sbjct: 143 ----------------PFDNKGQLKEAFNIR-LEELQVLDIKFLYGCSRPTIVVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + + P WS NL + A L+ VP P+ GV+++G T
Sbjct: 186 D-------ARHVKTYEVVLKDKDFVEGP--WSQNNLDNGAAVLIPVPPPLCGVIIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S +A A+ + + R+ V+ D + LL G L L
Sbjct: 237 IVYCSATAFKAIPVR---------PSITRAYGRVDADGSR--------YLLGDHAGLLHL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N+ ++GS GDS LV+
Sbjct: 280 LVITHEKERVTGLKIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++ DA + S + L+ VN + + + + T S A +
Sbjct: 329 -------NVQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428
>gi|322707263|gb|EFY98842.1| Pre-mRNA-splicing factor rse-1 [Metarhizium anisopliae ARSEF 23]
Length = 1212
Score = 63.9 bits (154), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 110/484 (22%), Positives = 198/484 (40%), Gaps = 48/484 (9%)
Query: 964 NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021
C G + + Q I I +L G T QK + L TP ++ E+ L+ I
Sbjct: 761 QCEEGVVGIQGQSLRIFSIDRL--GDTL-----TQKAVSLTYTPKKLIKHPEQPLFYTIE 813
Query: 1022 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1081
S P + L+ D V + D L D + I D Q
Sbjct: 814 SDNNTLPPDLRAQLMADPAVVNG-DATVLPPEDFGYPKGNRRWASCINVIDPVSEEPQVV 872
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRN 1138
T+ ++++E A++ +V +++NE+ L +GT G+DV R R
Sbjct: 873 QTVDLENNEAAVSAAIVPF---ASQDNESFLIVGT----GKDVVVNPRNFSEAYIYVYRF 925
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
+ + L ++ +++ AL QG LL G + ++ ++ A + P
Sbjct: 926 QEEGREL-EFIHKTKIEEPALALIPFQGKLLAGVGKTLRVYDLGMRQMLRKAQAEVAPQQ 984
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
+VSLN + I++GDI + + ++++K +L A D + T ++D S+
Sbjct: 985 IVSLNTQGSRIIVGDIQQGVTYVTYKPTTNKLIPFADDTIARWTTCTT-MVDYE--SVAG 1041
Query: 1259 SDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDR 1310
D+ N+ I PK S E G L++ ++ G + Q + TS +
Sbjct: 1042 GDKFGNMFIVRCPPKASEEADEEQSGLHLMNARDYLHGTSQRLDLMCHFYTQDIPTSMAK 1101
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
T G LL+ L G+IG PL ++ F QSL+ L P +AG +
Sbjct: 1102 TSLVVGGQD----VLLWSGLMGTIGVFIPLISREDADF--FQSLESHLRTEDPPLAGRDH 1155
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1427
+R +++ K I+D +L Y +LP +++ IA + + +I ++D+
Sbjct: 1156 LMYRSYYAPVKG-------IIDGDLCERYTLLPNDKKQMIAGELDRSVREIERKISDIRT 1208
Query: 1428 GTSF 1431
++F
Sbjct: 1209 RSAF 1212
>gi|342320507|gb|EGU12447.1| Pre-mRNA-splicing factor RSE1 [Rhodotorula glutinis ATCC 204091]
Length = 1212
Score = 63.9 bits (154), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 115/543 (21%), Positives = 211/543 (38%), Gaps = 88/543 (16%)
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+ G SRP +R L+ P + D A++ + C G I + L+
Sbjct: 716 VQGSPAILALSSRPWLNYAYRGILQFTPLIFDALDYAWSFSAEL-CPEGLIGIVGNS-LR 773
Query: 980 ICQLPS-GSTYDNYWPVQKVIPLKATPHQI-TYFAEKNLYPLIVSVPVLKP--LNQVLS- 1034
I P G Q VI L TP Q+ T + LY + P + + +S
Sbjct: 774 IFTFPRLGQKVQ-----QTVIDLSYTPRQLLTSPHSRLLYTVEADHRTFSPSAIQKTISD 828
Query: 1035 -LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE-VRILEPDRAGGPWQTRATIPMQSSENA 1092
+ + EV ++ N + L R + VR+++P A +T + ++ +E A
Sbjct: 829 MRMAEMEVDEEVLNLDPKEFGLPRGPAGQWASCVRVIDPVTA----ETVFKVDLEQNEAA 884
Query: 1093 LTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---------LFSTGRNADNPQ 1143
+ +VT + NE L +GT G+D + R L GR
Sbjct: 885 FSAAIVTFH---SHPNEVFLVVGT----GQDTSLAPRACKQAYLHTYKLLEEGRQ----- 932
Query: 1144 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLN 1203
+ ++ E+ AL + QG L+ G + L+ +L A +++LN
Sbjct: 933 --LELLHKTEVDDIPKALIAFQGRLVAGVGKALRLYDLGKKKLLRKAENKGFATMIMTLN 990
Query: 1204 IVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST---------- 1253
I++GD +S+Y+ +K +L + A D A+ ++D T
Sbjct: 991 TQGTRIIVGDAQESVYYALYKAPENRLLIFADDISPRWTTAS-IMVDYETVAAGDKFGNF 1049
Query: 1254 --------LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA 1305
+S V D+ I + P + + LL A +H+G +T ++ ++A
Sbjct: 1050 FVNRLPKGVSSDVDDDPTGAGIMHEKPYLMGAPHRTHLL--AHYHIGDIITSLHKVALVA 1107
Query: 1306 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1362
D L++ L G++G + P +++ F +L+ L P +
Sbjct: 1108 GGRD--------------LLVYTGLMGTVGVLVPFVSNEDVDF--FTTLEMHLRSEAPSL 1151
Query: 1363 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
G ++R ++ KA VD +L Y LP+ +Q +IA + T S+++ L
Sbjct: 1152 CGREHLAYRSAYTPVKA-------TVDGDLCEVYRSLPMAKQGQIAGELERTVSEVIKKL 1204
Query: 1423 NDL 1425
+++
Sbjct: 1205 DNV 1207
>gi|225448823|ref|XP_002282354.1| PREDICTED: splicing factor 3B subunit 3-like [Vitis vinifera]
Length = 1214
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 92/392 (23%), Positives = 162/392 (41%), Gaps = 77/392 (19%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+RIL+P A T + +Q +E A ++ V N KE TLLA+GTA
Sbjct: 863 IRILDPRTA----TTTCLLELQDNEAAFSICTV---NFHDKEYGTLLAVGTA-------- 907
Query: 1126 ARGRVLLFSTGRNAD----------NPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1175
+ L F R+ D + ++ +++G AL QG LL G
Sbjct: 908 ---KSLQFWPKRSFDAGYIHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSV 964
Query: 1176 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1235
+ L+ L P +VS++ ++ I +GDI +S ++ ++ QL + A
Sbjct: 965 LRLYDLGKRRLLRKCENKLFPNTIVSIHTYRDRIYVGDIQESFHYCKYRRDENQLYIFAD 1024
Query: 1236 DFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKL 1283
D S+ + T + ID T++ +D+ NI +S+ W+ KL
Sbjct: 1025 D--SVPRWLTASYHIDFDTMA--GADKFGNIYFVRLPQDVSDEVEEDPTGGKIKWEQGKL 1080
Query: 1284 LSR-------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1336
+FHVG VT + ++ PG + +++GT+ GS+G
Sbjct: 1081 NGAPNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CIIYGTVMGSLGA 1126
Query: 1337 IAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+ D++ F L+ + P + G + ++R A+ P D ++D +L
Sbjct: 1127 LLAFTSRDDVDF--FSHLEMHMRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLC 1177
Query: 1394 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ LPL+ Q +IA + T +IL L ++
Sbjct: 1178 EQFPTLPLDLQRKIADELDRTPGEILKKLEEV 1209
>gi|296086939|emb|CBI33172.3| unnamed protein product [Vitis vinifera]
Length = 934
Score = 63.5 bits (153), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 92/392 (23%), Positives = 162/392 (41%), Gaps = 77/392 (19%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+RIL+P A T + +Q +E A ++ V N KE TLLA+GTA
Sbjct: 583 IRILDPRTA----TTTCLLELQDNEAAFSICTV---NFHDKEYGTLLAVGTA-------- 627
Query: 1126 ARGRVLLFSTGRNAD----------NPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1175
+ L F R+ D + ++ +++G AL QG LL G
Sbjct: 628 ---KSLQFWPKRSFDAGYIHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSV 684
Query: 1176 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1235
+ L+ L P +VS++ ++ I +GDI +S ++ ++ QL + A
Sbjct: 685 LRLYDLGKRRLLRKCENKLFPNTIVSIHTYRDRIYVGDIQESFHYCKYRRDENQLYIFAD 744
Query: 1236 DFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKL 1283
D S+ + T + ID T++ +D+ NI +S+ W+ KL
Sbjct: 745 D--SVPRWLTASYHIDFDTMA--GADKFGNIYFVRLPQDVSDEVEEDPTGGKIKWEQGKL 800
Query: 1284 LSR-------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1336
+FHVG VT + ++ PG + +++GT+ GS+G
Sbjct: 801 NGAPNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CIIYGTVMGSLGA 846
Query: 1337 IAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+ D++ F L+ + P + G + ++R A+ P D ++D +L
Sbjct: 847 LLAFTSRDDVDF--FSHLEMHMRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLC 897
Query: 1394 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ LPL+ Q +IA + T +IL L ++
Sbjct: 898 EQFPTLPLDLQRKIADELDRTPGEILKKLEEV 929
>gi|427788481|gb|JAA59692.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
Length = 1156
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 78/345 (22%), Positives = 145/345 (42%), Gaps = 46/345 (13%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ ++G +++ HNL +D H + ++ MQ+ E A+++
Sbjct: 795 NDQLGQEVEIHNLLIIDQHTFEVLHAHQF-------------------MQT-EYAMSIVS 834
Query: 1098 VTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
L N + T +GTA V E +GR+++F D V E +E+KG
Sbjct: 835 TRLGN----DPNTYYIVGTANVLPDESDPKQGRIVVFHW---VDGKLEHVAE---QEIKG 884
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGT-ELNGIA--FYDAPPLYVVSLNIVKNFILLGD 1213
A ++ G LL A + L +W EL F + LY L +F+L+GD
Sbjct: 885 APYSMLEFNGKLLAAINSTVRLFEWNAERELRNECSHFNNILALY---LRAKGDFVLVGD 941
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
+ +S+ L++K +A+D+ + + E L D + L ++ N+ +
Sbjct: 942 LMRSMSLLAYKPLEGNFEEIARDYQTNWMSSVEILDDDTFLG---AESTTNLFVCQKDSA 998
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
+ + Q L +FH+G V F ++ T + + ++LFGT+ G+
Sbjct: 999 ATTDEERQHLQEVGQFHLGEFVNVFRHGSLVMQHPGETSSP------TQGSVLFGTIHGA 1052
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
IG ++ L + L +Q+KL + V ++ +R F + K
Sbjct: 1053 IGLVSQLPADFYTFLSEVQEKLTKVIKSVGKIDHAFWRSFSTERK 1097
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 90/401 (22%), Positives = 154/401 (38%), Gaps = 81/401 (20%)
Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
V+D F+HG P +V+LH+ S H + +LK + W
Sbjct: 164 VQDMEFLHGCKTPTIVLLHQD--------SQARHM-----KTYEVSLKDKEFVKGPWKQD 210
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
++ +A ++AVP P G L++G +I YH+ + Y V + L R S V
Sbjct: 211 HVESEANLVIAVPEPFCGALIIGQESITYHNG--------DQYVV---ITPHLIRQSTIV 259
Query: 363 ---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSD 414
++DA + +L D+A G L +L + + ++ V+ L L +
Sbjct: 260 CYGKVDANGSRYLLGDMA------GRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAEC 313
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + ++GSRLGDS L++ + E F ++
Sbjct: 314 ITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEVFTNL---------------GP 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 359 IVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS----- 401
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + + R E L++S +T VL +
Sbjct: 402 -------IDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAERDNTLVLSFVRQTRVLMLSG 454
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
E TE + +T GN+ +++IQV R++D
Sbjct: 455 EEVEETELAGFDTSQQTFFCGNV-RNKQLIQVTAAAVRLVD 494
>gi|399216895|emb|CCF73582.1| unnamed protein product [Babesia microti strain RI]
Length = 1232
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 98/406 (24%), Positives = 164/406 (40%), Gaps = 90/406 (22%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
VRI++P T A + + + E A++ L E LA+GT V G ++A
Sbjct: 866 VRIIDPT----TLSTAAKLLLDTDEAAISCCACDL------EGYRCLAVGT--VTGWNLA 913
Query: 1126 ARG----RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1181
+ +++ G N + +T ++S ++ G AL + +G LL GP +IL+
Sbjct: 914 NSNSNSCHIRMYAYGPNFE-----ITFLHSTKVTGIPRALLAYEGRLLAGVGPDVILYAL 968
Query: 1182 TGTELNGIAFYD-----------APP--------LYVVSLNIVKNFILLGDIHKSIYFLS 1222
+L A Y A P V+ L N I +GDI +SI L
Sbjct: 969 GKRQLLKKAEYRGGVIDIQGYGVATPRTIGNGGLFGVMWLGASGNRIFVGDIRESITVLK 1028
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES----- 1277
+ EQ A+L+L+ D ++D T++LV D+ + + S S
Sbjct: 1029 FDEQMAKLSLICDDIRP-RWITGATVLDHHTVALV--DKFDTFAVCRVPSEASASNLSSA 1085
Query: 1278 --------------WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1323
G K A+FH+G L+T D+ G +
Sbjct: 1086 LNSGSLEAVMPTILGVGNKFEQEAQFHLGD----------LSTCIDKVTLCSGCTE---- 1131
Query: 1324 ALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1380
A+++ T+ GSIG + P DEL LQ L+ + + P ++G +R ++
Sbjct: 1132 AVVYATILGSIGALIPFISSDELD--TLQHLELLMANENPPLSGREHSIYRSYY------ 1183
Query: 1381 RPGP-DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
GP ++D +L +E L Q IA + T ++I+ L D+
Sbjct: 1184 --GPVQHVIDGDLCEEFESLDSITQSRIAAKIDKTVTEIIKKLRDI 1227
>gi|195145844|ref|XP_002013900.1| GL24391 [Drosophila persimilis]
gi|194102843|gb|EDW24886.1| GL24391 [Drosophila persimilis]
Length = 1140
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 114/267 (42%), Gaps = 51/267 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+++Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMVLYQGLFTIIPMDKEASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCLNPTIIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
++ +D + LL G L +L + G V+ + + K +
Sbjct: 260 CYA-RVDGKGLRY------LLGNMDGQLYMLFLGTSETSKGVTVKDIKVEKLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGS 441
IT + N ++G+R GDS LV+ + S
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLSSES 339
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 77/360 (21%), Positives = 142/360 (39%), Gaps = 42/360 (11%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ EVG +ID HNL +D T+ V L + P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID-QNTFEV-------LHAHQFVAPETISSLMSAKLGDDPNTYYV 831
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1157
V + + E GR+++F + +T+V ++ G
Sbjct: 832 VAT---------------SLVIPDEPEPKVGRIIIFHYHDSK------LTQVAETKVDGT 870
Query: 1158 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
AL G +L G + L++WT + + + + L +FIL+GD+ +S
Sbjct: 871 CYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMRS 930
Query: 1218 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
I L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 931 ITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATTD 987
Query: 1278 WKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGC 1336
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 988 EERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIGI 1040
Query: 1337 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ + + + L L+++L + V + +R F N K + +D +L+ +
Sbjct: 1041 VTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIESF 1098
>gi|427780151|gb|JAA55527.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
Length = 1181
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 78/345 (22%), Positives = 145/345 (42%), Gaps = 46/345 (13%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ ++G +++ HNL +D H + ++ MQ+ E A+++
Sbjct: 820 NDQLGQEVEIHNLLIIDQHTFEVLHAHQF-------------------MQT-EYAMSIVS 859
Query: 1098 VTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
L N + T +GTA V E +GR+++F D V E +E+KG
Sbjct: 860 TRLGN----DPNTYYIVGTANVLPDESDPKQGRIVVFHW---VDGKLEHVAE---QEIKG 909
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGT-ELNGIA--FYDAPPLYVVSLNIVKNFILLGD 1213
A ++ G LL A + L +W EL F + LY L +F+L+GD
Sbjct: 910 APYSMLEFNGKLLAAINSTVRLFEWNAERELRNECSHFNNILALY---LRAKGDFVLVGD 966
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
+ +S+ L++K +A+D+ + + E L D + L ++ N+ +
Sbjct: 967 LMRSMSLLAYKPLEGNFEEIARDYQTNWMSSVEILDDDTFLG---AESTTNLFVCQKDSA 1023
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
+ + Q L +FH+G V F ++ T + + ++LFGT+ G+
Sbjct: 1024 ATTDEERQHLQEVGQFHLGEFVNVFRHGSLVMQHPGETSSP------TQGSVLFGTIHGA 1077
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
IG ++ L + L +Q+KL + V ++ +R F + K
Sbjct: 1078 IGLVSQLPADFYTFLSEVQEKLTKVIKSVGKIDHAFWRSFSTERK 1122
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 92/420 (21%), Positives = 156/420 (37%), Gaps = 94/420 (22%)
Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
V+D F+HG P +V+LH+ S H + +LK + W
Sbjct: 164 VQDMEFLHGCKTPTIVLLHQD--------SQARHM-----KTYEVSLKDKEFVKGPWKQD 210
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
++ +A ++AVP P G L++G +I YH+ + Y V + L R S V
Sbjct: 211 HVESEANLVIAVPEPFCGALIIGQESITYHNG--------DQYVV---ITPHLIRQSTIV 259
Query: 363 ---ELDAAHATWLQNDVA-------------------LLSTKTGDLVLLTVVYDGRV--- 397
++DA + +L D+A LL G L +L + + ++
Sbjct: 260 CYGKVDANGSRYLLGDMAGRLFMLLLEREDKMDGTXYLLGDMAGRLFMLLLEREDKMDGT 319
Query: 398 --VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
V+ L L + IT + N + ++GSRLGDS L++ + E F
Sbjct: 320 TTVKDLKLEFLGEITIAECITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEVFT 379
Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPL 515
++ + DM + T S A ++ G L
Sbjct: 380 NL---------------GPIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSL 412
Query: 516 KDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
+ G+ I+ AS ++LPG KG+W + + R E
Sbjct: 413 RIIRNGIGIHEHAS------------IDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAER 460
Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
L++S +T VL + E TE + +T GN+ +++IQV R++D
Sbjct: 461 DNTLVLSFVRQTRVLMLSGEEVEETELAGFDTSQQTFFCGNV-RNKQLIQVTAAAVRLVD 519
>gi|125774475|ref|XP_001358496.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
gi|54638233|gb|EAL27635.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
Length = 1140
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 78/360 (21%), Positives = 142/360 (39%), Gaps = 42/360 (11%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ EVG +ID HNL +D T+ V L + P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID-QNTFEV-------LHAHQFVAPETISSLMSAKLGDDPNTYYV 831
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1157
V + + E GR+++F N +T+V ++ G
Sbjct: 832 VAT---------------SLVIPDEPEPKVGRIIIFHYHDNK------LTQVAETKVDGT 870
Query: 1158 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
AL G +L G + L++WT + + + + L +FIL+GD+ +S
Sbjct: 871 CYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMRS 930
Query: 1218 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
I L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 931 ITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATTD 987
Query: 1278 WKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGC 1336
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 988 EERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIGI 1040
Query: 1337 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ + + + L L+++L + V + +R F N K + +D +L+ +
Sbjct: 1041 VTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIESF 1098
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 113/267 (42%), Gaps = 51/267 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+++Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMVLYQGLFTIIPMDKEASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELSVYDVEFLHGCLNPTIIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
++ +D + LL G L +L + G V+ + + K +
Sbjct: 260 CYA-RVDGKGLRY------LLGNMDGQLYMLFLGTSETSKGVTVKDIKVEKLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGS 441
IT + N ++G+R GDS LV+ + S
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLSSES 339
>gi|326426696|gb|EGD72266.1| hypothetical protein PTSG_00286 [Salpingoeca sp. ATCC 50818]
Length = 1104
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 76/323 (23%), Positives = 138/323 (42%), Gaps = 51/323 (15%)
Query: 1107 ENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
+N IGTA+V E +RGR+L+ +N + + V+ E G++ L +
Sbjct: 779 DNTEYFIIGTAFVDPTETQPSRGRILI----SKLENKKEIAI-VHECEAAGSVYCLTKMC 833
Query: 1166 GH----LLIASGPKIILHKWTGT---------ELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
G L+ +++ K+ T ++G + A VVSL+ + +L+G
Sbjct: 834 GKDTDDLVAGINNQVVHFKYDATGQDAAKKLRAVSGNQNFGA----VVSLDSCDDIVLVG 889
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA----TEFLIDGSTLSLVVSDEQKNIQIF 1268
D+ +++ + + QL ++ + A T FL+ SL V + F
Sbjct: 890 DMLNAVFVMQKAQDKLQLVAGSQTANWVSSCALVNETVFLVASHAHSLSVCQRE-----F 944
Query: 1269 YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT----NRFA 1324
M Q L ++ E ++G VT F+R + G+A D + N F
Sbjct: 945 EPGSTM------QTLNAKFEIYLGETVTSFVRAAL--------GSAAAVDSSMPLRNTF- 989
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1384
+FGT+ G + C+ PL L +L+ ++ + + + GL+ R FR + +
Sbjct: 990 FVFGTMGGGLACLLPLTPPQTELLTALECRMEEKIGGLGGLDHREFRTARDEQRMAQQVN 1049
Query: 1385 DSIVDCELLSHYEMLPLEEQLEI 1407
+VD +L+ + LP EEQ E+
Sbjct: 1050 PRLVDGDLVETFLQLPEEEQKEL 1072
>gi|194741158|ref|XP_001953056.1| GF17579 [Drosophila ananassae]
gi|190626115|gb|EDV41639.1| GF17579 [Drosophila ananassae]
Length = 1140
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/360 (21%), Positives = 141/360 (39%), Gaps = 42/360 (11%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ EVG +ID HNL +D + +L + P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVAPETISSLMSAKLGDDPNTYYV 831
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1157
V E E + GR+++F N +T+V ++ G
Sbjct: 832 VATSLVIPDEPEPKV---------------GRIIIFHYHDNK------LTQVAETKVDGT 870
Query: 1158 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
AL G +L G + L++WT + + + + L +FIL+GD+ +S
Sbjct: 871 CYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMRS 930
Query: 1218 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
I L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 931 ITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATTD 987
Query: 1278 WKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGC 1336
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 988 EERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIGI 1040
Query: 1337 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ + + + L L+++L + V + +R F N K + +D +L+ +
Sbjct: 1041 VTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIESF 1098
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 113/264 (42%), Gaps = 51/264 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKEASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLITVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
++ +D+ + LL G L +L + G V+ + + + +
Sbjct: 260 CYA-RVDSKGFRY------LLGNMDGQLYMLFLGTSETSKGITVKDIKVEQLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
IT + N ++G+R GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLS 336
>gi|384080885|dbj|BAM11105.1| damage-specific DNA binding protein 1, 127kDa, partial
[Siebenrockiella crassicollis]
Length = 364
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 107/235 (45%), Gaps = 30/235 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1162
K+ T +GTA V E+ + GR+++F S G+ + + KE+KGA+ ++
Sbjct: 152 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 203
Query: 1163 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
G LL + + L++WT TE N + + LYV + +FIL+GD+ +S
Sbjct: 204 EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALYVKTKG---DFILVGDLMRS 258
Query: 1218 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 259 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 315
Query: 1278 WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1332
+ Q L FH+G V F ++ + T + P D ++LFGT++G
Sbjct: 316 EERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGET-STPTQD-----SVLFGTVNG 364
>gi|342885857|gb|EGU85809.1| hypothetical protein FOXB_03657 [Fusarium oxysporum Fo5176]
Length = 1189
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 103/480 (21%), Positives = 205/480 (42%), Gaps = 63/480 (13%)
Query: 964 NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021
C G + + Q I I +L G T +QK IPL TP ++ ++ L+ I
Sbjct: 761 QCEEGIVGIQGQSLRIFNIDRL--GDTL-----IQKSIPLTYTPKKLVKHPDQPLFYTIE 813
Query: 1022 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQ 1079
+ P LL D ++ + D+ L D + + +++P G Q
Sbjct: 814 ADNNTLPPELRAQLLADPKIVNG-DSRVLPPEDFGYPKGTRRWASCINVIDPLSEEG--Q 870
Query: 1080 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNA 1139
TI ++++E A++ +V+ ++++NE+ L +GT G+D+ R +S G
Sbjct: 871 VVQTIDLENNEAAVSAAIVSF---SSQDNESFLVVGT----GKDMVVNPRS--YSEG--- 918
Query: 1140 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1199
+Y + G L + QG + +A G ++ ++ ++ + + +
Sbjct: 919 ------YLHIYRFQDGGENLTLLAFQGRVAVAVGTQLRIYDLGMRQMLRKSQAEVAAQQI 972
Query: 1200 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1259
VSLN + I++GD+ + + ++ +K +L D + T ++D S+
Sbjct: 973 VSLNTQGSRIIVGDVQQGVTYVVYKPASNKLIPFVDDTIARWTTCTT-MVDYE--SVAGG 1029
Query: 1260 DEQKNIQIFYYAPKMSES----WKGQKLLSRAEF-HVGAHVTKFLRLQMLATSSDRTGAA 1314
D+ N+ I K SE G L++ E+ H H + + TS +T
Sbjct: 1030 DKFGNMFIVRCPEKASEEADEEQTGLHLINAREYLHGTPH-------RDIPTSITKTSLV 1082
Query: 1315 PGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
G + LL+ + G+IG P ++ F Q+L++ L P +AG + +R
Sbjct: 1083 VGGQEI----LLWSGIMGTIGVFIPFISREDADF--FQNLEQHLRTEDPPLAGRDHLMYR 1136
Query: 1372 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+++ K ++D +L Y +LP +++L IA + + +I ++D+ ++F
Sbjct: 1137 GYYAPVKG-------VIDGDLCERYNLLPNDKKLMIAGELDRSVREIERKISDIRTRSAF 1189
>gi|237837399|ref|XP_002367997.1| splicing factor 3B subunit 3, putative [Toxoplasma gondii ME49]
gi|211965661|gb|EEB00857.1| splicing factor 3B subunit 3, putative [Toxoplasma gondii ME49]
gi|221488748|gb|EEE26962.1| splicing factor 3B subunit, putative [Toxoplasma gondii GT1]
gi|221509241|gb|EEE34810.1| splicing factor 3B subunit, putative [Toxoplasma gondii VEG]
Length = 1233
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/299 (22%), Positives = 119/299 (39%), Gaps = 49/299 (16%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
V+S ++ AL + +G LL G K+ L+ L Y P V + + +
Sbjct: 957 VHSTPVEDYPMALTAFRGMLLAGVGHKLRLYALGRKRLLKKCEYKNLPCGVAFIRVAGDR 1016
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL---SLVVSDEQKNI 1265
+ +GD+ +S++ + ++ +LA D +L G L + V +D+ ++
Sbjct: 1017 LFVGDVRESVHVMRYRLSENLFYVLADDV------VPRWLTKGEVLDYHTFVAADKFDSV 1070
Query: 1266 QIFYYAPKMSESWKGQ------------------KLLSRAEFHVGAHVTKFLRLQMLATS 1307
I + E G KL S FH+G VT R + + +
Sbjct: 1071 FICRVPSEAKEDELGDTTGLRLRGDTTYLTDKCFKLQSLLHFHIGEIVTALERATLTSAA 1130
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLN 1366
S+ ++++GT+ GSIG +P L + L+ + P +AG
Sbjct: 1131 SE--------------SIVYGTIMGSIGSFSPFLTKHELDLFTHLEMVMRSEKPPLAGRE 1176
Query: 1367 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
FR ++ K + VD +L Y +LP E+Q IA T + IL +L D+
Sbjct: 1177 HIMFRSYYHPAK-------NTVDGDLCESYALLPYEDQKRIAQDFEKTPADILKHLEDI 1228
>gi|429961863|gb|ELA41407.1| hypothetical protein VICG_01512 [Vittaforma corneae ATCC 50505]
Length = 1153
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/385 (22%), Positives = 165/385 (42%), Gaps = 69/385 (17%)
Query: 1063 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGE 1122
+Y + + PD ++ ++ M+ EN L + +FN L + T++ +GE
Sbjct: 804 QYSIELRSPD-----FKLISSTTME--ENELICDIKVIFNN-------FLVVCTSFPEGE 849
Query: 1123 DVAARGRVLLFSTGRNADNPQNL-VTE----VYSKELKGAISALASLQGHLLIASGPKII 1177
D +G+++++S +P NL +T+ + S+ LK ++ + + G K++
Sbjct: 850 DKMTKGKLIVYSLVNIVPDPDNLHITKKLKLICSETLKNPCLFCEEVRSLISVCVGTKLM 909
Query: 1178 LHKWT-GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA-QLNLLAK 1235
++++ T L + ++ L SL + KN I + DI IYF + + +L+LL +
Sbjct: 910 IYEFNENTGLAAVGRHEL-SLLCTSLFVTKNLIAVSDIMNGIYFFFLRPRDPLKLHLLGR 968
Query: 1236 D--------FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
G +D F F D S+V + ++IF Y+P S G +L+ RA
Sbjct: 969 SCLVPNCRFLGGID-FCPSFETDALQFSIVSVCKYGIVRIFTYSPYDPVSKNGNQLVKRA 1027
Query: 1288 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG------SIGCIAPLD 1341
E VTK A P + ++FG ++ S + L
Sbjct: 1028 EI-----VTKL--------------ANP------LYKVVFGQINEFESILLSSNVMVLLR 1062
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE--LLSHYEML 1399
+ F +LQ++Q + + + G+N R++ + + P S++ CE LL +
Sbjct: 1063 AINFPKLQAIQHCISIFISNRCGINVRNYLE---TEEFVNPECKSVI-CEKILLEFFYFK 1118
Query: 1400 PLEEQLEIAHQTGTTRSQILSNLND 1424
PL ++ +I G I+ + D
Sbjct: 1119 PLVQE-KICKLVGLDYFNIVELIED 1142
>gi|358391805|gb|EHK41209.1| hypothetical protein TRIATDRAFT_135379 [Trichoderma atroviride IMI
206040]
Length = 1212
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 103/475 (21%), Positives = 194/475 (40%), Gaps = 87/475 (18%)
Query: 995 VQKVIPLKATP--------HQITYFAEKN---LYPLIVSVPVLKP--LNQVLSLLIDQEV 1041
+Q IPL TP H + Y E + L P + + + P +N +L +E
Sbjct: 787 IQNSIPLTYTPKKMVKHPEHPLFYVIEADNHTLSPDLCAKLLADPARVNGDAKVLSPEEF 846
Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1101
GH N +S + +++P G Q I + +E A+++ +VT
Sbjct: 847 GHPRGNRRWASC------------ISVVDPLAEDG--QALQKIDLDENEAAVSLAIVTF- 891
Query: 1102 NTTTKENETLLAIGT-------------AYVQGEDVAARGRVLLFSTGRNADNPQNLVTE 1148
+++NET L +GT AYV GR L+F
Sbjct: 892 --ASQDNETFLVVGTGKDMVVNPRSFSDAYVHIYRFEQEGRGLVF--------------- 934
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
++ +++ A+ QG +L+ G + ++ +L + P + SL+ N
Sbjct: 935 IHKTKVEEPPMAIIPFQGRVLVGIGKILRIYDLGMRQLLRKTQAEVAPQLINSLSTQGNR 994
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQI 1267
I++GD+ + I ++ +K+ +L D ++ + T ++D T++ D+ NI +
Sbjct: 995 IIVGDVQQGITYVVYKQTTNKLIPFVDD--TVARWTTCSTMVDYETVA--GGDKFGNIFV 1050
Query: 1268 FYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKF---LRLQMLATSSDRTGAAPGSDK 1319
K SE ++ L +R H +H L Q + TS +T G
Sbjct: 1051 VRSPQKASEEADEEQAGLHLLNARDYLHGRSHRLDLMCHLYTQDIPTSITKTSLVVGGQD 1110
Query: 1320 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
LL+ L G+IG + P ++ F QSL+ L P +AG + +R +++
Sbjct: 1111 ----VLLWSGLMGTIGVLIPFVTREDTDF--FQSLELHLRAEDPPLAGRDHLMYRSYYAP 1164
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
K ++D +L Y +LP +++ IA + + +I ++D+ ++F
Sbjct: 1165 VKG-------VIDGDLCERYTLLPNDKKQMIAAELDRSVREIERKISDIRTRSAF 1212
>gi|395333071|gb|EJF65449.1| hypothetical protein DICSQDRAFT_178021 [Dichomitus squalens LYAD-421
SS1]
Length = 1213
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 119/498 (23%), Positives = 196/498 (39%), Gaps = 77/498 (15%)
Query: 965 CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLI---- 1020
C G I + S +L+I Q+P T Q IPL TP ++ L LI
Sbjct: 759 CQEGLIGI-SGSLLRIFQIPKLGTKLK----QDSIPLSYTPRKLIPHPHNGLLYLIEGDH 813
Query: 1021 ---VSVPVLKPLNQVLSLLIDQEVGHQIDNH--NLSSVDLHRTYTVE---EYEVRILEPD 1072
K L Q+ +E G +D NL R +RI+ P
Sbjct: 814 RVMSEEAAAKQLQQL------RESGRAVDEEMVNLPPEQFGRPKAPAGTWASCIRIISPL 867
Query: 1073 RAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1132
A QT I + ++E A ++ +V K NE L +GTA Q +A R
Sbjct: 868 DA----QTVNVIHLDNNEAAFSLAIVPF---AAKNNELHLVVGTA--QDTFLAPRSCTSG 918
Query: 1133 F-STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAF 1191
F T R D+ +NL ++ E A+ + QG L+ G + L+ +L
Sbjct: 919 FLRTYRFTDDGRNLEL-LHKTETNDVPLAIMAFQGKLVAGVGKALRLYDIGKKKLLRKVE 977
Query: 1192 YDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDG 1251
+V+LN + I++GD+ +S+++ +K +L + A D AT L D
Sbjct: 978 NKTLGSTIVTLNTQGSRIIIGDMQESVFYAVYKPPENRLLVFADDVQPRWVTATTML-DY 1036
Query: 1252 STLSLVVSDEQKNIQIFYYAPKMSESW------------KG------QKLLSRAEFHVGA 1293
+T +V SD N+ + K+S+ KG K A FH+G
Sbjct: 1037 NT--VVASDRFGNVFVNRLDAKISDQIDDDPTGAGILHEKGVLFGAPHKTAMLAHFHIGD 1094
Query: 1294 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQ 1352
VT ++ ++A R +L+ L G+IG + P + + L +L+
Sbjct: 1095 IVTSLNKISLVA--------------GGREVILYTCLHGTIGILVPFVSKEDVDLLTTLE 1140
Query: 1353 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
+ + + G + ++R ++ KA +VD +L + LP +Q IA +
Sbjct: 1141 QHMRTEQLSLVGRDHLAWRGYYVPVKA-------VVDGDLCESFAKLPANKQSSIAGELD 1193
Query: 1413 TTRSQILSNLNDLALGTS 1430
T ++L L L + S
Sbjct: 1194 RTVGEVLKKLEQLRVTAS 1211
>gi|296814646|ref|XP_002847660.1| pre-mRNA-splicing factor rse1 [Arthroderma otae CBS 113480]
gi|238840685|gb|EEQ30347.1| pre-mRNA-splicing factor rse1 [Arthroderma otae CBS 113480]
Length = 1235
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 102/463 (22%), Positives = 190/463 (41%), Gaps = 53/463 (11%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C G + + Q + ++ S DN +Q+ IPL T + E YP+
Sbjct: 759 QCVEGMVGIQGQNL----RIFSIEKLDNNL-LQESIPLAYTSRSLVRHPE---YPIFY-- 808
Query: 1024 PVLKPLNQVLS-----LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGG 1076
V+ N VLS L+ + DN L D + +++++P A
Sbjct: 809 -VIGSDNNVLSPSTKAKLLSESTTVNGDNAELPPEDFGYPRGTNHWASCIQVVDPINAKA 867
Query: 1077 PWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1136
+ I ++ +E A+++ V+ T++E+ET L +GT +G V+ R F
Sbjct: 868 ---VMSRIELEDNEAAVSIAAVSF---TSQEDETFLVVGTG--KGMVVSPRSFTCGFIHI 919
Query: 1137 RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1196
+ ++ +++ AL QG LL GP + ++ +L P
Sbjct: 920 YRFQEEGKELEFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQITP 979
Query: 1197 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1256
+V L + I++ D+ +S+ ++ +K Q L A D T ++D T++
Sbjct: 980 RVIVGLQTQGSRIIVSDVQESVTYVVYKYQENNLIPFADDIIPRWTTCTT-MVDYETVA- 1037
Query: 1257 VVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSS 1308
D+ NI + PK SE G L+ ++ GA + Q + TS
Sbjct: 1038 -GGDKFGNIWLLRCPPKASEEADEDGSGAHLIHERQYLQGAPNRLSLVAHFYSQDIPTSI 1096
Query: 1309 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGL 1365
+T G R L++ L G++G + P D++ F Q+L+ +L P +AG
Sbjct: 1097 QKTQLVAG----GRDILVWTGLQGTVGMLIPFVTRDDVDF--FQTLEMQLTSQNPPLAGR 1150
Query: 1366 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
+ +R +++ K ++D +L + +LP +++ IA
Sbjct: 1151 DHLIYRGYYAPCKG-------VIDGDLCETFFLLPNDKKQAIA 1186
>gi|403411971|emb|CCL98671.1| predicted protein [Fibroporia radiculosa]
Length = 1212
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 122/495 (24%), Positives = 200/495 (40%), Gaps = 71/495 (14%)
Query: 965 CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLI---- 1020
C G I + S +L+I Q+P T Q IPL TP + L+ LI
Sbjct: 758 CPEGLIGI-SGSVLRIFQIPKLGTKLK----QDAIPLSYTPRKFISHPTNGLFYLIEGDH 812
Query: 1021 ------VSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE---EYEVRILEP 1071
S L L Q ++ D+ V NL R +RI+ P
Sbjct: 813 RVRSDEASAKALGELRQQGKMVDDELV-------NLPPETFGRQKAPAGTWASCIRIINP 865
Query: 1072 DRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL 1131
A +T IP+ ++E A ++ VV+ + + E L +GTA Q +A R
Sbjct: 866 VDA----KTVNIIPLDNNEAAFSLAVVSF---SARSGELHLVVGTA--QDTFLAPRSCTS 916
Query: 1132 LF-STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL---- 1186
F T R D+ NL ++ E A+ QG L+ G + L+ +L
Sbjct: 917 GFLRTYRFTDDGTNLEL-LHKTETNDVPLAVLGFQGRLVAGVGKALRLYDMGKKKLLRKV 975
Query: 1187 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1246
F A +VSL + IL+GD+ +S+ F +K +L + A D A
Sbjct: 976 ENKTFASA----IVSLATQGSRILVGDMQESVSFAVYKPPENKLLVFADDTQPRWTSAMT 1031
Query: 1247 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEFHVGA-HVTKFL-- 1299
++D +T++ +D NI + PK+SE G +L GA H T+ L
Sbjct: 1032 -MVDYNTVA--SADRFGNIYVNRLDPKVSEQVDDDPTGAGILHEKGLLAGAPHKTELLSH 1088
Query: 1300 -RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKL 1355
+ + TS ++ G R LL+ L G+IG + P +++ F + +L++ +
Sbjct: 1089 FHVGDIVTSINKVSLVAG----GREVLLYTGLHGTIGILVPFVSKEDVDF--ISTLEQHM 1142
Query: 1356 VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1415
+ G + ++R ++ KA +VD +L + LP +Q IA + T
Sbjct: 1143 RTEQLSLVGRDHLTWRGYYVPVKA-------VVDGDLCETFARLPASKQSAIAGELDRTV 1195
Query: 1416 SQILSNLNDLALGTS 1430
++L L L + S
Sbjct: 1196 GEVLKKLEQLRVTAS 1210
>gi|195996829|ref|XP_002108283.1| hypothetical protein TRIADDRAFT_49802 [Trichoplax adhaerens]
gi|190589059|gb|EDV29081.1| hypothetical protein TRIADDRAFT_49802 [Trichoplax adhaerens]
Length = 1208
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 83/390 (21%), Positives = 167/390 (42%), Gaps = 61/390 (15%)
Query: 1070 EPDRAGGPWQT-------RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGE 1122
EP G W + ++ + E AL+V + K +ET + +G A
Sbjct: 852 EPKAGNGQWASCIQLLAPDQSLELDQDEAALSVAICRF---AYKPDETFVVVGVAKELNL 908
Query: 1123 DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1182
+ ++ L+ +T R A+ LV + +E+ A+A+ QG LL+ +G + ++
Sbjct: 909 NPSSSSGGLM-NTYRMANGQLELVHKTVVEEVP---RAMAAFQGRLLVGTGRILRVYDLG 964
Query: 1183 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1242
+L P +V+++ + + +++GD+ +S++F+ ++ + +L + A D
Sbjct: 965 RKKLLRKCENKNFPYRIVTISSMGSRVIVGDVQESVHFVKYRAKENRLVVFADDVSPRYV 1024
Query: 1243 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLL 1284
AT FL D T++ V D+ +I I + +++ W QK
Sbjct: 1025 TATCFL-DYDTIA--VGDKFGSIAILRLSDDINDEIEEDPTGAKAFWDRGLLNGASQKAN 1081
Query: 1285 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1341
A F++G V + LQ +T PG ++ L++ TL GSIG + P +
Sbjct: 1082 LEASFYIGETV---MSLQ-------KTTIIPGGSES----LIYTTLSGSIGVLLPFTSRE 1127
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1401
E+ F Q L+ L + G + ++R ++ K +++D ++ + L
Sbjct: 1128 EVDF--FQHLEMHLRSENAPICGRDHLAYRSYYFPAK-------NVIDGDMCEQFNALDG 1178
Query: 1402 EEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
++ +A + T +I L D+ +F
Sbjct: 1179 SKRRTLAMELDRTPPEISKKLEDMRTRYAF 1208
>gi|374095609|gb|AEY85032.1| spliceosomal-like protein [Camellia sinensis]
Length = 1212
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 85/388 (21%), Positives = 162/388 (41%), Gaps = 69/388 (17%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA---YVQGE 1122
+R+L+P A T + +Q +E A ++ L N KE TLLA+GTA +
Sbjct: 861 IRVLDPRTA----NTTCLLELQDNEAAFSI---CLVNFHDKEYGTLLAVGTAKGLQFWPK 913
Query: 1123 DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1182
+ G + ++ R ++ ++L ++ ++ AL QG LL G + L+
Sbjct: 914 RSISSGYIHIY---RFVEDGKSLEL-LHKTQVDDVPLALCQFQGKLLAGVGSVLRLYDLG 969
Query: 1183 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1242
+L P + S++ ++ I +GDI +S ++ ++ QL + A D C
Sbjct: 970 KRKLLRKCENKLFPNTITSIHTYRDRIYVGDIQESFHYCKYRRDENQLYIFADD-----C 1024
Query: 1243 ----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR- 1286
+ ID T++ +D+ NI A +S+ W+ KL
Sbjct: 1025 VPRWLTASYHIDFDTMA--GADKFGNIYFVRLAQDVSDEIEEDPTGGKIKWEQGKLNGAP 1082
Query: 1287 ------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
+FHVG VT + ++ + + +++GT+ GS+G +
Sbjct: 1083 NKVEEIVQFHVGDVVTCLQKASLIPSGGE--------------CVIYGTVMGSLGALLAF 1128
Query: 1341 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1397
D++ F L+ + P + G + ++R A+ P D ++D +L +
Sbjct: 1129 TSRDDVDF--FSHLEMHMRQENPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFP 1179
Query: 1398 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
LP++ Q +IA + T +IL L ++
Sbjct: 1180 TLPMDMQRKIADELDRTPGEILKKLEEV 1207
>gi|195108657|ref|XP_001998909.1| GI23368 [Drosophila mojavensis]
gi|193915503|gb|EDW14370.1| GI23368 [Drosophila mojavensis]
Length = 1140
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/344 (20%), Positives = 138/344 (40%), Gaps = 42/344 (12%)
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ EVG +ID HNL +D T+ V +P ++ ++ ++
Sbjct: 780 NAEVGQEIDVHNLLIID-QNTFEV----------------LHAHQFVPPETISALMSAKL 822
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1156
+ T + T+ V ++ + GR+++F N +T+V ++ G
Sbjct: 823 -------GDDPNTYYVVATSLVFPDEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869
Query: 1157 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1216
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929
Query: 1217 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1276
SI L K+ +A+D A E L D + L D N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLGCETHD---NLFVCQKDSAATT 986
Query: 1277 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
+ Q L A FH+G + F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTINVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1379
+ + + + L L+++L + V ++ +R + N K
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIDHTYYRNYQINTKV 1083
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/263 (24%), Positives = 114/263 (43%), Gaps = 49/263 (18%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L +LR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATSLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
+D V D F+HG + P ++++H+ +H C L +K L W
Sbjct: 159 -MDELIVYDVEFLHGCLNPTVIVIHKDN-------DGRHVKCHEINLRDKEFMK---LAW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 208 KQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTINC 260
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRV--VQRLDLSKTNPSVLTSDI 415
++ +D+ + LL G L +L + + G+V V+ + + + + I
Sbjct: 261 YA-RVDSKGLRY------LLGNMDGQLYMLFLGINETGKVPTVKDIKVEQLGEISIPECI 313
Query: 416 TTIGNSLFFLGSRLGDSLLVQFT 438
T + N ++GSR GDS LV+ +
Sbjct: 314 TYLDNGFLYIGSRHGDSQLVRLS 336
>gi|167998730|ref|XP_001752071.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697169|gb|EDQ83506.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 172
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 8/96 (8%)
Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
E+ A+ ++ A RR S+ + +L F + R +HCFE PE+ +L R
Sbjct: 28 ETAALRTEAAAPGIHRRPSLTMRLRI----ILAFTEC----RCLLIHCFEYPEYQYLNRS 79
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
RE FA V+ D GRC VL+Y Q++ LKA G
Sbjct: 80 RERFAMDLSVRADLVGRCASVLIYNSQLVTLKAGHG 115
>gi|413935524|gb|AFW70075.1| hypothetical protein ZEAMMB73_605375 [Zea mays]
Length = 1229
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 121/555 (21%), Positives = 214/555 (38%), Gaps = 92/555 (16%)
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+S Q SRP + + + P CD ++ + + C+ G + V +
Sbjct: 713 VSHRQAMLCLSSRPWLGYIHQGHFLLTPLSCD-TLESAASFSSDQCSEGVVAVAGDALRI 771
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVL-------KPLNQV 1032
G T++ + IPL+ TP + +K +I S +
Sbjct: 772 FTIERLGETFN-----ETAIPLRYTPRKFVILPKKKYIAVIESDKGAFSAEEREAAKKEC 826
Query: 1033 LSLLIDQEVGHQIDNHNLSSVDLH------RTYTVEEYE------------VRILEPDRA 1074
L E G+ + + + D T+ E+Y +RIL+P
Sbjct: 827 LDASGAAENGNANNGDPMENGDGQDGAEEGNTFPDEQYGYPKAESERWVSCIRILDPRSR 886
Query: 1075 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL--- 1131
T + +Q +E A+++ V N KE+ TLLAIGTA +G R R L
Sbjct: 887 ----DTTCLLELQDNEAAVSICTV---NFHDKEHGTLLAIGTA--KGLQFWPR-RTLAGG 936
Query: 1132 LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAF 1191
+ D ++L ++ +++ AL QG LL G + L+ +L
Sbjct: 937 FIHVYKFVDEGRSLEL-LHKTQVEEVPLALCQFQGRLLAGVGSVLRLYDLGKRKLLRKCE 995
Query: 1192 YDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDG 1251
P +VS++ ++ I +GD+ +S ++ ++ QL + A D T ID
Sbjct: 996 NKLFPRTLVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADD-SVPRWLTTAQHIDF 1054
Query: 1252 STLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR-------AEFHVGA 1293
T++ +D+ NI +S+ W+ KL +FHVG
Sbjct: 1055 DTMA--GADKFGNIYFARLPQDISDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGD 1112
Query: 1294 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQS 1350
VT + ++ PG + L++GT+ GS+G + +++ F
Sbjct: 1113 VVTCLQKASLI----------PGGGE----CLIYGTVMGSVGALLAFTSREDVDF--FSH 1156
Query: 1351 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
L+ L P + G + ++R A+ P D ++D +L Y LP + Q +IA +
Sbjct: 1157 LEMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQYPSLPADMQRKIADE 1209
Query: 1411 TGTTRSQILSNLNDL 1425
T +IL L D+
Sbjct: 1210 LDRTPGEILKKLEDI 1224
>gi|330790247|ref|XP_003283209.1| CPSF domain-containing protein [Dictyostelium purpureum]
gi|325086890|gb|EGC40273.1| CPSF domain-containing protein [Dictyostelium purpureum]
Length = 1233
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/289 (21%), Positives = 128/289 (44%), Gaps = 29/289 (10%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
VY E + + A+A QG L G I ++ +L P +V+++ + +
Sbjct: 956 VYKTEAEEPVYAMAPFQGRLCAGVGKNIRIYDMGKKKLLRKCETKNLPNTIVNIHSLGDR 1015
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1268
+++GDI +SI+F+ +K+ L + A D + ++D T++ +D+ NI I
Sbjct: 1016 LVVGDIQESIHFIKYKKLENMLYVFADDLAP-RWITSSVMLDYDTVA--GADKFGNIFIL 1072
Query: 1269 YYAPKMSESWKGQKLLSRAEFHVG---------AHVTKFLRLQMLATSSDRTGAAPGSDK 1319
+S+ + S+ +F G H+ + T + + GSD
Sbjct: 1073 RLPSNVSDEVEEDPTGSKLKFESGLLNGAPHKLEHIANIFAGDAITTLNKTSLVVGGSD- 1131
Query: 1320 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
LL+ T+ G+IG + P +++ F SL+ +L + + G + ++R ++
Sbjct: 1132 ----VLLYTTISGAIGALIPFVSREDVDF--FSSLELQLRNEHAPLCGRDHLAYRSYYFP 1185
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
K +I+D +L + L ++Q +IA + + S++L L D+
Sbjct: 1186 VK-------NIIDGDLCEQFITLDPQKQRQIAEELSRSPSEVLKKLEDI 1227
>gi|380490733|emb|CCF35810.1| pre-mRNA-splicing factor rse-1 [Colletotrichum higginsianum]
Length = 1212
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 114/504 (22%), Positives = 199/504 (39%), Gaps = 88/504 (17%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C G + + Q + G T QK IPL TP ++ E +P+ ++
Sbjct: 761 QCEEGVVGIQGQSLRIFAIENLGDTI-----TQKSIPLSYTPRRLLKHPE---HPMFYTI 812
Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE----EYEVRILEPDRAGGP-- 1077
+ DN+ L DL E + RIL PD G P
Sbjct: 813 --------------------EADNNTLPP-DLRAKLIAEPGVVNGDARILPPDEFGYPKG 851
Query: 1078 ---WQTRATIPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQGED 1123
W + ++ SE ++ V L N +++NE+ L +GT G+D
Sbjct: 852 KGRWASCISVIDPLSEEQRVLQTVDLDNNEAAVSAAIVSFASQDNESFLIVGT----GKD 907
Query: 1124 VAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIIL 1178
+ R FS G R +++ L ++ +++ SAL QG LL G + +
Sbjct: 908 MIVNPR--QFSEGYIHVYRFSEDGHEL-EFIHKTKVEEPPSALLGFQGRLLAGIGQTLRI 964
Query: 1179 HKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1238
+ ++ A D P +VSL+ + I++GD+ I ++ +K +L D
Sbjct: 965 YDLGLRQMLRKAQADVAPQLIVSLSTQGSRIIVGDVQHGITYVVYKPTTNKLIPFVDDTI 1024
Query: 1239 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS----ESWKGQKLL-SRAEFHVGA 1293
S T ++D S+V D+ NI + K S E G LL +R H
Sbjct: 1025 SRWVTCTT-MVDYE--SVVGGDKFGNIFLVRCPEKASQEADEESGGLHLLNTRDYLHGTP 1081
Query: 1294 HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRR 1347
H L Q + TS +T G LL+ ++G+IG P +++ F
Sbjct: 1082 HRLSLLGHSYTQDVPTSITKTSLVVGGQD----VLLWSGINGTIGVFIPFVTREDVDF-- 1135
Query: 1348 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEI 1407
Q+L++ + +AG + +R ++ K ++D +L Y +LP E++ I
Sbjct: 1136 FQNLEQHMRTEDAPLAGRDHLMYRSYYVPVKG-------VIDGDLCERYTLLPSEKKQMI 1188
Query: 1408 AHQTGTTRSQILSNLNDLALGTSF 1431
A + + +I ++D+ ++F
Sbjct: 1189 AGELDRSVREIERKISDIRTRSAF 1212
Score = 46.2 bits (108), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 137/603 (22%), Positives = 226/603 (37%), Gaps = 129/603 (21%)
Query: 74 QEEGSKESK--NSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
Q G+KE + ++ +L S + V + + G + S+A G++ +D
Sbjct: 27 QFSGTKEQNIVTASGSRLTLLRPDPSQGKVITVLSHDIFGIIRSMAAFRLAGSN----KD 82
Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVDP 187
+ILA + +I+++E+ I + + F+ LHL K G G + DP
Sbjct: 83 YLILATDSGRITIIEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACDP 131
Query: 188 QGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+GR V L ++ + SQ E T S A V+++ LD+
Sbjct: 132 KGRACLIASVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVLSMVALDV- 181
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLIW 299
GY PV L E E T A G + + T ++ + L W
Sbjct: 182 ---------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRKW 230
Query: 300 SAMNLPHDAYKLLAVP---SPIGGVLVVGANTIHY-HSQSASCALAL--NNYAVSLDSSQ 353
S P A L VP GVLV G I Y HS + + + A S +
Sbjct: 231 SESVDP-TASMLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRK 289
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRLDLSKT 406
S +L + + LL T+ GDL T+ DG V+RL +
Sbjct: 290 RHAVSGVMHKLKGSAGAFF----FLLQTEDGDLFKATLDMVEDTDGNPTGEVKRLKIKYF 345
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
+ ++S + + + + S+ G+ QF E+ GD + +
Sbjct: 346 DTIPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGDDDDE------ 385
Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
L SS D D G + + + + A+ +S+ ++ PL D
Sbjct: 386 LEFSSDDFPTDPKAGYDAVYF--------HPRPLENLALVESIDSMNPLLDCKVANLTGE 437
Query: 527 DA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADSSRMA 569
DA +A G +S + ++ ELPG +WT+ K +RG
Sbjct: 438 DAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLNRG--------- 486
Query: 570 AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
D+Y AY+++S T+VL + + EV++S F+ A L G +IQV +
Sbjct: 487 ---DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQVHPK 541
Query: 630 GAR 632
G R
Sbjct: 542 GIR 544
>gi|346971485|gb|EGY14937.1| pre-mRNA-splicing factor RSE1 [Verticillium dahliae VdLs.17]
Length = 1230
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 105/487 (21%), Positives = 200/487 (41%), Gaps = 54/487 (11%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C G + + Q + G T QK IPL TP ++ E ++ I S
Sbjct: 779 QCEEGVVGIQGQSLRIFAIEKLGDTL-----TQKSIPLTYTPRRMVKHPEHPMFYTIESD 833
Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRAT 1083
P LL D V + D L + + I D QT T
Sbjct: 834 NNTLPPELRAQLLADPSVVNG-DARTLPPAEFGYPRAKGRWASCISVIDPLSEELQTLQT 892
Query: 1084 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1138
+ + ++E A++ +V T+++NE+ L +GT G+D+ R F+ G R
Sbjct: 893 VDLDNNEAAVSAAIVPF---TSQDNESFLVVGT----GKDMIVNPR--QFTEGYIHIYRF 943
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
+++ + L ++ +++ +AL + QG L+ G + ++ ++ A D P
Sbjct: 944 SEDGREL-EFIHKTKVEEPPTALLAFQGRLVAGVGKTLRIYDLGQKQMLRKAQADVAPQL 1002
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSL 1256
+VSL+ + I++GD+ + + ++ +K +L D + + C ++D S+
Sbjct: 1003 IVSLSTQGSRIVVGDVQQGVTYVVYKPLSNKLIPFVDDTVARWMTCTT---MVDYE--SV 1057
Query: 1257 VVSDEQKNIQIFYYAPKMS----ESWKGQKLL-SRAEFHVGAH----VTKFLRLQMLATS 1307
D+ NI I K S E G L +R H H V+ F +L +
Sbjct: 1058 AGGDKFGNIFIVRAPEKASQEADEEGAGLHLTNTRDYLHGTPHRLSLVSHFYSQDVLTSI 1117
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1364
+ + G D LL+ + G+IG P ++ F Q+L++ L +AG
Sbjct: 1118 TKTSLVVGGQD-----VLLWSGISGTIGVFIPFVTREDADF--FQTLEQHLRTEDAPLAG 1170
Query: 1365 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
+ +R +++ K ++D +L Y +LP +++ IA + + +I ++D
Sbjct: 1171 RDHLMYRGYYAPVKG-------VIDGDLCERYTLLPNDKKQMIAGELDRSVREIERKISD 1223
Query: 1425 LALGTSF 1431
+ ++F
Sbjct: 1224 IRTRSAF 1230
Score = 46.2 bits (108), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 122/548 (22%), Positives = 203/548 (37%), Gaps = 125/548 (22%)
Query: 128 RRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDP 187
R + +ILA + +I+++E+ + + + + F K G G + DP
Sbjct: 97 RGFNYLILATDSGRIAIIEYLPAQNRFQRLHLETFG-------KSGIRRVVPGEFLACDP 149
Query: 188 QGRCGGVLVYGLQ-----MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
+GR L+ L+ ++ + SQ E T S A HV+++ LD
Sbjct: 150 KGRA--CLIASLEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVHVLSMVALD 198
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL----- 297
+ GY PV L E + T A + +AL + T L + L
Sbjct: 199 V----------GYANPVFAAL-ETDYTEADQDPTGQ-----AALDVETQLVYYELDLGLN 242
Query: 298 -IWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNY--AVS 348
+ + P D L P G GVLV G I Y HS + + + A
Sbjct: 243 HVVRKWSEPVDNTASLLFQVPGGNDGPSGVLVCGEENITYRHSNQEAFRVPVPRRRGATE 302
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRL 401
S + + +L + + LL T+ GDL +T+ DG V+RL
Sbjct: 303 DPSRKRCIVAGVMHKLKGSAGAFF----FLLQTEDGDLFKITIDMIEDRDGNPTGEVKRL 358
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
+ + + S + + + ++ S+ G+ QF E+ GD
Sbjct: 359 KIKYFDTIPVASSLCILKSGFLYVASQFGNYQFYQF--------------EKLGD----- 399
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ L SS D D E + ++ + A+ +S+ ++ PL D
Sbjct: 400 -DDEELEFSSDDFPTDPKQSYEAVFF--------HPRELENLALVESIDSMNPLIDCKVA 450
Query: 522 LRINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNAD 564
DA +A G +S + ++ ELPG +WT+ K SRG
Sbjct: 451 NLTGEDAPQIYTACGNGARSTFRILKHGLEVNEIVASELPGIPSAVWTL--KLSRG---- 504
Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
D+Y AY+++S T+VL + + EV +S F+ A L G +I
Sbjct: 505 --------DQYDAYIVLSFTNATLVLSIGETVEEVNDS--GFLTSVPTLAAQLLGGEGLI 554
Query: 625 QVFERGAR 632
QV +G R
Sbjct: 555 QVHPKGIR 562
>gi|452820919|gb|EME27955.1| splicing factor 3B subunit 3 [Galdieria sulphuraria]
Length = 1294
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 63/275 (22%), Positives = 114/275 (41%), Gaps = 26/275 (9%)
Query: 1158 ISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1216
I+ +AS QGHLL+A G + ++ + L A P + + + I L D+ +
Sbjct: 1002 ITTMASFQGHLLVAVGTSLRMYDLGKKQLLKKTQHPRATPHKITCIETCYDRIFLSDVQE 1061
Query: 1217 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK--- 1273
S++ + +A D+ C T L+D T++ + D+ NI I P+
Sbjct: 1062 SVFLYRYSAADNLFLCIADDYLPKWC-TTMCLLDYDTVA--IGDKMGNISILRLPPEAGT 1118
Query: 1274 -MSESWKGQKLLSRAEFHVGAHVTKFL--RLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
+ + G L A H ++ +Q L+ TG P L +GTL
Sbjct: 1119 FIEQDPTGGLLSKEAPHHFQLEACYYVGSVIQCLSKVEWTTGDVP--------LLFYGTL 1170
Query: 1331 DGSIGCIAPL-DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1389
DG+IG + PL L Q+L+ +L + + G + ++R + + ++D
Sbjct: 1171 DGAIGVMIPLRSTLDMELFQALELQLREYRSPLCGRHHLAYRSYFFPVR-------HVID 1223
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
+L + L LE+Q +I + + + L D
Sbjct: 1224 GDLCEEFYRLSLEQQEKIVKELDRSIVDVHRKLED 1258
>gi|336371417|gb|EGN99756.1| hypothetical protein SERLA73DRAFT_88390 [Serpula lacrymans var.
lacrymans S7.3]
gi|336384183|gb|EGO25331.1| hypothetical protein SERLADRAFT_355643 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1216
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 165/379 (43%), Gaps = 39/379 (10%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+RI++P A +T + I + S+E A ++ +V + + NE L +GTA +A
Sbjct: 861 IRIIDPVEA----KTLSMITLDSNECAFSLAIVPF---SARGNELHLVVGTA--ADTFLA 911
Query: 1126 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1185
R F + ++ E AL + QG L+ G + ++ +
Sbjct: 912 PRSCSSGFLRTYKFTEDGTGLELLHKTETDDVPLALMAFQGRLVAGVGKALRIYDIGKKK 971
Query: 1186 LNGIAFYDAPPLY---VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1242
L A + +V+LN + I++GD+ +SI F+++K +L + A D
Sbjct: 972 LLRKVENKARATFSTAIVTLNTQGSRIIVGDMQESISFVAYKAPENRLLVFADDNQPRWI 1031
Query: 1243 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEFHVGA-HVTK 1297
AT ++D +T++ D NI + PK+SE G +L +GA H TK
Sbjct: 1032 TATT-MVDYTTIA--AGDRFGNIFVNRLDPKVSEQVDDDPTGAGILHEKGLLMGAPHKTK 1088
Query: 1298 FL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSL 1351
L + L TS ++ G R LL+ L G+IG + P +++ F + +L
Sbjct: 1089 MLAHFHIGDLVTSINKVSLVAG----GREVLLYTGLHGTIGILVPFVSKEDVDF--ISTL 1142
Query: 1352 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1411
++ + + G + S+R +++ K S+VD +L Y LP +Q IA +
Sbjct: 1143 EQHMRTEQGSLVGRDQLSWRGYYTPVK-------SVVDGDLCETYARLPGTKQSAIAGEL 1195
Query: 1412 GTTRSQILSNLNDLALGTS 1430
T ++L L L + S
Sbjct: 1196 DRTVGEVLKKLEQLRVTAS 1214
>gi|242060436|ref|XP_002451507.1| hypothetical protein SORBIDRAFT_04g003000 [Sorghum bicolor]
gi|241931338|gb|EES04483.1| hypothetical protein SORBIDRAFT_04g003000 [Sorghum bicolor]
Length = 1232
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 120/556 (21%), Positives = 216/556 (38%), Gaps = 93/556 (16%)
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+S Q SRP + + + P CD ++ + + + C+ G + V +
Sbjct: 715 VSHRQAMLCLSSRPWLGYIHQGHFLLTPLSCD-TLESASSFSSDQCSEGVVAVAGDALRI 773
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP-VLKPLNQVLSLLID 1038
G T++ + IPL+ TP + +K +I S + +
Sbjct: 774 FTIERLGETFN-----ETAIPLRYTPRKFVILPKKKYIAVIESDKGAFSAEEREAAKKEC 828
Query: 1039 QEVGHQIDNHNLSSVDLHR-------------TYTVEEYE------------VRILEPDR 1073
E +N N ++ D T+ E+Y +RIL+P
Sbjct: 829 LEASGAAENGNANNGDPMENGDGQEDGAEEGNTFPDEQYGYPKAESERWVSCIRILDPRS 888
Query: 1074 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL-- 1131
T + +Q +E A+++ V N KE+ TLLA+GTA +G R R L
Sbjct: 889 R----DTTCLLELQENEAAVSICTV---NFHDKEHGTLLAVGTA--KGLQFWPR-RTLAG 938
Query: 1132 -LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIA 1190
+ D ++L ++ +++ AL QG LL G + L+ +L
Sbjct: 939 GFIHIYKFVDEGRSLEL-LHKTQVEEVPLALCQFQGRLLAGVGSVLRLYDLGKRKLLRKC 997
Query: 1191 FYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLID 1250
P +VS++ ++ I +GD+ +S ++ ++ QL + A D A + ID
Sbjct: 998 ENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAAQH-ID 1056
Query: 1251 GSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR-------AEFHVG 1292
T++ +D+ NI +S+ W+ KL +FHVG
Sbjct: 1057 FDTMA--GADKFGNIYFARLPQDISDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVG 1114
Query: 1293 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQ 1349
VT + ++ PG + L++GT+ GS+G + +++ F
Sbjct: 1115 DVVTCLQKASLI----------PGGGE----CLIYGTVMGSVGALLAFTSREDVDF--FS 1158
Query: 1350 SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1409
L+ L P + G + ++R A+ P D ++D +L Y LP + Q +IA
Sbjct: 1159 HLEMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQYPSLPADMQRKIAD 1211
Query: 1410 QTGTTRSQILSNLNDL 1425
+ T +IL L D+
Sbjct: 1212 ELDRTPGEILKKLEDI 1227
>gi|408400551|gb|EKJ79630.1| hypothetical protein FPSE_00190 [Fusarium pseudograminearum CS3096]
Length = 1212
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 105/488 (21%), Positives = 208/488 (42%), Gaps = 56/488 (11%)
Query: 964 NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021
C G + + Q I I +L G T +QK IPL TP ++ ++ L+ I
Sbjct: 761 QCEEGIVGIQGQSLRIFNIDRL--GETL-----IQKSIPLTYTPKKLVKHPDQPLFYTIE 813
Query: 1022 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQ 1079
+ P LL D V + D+ L D + + +++P G Q
Sbjct: 814 ADNNTLPPELRAQLLADPGVVNG-DSKVLPPEDFGYPKGTRRWASCINVIDPLSEEG--Q 870
Query: 1080 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG--- 1136
TI ++++E A++ +V+ ++++NE+ L IGT G+D+ R FS G
Sbjct: 871 VLQTIDLENNEAAVSAAIVSF---SSQDNESFLVIGT----GKDMVVNPRS--FSEGYLH 921
Query: 1137 --RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1194
R + + L ++ +++ AL + QG +L+A G + ++ ++ + +
Sbjct: 922 IYRFLEGGREL-EFIHKTKVEEPPLALLAFQGRVLVAVGTSLRIYDLGMRQMLRKSQAEV 980
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
+VSLN + I++GD+ + + ++ +K +L D + T ++D
Sbjct: 981 ATQQIVSLNTQGSRIIVGDVQQGVTYVVYKPASNKLIPFVDDTIARWTTCTT-MVDYE-- 1037
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKFL---RLQMLAT 1306
S+ D+ N+ I K SE ++ + +R H H + Q + T
Sbjct: 1038 SVAGGDKFGNMFIVRCPEKASEEADEEQSGLHLINARDYLHGTPHRVSLMCHFYTQDIPT 1097
Query: 1307 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1363
S + G + LL+ + G+IG P ++ F Q+L++ L P +A
Sbjct: 1098 SITKASLVVGGQE----VLLWSGIMGTIGVFIPFVSREDADF--FQNLEQHLRTEDPPLA 1151
Query: 1364 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1423
G + +R +++ K ++D +L Y +LP +++ IA + + +I ++
Sbjct: 1152 GRDHLMYRGYYAPVKG-------VIDGDLCERYNLLPNDKKQMIAGELDRSVREIERKIS 1204
Query: 1424 DLALGTSF 1431
D+ ++F
Sbjct: 1205 DIRTRSAF 1212
>gi|427798971|gb|JAA64937.1| Putative damage-specific dna binding complex subunit ddb1, partial
[Rhipicephalus pulchellus]
Length = 1259
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 84/372 (22%), Positives = 156/372 (41%), Gaps = 56/372 (15%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDV 1124
+R+L P + QT + ++ +E AL+V +V ++ +E + +G A + +
Sbjct: 861 IRVLNPADS----QTLCKVALEQNEAALSVALVKF---ASQPDEQYVVVGAARELSLQPW 913
Query: 1125 AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1184
AR LL T R + + + V++ ++ A +AL QG LL G + L+
Sbjct: 914 HARSGGLLL-TYRLSHAGETRLELVHATSVEEAPTALCPFQGRLLAGVGKCLRLYDLGRK 972
Query: 1185 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1244
+L P +VS+ + N +++GD+ +S +FL +K Q QL + A D
Sbjct: 973 KLLRKCENKYIPSAIVSIQSMGNRVVVGDVQESFFFLRYKRQENQLVIFADD-AVPRWIT 1031
Query: 1245 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES---------------WKG---QKLLSR 1286
++D T++ +D+ N+ I +S+ W G QK
Sbjct: 1032 ASCMLDYDTVA--GADKFGNVSIIRLPNSVSDEVDEDPTGIKSLWDRGWLGGSSQKAEVI 1089
Query: 1287 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1343
+ FH+G V + ++ PG ++ L++ TL G+IG + P ++
Sbjct: 1090 SNFHIGETVLSLQKATLI----------PGGSES----LVYVTLSGTIGVLVPFTAHEDH 1135
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
F Q L+ + P + G + SFR + K +++D +L + L +
Sbjct: 1136 DF--FQHLEMHMRSENPPLCGRDHLSFRSSYFPVK-------NVIDGDLCEQFNSLDPSK 1186
Query: 1404 QLEIAHQTGTTR 1415
Q IA + R
Sbjct: 1187 QKSIAEELDRNR 1198
Score = 40.8 bits (94), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 60/265 (22%), Positives = 102/265 (38%), Gaps = 51/265 (19%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ + G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTIPVAASMCVLKTGFLFVAAEFGNHCLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E GD AP AL++++ EEL A T
Sbjct: 362 ARLGEEDEEPEFSSAIPLEEGDTFFFAPR----------ALRNLLPVEELDSLSPAMGCT 411
Query: 495 ------ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC- 547
E + + R I L+ +GL + S + ELPG
Sbjct: 412 IADLANEDTPQLYVACGRGPRSCIRVLR---HGLEV------------SEMAVSELPGNP 456
Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+WTV K+ D++Y AY+I+S T+VL + + EVT+S +
Sbjct: 457 NAVWTVKRKA--------------DEDYDAYIIVSFVNATLVLSIGETVEEVTDS-GFLG 501
Query: 608 QGRTIAAGNLFGRRRVIQVFERGAR 632
T++ + G ++QV+ G R
Sbjct: 502 TTPTLSCAQI-GDDALVQVYPEGIR 525
>gi|302807210|ref|XP_002985318.1| hypothetical protein SELMODRAFT_181612 [Selaginella moellendorffii]
gi|300147146|gb|EFJ13812.1| hypothetical protein SELMODRAFT_181612 [Selaginella moellendorffii]
Length = 1207
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 83/380 (21%), Positives = 159/380 (41%), Gaps = 52/380 (13%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+R+L+P A T + +Q +E A ++ V + K T+LA+GTA
Sbjct: 855 IRVLDPKAAA----TTCLLELQENEAAFSICCVNFHDN--KNLGTVLAVGTAKDLEWWPK 908
Query: 1126 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1185
R R ++ ++L V+ + G +AL QG LL G + ++ +
Sbjct: 909 RRSMGGFIHIYRFVEDGRSLEL-VHKTPIDGVPTALCQFQGRLLAGIGQILRIYDLGKRK 967
Query: 1186 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------- 1236
L P + S++ + I +GDI +S +++ ++ QL A D
Sbjct: 968 LLRKCENKNFPNTITSIHSYGDRIYVGDIQESFHYVKYRRDENQLYAFADDSSPRWLTAS 1027
Query: 1237 ----FGSL---DCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
F ++ D F F + LS + D+ +I + +++ + K+ +
Sbjct: 1028 LHIDFDTMAAGDKFGNLFFVRLPQDLSEEIEDDPTGGKIKWEQGRLNGA--PNKVEEIIQ 1085
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTF 1345
FHVG VT + ++ PG ++ +++GT+ GS+G + P +++ F
Sbjct: 1086 FHVGEVVTCMQKASLI----------PGGGES----VIYGTVMGSVGALLPFSSREDVDF 1131
Query: 1346 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1405
L+ + P + G + +FR A+ P D ++D +L Y LP + Q
Sbjct: 1132 --FSHLEMHMRQEHPPLCGRDHMAFR------SAYFPVKD-VIDGDLCEQYPTLPPDLQR 1182
Query: 1406 EIAHQTGTTRSQILSNLNDL 1425
+IA + T +++ L D+
Sbjct: 1183 KIAEELDRTPGEVMKKLEDI 1202
>gi|302773427|ref|XP_002970131.1| hypothetical protein SELMODRAFT_171237 [Selaginella moellendorffii]
gi|300162642|gb|EFJ29255.1| hypothetical protein SELMODRAFT_171237 [Selaginella moellendorffii]
Length = 1207
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 83/380 (21%), Positives = 159/380 (41%), Gaps = 52/380 (13%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+R+L+P A T + +Q +E A ++ V + K T+LA+GTA
Sbjct: 855 IRVLDPKAAA----TTCLLELQENEAAFSICCVNFHDN--KNLGTVLAVGTAKDLEWWPK 908
Query: 1126 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1185
R R ++ ++L V+ + G +AL QG LL G + ++ +
Sbjct: 909 RRSMGGFIHIYRFVEDGRSLEL-VHKTPIDGVPTALCQFQGRLLAGIGQILRIYDLGKRK 967
Query: 1186 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------- 1236
L P + S++ + I +GDI +S +++ ++ QL A D
Sbjct: 968 LLRKCENKNFPNTITSIHSYGDRIYVGDIQESFHYVKYRRDENQLYAFADDSSPRWLTAS 1027
Query: 1237 ----FGSL---DCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
F ++ D F F + LS + D+ +I + +++ + K+ +
Sbjct: 1028 LHIDFDTMAAGDKFGNLFFVRLPQDLSEEIEDDPTGGKIKWEQGRLNGA--PNKVEEIIQ 1085
Query: 1289 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTF 1345
FHVG VT + ++ PG ++ +++GT+ GS+G + P +++ F
Sbjct: 1086 FHVGEVVTCMQKASLI----------PGGGES----VIYGTVMGSVGALLPFSSREDVDF 1131
Query: 1346 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1405
L+ + P + G + +FR A+ P D ++D +L Y LP + Q
Sbjct: 1132 --FSHLEMHMRQEHPPLCGRDHMAFR------SAYFPVKD-VIDGDLCEQYPTLPPDLQR 1182
Query: 1406 EIAHQTGTTRSQILSNLNDL 1425
+IA + T +++ L D+
Sbjct: 1183 KIAEELDRTPGEVMKKLEDI 1202
>gi|389638952|ref|XP_003717109.1| pre-mRNA-splicing factor RSE1 [Magnaporthe oryzae 70-15]
gi|148887431|sp|Q52E49.2|RSE1_MAGO7 RecName: Full=Pre-mRNA-splicing factor RSE1
gi|351642928|gb|EHA50790.1| pre-mRNA-splicing factor RSE1 [Magnaporthe oryzae 70-15]
Length = 1216
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 112/515 (21%), Positives = 208/515 (40%), Gaps = 106/515 (20%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C G + V Q L+I + DN +QK IPL TP ++ + ++ I
Sbjct: 761 QCEEGMVGVNGQ-FLRIFAIEKLG--DNV--IQKSIPLTYTPRKLAKHPTQRIFYTI--- 812
Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLS---SVDLHRTYTVEEYEVRILEPD-----RAG 1075
+ DN+ L+ L T + R+L PD R
Sbjct: 813 --------------------EADNNTLAPELREQLMAAPTAVNGDARVLPPDEFGYPRGN 852
Query: 1076 GPWQTRAT------------------IPMQSSENALTVRVVTLFNTTTKENETLLAIGTA 1117
G W + + I + ++E AL++ VV+ +++ E+ L +GT
Sbjct: 853 GRWASCISVVDPLGDGEELEPGVVQRIDLDNNEAALSMAVVSF---ASQDGESFLVVGT- 908
Query: 1118 YVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
G+D+ R F+ G R +++ + L ++ +++ +AL QG L+
Sbjct: 909 ---GKDMVVNPR--RFTEGYIHVYRFSEDGREL-EFIHKTKVEEPPTALLPFQGRLVAGI 962
Query: 1173 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1232
G + ++ +L A + P +VSLN + I++GD+ + ++++K + +L
Sbjct: 963 GRMLRIYDLGLRQLLRKAQAEVAPQLIVSLNTQGSRIIVGDVQHGLIYVAYKSETNRLIP 1022
Query: 1233 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1292
A D + T + ST +D+ N+ I K S+ +E H+
Sbjct: 1023 FADDTIARWTTCTTMVDYDSTAG---ADKFGNLWILRCPEKASQESDEPG----SEVHL- 1074
Query: 1293 AHVTKFL-----RLQMLA--------TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
H +L RL ++A TS +T G + LL+G G+IG + P
Sbjct: 1075 VHSRDYLHGTSNRLALMAHVYTQDIPTSICKTNLVVGGQE----VLLWGGFQGTIGVLIP 1130
Query: 1340 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
++ F QSL++ L P +AG + +R + K ++D +L Y
Sbjct: 1131 FVSREDADF--FQSLEQHLRSEDPPLAGRDHLMYRGCYVPVKG-------VIDGDLCERY 1181
Query: 1397 EMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
MLP +++ IA + + +I ++D+ ++F
Sbjct: 1182 TMLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1216
>gi|241260143|ref|XP_002404926.1| DNA repair protein xp-E, putative [Ixodes scapularis]
gi|215496735|gb|EEC06375.1| DNA repair protein xp-E, putative [Ixodes scapularis]
Length = 1148
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 66/259 (25%), Positives = 113/259 (43%), Gaps = 26/259 (10%)
Query: 1114 IGTAYV-QGEDVAARGRVLLFS--TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+GTA V E +GR+++F G+ + +V KE+KGA +L G LL
Sbjct: 831 VGTAIVLPDESDPKQGRIIIFHWVDGK--------LQQVAEKEIKGAPYSLLEFNGKLLA 882
Query: 1171 ASGPKIILHKWTGT-ELNGIA--FYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
+ + L +W EL+ F + LY L +FIL+GD+ +S+ L++K
Sbjct: 883 SINSTVRLFEWNAERELHNECSHFNNILALY---LKTKGDFILVGDLMRSMSLLAYKPLE 939
Query: 1228 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
+A+D+ + A E L D + L ++ N+ + + Q L
Sbjct: 940 GSFEEIARDYQTNWMCAVEILDDDTFLG---AESTTNLFVCQKDSAATTDEDRQHLQEVG 996
Query: 1288 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1347
+FH+G V F R L ++P + ++LFGT+ G+IG +A L +
Sbjct: 997 QFHLGEFVNIF-RHGSLVMQHPGEASSP-----TQGSVLFGTIHGAIGLVAQLPSDFYNF 1050
Query: 1348 LQSLQKKLVDSVPHVAGLN 1366
L +Q L + V ++
Sbjct: 1051 LLEVQGNLTKVIKSVGKID 1069
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 94/402 (23%), Positives = 155/402 (38%), Gaps = 95/402 (23%)
Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
V+D F+HG P +V+LH+ S H + IS LK + W
Sbjct: 166 VQDMEFLHGCKTPTIVLLHQD--------SQARH---MKTYEIS--LKDKEFVKGPWKQD 212
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
++ +A ++AVP P +G +I YH+ + + L R S V
Sbjct: 213 HVESEATIVIAVPEPFCDARCIGQESITYHNGDQDVVI-----------TPHLIRQSTIV 261
Query: 363 ---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSD 414
++DA + +L D+A G L +L + + ++ V+ L L +
Sbjct: 262 CYGKVDANGSRYLLGDMA------GRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAEC 315
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+T + N + ++GSRLGDS L++ L+S E+ +E T
Sbjct: 316 MTYLDNGVVYVGSRLGDSQLIK---------LNSERNEQGSYVEVMEVFTN--------- 357
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L + F G L+ G+ I+ AS
Sbjct: 358 LGPIVDMCVVDLERQGQGQLVTCSGAFKE---------GSLRIIRNGIGIHEHAS----- 403
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KGIW + N DSSR L++S +T VL +
Sbjct: 404 -------IDLPGIKGIWPLR------VNTDSSR--------DNTLVLSFVGQTRVLMLSG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
E TE + + +T GN+ ++IQV R++DG
Sbjct: 443 EEVEETELAGFDISQQTFFCGNV-RNNQLIQVTAAAVRLVDG 483
>gi|326469377|gb|EGD93386.1| splicing factor 3B subunit 3 [Trichophyton tonsurans CBS 112818]
Length = 1188
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 107/489 (21%), Positives = 203/489 (41%), Gaps = 59/489 (12%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS- 1022
C G + + Q + ++ S DN +Q+ IPL TP E L+ +I S
Sbjct: 738 QCVEGMVGIQGQNL----RIFSIEKLDNNL-LQEPIPLAYTPRNFVRHPEYPLFYVIGSD 792
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRA 1082
+L P + + L+ + D+ L D + I D P T++
Sbjct: 793 NNILSPATK--AKLLSESTAVNGDSAELPPEDFGYPRGTNHWASSIQVVD----PIHTKS 846
Query: 1083 TI---PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG--- 1136
+ ++ +E A+++ V+ T++E+ET L +GT G+D+ R F+ G
Sbjct: 847 VLSNLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRT--FTCGFIH 897
Query: 1137 --RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1194
R + + L ++ +++ AL QG LL GP + ++ +L
Sbjct: 898 IYRFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQI 956
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
P +V L + I++ D+ +S+ ++ +K Q L A D T ++D T+
Sbjct: 957 TPRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTT-MVDYETV 1015
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQMLA 1305
+ D+ NI + K SE G L+ ++ GA V F Q +
Sbjct: 1016 A--GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVIHFYS-QDIP 1072
Query: 1306 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1362
TS +T G R L++ L G++G P D++ F Q+L+ +L P +
Sbjct: 1073 TSIQKTQLVAG----GRDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNPPL 1126
Query: 1363 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
AG + +R +++ K ++D +L + +LP +++ IA + + +I +
Sbjct: 1127 AGRDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIAGELDRSVREIERKI 1179
Query: 1423 NDLALGTSF 1431
+D+ ++
Sbjct: 1180 SDMRTKVAY 1188
>gi|326483043|gb|EGE07053.1| pre-mRNA-splicing factor rse1 [Trichophyton equinum CBS 127.97]
Length = 1209
Score = 60.5 bits (145), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 107/489 (21%), Positives = 203/489 (41%), Gaps = 59/489 (12%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS- 1022
C G + + Q + ++ S DN +Q+ IPL TP E L+ +I S
Sbjct: 759 QCVEGMVGIQGQNL----RIFSIEKLDNNL-LQEPIPLAYTPRNFVRHPEYPLFYVIGSD 813
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRA 1082
+L P + + L+ + D+ L D + I D P T++
Sbjct: 814 NNILSPATK--AKLLSESTAVNGDSAELPPEDFGYPRGTNHWASSIQVVD----PIHTKS 867
Query: 1083 TI---PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG--- 1136
+ ++ +E A+++ V+ T++E+ET L +GT G+D+ R F+ G
Sbjct: 868 VLSNLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRT--FTCGFIH 918
Query: 1137 --RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1194
R + + L ++ +++ AL QG LL GP + ++ +L
Sbjct: 919 IYRFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQI 977
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
P +V L + I++ D+ +S+ ++ +K Q L A D T ++D T+
Sbjct: 978 TPRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTT-MVDYETV 1036
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQMLA 1305
+ D+ NI + K SE G L+ ++ GA V F Q +
Sbjct: 1037 A--GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVIHFYS-QDIP 1093
Query: 1306 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1362
TS +T G R L++ L G++G P D++ F Q+L+ +L P +
Sbjct: 1094 TSIQKTQLVAG----GRDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNPPL 1147
Query: 1363 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
AG + +R +++ K ++D +L + +LP +++ IA + + +I +
Sbjct: 1148 AGRDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIAGELDRSVREIERKI 1200
Query: 1423 NDLALGTSF 1431
+D+ ++
Sbjct: 1201 SDMRTKVAY 1209
>gi|401407861|ref|XP_003883379.1| putative Splicing factor 3B subunit 3 [Neospora caninum Liverpool]
gi|325117796|emb|CBZ53347.1| putative Splicing factor 3B subunit 3 [Neospora caninum Liverpool]
Length = 1233
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 68/299 (22%), Positives = 117/299 (39%), Gaps = 49/299 (16%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
V+S ++ ALA +G LL G K+ L+ L Y P V + + +
Sbjct: 957 VHSTPVEDYPMALAPFRGMLLAGVGHKLRLYALGKKRLLKKCEYKNLPCGVAFIRVAGDR 1016
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL---SLVVSDEQKNI 1265
+ +GD+ +S++ + ++ +LA D +L G L + V +D+ ++
Sbjct: 1017 LFVGDLRESVHVMRYRLSENLFYVLADDV------VPRWLTKGEVLDYHTFVAADKFDSV 1070
Query: 1266 QIFYYAPKMSESWKGQ------------------KLLSRAEFHVGAHVTKFLRLQMLATS 1307
I + + G KL S FH+G VT R + A +
Sbjct: 1071 FICRVPSEAKQDELGDTTGLRLRGDTTYLTDKCFKLQSLLHFHIGEVVTALERATLTAGA 1130
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLN 1366
S+ ++++GT+ GSIG +P L + L+ L P + G
Sbjct: 1131 SE--------------SIIYGTIMGSIGAFSPFLTKHELDLFTHLEMVLRSEKPPLGGRE 1176
Query: 1367 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
FR ++ K + VD +L Y +LP + Q IA T + IL +L D+
Sbjct: 1177 HIMFRSYYHPAK-------NTVDGDLCESYALLPYDVQKRIAQDFEKTPADILKHLEDI 1228
>gi|440792421|gb|ELR13643.1| splicing factor 3b subunit 3, putative [Acanthamoeba castellanii str.
Neff]
Length = 1227
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 59/293 (20%), Positives = 118/293 (40%), Gaps = 44/293 (15%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
V+ +++G +A+ QG LL+ G + ++ +L P + S+
Sbjct: 952 VHKTQVEGVPTAVCGFQGRLLVGIGKMLRIYDLGKRKLLRKCENKGFPHCIQSITTQGER 1011
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKD----------------FGSLDCFATEFLIDGS 1252
I++GD+ +S +F+ +++ QLN+ A D D F F++
Sbjct: 1012 IIVGDLAESFHFVKYRKAENQLNVYADDSNPRWLTASQMLDYDTMAGADKFGNVFIV--R 1069
Query: 1253 TLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTG 1312
S V + + N + K S + KL + FHVG + + + +D
Sbjct: 1070 LPSEVNEELEDNPMGNFLMSKQSLNGAAFKLQTLINFHVGDTINSMTKASLFTGGAD--- 1126
Query: 1313 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
L++ TL G +G + P +++ F L+ + +P + G + +
Sbjct: 1127 -----------VLVYTTLMGGMGALLPFVSREDVDF--FSHLEMHMRSELPPLCGRDHLA 1173
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+R ++ K ++D +L + +LP E+Q IA + T ++L L
Sbjct: 1174 YRSYYFPVK-------DVIDGDLCEQFSLLPPEKQRTIAEELDRTPGEVLKKL 1219
>gi|347829304|emb|CCD45001.1| similar to pre-mRNA-splicing factor rse1 [Botryotinia fuckeliana]
Length = 1212
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 112/505 (22%), Positives = 216/505 (42%), Gaps = 91/505 (18%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C G + + Q L+I + + DN +Q+ IPL TP + E +
Sbjct: 762 QCTEGMVGIQGQN-LRIFSIEKLT--DNL--LQESIPLTYTPRRFVRHPEHACFY----- 811
Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR-- 1081
V++ N +LS Q++ +++ ++++ D +L P+ G P T
Sbjct: 812 -VIEADNNILSPATKQKL---LEDPSVTNGD-----------ATVLPPEEFGYPRGTNHW 856
Query: 1082 ----------------ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+TI ++ +E A+++ VV ++E+ET L +GT G+D+
Sbjct: 857 ASCISVVDPVTEKKVLSTIHLEDNECAVSIAVVAF---ASQEDETFLCVGT----GKDMV 909
Query: 1126 ARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK 1180
R FS G R ++ + L ++ +++ AL + QG LL G + ++
Sbjct: 910 VSPRS--FSAGFIHVYRFHEDGKEL-EFIHKTKVEEPPMALLAFQGRLLAGVGKDLRIYD 966
Query: 1181 WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1240
+L A + P +V L + I++ D+ +SI + +K Q +L D +
Sbjct: 967 LGMRQLLRKAQSEVAPNMIVGLQTQGSRIIVSDVQESITMVVYKFQENRLIPFVDDTIAR 1026
Query: 1241 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA-HV 1295
T ++D T++ D+ N+ + K SE G LL ++ GA H
Sbjct: 1027 WTSCTT-MVDYETVA--GGDKFGNLWLLRCPAKASEEADEEGSGAHLLHERQYLAGAPH- 1082
Query: 1296 TKFLRLQMLATSSDRTGAAPGS-DKTN-----RFALLFGTLDGSIGCIAPL---DELTFR 1346
RL ++A + + P S KTN R LL+ L G++G + P +++ F
Sbjct: 1083 ----RLTLMAHNFSQD--IPMSIQKTNLVAGGRDCLLWSGLQGTLGILIPFVSREDVDF- 1135
Query: 1347 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1406
Q+L++ L P +AG + +R ++ K ++D +L Y +LP +++
Sbjct: 1136 -FQTLEQHLRSEDPPLAGRDHLIYRSYYVPVKG-------VIDGDLCERYTLLPTDKKQM 1187
Query: 1407 IAHQTGTTRSQILSNLNDLALGTSF 1431
IA + + +I ++D+ +++
Sbjct: 1188 IAGELDRSVREIERKISDIRTRSAY 1212
Score = 41.2 bits (95), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 85/377 (22%), Positives = 151/377 (40%), Gaps = 52/377 (13%)
Query: 320 GVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDV 376
GVLV G + I Y HS + +A+ + + Q V +L A +
Sbjct: 253 GVLVCGEDNITYRHSNQEAFRVAIPRRRGATEDPQRKRNIVAGVMHKLKGAAGAFF---- 308
Query: 377 ALLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL 429
LL T GDL +T+ V D V+RL + + + + + + + F+ S
Sbjct: 309 FLLQTDDGDLFKITIEMVEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGFLFVASEF 368
Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLY 487
G+ QF + + ++F G E+ P R + + +L + ++ +
Sbjct: 369 GNHQFYQFEKLGDDDEETEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESIDSMNPLMD 428
Query: 488 GSASNNT-ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
+N T E A + +S + LK +GL ++ + ELPG
Sbjct: 429 CKVANLTDEDAPQIYSICGTGARSTFRTLK---HGLEVSEIVES------------ELPG 473
Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
+WT K +RG D Y AY+I+S T+VL + + EVT++ +
Sbjct: 474 VPSAVWTT--KLTRG------------DTYDAYIILSFSNGTLVLSIGETVEEVTDT-GF 518
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
T+A L G +IQV +G R + + + + P + + + N ++V+
Sbjct: 519 LSSAPTLAVQQL-GEDSLIQVHPKGIRHIRADHRVNEWA-APQHRSIVAATTNERQVAVA 576
Query: 666 IAD-PYVLLGM-SDGSI 680
++ V M SDGS+
Sbjct: 577 LSSGEIVYFEMDSDGSL 593
>gi|322693432|gb|EFY85292.1| Pre-mRNA-splicing factor RSE1 [Metarhizium acridum CQMa 102]
Length = 1221
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 106/476 (22%), Positives = 194/476 (40%), Gaps = 74/476 (15%)
Query: 964 NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021
C G + + Q I I +L G T QK I L TP ++ E+ L+ I
Sbjct: 761 QCEEGIVGIQGQSLRIFSIDRL--GDTL-----TQKAISLTYTPKKLIKHPEQPLFYTIE 813
Query: 1022 S-----VPVLKP--------LNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRI 1068
S P L+ +N +L +E G+ N +S + +
Sbjct: 814 SDNNTLPPDLRAQLIADPAVVNGDTKVLPPEEFGYPKGNRRWASC------------INV 861
Query: 1069 LEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARG 1128
++P Q T+ ++++E A++ +V +++NE+ L +GT G+D+
Sbjct: 862 IDP--VSEEPQVVQTVDLENNEAAVSAAIVPF---ASQDNESFLIVGT----GKDIVVNP 912
Query: 1129 RVL---LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1185
R R + + L ++ +++ AL QG LL G + ++ +
Sbjct: 913 RNFSEAYIYVYRFQEEGREL-EFIHKTKIEEPALALIPFQGKLLAGVGKTLRVYDLGMRQ 971
Query: 1186 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1245
+ A + P +VSLN + I++GD+ + + ++++K +L A D + T
Sbjct: 972 MLRKAQAEVAPQQIVSLNTQGSRIIVGDVQQGVTYVTYKPTTNKLIPFADDIIARWITCT 1031
Query: 1246 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGA----HVTK 1297
++D S+ D+ N+ I PK S E G L++ ++ G +
Sbjct: 1032 T-MVDYE--SVAGGDKFGNMFIVRCPPKASEEADEEQSGLHLMNARDYLHGTSQRLDLMC 1088
Query: 1298 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKK 1354
Q + TS +T G LL+ L G+IG PL ++ F QSL+
Sbjct: 1089 HFYTQDIPTSMAKTSLVVGGQDV----LLWSGLMGTIGVFIPLISREDADF--FQSLESH 1142
Query: 1355 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
L P +AG + +R +++ K I+D +L Y +LP +++ IA +
Sbjct: 1143 LRTEDPPLAGRDHLMYRSYYAPVKG-------IIDGDLCERYTLLPNDKKQMIAGE 1191
>gi|125580741|gb|EAZ21672.1| hypothetical protein OsJ_05303 [Oryza sativa Japonica Group]
Length = 1224
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 124/563 (22%), Positives = 212/563 (37%), Gaps = 107/563 (19%)
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+S Q SRP + + + P CD ++ + + C+ G + V +
Sbjct: 707 VSHRQAMLCLSSRPWLGYIHQGHFLLTPLSCD-TLESAASFSSDQCSEGVVAVAGDALRI 765
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP-VLKPLNQVLSLLID 1038
G T++ + IPL+ TP + +K +I S L + +
Sbjct: 766 FTIEHLGETFN-----ETAIPLRYTPRKFVILPKKKYLAVIESDKGALSAEEREAAKKEC 820
Query: 1039 QEVGHQIDNHNLSSVDL-------------HRTYTVEEYE------------VRILEPDR 1073
E +N N ++ D T E+Y +RIL+P
Sbjct: 821 LEAAGVTENGNANNGDQMENGDGQEDGAEDRNTLPDEQYGYPKAESERWVSCIRILDPKS 880
Query: 1074 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
T + +Q +E A+++ V N KE+ TLLA+GTA + L F
Sbjct: 881 R----DTTCLLELQDNEAAVSICTV---NFHDKEHGTLLAVGTA-----------KGLQF 922
Query: 1134 STGRNAD----NPQNLVTEVYSKEL--KGAIS----ALASLQGHLLIASGPKIILHKWTG 1183
RN + V E S EL K + AL QG LL G + L+
Sbjct: 923 WPKRNLSAGFIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSVLRLYDLGK 982
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+L P +VS++ ++ I +GD+ +S ++ ++ QL + A D
Sbjct: 983 RKLLRKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLT 1042
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR------ 1286
A ID T++ +D+ NI +S+ W+ KL
Sbjct: 1043 AANH-IDFDTMA--GADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEE 1099
Query: 1287 -AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1342
+FHVG VT + ++ PG + L++GT+ GS+G + ++
Sbjct: 1100 IVQFHVGDVVTCLQKASLI----------PGGGE----CLIYGTVMGSVGALLAFTSRED 1145
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1402
+ F L+ L P + G + ++R A+ P D ++D +L + LP +
Sbjct: 1146 VDF--FSHLEMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFPSLPAD 1196
Query: 1403 EQLEIAHQTGTTRSQILSNLNDL 1425
Q +IA + T +IL L D+
Sbjct: 1197 MQRKIADELDRTPGEILKKLEDI 1219
>gi|42409127|dbj|BAD10377.1| putative splicing factor 3b, subunit 3, 130kDa [Oryza sativa Japonica
Group]
gi|42409258|dbj|BAD10521.1| putative splicing factor 3b, subunit 3, 130kDa [Oryza sativa Japonica
Group]
gi|125538000|gb|EAY84395.1| hypothetical protein OsI_05771 [Oryza sativa Indica Group]
Length = 1234
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 124/563 (22%), Positives = 212/563 (37%), Gaps = 107/563 (19%)
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+S Q SRP + + + P CD ++ + + C+ G + V +
Sbjct: 717 VSHRQAMLCLSSRPWLGYIHQGHFLLTPLSCD-TLESAASFSSDQCSEGVVAVAGDALRI 775
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP-VLKPLNQVLSLLID 1038
G T++ + IPL+ TP + +K +I S L + +
Sbjct: 776 FTIEHLGETFN-----ETAIPLRYTPRKFVILPKKKYLAVIESDKGALSAEEREAAKKEC 830
Query: 1039 QEVGHQIDNHNLSSVDL-------------HRTYTVEEYE------------VRILEPDR 1073
E +N N ++ D T E+Y +RIL+P
Sbjct: 831 LEAAGVTENGNANNGDQMENGDGQEDGAEDRNTLPDEQYGYPKAESERWVSCIRILDPKS 890
Query: 1074 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
T + +Q +E A+++ V N KE+ TLLA+GTA + L F
Sbjct: 891 R----DTTCLLELQDNEAAVSICTV---NFHDKEHGTLLAVGTA-----------KGLQF 932
Query: 1134 STGRNAD----NPQNLVTEVYSKEL--KGAIS----ALASLQGHLLIASGPKIILHKWTG 1183
RN + V E S EL K + AL QG LL G + L+
Sbjct: 933 WPKRNLSAGFIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSVLRLYDLGK 992
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+L P +VS++ ++ I +GD+ +S ++ ++ QL + A D
Sbjct: 993 RKLLRKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLT 1052
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR------ 1286
A ID T++ +D+ NI +S+ W+ KL
Sbjct: 1053 AANH-IDFDTMA--GADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEE 1109
Query: 1287 -AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1342
+FHVG VT + ++ PG + L++GT+ GS+G + ++
Sbjct: 1110 IVQFHVGDVVTCLQKASLI----------PGGGE----CLIYGTVMGSVGALLAFTSRED 1155
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1402
+ F L+ L P + G + ++R A+ P D ++D +L + LP +
Sbjct: 1156 VDF--FSHLEMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFPSLPAD 1206
Query: 1403 EQLEIAHQTGTTRSQILSNLNDL 1425
Q +IA + T +IL L D+
Sbjct: 1207 MQRKIADELDRTPGEILKKLEDI 1229
>gi|308504990|ref|XP_003114678.1| hypothetical protein CRE_28194 [Caenorhabditis remanei]
gi|308258860|gb|EFP02813.1| hypothetical protein CRE_28194 [Caenorhabditis remanei]
Length = 270
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/260 (21%), Positives = 123/260 (47%), Gaps = 37/260 (14%)
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----FGSLDCFATEFLIDG 1251
P+ +V++ I++ D +S++FL +++ QL + A D + S C ++D
Sbjct: 24 PVSIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQLVVFADDTTPRYVSCVC-----VLDY 78
Query: 1252 STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA------ 1305
T++ V+D+ N+ + +++E + +S++ + G +++++A
Sbjct: 79 HTVA--VADKFGNLAVVRLPERVNEDVQDDPTVSKSVWDRGWLNGASQKVELVANFFIGD 136
Query: 1306 --TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVP 1360
TS +T PG+++ AL++ T+ G+IGC+ DE+ F +L+ + P
Sbjct: 137 TITSLQKTSLMPGANE----ALVYTTIGGAIGCLVSFMSKDEVDF--FTNLEMHVRSEYP 190
Query: 1361 HVAGLNPRSFRQFHSNGKAHRPGP---------DSIVDCELLSHYEMLPLEEQLEIAHQT 1411
+ G + ++R +++ K S++D ++ + ++ L +Q E+A +
Sbjct: 191 PLCGRDHLAYRSYYAPCKVCFNFLLFRSIVSLFQSVIDGDICEQFSLMDLSKQKEVAEEL 250
Query: 1412 GTTRSQILSNLNDLALGTSF 1431
G T S+I L D+ +F
Sbjct: 251 GKTVSEISKKLEDIRTRYAF 270
>gi|294875343|ref|XP_002767276.1| spliceosome factor, putative [Perkinsus marinus ATCC 50983]
gi|239868839|gb|EEQ99993.1| spliceosome factor, putative [Perkinsus marinus ATCC 50983]
Length = 1258
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 82/383 (21%), Positives = 152/383 (39%), Gaps = 54/383 (14%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDV 1124
+R+++P A ++ + + E A + V + K+N L +GTA V +
Sbjct: 902 IRVVDPLTASTSFK----LDLDVDEAATAMTVCYFYQL--KDNRPCLVVGTATGVDPHNP 955
Query: 1125 AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA------SGPKIIL 1178
+ + D NL ++ L+G SA+ +G LL+A P + +
Sbjct: 956 SRSAHGKCYIKTYLYDESYNLQL-IHVTPLEGVPSAMYPFEGRLLVALRGSPTVAPVLRI 1014
Query: 1179 HKWTGTELNGIAFYDAPPLY--VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1236
++ L Y P ++ L++ K+ I D SI L W+ Q+ +++ D
Sbjct: 1015 YELGKKRLLKKCEYKFLPESGGIMWLDVNKDRIFAADSRDSILVLRWRYSDNQMQVISDD 1074
Query: 1237 FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA--PKMSESWKGQ---------KLLS 1285
C ++D +T+ VV D+ NI + K + +W K+
Sbjct: 1075 TYP-RCITAAAVLDYNTI--VVGDKFDNIAVLRVPGDAKDAGAWGRDNDYASGNTFKMDL 1131
Query: 1286 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD---E 1342
FHVG +T R+ M+A ++ +++ T+ G+IG + P E
Sbjct: 1132 IGHFHVGETITSLQRVTMVAGGAE--------------IVIYSTVLGTIGALYPFSSKRE 1177
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1402
F + + + + P + G +R F+ K + VD +L Y LP E
Sbjct: 1178 HGFLQALEMHMRNTAASPSLTGREHVMYRSFYHPIK-------NFVDADLCEVYYQLPAE 1230
Query: 1403 EQLEIAHQTGTTRSQILSNLNDL 1425
+Q +IA T +++ L D+
Sbjct: 1231 KQRQIAVDMDKTPQEVMKKLEDI 1253
>gi|301630307|ref|XP_002944263.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Xenopus (Silurana) tropicalis]
Length = 92
Score = 59.7 bits (143), Expect = 1e-05, Method: Composition-based stats.
Identities = 36/93 (38%), Positives = 54/93 (58%), Gaps = 1/93 (1%)
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
+ E T+RRL LQ L +PH AGLNPR+FR +S+ + + +++D ELL+ Y L
Sbjct: 1 MQEKTYRRLLMLQNALT-VLPHHAGLNPRAFRMLNSSRRMLQNPVRNVLDGELLNRYLYL 59
Query: 1400 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
E+ E+A + GTT IL +L ++ TS
Sbjct: 60 SNMERSELARKIGTTTDIILDDLLEIDRVTSLF 92
>gi|406698009|gb|EKD01256.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 8904]
Length = 1216
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 128/285 (44%), Gaps = 30/285 (10%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
A+A+ QG+LL G + L++ L + P V ++N+V I++GD+ +S +
Sbjct: 949 AVAAFQGYLLAGVGKSLRLYEMGKKALLRKCENNGFPTGVATINVVGARIIVGDLQESTF 1008
Query: 1220 FLSWKE-QGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
+ ++ QL + A D S F T +D T+ +D+ NI + ++SE
Sbjct: 1009 YCVYRSIPSRQLLIFADD--SQPRFLTAVCNVDYDTV--CCADKFGNIFVNRLEERVSEK 1064
Query: 1278 WK----GQKLLSRAEFHVG-AHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
G +L F +G A+ T + + + TS + APG R +++ T
Sbjct: 1065 VDDDPTGAVILHEKGFLMGSANKTDLIAHYNVGSVVTSLTKVSVAPG----GRDVVVYTT 1120
Query: 1330 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1386
+ G++G + P D++ F + +L+ + + G + ++R +++ KA
Sbjct: 1121 ISGAVGALVPFISNDDVEF--MTTLEMHIRSLNTSLVGRDHLAYRGYYAPVKA------- 1171
Query: 1387 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+VD +L + MLP +Q IA ++L L L G+ F
Sbjct: 1172 VVDGDLCESFNMLPYPQQQAIAADLDRNVGEVLKKLEQLRTGSVF 1216
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 129/595 (21%), Positives = 223/595 (37%), Gaps = 132/595 (22%)
Query: 85 GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
G T+ +L S L+ +C G V ++A G +D I+L+ + ++S+
Sbjct: 34 GSTRLEILKLNPSTGQLDSICSSEAFGTVRNVAAFRLAGMG----KDYIVLSSDSGRLSI 89
Query: 145 LEFDDSIHGLRITSMHCFES-PEWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQ 200
+E L I+ FES + ++ K G G + VDP+GR G V L
Sbjct: 90 IE-------LVISPTPHFESLYQEVYGKSGSRRTIPGQFLAVDPKGRSAMFGAVEKQKLC 142
Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
I+ + ++G A + V+N+ D GY P+
Sbjct: 143 YILNRNTEG---------KVYPSSPLEAHKNHTLVVNMIACDT----------GYDNPMF 183
Query: 261 VILHERELTW----------AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
L EL + A R + KH T L ++ +++ WS P D
Sbjct: 184 AAL---ELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRK----WSE---PTDRRA 233
Query: 311 LLAVPSP------------IGGVLVVGANTI---HYHSQSASCALALNNYAVSLDSSQEL 355
L V P GGVLV + + H +++ + ++
Sbjct: 234 NLLVQVPGGQNANTDRFDGPGGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPG---- 289
Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
+SS + + AA ++ LL ++ GDL T+ ++G V+ L + + + +
Sbjct: 290 -QSSRGIIIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATS 348
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTC---------GSGTSMLSSGLKEEFGDIE--ADAPS 463
+ + + F+ S GD L QF S T GL EE P
Sbjct: 349 LCILKSGYLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPL 408
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSY 520
L + +L +++ + ++L G+AS+ + ++ R GP + +
Sbjct: 409 QNLLLADTLSSLDPILDAQVVNLLGNASDTPQ----IYAACGR------GPRSTFRSLKH 458
Query: 521 GLRINADASATGISKQSNYELVE--LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
GL +N LVE LPG +WT+ + DDEY +
Sbjct: 459 GLDVNV--------------LVESPLPGVPNAVWTL--------------KLSEDDEYDS 490
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
Y+++S T+VL + + EV ++ + G T+A L G ++QV G R
Sbjct: 491 YIVLSFPNGTLVLSIGETIEEVNDT-GFLSSGPTLAVQQL-GSAGLLQVHPAGLR 543
>gi|297598550|ref|NP_001045829.2| Os02g0137400 [Oryza sativa Japonica Group]
gi|255670583|dbj|BAF07743.2| Os02g0137400 [Oryza sativa Japonica Group]
Length = 845
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 124/563 (22%), Positives = 212/563 (37%), Gaps = 107/563 (19%)
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+S Q SRP + + + P CD ++ + + C+ G + V +
Sbjct: 328 VSHRQAMLCLSSRPWLGYIHQGHFLLTPLSCD-TLESAASFSSDQCSEGVVAVAGDALRI 386
Query: 980 ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP-VLKPLNQVLSLLID 1038
G T++ + IPL+ TP + +K +I S L + +
Sbjct: 387 FTIEHLGETFN-----ETAIPLRYTPRKFVILPKKKYLAVIESDKGALSAEEREAAKKEC 441
Query: 1039 QEVGHQIDNHNLSSVDL-------------HRTYTVEEYE------------VRILEPDR 1073
E +N N ++ D T E+Y +RIL+P
Sbjct: 442 LEAAGVTENGNANNGDQMENGDGQEDGAEDRNTLPDEQYGYPKAESERWVSCIRILDPKS 501
Query: 1074 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
T + +Q +E A+++ V N KE+ TLLA+GTA + L F
Sbjct: 502 R----DTTCLLELQDNEAAVSICTV---NFHDKEHGTLLAVGTA-----------KGLQF 543
Query: 1134 STGRNAD----NPQNLVTEVYSKEL--KGAIS----ALASLQGHLLIASGPKIILHKWTG 1183
RN + V E S EL K + AL QG LL G + L+
Sbjct: 544 WPKRNLSAGFIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSVLRLYDLGK 603
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+L P +VS++ ++ I +GD+ +S ++ ++ QL + A D
Sbjct: 604 RKLLRKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLT 663
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR------ 1286
A ID T++ +D+ NI +S+ W+ KL
Sbjct: 664 AANH-IDFDTMA--GADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEE 720
Query: 1287 -AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1342
+FHVG VT + ++ PG + L++GT+ GS+G + ++
Sbjct: 721 IVQFHVGDVVTCLQKASLI----------PGGGE----CLIYGTVMGSVGALLAFTSRED 766
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1402
+ F L+ L P + G + ++R A+ P D ++D +L + LP +
Sbjct: 767 VDF--FSHLEMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFPSLPAD 817
Query: 1403 EQLEIAHQTGTTRSQILSNLNDL 1425
Q +IA + T +IL L D+
Sbjct: 818 MQRKIADELDRTPGEILKKLEDI 840
>gi|401883281|gb|EJT47496.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 2479]
Length = 1216
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 128/285 (44%), Gaps = 30/285 (10%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
A+A+ QG+LL G + L++ L + P V ++N+V I++GD+ +S +
Sbjct: 949 AVAAFQGYLLAGVGKSLRLYEMGKKALLRKCENNGFPTGVATINVVGARIIVGDLQESTF 1008
Query: 1220 FLSWKE-QGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
+ ++ QL + A D S F T +D T+ +D+ NI + ++SE
Sbjct: 1009 YCVYRSIPSRQLLIFADD--SQPRFLTAVCNVDYDTV--CCADKFGNIFVNRLEERVSEK 1064
Query: 1278 WK----GQKLLSRAEFHVG-AHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
G +L F +G A+ T + + + TS + APG R +++ T
Sbjct: 1065 VDDDPTGAVILHEKGFLMGSANKTDLIAHYNVGSVVTSLTKVSVAPG----GRDVVVYTT 1120
Query: 1330 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1386
+ G++G + P D++ F + +L+ + + G + ++R +++ KA
Sbjct: 1121 ISGAVGALVPFISNDDVEF--MTTLEMHIRSLNTSLVGRDHLAYRGYYAPVKA------- 1171
Query: 1387 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+VD +L + MLP +Q IA ++L L L G+ F
Sbjct: 1172 VVDGDLCESFNMLPYPQQQAIAADLDRNVGEVLKKLEQLRTGSVF 1216
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 130/595 (21%), Positives = 223/595 (37%), Gaps = 132/595 (22%)
Query: 85 GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
G T+ +L S L+ +C G V ++A G +D I+L+ + ++S+
Sbjct: 34 GSTRLEILKLNPSTGQLDSICSSEAFGTVRNVAAFRLAGMG----KDYIVLSSDSGRLSI 89
Query: 145 LEFDDSIHGLRITSMHCFES-PEWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQ 200
+E L I+ FES + ++ K G G + VDP+GR G V L
Sbjct: 90 IE-------LVISPTPHFESLYQEVYGKSGSRRTIPGQFLAVDPKGRSAMFGAVEKQKLC 142
Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
I+ + ++G A + V+N+ D GY P+
Sbjct: 143 YILNRNTEG---------KVYPSSPLEAHKNHTLVVNMIACDT----------GYDNPMF 183
Query: 261 VILHERELTW----------AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
L EL + A R + KH T L ++ +++ WS P D
Sbjct: 184 AAL---ELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRK----WSE---PTDRRA 233
Query: 311 LLAVPSP------------IGGVLVVGANTI---HYHSQSASCALALNNYAVSLDSSQEL 355
L V P GGVLV + + H +++ + ++
Sbjct: 234 NLLVQVPGGQNANTDRFDGPGGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPG---- 289
Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
+SS + + AA ++ LL ++ GDL T+ ++G V+ L + + + +
Sbjct: 290 -QSSRGIIIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATS 348
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTC---------GSGTSMLSSGLKEEFGDIE--ADAPS 463
+ + + F+ S GD L QF S T GL EE P
Sbjct: 349 LCILKSGYLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPL 408
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSY 520
L + +L +++ + ++L G+AS+ + ++ R GP + +
Sbjct: 409 QNLLLADTLSSLDPILDAQVVNLLGNASDTPQ----IYAACGR------GPRSTFRSLKH 458
Query: 521 GLRINADASATGISKQSNYELVE--LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
GL IN LVE LPG +WT+ + DDEY +
Sbjct: 459 GLDINV--------------LVESPLPGVPNAVWTL--------------KLSEDDEYDS 490
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
Y+++S T+VL + + EV ++ + G T+A L G ++QV G R
Sbjct: 491 YIVLSFPNGTLVLSIGETIEEVNDT-GFLSSGPTLAVQQL-GSAGLLQVHPAGLR 543
>gi|388582014|gb|EIM22320.1| hypothetical protein WALSEDRAFT_60013 [Wallemia sebi CBS 633.66]
Length = 1208
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/357 (23%), Positives = 144/357 (40%), Gaps = 72/357 (20%)
Query: 1107 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE------VYSKELKGAISA 1160
E+ET L +GTA + R+++ + + A VT+ ++ ++ A
Sbjct: 892 EDETHLVVGTA---------KDRMMMPQSHKEAYLRVYKVTQDSQLELLHKTDIDDVPYA 942
Query: 1161 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
+ + +G LL G + L+ L + +V+LN+V + I +GD+ +S+ F
Sbjct: 943 IHAFKGRLLAGVGKALRLYDLGKKRLLRKCENKSFAAGIVNLNVVGSRIYVGDMQESVSF 1002
Query: 1221 LSWKEQGAQLNLLAKDFGSL----------------DCFATEFL--IDGSTLSLVVSDEQ 1262
+K +L + A D S D F F+ +D ST V DE
Sbjct: 1003 AVYKAPENRLLVFADDIMSRWTTTATPVDYDTVAGGDKFGNIFITRVDKSTSEWVDEDES 1062
Query: 1263 KN-----IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1317
+++ AP S KLL A F+VG VT + Q+ A D
Sbjct: 1063 GGGLLHARGLYHGAPNRS------KLL--AHFYVGDIVTSITKSQLSAGGRD-------- 1106
Query: 1318 DKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1374
L++ L G++G I P D++ F + +L+ + P + G + FR ++
Sbjct: 1107 ------VLVYTCLHGTVGMIIPFASKDDIEF--MSTLELHMRQESPSLVGRDHLGFRSYY 1158
Query: 1375 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
KA VD +L Y LP+ +Q IA++ T ++L + L F
Sbjct: 1159 IPCKA-------FVDGDLCELYASLPVTKQQAIANELDRTSGEVLKKIESLRSAAGF 1208
>gi|391335522|ref|XP_003742140.1| PREDICTED: DNA damage-binding protein 1-like [Metaseiulus
occidentalis]
Length = 1154
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 66/304 (21%), Positives = 134/304 (44%), Gaps = 30/304 (9%)
Query: 1114 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GTA++ E+ + GR+ + R D + E KE GA ++ L IA
Sbjct: 846 VGTAFINQEESEPKVGRIFVL---RWHDGKLETIAE---KEAAGAPYSIREFHQKLAIAI 899
Query: 1173 GPKIILHKWTGT---ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1229
+ L+ W + F++ + ++ L + ++IL+GD+ +S+ L++
Sbjct: 900 NSTVRLYSWNAEKDLQSECTPFFN---IVILHLKCLGDYILVGDLMRSMTLLNYNADITS 956
Query: 1230 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1289
L + +D+ + A E L + + L+ ++ N+ + P ++ + Q + A +
Sbjct: 957 LEEIGRDYQTNWTTAVEILDEDTFLA---AESNLNLYVCKRDPSAADDTR-QHMHEVALY 1012
Query: 1290 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT--NRFALLFGTLDGSIGCIAPLDELTFRR 1347
H+G V ++ ++ A PG N+ + L+G+L G++G I P+ + +
Sbjct: 1013 HLGEMVNVIVKGSLVM-------AQPGDMPLPLNK-SFLYGSLHGAVGVIVPIKQELYAI 1064
Query: 1348 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEI 1407
L +Q L ++ V + +R F + K P +D +L+ LP +E LE
Sbjct: 1065 LNQIQTNLAKTIKSVGKIEHGFWRTFLAERKIE-PAT-GFIDGDLIEQLLDLP-KEALES 1121
Query: 1408 AHQT 1411
Q+
Sbjct: 1122 VSQS 1125
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 99/428 (23%), Positives = 165/428 (38%), Gaps = 82/428 (19%)
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
+PV+ I++E + T +H + AL L + P W NL +A L+ V
Sbjct: 178 DPVLAIVYEEQQT-------RHMKTHVIALR-DKELMKGP--WGQRNLDLEADMLIPVED 227
Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
GV++VG TI YH +Y S + S +D V
Sbjct: 228 TETGVIIVGGETIVYHYG--------QDYICIQPSFLRTTKISCYCRIDNNRL------V 273
Query: 377 ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
+L G L +LT+ + + V L + ++ + N + F+GSRLGDS L++
Sbjct: 274 FILGGICGRLFILTLRRENKKVVSHSLDLLGSVSIPECLSYLDNGVVFVGSRLGDSQLIR 333
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-DALQDMVNGEELSLYGSASNNTE 495
+ A P + L ++ A+ DM+ +L G T
Sbjct: 334 --------------------MHAQEPFIEVLESYTNLGAILDMI-VVDLEKQGQDQLITC 372
Query: 496 SAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYH 555
S Q G L+ G+ I+ A VEL G KGIW +
Sbjct: 373 SGQGA-----------CGSLRIIRNGIGIHELAC------------VELSGIKGIWAL-- 407
Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE--TADLLTEVTESVDYFVQGRTIA 613
R + A DD L++S +T V + + L +VT + + +T
Sbjct: 408 ---RMNTAQLEEDTPTDDT----LVLSFVGQTRVFNCSSTEELEQVTLPAAFDIDSQTFC 460
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQ-DLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
A N+ G +VIQV ++ ++ + T+ D F P + N +++++ + V
Sbjct: 461 ARNVLG-NQVIQVTDKRVNLISVTSKTRVDQWFPPEGEIITQCACNDVQVALALKNVLVY 519
Query: 673 LGMSDGSI 680
L + DGS+
Sbjct: 520 LEIRDGSL 527
>gi|169848339|ref|XP_001830877.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
gi|116508046|gb|EAU90941.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
Length = 1213
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 103/485 (21%), Positives = 200/485 (41%), Gaps = 74/485 (15%)
Query: 977 ILKICQLPS-GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1035
+L+I +P GS Q +PL TP ++ E N + LI S + +
Sbjct: 770 VLRIFTIPKLGSKLK-----QDTLPLSYTPRKLITHPENNYFYLIESDHRVYSEEATKAK 824
Query: 1036 LID-QEVGHQIDNH--NLSSVDLHRTYTVE---EYEVRILEPDRAGGPWQTRATIPMQSS 1089
L + Q+ G +ID +L + R +RI++P +T A P+ ++
Sbjct: 825 LDELQKKGKKIDEEIISLPPSEFGRPKAPAGTWASNIRIIDPVEN----KTVAVFPLDNN 880
Query: 1090 ENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLV 1146
E A ++ +V + + E L +GTA +D R T + +N L
Sbjct: 881 EAAFSIAIVPF---SARNGELHLVVGTA----KDTTVSPRTCESGFLRTYKFTENGTGLE 933
Query: 1147 TEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK 1206
++ E AL + QG L G + ++ +L + +V+L
Sbjct: 934 L-LHKTETDDVPMALLAFQGRLAAGVGKALRIYDIGKKKLLRKVENKSFTTAIVTLTTQG 992
Query: 1207 NFILLGDIHKSIYFLSWKEQGAQLNLLAK-------------DFGSL---DCFATEFL-- 1248
+ IL+GD+ +S+ ++ +K+ +L A D+ ++ D F F+
Sbjct: 993 SRILVGDMQESVQYVVYKQPENRLLTFADDTQPRWVTAITMVDYNTIVAGDRFGNIFVNR 1052
Query: 1249 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1308
+D S +S V ++ I + P + + K++ A FHVG +T ++ ++A
Sbjct: 1053 LD-SKVSDQVDEDPTGAGILHEKPILMGAPHKTKMI--AHFHVGDIITSLHKVSLVA--- 1106
Query: 1309 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGL 1365
R +++ L G+IG + P +++ F + +L++ + P + G
Sbjct: 1107 -----------GGREVIVYTGLHGTIGILMPFISKEDVDF--ISTLEQHMRTEQPSLVGR 1153
Query: 1366 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ ++R ++ KA +VD +L Y LP +Q IA++ T ++L L +
Sbjct: 1154 DQLAYRGYYVPVKA-------VVDGDLCETYAHLPASKQSSIANELDRTVGEVLKKLEQM 1206
Query: 1426 ALGTS 1430
+ +S
Sbjct: 1207 RVTSS 1211
>gi|124505011|ref|XP_001351247.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
putative [Plasmodium falciparum 3D7]
gi|7768292|emb|CAB11136.2| CPSF (cleavage and polyadenylation specific factor), subunit A,
putative [Plasmodium falciparum 3D7]
Length = 2870
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/218 (22%), Positives = 103/218 (47%), Gaps = 16/218 (7%)
Query: 1194 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1253
P +++++++ ++I++GDI S+ L + + +QL + +D+ ++ C + L S
Sbjct: 2631 TPSSWIMTVDVYGDYIVVGDIMTSVTILQYDYENSQLFEVCRDYSNIWCTS---LCALSK 2687
Query: 1254 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL---QMLATSSDR 1310
+VVSD N I + KL S + F+ G+ + K L L ++ D+
Sbjct: 2688 SHIVVSDMDANFIILQKSKFKYNDEDSYKLSSVSLFNHGSIINKMLPLSNTNLIEEDYDK 2747
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL-TFRRLQSLQKKLVDSVPHVAGLNPRS 1369
+D +L + +GSI + P F++ ++ + D++ + L+ +
Sbjct: 2748 RNILTKND-----GILCASSEGSISVLIPFSSFANFKKALCIEIAITDNISSIGNLSHNA 2802
Query: 1370 FRQFHSNGKA-HRPGPDSIVDCELLSHYEMLPLEEQLE 1406
+R++ N ++ H G IVD ELL + + E+Q +
Sbjct: 2803 YREYKVNFRSKHCKG---IVDGELLKMFFHMSFEKQYK 2837
>gi|410045300|ref|XP_508472.4| PREDICTED: DNA damage-binding protein 1 [Pan troglodytes]
Length = 1107
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 90/199 (45%), Gaps = 20/199 (10%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 830 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 883
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 884 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 938
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 939 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 995
Query: 1280 GQKLLSRAEFHVGAHVTKF 1298
Q L FH+G V F
Sbjct: 996 RQHLQEVGLFHLGEFVNVF 1014
>gi|327309050|ref|XP_003239216.1| pre-mRNA splicing factor rse1 [Trichophyton rubrum CBS 118892]
gi|326459472|gb|EGD84925.1| pre-mRNA splicing factor rse1 [Trichophyton rubrum CBS 118892]
Length = 1209
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 81/367 (22%), Positives = 161/367 (43%), Gaps = 44/367 (11%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG----- 1136
+++ ++ +E A+++ V+ T++E+ET L +GT G+D+ R F+ G
Sbjct: 870 SSLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRT--FTCGFIHIY 920
Query: 1137 RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1196
R + + L ++ +++ AL QG LL GP + ++ +L P
Sbjct: 921 RFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQITP 979
Query: 1197 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1256
+V L + I++ D+ +S+ ++ +K Q L A D T ++D T++
Sbjct: 980 RVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALISFADDIIPRWTTCTT-MVDYETVA- 1037
Query: 1257 VVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQMLATS 1307
D+ NI + K SE G L+ ++ GA V F Q + TS
Sbjct: 1038 -GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVIHFYS-QDIPTS 1095
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1364
+T G R L++ L G++G P D++ F Q+L+ +L P +AG
Sbjct: 1096 IQKTQLVAG----GRDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNPPLAG 1149
Query: 1365 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
+ +R +++ K ++D +L + +LP +++ IA + + +I ++D
Sbjct: 1150 RDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIAGELDRSVREIERKISD 1202
Query: 1425 LALGTSF 1431
+ ++
Sbjct: 1203 MRTKVAY 1209
>gi|255081708|ref|XP_002508076.1| predicted protein [Micromonas sp. RCC299]
gi|226523352|gb|ACO69334.1| predicted protein [Micromonas sp. RCC299]
Length = 1199
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 86/388 (22%), Positives = 162/388 (41%), Gaps = 72/388 (18%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
VRI++P A T+ + M +E AL V + +E LA+G+A
Sbjct: 851 VRIVDPASA----STKQIVEMTGNEAALCCCHVYF----PQADELFLAVGSA-------- 894
Query: 1126 ARGRVLLFSTGRNADN--------PQNLVTEVYSKE-LKGAISALASLQGHLLIASGPKI 1176
V L + R+++ Q+ E++ K L G A+ +G LL+ G +
Sbjct: 895 ----VSLTFSPRDSEGGFIHLYRYTQDGGIELFHKTPLDGVPGAMCGFKGRLLVGVGNTL 950
Query: 1177 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1236
L+ + +L P ++ +++ I +GD+ +S +++ +K + + ++A D
Sbjct: 951 RLYDFGKKKLLRKVENRNFPNFIKTIHAQGERIYVGDVQESFHYVRYKREDGSMYIVADD 1010
Query: 1237 ----------------FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1280
D F F+ S L+ VSDE + P ++ G
Sbjct: 1011 VQPRHVTAACPLDYDTIAGGDRFGNVFV---SRLAQDVSDEIEE------DPTGGKTAYG 1061
Query: 1281 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
Q L+ A + VT+F + + + T A G + ++++ TL G++G + P
Sbjct: 1062 QGALNGASHKIN-QVTQFHVGETVCALTKGTLQAGGLE-----SMIYATLMGTLGALMPF 1115
Query: 1341 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1397
+++ F L+ + +P + G + +FR ++ P D ++D +L Y
Sbjct: 1116 GNREDVDF--CTHLEMHMRQELPPLLGRDHLAFR------SSYFPVKD-VIDGDLCEMYT 1166
Query: 1398 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+LP E Q +A T S++L L DL
Sbjct: 1167 VLPHEAQRRVAEDMDRTVSEVLKKLEDL 1194
>gi|258570355|ref|XP_002543981.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
gi|237904251|gb|EEP78652.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
Length = 1209
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 79/365 (21%), Positives = 162/365 (44%), Gaps = 44/365 (12%)
Query: 1084 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR------GRVLLFSTGR 1137
I ++ +E A++V V +++++ET L +GT G+D+ G + ++ R
Sbjct: 872 IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVNPPSSSCGYIHIY---R 921
Query: 1138 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1197
++ + L ++ +++ AL + QG LL G + ++ +L + P
Sbjct: 922 FQEDGKEL-EFIHKTKVESPPQALLAFQGRLLAGIGTNLRIYDLGMKQLLRKCQAEVVPR 980
Query: 1198 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1257
+V L + I++ D+ +S+ ++ +K Q +L A D + T ++D T++
Sbjct: 981 MIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDIIARWTTCTT-MVDYETVA-- 1037
Query: 1258 VSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATSSD 1309
D+ N+ + K SE G L+ ++ GA L + Q + TS
Sbjct: 1038 GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMIHFYPQDIPTSIQ 1097
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1366
+T G R L++ L G+IG + P +++ F QSL+ +L P +AG +
Sbjct: 1098 KTQLVAG----GRDILVWTGLQGTIGMLIPFVSREDVDF--FQSLEMQLTSQTPPIAGRD 1151
Query: 1367 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
+R +++ K +D +L Y LP +++L IA + + +I ++D+
Sbjct: 1152 HLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIAGELDRSVREIERKISDMR 1204
Query: 1427 LGTSF 1431
++
Sbjct: 1205 TKVAY 1209
Score = 41.2 bits (95), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 65/265 (24%), Positives = 108/265 (40%), Gaps = 40/265 (15%)
Query: 378 LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
LL T+ GDL +T+ V D V+RL L + + S + + N F+ S G
Sbjct: 307 LLQTEDGDLFKVTIDMVEDDNGQPTGEVRRLKLKYFDTVPIASSLCILKNGFLFVASENG 366
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGD--IEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
+ QF + ++F +E AP R R + + L + +N +
Sbjct: 367 NHHFYQFEKLGDDDEETEFTSDDFSSDPLEPLAPVYFRPRPAENLNLVESINSVNPLMSC 426
Query: 489 SASNNTES-AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
+N TE A + ++ + LK +GL + S+ EL +P
Sbjct: 427 KVANLTEDDAPQLYTLCGTGARSTFRTLK---HGLEV---------SEIVESELPSVPS- 473
Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+WT K +R +D+Y AY+I+S T+VL + + EVT++ +
Sbjct: 474 -AVWTT--KLTR------------NDQYDAYIILSFTNGTLVLSIGETVEEVTDT-GFLS 517
Query: 608 QGRTIAAGNLFGRRRVIQVFERGAR 632
T+A L G +IQV +G R
Sbjct: 518 SAPTLAVQQL-GEDSLIQVHPKGIR 541
>gi|167390599|ref|XP_001739420.1| DNA damage-binding protein [Entamoeba dispar SAW760]
gi|165896898|gb|EDR24200.1| DNA damage-binding protein, putative [Entamoeba dispar SAW760]
Length = 1088
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 79/349 (22%), Positives = 158/349 (45%), Gaps = 44/349 (12%)
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQN 1144
++ +E+AL++ + + + + + IGTA+ + +V + GR+L+ D
Sbjct: 756 LKENEHALSIEEIVV------DEKEMFVIGTAFAKPNEVEPSSGRILIVQI---KDGKLE 806
Query: 1145 LVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGTELNG---IAFYDAPP---- 1196
+V + K++ GA+ ++ +L + +L ++ K+++ ++ NG + +
Sbjct: 807 IV---FEKDVNGAVYSIKTLLKKYLAMSIEKKLVIFEYQRIITNGEFEVKLQEKGSCNVK 863
Query: 1197 ---LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL---LAKDFGSLDCFATEFLID 1250
LYV ++ N IL+GD+ KSI S+ G N +++DF + A EF+ +
Sbjct: 864 LIGLYVKTMG---NKILVGDLMKSISVYSFDNNGNNKNCLNEVSRDFYASYTTAIEFVDE 920
Query: 1251 GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1310
LS SD N+ +F +ES + +L + A HVG + + + T S
Sbjct: 921 NCYLS---SDSNSNLLVFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHSTY 976
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN-PRS 1369
+ + +LFG + G IG I + + L +Q +++ + + P
Sbjct: 977 -------ETIQKKCILFGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTPDE 1029
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
+++ + K R +I+D ++ Y + E+Q EIAH +G QI
Sbjct: 1030 WKKVIDDWK--RMPSSNIIDGNIVESYLEMSKEKQCEIAHLSGVNEEQI 1076
>gi|298713790|emb|CBJ27162.1| spliceosomal protein sap, putative [Ectocarpus siliculosus]
Length = 1256
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/378 (22%), Positives = 153/378 (40%), Gaps = 55/378 (14%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+R+L+P T + + +E AL+V V N + E +A+GTA +
Sbjct: 909 IRLLDPVEG----TTVECLDLDDNEAALSVAPVAFHN---RNGEAFVAVGTA--KSLTFH 959
Query: 1126 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1185
RG F +N + ++ E+ A+ QG LL+ G + ++ +
Sbjct: 960 PRGHEGCFVHVYRI--LENRLVLLHKTEVPDVPLAMKEFQGRLLVGVGQSLRMYDLGRKK 1017
Query: 1186 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----FGSLD 1241
L P VVSL + + + GD +S + ++ +L A D F +
Sbjct: 1018 LLRKCENKRMPSMVVSLTVTGDRVFAGDQMESCHCFKYRRAENRLVEFADDQVPRFMTKT 1077
Query: 1242 CFATEFLIDGST---------LSLVVSDEQKNI---QIFYYAPKMSESWKGQKLLSRAEF 1289
C I G+ + L VSD+ N ++ + + +S + K+ + +F
Sbjct: 1078 CLLDYDSIAGADKFGNIFVLRVPLDVSDDVDNPTGNRLLWDSGHLSGA--PNKVQQQLQF 1135
Query: 1290 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFR 1346
HVG V+ S RT PG + LL+ T++GSIG + P D++ F
Sbjct: 1136 HVGEVVS----------SLRRTTLVPGGAEV----LLYSTINGSIGALLPFKSRDDVDF- 1180
Query: 1347 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1406
++ + P + G + S+R ++ K ++D +L + LP E+Q
Sbjct: 1181 -FTHMEMYMRQEKPTLCGRDHISYRSYYLPAK-------DVIDGDLCEQFSSLPFEKQKL 1232
Query: 1407 IAHQTGTTRSQILSNLND 1424
+A+ T +++ L D
Sbjct: 1233 VANGLDRTVGEVVKKLED 1250
>gi|429850956|gb|ELA26181.1| DNA damage-binding protein 1 [Colletotrichum gloeosporioides Nara
gc5]
Length = 1409
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 137/321 (42%), Gaps = 40/321 (12%)
Query: 1114 IGTAYV----QGEDVAARGRVLLFSTGRNAD-NPQNLVTEVYSKELKGAISALASLQGHL 1168
+GT+Y+ E+ +GR+L+ G ++D NP +V S ELKGA +LA + L
Sbjct: 826 VGTSYLADPEMDENSEVKGRILVL--GVDSDKNPYQIV----SHELKGACRSLAVMGDKL 879
Query: 1169 LIASGPKIILHKW------TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
+ ++++ + +G+ L F P + V L++ N I + D+ +S+ +
Sbjct: 880 VAGLSKTVVVYDYAEESSTSGSLLKLATFR--PSTFPVDLDVNGNMIGVADLMQSMTLIE 937
Query: 1223 W--KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1280
+ + G + L+ + +AT L + +D Q N+ + P
Sbjct: 938 FIPAQDGNKARLVERARHFQYIWATAVCHLEQDL-WIEADAQGNLMVLRRNPNAPTEHDK 996
Query: 1281 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
+++ +EFH+G + K L +++ +D P K T++GSI A +
Sbjct: 997 KQMEVISEFHLGEQINKIRPLDVVSGEND-----PIEPKA-----FLATIEGSIYVFADI 1046
Query: 1341 DE------LTFR-RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
L F+ RL + K L + AGL+ S+R F N K GP VD EL+
Sbjct: 1047 KPEYQSLLLQFQERLAGVIKTLGQADEPGAGLSFMSWRGFR-NAKRSADGPFRFVDGELI 1105
Query: 1394 SHYEMLPLEEQLEIAHQTGTT 1414
+ L Q + G T
Sbjct: 1106 ERFLDLDAGRQEAVVQGLGPT 1126
>gi|409045147|gb|EKM54628.1| hypothetical protein PHACADRAFT_210427 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1213
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 115/498 (23%), Positives = 202/498 (40%), Gaps = 77/498 (15%)
Query: 965 CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV- 1023
C G I + S +L+I Q+P T Q IPL TP ++ + +I
Sbjct: 759 CPEGLIGI-SGSVLRIFQIPKLGTKLK----QDSIPLSYTPRKMIVHPTNGFFYMIEGDH 813
Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNH--NLSSVDLHRTYTVE---EYEVRILEPDRAGGPW 1078
V+ + + ++ G Q+D NL R +RI+ P A
Sbjct: 814 RVMGEEAKQKKIEELRQQGKQVDEEMLNLPPEVFGRPKAPAGTWASGIRIVSPVDA---- 869
Query: 1079 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLF-STGR 1137
QT IP+ ++E A ++ VV T + E L +GTA Q +A R F T R
Sbjct: 870 QTVNFIPLDNNEAAFSIAVVPF---TARGGELTLVVGTA--QDTFLAPRSCTSGFLRTYR 924
Query: 1138 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL----NGIAFYD 1193
D+ ++ + ++ E A+ + QG L G + ++ +L F +
Sbjct: 925 FLDDGRD-IELLHKTETNDVPLAIMAFQGRLAAGIGKALRIYDIGKKKLLRKVESKNFSN 983
Query: 1194 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1253
A +V+LN + I++GD+ +SI++ +K +L + A D A +ID +T
Sbjct: 984 A----IVTLNTQGSRIIVGDMQESIFYAVYKPPENRLLIFADDAQPRWITAVT-MIDYNT 1038
Query: 1254 LSLVVSDEQKNIQIFYYAPKMSESW------------KG------QKLLSRAEFHVGAHV 1295
++ D N+ + PK+S+ KG K A FHVG V
Sbjct: 1039 VA--AGDRFGNVFVNRLDPKISDQVDDDPTGAGILHEKGILMGAPHKTAMIAHFHVGDIV 1096
Query: 1296 TKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQ 1352
T ++ ++A R LL+ L G++G + P +++ F + +L+
Sbjct: 1097 TSIHKVSLVA--------------GGRELLLYTGLHGTVGILVPFVSKEDVDF--ISTLE 1140
Query: 1353 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
+ + + G + ++R ++ KA +VD +L + LP +Q IA +
Sbjct: 1141 QHMRTEQLSLVGRDHLAWRGYYVPVKA-------VVDGDLCEMFARLPASKQSSIATELD 1193
Query: 1413 TTRSQILSNLNDLALGTS 1430
T ++L L L + S
Sbjct: 1194 RTVGEVLKKLEQLRVTAS 1211
>gi|301110252|ref|XP_002904206.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
gi|262096332|gb|EEY54384.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
Length = 1197
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 126/298 (42%), Gaps = 48/298 (16%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
V++ E+ A+ QG LL++ G + ++ ++ P +V L +
Sbjct: 922 VHTTEIDDIPHAMCEFQGRLLVSVGRALRIYDLGKKKMLRKCENRNFPSILVELKAAGDR 981
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE-FLIDGSTLSLVVSDEQKNIQI 1267
I D+H+S +F+ +K+ QL + A D + F T L+D TL +D+ N+ +
Sbjct: 982 IYASDMHESFHFVKYKKDENQLVIFADD--CVPRFITSSVLLDYDTL--CGADKFGNVFV 1037
Query: 1268 FYYAPKMSES----------WKG-------QKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1310
++S+ W KL A+FHVG VT +R ++
Sbjct: 1038 SRLPSEVSDEIDNPTGNRILWDSGLLNGAPNKLEQVAQFHVGDVVTSMVRSSLV------ 1091
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
PG + A+++ T+ G IG + P +++ F L+ + P + G +
Sbjct: 1092 ----PGGTE----AVIYATIMGRIGALIPFTSREDVDF--YTHLEMYMRQEQPPLCGRDH 1141
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
S+R ++ K +I D +L + L +E+Q +A T +++L L D+
Sbjct: 1142 LSYRSYYIPVK-------NITDGDLCEQFSSLSVEKQASVAEDLDRTPAEVLKKLEDI 1192
Score = 46.6 bits (109), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 68/306 (22%), Positives = 114/306 (37%), Gaps = 86/306 (28%)
Query: 321 VLVVGANTIHYHSQ---SASCALALNNYAVSLDSSQELPRSSFSVE--LDAAHATWLQND 375
VLV+G NT+ Y ++ +CA+ PR + + AT Q D
Sbjct: 247 VLVLGENTVQYKNEGHPELTCAI---------------PRREGEHRDIIIVSAATHKQRD 291
Query: 376 V--ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
+ LL ++ GDL +++ Y G VV+ + + + + S + L F S +
Sbjct: 292 LFFVLLQSELGDLYKISLDYSGNVVEEIKIQFFDTIPVASSMCITKTGLLFCASEFSNHY 351
Query: 434 LVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA---------------LQD 477
L QF + G G K ++ ST LR+ ++ A + D
Sbjct: 352 LFQFLSIGEG----DDAAKCSSLAMDPTEFSTFPLRKLTNLALASSSASLSPVTQLLVDD 407
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ N + +Y NN S+ L+ +GL I A++
Sbjct: 408 LANEQTPQMYALCGNNNRSS-----------------LRVLRHGLPITEMAASA------ 444
Query: 538 NYELVELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
LPG K +W + +Y D Y Y+++S E T+VLE + +
Sbjct: 445 ------LPGVAKAVWCLKE--------------SYADPYDKYIVVSFEDATLVLEVGETV 484
Query: 597 TEVTES 602
EV +S
Sbjct: 485 EEVAQS 490
>gi|312370905|gb|EFR19207.1| hypothetical protein AND_22901 [Anopheles darlingi]
Length = 1287
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 68/311 (21%), Positives = 126/311 (40%), Gaps = 43/311 (13%)
Query: 1140 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1199
DN N + V+ E+ A AL + QG LL G + ++ +L P +
Sbjct: 1001 DNQTNELEHVHRTEIDDAPGALCAFQGRLLAGIGKVLRMYDLGKKKLLRKCENKHIPNQI 1060
Query: 1200 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1259
V++ + + + D+ +S+Y L +K QL + A D + L+D T++
Sbjct: 1061 VNIQGMGQRVYVSDVQESVYCLKYKRPENQLIIFADDTHP-RWVTSATLLDYDTVA--TG 1117
Query: 1260 DEQKNIQIFYYAPKMS----ESWKGQKLL----------SRAE----FHVGAHVTKFLRL 1301
D+ NI + +S E G K L +AE FH+G V +
Sbjct: 1118 DKFGNIAVLRLPHSVSDDVDEDPTGNKALWDRGLLNGASQKAENICTFHLGEIVMSLQKA 1177
Query: 1302 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKLVDSVP 1360
++ PG ++ L++ T+ G++G + P + Q L+ + + P
Sbjct: 1178 TLI----------PGGSES----LIYATMSGTVGALVPFTSREDYDFFQHLEMHMRNENP 1223
Query: 1361 HVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1420
+ G + S+R ++ K +++D +L + L +Q IA G T S++
Sbjct: 1224 PLCGRDHLSYRSYYYPVK-------NVMDGDLCEQFTSLDPAKQKSIASDLGRTPSEVAK 1276
Query: 1421 NLNDLALGTSF 1431
L D+ +F
Sbjct: 1277 KLEDIRTRYAF 1287
>gi|302916981|ref|XP_003052301.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256733240|gb|EEU46588.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1212
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 105/488 (21%), Positives = 206/488 (42%), Gaps = 56/488 (11%)
Query: 964 NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021
C G + + Q I I +L G T +QK IPL TP ++ E+ L+ I
Sbjct: 761 QCEEGIVGIQGQSLRIFNIDRL--GDTL-----IQKSIPLTYTPKKLVKHPEQPLFYTIE 813
Query: 1022 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQ 1079
+ P LL D V + D+ L D + + +++P G Q
Sbjct: 814 ADNNTLPPELRAQLLADPNVVNG-DSQVLPPEDFGYPRANRRWASCINVVDPLSEEG--Q 870
Query: 1080 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG--- 1136
T+ +++E A++ VV+ +++NE L +GT G+D+ + +S G
Sbjct: 871 VLQTVHFENNEAAVSATVVSF---ASQDNENFLVVGT----GKDMIVNPQS--YSEGYLY 921
Query: 1137 --RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1194
R ++ + L ++ +++ AL QG + +A G ++ ++ ++ A +
Sbjct: 922 IYRFVEDGREL-EFIHKTKIEEPPLALLPFQGKVAVAVGTQLRIYDLGMRQMLRKAQAEV 980
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
+VSLN + I++GD+ + + + +K +L A D + T ++D
Sbjct: 981 SAQRIVSLNTQGSRIVVGDVQQGVTLVVYKSATNKLIPFADDTVARWTTCTT-MVDYE-- 1037
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKFL---RLQMLAT 1306
S+ D+ N+ I K SE ++ + +R H H + Q + T
Sbjct: 1038 SIAGGDKFGNMFIVRCPEKASEEADEEQSGLHLINARDYLHGTPHRLGLMCHFYTQDVPT 1097
Query: 1307 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1363
S +T G + LL+ + G+IG P ++ F Q+L++ L P +A
Sbjct: 1098 SITKTSLVVGGQEI----LLWSGIMGTIGVFIPFVSREDADF--FQNLEQHLRTEDPPLA 1151
Query: 1364 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1423
G + +R +++ K ++D +L Y +LP +++ IA + + +I ++
Sbjct: 1152 GRDHLMYRGYYAPVKG-------VIDGDLCERYNLLPNDKKQMIAGELDRSVREIERKIS 1204
Query: 1424 DLALGTSF 1431
D+ ++F
Sbjct: 1205 DIRTRSAF 1212
>gi|242772631|ref|XP_002478075.1| nuclear mRNA splicing factor, putative [Talaromyces stipitatus ATCC
10500]
gi|218721694|gb|EED21112.1| nuclear mRNA splicing factor, putative [Talaromyces stipitatus ATCC
10500]
Length = 1209
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 101/454 (22%), Positives = 195/454 (42%), Gaps = 58/454 (12%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVP-VLKPLNQVLSLLIDQEV--GHQI----DN 1047
+Q+ IPL TP E+ L+ +I S VL P Q LL D ++ G + +
Sbjct: 785 LQESIPLAYTPRHFVKHPEQPLFYVIESENNVLAPATQT-RLLEDSKLQNGEAVIPPAET 843
Query: 1048 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE 1107
H +E +++P + + + ++ +E+A++V V+ +++
Sbjct: 844 FGFPRATGHWASCIE-----VVDPINSKS---VLSRLELEENESAVSVAAVSF---ASQD 892
Query: 1108 NETLLAIGTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALA 1162
NET L +GT G+DV R FS G R ++ + L ++ +++ AL
Sbjct: 893 NETFLVVGT----GKDVVTYPRS--FSAGFIHIYRFQEDGREL-EFIHKTKIEEPPLALL 945
Query: 1163 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
+ QG L+ G + ++ ++ +A P +V L + I++ D+ +S+ ++
Sbjct: 946 AFQGRLVAGIGKNLRVYDLGMKQMLRKCQVEASPNLIVGLQTQGSRIIVSDVQESVTYVV 1005
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES----W 1278
+K Q QL D + AT ++D T + D+ N+ + K+SE
Sbjct: 1006 YKYQENQLIPFVDDVIARWTTATT-MVDYETTA--GGDKFGNLWLVRCPKKVSEESDEDG 1062
Query: 1279 KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
G L+ + G + Q + TS +T G R L++ L G+I
Sbjct: 1063 SGAHLIHERSYLQGTPNRLDLMVHFYTQDIPTSLHKTNLVVG----GRDILVWTGLQGTI 1118
Query: 1335 GCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1391
G + P +++ F Q+L+ +L P +AG +R ++ K ++D +
Sbjct: 1119 GMMIPFISREDVDF--FQNLEMQLASQNPPLAGREHLIYRSYYVPVKG-------VIDGD 1169
Query: 1392 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
L Y +LP +++L IA + + +I ++D+
Sbjct: 1170 LCESYFLLPNDKKLMIAGELDRSVREIERKISDM 1203
>gi|212531303|ref|XP_002145808.1| nuclear mRNA splicing factor, putative [Talaromyces marneffei ATCC
18224]
gi|210071172|gb|EEA25261.1| nuclear mRNA splicing factor, putative [Talaromyces marneffei ATCC
18224]
Length = 1209
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 80/358 (22%), Positives = 160/358 (44%), Gaps = 42/358 (11%)
Query: 1084 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1138
+ ++ +E+A+++ V+ +++NET L +GT G+DV R FS G R
Sbjct: 872 LELEENESAVSIAAVSF---ASQDNETFLVVGT----GKDVVTYPRS--FSAGFIHIYRF 922
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
++ + L ++ +++ AL + QG L+ G + ++ ++ +A P
Sbjct: 923 QEDGREL-EFIHKTKIEEPPLALLAFQGRLVAGIGKNLRIYDLGMKQMLRKCQVEAVPNL 981
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
+V L + I++ D+ +S+ ++ +K Q QL D + AT ++D T +
Sbjct: 982 IVGLQTQGSRIIVSDVQESVTYVVYKYQENQLIPFVDDVIARWTTATT-MVDYETTA--G 1038
Query: 1259 SDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAHVTKFLRL----QMLATSSDR 1310
D+ N+ + K S E G L+ + G L + Q + TS +
Sbjct: 1039 GDKFGNLWLVRCPQKASDDSDEDGSGAHLIHERSYLQGTANRLNLMIHYYTQDIPTSLHK 1098
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
T G R L++ L G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1099 TNLVVG----GRDILVWTGLQGTIGIMVPFISREDVDF--FQNLETQLASQNPPLAGRDH 1152
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+R ++ K ++D +L Y +LP +++L IA + + +I ++D+
Sbjct: 1153 LIYRSYYVPSKG-------VIDGDLCESYFLLPNDKKLMIAGELDRSVREIERKISDM 1203
>gi|340367933|ref|XP_003382507.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Amphimedon
queenslandica]
Length = 1214
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 76/389 (19%), Positives = 160/389 (41%), Gaps = 59/389 (15%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1123
+R++ P++ +T + + +E A ++ V + + E + +GTA +
Sbjct: 862 LRVMHPNQG----KTLDIVQFEQNEAAFSLAVCQF--VSKGDLEWFVVVGTAKDMIITPR 915
Query: 1124 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
+ G +++F + + V++ +L A+A QG LL+ G + ++
Sbjct: 916 AISSGSLIVFRLSPDGSK----LEHVHTTQLDDVPIAMAPFQGRLLVGVGKLLRIYDIGK 971
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
++ P VV + ++ + +GD+ ++++FL ++ QL + A + C
Sbjct: 972 KKMLRKCENKHLPYLVVDIKVMGRRVYVGDVQEAVHFLYYRPHENQLVIFADEVVPRFC- 1030
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLS 1285
T ++D +T++ +D+ NI I +++ W QK
Sbjct: 1031 TTSCILDYNTVA--SADKFGNITILRLPSDVTDQVDEDPSGSRSLWDRGFLNGATQKANV 1088
Query: 1286 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1342
+HVG + ++ ++ PG + L++ TL GSIG + P ++
Sbjct: 1089 MTSYHVGEGINTLHKVSLI----------PGGSE----VLVYTTLSGSIGILVPFSSKED 1134
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1402
F Q L+ + ++ G + SFR ++ K S++D +L Y L
Sbjct: 1135 SDF--FQHLEMHMRSEWSNLVGRDHLSFRSYYVPVK-------SVIDGDLCEVYNSLDPS 1185
Query: 1403 EQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
++ EIA + S++ L DL +F
Sbjct: 1186 KRREIALDLDRSPSEVAKKLEDLRTRYAF 1214
>gi|164656549|ref|XP_001729402.1| hypothetical protein MGL_3437 [Malassezia globosa CBS 7966]
gi|159103293|gb|EDP42188.1| hypothetical protein MGL_3437 [Malassezia globosa CBS 7966]
Length = 1207
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 85/361 (23%), Positives = 143/361 (39%), Gaps = 35/361 (9%)
Query: 1075 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS 1134
G QT A ++ E AL++ +V NE L +G+A + R S
Sbjct: 859 GTTMQTCAEYALEKDEAALSMALVPF---AACGNELFLVVGSA-LGVTHAPLTWRAAFLS 914
Query: 1135 TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1194
T R DN L V+ E+ AL + G LL +GP + + +L
Sbjct: 915 TYRLTDNGCGLAL-VHKTEVDHVPLALRAFHGRLLAGTGPYVRIFDMGTKKLLRKCQSRP 973
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
P VVSL + +++GD+ +S+++ +K L A D + ++D T+
Sbjct: 974 FPSKVVSLQVQGYRVIVGDMQESVHYSVYKPATNTLVAFADDIMPRWTTSALLMLDYDTV 1033
Query: 1255 SLVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAHVTKFLRLQMLA----- 1305
+ D+ N+ + S E G L + + +GA R Q+LA
Sbjct: 1034 --MAGDKFGNVFVLRIDSSASLSADEDPTGLMLQNERSYLMGAA----HRAQLLAHYHVG 1087
Query: 1306 ---TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPH 1361
TS PG R +L+ ++G+IG + P + R +L+ +
Sbjct: 1088 DIITSLSMESLVPG----GRPVVLYTCVNGTIGALVPFISREDVRLFTTLEMHMRQENLS 1143
Query: 1362 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1421
+ G + ++R ++ KA +VD +L Y LP E+Q IA + T + I
Sbjct: 1144 LTGRDHLAYRGHYTPVKA-------VVDADLCELYTALPHEKQESIADELDRTPADIAKK 1196
Query: 1422 L 1422
L
Sbjct: 1197 L 1197
>gi|164429062|ref|XP_957282.2| pre-mRNA splicing factor RSE1 [Neurospora crassa OR74A]
gi|157072394|gb|EAA28046.2| pre-mRNA splicing factor RSE1 [Neurospora crassa OR74A]
Length = 1192
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 106/475 (22%), Positives = 192/475 (40%), Gaps = 89/475 (18%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
+QK IPL TP + E+ + I S P LL Q G
Sbjct: 769 IQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG------------ 816
Query: 1055 LHRTYTVEEYEVRILEPD-----RAGGPWQTRATIPMQSSENALTVRVVTLFNT------ 1103
+ +L P+ RA G W + +I SE ++ + L N
Sbjct: 817 ----------DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSA 866
Query: 1104 -----TTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKE 1153
++E E+ L +GT G+D+ R F+ G R ++ ++L ++
Sbjct: 867 AIVPFASQEGESFLVVGT----GKDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTR 919
Query: 1154 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1213
++ AL QG LL G + ++ +L A D P +VSL N I++GD
Sbjct: 920 VEEPPLALIPFQGRLLAGVGKTLRIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGD 979
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
+ + I ++ +K +G +L A D +L+ + T ++D S+ D+ NI I
Sbjct: 980 LQQGITYVVYKAEGNRLIPFADD--TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPE 1035
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKFL-----RL--------QMLATSSDRTGAAPGSDK 1319
++S+ +E H+ H +L RL Q L TS +T G
Sbjct: 1036 RVSQETDEPG----SEIHL-MHARNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD 1090
Query: 1320 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
LL+ L G++G P +++ F Q+L+ + P +AG + +R +++
Sbjct: 1091 ----VLLWSGLQGTVGVFIPFVSREDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTP 1144
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
K ++D +L + +LP +++ IA + + +I ++D+ ++F
Sbjct: 1145 VKG-------VIDGDLCERFSLLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1192
>gi|348667612|gb|EGZ07437.1| hypothetical protein PHYSODRAFT_565381 [Phytophthora sojae]
Length = 1197
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 124/298 (41%), Gaps = 48/298 (16%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
V++ E+ A+ QG LL++ G + ++ ++ P +V L +
Sbjct: 922 VHTTEIDDIPHAMCEFQGRLLVSVGRALRIYDLGKKKMLRKCENRNFPSILVELKAAGDR 981
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE-FLIDGSTLSLVVSDEQKNIQI 1267
I D+H+S +F+ +K+ QL + A D + F T L+D TL +D+ N+ +
Sbjct: 982 IYASDMHESFHFVKYKKDENQLVIFADD--CVPRFITSSVLLDYDTL--CGADKFGNVFV 1037
Query: 1268 FYYAPKMSES----------WKG-------QKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1310
++S+ W KL A+FHVG VT + R
Sbjct: 1038 SRLPSEVSDEIDNPTANRILWDSGLLNGAPNKLEQVAQFHVGDVVTSMV----------R 1087
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
T PG + A+++ T+ G IG + P ++ F L+ + P + G +
Sbjct: 1088 TSLVPGGIE----AIIYATIMGRIGALIPFTSRQDVDF--YTHLEMYMRQEQPPLCGRDH 1141
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
S+R ++ K ++ D +L + L +E+Q +A T +++L L D+
Sbjct: 1142 LSYRSYYIPVK-------NVTDGDLCEQFSSLSVEKQASVAEDLDRTPAEVLKKLEDI 1192
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 68/301 (22%), Positives = 116/301 (38%), Gaps = 76/301 (25%)
Query: 321 VLVVGANTIHYHSQ---SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV- 376
VLV+G NT+ Y ++ +CA+ Q PR V + AT Q D+
Sbjct: 247 VLVLGENTVQYKNEGHPELTCAIP---------RRQGEPRDIVIV----SAATHKQRDLF 293
Query: 377 -ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
LL ++ GDL +++ Y G V+ + + + + S + L F S + L
Sbjct: 294 FVLLQSELGDLYKISLDYSGNAVEEIKIQFFDTVPVASSMCITKTGLLFCASEFSNHYLF 353
Query: 436 QF-TCGSGT------------SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE 482
QF + G G + LS+ + +++ A S L + + D+ N +
Sbjct: 354 QFLSIGEGDDTAKCSSLAMDPTELSTFPLRKLTNLQL-ASSMPSLSPVTQLLVDDLANEQ 412
Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV 542
+Y N+ S+ L+ +GL I A++
Sbjct: 413 TPQMYALCGNSNRSS-----------------LRVLRHGLPITEMAASA----------- 444
Query: 543 ELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
LPG K +W + +Y D Y Y+++S E T+VLE + + EVT+
Sbjct: 445 -LPGVAKAVWCLKE--------------SYADPYDKYIVVSFEDATLVLEVGETVEEVTQ 489
Query: 602 S 602
S
Sbjct: 490 S 490
>gi|336469942|gb|EGO58104.1| pre-mRNA splicing factor RSE1 [Neurospora tetrasperma FGSC 2508]
Length = 1192
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 106/475 (22%), Positives = 192/475 (40%), Gaps = 89/475 (18%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
+QK IPL TP + E+ + I S P LL Q G
Sbjct: 769 IQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG------------ 816
Query: 1055 LHRTYTVEEYEVRILEPD-----RAGGPWQTRATIPMQSSENALTVRVVTLFNT------ 1103
+ +L P+ RA G W + +I SE ++ + L N
Sbjct: 817 ----------DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSA 866
Query: 1104 -----TTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKE 1153
++E E+ L +GT G+D+ R F+ G R ++ ++L ++
Sbjct: 867 AIVPFASQEGESFLVVGT----GKDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTR 919
Query: 1154 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1213
++ AL QG LL G + ++ +L A D P +VSL N I++GD
Sbjct: 920 VEEPPLALIPFQGRLLAGVGKTLRIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGD 979
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
+ + I ++ +K +G +L A D +L+ + T ++D S+ D+ NI I
Sbjct: 980 LQQGITYVVYKAEGNRLIPFADD--TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPE 1035
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKFL-----RL--------QMLATSSDRTGAAPGSDK 1319
++S+ +E H+ H +L RL Q L TS +T G
Sbjct: 1036 RVSQETDEPG----SEIHL-MHARNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD 1090
Query: 1320 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
LL+ L G++G P +++ F Q+L+ + P +AG + +R +++
Sbjct: 1091 ----VLLWSGLQGTVGVFIPFVSREDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTP 1144
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
K ++D +L + +LP +++ IA + + +I ++D+ ++F
Sbjct: 1145 VKG-------VIDGDLCERFSLLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1192
>gi|302831461|ref|XP_002947296.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f. nagariensis]
gi|300267703|gb|EFJ51886.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f. nagariensis]
Length = 1221
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 135/344 (39%), Gaps = 60/344 (17%)
Query: 1109 ETLLAIGTA----YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL--KGAISALA 1162
E LL +G A Y+ + AA RV R AD + L E+ K + G AL
Sbjct: 906 EKLLVVGCAKGLRYMPTDCEAAYIRVY-----RLADGGKRL--ELVHKTIVDGGVPGALC 958
Query: 1163 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
+G LL GP + L+ +L Y+ P ++++ + I +GD +S++ +
Sbjct: 959 GFKGRLLAGVGPTLRLYDMGKKKLLRKCEYNRLPHQIMNITVQGPRIYVGDAQESVHMMR 1018
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG-- 1280
+K+ + A D T +D TL+ D+ N + + S+ +
Sbjct: 1019 YKKADNAFYIFADDIAP-RYLTTILPLDYDTLA--AGDKFGNFVVLRLPREASQQVEDDP 1075
Query: 1281 ----------------QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
KL +FHVG +T R +M A +
Sbjct: 1076 TGGKMAAASGRLNGAPHKLEEVVKFHVGDTITSLQRAEMQAGGQE--------------V 1121
Query: 1325 LLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
LL+ T+ G+IG + P +++ F L+ L P + G + SFR A+
Sbjct: 1122 LLYSTVMGAIGVLYPFTNREDVDF--FGHLEMHLRQEHPPLCGRDHLSFR------SAYF 1173
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
P S VD +L Y +P ++Q IA T ++L L D+
Sbjct: 1174 P-VRSCVDGDLCGQYASIPAKKQQMIAEAMDRTPGEMLKKLEDI 1216
Score = 41.6 bits (96), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 71/353 (20%), Positives = 130/353 (36%), Gaps = 76/353 (21%)
Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGAN 327
A ++ KH T L ++ L++ W+ + + A L+AVP GGVLV N
Sbjct: 199 AASMAQKHLTFYEMDLGLNNVLRK----WTE-PIDNGANLLVAVPGGADGPGGVLVCAEN 253
Query: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
I Y +Q A+ L + + S++ A++ +L + ++ GD+
Sbjct: 254 FIIYKNQDHEEVRAVIPRRSDLPGDRGVLIVSYATHKKKAYSFFL------VQSEYGDIY 307
Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
+T+ Y+G V L + + + I + F S G+ L QF
Sbjct: 308 KVTLAYEGEAVTELKIKYFDTIPPCTSIAVLKTGFLFAASEYGNHALYQFV--------- 358
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
G E+ D+E SSS AL G + + + + + D
Sbjct: 359 -GTGEDDEDVE-----------SSSAALVQTEEGFQPVFF--------EPRPLKNLLLID 398
Query: 508 SLVNIGPLKDFS-----------------YGLRINADASATGISKQSNYELVELPGCK-G 549
+ ++ P+ D +G R + G++ + + LPG
Sbjct: 399 EMASLMPITDMKVANLLNEEIPQIYALCGHGPRASLSVLRPGLAV-TELAVSPLPGAPTA 457
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
+WTV ++ DE+ A++++S T+V + + E ES
Sbjct: 458 VWTVRRNAT--------------DEFDAFIVVSFANATLVFSIGEEVKETNES 496
>gi|443918546|gb|ELU38987.1| CPSF A subunit region domain-containing protein [Rhizoctonia solani
AG-1 IA]
Length = 1037
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 79/348 (22%), Positives = 146/348 (41%), Gaps = 53/348 (15%)
Query: 1097 VVTLFNTTTKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK 1155
+VT T + + GTA + GE+ GR++LF + +N++ SK+++
Sbjct: 704 MVTSIGVYTHGGNSYILAGTAIINPGENEPLAGRIILF-----GQDEENMIKFKASKDVE 758
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTEL---NGIAFYDAPPLYVVSLNIVK-NFILL 1211
G +S++ L ++ A G I L+ E+ + +A ++ Y+V IV+ N I++
Sbjct: 759 GGVSSIKQLGARIIAAIGHGIYLYNLGRGEVTISDPVARWERG--YIVHDIIVRPNMIVV 816
Query: 1212 GDIHKSIYFLSWKEQGA-----------------QLNLLAKDFGSLDCFATEFLIDGSTL 1254
D +S+ L + E+ + Q +A D ++ + E L D T
Sbjct: 817 SDRLRSVSVLRFIERTSTPESHEEIETEEDSTILQFETVAMDMHAVWPTSVEVLPDNKT- 875
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1314
++ S NI ++ + L RA FH G + KF+ S+ ++ A
Sbjct: 876 -IIASQTDGNI--------LTWELEDGNLEPRAAFHTGEIIHKFI------ASTAKSSAG 920
Query: 1315 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1374
P R +F T G IG ++ +D+ +L L+ KL D++ + + +R
Sbjct: 921 P------RTVAIFVTNTGRIGTLSTVDDADALQLTRLEMKLGDAIKGLGNIKHPEWRAPK 974
Query: 1375 SNGKAHRPGP-DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1421
+P P + D + + + L EE I +G+ I SN
Sbjct: 975 LLHTGTKPPPRRGVTDGDFIKKFLELSSEEAKRIL-SSGSAAETIGSN 1021
>gi|389740093|gb|EIM81285.1| hypothetical protein STEHIDRAFT_86633 [Stereum hirsutum FP-91666 SS1]
Length = 1213
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 122/539 (22%), Positives = 218/539 (40%), Gaps = 75/539 (13%)
Query: 923 HQGFFLSGSRPCWCMVFRERL-RVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
HQ L+ S W + L P + D A++ + C G I +T +L+I
Sbjct: 717 HQPAILALSSRTWLNYTHQNLMHFTPLIFDNLDFAWSFSAEL-CPEGLIGITGS-VLRIF 774
Query: 982 QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLI-VSVPVLKPLNQVLSLLIDQE 1040
Q+P T Q IPL TP + + + +I V P + +
Sbjct: 775 QMPKLGTKLK----QDSIPLSYTPRKFASHPSNSYFYIIEADHRVTGPDATAKKVAELRS 830
Query: 1041 VGHQIDNHNL---SSVDLHRTYTVEEYE--VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
G +D+ L + V + +RI++P ++ A IP+ ++E A ++
Sbjct: 831 QGKMVDDEMLELPAEVFGRSKAPAGTWASCIRIVDPAEG----KSVAEIPIDNNEAAFSL 886
Query: 1096 RVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLF-STGRNADNPQNLVTEVYSKEL 1154
+V + + NE L +GTA Q +A R F T + D+ L ++ E
Sbjct: 887 AIVPF---SVRNNEYHLVVGTA--QDTFLAPRSCTSGFLRTYKFVDDGAGLEL-LHKTET 940
Query: 1155 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
+L + QG L+ G + ++ +L A ++SLN + I++GD+
Sbjct: 941 DDIPMSLLAFQGRLVAGIGKALRIYDIGKKKLLRKAESKTFASAIISLNTQGSRIIVGDM 1000
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
+SI + +K +L + A D + + C ++D +T++ D NI I
Sbjct: 1001 QESIAYAVYKAPENKLLVFADDTQARWVTC---STMVDYTTVA--AGDRFGNIFINRLDS 1055
Query: 1273 KMSESW------------KG------QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1314
K+S+ KG K A FHVG VT ++ ++A
Sbjct: 1056 KVSDQVDDDPTGAGILHEKGILMGAPHKTAMLAHFHVGDLVTSIHKVSLVA--------- 1106
Query: 1315 PGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
R LL+ L G+IG + PL +++ F + +L++ + + G + ++R
Sbjct: 1107 -----GGREVLLYTGLHGTIGMLVPLVSKEDVDF--ISTLEQHIRTEQTSLVGRDHLAWR 1159
Query: 1372 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
++ KA +VD +L + LP +Q IA + T S++L L+ L + S
Sbjct: 1160 GYYVPVKA-------VVDGDLCETFARLPAAKQSMIAGELDRTVSEVLKKLDQLRVTAS 1211
>gi|402077250|gb|EJT72599.1| pre-mRNA-splicing factor RSE1 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 1216
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 111/505 (21%), Positives = 214/505 (42%), Gaps = 86/505 (17%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQI----------TYFAE 1013
C G + V Q L+IC + G D+ +QK I L TP ++ T AE
Sbjct: 761 QCEEGMVGVQGQ-FLRICAI--GKLGDSM--IQKSISLAYTPKKLIKNPTHPIFYTIEAE 815
Query: 1014 KNLYPLIVSVPVLKP---LNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1070
N P + +L +N +L +E G+ N +S + +++
Sbjct: 816 NNTLPPELREQLLAAPTAVNGDTKVLPPEEFGYPRGNGRWASC------------ISVVD 863
Query: 1071 PDRAGGPWQTRA--TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARG 1128
P G + I ++++E ++V VV ++++E+ L +GT G+D+
Sbjct: 864 PLGDGEEREPSVLQQIHLENNEATVSVAVVPF---ASQDSESFLVVGT----GKDMVLNP 916
Query: 1129 RVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
R F+ G R ++ + L ++ +++ AL + QG L+ G + ++
Sbjct: 917 RC--FTEGYIHVYRFLEDGREL-EFIHKTKVEEPPMALLAFQGKLVAGVGRSLRIYDLGL 973
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+L A + P +VSL + I++GD + ++++K++ +L A D S+ +
Sbjct: 974 RQLLRKAQSEVAPRVIVSLQTQGSRIVVGDSQHGLIYVAYKQEANKLIAFADD--SIQRW 1031
Query: 1244 AT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL--- 1299
T ++D S D+ NI I K S+ +E H+ H +L
Sbjct: 1032 TTCSTMVDYE--STAGGDKFGNIWILRCPEKASQEADQPG----SEVHL-MHARDYLHGT 1084
Query: 1300 --RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFR 1346
RL Q +ATS +T G + LL+G + G+IG + P ++ F
Sbjct: 1085 SNRLALMAHVYTQDIATSICKTNLVVGGQE----VLLWGGIQGTIGVLIPFVSREDADF- 1139
Query: 1347 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1406
Q+L++ + P +AG + +R ++ K ++D +L + MLP +++
Sbjct: 1140 -FQTLEQHMRSEDPPLAGRDHLMYRSYYVPVKG-------VIDGDLCERFTMLPNDKKQM 1191
Query: 1407 IAHQTGTTRSQILSNLNDLALGTSF 1431
IA + + +I ++D+ ++F
Sbjct: 1192 IAGELDRSVREIERKISDIRTRSAF 1216
Score = 43.5 bits (101), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 60/279 (21%), Positives = 110/279 (39%), Gaps = 68/279 (24%)
Query: 378 LLSTKTGDLVLLTVVY----DGRV---VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
LL T+ GDL +T+ +G VQRL + + ++S++ + + F+ S G
Sbjct: 310 LLQTEDGDLFKVTIDMLEDAEGNTTGEVQRLKIKYFDTIPVSSNLCILKSGFLFVASEFG 369
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
+ QF E+ GD + L SS + D E + +
Sbjct: 370 NHHFYQF--------------EKLGD------DDEELEFSSENFPSDPAEPYEPAYF--- 406
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV---- 542
+ T + A+ +S+ ++ PL D + DA + +G +S + ++
Sbjct: 407 -----YPRPTENLALVESVESMNPLMDLKVANLTDEDAPQIYTVSGNGARSTFRMLKHGL 461
Query: 543 --------ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
+LPG +WT A DD+Y +Y+++S T+VL
Sbjct: 462 EVNEIVASQLPGTPSAVWTT--------------KIARDDQYDSYIVLSFTNGTLVLSIG 507
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ + EV+++ F+ + A G ++QV RG R
Sbjct: 508 ETVEEVSDT--GFLSSVSTLAVQQLGEDGLVQVHPRGIR 544
>gi|189044515|sp|Q7RYR4.2|RSE1_NEUCR RecName: Full=Pre-mRNA-splicing factor rse-1
Length = 1209
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 106/475 (22%), Positives = 192/475 (40%), Gaps = 89/475 (18%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
+QK IPL TP + E+ + I S P LL Q G
Sbjct: 786 IQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG------------ 833
Query: 1055 LHRTYTVEEYEVRILEPD-----RAGGPWQTRATIPMQSSENALTVRVVTLFNT------ 1103
+ +L P+ RA G W + +I SE ++ + L N
Sbjct: 834 ----------DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSA 883
Query: 1104 -----TTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKE 1153
++E E+ L +GT G+D+ R F+ G R ++ ++L ++
Sbjct: 884 AIVPFASQEGESFLVVGT----GKDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTR 936
Query: 1154 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1213
++ AL QG LL G + ++ +L A D P +VSL N I++GD
Sbjct: 937 VEEPPLALIPFQGRLLAGVGKTLRIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGD 996
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
+ + I ++ +K +G +L A D +L+ + T ++D S+ D+ NI I
Sbjct: 997 LQQGITYVVYKAEGNRLIPFADD--TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPE 1052
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKFL-----RL--------QMLATSSDRTGAAPGSDK 1319
++S+ +E H+ H +L RL Q L TS +T G
Sbjct: 1053 RVSQETDEPG----SEIHL-MHARNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD 1107
Query: 1320 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
LL+ L G++G P +++ F Q+L+ + P +AG + +R +++
Sbjct: 1108 ----VLLWSGLQGTVGVFIPFVSREDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTP 1161
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
K ++D +L + +LP +++ IA + + +I ++D+ ++F
Sbjct: 1162 VKG-------VIDGDLCERFSLLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1209
>gi|328874742|gb|EGG23107.1| UV-damaged DNA binding protein1 [Dictyostelium fasciculatum]
Length = 1116
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 69/309 (22%), Positives = 126/309 (40%), Gaps = 30/309 (9%)
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
+E + +GT Y D GR+L+F + D+ L+ E ++G+I + +
Sbjct: 810 EEQSEYIVVGTTY-HCHDRKECGRILVF---KMIDSRLILLDET---TVRGSIFCMIAFN 862
Query: 1166 GHLLIASGPKIILHKWTGT----ELNGIAFY--DAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL+A + + W+G +L G Y LY+ +F+L+GD+ KS+
Sbjct: 863 GQLLVAINKSVHRYTWSGDSSSGKLTGEEIYGGHTASLYLAGRG---DFVLVGDMMKSMA 919
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L + G + L++ F+ D + L SD N+ + + +
Sbjct: 920 LL--QASGKDVKELSRSSQPFWLTGLTFIDDDTYLG---SDNSYNLILMKKNTETANEVD 974
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
Q L + H G + +F LAT +D P S ++F T+ G IG I+
Sbjct: 975 SQLLDNIGHIHTGEFINRF-HHGTLATLTDVDSPKPNS-------IIFATISGCIGVIST 1026
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
+ + + LQ L + + G + +R F N + +D +L+ + L
Sbjct: 1027 ISKQDYDFFSKLQVGLNRVIRGIGGFSHDRWRSFQ-NEHISNIESRNFIDGDLVEQFLHL 1085
Query: 1400 PLEEQLEIA 1408
++ LE+
Sbjct: 1086 RHDKMLEVT 1094
Score = 42.7 bits (99), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 44/196 (22%), Positives = 77/196 (39%), Gaps = 41/196 (20%)
Query: 504 AVRDSLVNIGPLKDF----------------SYGLR---INADASATGISKQSNYELVEL 544
V D+ N+GP+ DF S G + + + GI++Q++ ++L
Sbjct: 335 TVLDTFANLGPIPDFCLVDIEKQGQNQIVACSGGFKEGSLRVIRNGIGITEQAS---IDL 391
Query: 545 PGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
PG K IW++ S R YLI+S + T VLE E TE
Sbjct: 392 PGIKAIWSLARGSDR------------------YLILSFISSTKVLEFQGEDIEETEIAG 433
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
+ +Q T+ GN+ ++++Q+ G ++D + PS+ S + +
Sbjct: 434 FDLQSPTLYCGNV-ADKQILQISTSGIYLVDHETNLNYDVWKPSSGSINLASHQGNQILI 492
Query: 665 SIADPYVLLGMSDGSI 680
S + + D I
Sbjct: 493 SFGKTLIYFEIKDQKI 508
>gi|310793065|gb|EFQ28526.1| CPSF A subunit region [Glomerella graminicola M1.001]
Length = 1212
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 107/507 (21%), Positives = 200/507 (39%), Gaps = 94/507 (18%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C G + + Q + G T QK IPL TP ++ E YP+ ++
Sbjct: 761 QCEEGVVGIQGQSLRIFAIENLGDTI-----TQKSIPLSYTPRRLLKHPE---YPMFYTI 812
Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE----EYEVRILEPDRAGGP-- 1077
+ DN+ L DL E + RIL PD G P
Sbjct: 813 --------------------EADNNTLPP-DLRAKLIAEPGVVNGDARILPPDEFGYPKG 851
Query: 1078 ---W--------------QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQ 1120
W + TI + ++E A++ +V+ ++++E+ L +GT
Sbjct: 852 KGRWASCISVIDPLAEDQRVLQTIDLDNNEAAVSAAIVSF---ASQDSESFLIVGT---- 904
Query: 1121 GEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1175
G+D+ R FS G R ++ L ++ +++ SAL QG LL G
Sbjct: 905 GKDMVVNPR--QFSEGYIHVYRFGEDGHEL-EFIHKTKVEEPPSALLGFQGRLLAGIGKT 961
Query: 1176 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1235
+ ++ ++ A D P +VSL+ + I++GD+ I ++ +K +L
Sbjct: 962 LRIYDLGLRQMLRKAQADVTPQLIVSLSTQGSRIIVGDVQHGITYVVYKPTTNKLIPFVD 1021
Query: 1236 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFH 1290
D S T ++D S+ D+ N+ + + K ++ + + +R H
Sbjct: 1022 DTVSRWVTCTT-MVDYE--SVAGGDKFGNMFLVRCSEKATQEADDESGGLHLINTRDYLH 1078
Query: 1291 VGAHVTKFLR---LQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELT 1344
H L Q + TS +T G LL+ ++G+IG P +++
Sbjct: 1079 GTPHRLSLLAHSYTQDVPTSITKTSLVVGGQD----VLLWSGINGTIGVFIPFVTREDVD 1134
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1404
F Q+L++ + +AG + +R ++ K ++D +L Y +LP E++
Sbjct: 1135 F--FQNLEQHMRTEDAPLAGRDHLMYRGYYVPVKG-------VIDGDLCERYTLLPSEKK 1185
Query: 1405 LEIAHQTGTTRSQILSNLNDLALGTSF 1431
IA + + +I ++D+ ++F
Sbjct: 1186 QMIAGELDRSVREIERKISDIRTRSAF 1212
Score = 44.7 bits (104), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 129/570 (22%), Positives = 212/570 (37%), Gaps = 131/570 (22%)
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ + G + S+A G++ +D +ILA + +I+++E+ I + + F+
Sbjct: 62 HDIFGIIRSMAAFRLAGSN----KDYLILATDSGRITIIEY--------IPAQNRFQR-- 107
Query: 167 WLHL----KRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDT 219
LHL K G G + DP+GR V L ++ + SQ E T
Sbjct: 108 -LHLETFGKSGVRRVIPGEYLACDPKGRACLIASVEKNKLVYVLNRNSQA-------ELT 159
Query: 220 FGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWA-----GRV 274
S A V+++ LD+ GY PV L E E T A G
Sbjct: 160 ISSP--LEAHKPGVLVLSMVALDV----------GYANPVFAAL-EIEYTEADQDPTGEA 206
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTI 329
+ + T ++ + L WS P D L P G GVLV G I
Sbjct: 207 AREAETQLV-YYELDLGLNHVVRKWSE---PVDPTASLLFQVPGGQDGPSGVLVCGEENI 262
Query: 330 HY-HSQSASCALAL--NNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
Y HS + + + A S + S +L + + L+ T+ GDL
Sbjct: 263 TYRHSNQEAFRVPIPRRRGATEDPSRKRHVVSGVMHKLKGSAGAFF----FLIQTEDGDL 318
Query: 387 VLLTVVY----DGRV---VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
T+ DG V+RL + + ++S + + + + S+ G+ QF
Sbjct: 319 FKATIDMVEDADGNPTGEVKRLKIKYFDTIPVSSSLCILKSGFLYAASQFGNHQFYQF-- 376
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
E+ GD + L SS D D G + + +
Sbjct: 377 ------------EKLGD------DDEELEFSSDDFPTDPKAGYDAVYF--------HPRP 410
Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV------------E 543
+ A+ +S+ ++ PL D DA +A G +S + ++ E
Sbjct: 411 LENLALVESIDSMNPLLDCKVANLTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASE 470
Query: 544 LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
LPG +WT+ K +RG D+Y AY+++S T+VL + + EV++S
Sbjct: 471 LPGIPSAVWTL--KLNRG------------DQYDAYIVLSFTNGTLVLSIGETVEEVSDS 516
Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
F+ A L G +IQV +G R
Sbjct: 517 --GFLTSVPTLAAQLLGEDGLIQVHPKGIR 544
>gi|392593521|gb|EIW82846.1| hypothetical protein CONPUDRAFT_81012 [Coniophora puteana RWD-64-598
SS2]
Length = 1213
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 108/506 (21%), Positives = 204/506 (40%), Gaps = 99/506 (19%)
Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLK 1027
G I +T +L+I Q+P Q +PL TP + V
Sbjct: 762 GLIGITG-SVLRIFQIPKLGMKLK----QDAVPLSYTPRKF----------------VSH 800
Query: 1028 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILE--------PDRAGGPW 1078
P NQ L L+ + H++ + + + L T ++ + IL+ P G W
Sbjct: 801 PANQYLYLI---QGDHRVMSESAAEKKLQEMRTKGQKVDEEILQLPVEVFGRPKAPAGTW 857
Query: 1079 ------------QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAA 1126
+T T+ + ++E+A +V VV ++NE L +GTA +
Sbjct: 858 ASAICIIDPIEARTIHTVELDNNESAFSVAVVPF---AARDNELHLVVGTA--ADTLLTP 912
Query: 1127 RG-RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1185
R R T R D ++L ++ E A+ + QG L+ G + L++ +
Sbjct: 913 RSCRSGYLRTYRFTDEGRSLEL-LHKTETDDVPLAVMAFQGRLIAGVGKSLRLYEIGKKK 971
Query: 1186 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1245
L A + +V+LN + I++GD+ +S++F ++K +L + A D A
Sbjct: 972 LLRKAENKSFASAIVTLNTQGSRIIVGDMQESVHFAAYKAPENRLLIFADDMQPRWVTAL 1031
Query: 1246 EFLIDGSTLSL------------------VVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
++D +T+++ V D+ I + ++S + KLL
Sbjct: 1032 T-MVDYTTIAVGDRFGNVFINRLDMRVSDQVDDDPTGAGILHEKGQLSGAPHKTKLL--C 1088
Query: 1288 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELT 1344
FHVG +T ++ ++A R LL+ + G+IG + P +++
Sbjct: 1089 HFHVGDLITSIHKVSLVA--------------GGREVLLYTGIHGTIGILVPFVSKEDVD 1134
Query: 1345 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1404
F + +L++ + + G + S+R +++ KA +VD +L + L +Q
Sbjct: 1135 F--ISTLEQHMRSEQSSLVGRDQLSWRGYYTPVKA-------VVDGDLCEAFARLTGSKQ 1185
Query: 1405 LEIAHQTGTTRSQILSNLNDLALGTS 1430
IA + T ++L L L + +S
Sbjct: 1186 SAIAGELDRTVGEVLKKLEQLRVTSS 1211
Score = 46.2 bits (108), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 87/384 (22%), Positives = 146/384 (38%), Gaps = 97/384 (25%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL +T+ +D V+ L + + + S + + + F+ S G+ L QF
Sbjct: 308 LLQSEDGDLFKVTIDHDEDEVKSLKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367
Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
S TS S G+ E F + + R R + AL D + + L
Sbjct: 368 QKLGDDDDEPEFSSTSFPSFGMAESFIPLPH---AHFRPRGLDNLALADEIESLDPILDA 424
Query: 489 SASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
N N+++ Q F+ R S L+ +GL + S+ ELP
Sbjct: 425 KVMNILPNSDTPQ-IFTACGRGSRSTFRMLR---HGLEVEESVSS------------ELP 468
Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
G +WT DD Y +Y+I+S T+VL + + EV ++
Sbjct: 469 GIPNAVWTTKRTE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT-G 513
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGA------------RILDGS--------------- 637
+ T+A + G ++QV +G R+ G
Sbjct: 514 FLSSAPTLAVQQI-GSDALLQVHPQGIRHVLSDRRVNEWRVPQGKTIVCATTNKRQVVVA 572
Query: 638 -------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRL 682
Y DL G N + STVL++S+ + PY+ +G D ++R+
Sbjct: 573 LSSAELVYFELDLD-GQLNEYQDWKAMGSTVLALSVGEVPEGRQRTPYLAVGCEDQTVRI 631
Query: 683 LVGDPSTC--TVSVQT----PAAI 700
+ DP + T+S+Q P+AI
Sbjct: 632 ISLDPESTLETISLQALTAPPSAI 655
>gi|350290373|gb|EGZ71587.1| Pre-mRNA-splicing factor rse-1 [Neurospora tetrasperma FGSC 2509]
Length = 1209
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 106/475 (22%), Positives = 192/475 (40%), Gaps = 89/475 (18%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
+QK IPL TP + E+ + I S P LL Q G
Sbjct: 786 IQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG------------ 833
Query: 1055 LHRTYTVEEYEVRILEPD-----RAGGPWQTRATIPMQSSENALTVRVVTLFNT------ 1103
+ +L P+ RA G W + +I SE ++ + L N
Sbjct: 834 ----------DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSA 883
Query: 1104 -----TTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKE 1153
++E E+ L +GT G+D+ R F+ G R ++ ++L ++
Sbjct: 884 AIVPFASQEGESFLVVGT----GKDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTR 936
Query: 1154 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1213
++ AL QG LL G + ++ +L A D P +VSL N I++GD
Sbjct: 937 VEEPPLALIPFQGRLLAGVGKTLRIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGD 996
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
+ + I ++ +K +G +L A D +L+ + T ++D S+ D+ NI I
Sbjct: 997 LQQGITYVVYKAEGNRLIPFADD--TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPE 1052
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKFL-----RL--------QMLATSSDRTGAAPGSDK 1319
++S+ +E H+ H +L RL Q L TS +T G
Sbjct: 1053 RVSQETDEPG----SEIHL-MHARNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD 1107
Query: 1320 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
LL+ L G++G P +++ F Q+L+ + P +AG + +R +++
Sbjct: 1108 ----VLLWSGLQGTVGVFIPFVSREDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTP 1161
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
K ++D +L + +LP +++ IA + + +I ++D+ ++F
Sbjct: 1162 VKG-------VIDGDLCERFSLLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1209
>gi|440478305|gb|ELQ59147.1| pre-mRNA-splicing factor rse-1 [Magnaporthe oryzae P131]
Length = 1223
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 113/514 (21%), Positives = 206/514 (40%), Gaps = 110/514 (21%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C G + V Q L+I + DN +QK IPL TP ++ + ++ I
Sbjct: 761 QCEEGMVGVNGQ-FLRIFAIEKLG--DNV--IQKSIPLTYTPRKLAKHPTQRIFYTI--- 812
Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSV---DLHRTYTVEEYEVRILEPD-----RAG 1075
+ DN+ L+ L T + R+L PD R
Sbjct: 813 --------------------EADNNTLAPELREQLMAAPTAVNGDARVLPPDEFGYPRGN 852
Query: 1076 GPWQTRAT------------------IPMQSSENALTVRVVTLFNTTTKENETLLAIGTA 1117
G W + + I + ++E AL++ VV+ +++ E+ L +GT
Sbjct: 853 GRWASCISVVDPLGDGEELEPGVVQRIDLDNNEAALSMAVVSF---ASQDGESFLVVGT- 908
Query: 1118 YVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
G+D+ R F+ G R +++ + L ++ +++ +AL QG L+
Sbjct: 909 ---GKDMVVNPR--RFTEGYIHVYRFSEDGREL-EFIHKTKVEEPPTALLPFQGRLVAGI 962
Query: 1173 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1232
G + ++ +L A + P +VSLN + I++GD+ + ++++K + +L
Sbjct: 963 GRMLRIYDLGLRQLLRKAQAEVAPQLIVSLNTQGSRIIVGDVQHGLIYVAYKSETNRLIP 1022
Query: 1233 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1292
A D + T + ST +D+ N+ I K S+ +E H+
Sbjct: 1023 FADDTIARWTTCTTMVDYDSTAG---ADKFGNLWILRCPEKASQESDEPG----SEVHL- 1074
Query: 1293 AHVTKFL-----RLQMLA--------TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
H +L RL ++A TS +T G + LL+G G+IG + P
Sbjct: 1075 VHSRDYLHGTSNRLALMAHVYTQDIPTSICKTNLVVGGQE----VLLWGGFQGTIGVLIP 1130
Query: 1340 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
++ F QSL++ L P +AG + +R + K ++D +L Y
Sbjct: 1131 FVSREDADF--FQSLEQHLRSEDPPLAGRDHLMYRGCYVPVKG-------VIDGDLCERY 1181
Query: 1397 EMLPLEEQLEIAHQTGTT----RSQILSNLNDLA 1426
MLP +++ IA + + +I +N D+A
Sbjct: 1182 TMLPNDKKQMIAGELDRSVREIERKISTNFVDIA 1215
>gi|71004436|ref|XP_756884.1| hypothetical protein UM00737.1 [Ustilago maydis 521]
gi|74704394|sp|Q4PGM6.1|RSE1_USTMA RecName: Full=Pre-mRNA-splicing factor RSE1
gi|46095609|gb|EAK80842.1| hypothetical protein UM00737.1 [Ustilago maydis 521]
Length = 1221
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 132/580 (22%), Positives = 228/580 (39%), Gaps = 81/580 (13%)
Query: 887 LRNLRFSRTPLDAYTREETPH-----GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRE 941
L+N RT LDA T + T G+ R+ I + G +R +++
Sbjct: 685 LQNGVLLRTILDAVTGQLTDTRTRFLGSKAVRL-IRTKVHGQAAVMALSTRTWLSYTYQD 743
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPL 1001
RL+ P + D A++ + C G I + L+I +PS ++ Q + L
Sbjct: 744 RLQFVPLIFDVLDHAWSFSAEL-CPEGLIGIVG-STLRIFTIPSLASKLK----QDSVAL 797
Query: 1002 KATPHQIT-YFAEKNLYPLI-VSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY 1059
TP +I + E+ L+ ++ L P Q + +G ++ H +DL+
Sbjct: 798 SYTPRKIANHPNEQGLFYVVEAEHRTLSPGAQRRRT---EMLGKELKPHQRGVLDLN--- 851
Query: 1060 TVEEYEVRILEPDRA-------GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1112
E +R + A G QT + M +E A ++ VV + E E +L
Sbjct: 852 PAEFGAIRAEAGNWASCIRAVDGVQAQTTHRLEMDDNEAAFSIAVVPF---ASAEKEVML 908
Query: 1113 AIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1169
+G+A DV R +T R DN + L ++ E+ L + QG LL
Sbjct: 909 VVGSAV----DVVLSPRSCKKAYLTTYRLLDNGRELEL-LHKTEVDDIPLVLRAFQGRLL 963
Query: 1170 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1229
G + ++ +L + P VVSL+ + I++GD+ +SI F S+K +
Sbjct: 964 AGIGKALRIYDLGKKKLLRKCENRSFPTAVVSLDAQGSRIVVGDMQESIIFASYKPLENR 1023
Query: 1230 LNLLAKD----------------FGSLDCFATEFL--IDGSTLSLVVSDEQKNIQIFYYA 1271
L A D + D F ++ +DG+T S V ++ + I +
Sbjct: 1024 LVTFADDVMPKFVTRCTMLDYDTVAAADKFGNIYVLRLDGNT-SRSVDEDPTGMTIVHEK 1082
Query: 1272 PKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1331
P + + L+ A F VG +T R M+A R LL+ L
Sbjct: 1083 PVLMGAAHKASLV--AHFFVGDIITSLHRTAMVA--------------GGREVLLYTGLS 1126
Query: 1332 GSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1390
GSIG + P + + L +L+ L + G + ++R ++ K S++D
Sbjct: 1127 GSIGALVPFVSKEDVDTLSTLESHLRQENNSIVGRDHLAYRSSYAPVK-------SVIDG 1179
Query: 1391 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
+L + +L +Q IA + +I L L G +
Sbjct: 1180 DLCETFGLLSPAKQNAIAGELDRKPGEINKKLAQLREGAT 1219
>gi|432089478|gb|ELK23419.1| DNA damage-binding protein 1 [Myotis davidii]
Length = 1047
Score = 57.0 bits (136), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 75/155 (48%), Gaps = 17/155 (10%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
L++K +A+DF A E L D + L
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFL 966
>gi|328770812|gb|EGF80853.1| hypothetical protein BATDEDRAFT_29900 [Batrachochytrium dendrobatidis
JAM81]
Length = 1213
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 112/543 (20%), Positives = 205/543 (37%), Gaps = 74/543 (13%)
Query: 919 NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
I G G SRP + R ++ P L S+ + + C G + +T +
Sbjct: 715 KIQGETGLLALSSRPWISFTYHSRSKLMP-LSYESLEYGSSFCSEQCTEGIVAITGNTLR 773
Query: 979 KICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV-PVLKPLNQVLSLLI 1037
+ GS + Q IPLK TP + + + +I S V P ++ L
Sbjct: 774 ILNTEKLGSVFK-----QASIPLKYTPRRFIFDQVSQNFVVIESDHGVFCPSDRAKIL-- 826
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+++D EE V +AG P + + I + S + T+ +
Sbjct: 827 ----------ETKATLDADEGTIPEELAVEQFGHSKAG-PERWASCIRVISPIHGETLHL 875
Query: 1098 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSK----- 1152
L +NE I Q ++ T N + + TE Y +
Sbjct: 876 EDL-----DDNEAAFCISFCIFQSSLTTTH---IVVGTASNVNLSSSSFTEGYLRVYKLA 927
Query: 1153 ------------ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1200
+KG + S QG LL+ G + ++ ++ P +V
Sbjct: 928 PDGTSLEFLHKTPIKGIPKVMCSFQGRLLVGVGSLLRIYDLGKKKMLRKCECKGFPTTIV 987
Query: 1201 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1260
+L+ N I+LGD +S+++ ++ ++ + A D AT ++D T +V D
Sbjct: 988 TLHTQGNRIILGDAQESVHYAMYRAFDNRIVIFADDTIPRWVTAT-CMVDYDT--VVGGD 1044
Query: 1261 EQKNIQIFYYAPKMS-----ESWKGQKLLSRAEF----HVGAHVTKFLRLQMLATSSDRT 1311
+ NI + + ++S ++ Q + R H H F + L TS +T
Sbjct: 1045 KMGNIFVNRLSAEVSKGIDEDTTGNQAIFDRGYLQGAPHKVHHEADFFLGETL-TSLTKT 1103
Query: 1312 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
PG R LL+ TL G IG + P D++ F Q+L+ + P + G +
Sbjct: 1104 SLVPG----GREILLYTTLMGGIGLLIPFISKDDVDF--FQTLEMTMRSECPPLCGRDHL 1157
Query: 1369 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
++R F++ A I+D +L + ++ +++ IA + + + L D+
Sbjct: 1158 AYRSFYTPVHA-------IIDGDLCEMFNVMVGDKKRGIAESVDRSVADVGKKLEDMRTR 1210
Query: 1429 TSF 1431
+F
Sbjct: 1211 VAF 1213
>gi|219110831|ref|XP_002177167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411702|gb|EEC51630.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1303
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 126/289 (43%), Gaps = 47/289 (16%)
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1215
G + +LA QG LL+ G + L++ +L + P +V ++ V +GD+
Sbjct: 1036 GPVLSLAHFQGRLLVGIGTTLRLYEMGKRQLLRKSELRNFPTFVKTVQTVGERAYIGDMM 1095
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+SI + + +L L+A D E L+D +T++ V D+ NI + P+ +
Sbjct: 1096 QSIQIVRYDVSANRLVLIANDASPRPIVCQE-LLDWNTVA--VGDKFGNISVMRL-PRGA 1151
Query: 1276 ES----WKGQKLL---SR----------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1318
++ GQ+ L SR +++VG VT R ++A ++
Sbjct: 1152 DTSAIDVTGQRALWDSSREDMIPKLELLCQYYVGEVVTSMTRSSLVAGGAE--------- 1202
Query: 1319 KTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1375
+L++ T+ G IG P +++ F L+ +L G +P+S+R +++
Sbjct: 1203 -----SLIYVTVSGRIGAFVPFTNRNDVDF--YSQLESELRGDASRPTGRDPQSYRSYYA 1255
Query: 1376 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
H +VD +L + L E+Q +IA + T +I+ L D
Sbjct: 1256 P-MMH------VVDGDLCDAFNSLGPEKQNKIAEKLDRTVGEIMKKLED 1297
>gi|398019848|ref|XP_003863088.1| CPSF-domain protein, putative [Leishmania donovani]
gi|322501319|emb|CBZ36398.1| CPSF-domain protein, putative [Leishmania donovani]
Length = 1347
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 70/308 (22%), Positives = 123/308 (39%), Gaps = 37/308 (12%)
Query: 1102 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1161
+ +E + LL IG+++ ++ AR + + T R Q L + SK++ GA+
Sbjct: 913 GVSEEEWQHLLLIGSSFTFPDEQRARSGRITWCTLREERQRQRL-HLIASKDIGGALQCC 971
Query: 1162 ASL---QGHLLIASGPKIILHKWTGTELN---------GIAFYDAPPLYVVSLNIVKNFI 1209
A++ +G + + + L++W + G+ PLY SL + +
Sbjct: 972 AAVPHYKGRIALGVNGCVCLYQWNTEDQTFVAEERCRVGLTVTKLIPLYHTSL--AASVL 1029
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
+ D+ S +F+ L +L +D D L L D+ N
Sbjct: 1030 VALDVRHSAFFIEVDTLQGNLKVLCRDADLRGIMDGHVGSDAENLCLF--DDSLNFTALK 1087
Query: 1270 YAPKMSESWKGQ-----------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1318
P E+ G + RA+ H+G VT +R A +S A S
Sbjct: 1088 VVPLPVEARDGDAAAAARATAQYRFEVRAQCHLGDLVT-CVRQGSFAATSLMEAPAHCSS 1146
Query: 1319 KTNRF--------ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1370
N+ L+F T G G + P+ T+ L++L+ LV +VP V GL+ ++F
Sbjct: 1147 AQNQLLLPGIAGPQLVFATAHGGFGVVTPVHAATYLVLRALEASLVRTVPPVGGLSHQAF 1206
Query: 1371 RQFHSNGK 1378
R+ G+
Sbjct: 1207 REVLRAGQ 1214
>gi|301124447|ref|XP_002909707.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262106897|gb|EEY64949.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 328
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 65/281 (23%), Positives = 109/281 (38%), Gaps = 75/281 (26%)
Query: 925 GFFLSGSRPCWCMVFRERLRVHPQLCDGS-------------------IVAFTVLHNVNC 965
G F G+ P W + R P S +++FT H+ +C
Sbjct: 3 GAFFRGAHPMWILGDRGHASFVPMCVPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHWSC 62
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWP-----VQKVIPLKATPHQITYFA-------- 1012
+GFIY S+G L++C+LPS T P V + AT H + Y
Sbjct: 63 PNGFIYFHSRGALRVCELPSSKT-STILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGVA 121
Query: 1013 ---EKNLYPLIVSVPVLKPLNQVLSLL------------IDQEVGHQIDNHNLSSVDL-- 1055
E Y ++ S LKP + + +D N + ++
Sbjct: 122 EALEAPTYAVVCSA-RLKPADADRATEVEGAEEELEPENLDPNGNPLGSNVMAPTAEMFA 180
Query: 1056 -----HRTYTVEE-YEVRILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT--- 1104
H +T E+ YE+R+++ D G W R + + E L+V+++ L++++
Sbjct: 181 DYETDHMAHTEEDVYELRLVQTDEF-GEWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMK 239
Query: 1105 ----------TKENETLLAIGTAYV--QGEDVAARGRVLLF 1133
K+ L +GT +V GED + RGR+LL+
Sbjct: 240 EEVASTSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLY 280
>gi|146094112|ref|XP_001467167.1| putative CPSF-domain protein [Leishmania infantum JPCM5]
gi|134071531|emb|CAM70220.1| putative CPSF-domain protein [Leishmania infantum JPCM5]
Length = 1347
Score = 57.0 bits (136), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 70/304 (23%), Positives = 122/304 (40%), Gaps = 37/304 (12%)
Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL- 1164
+E + LL IG+++ ++ AR + + T R Q L + SK++ GA+ A++
Sbjct: 917 EEWQHLLLIGSSFTFPDEQRARSGRITWCTLREERQRQRL-HLIASKDIGGALQCCAAVP 975
Query: 1165 --QGHLLIASGPKIILHKWTGTELN---------GIAFYDAPPLYVVSLNIVKNFILLGD 1213
+G + + + L++W + G+ PLY SL + ++ D
Sbjct: 976 HYKGRIALGVNGCVCLYQWNTEDQTFVAEERCRVGLTVTKLIPLYHTSL--AASVLVALD 1033
Query: 1214 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
+ S +F+ L +L +D D L L D+ N P
Sbjct: 1034 VRHSAFFIEVDTLQGNLKVLCRDADLRGIMDGHVGSDAENLCLF--DDSLNFTALKVVPL 1091
Query: 1274 MSESWKGQ-----------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1322
E+ G + RA+ H+G VT +R A +S A S N+
Sbjct: 1092 PVEARDGDAAAAARATAQYRFEVRAQCHLGDLVT-CVRQGSFAATSLMEAPAHCSSAQNQ 1150
Query: 1323 F--------ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1374
L+F T G G + P+ T+ L++L+ LV +VP V GL+ ++FR+
Sbjct: 1151 LLLPGIAGPQLVFATAHGGFGVVTPVHAATYLVLRALEASLVRTVPPVGGLSHQAFREVL 1210
Query: 1375 SNGK 1378
G+
Sbjct: 1211 RAGQ 1214
>gi|440473070|gb|ELQ41892.1| pre-mRNA-splicing factor rse-1 [Magnaporthe oryzae Y34]
Length = 1229
Score = 56.6 bits (135), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 110/502 (21%), Positives = 202/502 (40%), Gaps = 106/502 (21%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C G + V Q L+I + DN +QK IPL TP ++ + ++ I
Sbjct: 761 QCEEGMVGVNGQ-FLRIFAIEKLG--DNV--IQKSIPLTYTPRKLAKHPTQRIFYTI--- 812
Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSV---DLHRTYTVEEYEVRILEPD-----RAG 1075
+ DN+ L+ L T + R+L PD R
Sbjct: 813 --------------------EADNNTLAPELREQLMAAPTAVNGDARVLPPDEFGYPRGN 852
Query: 1076 GPWQTRAT------------------IPMQSSENALTVRVVTLFNTTTKENETLLAIGTA 1117
G W + + I + ++E AL++ VV+ +++ E+ L +GT
Sbjct: 853 GRWASCISVVDPLGDGEELEPGVVQRIDLDNNEAALSMAVVSF---ASQDGESFLVVGT- 908
Query: 1118 YVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
G+D+ R F+ G R +++ + L ++ +++ +AL QG L+
Sbjct: 909 ---GKDMVVNPR--RFTEGYIHVYRFSEDGREL-EFIHKTKVEEPPTALLPFQGRLVAGI 962
Query: 1173 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1232
G + ++ +L A + P +VSLN + I++GD+ + ++++K + +L
Sbjct: 963 GRMLRIYDLGLRQLLRKAQAEVAPQLIVSLNTQGSRIIVGDVQHGLIYVAYKSETNRLIP 1022
Query: 1233 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1292
A D + T + ST +D+ N+ I K S+ + +E H+
Sbjct: 1023 FADDTIARWTTCTTMVDYDSTAG---ADKFGNLWILRCPEKASQ----ESDEPGSEVHL- 1074
Query: 1293 AHVTKFL-----RLQMLA--------TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
H +L RL ++A TS +T G + LL+G G+IG + P
Sbjct: 1075 VHSRDYLHGTSNRLALMAHVYTQDIPTSICKTNLVVGGQE----VLLWGGFQGTIGVLIP 1130
Query: 1340 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
++ F QSL++ L P +AG + +R + K ++D +L Y
Sbjct: 1131 FVSREDADF--FQSLEQHLRSEDPPLAGRDHLMYRGCYVPVKG-------VIDGDLCERY 1181
Query: 1397 EMLPLEEQLEIAHQTGTTRSQI 1418
MLP +++ IA + + +I
Sbjct: 1182 TMLPNDKKQMIAGELDRSVREI 1203
>gi|336257679|ref|XP_003343663.1| hypothetical protein SMAC_08834 [Sordaria macrospora k-hell]
gi|380091896|emb|CCC10625.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1209
Score = 56.6 bits (135), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 102/459 (22%), Positives = 190/459 (41%), Gaps = 57/459 (12%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
+QK IPL TP ++ E+ + I S P LL Q G D L D
Sbjct: 786 IQKSIPLTYTPKRLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG---DATILPPED 842
Query: 1055 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1114
+ I D + I + ++E A++ +V ++E E+ L +
Sbjct: 843 FGYPKAKGRWASCISIIDPISDEPRVLQRIDLDNNEAAVSAAIVPF---ASQEGESFLVV 899
Query: 1115 GTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLL 1169
GT G+D+ R F+ G R ++ ++L ++ ++ AL QG LL
Sbjct: 900 GT----GKDMVLNPR--QFTEGYIHVYRFHEDGRDL-EFIHKTRVEEPPMALIPFQGRLL 952
Query: 1170 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1229
G + ++ +L A D P +VSL N I++GD+ + + ++ +K +G +
Sbjct: 953 AGVGKTLRIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGDLQQGVTYVVYKAEGNR 1012
Query: 1230 LNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
L D +L+ + T ++D S+ D+ NI I ++S+ +E
Sbjct: 1013 LIPFVDD--TLNRWTTCTTMVDYE--SVASGDKFGNISIVRCPERVSQDTDEPG----SE 1064
Query: 1289 FHVGAHVTKFL-----RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
H+ H +L RL Q L TS +T G LL+ L G++G
Sbjct: 1065 IHL-MHARNYLHGTPNRLSLQVHFFTQDLPTSICKTSLVVGGQD----VLLWSGLQGTVG 1119
Query: 1336 CIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
P +++ F Q+L+ + P +AG + +R +++ K ++D +L
Sbjct: 1120 VFIPFVSREDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTPVKG-------VIDGDL 1170
Query: 1393 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+ +LP +++ IA + + +I ++D+ ++F
Sbjct: 1171 CERFSLLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1209
>gi|46125735|ref|XP_387421.1| hypothetical protein FG07245.1 [Gibberella zeae PH-1]
Length = 1208
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 103/475 (21%), Positives = 200/475 (42%), Gaps = 56/475 (11%)
Query: 964 NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021
C G + + Q I I +L G T +QK IPL TP ++ ++ L+ I
Sbjct: 761 QCEEGIVGIQGQSLRIFNIDRL--GETL-----IQKSIPLTYTPKKLVKHPDQPLFYTIE 813
Query: 1022 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQ 1079
+ P LL D V + D+ L D + + +++P G Q
Sbjct: 814 ADNNTLPPELRAQLLADPGVVNG-DSRVLPPEDFGYPKGTRRWASCINVIDPLSEEG--Q 870
Query: 1080 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG--- 1136
TI ++++E A++ +V ++++NE+ L IGT G+D+ R FS G
Sbjct: 871 VLQTIDLENNEAAVSAAIVPF---SSQDNESFLVIGT----GKDMVVNPRS--FSEGYLH 921
Query: 1137 --RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1194
R + + L ++ +++ AL + QG +L+A G + ++ ++ + +
Sbjct: 922 IYRFLEGGREL-EFIHKTKVEEPPLALLAFQGRVLVAVGTSLRIYDLGMRQMLRKSQAEV 980
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
+VSLN + I++GD+ + + ++ +K +L D + T ++D
Sbjct: 981 ATQQIVSLNTQGSRIIVGDVQQGVTYVVYKPASNKLIPFVDDTIARWTTCTT-MVDYE-- 1037
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKFL---RLQMLAT 1306
S+ D+ N+ I K SE ++ + +R H H + Q + T
Sbjct: 1038 SVAGGDKFGNMFIVRCPEKASEEADEEQSGLHLINARDYLHGTPHRVSLMCHFYTQDIPT 1097
Query: 1307 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1363
S + G + LL+ + G+IG P ++ F Q+L++ L P +A
Sbjct: 1098 SITKASLVVGGQE----VLLWSGIMGTIGVFIPFVSREDADF--FQNLEQHLRTEDPPLA 1151
Query: 1364 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
G + +R +++ K ++D +L Y +LP +++ IA + + +I
Sbjct: 1152 GRDHLMYRGYYAPVKG-------VIDGDLCERYNLLPNDKKQMIAGELDRSVREI 1199
>gi|393243160|gb|EJD50676.1| hypothetical protein AURDEDRAFT_112250 [Auricularia delicata
TFB-10046 SS5]
Length = 1140
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 124/281 (44%), Gaps = 33/281 (11%)
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQN 1144
MQ +N V +L E + +GTAY++ ++ +RGR+L+F + ++ +
Sbjct: 797 MQLDDNEEITSVASLPIMPESRTE-MFVVGTAYIKDSEMEPSRGRILVFGSLEDSGTGGS 855
Query: 1145 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL------- 1197
+T ++ GA+ +L S+ G ++ +IL++ L+ L
Sbjct: 856 WLTAFL--QVTGAVLSLTSVDGLIVAGVNTAVILYELRRNTLSEAERASHLTLRQKKEWN 913
Query: 1198 --YVV-SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
YVV SL + I +GD SI L WK + L+ +A+ FG + A + + G
Sbjct: 914 HNYVVTSLAARGDTIYIGDSVASIAILRWKHE--TLHTIARHFGPIFPLALDVMSSG--- 968
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1314
S++ ++ N+ F+ ES +KL +H+G V KF+ ++ +A
Sbjct: 969 SVITANIDYNLHTFH-----QESPTDRKLEIDGSYHLGDQVNKFIPGRL---------SA 1014
Query: 1315 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1355
P + +F T G IG +A D+ L +L++ +
Sbjct: 1015 PTVGASIVLEQVFVTSLGRIGIVAEADKDASWALSALERNI 1055
>gi|388855100|emb|CCF51231.1| probable splicing factor 3B subunit 3 [Ustilago hordei]
Length = 1221
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 130/582 (22%), Positives = 228/582 (39%), Gaps = 85/582 (14%)
Query: 887 LRNLRFSRTPLDAYTREETPH-----GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRE 941
L N RT LDA T + T G+ R+ I + G +R ++
Sbjct: 685 LSNGVLLRTVLDAMTGQLTDTRTRFLGSKAVRL-IRTKVHGQSAVMALSTRTWLSFTYQS 743
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPL 1001
RL+ P + D A++ + C G I + L+I +PS ++ Q + L
Sbjct: 744 RLQFTPLIFDALDHAWSFSAEL-CPEGLIGIVG-STLRIFTIPSLASKLK----QDSVAL 797
Query: 1002 KATPHQITYFAEKN--LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT- 1058
TP +I + ++ Y + L P Q + + + ++ H +DL
Sbjct: 798 SYTPRKIAHHPDEQGLFYVVEADRRTLSPGAQRRRV---EALEKELKPHQRGVLDLKPAE 854
Query: 1059 YTVEEYE-------VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1111
+ + E VR+++ G QT I + +E A +V +V + E ++
Sbjct: 855 FGLIRGEAGNWASCVRVVD----GPQSQTTHKIELDDNEAAFSVAIVPF---ASAEKQSF 907
Query: 1112 LAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSK-ELKGAISALASLQGH 1167
L +G+A DV R +T R + + L EV+ K E+ L QG
Sbjct: 908 LVVGSAV----DVVLSPRSFKKAYLTTYRLINGGREL--EVHHKTEIDDIPLVLRPFQGR 961
Query: 1168 LLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
LL G + ++ +L + P +VSL+ + I++GD+ +SI F S+K
Sbjct: 962 LLAGVGKALRIYDLGKKKLLRKCENKSFPTAIVSLDAQGSRIVVGDMQESIVFTSYKPLE 1021
Query: 1228 AQLNLLAKD----------------FGSLDCFATEFL--IDGSTLSLVVSDEQKNIQIFY 1269
+L A D + D F ++ ID T S V ++ + I +
Sbjct: 1022 NRLVTFADDVMPKFVTRCTMLDYDTVAAADKFGNLYVLRIDADT-SRSVDEDPTGMTIVH 1080
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
P + + LL A + VG + TS +RT PG R L++
Sbjct: 1081 EKPVLMGAAHKATLL--AHYFVGD----------IITSLNRTVMVPG----GREVLMYTG 1124
Query: 1330 LDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
+ G+IG + P + + L +LQ +L + G + ++R ++ K S++
Sbjct: 1125 ISGTIGALVPFVSKEDVDTLSTLQTQLRQENNSLVGRDHLAYRSSYAPVK-------SVI 1177
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
D +L + +L +Q IA + S+I L L G +
Sbjct: 1178 DGDLCETFGLLQPAKQNAIAQELDRKPSEINKKLAQLREGAT 1219
>gi|16197726|emb|CAC94909.1| damaged-DNA recognition protein 1 [Mus musculus]
Length = 994
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 75/155 (48%), Gaps = 17/155 (10%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
L++K +A+DF A E L D + L
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFL 966
>gi|70992271|ref|XP_750984.1| UV-damaged DNA binding protein [Aspergillus fumigatus Af293]
gi|66848617|gb|EAL88946.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
Af293]
gi|159124553|gb|EDP49671.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
A1163]
Length = 1140
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 129/314 (41%), Gaps = 40/314 (12%)
Query: 1114 IGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GTAY+ E D + RGR+L+F DN + L T+V +KGA ALA L ++ A
Sbjct: 834 VGTAYLDDEGDESIRGRILIF----EVDNGRKL-TQVAELPVKGACRALAMLGDKIVAAL 888
Query: 1173 GPKIILHK-----WTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
++++K + L +A Y AP V + + N I + D+ KS+ + +K
Sbjct: 889 VKTVVVYKVINNNFGAMRLEKLASYRTSTAP----VDVTVTGNLIAVSDLMKSMCLVEYK 944
Query: 1225 E----QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1280
E + +A+ F ++ + + L SD + N+ + + E
Sbjct: 945 EGENGTPDTMTEVARHFQTVWATGVANIAPDTFLE---SDAEGNLIVLHRNTTGVEEDDK 1001
Query: 1281 QKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1338
++L E +G V + + +Q LA+ P + GT++GSI A
Sbjct: 1002 RRLEVTGEISLGEMVNRIRPVNIQQLAS----VAVTPRA--------FLGTVEGSIYLFA 1049
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
++ L LQ + V V + FR F S + + P VD EL+ +
Sbjct: 1050 IINPDHQDFLMRLQATIAGKVELVGNIPFNEFRGFRSMVREAKE-PYRFVDGELIERFLT 1108
Query: 1399 LPLEEQLEIAHQTG 1412
Q EI G
Sbjct: 1109 CEPSLQEEIVSTVG 1122
>gi|156084934|ref|XP_001609950.1| splicing factor 3b, subunit 3, 130kD [Babesia bovis T2Bo]
gi|154797202|gb|EDO06382.1| splicing factor 3b, subunit 3, 130kD, putative [Babesia bovis]
Length = 1169
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 68/279 (24%), Positives = 119/279 (42%), Gaps = 40/279 (14%)
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY-VVSLNIVKNFILLGDI 1214
G + A +G LL + G +I L+ +L A + + + LN V + I GDI
Sbjct: 917 GVVRAFTGYEGRLLASVGKRIRLYALGKKQLLLKAEHRTCSDHGFIWLNAVGSRIFAGDI 976
Query: 1215 HKSIYFLSWK---EQGAQLNLLAKDFGSLDCFATEFLIDGSTL--SLVVSDEQKNIQIFY 1269
+ I L K E+ A+ + G +L + L S V++ ++ + IF
Sbjct: 977 REGIQILRIKFYSEEAAEFEWVGGATGP------RWLTSCAQLDYSTVIAGDKFD-SIFV 1029
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
ES + +L + +FH+G L T+ D+ + + +L+GT
Sbjct: 1030 TRVPQEESTRHIQLENVCQFHLGD----------LPTAMDKAALSQSTH-----VVLYGT 1074
Query: 1330 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1386
+ GSIG + P DEL F LQ L+ + P + G +R ++ +
Sbjct: 1075 VMGSIGALVPFQSKDELDF--LQHLEMLMATEAPPLCGREHSFYRSYYVPVQ-------Q 1125
Query: 1387 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+VD +L + L +Q ++A Q TT + +L L+D+
Sbjct: 1126 VVDGDLCEQFRHLTEAQQRKVAQQLDTTVNNVLRKLDDI 1164
>gi|302654423|ref|XP_003019019.1| hypothetical protein TRV_07032 [Trichophyton verrucosum HKI 0517]
gi|291182709|gb|EFE38374.1| hypothetical protein TRV_07032 [Trichophyton verrucosum HKI 0517]
Length = 460
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 99/435 (22%), Positives = 182/435 (41%), Gaps = 54/435 (12%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVS-VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
+Q+ IPL TP E L+ +I S +L P + + L+ + D+ L
Sbjct: 17 LQEPIPLAYTPRNFVRHPEYPLFYVIGSDNNILSPATK--AKLLSESTTVNGDSAELPPE 74
Query: 1054 DLHRTYTVEEYEVRILEPDRAGGPWQTRATI---PMQSSENALTVRVVTLFNTTTKENET 1110
D + I D P T++ + ++ +E A+++ V+ T++E+ET
Sbjct: 75 DFGYPRGTNHWASSIQVVD----PIHTKSVLSNLELEDNEAAVSIAAVSF---TSQEDET 127
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQ 1165
L +GT G+D+ R F+ G R + + L ++ +++ AL Q
Sbjct: 128 FLVVGT----GKDMVVSPRT--FTCGFIHIYRFQEEGKEL-EFIHKTKVEQPPLALLGFQ 180
Query: 1166 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1225
G LL GP + ++ +L P +V L + I++ D+ +S+ ++ +K
Sbjct: 181 GRLLAGIGPDLRIYDLGMRQLLRKCQAQITPRVIVGLQTQGSRIIVSDVQESVTYVVYKY 240
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW----KGQ 1281
Q L A D T ++D T++ D+ NI + K SE G
Sbjct: 241 QENALIPFADDIIPRWTTCTT-MVDYETVA--GGDKFGNIWLLRCPTKASEEADEDGSGA 297
Query: 1282 KLLSRAEFHVGAH-----VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1336
L+ ++ GA V F Q + TS +T G R L++ L G++G
Sbjct: 298 HLIHERQYLQGAPNRLSLVIHFYS-QDIPTSIQKTQLVAGG----RDILVWTGLQGTVGM 352
Query: 1337 IAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
P D++ F Q+L+ +L P +AG + +R +++ K ++D +L
Sbjct: 353 FVPFITRDDVDF--FQTLEMQLASQNPPLAGRDHLIYRGYYAPCKG-------VIDGDLC 403
Query: 1394 SHYEMLPLEEQLEIA 1408
+ +LP +++ IA
Sbjct: 404 ETFLLLPNDKKQAIA 418
>gi|358338734|dbj|GAA31211.2| DNA damage-binding protein 1, partial [Clonorchis sinensis]
Length = 1515
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 111/266 (41%), Gaps = 37/266 (13%)
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD--EDTFGSGGGFSARIESSHVINLRD 240
V VDP C V +Y + I+ + G L D E + ++ RIE +++
Sbjct: 101 VLVDPGANCVVVRLYHGLLRIIPLNGIGEKLTTDSLEVNQYAANTYNVRIEEGNIV---- 156
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
D F+HGY P +++E EL H L+ L
Sbjct: 157 -------DMAFLHGYTLPTFAMIYEDELVL--------HMKTYEISGREPALRNVQLTLD 201
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
++ D+ L+ VP P GGV++VG N I+YH++ ++ Y +SQ L ++
Sbjct: 202 SIE--PDSKLLIPVPKPFGGVILVGDNIIYYHTKDGP---HISQYIPQAKASQVLCYAAV 256
Query: 361 SVEL----DAAHATWLQNDVALLSTKTGDLVLLT----VVYDGRVVQ-RLDLSKTNPSVL 411
+ D A ++ + +A T +G+ +L + V R+ R++L +
Sbjct: 257 DAQRYLLGDMAGRLYMVHLLAEDHTPSGNGLLGSTSSAAVPSARIGSIRIEL--LGETAT 314
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
I + N + F+G LGDS L++
Sbjct: 315 PESIAYVDNGVVFIGCTLGDSQLIRL 340
>gi|358060450|dbj|GAA93855.1| hypothetical protein E5Q_00501 [Mixia osmundae IAM 14324]
Length = 1153
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 64/303 (21%), Positives = 131/303 (43%), Gaps = 47/303 (15%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
++ EL ++A+ QG ++ G + L+ + L + + P VVSL + +
Sbjct: 875 LHKTELDDVPLSIAAFQGRAVVGVGRALRLYTMGKSRLLRKSENKSFPAAVVSLQVQGSR 934
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKD----------------FGSLDCFATEFL--ID 1250
I D S+YF+++K +L + A D S D F F+ +D
Sbjct: 935 IYASDAQDSVYFVAYKAADNRLLIFADDTQQRWITCNTVVDYDTVASGDKFGNVFVSRVD 994
Query: 1251 GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1310
+S V ++Q I + P + +LL+ F+VG +T ++ ++A
Sbjct: 995 -KLVSEDVDEDQTGAGILHEKPLFMGAPHRLQLLT--HFNVGDILTCIQKVSLVAG---- 1047
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
R LL+ L G++G + P +++ F +L+ + P + G +
Sbjct: 1048 ----------GREILLYTCLGGTVGMLIPFISKEDVEFS--STLEMHMRAENPSIVGRDH 1095
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1427
++R ++ KA VD +L + +LP+++Q +IA + + S++L ++ + +
Sbjct: 1096 LAYRGYYVPQKA-------TVDGDLCETFALLPMQKQAQIAGELDRSVSEVLKKIDSMRI 1148
Query: 1428 GTS 1430
+S
Sbjct: 1149 LSS 1151
>gi|322787057|gb|EFZ13281.1| hypothetical protein SINV_13198 [Solenopsis invicta]
Length = 986
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 97/222 (43%), Gaps = 43/222 (19%)
Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
E+G +I+ HNL +D H + + + ++E AL+
Sbjct: 778 EIGQEIEVHNLLIIDQHTFEVLHAH--------------------TLMATEYALS----- 812
Query: 1100 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1155
L +T E+ T +GTA++ ++ + GR+LL+ S G+ T+V KE+K
Sbjct: 813 LISTRLGEDPTSYFVVGTAFINPDETEPKMGRILLYHWSEGK--------FTQVAEKEIK 864
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1212
G+ +L G LL + + L +WT + L F + LY L +F+L+G
Sbjct: 865 GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVG 921
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
D+ +S+ L +K +A+D+ + E L D + L
Sbjct: 922 DLMRSLTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFL 963
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 92/203 (45%), Gaps = 31/203 (15%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + I+ K+ I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEINLRDKEFSKIPW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSP+ G +++G +I YH N+Y + + +
Sbjct: 208 RQDNVEREAMMVIPVPSPMCGAIIIGQESILYHDG--------NSYVAVVPPIIKQSTIT 259
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV-VQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L + DG + V+ L + +
Sbjct: 260 CYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGTLSVKDLKVELLGEISIPEC 313
Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
IT + N + ++GSRLGDS L++
Sbjct: 314 ITYLDNGVIYVGSRLGDSQLIKL 336
>gi|302504585|ref|XP_003014251.1| hypothetical protein ARB_07556 [Arthroderma benhamiae CBS 112371]
gi|291177819|gb|EFE33611.1| hypothetical protein ARB_07556 [Arthroderma benhamiae CBS 112371]
Length = 460
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 99/435 (22%), Positives = 182/435 (41%), Gaps = 54/435 (12%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVS-VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
+Q+ IPL TP E L+ +I S +L P + + L+ + D+ L
Sbjct: 17 LQEPIPLAYTPRNFVRHPEYPLFYVIGSDNNILSPATK--AKLLSESTTVNGDSAELPPE 74
Query: 1054 DLHRTYTVEEYEVRILEPDRAGGPWQTRATI---PMQSSENALTVRVVTLFNTTTKENET 1110
D + I D P T++ + ++ +E A+++ V+ T++E+ET
Sbjct: 75 DFGYPRGTNHWASSIQVVD----PIHTKSVLSNLELEDNEAAVSIAAVSF---TSQEDET 127
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQ 1165
L +GT G+D+ R F+ G R + + L ++ +++ AL Q
Sbjct: 128 FLVVGT----GKDMVVSPRT--FTCGFIHIYRFQEEGKEL-EFIHKTKVEQPPLALLGFQ 180
Query: 1166 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1225
G LL GP + ++ +L P +V L + I++ D+ +S+ ++ +K
Sbjct: 181 GRLLAGIGPDLRIYDLGMRQLLRKCQAQITPRVIVGLQTQGSRIIVSDVQESVTYVVYKY 240
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW----KGQ 1281
Q L A D T ++D T++ D+ NI + K SE G
Sbjct: 241 QENALIPFADDIIPRWTTCTT-MVDYETVA--GGDKFGNIWLLRCPTKASEEADEDGSGA 297
Query: 1282 KLLSRAEFHVGAH-----VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1336
L+ ++ GA V F Q + TS +T G R L++ L G++G
Sbjct: 298 HLIHERQYLQGAPNRLSLVIHFYS-QDIPTSIQKTQLVAGG----RDILVWTGLQGTVGM 352
Query: 1337 IAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
P D++ F Q+L+ +L P +AG + +R +++ K ++D +L
Sbjct: 353 FVPFITRDDVDF--FQTLEMQLASQNPPLAGRDHLIYRGYYAPCKG-------VIDGDLC 403
Query: 1394 SHYEMLPLEEQLEIA 1408
+ +LP +++ IA
Sbjct: 404 ETFLLLPNDKKQAIA 418
>gi|300122534|emb|CBK23104.2| unnamed protein product [Blastocystis hominis]
Length = 172
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 65/128 (50%), Gaps = 15/128 (11%)
Query: 1281 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
+ ++ +A+FH+ + +T L + + P N + T +G++G +
Sbjct: 26 RNVVRQADFHLASQITSILPISL-----------PDGQCIN----VILTAEGAMGVFLFV 70
Query: 1341 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1400
+ +L SLQK+L++++P A LN +FR++ S+G P ++D ++ Y ML
Sbjct: 71 TGEEYTKLSSLQKRLIEALPQNAALNNFNFRKYMSDGMMKYPRRKGVLDMGVIRKYLMLS 130
Query: 1401 LEEQLEIA 1408
+EQ +IA
Sbjct: 131 TQEQEDIA 138
>gi|393217872|gb|EJD03361.1| hypothetical protein FOMMEDRAFT_108572 [Fomitiporia mediterranea
MF3/22]
Length = 1213
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 114/511 (22%), Positives = 195/511 (38%), Gaps = 103/511 (20%)
Query: 965 CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP 1024
C G I + S +L+I Q+P T Q +PL TP +
Sbjct: 759 CPEGLIGI-SGSVLRIFQIPRLGTKLK----QDSMPLTYTPRKF---------------- 797
Query: 1025 VLKPLNQVLSLLIDQEVGHQI---DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1081
+ P+NQ ++ E H++ D +L + + EV L P+ G P
Sbjct: 798 IPHPMNQYFYMI---EADHRVMGDDAAKEKLAELRQRGVKYDQEVVDLPPEVFGRPKAPA 854
Query: 1082 AT----IPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQGEDVAA 1126
T I + N TV+VV L N + +E L +GTA +A
Sbjct: 855 GTWGSCIRILDPINKATVKVVHLDNNEAAFSIAIVPFAARNSELFLCVGTA--SSTFLAP 912
Query: 1127 RG------RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK 1180
R R F+ G AD + V+ E AL + QG L G + +++
Sbjct: 913 RSCSSGFIRTYAFTNG-GAD-----LELVHKTEADDVPMALMAFQGRLCAGVGKSLRIYE 966
Query: 1181 WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1240
+L +V+LN + I++GD+ +SI + +K +L + A D
Sbjct: 967 IGKKKLLRKVETKTYGSAIVTLNTQGSRIIVGDMQESIVYAVFKPPENRLLIFADD-SQP 1025
Query: 1241 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSR---------- 1286
+ ++D +T++ D+ N+ I K+S E G +L
Sbjct: 1026 RWTTSAVMVDYTTIA--AGDKFGNVFINRLDSKISDQVDEDPTGAGILHEKGLLMGAPHK 1083
Query: 1287 ----AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1340
A FHVG VT ++ ++A R LL+ L G+IG + P
Sbjct: 1084 TGMIAHFHVGDIVTSIHKISLVA--------------GGREVLLYTCLHGTIGILVPFVS 1129
Query: 1341 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
+++ F + +L++ + + G + ++R ++ KA +VD +L + L
Sbjct: 1130 KEDVDF--ISTLEQHMRSEKLSLVGRDHLAWRGYYVPVKA-------VVDGDLCEQFARL 1180
Query: 1400 PLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
P +Q IA + T ++L L L + S
Sbjct: 1181 PANKQSAIAVELDRTVGEVLKKLEQLRVTAS 1211
>gi|119471789|ref|XP_001258220.1| UV-damaged DNA binding protein, putative [Neosartorya fischeri NRRL
181]
gi|119406372|gb|EAW16323.1| UV-damaged DNA binding protein, putative [Neosartorya fischeri NRRL
181]
Length = 1140
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 129/314 (41%), Gaps = 40/314 (12%)
Query: 1114 IGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GTAY+ E D + RGR+L+F DN + L T+V +KGA ALA L ++ A
Sbjct: 834 VGTAYLDDEGDESIRGRILIF----EVDNGRKL-TQVAELPVKGACRALAMLGDKIVAAL 888
Query: 1173 GPKIILHK-----WTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
+++++ + L +A Y AP V + + N I + D+ KS+ + +K
Sbjct: 889 VKTVVVYRVINNNFGAMRLEKLASYRTSTAP----VDVTVTGNLIAVSDLMKSMCLVEYK 944
Query: 1225 E----QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1280
E + +A+ F ++ + + L SD + N+ + + E
Sbjct: 945 EGENGTPDTMTEVARHFQTVWATGVANIAPDTFLE---SDAEGNLIVLHRNTTGVEEDDK 1001
Query: 1281 QKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1338
++L E +G V + + +Q LA+ P + GT++GSI A
Sbjct: 1002 RRLEVTGEISLGEMVNRIRPVNIQQLAS----VAVTPRA--------FLGTVEGSIYLFA 1049
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
++ L LQ + V V + FR F S + + P VD EL+ +
Sbjct: 1050 IINPDHQDFLMRLQATIAGKVELVGNMPLNEFRGFRSMVREAKE-PYRFVDGELIERFLT 1108
Query: 1399 LPLEEQLEIAHQTG 1412
Q EI G
Sbjct: 1109 CEPSLQEEIVSTVG 1122
>gi|121699866|ref|XP_001268198.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
1]
gi|119396340|gb|EAW06772.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
1]
Length = 1140
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 77/317 (24%), Positives = 129/317 (40%), Gaps = 46/317 (14%)
Query: 1114 IGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GTA++ E D + RGR+L+F DN + L T+V +KGA ALA L ++ A
Sbjct: 834 VGTAFLDDEGDESIRGRILIF----EVDNGRKL-TQVAELPVKGACRALAMLGNRIVAAL 888
Query: 1173 GPKIILHK-----WTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
++++K + +L +A Y AP V + + N I + D+ KS+ + +K
Sbjct: 889 VKTVVVYKAVSNNFGAMKLEKLASYRTSTAP----VDVTVTGNLIAVSDLMKSVCLVEYK 944
Query: 1225 EQ----GAQLNLLAKDFGS-----LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
E L +A+ F + + C A + ++ SD + N+ I
Sbjct: 945 EGEDGLPDTLTEVARHFQTVWATGVACIAQDTFLE--------SDAEGNLIILCRNTTGV 996
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
E ++L E +G V + + + +S P + T++GSI
Sbjct: 997 EEDDKRRLEVTGEISLGEMVNRIRPVNIQQLTS--VAVTPRA--------FLATVEGSIY 1046
Query: 1336 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1395
A ++ L LQ + V V + FR FHS + + P VD EL+
Sbjct: 1047 LFAMINPDHQDFLMRLQATIAGKVELVGNMPFNEFRGFHSMVREAQE-PYRFVDGELIER 1105
Query: 1396 YEMLPLEEQLEIAHQTG 1412
+ Q EI G
Sbjct: 1106 FLACEPSVQEEIVSIVG 1122
Score = 41.6 bits (96), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 61/136 (44%), Gaps = 25/136 (18%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
L+ VP+P+GG+L++G +I Y V D+++ + R LD A
Sbjct: 248 LIPVPAPLGGLLILGETSIKY---------------VDADNNEIISRP-----LDEATIF 287
Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFL 425
W Q D LL+ G L L +V D V+ L + S + +G + FL
Sbjct: 288 VAWEQVDSQRWLLADDYGRLFFLMLVLDSDNQVESWKLDLLGKTSRASVLVYLGGGVLFL 347
Query: 426 GSRLGDSLLVQFTCGS 441
GS GDS +++ + GS
Sbjct: 348 GSHQGDSQVLRISNGS 363
>gi|148709424|gb|EDL41370.1| damage specific DNA binding protein 1 [Mus musculus]
Length = 968
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 73/151 (48%), Gaps = 17/151 (11%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLID 1250
L++K +A+DF A E L D
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDD 962
>gi|320037168|gb|EFW19106.1| pre-mRNA-splicing factor rse1 [Coccidioides posadasii str. Silveira]
Length = 970
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 97/488 (19%), Positives = 204/488 (41%), Gaps = 57/488 (11%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLI-VS 1022
C G + + Q + ++ S DN +Q+ IPL TP E+ L+ +I
Sbjct: 520 QCVEGMVGIQGQNL----RIFSIEKLDNNL-LQETIPLAYTPRHFVRHPEQPLFYVIEAD 574
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRA 1082
+L P + L + ++ R +++++P + +
Sbjct: 575 NNILSPSTKAKLLQDSKAANGEVAELPPEDFGYPRGTGHWASCIQVVDPINSKA---VIS 631
Query: 1083 TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------AARGRVLLFSTG 1136
I ++ +E A++V V +++++ET L +GT G+D+ ++ G + ++
Sbjct: 632 RIELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVYPPSSSCGFIHIYRFQ 684
Query: 1137 RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1196
+ + ++ +++ AL + QG LL G + ++ +L + P
Sbjct: 685 EDGKE----LEFIHKTKVESPPHALLAFQGRLLAGIGRNLRIYDLGMKQLLRKCQAEVVP 740
Query: 1197 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTL 1254
+V L + I++ D+ +S+ ++ +K Q +L A D + C A ++D T+
Sbjct: 741 RLIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDVIARWTTCTA---MVDYETV 797
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLAT 1306
+ D+ N+ + K SE G L+ ++ GA L + Q + T
Sbjct: 798 A--GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMVHFYPQDIPT 855
Query: 1307 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1363
S +T G R L++ L G++G + P +++ F QSL+ +L P +A
Sbjct: 856 SIQKTQLVAG----GRDILVWTGLQGTVGMLVPFVSREDVDF--FQSLEMQLTSQTPPLA 909
Query: 1364 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1423
G + +R +++ K +D +L Y LP +++L IA + + +I ++
Sbjct: 910 GRDHLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIAGELDRSVREIERKIS 962
Query: 1424 DLALGTSF 1431
D+ ++
Sbjct: 963 DMRTKVAY 970
>gi|402223178|gb|EJU03243.1| hypothetical protein DACRYDRAFT_115454 [Dacryopinax sp. DJM-731 SS1]
Length = 1175
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 108/440 (24%), Positives = 179/440 (40%), Gaps = 79/440 (17%)
Query: 926 FFLSGSRPCWCMVFRERLRVHP-QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
F G RP + +RL P +L D I A +VLH FI+ ++ +L I Q+
Sbjct: 718 IFACGDRPALLFLKNDRLTASPIKLRD--IHAGSVLHIPQFPSSFIFASASTLL-IGQIR 774
Query: 985 SGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQ 1044
D V+ + TP ++TY Y ++ K LN+ D+E+
Sbjct: 775 ESQKID----VRTISLGLDTPIRLTYHRGLRAYGVVCQ---RKELNRE----DDREIYS- 822
Query: 1045 IDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTT 1104
SS L T E PD + V T+ ++T
Sbjct: 823 ------SSFKLFDDITFEYLNNFTARPDEQ-------------------MMCVTTIPDST 857
Query: 1105 TKENETLLAIGTAYVQG-EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
+E+ L +GT G E+ ++GR+L+F + P + V S ++ G + A+ +
Sbjct: 858 GEEDSDFLVVGTYEATGAEEDVSKGRILIFE-----EVPNRKLKLVVSHDVGGCVYAVTN 912
Query: 1164 LQGHLLIASGPKI---ILHK-WTGTELNGIAFYDAPPLYVVSLNIVK-NFILLGDIHKSI 1218
+ +L A + LH+ + +A + + YV S I + N +L+GD +++
Sbjct: 913 VGANLAAAINGTLQVFSLHRSHDDIRIESVAKWSSA--YVASSLICRGNTLLVGDAMRAV 970
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
L W GA+L L D+ SL E + +G V+ E N + +W
Sbjct: 971 CILRWT--GAKLETLYHDYASLWIQTLESIDEGG----VIGAELNNNIV---------TW 1015
Query: 1279 KGQKLLSR-AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIG 1335
+ L R ++ G + +F R + A AAPG+ N L+F T G IG
Sbjct: 1016 RKDGKLERDGMWYFGEGINRFRRASLNA-------AAPGAGGNNAGRGNLVFCTNTGRIG 1068
Query: 1336 CIAPLDELTFRRLQSLQKKL 1355
+A LDE +L +LQ+ +
Sbjct: 1069 IVASLDEDLSMQLSNLQRNI 1088
>gi|295666353|ref|XP_002793727.1| pre-mRNA-splicing factor rse1 [Paracoccidioides sp. 'lutzii' Pb01]
gi|226278021|gb|EEH33587.1| pre-mRNA-splicing factor rse1 [Paracoccidioides sp. 'lutzii' Pb01]
Length = 1209
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 97/454 (21%), Positives = 192/454 (42%), Gaps = 46/454 (10%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLI-VSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
+Q+ I L TP Q E+ L+ +I +L P + LL D + + D L S
Sbjct: 785 LQETISLTYTPRQFIKHPEQPLFYVIEADNNILSPATRA-KLLNDSDAVNG-DATVLPSE 842
Query: 1054 DLHRTYTVEEYE--VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1111
D + ++I++P + + I ++ +E A++V V+ +++++ET
Sbjct: 843 DFGYPRGTGHWASCIQIVDPVNSKS---VVSQIELEENEAAVSVAAVSF---SSQDDETF 896
Query: 1112 LAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
L +GT G+D+ R R ++ + L ++ +++ AL QG L
Sbjct: 897 LVVGT----GKDMVVNPRSCSAGFIHIYRFQEDGKEL-EFIHKTKVEQPPVALLGFQGRL 951
Query: 1169 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
L G + ++ ++ P VV L + I++ D+ +S+ ++ +K Q
Sbjct: 952 LAGIGTDVRIYDLGMRQMLRKCQASVVPHLVVGLQTQGSRIIVSDVQESVTYVVYKSQEN 1011
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW----KGQKLL 1284
+L D S T ++D T++ D+ N+ + K SE G L+
Sbjct: 1012 RLIPFVDDVISRWTTCTT-MVDYETVA--GGDKFGNLWLLRCPAKASEEADEDGSGAHLI 1068
Query: 1285 SRAEFHVGA----HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
++ GA ++ Q L TS + G R L++ L G++G + P
Sbjct: 1069 HERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLVTG----GRDILVWTGLQGTVGMLIPF 1124
Query: 1341 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1397
+E+ F QSL+ +L P +AG + +R +++ K +D +L Y
Sbjct: 1125 ISREEVDF--FQSLEMQLAAQNPPLAGRDHLIYRSYYAPAKG-------TIDGDLCETYL 1175
Query: 1398 MLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+LP +++ +IA + + +I + D+ ++
Sbjct: 1176 LLPNDKKQQIAGELDRSVREIERKIADMRTKVAY 1209
>gi|303324325|ref|XP_003072150.1| Splicing factor 3B subunit 3, putative [Coccidioides posadasii C735
delta SOWgp]
gi|240111860|gb|EER30005.1| Splicing factor 3B subunit 3, putative [Coccidioides posadasii C735
delta SOWgp]
Length = 1209
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/367 (21%), Positives = 165/367 (44%), Gaps = 48/367 (13%)
Query: 1084 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------AARGRVLLFSTGR 1137
I ++ +E A++V V +++++ET L +GT G+D+ ++ G + ++ R
Sbjct: 872 IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVYPPSSSCGFIHIY---R 921
Query: 1138 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1197
++ + L ++ +++ AL + QG LL G + ++ +L + P
Sbjct: 922 FQEDGKEL-EFIHKTKVESPPHALLAFQGRLLAGIGRNLRIYDLGMKQLLRKCQAEVVPR 980
Query: 1198 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLS 1255
+V L + I++ D+ +S+ ++ +K Q +L A D + C A ++D T++
Sbjct: 981 LIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDVIARWTTCTA---MVDYETVA 1037
Query: 1256 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATS 1307
D+ N+ + K SE G L+ ++ GA L + Q + TS
Sbjct: 1038 --GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMVHFYPQDIPTS 1095
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1364
+T G R L++ L G++G + P +++ F QSL+ +L P +AG
Sbjct: 1096 IQKTQLVAG----GRDILVWTGLQGTVGMLVPFVSREDVDF--FQSLEMQLTSQTPPLAG 1149
Query: 1365 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
+ +R +++ K +D +L Y LP +++L IA + + +I ++D
Sbjct: 1150 RDHLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIAGELDRSVREIERKISD 1202
Query: 1425 LALGTSF 1431
+ ++
Sbjct: 1203 MRTKVAY 1209
>gi|391341057|ref|XP_003744848.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Metaseiulus
occidentalis]
Length = 1211
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/247 (23%), Positives = 104/247 (42%), Gaps = 27/247 (10%)
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
P +V++N V N I++GD+ +S +F+ ++ QL + A DF A ++D T
Sbjct: 981 PNLIVTINAVGNRIVVGDVQESFFFIRYRMLENQLIIFADDFTPRWTTAA-CMVDYRT-- 1037
Query: 1256 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG--------AHVTKFLRLQMLATS 1307
+V D+ N+ I S+ R+ + G A V + L S
Sbjct: 1038 VVGGDKFGNVYILRLPGNTSDDVDEDPTGVRSLWDRGWLGGAGQKAEVLSMTHVGELIVS 1097
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1364
+T PG + A+++ T+ G +G + P D+ F Q L+ + P + G
Sbjct: 1098 LQKTALIPGGPE----AIVYTTIAGGVGALIPFSSKDDHEF--FQHLEMYMRTEHPPICG 1151
Query: 1365 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
+ SFR ++ KA ++D +L Y L +Q +IA + ++ L D
Sbjct: 1152 RDHLSFRSYYFPVKA-------VIDGDLCEQYNSLDANKQKQIADELERLPHEVAKKLED 1204
Query: 1425 LALGTSF 1431
+ +F
Sbjct: 1205 IRTKFAF 1211
>gi|17541566|ref|NP_502299.1| Protein DDB-1 [Caenorhabditis elegans]
gi|74965443|sp|Q21554.2|DDB1_CAEEL RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|5824558|emb|CAA92824.2| Protein DDB-1 [Caenorhabditis elegans]
Length = 1134
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/281 (19%), Positives = 119/281 (42%), Gaps = 17/281 (6%)
Query: 1105 TKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
T ++ T +GT + ++ + GR+++F D ++ + V+ ++G+ A+
Sbjct: 814 TNDSSTYYVVGTGLIYPDETETKIGRIVVFEVD---DVERSKLRRVHELVVRGSPLAIRI 870
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
L G L+ A I L +WT + + + + L ++ + + D+ +S+ LS+
Sbjct: 871 LNGKLVAAINSSIRLFEWTTDKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSY 930
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+ +AKD+ S EF+ S L +++ P + G+ +
Sbjct: 931 RMLEGNFEEVAKDWNSQWMVTCEFITAESILGGEAHLNLFTVEVDKTRPITDD---GRYV 987
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPLD 1341
L + + K + L + D +++ ++FGT G+IG I +D
Sbjct: 988 LEPTGYWYLGELPKVMTRSTLVIQPE--------DSIIQYSQPIMFGTNQGTIGMIVQID 1039
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
+ + L +++K + DSV + + S+R F +A P
Sbjct: 1040 DKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQKRAEPP 1080
Score = 47.8 bits (112), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 166/396 (41%), Gaps = 96/396 (24%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
D+ L+ VP IGGV+V+G+N++ Y + Y SL L ++F+ +
Sbjct: 210 DSSVLIPVPHAIGGVIVLGSNSVLYKPNDNLGEVV--PYTCSL-----LENTTFTCHGIV 262
Query: 365 DAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
DA+ +L LS G L++L T G V+ + + + + I I N
Sbjct: 263 DASGERFL------LSDTDGRLLMLLLNVTESQSGYTVKEMRIDYLGETSIADSINYIDN 316
Query: 421 SLFFLGSRLGDSLLVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
+ F+GSRLGDS L++ T +G S S + E + +I ++DMV
Sbjct: 317 GVVFVGSRLGDSQLIRLMTEPNGGSY--SVILETYSNI---------------GPIRDMV 359
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
E ++ + T + A +D G L+ G+ I+ AS
Sbjct: 360 MVE---------SDGQPQLVTCTGADKD-----GSLRVIRNGIGIDELAS---------- 395
Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
V+L G GI+ + S NAD+ Y+I+SL T VL+ E
Sbjct: 396 --VDLAGVVGIFPIRLDS----NADN------------YVIVSLSDETHVLQITGEELED 437
Query: 600 TESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSESGSGS 656
+ ++ TI A LFG ++Q E+ R++ S +++ + P+N E S
Sbjct: 438 VKLLEINTDLPTIFASTLFGPNDSGIILQATEKQIRLMSSSGLSK--FWEPTNGEIISK- 494
Query: 657 ENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
+SV+ A+ ++L D ++ LL TC V
Sbjct: 495 -----VSVNAANGQIVLAARD-TVYLL-----TCIV 519
>gi|391341059|ref|XP_003744849.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Metaseiulus
occidentalis]
Length = 1223
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/247 (23%), Positives = 104/247 (42%), Gaps = 27/247 (10%)
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
P +V++N V N I++GD+ +S +F+ ++ QL + A DF A ++D T
Sbjct: 993 PNLIVTINAVGNRIVVGDVQESFFFIRYRMLENQLIIFADDFTPRWTTAA-CMVDYRT-- 1049
Query: 1256 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG--------AHVTKFLRLQMLATS 1307
+V D+ N+ I S+ R+ + G A V + L S
Sbjct: 1050 VVGGDKFGNVYILRLPGNTSDDVDEDPTGVRSLWDRGWLGGAGQKAEVLSMTHVGELIVS 1109
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1364
+T PG + A+++ T+ G +G + P D+ F Q L+ + P + G
Sbjct: 1110 LQKTALIPGGPE----AIVYTTIAGGVGALIPFSSKDDHEF--FQHLEMYMRTEHPPICG 1163
Query: 1365 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
+ SFR ++ KA ++D +L Y L +Q +IA + ++ L D
Sbjct: 1164 RDHLSFRSYYFPVKA-------VIDGDLCEQYNSLDANKQKQIADELERLPHEVAKKLED 1216
Query: 1425 LALGTSF 1431
+ +F
Sbjct: 1217 IRTKFAF 1223
>gi|71413926|ref|XP_809084.1| cleavage and polyadenylation specificity factor-like protein
[Trypanosoma cruzi strain CL Brener]
gi|70873410|gb|EAN87233.1| cleavage and polyadenylation specificity factor-like protein,
putative [Trypanosoma cruzi]
Length = 499
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 62/260 (23%), Positives = 105/260 (40%), Gaps = 54/260 (20%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
+++V+D F+ EP++ L ER TWAGRV W+ LS S
Sbjct: 250 IRYVRDMQFIDSSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
T+ ++ LI ++P++ + +P+G GV+ G NT+ + + + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFAQIPSGVICYGINTVMHVTTKRGYGVYL 365
Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQNDV---AL 378
NN + D E + F V L A+ T + N++ +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFMVNLSLANCTASFMSIVNEMLHLLV 425
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+S + G ++ L++ VQ + ++ S I IG+ + FLGS GDS
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTGCYCSGIARIGDQIVFLGSACGDS------ 479
Query: 439 CGSGTSMLSSGLKEEFGDIE 458
C + M S + F IE
Sbjct: 480 CIAKVDMFHSDAAKRFQIIE 499
>gi|323508292|emb|CBQ68163.1| probable splicing factor 3B subunit 3 [Sporisorium reilianum SRZ2]
Length = 1221
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 133/579 (22%), Positives = 231/579 (39%), Gaps = 79/579 (13%)
Query: 887 LRNLRFSRTPLDAYTREETPH-----GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRE 941
L+N RT LDA T + T G+ R+ I + G +R +++
Sbjct: 685 LQNGVLLRTVLDAVTGQLTDTRTRFLGSKAVRL-IRTKVHGQSAVMALSTRTWLSFTYQD 743
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPL 1001
RL+ P + D A++ + C G I + L+I +PS ++ Q + L
Sbjct: 744 RLQFVPLIFDALDHAWSFSAEL-CPEGLIGIVG-STLRIFTMPSLASKLK----QDSVAL 797
Query: 1002 KATPHQITYFAEKN--LYPLIVSVPVLKPLNQVL-SLLIDQEVG-HQ--IDNHNLSSVDL 1055
TP + + + Y + L P Q + ++++E+ HQ + + N + L
Sbjct: 798 SYTPRKFAHHPNEQGLFYVVEAEHRTLSPGAQRRRTEMLEKELKPHQRGVLDLNPAEFGL 857
Query: 1056 HRTYTVEEYE-VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1114
R VR+++ G QT + + +E A +V VV + E E +L +
Sbjct: 858 IRAEAGNWASCVRVVD----GVQSQTTHKLELDDNEAAFSVAVVPF---ASAEKEAMLVV 910
Query: 1115 GTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSK-ELKGAISALASLQGHLLI 1170
G+A DV R +T R +N + L EV K E+ L QG LL
Sbjct: 911 GSAV----DVVLSPRSFKKAYLTTYRLTNNGREL--EVLHKTEVDDIPLVLRPFQGRLLA 964
Query: 1171 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1230
G + ++ +L + +VSL+ + I++GD+ +SI F S+K +L
Sbjct: 965 GIGKALRIYDLGKKKLLRKCENKSFATAIVSLDAQGSRIVVGDMQESIIFTSYKPLENRL 1024
Query: 1231 NLLAKD----------------FGSLDCFATEFL--IDGSTLSLVVSDEQKNIQIFYYAP 1272
A D + D F ++ ID T S V ++ + I + P
Sbjct: 1025 VTFADDVMPKFVTRCAMLDYDTVAAADKFGNVYVLRIDADT-SRSVDEDPTGMTIVHEKP 1083
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1332
+ + L+ A F VG + TS +RT PG R LL+ + G
Sbjct: 1084 VLMGAAHKATLV--AHFFVGD----------IVTSLNRTVMVPG----GREVLLYTGVSG 1127
Query: 1333 SIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1391
+IG + P + + L +L+ L + G + ++R ++ K S++D +
Sbjct: 1128 TIGALVPFVSKEDVDTLSTLESHLRQENSSLVGRDHLAYRSSYAPVK-------SVIDGD 1180
Query: 1392 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
L + +LP +Q IA + S+I L L G++
Sbjct: 1181 LCETFGLLPPAKQNAIATELDRKPSEINKKLAQLREGST 1219
>gi|392869416|gb|EJB11761.1| pre-mRNA-splicing factor rse1 [Coccidioides immitis RS]
Length = 1209
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 77/367 (20%), Positives = 162/367 (44%), Gaps = 48/367 (13%)
Query: 1084 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------AARGRVLLFSTGR 1137
I ++ +E A++V V +++++ET L +GT G+D+ ++ G + ++
Sbjct: 872 IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVYPPSSSCGFIHIYRFQE 924
Query: 1138 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1197
+ + ++ +++ AL + QG LL G + ++ +L + P
Sbjct: 925 DGKE----LEFIHKTKVESPPHALLAFQGRLLAGIGRNLRIYDLGMKQLLRKCQAEVVPR 980
Query: 1198 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLS 1255
+V L + I++ D+ +S+ ++ +K Q +L A D + C A ++D T++
Sbjct: 981 LIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDVIARWTTCTA---MVDYETVA 1037
Query: 1256 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATS 1307
D+ N+ + K SE G L+ ++ GA L + Q + TS
Sbjct: 1038 --GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMVHFYPQDIPTS 1095
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1364
+T G R L++ L G++G + P +++ F QSL+ +L P +AG
Sbjct: 1096 IQKTQLVAG----GRDILVWTGLQGTVGMLVPFVSREDVDF--FQSLEMQLTSQTPPLAG 1149
Query: 1365 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
+ +R +++ K +D +L Y LP +++L IA + + +I ++D
Sbjct: 1150 RDHLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIAGELDRSVREIERKISD 1202
Query: 1425 LALGTSF 1431
+ ++
Sbjct: 1203 MRTKVAY 1209
>gi|302837243|ref|XP_002950181.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
nagariensis]
gi|300264654|gb|EFJ48849.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
nagariensis]
Length = 1104
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 112/268 (41%), Gaps = 30/268 (11%)
Query: 1114 IGTAYVQGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAI-SALASLQGHLLIA 1171
+GTA++ E+ +GR+L+ R LVTE KE+KGA + L ++ +L +
Sbjct: 835 VGTAFIVPEEPEPTKGRILVLEHVR-------LVTE---KEVKGAAYNVLPFVKDKILAS 884
Query: 1172 SGPKIILHKWTGTELNGIAFYDAPP------LYVVSLNIVKNFILLGDIHKSIYFLSWKE 1225
K+ +G +L G+ A + + L N +++GD+ +S+ LS+
Sbjct: 885 VNSKV---PASGCDLGGVRVELASECSYLGNILALYLATRGNLVVVGDLMRSVSLLSYNV 941
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1285
+ L A D+ S + E L D + L D N+ + + + +L
Sbjct: 942 EQGVLEHRAADYNSGWTTSVEALDDDTYLE---GDNHLNLVVLRRNADSATDEERARLQV 998
Query: 1286 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1345
E+H G V +F ++ D + LLFG DG +G IA L +
Sbjct: 999 VGEYHTGTFVNRFRHGSLVMRPPDSEFV------SLPVPLLFGGTDGRLGVIARLPPGLY 1052
Query: 1346 RRLQSLQKKLVDSVPHVAGLNPRSFRQF 1373
L LQ L V V GL+ ++ F
Sbjct: 1053 EMLTKLQSALRQVVRGVGGLSHEAWIAF 1080
Score = 47.0 bits (110), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 94/242 (38%), Gaps = 55/242 (22%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL + G + LL + +DG V L + S + + + L F+GSR GDS LV+
Sbjct: 281 LLGNRQGGMQLLVLAHDGSRVSGLRTEPLGYTCAPSCLAYLDSGLTFVGSRSGDSQLVRI 340
Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
+ + P T S +L +V+ + L
Sbjct: 341 SAQP-----------------VNQPPTYLELVDSFPSLAPIVDFVVMDL---------ER 374
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
Q + + + G L+ G+ IN A+ VELPG KG+W++
Sbjct: 375 QGQGQLVMCSGIDSDGSLRVVRNGIGINRQAT------------VELPGIKGVWSL---- 418
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
R H YDDEY YL+++ E R + L T + L E E + +T+ G
Sbjct: 419 -RSH---------YDDEYDKYLLLTFVGETRLLALNTEEELDE-AELPGFDSGSQTLWCG 467
Query: 616 NL 617
N+
Sbjct: 468 NM 469
>gi|302680006|ref|XP_003029685.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
gi|300103375|gb|EFI94782.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
Length = 1213
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 87/376 (23%), Positives = 157/376 (41%), Gaps = 36/376 (9%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+RI++P + T A IP+ ++E A ++ VV + + E L +GTA V+
Sbjct: 861 IRIIDPTQN----STVAVIPLDNNEAAFSIAVVPF---SARNGELFLVVGTA--ANTRVS 911
Query: 1126 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1185
R + N + + E AL + QG L G + ++ +
Sbjct: 912 PRTCSSGYLRTYQFTNDGAGLELHHKTETDDVPLALLAFQGRLAAGVGKALRIYDIGKKK 971
Query: 1186 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1245
L A +V+LN + I+ GD+ +S+++ +K +L + A D +
Sbjct: 972 LLRKAENKGFGTTIVTLNTQGSRIIAGDMQESLFYAVYKAPENRLLVFADD-SQPRWISA 1030
Query: 1246 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEFHVGA-HVTKFL- 1299
++D T++ D N+ + K+SE G +L +GA H TK L
Sbjct: 1031 ATMVDYYTVA--AGDRFGNVFVNRLDYKVSEQVDDDPTGAGILHEKGILMGAPHKTKLLC 1088
Query: 1300 --RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKK 1354
+ L TS + G R LL+ L G+IG + P +++ F + +L++
Sbjct: 1089 HFHVGDLITSIHKVALVAG----GREVLLYTGLHGTIGMLVPFVSKEDVDF--ISTLEQH 1142
Query: 1355 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1414
+ + G + S+R ++ KA +VD +L + LP +Q IA++ T
Sbjct: 1143 MRSEQSSLVGRDHLSWRGYYVPVKA-------VVDGDLCETFAKLPASKQSAIANELDRT 1195
Query: 1415 RSQILSNLNDLALGTS 1430
++L L+ L TS
Sbjct: 1196 VGEVLKKLDSLRTTTS 1211
>gi|392580116|gb|EIW73243.1| hypothetical protein TREMEDRAFT_37240 [Tremella mesenterica DSM 1558]
Length = 1214
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 81/393 (20%), Positives = 162/393 (41%), Gaps = 67/393 (17%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA-------- 1117
+RIL+P QT +TI + E A ++ + N E L +GTA
Sbjct: 862 IRILDPLTN----QTVSTIELDEDEAAFSLTIAYFENMA---GEPSLVVGTAVKTTLTPR 914
Query: 1118 -----YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+++ + GR L F ++ +L +A QG+LL+ +
Sbjct: 915 GCKEGWLRVYAIKENGRTLEF---------------MHKTKLDEIPLCVAGFQGYLLVGA 959
Query: 1173 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE-QGAQLN 1231
G + L++ L ++ P + ++N++ I++GD+ +S +F ++ QL
Sbjct: 960 GKSLRLYEAGKKALLRKCENNSFPTVIATINVIGARIIVGDMQESTFFCVYRSIPTRQLL 1019
Query: 1232 LLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLS 1285
+ D L C +D T++ D+ N+ + +SE G +L
Sbjct: 1020 VFGDDTQPRFLTCVTN---VDYDTVA--CGDKFGNVFVNRMDQAVSEKVDDDPTGAGILH 1074
Query: 1286 RAEFHVG-AHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-- 1339
F +G AH T + ++ + TS + PG R L++ T+ G++G + P
Sbjct: 1075 EKGFLMGAAHKTTLIAHYQVGSVVTSLTKVSLVPG----GRDVLVYTTISGAVGALVPFI 1130
Query: 1340 -LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1398
+D++ F + +L+ + + G + ++R +++ +VD +L Y
Sbjct: 1131 SMDDVEF--MTTLEMHMRSQNISLVGRDHLAYRGYYAPVMG-------VVDGDLCDAYSS 1181
Query: 1399 LPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
LP +Q IA++ + +L L + ++F
Sbjct: 1182 LPYTKQSSIANELDRSVGDVLKKLEQMRTSSAF 1214
>gi|219125301|ref|XP_002182922.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
1055/1]
gi|217405716|gb|EEC45658.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
1055/1]
Length = 1284
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 66/286 (23%), Positives = 121/286 (42%), Gaps = 30/286 (10%)
Query: 1111 LLAIGTAYVQ-GEDVAARGRVLLFST----GRNADNPQNLVTEVYSKELKGAISALASL- 1164
L +GTAY ED +RGR+L++S V ++ +G + ++
Sbjct: 931 FLLVGTAYAMPDEDEPSRGRILVYSCQADEASGTPTSTRAVRQITEMSTQGGVYSICQFY 990
Query: 1165 QGHLLIASGPKIILHKWTGT------ELNGIAFYDAPPLYVVSLNI---VKNFILLGDIH 1215
G+ L K + + E GI + ++VSL + K ++GD+
Sbjct: 991 DGNFLCTVNSKTHVVQIVADCGVLRLEYVGIGHHG----HIVSLFVKSRAKPLAIVGDLM 1046
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+S+ + + Q L +A+DF A E L D + + E N K +
Sbjct: 1047 RSVSLMQYYPQHETLEEVARDFNPNWTTAVEMLTD----DVYIGAENWNNLFCLRRNKAA 1102
Query: 1276 ESWKGQ-KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
S + + +L + EFH+G KF+ ++ + S ++R A LFGT++GS+
Sbjct: 1103 TSEEIRCRLDNIGEFHLGEMCNKFMSGSLVMP------VSSNSTTSSRRATLFGTVEGSL 1156
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1380
G I LD T +L++ + ++ V G + + +R + + H
Sbjct: 1157 GVILGLDGRTAAFFITLERAIAKTIQPVGGFSHQLYRSCQAELRVH 1202
>gi|390342012|ref|XP_793599.3| PREDICTED: uncharacterized protein LOC588842 [Strongylocentrotus
purpuratus]
Length = 1161
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 68/281 (24%), Positives = 111/281 (39%), Gaps = 46/281 (16%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
GP+ +DP+ R G+ +Y I+ + L F+ R+E +VI+++
Sbjct: 50 GPIGIIDPECRMIGLRLYDGLFKIIPLDRDNKEL----------KAFNIRLEELNVIDVQ 99
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
F++G +P +V LH+ GR H + P W
Sbjct: 100 -----------FLYGCHQPTIVFLHQDP---HGR-----HVKTYEVNLREKEFNRGP--W 138
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++AVP P GG L++G +I YH A+A + S+
Sbjct: 139 KQDNVETEATMVIAVPQPYGGALIIGQESITYHKGDNYVAIA----------PPTIKNST 188
Query: 360 FSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDIT 416
LD + +L D L L+ DG V+ L L + + +T
Sbjct: 189 LVCYGRLDNNGSRYLLGD--LTGRLFLLLLDKEESMDGAATVKDLKLEFLGETSIAECLT 246
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
+ N + F+GSRLGDS LV+ S S + E F ++
Sbjct: 247 YLDNGVVFIGSRLGDSQLVRLNTESDESGSYVTMMETFTNL 287
>gi|330792580|ref|XP_003284366.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
gi|325085712|gb|EGC39114.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
Length = 1064
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 76/336 (22%), Positives = 141/336 (41%), Gaps = 38/336 (11%)
Query: 1103 TTTKENE---TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAIS 1159
T+TK ++ T LA+GT+ + D GRVLLF N + + LV + + +
Sbjct: 749 TSTKFDDDPCTYLAVGTS-INIPDRQTSGRVLLF----NINEAKKLVL-LEEISFRSGVL 802
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK-----NFILLGDI 1214
L G L+ A ++ +++ ++ + ++ I+K +F+L+GD+
Sbjct: 803 YLHQFNGRLIAAVLKRLYSIRYSYSKEKNCKVISSENVHKGHTMILKLASRGHFMLVGDM 862
Query: 1215 HKSIYFLSWKEQGAQLNLLAKD-----FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
KS+ L E G+ L +AK+ S+ ++ I T + V ++ N
Sbjct: 863 MKSMSLLGQSENGS-LVQIAKNPQPIWIRSIAMINDDYFIGSETSNNFVVVKKNN----- 916
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
+ + + L S +H+G + ML S R P SD +L+ +
Sbjct: 917 ---DSTNELERELLDSVGHYHIGESIN-----SMLCGSLVR---LPDSDAPPIPTILYAS 965
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1389
++GSIG IA + + + LQK L V + G S+R F ++ H + +D
Sbjct: 966 VNGSIGVIASISKEDYEFFSKLQKGLNRVVNGIGGFTHESWRAFSND--HHTVESRNFID 1023
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+L+ + L +E ++ T + L + L
Sbjct: 1024 GDLIEMFPDLKIESMAKVIQDMNVTLDETLKRIESL 1059
Score = 42.4 bits (98), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 128/565 (22%), Positives = 214/565 (37%), Gaps = 117/565 (20%)
Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
++G + L + S GG ++D + ++ E K +L +D + + E
Sbjct: 14 IYGRISVLKLFSAGG-----KQDYLFISTESFKFCILAYDSEKKEIVTKASGNAED---- 64
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
GR + A G L +DP GR +I L +G L+ E G +
Sbjct: 65 --TIGRPTEA-GQLGIIDPDGR----------LIALHLYEGLLKLINIEK------GLNN 105
Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
I+ + N R L+ V D F++G P + +L + KH I +
Sbjct: 106 PIQKTAA-NTR-LEELQVMDMTFLYGCKIPTIAVL------FKDTKDEKH----IVTYEV 153
Query: 289 STTLKQH-PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
S ++ P WS N+ Y + V P+GGVLVV N I Y + + ++A
Sbjct: 154 SQKDQELCPGPWSQSNV--GVYSSMLVAVPLGGVLVVADNGITYMNGRTTRSIA------ 205
Query: 348 SLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+P + F +D + +L D G L +L ++ + V L
Sbjct: 206 -------IPYTKFLAYDRVDKDGSRYLFGD------HFGRLSVLVLLNHQQRVTELKFET 252
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + + + F+GS GDS L++ + E D P+T
Sbjct: 253 LGRTSIPSSISYLDSGVVFIGSSSGDSQLIRL------------------NTEKD-PATD 293
Query: 466 RLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRDSLVNIGPLKDFSYGLR 523
S L++ N G + + AQ T S RD G L+ G+
Sbjct: 294 ----SYISHLENFTNIGPIVDFCLVDTEKQGQAQIVTCSGTYRD-----GTLRVIRNGI- 343
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
GI++++ L+EL G KG+W + N S + D YLI+S
Sbjct: 344 --------GIAEKA---LIELEGVKGLWPI------KENDPSDPLNPKD----QYLIVSF 382
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQD 642
T VL+ E TE TI N+ ++QV + +++ ++ D
Sbjct: 383 IGYTKVLQFQGEEIEETEFEGLDSNSSTILCSNIDKENVIVQVTNQAINLINPITFKRVD 442
Query: 643 LSFGPSNSESGSGSENSTVLSVSIA 667
PS S S N + +++SI
Sbjct: 443 QWKSPSGSPINLVSSNQSQIALSIG 467
>gi|299751161|ref|XP_001830098.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
gi|298409248|gb|EAU91763.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
Length = 1205
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 105/472 (22%), Positives = 196/472 (41%), Gaps = 83/472 (17%)
Query: 996 QKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL 1055
Q+ +PL TP + + LI S +Q L ++E ++ + N+S D
Sbjct: 778 QESVPLAFTPRKFVTHPGNRFFYLIES-------DQ--RTLSEEEEKKKLADLNISRDD- 827
Query: 1056 HRTYTVEEYEVRILEPDRA-GGPWQTR------------ATIPMQSSENALTVRVVTLFN 1102
+ + + +I RA G W +R AT+P++++E A ++ VV +
Sbjct: 828 ---HPILQLPAKIFGRTRASAGHWASRIHIFDPMEAKTVATLPLKANEAAFSIAVVPFAS 884
Query: 1103 TTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNADNPQNL-VTEVYSKELKGAIS 1159
T E L +GTA ++ A+ + ++ + L T + EL
Sbjct: 885 TG---GEYHLVVGTAMHHLVTPPQASASYLKVYKIVNEGTGLELLHETPIQDSELP---R 938
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
AL + QG LL G + ++ +L A +P +VSL + I++GD+ +S
Sbjct: 939 ALLAFQGRLLAGVGKALRIYDLGKKKLLRKAETKSP-TAIVSLATQGSRIVIGDMQESTL 997
Query: 1220 FLSWKEQGAQLNLLAK-------------DFGSL---DCFATEFL--IDGSTLSLVVSDE 1261
F +KE +L + D+ ++ D F F+ +D ST+S V ++
Sbjct: 998 FAVYKEAENRLLIFGDDTQPRWVSAMTMVDYNTVAVGDKFGNIFVNRLD-STISDQVDED 1056
Query: 1262 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1321
I + ++ + K+L A FHVG +T ++ ++
Sbjct: 1057 PTGAGILHEKATLNGAPHKTKML--AHFHVGDIITSIHKVSLVV--------------GG 1100
Query: 1322 RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
R LL+ L G+IG + PL +++ F L L++ + + + G + S+R ++ K
Sbjct: 1101 REVLLYTGLQGTIGILVPLTSKEDIEF--LTMLEQHIRNEQGSLVGRDHLSWRGYYVPVK 1158
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1430
A ++D +L Y L +Q IA + T +L L+ + + +S
Sbjct: 1159 A-------VIDGDLCETYGGLSSSKQSAIASELDRTVGDVLKKLDQMRVASS 1203
>gi|300120114|emb|CBK19668.2| unnamed protein product [Blastocystis hominis]
Length = 1240
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 124/543 (22%), Positives = 209/543 (38%), Gaps = 103/543 (18%)
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTV-------------LHNVNCNHGFIYV- 972
L+G PC L V P LC + T+ N +C+ G + V
Sbjct: 715 ILAGGNPCVLA-----LSVKPWLCYCANNTLTLTSLVSDPLDLAAPFCNEDCSEGIVCVA 769
Query: 973 -TSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
T+ I++I L T + IPL TP ++ +YP + +L+ +
Sbjct: 770 GTNLNIIRIDDLTQPFTATS-------IPLSYTPRELV------VYPGQPRLLLLETDHN 816
Query: 1032 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRI---LEPDRAGG--------PWQT 1080
S L Q Q HN+S V+ EY+ EPD+ QT
Sbjct: 817 AYSELEKQSFYQQ---HNVSYVN--------EYDCGAPIPAEPDKWASCIRVVDAISLQT 865
Query: 1081 RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARG------RVLLFS 1134
+ + +E A ++ V +K +E + IGTA + + R V F
Sbjct: 866 LERLELADNEAAFSMCVCRF---ASKGDEPFVVIGTA--KNLKIHPRSCSQGFISVFRFV 920
Query: 1135 TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1194
G + + ++ E+ +AL G L G + ++ +L A
Sbjct: 921 EGHS-------LQLLHRTEVDEVPAALCEFDGKLAAGIGRSVRVYDLGKKKLLRKCENKA 973
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
P +V L + + GD+ ++ F+ +++ QL A D G ++D +T+
Sbjct: 974 MPHFVTKLRAMGERLYAGDLTDNVSFVKFRKGTNQLVEFA-DGGIPRSITALDVLDYNTV 1032
Query: 1255 SLVVSDEQKNIQIFYYAPKM---------SESWKGQKLLSRAEFHVGAHVTKFLRLQMLA 1305
V D+ N+ + PK+ S S LLS A A + L +
Sbjct: 1033 --VCGDKGGNLFVERVDPKVDDDIANPTGSRSLWNSGLLSAAPNK--AEQAASIYLGEIV 1088
Query: 1306 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1362
TS +T PG D+ +L+GT+ G+IG + P+ D+L L ++ + P +
Sbjct: 1089 TSVQKTVLIPGGDEV----VLYGTIFGTIGALLPMPSRDDL--HHLMHIEMYIRKQEPSL 1142
Query: 1363 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
G + S+R ++ K I+D L + MLP +Q EIA+ + S I+ +
Sbjct: 1143 VGRDILSWRSAYTPMKG-------IIDGNLCETFSMLPQIKQEEIANALVLSVSSIVKKM 1195
Query: 1423 NDL 1425
DL
Sbjct: 1196 EDL 1198
>gi|425768510|gb|EKV07031.1| Pre-mRNA-splicing factor rse1 [Penicillium digitatum PHI26]
gi|425775700|gb|EKV13954.1| Pre-mRNA-splicing factor rse1 [Penicillium digitatum Pd1]
Length = 1209
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 99/458 (21%), Positives = 186/458 (40%), Gaps = 66/458 (14%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVS-VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
+Q+ IPL TP + +++L+ +I S VL P + LID + +L
Sbjct: 785 LQESIPLSYTPRRFVKHPDQHLFYVIESDNNVLSPATR--QRLIDDSQAQNGEVADLPPA 842
Query: 1054 DLHRTYTVEEYE--VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1111
D + V+I++P +T+ ++ +E A+++ V+ +++++ET
Sbjct: 843 DFGYPRATGHWASCVQIVDPITTKS---VISTLDLEDNEAAVSLAAVSF---SSQDDETF 896
Query: 1112 LAIGTA-------------YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAI 1158
L +GTA ++ GR L F D P
Sbjct: 897 LVVGTAKDMTVSPPSSSCGFIHIYRFQEDGRELEFIHKTQVDEPP--------------- 941
Query: 1159 SALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
AL QG LL GP + ++ +L P +V L + I++ DI +S+
Sbjct: 942 LALLGFQGRLLAGIGPVLRVYDLGMKQLLRKCQAPVVPKTIVGLQTQGSRIIVSDIRESV 1001
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1278
++ +K Q L A D + +T ++D T + D+ N+ + K+SE
Sbjct: 1002 TYVVYKYQDNVLIPFADDSIARWTSSTT-MVDYETTA--GGDKFGNLWLVRCPSKISEQA 1058
Query: 1279 ----KGQKLL-SRAEFHVGAHVTKFLR---LQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
G L+ + H H + + Q + TS +T G R +++ L
Sbjct: 1059 DEDGSGAHLIHEKGYLHGTPHRLELMVHFFAQDIPTSLHKTQLVAG----GRDIVVWTGL 1114
Query: 1331 DGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1387
G+IG P +++ F Q L+ +L P +AG + +R +++ K +
Sbjct: 1115 QGTIGMFVPFVSREDVDF--FQLLETQLASQQPPLAGRDHLMYRGYYAPVKG-------V 1165
Query: 1388 VDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+D +L Y +LP + +L IA + + +I ++D+
Sbjct: 1166 IDGDLCEMYLLLPNDTKLMIAGELDRSVREIERKISDM 1203
>gi|225683909|gb|EEH22193.1| pre-mRNA-splicing factor rse1 [Paracoccidioides brasiliensis Pb03]
Length = 1209
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 77/362 (21%), Positives = 156/362 (43%), Gaps = 38/362 (10%)
Query: 1084 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNAD 1140
I ++ +E A++V V+ +++++ET L +GT G+D+ R R +
Sbjct: 872 IELEENEAAVSVAAVSF---SSQDDETFLVVGT----GKDMVVNPRSCSAGFIHIYRFQE 924
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1200
+ + L ++ +++ AL QG LL G + ++ ++ P VV
Sbjct: 925 DGKEL-EFIHKTKVEQPPVALLGFQGRLLAGIGTDVRIYDLGMRQMLRKCQASVVPHLVV 983
Query: 1201 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1260
L + I++ D+ +S+ ++ +K Q +L D S T ++D T++ D
Sbjct: 984 GLQTQGSRIIVSDVQESVTYVVFKSQENRLIPFVDDVISRWTTCTT-MVDYETVA--GGD 1040
Query: 1261 EQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTG 1312
+ N+ + K SE G L+ ++ GA ++ Q L TS +
Sbjct: 1041 KFGNLWLLRCPAKASEEADEDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQ 1100
Query: 1313 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
G R L++ L G++G + P +E+ F QSL+ +L P +AG +
Sbjct: 1101 LVTG----GRDILVWTGLQGTVGMLIPFISREEVDF--FQSLEMQLAAQNPPLAGRDHLI 1154
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1429
+R +++ K +D +L Y +LP +++ +IA + + +I + D+
Sbjct: 1155 YRSYYAPAKG-------TIDGDLCETYLLLPNDKKQQIAGELDRSVREIERKIADMRTKV 1207
Query: 1430 SF 1431
++
Sbjct: 1208 AY 1209
>gi|340721347|ref|XP_003399083.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus terrestris]
gi|350406701|ref|XP_003487854.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus impatiens]
Length = 1217
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 80/389 (20%), Positives = 153/389 (39%), Gaps = 60/389 (15%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1123
+RI+ P T T + E L ++L + + ++ L +G A +
Sbjct: 866 IRIIAP-------TTGQTFEVHRLEQNLAALCLSLVKFSNQGDQLFLIVGIAKEFQLNPR 918
Query: 1124 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
V++ G + + N + V+ L A+ QG +L+ G + L+
Sbjct: 919 VSSGGFLYTYRVNSECTN----LELVHKTTLDEVPLAICPYQGRVLVGVGRMLRLYDMGK 974
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+L P VVS+N + I + D+ +S+Y + +K Q QL + A D
Sbjct: 975 KKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHP-RWI 1033
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLS 1285
T ++D T++ +D+ NI + A +++ W QK +
Sbjct: 1034 TTTCVLDYDTVA--TADKFGNIAVIRLASGINDDVDEDPTGNKALWDRGLLNGASQKADT 1091
Query: 1286 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1342
A FHVG V + ++ PG ++ L++ TL G++G + P ++
Sbjct: 1092 VACFHVGETVMSLQKATLI----------PGGSES----LVYTTLSGTVGVLVPFTSHED 1137
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1402
F Q L+ + P + G + SFR ++ K +++D +L + +
Sbjct: 1138 HDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK-------NVIDGDLCEQFNSIEPT 1188
Query: 1403 EQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+Q I+ T S++ L D+ +F
Sbjct: 1189 KQKSISGDLERTASEVSKKLEDIRTRYAF 1217
Score = 42.4 bits (98), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
SS + E GD AP R + D+L ++ + L A+
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
+T T R +L + +GL + S + ELPG +W
Sbjct: 419 DTPELYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV + D+EY AY+I+S T+VL + + EVT+S F+
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504
Query: 612 IAAGNLFGRRRVIQVFERGAR 632
+ + G ++QV+ G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525
>gi|66553024|ref|XP_623333.1| PREDICTED: splicing factor 3B subunit 3 isoform 1 [Apis mellifera]
gi|380015815|ref|XP_003691890.1| PREDICTED: splicing factor 3B subunit 3-like [Apis florea]
Length = 1217
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 80/389 (20%), Positives = 152/389 (39%), Gaps = 60/389 (15%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1123
+RI+ P T T + E L ++L + ++ L +G A +
Sbjct: 866 IRIIAP-------STGQTFEVHRLEQNLAALCLSLVKFANQGDQLFLIVGIAKEFQLNPR 918
Query: 1124 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
V++ G + + N + V+ L A+ QG +L+ G + L+
Sbjct: 919 VSSGGFLYTYKVNSECTN----LELVHKTTLDEVPLAICPYQGRVLVGVGRMLRLYDMGK 974
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+L P VVS+N + I + D+ +S+Y + +K Q QL + A D
Sbjct: 975 KKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHP-RWI 1033
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLS 1285
T ++D T++ +D+ NI + A +++ W QK +
Sbjct: 1034 TTTCVLDYDTVA--TADKFGNIAVIRLASGINDDVDEDPTGNKALWDRGLLNGASQKADT 1091
Query: 1286 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1342
A FHVG V + ++ PG ++ L++ TL G++G + P ++
Sbjct: 1092 VACFHVGETVMSLQKATLI----------PGGSES----LVYTTLSGTVGVLVPFTSHED 1137
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1402
F Q L+ + P + G + SFR ++ K +++D +L + +
Sbjct: 1138 HDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPIK-------NVIDGDLCEQFNSIEPA 1188
Query: 1403 EQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+Q I+ T S++ L D+ +F
Sbjct: 1189 KQKSISSDLERTASEVSKKLEDIRTRYAF 1217
Score = 42.4 bits (98), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
SS + E GD AP R + D+L ++ + L A+
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
+T T R +L + +GL + S + ELPG +W
Sbjct: 419 DTPQLYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV + D+EY AY+I+S T+VL + + EVT+S F+
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504
Query: 612 IAAGNLFGRRRVIQVFERGAR 632
+ + G ++QV+ G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525
>gi|190345965|gb|EDK37945.2| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC
6260]
Length = 1206
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 123/611 (20%), Positives = 236/611 (38%), Gaps = 102/611 (16%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
+ ++ +CH ++ G ++++ + +GG++ D +++ + ++S+LEFD
Sbjct: 55 SGKIKQICHQQVIGVIQNIDRIRKGGSN----LDLLVITSDSGRLSILEFDKD------- 103
Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
+ F + H K G G + VDPQ R + ++ KA + L
Sbjct: 104 ELKFFPVVQEPHSKNGMNRTTPGEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKL---- 159
Query: 218 DTFGSGGGFSARIES----SHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
S+ +ES + I + LD GY P++ + E +A
Sbjct: 160 -------ELSSPLESVSKNTLTIQMVSLDT----------GYENPMLAAI---ECNYAHY 199
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSA-----MNLPHDAYKLLAVPSPIGGVLVVGANT 328
+ + S L++ + L + A + +P + L+ +P+PIGGV+V G++
Sbjct: 200 DASLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTLVPLPTPIGGVIVAGSSF 259
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I YH+ + L L S S +P ++V H N LL + GD
Sbjct: 260 IFYHNPTIDQQLYLP--IPSRAGSSPVPIVCYAV-----HKLKKNNFFILLHNELGDCFR 312
Query: 389 LTVVY--DGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSM 445
+ + Y D V L + + ++ I F D +L Q G S
Sbjct: 313 VLIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDKMLYQIEDLGDNDSY 372
Query: 446 LSSGLKEEFGDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
+SS D+ + + + R + AL +++ G+ +ES + +
Sbjct: 373 ISSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGALVKTSESKESRIAMI 432
Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
S LK ++G+ I+ LV P +V+
Sbjct: 433 TGHS-----HLKLKTHGIPIST--------------LVSSPLPMIATSVF---------- 463
Query: 565 SSRMAAYDDEYHAYLIISLEA--RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
++R++A + + Y++IS A +T+VL +++ EV +S FV + G +
Sbjct: 464 TTRLSA-ESKNDEYMVISSSASSKTLVLAIGEVVEEVQDSS--FVTDQPTIGVQQVGLKS 520
Query: 623 VIQVFERGARIL-----DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
+IQ++ G R + +G + + P T++S S VL+G+S+
Sbjct: 521 LIQIYSNGIRHIRQTETEGKITKKTFDWYP--------PAGITIISASTNQEQVLIGLSN 572
Query: 678 GSIRLLVGDPS 688
+ DP+
Sbjct: 573 RELCYFEIDPT 583
>gi|19114492|ref|NP_593580.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe 972h-]
gi|46395602|sp|O13807.1|DDB1_SCHPO RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|2330717|emb|CAB11219.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe]
Length = 1072
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 95/492 (19%), Positives = 190/492 (38%), Gaps = 97/492 (19%)
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
RES GPL+ VDP R + VY + I+ + + + FS RI+
Sbjct: 111 RES-QSGPLLLVDPFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQEL 169
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+V+ D ++ P + +L++ + ++K ++
Sbjct: 170 NVV-----------DIAMLYNSSRPSLAVLYKDSKSIVHLSTYK------------INVR 206
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
+ + + + HD + +PS GGV V G ++Y S+ + L Y
Sbjct: 207 EQEIDEDDV-VCHDIEEGKLIPSENGGVFVFGEMYVYYISKDIQVSKLLLTY-------- 257
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P ++FS + T L + + +++ ++G L ++ V ++L K S + S
Sbjct: 258 --PITAFSPSISNDPETGLDSSIYIVADESGMLYKFKALFTDETVS-MELEKLGESSIAS 314
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ + ++ F+GS +S+L+Q PS + +
Sbjct: 315 CLIALPDNHLFVGSHFNNSVLLQL------------------------PSITK-NNHKLE 349
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
LQ+ VN +S + + T S+ T S A +D G L+ + I
Sbjct: 350 ILQNFVNIAPISDFIIDDDQTGSSIITCSGAYKD-----GTLRIIRNSINI--------- 395
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
N L+E+ G K ++V S A YD+ + +L + E R +++
Sbjct: 396 ---ENVALIEMEGIKDFFSV------------SFRANYDN--YIFLSLICETRAIIVSPE 438
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
+ + + D + TI ++G +++Q+ + R+ DG + +S P + G
Sbjct: 439 GVF---SANHDLSCEESTIFVSTIYGNSQILQITTKEIRLFDGKKLHSWIS--PMSITCG 493
Query: 654 SGSENSTVLSVS 665
S ++ ++V+
Sbjct: 494 SSFADNVCVAVA 505
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 64/290 (22%), Positives = 124/290 (42%), Gaps = 38/290 (13%)
Query: 1112 LAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+ +GT + +D GR+++F +DN + E +++G+++ L L HL++
Sbjct: 770 VVVGTGFNFPDQDAPDSGRLMVFEM--TSDNNIEMQAE---HKVQGSVNTLV-LYKHLIV 823
Query: 1171 A--SGPKIILHKWTGTE--LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1226
A + I GT N I P Y + +++ ++ I+ D+ KSI L + +
Sbjct: 824 AGINASVCIFEYEHGTMHVRNSIR----TPTYTIDISVNQDEIIAADLMKSITVLQFIDD 879
Query: 1227 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY---YAPKMSESWKGQKL 1283
QL +A+D+ L + E L S V++ N I +P++S+ +KL
Sbjct: 880 --QLIEVARDYHPLWATSVEIL---SERKYFVTEADGNAVILLRDNVSPQLSDR---KKL 931
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1343
+F++G + K R D++ P LL T+DGS+ +
Sbjct: 932 RWYKKFYLGELINK-TRHCTFIEPQDKSLVTP--------QLLCATVDGSLMIVGDAGMS 982
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
L LQ + +P GL+ + ++++ + P ++D L+
Sbjct: 983 NTPLLLQLQDNIRKVIPSFGGLSHKEWKEYRGENET---SPSDLIDGSLI 1029
>gi|331238007|ref|XP_003331659.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|309310649|gb|EFP87240.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1213
Score = 54.3 bits (129), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 91/398 (22%), Positives = 169/398 (42%), Gaps = 68/398 (17%)
Query: 1071 PDRAGGPWQT--RATIPMQSS----------ENALTVRVVTL-FNTTTKENETLLAIGTA 1117
P G W + R + P++ + E A ++ +V+ F E LL +G+A
Sbjct: 847 PRAEAGKWASCIRISDPIEKTTLVREDLGDNEAATSLAIVSFAFAAHHPATEPLLVVGSA 906
Query: 1118 ---YVQGEDVAARG--RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+VQ G RV F N ++ V+ E+ SAL QG L
Sbjct: 907 KDAFVQPR-TCKNGFLRVYRFV------NDGKVIELVHKTEVDEMPSALVGFQGRLAAGV 959
Query: 1173 GPKIILHKWTGTEL----NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
G + ++ +L +F A ++SL++ + IL+GD S+ + +K
Sbjct: 960 GKALRIYDLGKKKLLRKVENKSFGSA----IISLSVQGSRILVGDSQDSVSYAVYKPAEN 1015
Query: 1229 QLNLLAKDF-GSLDCFATEFLIDGSTLS-------LVVSDEQKNIQIFYYAPKMSESWKG 1280
+L + A D AT ++D T++ L VS KN+ + ++ E G
Sbjct: 1016 RLIVFADDVVPRWTTCAT--MVDYDTVAGGDRFGNLWVSRLPKNV-----SDEVDEDPTG 1068
Query: 1281 QKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1336
++ + +GA H K L L + TS +T PG R LL+ + GSIG
Sbjct: 1069 AGIMHEKGYLMGAPHKLKNLVHFHLNDIPTSIQKTSLVPG----GREVLLYTGVQGSIGI 1124
Query: 1337 IAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+ P +++ F Q+L+ + + +P + G + ++R ++ K + VD +L
Sbjct: 1125 LVPFISKEDVDF--FQTLEMHMRNEMPSLVGRDHLAYRGYYFPVK-------NCVDGDLC 1175
Query: 1394 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+ +LP +QL++A + + S +L + + + + +
Sbjct: 1176 ESFALLPSAKQLQVASELDRSVSDVLKKIEAVRVSSGY 1213
>gi|66811906|ref|XP_640132.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
gi|74854972|sp|Q54SA7.1|SF3B3_DICDI RecName: Full=Probable splicing factor 3B subunit 3
gi|60468134|gb|EAL66144.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
Length = 1256
Score = 54.3 bits (129), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 54/289 (18%), Positives = 127/289 (43%), Gaps = 29/289 (10%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
+Y E++ + A+A QG L+ G I ++ +L P +V+++ + +
Sbjct: 979 LYKTEVEEPVYAMAQFQGKLVCGVGKSIRIYDMGKKKLLRKCETKNLPNTIVNIHSLGDR 1038
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1268
+++GDI +SI+F+ +K L + A D + ++D T++ +D+ NI +
Sbjct: 1039 LVVGDIQESIHFIKYKRSENMLYVFADDLAP-RWMTSSVMLDYDTVA--GADKFGNIFVL 1095
Query: 1269 YYAPKMSESWKGQKLLSRAEFHVGA---------HVTKFLRLQMLATSSDRTGAAPGSDK 1319
+S+ + ++ +F G H+ F + T + + G +
Sbjct: 1096 RLPLLISDEVEEDPTGTKLKFESGTLNGAPHKLDHIANFFVGDTVTTLNKTSLVVGGPE- 1154
Query: 1320 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
+L+ T+ G+IG + P +++ F +L+ + + G + ++R ++
Sbjct: 1155 ----VILYTTISGAIGALIPFTSREDVDF--FSTLEMNMRSDCLPLCGRDHLAYRSYYFP 1208
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
K +I+D +L + L ++QL I+ + + S+++ L ++
Sbjct: 1209 VK-------NIIDGDLCEQFSTLNYQKQLSISEELSRSPSEVIKKLEEI 1250
Score = 43.9 bits (102), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 124/338 (36%), Gaps = 75/338 (22%)
Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQN 374
GGVLV + I Y +Q + + +PR S L +H++ Q
Sbjct: 256 GGVLVASEDYIVYRNQDHA------------EVRSRIPRRYGSDPNKGVLIISHSSHKQK 303
Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
+ L+ ++ GDL +T+ Y G V ++++ + VL + +T + N F S GD
Sbjct: 304 GMFFFLVQSEHGDLYKITLDYQGDQVSEVNVNYFDTIVLANCLTVLKNGFLFAASEFGDH 363
Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL----RRSSSDALQDMVNGEELSLYG 488
L F S G +EE G + L R S ++++ N E S
Sbjct: 364 TLYFFK--------SIGDEEEEGQAKRLEDKDGHLWFTPRNSCGTKMEELKNLEPTSHLS 415
Query: 489 SASNNTESAQKTFSFAVRDSLVNIGP-------------LKDFSYGLRINADASATGISK 535
S S F V D + P LK +GL + +A
Sbjct: 416 SLS-------PIIDFKVLDLVREENPQLYSLCGTGLNSSLKVLRHGLSVTTITTAN---- 464
Query: 536 QSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
LPG GIWTV +S NA D+ Y+++S T VL D
Sbjct: 465 --------LPGVPSGIWTVPKSTS--PNA--------IDQTDKYIVVSFVGTTSVLSVGD 506
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ E ES ++ T G +IQVF G R
Sbjct: 507 TIQENHES--GILETTTTLLVKSMGDDAIIQVFPTGFR 542
>gi|315053737|ref|XP_003176243.1| pre-mRNA-splicing factor rse1 [Arthroderma gypseum CBS 118893]
gi|311338089|gb|EFQ97291.1| pre-mRNA-splicing factor rse1 [Arthroderma gypseum CBS 118893]
Length = 1181
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 103/467 (22%), Positives = 189/467 (40%), Gaps = 61/467 (13%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
C G + + Q + ++ S DN +Q+ IPL TP E YPL
Sbjct: 725 QCVEGMVGIQGQNL----RIFSIEKLDNNL-LQEPIPLAYTPRNFVRHPE---YPLFY-- 774
Query: 1024 PVLKPLNQVLS-----LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGG 1076
V+ N VLS L+ + D+ L D + + +++P
Sbjct: 775 -VIGSDNNVLSPSTKAKLLGESTAVNGDSAELPPEDFGYPRGTNHWASCIEVVDPINTKS 833
Query: 1077 PWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LF 1133
+ + ++ +E A+++ V+ T++E+ET L +GT G+D+ R
Sbjct: 834 ---VLSKLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRTYTCGFI 883
Query: 1134 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1193
R + + L ++ +++ AL QG LL GP + ++ +L
Sbjct: 884 HIYRFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGVGPDLRIYDLGMRQLLRKCQAQ 942
Query: 1194 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1253
P +V L + I++ D+ +S+ ++ +K Q L A D S T ++D T
Sbjct: 943 ITPRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIISRWTTCTT-MVDYET 1001
Query: 1254 LSLVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQML 1304
++ D+ NI + K SE G L+ ++ GA V F Q +
Sbjct: 1002 VA--GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVVHFYS-QDI 1058
Query: 1305 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPH 1361
TS +T G R L++ L G++G P D++ F Q+L+ +L
Sbjct: 1059 PTSIQKTQLVAG----GRDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNAP 1112
Query: 1362 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
+AG + +R +++ K ++D +L + +LP +++ IA
Sbjct: 1113 LAGRDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIA 1152
>gi|124806507|ref|XP_001350742.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
3D7]
gi|23496869|gb|AAN36422.1|AE014849_41 splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
3D7]
Length = 1329
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 67/286 (23%), Positives = 113/286 (39%), Gaps = 45/286 (15%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
S G L+ + G K+ ++ +L Y P +VS+ I N I DI +S+
Sbjct: 1064 CFCSYNGKLIASIGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKISGNRIFACDIRESVL 1123
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1266
+ L L++ D +E L + ++ L V +E K +
Sbjct: 1124 IFFYDPNQNTLRLISDDIIPRWITCSEILDHHTIMAADKFDSVFILRVPEEAKQDEYGIT 1183
Query: 1267 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
+Y M+ S K +KL FH+G VT ++++ TSS+
Sbjct: 1184 NKCWYGGEIMNSSTKNRKLEHMMSFHIGEIVTSMQKVRLSPTSSE--------------C 1229
Query: 1325 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1379
+++ T+ G+IG P D ELT Q L+ L P + G FR ++ +
Sbjct: 1230 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTEKPPLCGREHIFFRSYYHPVQ- 1284
Query: 1380 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
++VD +L + L + Q +IA+ T IL L D+
Sbjct: 1285 ------NVVDGDLCEQFSSLSYDAQKKIANDLERTPEDILRKLEDI 1324
Score = 48.9 bits (115), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 80/379 (21%), Positives = 147/379 (38%), Gaps = 64/379 (16%)
Query: 304 LPHDAYKLLAVPSPIG-----GVLVVGANTIHYHS---QSASCALALNNYAVSLDSSQEL 355
LP D L +P P G GVL+ N + Y + CA Y L+ Q+
Sbjct: 261 LPIDITAHLLIPLPGGQQGPSGVLICCENFLVYKKVDHEDIYCA-----YPRRLEIGQDK 315
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
S + + L+ ++ GDL + V ++ +V+ + + + + I
Sbjct: 316 NISIICWTMHRIKKFFF----ILIQSEYGDLYKIEVDHEDGIVKEIVCKYFDTVPIGNSI 371
Query: 416 TTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+ + + F+ + G+ QF+ G G A T +L+
Sbjct: 372 SVLKSGSLFVAAEFGNHYFYQFSGIGDDNKQFMCTSNHPLGKNAIIAFKTNKLKNL---Y 428
Query: 475 LQDMVNGEE--LSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADAS 529
L D + L + + NT + Q ++ R GP L+ +GL I A
Sbjct: 429 LVDQIYSLSPILDMKIIDAKNTHTPQ-IYTLCGR------GPRSSLRILQHGLSIEELAD 481
Query: 530 ATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
ELPG K IWT+ + EY Y+++S E T+
Sbjct: 482 N------------ELPGKPKYIWTIKKDNL--------------SEYDGYIVVSFEGNTL 515
Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPS 648
+LE + + EV++++ + T N+ IQV++ G R ++G + + ++ P
Sbjct: 516 ILEIGESVEEVSDTL--LLNNVTTLHINILYDNSFIQVYDTGIRHINGKVVQEWVA--PK 571
Query: 649 NSESGSGSENSTVLSVSIA 667
N + + S NS+ + +S++
Sbjct: 572 NKQIKAASSNSSQIVISLS 590
>gi|383847297|ref|XP_003699291.1| PREDICTED: splicing factor 3B subunit 3-like [Megachile rotundata]
Length = 1217
Score = 53.9 bits (128), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 47/293 (16%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
A+ QG +L+ G + L+ +L P +VS+N + I + D+ +S+Y
Sbjct: 951 AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAIVSINAIGQRIYVSDVQESVY 1010
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-- 1277
+ +K Q QL + A D T ++D T++ +D+ NI + A +++
Sbjct: 1011 AVRYKRQENQLIVFADDTHP-RWITTTCVLDYDTVA--TADKFGNIAVIRLASGINDDVD 1067
Query: 1278 ---------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1321
W QK + A FHVG V + ++ PG ++
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGGSES- 1116
Query: 1322 RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
L++ TL G++G + P ++ F Q L+ + P + G + SFR ++ K
Sbjct: 1117 ---LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK 1171
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+++D +L + + +Q IA T S++ L D+ +F
Sbjct: 1172 -------NVIDGDLCEQFNSIEPAKQKSIAGDLERTPSEVSKKLEDIRTRYAF 1217
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
SS + E GD AP R + D+L ++ + L A+
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
+T T R +L + +GL + S + ELPG +W
Sbjct: 419 DTPQLYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV + D+EY AY+I+S T+VL + + EVT+S F+
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504
Query: 612 IAAGNLFGRRRVIQVFERGAR 632
+ + G ++QV+ G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525
>gi|47230701|emb|CAF99894.1| unnamed protein product [Tetraodon nigroviridis]
Length = 953
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 92/216 (42%), Gaps = 34/216 (15%)
Query: 1175 KIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1229
++ L++WT TE N + + LY L +FIL+GD+ +S+ L++K
Sbjct: 707 QVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPMEGN 761
Query: 1230 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1289
+A+DF A E L D + L ++ N+ + + + Q L F
Sbjct: 762 FEEIARDFNPNWMSAIEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVGVF 818
Query: 1290 HVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1345
H+G V F L LQ L +S T ++LFGT++G IG + L E +
Sbjct: 819 HLGEFVNVFCHGSLVLQNLGETSTPTQG----------SVLFGTVNGMIGLVTSLSEGWY 868
Query: 1346 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
L LQ +L + ++R FH+ K +
Sbjct: 869 SLLLDLQNRLNKVI-------KTTWRSFHTERKTEQ 897
Score = 48.1 bits (113), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 71/341 (20%), Positives = 130/341 (38%), Gaps = 64/341 (18%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 68 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 123
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG V ++ L + + +
Sbjct: 124 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 173
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S L +++++ + +
Sbjct: 174 CLTYLDNGVVFVGSRLGDSQLVKVRVTHSLSEL---------NVDSNDQGSFVTVMETFT 224
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
L +V+ + L + F G L+ G+ I+ AS
Sbjct: 225 NLGPIVDMCVVDLERQGQGQLVTCSGAFKE---------GSLRIIRNGIGIHEHAS---- 271
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + ++ R E L++S +T VL +
Sbjct: 272 --------IDLPGIKGLWPLRSEAGR--------------ETDDMLVLSFVGQTRVLMLS 309
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ ++IQ+ R++
Sbjct: 310 GEEVEETELPGFVDNQQTFYCGNV-AHNQLIQITSGSVRLV 349
>gi|336369683|gb|EGN98024.1| hypothetical protein SERLA73DRAFT_109335 [Serpula lacrymans var.
lacrymans S7.3]
gi|336382464|gb|EGO23614.1| hypothetical protein SERLADRAFT_449959 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1257
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 121/269 (44%), Gaps = 26/269 (9%)
Query: 1110 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1169
T + GT E ++GR+L+F + + +T S E++G + A+ S+ G ++
Sbjct: 940 TYICAGTYKYVDEVEPSQGRLLVFD-AEDGSLLREKITMAVSLEVRGCVYAVGSVNGMII 998
Query: 1170 IASGPKIILHKW---TGTEL---NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
A +++++ T+L + I ++ L V +L + IL+GD SI FL
Sbjct: 999 AAINSSVVVYRPEIDASTQLLALHKITEWNHNYL-VTNLVCRGDKILVGDAINSISFLRM 1057
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
E +Q+ LA+D+GSL E L S + + +F +A + E+ + L
Sbjct: 1058 VE--SQIQCLARDYGSLWPVCVEMLDQSSIIG-----ANSDYNLFTFA--LQETELRKSL 1108
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DE 1342
+++G V KF+ + T+ D + P K LF T G IG I + DE
Sbjct: 1109 ERDGSYYIGDMVNKFIPGAL--TAHDVSVDMPLEPKQ-----LFFTSTGCIGVIVDMGDE 1161
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
L+ + +LQ+ + + G+ FR
Sbjct: 1162 LSL-HMTALQRNMSTYLSQTKGVTHTKFR 1189
>gi|159489018|ref|XP_001702494.1| UV-damaged DNA binding complex subunit 1 protein [Chlamydomonas
reinhardtii]
gi|158280516|gb|EDP06273.1| UV-damaged DNA binding complex subunit 1 protein [Chlamydomonas
reinhardtii]
Length = 1147
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 61/248 (24%), Positives = 105/248 (42%), Gaps = 28/248 (11%)
Query: 1145 LVTEVYSKELKGAISALASLQGHLLIAS-GPKIILHKWTGTELNGI-----AFYDAPPLY 1198
LVTE KE+KGA + G ++AS K+ +++W E +G A+ A +
Sbjct: 895 LVTE---KEVKGAAYNVRPFAGDKILASVNNKVTVYRWVVREGSGGPGGCGAYELASECH 951
Query: 1199 ----VVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1252
V++L + +++GD+ +S+ LS+ + L A D+ S A E L D +
Sbjct: 952 HLGNVLALYLAARGGLVVVGDLMRSVSLLSYNAEQGVLEHRAADYNSGWTTAVEILDDDN 1011
Query: 1253 TLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR--LQMLATSSDR 1310
++ +D N+ + + + +L EFH G + + L M S+
Sbjct: 1012 YIA---ADNHCNLYVVRRNADSATDEERARLQVVGEFHTGTFINQMRNGSLVMRLPDSEH 1068
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1370
G P LLF DG +G +A L + LQ + V V GL+ +
Sbjct: 1069 AGLPP--------PLLFAGTDGRLGVVARLPPALYEWATKLQTAMRSVVRGVGGLDHEQW 1120
Query: 1371 RQFHSNGK 1378
R F ++ +
Sbjct: 1121 RAFANDRR 1128
>gi|310796681|gb|EFQ32142.1| CPSF A subunit region [Glomerella graminicola M1.001]
Length = 1163
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 73/322 (22%), Positives = 136/322 (42%), Gaps = 41/322 (12%)
Query: 1114 IGTAYVQGEDVA----ARGRVLLFSTGRNAD-NPQNLVTEVYSKELKGAISALASLQGHL 1168
+GT+Y+ D+ +GR+L+ G ++D NP +V S ELKGA +L + L
Sbjct: 847 VGTSYLADPDIDESGDTKGRILVL--GVDSDKNPYLIV----SHELKGACRSLGVMGEKL 900
Query: 1169 LIASGPKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
+ ++++ + T L +A + P + V +++ N I + D+ +S+ + +
Sbjct: 901 VAGLSKTVVVYDYVEESTTSGALRKLATF-RPSTFPVDIDVHGNMIGIADLMQSLTLVEF 959
Query: 1224 --KEQGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1280
+ G + L+ + +AT ++G S + +D Q N+ + P
Sbjct: 960 VPAQDGNKAKLVERARHFQYIWATSVCHLEGH--SWLEADAQGNLMVLRRNPNAPTEHDR 1017
Query: 1281 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
+++ EFH+G V K L + +D P K T++GS+ A +
Sbjct: 1018 KQMEVTGEFHLGEQVNKIRSLDITPNEND-----PIIPKA-----FLATVEGSLYVFADI 1067
Query: 1341 DELTFRRLQSLQKKLVDSVPHV--------AGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
L Q++L + V + +GL+ ++R F N K GP VD EL
Sbjct: 1068 KSEYQSLLIQFQERLAEVVRALGQADGEPGSGLSFTTWRGFR-NAKRAAEGPFRFVDGEL 1126
Query: 1393 LSHYEMLPLEEQLEIAHQTGTT 1414
+ + L +Q + G T
Sbjct: 1127 IERFLDLDEAKQEAVVQGLGPT 1148
>gi|268536658|ref|XP_002633464.1| C. briggsae CBR-DDB-1 protein [Caenorhabditis briggsae]
Length = 1134
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 53/271 (19%), Positives = 115/271 (42%), Gaps = 15/271 (5%)
Query: 1114 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GT + ++ + GR+++F D + + V+ ++G+ AL L G L+ A
Sbjct: 823 VGTGLIYPDESDTKLGRIIVFEVD---DVERTKLRRVHELVVRGSPLALRILNGKLVAAI 879
Query: 1173 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1232
+ L +WT ++ + + + + L ++ + + D+ +S+ LS++
Sbjct: 880 NSSVRLFEWTADKVLRLECSNFNHIVALDLKVMNEEVAVADLMRSVSLLSYRMMEGNFEE 939
Query: 1233 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF-HV 1291
+AKD+ S EF+ S L +++ P + G+ +L + ++
Sbjct: 940 VAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPITDD---GRYVLEPTGYWYL 996
Query: 1292 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1351
G +R ++ D T ++FGT G+IG + +D+ + L S+
Sbjct: 997 GELPKVMVRASLVVQPEDST-------IEYSHPIMFGTNQGTIGMLVQIDDKWKKFLVSI 1049
Query: 1352 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
+K + DSV + + ++R F + P
Sbjct: 1050 EKAISDSVKNCMQIEHSTYRSFIFQKRIEPP 1080
Score = 48.9 bits (115), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 71/137 (51%), Gaps = 19/137 (13%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
D+ L+ VP+P+GGV+V+GAN+ Y + + + Y+ SL L + F+ +
Sbjct: 210 DSQVLIPVPAPVGGVIVLGANSALYKASDVNGDVV--PYSCSL-----LKNTIFTCHGIV 262
Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV-VYDGR---VVQRLDLSKTNPSVLTSDITTIGN 420
DA+ D LL+ G L++L + + +GR V+ + + + + + + N
Sbjct: 263 DAS------GDRFLLADTDGRLLMLLLNIGEGRSGTTVKEMRIEYLGETSVADSVNYVDN 316
Query: 421 SLFFLGSRLGDSLLVQF 437
+ F+GSRLGDS L++
Sbjct: 317 GVVFVGSRLGDSQLIRL 333
>gi|307166104|gb|EFN60356.1| Splicing factor 3B subunit 3 [Camponotus floridanus]
Length = 1201
Score = 53.5 bits (127), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 65/297 (21%), Positives = 122/297 (41%), Gaps = 55/297 (18%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
A+ QG +L+ G + L+ +L P VVS+N + I + D+ +S+Y
Sbjct: 935 AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVY 994
Query: 1220 FLSWKEQGAQLNLLAKD----FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+ +K Q QL + A D F + C ++D T++ +D+ NI + A ++
Sbjct: 995 AVRYKRQENQLIVFADDTHPRFITTTC-----VLDYDTVA--TADKYGNIAVIRLATGIN 1047
Query: 1276 ES-----------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1317
+ W QK + A FHVG V + ++ PG
Sbjct: 1048 DDVDEDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGG 1097
Query: 1318 DKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1374
++ L++ TL G++G + P ++ F Q L+ + P + G + SFR ++
Sbjct: 1098 SES----LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYY 1151
Query: 1375 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
K +++D +L + + +Q I+ T S++ L D+ +F
Sbjct: 1152 YPVK-------NVIDGDLCEQFNSIEPVKQKSISGDLERTPSEVSKKLEDIRTRYAF 1201
>gi|116207186|ref|XP_001229402.1| conserved hypothetical protein [Chaetomium globosum CBS 148.51]
gi|88183483|gb|EAQ90951.1| conserved hypothetical protein [Chaetomium globosum CBS 148.51]
Length = 1211
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 99/455 (21%), Positives = 192/455 (42%), Gaps = 47/455 (10%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
+QK IPL TP ++ E+ + I S P ++ + L++Q D L D
Sbjct: 786 IQKSIPLTYTPKRLVKHPEQPYFYTIESDNNTLP-PELRAQLLEQSGAVNGDAAILPPED 844
Query: 1055 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1114
+ I D G I + +E A++ VV +++E E+ L +
Sbjct: 845 FGYPRATGRWASCISVVDPLGEEPSVLQRIDFEGNEAAVSAAVVPF---SSQEGESFLIV 901
Query: 1115 GTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLL 1169
GT G+D+ R FS G R ++ ++L ++ +++ AL QG LL
Sbjct: 902 GT----GKDMVLNPRK--FSEGYIHVYRFHEDGRDL-EFIHKTKVEEPPMALIPFQGRLL 954
Query: 1170 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1229
G + ++ +L A + P +VSL + I++GD+ + I ++ +K + +
Sbjct: 955 AGIGKTLRVYDLGLRQLLRKAQGEVAPQLIVSLQTQGSRIIVGDVQQGITYVVYKPESNK 1014
Query: 1230 LNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG-----QKL 1283
L A D +++ + T ++D S+ D+ NI I + + S+ Q L
Sbjct: 1015 LLPFADD--TINRWTTCTTMVDYE--SVAGGDKFGNIWILRCSERASQESDEPGSEIQLL 1070
Query: 1284 LSRAEFHVGAH----VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
+R H GA Q L TS +T G L++ + G++G + P
Sbjct: 1071 HARNYLH-GAQSRLSAMAHFYTQDLPTSIVKTNLVVGGQD----VLVWSGIQGTVGVLIP 1125
Query: 1340 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+++ F Q+L+ + P +AG + +R ++ K ++D +L +
Sbjct: 1126 FVSREDVDF--FQNLESHMRAEDPPLAGRDHLIYRGYYVPVKG-------VIDGDLCERF 1176
Query: 1397 EMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+LP +++ IA + + +I ++D+ ++F
Sbjct: 1177 SLLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1211
>gi|325189950|emb|CCA24429.1| splicing factor putative [Albugo laibachii Nc14]
Length = 1644
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 72/315 (22%), Positives = 129/315 (40%), Gaps = 45/315 (14%)
Query: 299 WSAMNLPHDAYKLLAVP---SPIGGVLVVGANTIHYHSQS---ASCALALNNYAVSLDSS 352
WS + +P A KL+AVP GGVLV+ I Y +++ SC+ L + +
Sbjct: 660 WSQV-VPRSANKLVAVPGGNDGPGGVLVIAQGLIQYQNENHPPLSCSFPLRSTG-GPNPV 717
Query: 353 QELPRSSFSVELDAAHATWLQNDV--ALLSTKTGDLVLLTVVYDGRVVQRLDLS--KTNP 408
Q+ + + + + + AT Q D+ L+ ++ GDL +++ Y G VQ+L + T P
Sbjct: 718 QDERKQGYPMMI-VSTATHKQRDLFFVLMQSEWGDLFKISLEYAGSSVQKLRIQYFDTIP 776
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
L IT G L F S + L QF + + D E + PS +
Sbjct: 777 VALALCITKTG--LLFAASEFSNHYLFQFLSIGEDDDAAQCVSAAENDQEPEIPSFSVRK 834
Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA 528
+ + ++ + ++ E + ++ + N L+ +GL + A
Sbjct: 835 LKNLAMISNIPSISPITQLLVDDFANEQTPQLYALCGQG---NRSSLRILRHGLPVMEMA 891
Query: 529 SATGISKQSNYELVELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
++ LPG K +W + ++ D Y+++S E T
Sbjct: 892 ASA------------LPGVAKAVWCLKE--------------SFTDTCDKYIVVSFEDAT 925
Query: 588 MVLETADLLTEVTES 602
+VLE D + E+T+S
Sbjct: 926 LVLEIGDTVEEITDS 940
Score = 47.0 bits (110), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 66/300 (22%), Positives = 123/300 (41%), Gaps = 54/300 (18%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIA---FYDAPPLYVVSLNIV 1205
V++ + G A+ QG LL++ G + ++ +L ++ +P ++ L
Sbjct: 1369 VHTTPVDGIPYAMIEFQGRLLVSVGKVLRIYDLGKRKLLRKCENRYFTSP---MIDLKSA 1425
Query: 1206 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKN 1264
+ I D+H+SI+F+ +K + QL A D + F T L+D T++ D+ N
Sbjct: 1426 GDRIYASDVHESIHFVKYKAEDNQLITFADD--CVPHFMTSSTLLDYDTIA--GGDKFGN 1481
Query: 1265 IQIFYYAPKMSES----------WKG-------QKLLSRAEFHVGAHVTKFLRLQMLATS 1307
+ + ++S+ W KL A+FHVG +T L
Sbjct: 1482 VFVTRLPAEVSDEIDNPTGNRMLWDTGLLNGAPHKLEQIAQFHVGEVITSVL-------- 1533
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1364
RT PG + +L+ T+ G IG + P D++ F L+ + + G
Sbjct: 1534 --RTSLVPGGMEV----ILYTTILGRIGALVPFTSRDDVDF--YTHLEMYMRQEKAPLCG 1585
Query: 1365 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
+ S+R + K ++ D +L + L ++Q IA T ++++ L D
Sbjct: 1586 RDHLSYRSYFIPVK-------NVTDGDLCEQFSSLGPDKQKNIAEDLDRTPTEVVKKLQD 1638
>gi|331221690|ref|XP_003323519.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|309302509|gb|EFP79100.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1213
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 134/302 (44%), Gaps = 43/302 (14%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL----NGIAFYDAPPLYVVSLNI 1204
V+ E+ SAL QG L G + ++ +L +F A ++SL++
Sbjct: 936 VHKTEVDEMPSALVGFQGRLAAGVGKALRIYDLGKKKLLRKVENKSFGSA----IISLSV 991
Query: 1205 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF-GSLDCFATEFLIDGSTLS-------L 1256
+ IL+GD S+ + +K +L + A D AT ++D T++ L
Sbjct: 992 QGSRILVGDSQDSVSYAVYKPAENRLIVFADDVVPRWTTCAT--MVDYDTVAGGDRFGNL 1049
Query: 1257 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTG 1312
VS KN+ + ++ E G ++ + +GA H K L L + TS +T
Sbjct: 1050 WVSRLPKNV-----SDEVDEDPTGAGIMHEKGYLMGAPHKLKNLVHFHLNDIPTSIQKTS 1104
Query: 1313 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
PG R LL+ + GSIG + P +++ F Q+L+ + + +P + G + +
Sbjct: 1105 LVPG----GREVLLYTGVQGSIGILVPFISKEDVDF--FQTLEMHMRNEMPSLVGRDHLA 1158
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1429
+R ++ K + VD +L + +LP +QL++A + + S +L + + + +
Sbjct: 1159 YRGYYFPVK-------NCVDGDLCESFALLPSAKQLQVASELDRSVSDVLKKIEAVRVSS 1211
Query: 1430 SF 1431
+
Sbjct: 1212 GY 1213
>gi|307205956|gb|EFN84082.1| Splicing factor 3B subunit 3 [Harpegnathos saltator]
Length = 1217
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 47/293 (16%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
A+ QG +L+ G + L+ +L P VVS+N + I + D+ +S+Y
Sbjct: 951 AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVY 1010
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-- 1277
+ +K Q QL + A D T ++D T++ +D+ NI + A +++
Sbjct: 1011 AVRYKRQENQLIVFADDTHP-RWITTTCVLDYDTVA--TADKFGNIAVIRLATGINDDVD 1067
Query: 1278 ---------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1321
W QK + A FHVG V + ++ PG ++
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGGSES- 1116
Query: 1322 RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
L++ TL G++G + P ++ F Q L+ + P + G + SFR ++ K
Sbjct: 1117 ---LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK 1171
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+++D +L + + +Q I+ T S++ L D+ +F
Sbjct: 1172 -------NVIDGDLCEQFNSIEPAKQKSISGDLERTPSEVSKKLEDIRTRYAF 1217
>gi|340960602|gb|EGS21783.1| hypothetical protein CTHT_0036510 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1100
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 76/308 (24%), Positives = 133/308 (43%), Gaps = 48/308 (15%)
Query: 1114 IGTAYVQGEDVA---------ARGRVLLFSTGRNAD-NPQNLVTEVYSKELKGAISALAS 1163
+GT+Y+ D + ARGR+L+ G ++D NP ++ S +LKGA LA
Sbjct: 783 VGTSYLPDPDYSPAPSHGNPEARGRILVL--GIDSDRNPY----QILSYQLKGACRCLAV 836
Query: 1164 LQ-GHLLIASGPKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
+ G +++ + + ++ T +L +A Y P Y V + I I + D+ KS
Sbjct: 837 MDDGKVVVGLTKAVTVCEYKETSSTTAQLTKLASY-RPSTYPVEIAIHGRTIAVADLMKS 895
Query: 1218 IYFLSW--KEQGAQLNLL--AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
I + + E+G Q L+ A+ + S A ++ +G L +D Q N+Q+
Sbjct: 896 ISLVDYIPAEEGGQAKLVERARHYQSAWSTAVGYVQNGLWLE---ADAQGNLQVLRQNVD 952
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
+++ AE ++G V + + + +S P + GT++G
Sbjct: 953 GITEDDRKRMELTAEINLGEMVNRIRSITV--ETSPEALIIPRA--------FLGTVEGG 1002
Query: 1334 I---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP--GPDSIV 1388
I G IAP L Q+K+ D + V + +FR + + A R GP +
Sbjct: 1003 IYMFGTIAP---HALDLLLRFQEKVADVIKAVGDSDNANFRSYRAFKNAERVGHGPFRFL 1059
Query: 1389 DCELLSHY 1396
D ELL +
Sbjct: 1060 DGELLERF 1067
>gi|405117821|gb|AFR92596.1| pre-mRNA-splicing factor RSE1 [Cryptococcus neoformans var. grubii
H99]
Length = 1217
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/285 (22%), Positives = 120/285 (42%), Gaps = 32/285 (11%)
Query: 1161 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
LA QG LL G + L++ L + P VV++N+ I++GD+ +S ++
Sbjct: 951 LAGFQGFLLAGVGKSLRLYEMGKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFY 1010
Query: 1221 LSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
++ QL + A D + C + +D T++ D+ NI I P +SE
Sbjct: 1011 CVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA--CGDKFGNIFINRLDPSISEK 1065
Query: 1278 WK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
G +L F +GA H T+ + + + TS + G R L++ T
Sbjct: 1066 VDDDPTGATILHEKSFLMGAAHKTEMIAHYNIGSVVTSITKIPLVAG----GRDVLVYTT 1121
Query: 1330 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1386
+ G++G + P D++ F + + D P G + ++R ++ K
Sbjct: 1122 ISGAVGALVPFVSSDDIEFMSTLEMHMRTQDISP--VGRDHIAYRGYYVPIKG------- 1172
Query: 1387 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+VD +L + +LP +Q IA + +L L + ++F
Sbjct: 1173 VVDGDLCESFSLLPYPKQQAIATDLDRSVGDVLKKLEQMRTSSAF 1217
>gi|332026090|gb|EGI66238.1| Splicing factor 3B subunit 3 [Acromyrmex echinatior]
Length = 1217
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 47/293 (16%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
A+ QG +L+ G + L+ +L P VVS+N + I + D+ +S+Y
Sbjct: 951 AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVY 1010
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-- 1277
+ +K Q QL + A D T ++D T++ +D+ NI + A +++
Sbjct: 1011 AVRYKRQENQLIVFADDTHP-RWITTTCVLDYDTVA--TADKFGNIAVIRLATGINDDVD 1067
Query: 1278 ---------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1321
W QK + A FHVG V + ++ PG ++
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGGSES- 1116
Query: 1322 RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
L++ TL G++G + P ++ F Q L+ + P + G + SFR ++ K
Sbjct: 1117 ---LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK 1171
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+++D +L + + +Q I+ T S++ L D+ +F
Sbjct: 1172 -------NVIDGDLCEQFNSIEPTKQKSISGDLERTPSEVSKKLEDIRTRYAF 1217
>gi|322797581|gb|EFZ19622.1| hypothetical protein SINV_00421 [Solenopsis invicta]
Length = 1217
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 47/293 (16%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
A+ QG +L+ G + L+ +L P VVS+N + I + D+ +S+Y
Sbjct: 951 AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVY 1010
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-- 1277
+ +K Q QL + A D T ++D T++ +D+ NI + A +++
Sbjct: 1011 AVRYKRQENQLIVFADDTHP-RWITTTCVLDYDTVA--TADKFGNIAVIRLATGINDDVD 1067
Query: 1278 ---------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1321
W QK + A FHVG V + ++ PG ++
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGGSES- 1116
Query: 1322 RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
L++ TL G++G + P ++ F Q L+ + P + G + SFR ++ K
Sbjct: 1117 ---LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK 1171
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+++D +L + + +Q I+ T S++ L D+ +F
Sbjct: 1172 -------NVIDGDLCEQFNSIEPTKQKSISGDLERTPSEVSKKLEDIRTRYAF 1217
>gi|157872916|ref|XP_001684981.1| putative CPSF-domain protein [Leishmania major strain Friedlin]
gi|68128051|emb|CAJ06910.1| putative CPSF-domain protein [Leishmania major strain Friedlin]
Length = 1347
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/311 (22%), Positives = 126/311 (40%), Gaps = 43/311 (13%)
Query: 1102 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1161
+ +E + LL IG+++ ++ AR + + R Q L + SK++ GA+
Sbjct: 913 GVSEEEWQHLLLIGSSFTFPDEQRARSGRITWCALREEHQQQRL-HLIASKDIGGALQCC 971
Query: 1162 ASL---QGHLLIASGPKIILHKWTGTELN---------GIAFYDAPPLYVVSLNIVKNFI 1209
A++ +G + + + L+KW + G+ PLY SL + +
Sbjct: 972 AAVPHYKGRIALGVNGCVCLYKWNTEDQTFVAEERCRVGLTVTKLIPLYHTSL--AASVL 1029
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDF---GSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
+ D+ S +F+ L +L +D G +D I +L + D+ N
Sbjct: 1030 VALDVRHSAFFIEVDTLQGSLKVLCRDAELRGVMDGH-----IGSDAENLCLFDDSLNFT 1084
Query: 1267 IFYYAPKMSE----------SWKGQ-KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1315
P E S Q + RA+ H+G VT +R A +S A
Sbjct: 1085 ALRVVPLPVEAGDGDAAAAASVTAQYRFEVRAQCHLGDLVT-CVRQGSFAATSLMEAPAS 1143
Query: 1316 GSDKTNRF--------ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNP 1367
+ NR L+F T G G + P+ T+ L++L+ LV ++ + GL+
Sbjct: 1144 CASAQNRLLLPGIAGPQLVFATAHGGFGVVTPVHAATYLVLRTLEASLVRTLQPLGGLSH 1203
Query: 1368 RSFRQFHSNGK 1378
++FR+ +G+
Sbjct: 1204 QAFREVLRSGQ 1214
>gi|221508103|gb|EEE33690.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 1878
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 67/292 (22%), Positives = 119/292 (40%), Gaps = 59/292 (20%)
Query: 1112 LAIGTAYVQGEDVAARGRVLLFS----------TGRNADNPQNLVT-------EVYSK-E 1153
LA G E+V GRV LF G D P E+++
Sbjct: 1472 LAAGVGVPLSENVECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIV 1531
Query: 1154 LKGAISALASL------QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1207
L G ++ + S + +++ + GP++ +H+ G++ AF DA + V S+ ++N
Sbjct: 1532 LNGPVTVVGSFFSSPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDAS-VCVTSVANIRN 1590
Query: 1208 FILLGDIHKSIYFLSWKEQGA----QLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDE 1261
F LLGD K + +SW+ ++ +++ F +L A FL + L ++ +D
Sbjct: 1591 FFLLGDALKGLNLVSWEYHAEADSRKMIRVSRTFPKSNLPVVACSFLTYENLLGMLATDI 1650
Query: 1262 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1321
N+++F Y + + F +L +L ++ AA K
Sbjct: 1651 DGNVRLFCY-------------------NADKNSPGFEKLDILQCDAEDRCAAGCVVKLQ 1691
Query: 1322 RFALLFGTLDGSIGCIAPLDELTFRR--------LQSLQKKLVDSVPHVAGL 1365
+F + T+ S+G A L FR LQ+LQ ++ +P GL
Sbjct: 1692 QFVVDSETV-ASLGEAADSSTLVFRLLASDSYSFLQTLQDRMAQYLPEPLGL 1742
>gi|401426063|ref|XP_003877516.1| putative CPSF-domain protein [Leishmania mexicana MHOM/GT/2001/U1103]
gi|322493761|emb|CBZ29051.1| putative CPSF-domain protein [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 1347
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/308 (22%), Positives = 120/308 (38%), Gaps = 37/308 (12%)
Query: 1102 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1161
+ +E + LL IG+++ ++ AR + + R Q L + SK++ GA+
Sbjct: 913 GVSKEEWQQLLLIGSSFTFPDEQRARSGRITWCALREEHQRQRL-HLIASKDIGGALQCC 971
Query: 1162 ASL---QGHLLIASGPKIILHKWTGTELN---------GIAFYDAPPLYVVSLNIVKNFI 1209
A++ +G + + + L+ W + G+ PLY SL + +
Sbjct: 972 AAVPHYKGRIALGVNGCVCLYNWNTEDQTFVAEERCRIGLTVTKLIPLYDTSL--AASVL 1029
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
+ D+ S +F+ L +L +D D L L D+ N
Sbjct: 1030 VALDVRHSAFFIEVDTLQGSLKVLCRDADLRGVMDGHIGSDAENLCLF--DDSLNFTAMK 1087
Query: 1270 YAPKMSE----------SWKGQ-KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1318
P E S Q + RA+ H+G VT +R A +S A S
Sbjct: 1088 VVPLPVEAGDGDAAAAGSVTAQYRFEVRAQCHLGDLVT-CVRPGCFAATSLMEAPAACSL 1146
Query: 1319 KTNRF--------ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1370
NR L+F T G G + P+ T+ L++L+ LV ++ V GL+ ++F
Sbjct: 1147 SRNRLLLPGIAGPQLVFATAHGGFGVVTPVQAATYLVLRALEASLVRTLQPVGGLSHQAF 1206
Query: 1371 RQFHSNGK 1378
R+ G+
Sbjct: 1207 REVLRAGQ 1214
>gi|237833631|ref|XP_002366113.1| hypothetical protein TGME49_024280 [Toxoplasma gondii ME49]
gi|211963777|gb|EEA98972.1| hypothetical protein TGME49_024280 [Toxoplasma gondii ME49]
Length = 2804
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/286 (23%), Positives = 121/286 (42%), Gaps = 47/286 (16%)
Query: 1112 LAIGTAYVQGEDVAARGRVLLFS----------TGRNADNPQNLVT-------EVYSK-E 1153
LA G E+V GRV LF G D P E+++
Sbjct: 2398 LAAGVGVPLSENVECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIV 2457
Query: 1154 LKGAISALASL------QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1207
L G ++ + S + +++ + GP++ +H+ G++ AF DA + V S+ ++N
Sbjct: 2458 LNGPVTVVGSFFSSPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDAS-VCVTSVANIRN 2516
Query: 1208 FILLGDIHKSIYFLSWKEQGA----QLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDE 1261
F LLGD K + +SW+ ++ +++ F +L A FL + L ++ +D
Sbjct: 2517 FFLLGDALKGLNLVSWEYHAEADSRKMIRVSRTFPKSNLPVVACSFLTYENLLGMLATDI 2576
Query: 1262 QKNIQIFYY-APKMSESWKGQKLLS-RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1319
N+++F Y A K S ++ +L AE A ++LQ S+ + +
Sbjct: 2577 DGNVRLFCYNADKNSPGFEKLDILQCDAEDRCAAGCV--VKLQQFVVDSETVASL--GEA 2632
Query: 1320 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1365
+ L+F L D +F LQ+LQ ++ +P GL
Sbjct: 2633 ADSSTLVFRLLAS--------DSYSF--LQTLQDRMAQYLPEPLGL 2668
>gi|221486318|gb|EEE24579.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 2804
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/286 (23%), Positives = 121/286 (42%), Gaps = 47/286 (16%)
Query: 1112 LAIGTAYVQGEDVAARGRVLLFS----------TGRNADNPQNLVT-------EVYSK-E 1153
LA G E+V GRV LF G D P E+++
Sbjct: 2398 LAAGVGVPLSENVECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIV 2457
Query: 1154 LKGAISALASL------QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1207
L G ++ + S + +++ + GP++ +H+ G++ AF DA + V S+ ++N
Sbjct: 2458 LNGPVTVVGSFFSSPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDAS-VCVTSVANIRN 2516
Query: 1208 FILLGDIHKSIYFLSWKEQGA----QLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDE 1261
F LLGD K + +SW+ ++ +++ F +L A FL + L ++ +D
Sbjct: 2517 FFLLGDALKGLNLVSWEYHAEADSRKMIRVSRTFPKSNLPVVACSFLTYENLLGMLATDI 2576
Query: 1262 QKNIQIFYY-APKMSESWKGQKLLS-RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1319
N+++F Y A K S ++ +L AE A ++LQ S+ + +
Sbjct: 2577 DGNVRLFCYNADKNSPGFEKLDILQCDAEDRCAAGCV--VKLQQFVVDSETVASL--GEA 2632
Query: 1320 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1365
+ L+F L D +F LQ+LQ ++ +P GL
Sbjct: 2633 ADSSTLVFRLLAS--------DSYSF--LQTLQDRMAQYLPEPLGL 2668
>gi|389586447|dbj|GAB69176.1| splicing factor 3B subunit 3 [Plasmodium cynomolgi strain B]
Length = 1286
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 127/604 (21%), Positives = 225/604 (37%), Gaps = 114/604 (18%)
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
L+ L L+ + G + L G++ +D +++ + ++ +L+F +
Sbjct: 41 LLRADKQGKLNLIASKDVFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLVILQFSNEK 96
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
+ +HC + K G G + VDP+GR + VY L
Sbjct: 97 NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDTK 149
Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
Q+ I D G GF I +S N D K V + + G
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPIFASIEQNYEMYD-KQVTNTNEIDGCTRK 208
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
++ L E +L ++ +++H LP D+ L +P P
Sbjct: 209 TLLCLWEMDL------------------GLNHVIRKH-------TLPIDSSAHLLIPIPG 243
Query: 319 G-----GVLVVGANTIHYHSQS---ASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
G GV+V N + Y CA Y L++ QE + S+ A H
Sbjct: 244 GQQGPSGVIVCCDNYLVYKKVEHVDVYCA-----YPRRLETGQE---KNISIVCSALHRI 295
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ L+ ++ GDL + + ++ +V+ + + + + I + + F+ + G
Sbjct: 296 R-KFFFILIQSEFGDLYKIEMDHEDGIVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354
Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--LSLY 487
+ QF+ G + K G A TK+L ++ L D V L +
Sbjct: 355 NHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPILDMK 411
Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVEL 544
+ N S Q +L GP L+ +GL I A EL
Sbjct: 412 ILDAKNANSPQIY-------ALCGRGPRSSLRILQHGLSIEELADN------------EL 452
Query: 545 PG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG K IWT+ + NA +Y Y+I+S E T++LE + + EV +S+
Sbjct: 453 PGRPKFIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVVDSL 498
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
+ T N+ +IQV + G R ++G + + + P N + + + NST +
Sbjct: 499 --LLTNVTTIHVNILYDNTLIQVHDTGIRHINGKVVHEWVP--PKNKQIKAATSNSTQIV 554
Query: 664 VSIA 667
+S++
Sbjct: 555 ISLS 558
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 63/286 (22%), Positives = 111/286 (38%), Gaps = 45/286 (15%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + G K+ ++ +L Y P ++S+ + + I DI +S+
Sbjct: 1021 CFCPFNGRLLASIGNKLRIYALGKKKLLKKCEYKDIPEAIISIKVSGDRIFASDIRESVL 1080
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1266
+ L L++ D +E L + ++ L V +E K +
Sbjct: 1081 IFFYDSNMNTLRLISDDIIPRWITCSEILDHHTIMAADKFDSVFVLRVPEEAKQEEYGIS 1140
Query: 1267 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
+Y M+ S K ++L FHVG VT ++++ TSS+
Sbjct: 1141 NKCWYGGEIMAGSNKNRRLEHIMSFHVGEIVTSLQKVKLSPTSSE--------------C 1186
Query: 1325 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1379
+++ T+ G+IG P D ELT Q L+ L P + G FR ++ +
Sbjct: 1187 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTENPPLCGREHIFFRSYYHPVQ- 1241
Query: 1380 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
++D +L + LP + Q ++A T IL L D+
Sbjct: 1242 ------HVIDGDLCEQFSSLPYDVQRKVAADLERTPDDILRKLEDI 1281
>gi|119173562|ref|XP_001239205.1| hypothetical protein CIMG_10227 [Coccidioides immitis RS]
Length = 1208
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 75/344 (21%), Positives = 152/344 (44%), Gaps = 48/344 (13%)
Query: 1084 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------AARGRVLLFSTGR 1137
I ++ +E A++V V +++++ET L +GT G+D+ ++ G + ++
Sbjct: 872 IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVYPPSSSCGFIHIYRFQE 924
Query: 1138 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1197
+ + ++ +++ AL + QG LL G + ++ +L + P
Sbjct: 925 DGKE----LEFIHKTKVESPPHALLAFQGRLLAGIGRNLRIYDLGMKQLLRKCQAEVVPR 980
Query: 1198 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLS 1255
+V L + I++ D+ +S+ ++ +K Q +L A D + C A ++D T++
Sbjct: 981 LIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDVIARWTTCTA---MVDYETVA 1037
Query: 1256 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATS 1307
D+ N+ + K SE G L+ ++ GA L + Q + TS
Sbjct: 1038 --GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMVHFYPQDIPTS 1095
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1364
+T G R L++ L G++G + P +++ F QSL+ +L P +AG
Sbjct: 1096 IQKTQLVAG----GRDILVWTGLQGTVGMLVPFVSREDVDF--FQSLEMQLTSQTPPLAG 1149
Query: 1365 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
+ +R +++ K +D +L Y LP +++L IA
Sbjct: 1150 RDHLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIA 1186
>gi|225560964|gb|EEH09245.1| pre-mRNA-splicing factor rse1 [Ajellomyces capsulatus G186AR]
Length = 1209
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 97/456 (21%), Positives = 192/456 (42%), Gaps = 62/456 (13%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
+Q+ IPL+ TP +F + YPL V++ N +LS ++ + D N +
Sbjct: 785 LQETIPLQYTPR---HFIKHPEYPLFY---VIEAENNILSPGTRTKLLNDSDAVNGDTTP 838
Query: 1055 LHRTYTVEEYE-----------VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNT 1103
L EE+ ++I++P + + + I ++ +E A++V V
Sbjct: 839 L----PPEEFGYPRGTGHWASCIQIVDPINSK---RVISQIELEENEAAVSVAAVPF--- 888
Query: 1104 TTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAISA 1160
+++++ET L +GT G+D+ R R + + L ++ +++ A
Sbjct: 889 SSQDDETFLVVGT----GKDMVVNPRSCTAGFIHIYRFQEEGKEL-EFIHKTKVEQPPMA 943
Query: 1161 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
L QG LL G + ++ ++ P VV L + I++ D+ +S+ +
Sbjct: 944 LLGFQGRLLAGIGTDLRIYDLGMKQMLRKCQASVVPHLVVGLQTQGSRIIVSDVQESLTY 1003
Query: 1221 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW-- 1278
+ +K Q +L D S T ++D T++ D+ N+ + K SE
Sbjct: 1004 VVYKYQENRLIPFVDDVISRWTTCTT-MVDYETVA--GGDKFGNLWLLRCPAKASEEADE 1060
Query: 1279 --KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1332
G L+ ++ GA ++ Q L TS + G R L++ L G
Sbjct: 1061 DGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLVTG----GRDILVWTGLQG 1116
Query: 1333 SIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1389
++ + P +E+ F QSL+ +L P +AG + +R +++ K +D
Sbjct: 1117 TVSMLIPFISREEVDF--FQSLEMQLAAQNPPLAGRDHLIYRSYYAPAKG-------TID 1167
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+L Y +LP +++ +IA + + +I + D+
Sbjct: 1168 GDLCETYLLLPNDKKQQIAGELDRSVREIERKIADM 1203
>gi|302406266|ref|XP_003000969.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
gi|261360227|gb|EEY22655.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
Length = 1059
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 129/574 (22%), Positives = 214/574 (37%), Gaps = 133/574 (23%)
Query: 104 VCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE 163
V + + G + S+A G++ +D +ILA + +I+++E+ + + + + F
Sbjct: 59 VLSHDVFGIIRSMAAFRIAGSN----KDYLILATDSGRIAIIEYLPAQNRFQRLHLETFG 114
Query: 164 SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-----MIILKASQGGSGLVGDED 218
K G G + DP+GR L+ L+ ++ + SQ E
Sbjct: 115 -------KSGIRRVVPGEFLACDPKGRA--CLIASLEKNKLVYVLNRNSQA-------EL 158
Query: 219 TFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
T S A HV+++ LD+ GY PV L E + T A +
Sbjct: 159 TISSP--LEAHKPGVHVLSMVALDV----------GYANPVFAAL-ETDYTEADQDPTGQ 205
Query: 279 HTCMISALSISTTLKQHPL------IWSAMNLPHDAYKLLAVPSPIG-----GVLVVGAN 327
+AL + T L + L + + P D L P G GVLV G
Sbjct: 206 -----AALDVETQLVYYELDLGLNHVVRKWSEPVDNTASLLFQVPGGNDGPSGVLVCGEE 260
Query: 328 TIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA----LLSTK 382
I Y HS + + + + E P S+ H L+ LL T+
Sbjct: 261 NITYRHSNQEAFRVPVPRRR----GATEDPSRKRSIVAGVMHK--LKGSAGAFFFLLQTE 314
Query: 383 TGDLVLLTVVY----DGRV---VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
GDL +T+ DG V+RL + + + S + + + ++ S+ G+
Sbjct: 315 DGDLFKITIDMIEDRDGNPTGEVKRLKIKYFDTIPVASSLCILKSGFLYVASQFGNYQFY 374
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE 495
QF E+ GD + L SS D D E +
Sbjct: 375 QF--------------EKLGD------DDEELEFSSDDFPTDPKQSYEAVFF-------- 406
Query: 496 SAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV--------- 542
++ + A+ +S+ ++ PL D DA +A G +S + ++
Sbjct: 407 HPRELENLALVESIDSMNPLIDCKVANLTGEDAPQIYTACGNGARSTFRILKHGLEVNEI 466
Query: 543 ---ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
ELPG +WT+ K SRG D+Y AY+++S T+VL + + E
Sbjct: 467 VASELPGIPSAVWTL--KLSRG------------DQYDAYIVLSFTNATLVLSIGETVEE 512
Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
V +S F+ A L G +IQV +G R
Sbjct: 513 VNDS--GFLTSVPTLAAQLLGGEGLIQVHPKGIR 544
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 62/268 (23%), Positives = 115/268 (42%), Gaps = 25/268 (9%)
Query: 964 NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021
C G + + Q I I +L T QK IPL TP ++ E ++ I
Sbjct: 761 QCEEGVVGIQGQSLRIFAIEKLSDTLT-------QKSIPLTYTPRRMVKHPEHPMFYTIE 813
Query: 1022 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1081
S P LL D V + D L V+ + I D QT
Sbjct: 814 SDNNTLPPELRAQLLADPSVVNG-DARTLPPVEFGYPRAKGRWASCISVIDPLSEELQTL 872
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG----- 1136
T+ + ++E A++ +V T+++NE+ L +GT G+D+ R F+ G
Sbjct: 873 QTVDLDNNEAAVSAAIVPF---TSQDNESFLVVGT----GKDMIVNPR--QFTEGYIHIY 923
Query: 1137 RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1196
R +++ + L ++ +++ +AL + QG L+ G + ++ ++ A D P
Sbjct: 924 RFSEDGREL-EFIHKTKVEEPPTALLAFQGRLVAGVGKTLRIYDLGQKQMLRKAQADVAP 982
Query: 1197 LYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
+VSL+ + I++GD+ + + ++ +K
Sbjct: 983 QLIVSLSTQGSRIVVGDVQQGVTYVVYK 1010
>gi|392578232|gb|EIW71360.1| hypothetical protein TREMEDRAFT_71141 [Tremella mesenterica DSM 1558]
Length = 1250
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 69/309 (22%), Positives = 130/309 (42%), Gaps = 46/309 (14%)
Query: 1085 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------------AARGRVLL 1132
P++S E T+ VTL ++ LLA+G A +D A GR++L
Sbjct: 894 PLESREIVTTIAYVTL------GDQNLLAVGIATFSEDDEDLPDDLDMVTISAQSGRLVL 947
Query: 1133 FSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-------WTGTE 1185
+ + D+ + + E+ S L+ A++ + ++ L +A+G + ++K T
Sbjct: 948 YEPVVDQDSAEPNLIELTSVGLESAVNDIKVIKNLLAVATGSNVTIYKHEKASHLLIPTS 1007
Query: 1186 LNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
AF A L V + + + +++GD +SI+ L E + +D +
Sbjct: 1008 RFASAFV-AKSLVVAPPDKLHPEERLVVGDGMRSIFVLDIDEGTGMIMGDERDMATHSVM 1066
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1303
A E L DG +++V+D NI F +++ + A F + ++ F R +
Sbjct: 1067 AMEGLRDGGQ-AVIVADAHSNISTFRLR---------EEIETAATFGLHEDISVFRRGSL 1116
Query: 1304 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1363
SS +P ++F T+DG +G + L R L LQ+ + +
Sbjct: 1117 APASSAEDVLSP--------EIIFATIDGRLGIVGELTPSAARTLDDLQRNMDRYIRGPG 1168
Query: 1364 GLNPRSFRQ 1372
+ RS+R+
Sbjct: 1169 DIAWRSYRR 1177
>gi|321249291|ref|XP_003191408.1| U2 snRNA binding protein [Cryptococcus gattii WM276]
gi|317457875|gb|ADV19621.1| U2 snRNA binding protein, putative [Cryptococcus gattii WM276]
Length = 1217
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 117/548 (21%), Positives = 203/548 (37%), Gaps = 83/548 (15%)
Query: 919 NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
N+ G SR +++ L+ P + D A++ L C G I + S L
Sbjct: 718 NVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWS-LSAAMCPDGLIGI-SGNTL 775
Query: 979 KICQLPS-GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
+I +P G Q PL TP + + P N V ++
Sbjct: 776 RIFSIPKLGEKLK-----QDSTPLTYTPRKF----------------ISHPFNPVFYMI- 813
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
+ D+ S + R +E E R ++ P + A VRV
Sbjct: 814 ------EADHRTYSKGAIERIVKQKESEGRRVDTLLLDLPANEFGRPRAPAGHWASCVRV 867
Query: 1098 V------TLFNTTTKENETLLAIGTAYVQ---GEDVAARG---RVLLFSTGRN------- 1138
+ T+ E+E +I AY + GE G + L G
Sbjct: 868 LDPLANETIMTFDLDEDEAAFSIAIAYFERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVY 927
Query: 1139 ADNPQNLVTEVYSKELKGAIS-ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1197
A Q V E K I LA QG LL G + L++ L + P
Sbjct: 928 AIKEQGRVLEFLHKTKTDDIPLCLAGFQGFLLAGVGKSLRLYEMGKKALLRKCENNGFPT 987
Query: 1198 YVVSLNIVKNFILLGDIHKSIYFLSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTL 1254
VV++N+ I++GD+ +S ++ ++ QL + A D + C + +D T+
Sbjct: 988 AVVTINVQGARIIVGDMQESTFYCVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETV 1044
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEFHVGA-HVTKFL---RLQMLAT 1306
+ D+ NI I P +SE G +L F +GA H T+ + + + T
Sbjct: 1045 A--CGDKFGNIFINRLDPSISEKVDDDPTGATILHEKSFLMGAAHKTEMIAHYNIGSVVT 1102
Query: 1307 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1363
S + G R L++ T+ G++G + P D++ F + +L+ + +
Sbjct: 1103 SITKIPLVAG----GRDVLVYTTISGAVGALVPFVSPDDIEF--MSTLEMHMRTQDISLV 1156
Query: 1364 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1423
G + ++R ++ K +VD +L + +LP +Q IA + +L L
Sbjct: 1157 GRDHIAYRGYYVPIKG-------VVDGDLCESFSLLPYPKQQAIASDLDRSVGDVLKKLE 1209
Query: 1424 DLALGTSF 1431
+ ++F
Sbjct: 1210 QMRTSSAF 1217
>gi|429859776|gb|ELA34542.1| pre-mRNA-splicing factor rse1 [Colletotrichum gloeosporioides Nara
gc5]
Length = 1212
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 78/365 (21%), Positives = 159/365 (43%), Gaps = 42/365 (11%)
Query: 1083 TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----R 1137
T+ + +E A++ +V+ +++NE L +GT G+D+ R F+ G R
Sbjct: 874 TVDLDDNEAAVSAAIVSF---ASQDNENFLIVGT----GKDMIVNPR--QFTEGYIHVYR 924
Query: 1138 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1197
++ L ++ +++ +AL + QG LL G + ++ ++ + D P
Sbjct: 925 FGEDGHEL-EFIHKTKVEEPPTALLAFQGRLLAGVGKTLRIYDLGLRQMLRKSQADVAPQ 983
Query: 1198 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1257
+VSL+ + I++GD+ + ++ +K +L D + T ++D S+
Sbjct: 984 LIVSLSTQGSRIVVGDVQHGVTYVVYKPTTNKLIPFVDDTIARWVTCTT-MVDYE--SVA 1040
Query: 1258 VSDEQKNIQIFYYAPKMS----ESWKGQKLL-SRAEFHVGAHVTKFLR---LQMLATSSD 1309
D+ N+ + K S E G LL +R H H L Q + TS
Sbjct: 1041 GGDKFGNMFLVRCPEKASQEADEEQAGLHLLNTRDYLHGAPHRLNLLSHSYTQDVPTSIT 1100
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1366
+T G LL+ ++G+IG P +++ F Q+L++ + +AG +
Sbjct: 1101 KTSLVVGGQDV----LLWSGINGTIGVFIPFVTREDVDF--FQNLEQHMRTEDAPLAGRD 1154
Query: 1367 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
+R ++ K ++D +L Y +LP +++L IA + + +I ++D+
Sbjct: 1155 HLMYRGYYVPVKG-------VIDGDLCERYTLLPNDKKLMIAGELDRSVREIERKISDIR 1207
Query: 1427 LGTSF 1431
++F
Sbjct: 1208 TRSAF 1212
Score = 44.3 bits (103), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 138/607 (22%), Positives = 226/607 (37%), Gaps = 137/607 (22%)
Query: 74 QEEGSKESKNSGETKRRVLM---DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
Q G+KE + R+ + D + L+ H + G + S+A G++ +
Sbjct: 27 QFSGTKEQNIVTASGSRLTLLRPDPSQGKVITLLSH-DIFGIIRSMAAFRLAGSN----K 81
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVD 186
D +ILA + +I+++E+ I + + F+ LHL K G G + D
Sbjct: 82 DYLILATDSGRITIVEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACD 130
Query: 187 PQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
P+GR V L ++ + +Q E T S A V+++ LD+
Sbjct: 131 PKGRACLIASVEKNKLVYVLNRNAQA-------ELTISSP--LEAHKPGVLVLSMVALDV 181
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLI 298
GY PV L E E T A G + + T ++ + L
Sbjct: 182 ----------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRK 229
Query: 299 WSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNY--AVSLD 350
WS P D L P G GVLV G I Y HS + + + A
Sbjct: 230 WSE---PVDPTASLLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDP 286
Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL--VLLTVVYDGR-----VVQRLDL 403
S + S +L + + LL T+ GDL ++ +V D V+RL +
Sbjct: 287 SRKRHIVSGVMHKLKGSAGAFF----FLLQTEDGDLFKAVIDMVEDADGNPTGEVKRLKI 342
Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
+ ++S + + + + S+ G+ QF E+ GD + +
Sbjct: 343 KYFDTVPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGDDDEEK-- 386
Query: 464 TKRLRRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
SS D D G + + Y N A+ +S+ ++ PL D
Sbjct: 387 ----EFSSDDFPADPKAGYDAVYFYPRPLEN---------LALVESIDSMNPLLDCKVAN 433
Query: 523 RINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADS 565
DA +A G +S + ++ ELPG +WT+ K SRG
Sbjct: 434 LTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLSRG----- 486
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
D+Y AY+++S T+VL + + EV++S F+ A L G +IQ
Sbjct: 487 -------DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQ 537
Query: 626 VFERGAR 632
V +G R
Sbjct: 538 VHPKGIR 544
>gi|402595041|gb|EJW88967.1| hypothetical protein WUBG_00126 [Wuchereria bancrofti]
Length = 621
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 87/361 (24%), Positives = 139/361 (38%), Gaps = 100/361 (27%)
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
+W NL +A +++VP P+GG L+ G + I YH + AL YA S
Sbjct: 201 LWKHDNLEGEANIVISVPEPVGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLT 412
+ +D +L D+A G+L +L L+L K N +V+
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLL----------LELGKDQEQDENSAVIV 300
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTC--GSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
D+ LG++ + + C +G + S FGD +L R
Sbjct: 301 RDMKV---------ESLGETCIAECMCYLDNGVCFIGS----RFGD--------SQLIRL 339
Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA-- 528
S++ A T ++ DS N+ P++D + +R N
Sbjct: 340 STEP---------------------RADGTGYISLLDSYTNLAPIRDMTV-MRCNGQQQI 377
Query: 529 -SATGISKQSNY----------EL--VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
+ +G K EL VEL G K ++T+ +RG DE+
Sbjct: 378 LTCSGAYKDGTIRIIRNGIGIEELASVELKGIKNMFTL---RTRG------------DEF 422
Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
YLI+S ++ T VL E TE + V G T+ AG LF + ++QV ++D
Sbjct: 423 DDYLILSFDSETHVLFINGEELEDTEITGFAVDGATLWAGCLFHSKTILQVTHGEVILID 482
Query: 636 G 636
G
Sbjct: 483 G 483
>gi|255946770|ref|XP_002564152.1| Pc22g01070 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591169|emb|CAP97395.1| Pc22g01070 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1209
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 98/461 (21%), Positives = 185/461 (40%), Gaps = 72/461 (15%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVS-VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
+Q+ IPL TP + E+ L+ +I S VL P + LID + +L
Sbjct: 785 LQESIPLSYTPRRFVKHPEQPLFYVIESDNNVLSPATR--QRLIDDSQAQNGEATDLPPA 842
Query: 1054 DLHRTYTVEEYE--VRILEPDRAGGPWQTRATI---PMQSSENALTVRVVTLFNTTTKEN 1108
D + ++++EP T++ I ++ +E A+++ V+ +++++
Sbjct: 843 DFGYPRATGHWASCIQVVEP------ITTKSVIFNLDLEDNEAAVSLAAVSF---SSQDD 893
Query: 1109 ETLLAIGTA-------------YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK 1155
ET L +GTA ++ GR L F D P
Sbjct: 894 ETFLVVGTAKDMTVSPPSSSCGFIHIYRFQEDGRELEFIHKTQVDEPP------------ 941
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1215
AL QG LL GP + ++ +L P +V L + I++ D+
Sbjct: 942 ---LALLGFQGRLLAGIGPILRVYDLGMKQLLRKCQAPVVPKTIVGLQTQGSRIIVSDVR 998
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+S+ ++ +K Q L A D + +T ++D T + D+ N+ + K+S
Sbjct: 999 ESVTYVVYKYQDNVLIPFADDSIARWTSSTT-MVDYETTA--GGDKFGNLWLVRCPSKVS 1055
Query: 1276 ESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLF 1327
E G L+ + H H + + Q + TS +T G R +++
Sbjct: 1056 EQADEDGSGAHLIHEKGYLHGTPHRLELMVHFYAQDIPTSLHKTQLVAG----GRDIVVW 1111
Query: 1328 GTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1384
G+IG P +++ F Q L+ +L P +AG + +R +++ K
Sbjct: 1112 TGFQGTIGMFVPFASREDVDF--FQLLETQLASQQPPLAGRDHLMYRGYYAPVKG----- 1164
Query: 1385 DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
++D +L Y +LP + +L IA + + +I ++D+
Sbjct: 1165 --VIDGDLCEMYLLLPNDTKLMIAGELDRSVREIERKISDM 1203
>gi|146420838|ref|XP_001486372.1| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC
6260]
Length = 1206
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 120/610 (19%), Positives = 237/610 (38%), Gaps = 96/610 (15%)
Query: 96 ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
+ + ++ +CH ++ G ++++ + +GG++ D +++ + ++S+LEFD
Sbjct: 53 LESGKIKQICHQQVIGVIQNIDRIRKGGSN----LDLLVITSDSGRLSILEFDKD----- 103
Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
+ F + H K G G + VDPQ R + ++ KA + L
Sbjct: 104 --ELKFFPVVQEPHSKNGMNRTTPGEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKL-- 159
Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
+ +++ I + LD GY P++ + E +A +
Sbjct: 160 -----ELLSPLESVSKNTLTIQMVSLDT----------GYENPMLAAI---ECNYAHYDA 201
Query: 276 WKHHTCMISALSISTTLKQHPLIWSA-----MNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
+ S L++ + L + A + +P + L+ +P+PIGGV+V G++ I
Sbjct: 202 SLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTLVPLPTPIGGVIVAGSSFIF 261
Query: 331 YHSQSASCALALNNYAVSLDS-SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
YH+ + L L + L + S +P ++V H N LL + GD +
Sbjct: 262 YHNPTIDQQLYL---PIPLRAGSSPVPIVCYAV-----HKLKKNNFFILLHNELGDCFRV 313
Query: 390 TVVY--DGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSML 446
+ Y D V L + + ++ I F D +L Q G S +
Sbjct: 314 LIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDKMLYQIEDLGDNDSYI 373
Query: 447 SSGLKEEFGDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
SS D+ + + + R + AL +++ G+ +ES + +
Sbjct: 374 SSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGALVKTSESKESRIAMIT 433
Query: 506 RDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADS 565
S LK ++G+ I+ LV P +V+ +
Sbjct: 434 GHS-----HLKLKTHGIPIST--------------LVSSPLPMIATSVF----------T 464
Query: 566 SRMAAYDDEYHAYLIISLEA--RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+R++A + + Y++IS A +T+VL +++ EV +S FV + G + +
Sbjct: 465 TRLSA-ESKNDEYMVISSSASSKTLVLAIGEVVEEVQDSS--FVTDQPTIGVQQVGLKSL 521
Query: 624 IQVFERGARIL-----DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
IQ++ G R + +G + + P T++S S VL+G+S+
Sbjct: 522 IQIYSNGIRHIRQTETEGKITKKTFDWYP--------PAGITIISASTNQEQVLIGLSNR 573
Query: 679 SIRLLVGDPS 688
+ DP+
Sbjct: 574 ELCYFEIDPT 583
>gi|357606250|gb|EHJ64976.1| putative Splicing factor 3B subunit [Danaus plexippus]
Length = 1216
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 82/392 (20%), Positives = 156/392 (39%), Gaps = 60/392 (15%)
Query: 1065 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV 1124
++RIL+ G T +P++ +E A+++ VV T ++ + +
Sbjct: 860 QIRILDMSGGVGGCSTVCLLPLEQNEAAVSLCVVRWAALTDNTPHLVVGVAKDALLSPRS 919
Query: 1125 AARGRV---LLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1181
+ G + +++TG+ LV + E G ALA+ G LL G + L+
Sbjct: 920 CSEGSLHVYKIYNTGK-----LELVHKTPIDEYPG---ALAAFNGKLLAGVGRMLRLYDI 971
Query: 1182 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1241
+L P + + ++ I + D+ +S++ + +K++ QL + A D
Sbjct: 972 GRRKLLRKCENRHIPNLIADIKTIRQRIFVSDVQESVFCVKYKKRENQLIIFADDTNP-R 1030
Query: 1242 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS----ESWKGQK------LLSRA---- 1287
++D T+++ +D+ N+ + +S E G K LL+ A
Sbjct: 1031 WITNTCILDYDTVAM--ADKFGNVAVLRLPQSVSDDVDEDPTGNKALWDRGLLNGASQKG 1088
Query: 1288 ----EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1343
FHVG VT R ++ PG + ALL+ T+ G++G P
Sbjct: 1089 DITVNFHVGETVTSLQRATLI----------PGGSE----ALLYATVSGALGVFLP---F 1131
Query: 1344 TFRR----LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
T R Q L+ + + G + SFR ++ K +++D +L + L
Sbjct: 1132 TSREDHDFFQHLEMHMRSENSPLCGRDHLSFRSYYYPVK-------NVIDGDLCEQFNSL 1184
Query: 1400 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+Q IA T +++ L D+ +F
Sbjct: 1185 EPAKQKAIAGDLERTPAEVSKKLEDIRTRYAF 1216
>gi|440636768|gb|ELR06687.1| pre-mRNA-splicing factor rse1 [Geomyces destructans 20631-21]
Length = 1212
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 98/463 (21%), Positives = 193/463 (41%), Gaps = 64/463 (13%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
+Q IPL TP + E + +I S N +LS Q++ N S
Sbjct: 788 LQHSIPLSYTPRRFVKHPEHPYFYVIQSD------NNILSKSTKQKLLEDPSLQNGDSAV 841
Query: 1055 LHRTYTVEEYE-----------VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNT 1103
L EE+ ++I++P R Q I ++ +E A+++ V+
Sbjct: 842 L----PAEEFGYPRGRGHWASCIQIVDPIREKKVLQQ---IDLEDNEAAVSMATVSF--- 891
Query: 1104 TTKENETLLAIGTAYVQGEDVAARGRV----LLFSTGRNADNPQNLVTEVYSKELKGAIS 1159
++E+E L +GT G+D+ A R + + D + + ++ +++
Sbjct: 892 ASQEDEVFLVVGT----GKDMVASPRSSSGGFIHVYRFHEDGKE--IEFIHKTKVEEPPL 945
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
AL QG LL+ G ++ ++ +L A + +V L + I++ D+ +SI
Sbjct: 946 ALLGFQGRLLVGIGRELRIYDLGMRQLLRKAQTEIAASLIVGLQTQGSRIIVSDVQESIT 1005
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW- 1278
F+ +K Q +L A D + T ++D T++ D+ N+ + K SE
Sbjct: 1006 FVVYKFQENKLIPFADDTIARWTTCTT-MVDYETVA--GGDKFGNLWLLRCPTKASEEAD 1062
Query: 1279 ---KGQKLLSRAEFHVGA-HVTKFLRLQM---LATSSDRTGAAPGSDKTNRFALLFGTLD 1331
G L+ ++ GA H + + TS +T G R LL+ L
Sbjct: 1063 EEGSGAHLVHERQYLQGAPHRVALMAHNFANDIPTSIQKTNLVAG----GRDCLLWSGLQ 1118
Query: 1332 GSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
G+I + P +++ F Q+L++ L +AG + +R ++ K ++
Sbjct: 1119 GTIAIMIPFVSREDVDF--FQTLEQHLRTEDAPLAGRDHLIYRSYYVPVKG-------VI 1169
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
D +L Y +LP ++++ IA + + +I ++D+ +++
Sbjct: 1170 DGDLCERYTLLPTDKKMMIAGEFDRSVREIERKISDMRTRSAY 1212
>gi|367027320|ref|XP_003662944.1| hypothetical protein MYCTH_2304190 [Myceliophthora thermophila ATCC
42464]
gi|347010213|gb|AEO57699.1| hypothetical protein MYCTH_2304190 [Myceliophthora thermophila ATCC
42464]
Length = 1211
Score = 51.6 bits (122), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 94/454 (20%), Positives = 191/454 (42%), Gaps = 45/454 (9%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
+QK IPL TP ++ E+ + I S P ++ + L+ Q D L D
Sbjct: 786 IQKSIPLTYTPKRLVKHPEQPYFYTIESDNNTIP-PELRAQLLAQSGAVNGDATVLPPED 844
Query: 1055 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1114
+ I D G + I ++ +E A++ VV ++E E+ L +
Sbjct: 845 FGYPRATGRWASCISIVDPLGDEPKVLQRIDLEGNEAAVSAAVVPF---ASQEGESFLVV 901
Query: 1115 GTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLL 1169
GT G+D+ R F+ G R ++ + L ++ +++ AL QG LL
Sbjct: 902 GT----GKDMVLNPRK--FTEGYIHVYRFHEDGREL-EFIHKTKVEEPPLALIPFQGRLL 954
Query: 1170 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1229
G + ++ +L A + P +V+L + I++GD+ + + ++ +K + +
Sbjct: 955 AGIGKMLRVYDLGLRQLLRKAQGEVAPQLIVTLQTQGSRIIVGDVQQGVTYVVYKPESNK 1014
Query: 1230 LNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG-----QKL 1283
L + A D +++ + T ++D S+ D+ N+ I + S+ Q L
Sbjct: 1015 LLVFADD--TINRWTTCTTMVDYE--SVAGGDKFGNVWILRCPERASQESDEPGSEIQLL 1070
Query: 1284 LSRAEFHVGAHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
+R H + + Q L TS +T G L++ + G++G + P
Sbjct: 1071 HARKYLHGAPNRLDLMVHFYTQDLPTSIVKTNLVVGGQD----VLVWSGIQGTVGVLIPF 1126
Query: 1341 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1397
+++ F QSL+ + P +AG + +R ++ K ++D +L +
Sbjct: 1127 VSREDVDF--FQSLESHMRAEDPPLAGRDHLIYRGYYVPVKG-------VIDGDLCERFS 1177
Query: 1398 MLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+LP +++ IA + + +I ++D+ ++F
Sbjct: 1178 LLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1211
>gi|308477185|ref|XP_003100807.1| CRE-DDB-1 protein [Caenorhabditis remanei]
gi|308264619|gb|EFP08572.1| CRE-DDB-1 protein [Caenorhabditis remanei]
Length = 1154
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 53/283 (18%), Positives = 121/283 (42%), Gaps = 21/283 (7%)
Query: 1105 TKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
T +++ +GT + E+ + GR+++F + ++ + V+ +G+ AL
Sbjct: 834 TNDSKVYYIVGTGLIYPEETDTKFGRIVVFEVD---EVERSKLRRVHDLVCRGSPLALRI 890
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
L G L+ A + L +WT + + + + + L ++ + + D+ +S+ LS+
Sbjct: 891 LNGKLVAAINSSVRLFEWTMDKELRLECSNFNHIMALDLKVMNEEVAVADVMRSVSLLSY 950
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE--SWKGQ 1281
+ +AKD+ S EF+ L + ++ +F S + G+
Sbjct: 951 RMLEGNFEEVAKDWNSEWMVTCEFITAEQILG-----GEAHLNLFTVEVDKSRPITDDGR 1005
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAP 1339
+L ++ + + + L D D + +++ ++FGT GSIG +
Sbjct: 1006 YVLEPTGYYYLGELPRVMVRSSLVAQPD--------DCSIQYSQPIMFGTNQGSIGMVVQ 1057
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
+D+ + L +++K + DSV + + ++R F + P
Sbjct: 1058 IDDKWKKFLIAVEKAIADSVKNCMHIEHTTYRSFIFQKRLESP 1100
Score = 45.1 bits (105), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 147/358 (41%), Gaps = 82/358 (22%)
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
++ DA L+ VP+PI GVLV+ AN+I Y S N V +S L + F+
Sbjct: 206 SIAADASVLIPVPAPISGVLVLAANSILYKSSDV-------NGDVVPYASPLLDNTVFTC 258
Query: 363 E--LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR---VVQRLDLSKTNPSVLTSDITT 417
+D + ++ +D T+ L+L+ + +GR V+ + + + + I
Sbjct: 259 HGLVDPSGERFILSD-----TEGRLLMLILNIGEGRSGITVKDMRIEYLGETSIADSINY 313
Query: 418 IGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ 476
I + F+GSRLGDS L++ SG S S + E + +I ++
Sbjct: 314 IDAGVVFVGSRLGDSQLIRLMPTPSGGSY--SVVLETYSNI---------------GPIR 356
Query: 477 DMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ 536
DM+ E ++ ++ T S A +D G L+ G+ I AS
Sbjct: 357 DMIMVE---------SDGQAQLVTCSGAEKD-----GSLRVIRNGIGIEELAS------- 395
Query: 537 SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
VEL G GI+ + S+ + Y+I+SL T VL+
Sbjct: 396 -----VELAGVIGIFPIRLNSTTDN----------------YVIVSLAEETHVLQINGEE 434
Query: 597 TEVTESVDYFVQGRTIAAGNLFG---RRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
E + + + TI A +FG ++QV E+ R + S +++ + P N E
Sbjct: 435 LEDVQLLQICTEMPTIFASTIFGPDNSEVLLQVTEKHVRFMAFSGLSK--IWEPPNGE 490
>gi|317031116|ref|XP_001392900.2| UV-damaged DNA binding protein [Aspergillus niger CBS 513.88]
Length = 1124
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 77/322 (23%), Positives = 133/322 (41%), Gaps = 52/322 (16%)
Query: 1112 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+GTAY+ E+ + RGR+L+F DN + L T+V +KGA ALA L G ++
Sbjct: 816 FVVGTAYLDDENEESIRGRILVFEI----DNGRKL-TKVAELPVKGACRALAML-GEKIV 869
Query: 1171 ASGPKIIL------HKWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
A+ K ++ + + +L +A Y AP V + + N I + D+ KS+ +
Sbjct: 870 AALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAP----VDVTVTGNVIAVADLMKSVCLV 925
Query: 1222 SWKE--QGA--QLNLLAKDFGSL-----DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
+ E G+ L +A+ F ++ C A + ++ +D + N+ +
Sbjct: 926 EYSEGENGSPDSLTEVARHFQTVWATGVSCIAKDTFLE--------TDAEGNLIVLRRNL 977
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
E ++L E +G V + + +Q LA+ + A GT+
Sbjct: 978 TGVEEDDKRRLEVTGEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTV 1025
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1390
+GSI A ++ L LQ + V + + FR F S + + P VD
Sbjct: 1026 EGSIYLFAIINPEHQDFLMRLQATMAGKVESLGNIPFNEFRGFRSMVREAKE-PYRFVDG 1084
Query: 1391 ELLSHYEMLPLEEQLEIAHQTG 1412
EL+ + Q EI G
Sbjct: 1085 ELIERFLTCEPSLQEEIVDSVG 1106
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G +I Y V DS++ + R LD A
Sbjct: 229 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 268
Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
W Q D LL+ G L L +V D VQ L + S + +G +
Sbjct: 269 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 328
Query: 423 FFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ GS
Sbjct: 329 IFVGSHQGDSQVLRIGNGS 347
>gi|440300137|gb|ELP92626.1| DNA damage-binding protein, putative [Entamoeba invadens IP1]
Length = 1086
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 79/337 (23%), Positives = 154/337 (45%), Gaps = 39/337 (11%)
Query: 1107 ENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL---- 1161
E + L +GTAY + GE +RGR ++F + + EV ++ + GA+ ++
Sbjct: 766 EEKELYIVGTAYAKLGEVEPSRGRFIIFEI------HEEKIIEVSNRYVDGAVYSVKRFE 819
Query: 1162 --------ASLQGHLLIAS-GPKIILHKWTGT-ELNGIAFYDAPPLYVVSLNIVKNFILL 1211
A++Q +++ KI+ K+ T E G A L+V +L + IL+
Sbjct: 820 NDVGNYIAATIQKKVVVYQIERKIVDGKFAVTIEEKGGANVKLIGLFVKTLG---HEILV 876
Query: 1212 GDIHKSIYFLSWKEQGAQLNLL--AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
GD+ KSI + E+ + ++ +DF + A EF+ + +S SD Q N+ +F
Sbjct: 877 GDLMKSISVFKFDEKATRNAVVETCRDFYASYTTAVEFMDEHCFMS---SDSQGNLLVFT 933
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
+ + KL + A HVG + + + ++ +T + +LFG
Sbjct: 934 ENTTTTNENEKFKLQNEAHIHVGECINVMCKGSIAVMNN-------AMWETQKKCMLFGG 986
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN-GKAHRPGPDSIV 1388
+ GSIG I ++ T+++L +L+ +++ + V + SF Q+ R +++
Sbjct: 987 ICGSIGGITEINLETYKKLFALESEMLREMKGV--IECESFGQWKMVFDDWKRMEAQNVI 1044
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
D ++ + LP E Q IA + G ++++ L +
Sbjct: 1045 DGNVVELFLDLPKESQKHIAEKIGYAGEELVTVLESM 1081
>gi|326432370|gb|EGD77940.1| splicing factor 3b subunit 3 [Salpingoeca sp. ATCC 50818]
Length = 1232
Score = 51.2 bits (121), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 60/304 (19%), Positives = 120/304 (39%), Gaps = 47/304 (15%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
V+ E++ AL G L+ G + ++ +L P VV + ++
Sbjct: 955 VHRTEVEAMPCALTPFAGRLIAGVGNIVRIYDMGRKKLLRKCENKHLPSRVVDIEVMGTR 1014
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1268
+++ D +S++FL +K L++ D C A ++D ST + V+D+ N+ +
Sbjct: 1015 VVVADQRESVFFLKYKPTENVLSVFCDDTTPRWCTAM-LMVDYST--VCVADKFGNVSVL 1071
Query: 1269 YYAPKMSESWK------------------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1310
++++ + QKL+ A F++G V + + + ++
Sbjct: 1072 RCPDDVTDTLQEDPSGAKAFWARGYLNGAPQKLVQVANFYIGEIVQSLHKTTLTPSGTE- 1130
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
+ + TL GSIG + P ++ F Q+L+ L P + G +
Sbjct: 1131 -------------CIAYTTLSGSIGALMPFSHKEDAEF--FQTLELHLRQEHPPICGRDH 1175
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1427
+FR + K S++D +L Y ML + +IA T ++ L +
Sbjct: 1176 LAFRSAYVPCK-------SVIDGDLCEEYNMLSASLKSDIADGLERTPQEVAKKLEEFRT 1228
Query: 1428 GTSF 1431
+F
Sbjct: 1229 RYAF 1232
>gi|393212467|gb|EJC97967.1| hypothetical protein FOMMEDRAFT_162310 [Fomitiporia mediterranea
MF3/22]
Length = 1161
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 72/136 (52%), Gaps = 10/136 (7%)
Query: 1109 ETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGH 1167
+T A+G+ Y + E +RGR+L+ STG + +++ S E+KGA++AL +QG
Sbjct: 849 DTFFAVGSVYFDETEREPSRGRILIISTGSKRNQTPHILA---STEVKGAVNALTCIQGK 905
Query: 1168 LLIASGPKIILHKWT---GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
L++A + + + T L + ++ L + ++ ++ + I++GD S+ L K
Sbjct: 906 LVVAINTSVDVFRLKHGDNTVLTAVTSWNHNYLVITAV-VMDDLIVIGDAVSSLAVL--K 962
Query: 1225 EQGAQLNLLAKDFGSL 1240
+ +L A+D+ L
Sbjct: 963 LEDDKLTTFARDYSPL 978
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 72/331 (21%), Positives = 128/331 (38%), Gaps = 73/331 (22%)
Query: 306 HDAYKLLAVPSPI-------GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
D+ L+ VP I GGVLV+G +TI ++S ++ S+ ++P++
Sbjct: 224 EDSNLLIPVPPQIKSSWNVNGGVLVLGGSTIAFYSIDRKQKKKNSSSQSKS-STSKIPQA 282
Query: 359 SFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
+ A W Q D LL G L LL + + + L + +P +
Sbjct: 283 EVNWPYFDITA-WAQIDEDGLRYLLGDSFGRLALLAINPQYAYLDIVLLGEVSPP---TS 338
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+T + + ++GS GDS L++ T ++ + + F +I AP + + D+
Sbjct: 339 LTPLASQYIYVGSHFGDSQLIRVTSERSSNGSYLEISDTFKNI---APIMDAVFEDTDDS 395
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
Q + ++ G S G L+ G N DA GI+
Sbjct: 396 GQPTI----ITCSGGEST--------------------GSLRVIRNGANFNEDARIEGIA 431
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE--T 592
G+W + + YDD +H Y++++ + T +LE
Sbjct: 432 N-----------ITGMWPIRRQ--------------YDDTFHHYMLVTTDTNTHLLELPN 466
Query: 593 ADLLTEVTESVDY---FVQGRTIAAGNLFGR 620
+ T V+ S D+ + RT+ AGN+ R
Sbjct: 467 SQQETAVSRSNDFSDLTIDSRTLVAGNMLTR 497
>gi|350629921|gb|EHA18294.1| damage-specific DNA binding protein [Aspergillus niger ATCC 1015]
Length = 1140
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 76/322 (23%), Positives = 131/322 (40%), Gaps = 52/322 (16%)
Query: 1112 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+GTAY+ E+ + RGR+L+F DN + L T+V +KGA ALA L G ++
Sbjct: 832 FVVGTAYLDDENEESIRGRILVFEI----DNGRKL-TKVAELPVKGACRALAML-GEKIV 885
Query: 1171 ASGPKIIL------HKWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
A+ K ++ + + +L +A Y AP V + + N I + D+ KS+ +
Sbjct: 886 AALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAP----VDVTVTGNVIAVADLMKSVCLV 941
Query: 1222 SWKE----QGAQLNLLAKDFGSL-----DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
+ E L +A+ F ++ C A + ++ +D + N+ +
Sbjct: 942 EYSEGENGMPDSLTEVARHFQTVWATGVSCIAKDTFLE--------TDAEGNLIVLRRNL 993
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
E ++L E +G V + + +Q LA+ + A GT+
Sbjct: 994 TGVEEDDKRRLEVTGEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTV 1041
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1390
+GSI A ++ L LQ + V + + FR F S + + P VD
Sbjct: 1042 EGSIYLFAIINPEHQDFLMRLQATMAGKVESLGNIPFNEFRGFRSMVRETKE-PYRFVDG 1100
Query: 1391 ELLSHYEMLPLEEQLEIAHQTG 1412
EL+ + Q EI G
Sbjct: 1101 ELIERFLTCEPSLQEEIVDSVG 1122
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G +I Y V DS++ + R LD A
Sbjct: 245 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 284
Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
W Q D LL+ G L L +V D VQ L + S + +G +
Sbjct: 285 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 344
Query: 423 FFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ GS
Sbjct: 345 IFVGSHQGDSQVLRIGNGS 363
>gi|134106833|ref|XP_777958.1| hypothetical protein CNBA4270 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260658|gb|EAL23311.1| hypothetical protein CNBA4270 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1218
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 64/285 (22%), Positives = 122/285 (42%), Gaps = 32/285 (11%)
Query: 1161 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
LA QG LL G + L++ L + P VV++N+ I++GD+ +S ++
Sbjct: 952 LAGFQGFLLAGIGKSLRLYEMGKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFY 1011
Query: 1221 LSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
++ QL + A D + C + +D T++ D+ NI I P +SE
Sbjct: 1012 CVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA--CGDKFGNIFINRLDPSISEK 1066
Query: 1278 WK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
G +L F +GA H T+ + + + TS + G R L++ T
Sbjct: 1067 VDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVVTSITKIPLVAG----GRDVLVYTT 1122
Query: 1330 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1386
+ G++G + P D++ F + +L+ + + G + ++R ++ K
Sbjct: 1123 ISGAVGALVPFVSSDDIEF--MSTLEMHMRTQDISLVGRDHIAYRGYYVPIKG------- 1173
Query: 1387 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+VD +L + +LP +Q IA + +L L + ++F
Sbjct: 1174 VVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQMRTSSAF 1218
>gi|391867503|gb|EIT76749.1| splicing factor 3b, subunit 3 [Aspergillus oryzae 3.042]
Length = 1034
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 77/357 (21%), Positives = 159/357 (44%), Gaps = 36/357 (10%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1139
+TI ++ +E A++V V T++++ET L +GTA + +A G + ++ R
Sbjct: 695 STIELEENEAAVSVAAVPF---TSQDDETFLVVGTAKDMNVNPPSSAGGYIHIY---RFQ 748
Query: 1140 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1199
++ + L ++ +++ AL QG L+ GP + ++ +L P +
Sbjct: 749 EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGPMLRIYDLGMKQLLRKCNAQVVPKTI 807
Query: 1200 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1259
V L + I++ D+ +S+ ++ +K Q L D S +T ++D T +
Sbjct: 808 VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETTA--GG 864
Query: 1260 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1311
D+ NI + K+SE G L+ R H + + + Q + T+ +T
Sbjct: 865 DKFGNIWMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKT 924
Query: 1312 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 925 QLVAG----GRDILVWSGFHGTIGMLVPFVSREDVDF--FQNLEMQLAAQNPPLAGRDHL 978
Query: 1369 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+R +++ K ++D +L Y +LP + ++ IA + + +I ++D+
Sbjct: 979 IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIAAELDRSVREIERKISDM 1028
>gi|346327528|gb|EGX97124.1| pre-mRNA splicing factor RSE1 [Cordyceps militaris CM01]
Length = 1206
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 79/368 (21%), Positives = 160/368 (43%), Gaps = 50/368 (13%)
Query: 1084 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGR--------VLLFST 1135
+ +++E A++V VV+ +++NE+ L +GT G+D+ R + F
Sbjct: 869 VDFENNEAAVSVAVVSF---ASQDNESFLVVGT----GKDIVLNPRSSSEAYIYIYRFQQ 921
Query: 1136 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1195
G + ++ +++ +AL QG LL G + ++ +L A +
Sbjct: 922 GGRE------LEFIHKTKIEEPATALLPFQGKLLAGIGKTLRMYDLGMRQLLRKAQAEVV 975
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTL 1254
P +VSLN + I++ D+ + + + +K +L D S+ ++T ++D
Sbjct: 976 PQQIVSLNTQGSRIVVSDVQQGVTLVVYKSASNKLIPFVDD--SIARWSTCTTMVDYE-- 1031
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWK----GQKLL-SRAEFHVGAHVTKFL---RLQMLAT 1306
S+ D+ N+ I K SE G L+ +R H H + + Q + T
Sbjct: 1032 SVAGGDKFGNMFIVRSPAKASEEADEDAAGLHLVNARDYLHGTQHRLELMCHFFTQDILT 1091
Query: 1307 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1363
S ++TG G LL+ + G+IG P ++ F QSL++ L +A
Sbjct: 1092 SINKTGLVVGGQDV----LLWSGIMGTIGVFIPFVSREDTDF--FQSLEQHLRTEDGPLA 1145
Query: 1364 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1423
G + +R +++ K ++D +L + +LP +++ IA + + +I ++
Sbjct: 1146 GRDHLMYRSYYAPVKG-------VIDGDLCERFSILPNDKKQMIAGELDRSVREIERKIS 1198
Query: 1424 DLALGTSF 1431
D+ ++F
Sbjct: 1199 DIRARSAF 1206
Score = 44.7 bits (104), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 144/639 (22%), Positives = 231/639 (36%), Gaps = 116/639 (18%)
Query: 74 QEEGSKES---KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
Q G+KE SG + D + L+ H + G + S+A+ G++ +
Sbjct: 21 QFAGTKEQLIITGSGSQLTLLRPDPAQGKVIALLSH-DIFGILRSIAVFRLAGSN----K 75
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
D IILA + +I++LE+ + M F K G G + DP+GR
Sbjct: 76 DYIILATDSGRITILEYLPGPNRFNRLHMETFG-------KSGIRRVVPGEYLACDPKGR 128
Query: 191 C---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVK 247
V L ++ + SQ E T S A VI + LD+
Sbjct: 129 ACLISAVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVIAMVALDV---- 175
Query: 248 DFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
GY PV L E + G + T ++ + L WS
Sbjct: 176 ------GYANPVFAALEIEYTEVDQDITGEALSEVETQLV-YYELDLGLNHVVRKWSD-- 226
Query: 304 LPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPR 357
P D L P G GVLV G I Y HS + + + + E P
Sbjct: 227 -PVDPTASLLFQVPGGNDGPSGVLVCGEENITYRHSNQDALRVPIPRRR----GATEDPS 281
Query: 358 SSFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKT 406
++ H L+ LL + GDL +T+ V D VQR+ +
Sbjct: 282 RKRNIVAGVMHK--LKGSAGAFFFLLQSDDGDLFKITIDMVEDEEGAPTGEVQRMKIKYF 339
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF--GDIEADAPST 464
+ + + + + + ++ S+ G+ QF E+F + A P
Sbjct: 340 DTVPVATSLCILKSGFLYVASQFGNYAFYQFEKLGDDDDEVEFSSEDFPVDPLAAYEPVY 399
Query: 465 KRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVRDSLVNIGPLKDFSYGLR 523
R + + AL D + L +N T E A + F+ + LK +GL
Sbjct: 400 FYPRLAENLALVDSIPAMNPLLDCKVANLTGEDAPQIFTICGNGARSTFRTLK---HGLE 456
Query: 524 INADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
+N ++ ELPG +WT+ S D++Y AY+++S
Sbjct: 457 VNEIVAS------------ELPGVPSAVWTLKLNS--------------DEQYDAYIVLS 490
Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQ 641
T+VL + + EV++S + TIAA L G +IQV RG R I +G
Sbjct: 491 FTNGTLVLSIGETVEEVSDS-GFLTSVPTIAA-QLLGTDGLIQVHPRGIRHIRNG----- 543
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
N S ++ ++++ S V + +S G I
Sbjct: 544 -------NVNEWSAPQHRSIVAASTNSHQVAIALSSGEI 575
>gi|58258783|ref|XP_566804.1| U2 snRNA binding protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|338819361|sp|P0CR23.1|RSE1_CRYNB RecName: Full=Pre-mRNA-splicing factor RSE1
gi|338819362|sp|P0CR22.1|RSE1_CRYNJ RecName: Full=Pre-mRNA-splicing factor RSE1
gi|57222941|gb|AAW40985.1| U2 snRNA binding protein, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1217
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 64/285 (22%), Positives = 122/285 (42%), Gaps = 32/285 (11%)
Query: 1161 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
LA QG LL G + L++ L + P VV++N+ I++GD+ +S ++
Sbjct: 951 LAGFQGFLLAGIGKSLRLYEMGKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFY 1010
Query: 1221 LSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
++ QL + A D + C + +D T++ D+ NI I P +SE
Sbjct: 1011 CVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA--CGDKFGNIFINRLDPSISEK 1065
Query: 1278 WK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
G +L F +GA H T+ + + + TS + G R L++ T
Sbjct: 1066 VDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVVTSITKIPLVAG----GRDVLVYTT 1121
Query: 1330 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1386
+ G++G + P D++ F + +L+ + + G + ++R ++ K
Sbjct: 1122 ISGAVGALVPFVSSDDIEF--MSTLEMHMRTQDISLVGRDHIAYRGYYVPIKG------- 1172
Query: 1387 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+VD +L + +LP +Q IA + +L L + ++F
Sbjct: 1173 VVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQMRTSSAF 1217
>gi|261329035|emb|CBH12013.1| damage-specific DNA binding protein, putative [Trypanosoma brucei
gambiense DAL972]
Length = 1270
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 61/292 (20%), Positives = 122/292 (41%), Gaps = 28/292 (9%)
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNAD--NPQNLVTEVYSKELKGAISALA---SLQ 1165
++ IGT +V ++ +R ++ T A + L+ SK+++GA+ +
Sbjct: 852 VVLIGTTFVFPDEQLSRSSRFMWCTVEVAKLRTEKTLLRLQGSKDVEGALQCCCIVPNYA 911
Query: 1166 GHLLIASGPKIILHKWTGTELNGIA--FYDAPPLYVVSLNIVK---NFILLGDIHKSIYF 1220
G + + G ++L+ W + +A L V + +++ ++I+ D S +F
Sbjct: 912 GRVALGIGGCVVLYSWNAADATFVAEETIQIGTLIVRLIPVMQKEVSYIVASDARHSCFF 971
Query: 1221 LSWKEQGAQLNLLAKD---FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
+ LN++A+D G +DC ++ S + + D+ N + ++ S
Sbjct: 972 VRIDTIQGSLNIVARDPELRGVMDCAILQY---ESRHDVCLGDDLFNFFCVSHVEPLANS 1028
Query: 1278 -------WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA----LL 1326
+KL + A++H+G +T + A S P R ++
Sbjct: 1029 SGVSAPAMPTKKLQTSAQYHMGDLIT-VMHQGSFAPCSVLNDVVPIPATLVRGVCGPQIV 1087
Query: 1327 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+GT G+ G I P+ TF L+ L+ + VP + G SFR+ G+
Sbjct: 1088 YGTSHGAFGAITPISSETFILLKGLEVSVASVVPPLGGFTHASFREVLRVGQ 1139
>gi|156095699|ref|XP_001613884.1| Splicing factor 3B subunit 3 [Plasmodium vivax Sal-1]
gi|148802758|gb|EDL44157.1| Splicing factor 3B subunit 3, putative [Plasmodium vivax]
Length = 1230
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 127/607 (20%), Positives = 220/607 (36%), Gaps = 120/607 (19%)
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
L+ L L+ + G + L G++ +D +++ + ++++L+F +
Sbjct: 41 LLRADKQGKLNLIASKDIFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLTILQFSNEK 96
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
+ +HC + K G G + VDP+GR + VY L
Sbjct: 97 NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDTK 149
Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
Q+ I D G GF + +S N LD K V + + Y
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPMFASIEQNYEALD-KQVTNTSEIDSYTRK 208
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
++ L W + H + P DA L +P P
Sbjct: 209 TLLSL------WEMDLGLNH-------------------VIRKYTFPIDASAHLLIPIPG 243
Query: 319 G-----GVLVVGANTIHYHS---QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
G GV+V N + Y CA Y L++ QE S L
Sbjct: 244 GQQGPSGVIVCCDNFLVYKKVDHADVYCA-----YPRRLETGQEKNLSIVCSTLHRIRKF 298
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ L+ ++ GDL + + ++ VV+ + + + + I + + F+ + G
Sbjct: 299 FF----ILIQSELGDLYKIEMEHEDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354
Query: 431 DSLLVQFTCGSG----TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--L 484
+ QF+ G G +M +S K G A TK+L ++ L D V L
Sbjct: 355 NHFFYQFS-GIGDEDNEAMCTS--KHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPIL 408
Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYEL 541
+ + N S Q +L GP L+ +GL I A
Sbjct: 409 DMKVIDAKNASSPQIY-------ALCGRGPRSSLRILQHGLSIEELADN----------- 450
Query: 542 VELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
ELPG K IWT+ + NA +Y Y+I+S E T++LE + + EV
Sbjct: 451 -ELPGRPKFIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVV 495
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
+S+ + T N+ +IQV + G R ++G + + + P N + + + N
Sbjct: 496 DSL--LLTNVTTIHVNILYDNSLIQVHDAGIRHINGKVIHEWVP--PKNKQIKAATSNCA 551
Query: 661 VLSVSIA 667
+ +S++
Sbjct: 552 QIVISLS 558
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 63/286 (22%), Positives = 111/286 (38%), Gaps = 45/286 (15%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + G K+ ++ +L Y P ++S+ + + I DI +S+
Sbjct: 965 CFCPFNGRLLASIGNKLRIYALGKKKLLKKCEYKDIPEAIISIKVSGDRIFASDIRESVL 1024
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1266
+ L L++ D +E L + ++ L V +E K +
Sbjct: 1025 IFFYDANMNTLRLISDDIIPRWITCSEILDHHTIMAADKFDSVFVLRVPEEAKQEEYGIS 1084
Query: 1267 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
+Y M+ S K ++L FHVG VT ++++ TSS+
Sbjct: 1085 NKCWYGGEIMAGSNKNRRLEHIMSFHVGEIVTSLQKVKLSPTSSE--------------C 1130
Query: 1325 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1379
+++ T+ G+IG P D ELT Q L+ L P + G FR ++ +
Sbjct: 1131 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTENPPLCGREHIFFRSYYHPVQ- 1185
Query: 1380 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
++D +L + LP + Q ++A T IL L D+
Sbjct: 1186 ------HVIDGDLCEQFSSLPYDVQRKVAADLERTPDDILRKLEDI 1225
>gi|72390667|ref|XP_845628.1| damage-specific DNA binding protein [Trypanosoma brucei TREU927]
gi|62359843|gb|AAX80271.1| damage-specific DNA binding protein, putative [Trypanosoma brucei]
gi|70802163|gb|AAZ12069.1| damage-specific DNA binding protein, putative [Trypanosoma brucei
brucei strain 927/4 GUTat10.1]
Length = 1270
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 61/292 (20%), Positives = 122/292 (41%), Gaps = 28/292 (9%)
Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNAD--NPQNLVTEVYSKELKGAISALA---SLQ 1165
++ IGT +V ++ +R ++ T A + L+ SK+++GA+ +
Sbjct: 852 VVLIGTTFVFPDEQLSRSSRFMWCTVEVAKLRTEKTLLRLQGSKDVEGALQCCCIVPNYA 911
Query: 1166 GHLLIASGPKIILHKWTGTELNGIA--FYDAPPLYVVSLNIVK---NFILLGDIHKSIYF 1220
G + + G ++L+ W + +A L V + +++ ++I+ D S +F
Sbjct: 912 GRVALGIGGCVVLYSWNAADATFVAEETIQIGTLIVRLIPVMQKEVSYIVASDARHSCFF 971
Query: 1221 LSWKEQGAQLNLLAKD---FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
+ LN++A+D G +DC ++ S + + D+ N + ++ S
Sbjct: 972 VRIDTIQGSLNIVARDPELRGVMDCAILQY---ESRHDVCLGDDLFNFFCVSHVEPLANS 1028
Query: 1278 -------WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA----LL 1326
+KL + A++H+G +T + A S P R ++
Sbjct: 1029 SGVSAPAMPTKKLQTTAQYHMGDLIT-VMHQGSFAPCSVLNDVVPIPATLVRGVCGPQIV 1087
Query: 1327 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
+GT G+ G I P+ TF L+ L+ + VP + G SFR+ G+
Sbjct: 1088 YGTSHGAFGAITPISSETFILLKGLEVSVASVVPPLGGFTHASFREVLRVGQ 1139
>gi|225558618|gb|EEH06902.1| DNA damage-binding protein 1a [Ajellomyces capsulatus G186AR]
Length = 1201
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 44/315 (13%)
Query: 1111 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
L +GT+Y+ GE + RGR+L F N + +V +KGA ALA +Q +
Sbjct: 885 LFVVGTSYLDDFGEG-SIRGRILAFEVTANRQ-----LAKVAEMPVKGACRALAIVQDKI 938
Query: 1169 LIASGPKIILH-----KWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
+ A ++++ ++ L+ A Y AP V + + N I + D+ KS+
Sbjct: 939 VAALMKTVVVYTLSKGQFADYTLSKTASYRTSTAP----VDIAVTGNLIAVADLMKSVSI 994
Query: 1221 LSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+ + +QGA L +A+ F +L A + + + L SD + N+ + +
Sbjct: 995 VEY-QQGANGLPDSLTEVARHFQTLWSTAVAPVAEDTWLE---SDAEGNLVMLHRNVNGV 1050
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1334
++L +E +G V + + + + +P + GT++GSI
Sbjct: 1051 TDDDRRRLEVTSEISLGEMVNRIRPVNIQGSQGAEAAISPRA--------FLGTVEGSIY 1102
Query: 1335 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
G I P + RLQS +V + G+ FR F N P VD EL
Sbjct: 1103 LFGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTIRQAEEPYRFVDGEL 1158
Query: 1393 LSHYEMLPLEEQLEI 1407
+ + +E Q EI
Sbjct: 1159 IERFLGCSVELQEEI 1173
Score = 45.1 bits (105), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 25/144 (17%)
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
L A L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 293 ELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV 341
Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITT 417
W Q D LL+ G L L +V D VQ +LDL P S +
Sbjct: 342 -------AWEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVY 392
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
+G + F+GS GDS L++ T GS
Sbjct: 393 MGGGITFIGSHQGDSELIRITEGS 416
>gi|325096432|gb|EGC49742.1| pre-mRNA-splicing factor Rse1 [Ajellomyces capsulatus H88]
Length = 1209
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 96/456 (21%), Positives = 192/456 (42%), Gaps = 62/456 (13%)
Query: 995 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
+Q+ IPL+ TP +F + +PL V++ N +LS ++ + D N +
Sbjct: 785 LQETIPLQYTPR---HFIKHPEHPLFY---VIEAENNILSPGTRTKLLNDSDAVNGDTTP 838
Query: 1055 LHRTYTVEEYE-----------VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNT 1103
L EE+ ++I++P + + + I ++ +E A++V V
Sbjct: 839 L----PPEEFGYPRGTGHWASCIQIVDPINSK---RVISQIELEENEAAVSVAAVPF--- 888
Query: 1104 TTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAISA 1160
+++++ET L +GT G+D+ R R + + L ++ +++ A
Sbjct: 889 SSQDDETFLVVGT----GKDMVVNPRSCTAGFIHIYRFQEEGKEL-EFIHKTKVEQPPMA 943
Query: 1161 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
L QG LL G + ++ ++ P VV L + I++ D+ +S+ +
Sbjct: 944 LLGFQGRLLAGIGTDLRIYDLGMKQMLRKCQASVVPHLVVGLQTQGSRIIVSDVQESLTY 1003
Query: 1221 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW-- 1278
+ +K Q +L D S T ++D T++ D+ N+ + K SE
Sbjct: 1004 VVYKYQENRLIPFVDDVISRWTTCTT-MVDYETVA--GGDKFGNLWLLRCPAKASEEADE 1060
Query: 1279 --KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1332
G L+ ++ GA ++ Q L TS + G R L++ L G
Sbjct: 1061 DGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLVTG----GRDILVWTGLQG 1116
Query: 1333 SIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1389
++ + P +E+ F QSL+ +L P +AG + +R +++ K +D
Sbjct: 1117 TVSMLIPFISREEVDF--FQSLEMQLAAQNPPLAGRDHLIYRSYYAPAKG-------TID 1167
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+L Y +LP +++ +IA + + +I + D+
Sbjct: 1168 GDLCETYLLLPNDKKQQIAGELDRSVREIERKIADM 1203
>gi|221061705|ref|XP_002262422.1| splicing factor 3b, subunit 3, 130kd [Plasmodium knowlesi strain H]
gi|193811572|emb|CAQ42300.1| splicing factor 3b, subunit 3, 130kd, putative [Plasmodium knowlesi
strain H]
Length = 1276
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 63/286 (22%), Positives = 112/286 (39%), Gaps = 45/286 (15%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
+ G LL + G K+ ++ +L Y P ++S+ + + I DI +S+
Sbjct: 1011 CFSPFNGRLLASVGNKLRIYALGKKKLLKKCEYKDIPEAIISIKVSGDRIFASDIRESVL 1070
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1266
+ L L++ D +E L + ++ L V +E K +
Sbjct: 1071 VFFYDANMNALRLISDDIIPRWITCSEILDHHTIMAADKFDSVFVLRVPEEAKQEEYGIS 1130
Query: 1267 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
+Y M+ S K ++L FHVG VT ++++ TSS+
Sbjct: 1131 NKCWYGGEMMAGSNKNRRLEHIMNFHVGEIVTSLQKVKLSPTSSE--------------C 1176
Query: 1325 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1379
+++ T+ G+IG P D ELT Q L+ L P + G FR ++ +
Sbjct: 1177 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTENPPLCGREHIFFRSYYHPVQ- 1231
Query: 1380 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
++D +L + LP + Q ++A T IL L D+
Sbjct: 1232 ------HVIDGDLCEQFSSLPYDIQRKVAADLERTPDDILRKLEDI 1271
Score = 47.8 bits (112), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 123/604 (20%), Positives = 222/604 (36%), Gaps = 114/604 (18%)
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
L+ L L+ + G + L G++ +D +++ + ++ +L+F +
Sbjct: 41 LLRADKQGKLNLIVSKDIFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLVILQFSNEK 96
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
+ +HC + K G G + VDP+GR + VY L
Sbjct: 97 NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDNK 149
Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
Q+ I D G GF + +S N D K V + +
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPMFASIEQNYEMYD-KQVTNTTEIDACTRK 208
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
++ L E +L ++ +++H LP D L +P P
Sbjct: 209 TLLCLWEMDL------------------GLNHVIRKH-------TLPIDMSAHLLIPIPG 243
Query: 319 G-----GVLVVGANTIHYHSQS---ASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
G GV+V N + Y CA Y L++ QE + S+ H
Sbjct: 244 GQQGPSGVIVCCDNYLVYKKVEHVDVYCA-----YPRRLETGQE---KNISIVCSTVHRI 295
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ L+ ++ GDL + + + VV+ + + + + I + + F+ + G
Sbjct: 296 R-KFFFILIQSEYGDLYKIEMDHQDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354
Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--LSLY 487
+ QF+ G + K G A TK+L ++ L D V L +
Sbjct: 355 NHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPILDMK 411
Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVEL 544
+ N S Q ++ R GP L+ +GL I A EL
Sbjct: 412 ILDAKNANSPQ-IYALCGR------GPRSSLRILQHGLSIEELADN------------EL 452
Query: 545 PG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG K IWT+ + NA +Y Y+I+S E T++LE + + EV +++
Sbjct: 453 PGRPKYIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVVDTL 498
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
+ T N+ +IQV + G R ++G + + + P N + + + N+T +
Sbjct: 499 --LLTNVTTIHVNILYDNSLIQVHDTGIRHINGKVINEWVP--PKNKQVKAATSNATQIV 554
Query: 664 VSIA 667
+S++
Sbjct: 555 ISLS 558
>gi|154342093|ref|XP_001566998.1| putative CPSF-domain protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134064323|emb|CAM40524.1| putative CPSF-domain protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 1347
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 67/328 (20%), Positives = 132/328 (40%), Gaps = 43/328 (13%)
Query: 1085 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQ 1143
P + + V +T + ++ + LL +G+++ ++ AR GRV+ F+ + +
Sbjct: 896 PAERCGEQIDVPAITSAGVSEEDWQHLLLVGSSFTFPDEQRARSGRVMWFAL--HEERQG 953
Query: 1144 NLVTEVYSKELKGAISALASL---QGHLLIASGPKIILHKWTGTELN---------GIAF 1191
+ + SK++ GA+ A + +G + + + L+KW + G+
Sbjct: 954 QRLRLIASKDIGGALQCCAEVPYYKGRIALGVNGCVCLYKWNTEDQTFVAEERCRVGLTV 1013
Query: 1192 YDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD---FGSLDCFATEFL 1248
PLY +L + ++ D+ S +F+ L +L ++ G +D +
Sbjct: 1014 TRLIPLYNTAL--AASVLVALDVRHSAFFIEVDLLQGSLKVLCREGNLRGVMDGY----- 1066
Query: 1249 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ-----------KLLSRAEFHVGAHVTK 1297
+ +L + D+ N P E G + RA++HVG VT
Sbjct: 1067 VGSDAENLCLFDDNLNFTALKVVPLPVEPGDGDAAAAASGTPQCRFEVRAQYHVGDLVTC 1126
Query: 1298 FLRLQMLATSSDRTGAAPGSDKTNRFA-------LLFGTLDGSIGCIAPLDELTFRRLQS 1350
ATS + S + L+F T G G + PL T+ L++
Sbjct: 1127 VRPGSFAATSLMKAPTPSSSVPSPLLLPGIAGPQLVFATAHGGFGVVTPLHAATYLVLRA 1186
Query: 1351 LQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
L+ L ++P + GL+ ++FR+ G+
Sbjct: 1187 LEASLERTLPPLGGLSHQAFREVLRAGQ 1214
>gi|212539802|ref|XP_002150056.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
18224]
gi|210067355|gb|EEA21447.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
18224]
Length = 1139
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 121/296 (40%), Gaps = 36/296 (12%)
Query: 1114 IGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GTAY+ E + RGR+LLF N ++ +KGA ALA + +++ A
Sbjct: 833 VGTAYLDDETAESIRGRILLFEVDSNRK-----LSLFLEHPVKGACRALAMMGDYIVAAL 887
Query: 1173 GPKIILHKWTGTELNGIAFYDAPPLY-----VVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
+++ + TG G +Y V + + I++ D+ KSI + +
Sbjct: 888 VKTVVIFEVTGQPQTGKYSLQKAAVYRTSTAPVDIAVTDKTIVVADLMKSISIVESNKTD 947
Query: 1228 AQLNLLAKDFGSLDCFATEF---LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1284
A L + AK+ FAT + + D + +VSD + N+ + ++L
Sbjct: 948 A-LTMEAKEVAR--HFATVWTTAVADIGSNQWLVSDAEGNLIVLRRNVDGMTEEDRRRLE 1004
Query: 1285 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI---GCIAPLD 1341
+E +G V + + + TS+ P + GT++GSI I P
Sbjct: 1005 VTSELLLGEMVNRIRPVNIPQTST--MAVTPKA--------FLGTVEGSIYLFALINPEH 1054
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNP-RSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ RLQ+ VDS GL P FR F S + P VD EL+ +
Sbjct: 1055 QDFLMRLQTAISAYVDS----PGLMPFNKFRAFRSTVREAEE-PFRFVDGELIERF 1105
Score = 40.4 bits (93), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 62/139 (44%), Gaps = 25/139 (17%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G I Y +D ++ + S LD A
Sbjct: 245 ASHLIPVPAPLGGLLVLGETCIKY-----------------IDDAK---NETISNPLDEA 284
Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDITTIGNSL 422
W+Q D LL+ G L L +V D + V+ L + S + +G +
Sbjct: 285 TIFVAWVQVDGQRWLLADDYGRLFFLMLVLDSQNEVEGWKLDYLGEASRASVLIYLGAGM 344
Query: 423 FFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ + GS
Sbjct: 345 TFIGSHQGDSQVIRISEGS 363
>gi|115397303|ref|XP_001214243.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114192434|gb|EAU34134.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 1140
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 79/322 (24%), Positives = 132/322 (40%), Gaps = 51/322 (15%)
Query: 1112 LAIGTAYV-QGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1169
+GTAY+ + ED + RGR+L+F DN + L T+V +KGA ALA L ++
Sbjct: 831 FVVGTAYLDEDEDRDSIRGRILMF----EVDNGRKL-TKVAELAVKGACRALAMLGDKVV 885
Query: 1170 IASGPKIILHKWTGT-----ELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
A ++++K TG +L +A Y AP V + + N I + D+ KS +
Sbjct: 886 AALVKTVVIYKVTGNNFGAMKLEKLASYRTSTAP----VDITVTDNVIAVSDLMKSSCLV 941
Query: 1222 SW--KEQGA--QLNLLAKDFGS-----LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
+ E G L +A+ F + + C A ++ SD + N+ I
Sbjct: 942 EYIEGEDGLPDSLKEVARHFQTVWATGIACIAPHTYLE--------SDAEGNLIILRRNL 993
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
E ++L E +G V + + +Q LA+ + A GT+
Sbjct: 994 SGVEEDDKRRLEVTGEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTV 1041
Query: 1331 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1390
+GSI A ++ L LQ + + + + FR F S + + P VD
Sbjct: 1042 EGSIYLYAIINPEHQDFLMRLQATMAGKIESLGDMPFNEFRGFRSMVREAKE-PYRFVDG 1100
Query: 1391 ELLSHYEMLPLEEQLEIAHQTG 1412
EL+ + Q +I + G
Sbjct: 1101 ELIERFLTCEPSVQEDIVNSVG 1122
Score = 40.8 bits (94), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 60/146 (41%), Gaps = 25/146 (17%)
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS--LDSSQELPRS 358
A L A L+ VP+P+GG+L++G +I Y NN VS LD +
Sbjct: 237 AQELDLGASHLIPVPAPLGGLLILGETSIKYVDDD-------NNEIVSRLLDEA------ 283
Query: 359 SFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDI 415
W Q D LL+ G L L +V D VQ L + S +
Sbjct: 284 -------TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSENQVQGWQLDHLGNTSRASTL 336
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGS 441
+G + F+GS GDS +++ GS
Sbjct: 337 VYLGGGVIFVGSHQGDSQVLRVGDGS 362
>gi|121700262|ref|XP_001268396.1| nuclear mRNA splicing factor, putative [Aspergillus clavatus NRRL 1]
gi|119396538|gb|EAW06970.1| nuclear mRNA splicing factor, putative [Aspergillus clavatus NRRL 1]
Length = 1209
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 98/478 (20%), Positives = 206/478 (43%), Gaps = 49/478 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS- 1022
C G + + Q + ++ S DN +Q+ IPL TP ++ E+ L+ +I S
Sbjct: 759 QCVEGMVGIQGQNL----RIFSIEKLDNNM-LQESIPLSYTPRRLLKHPEQPLFYVIGSD 813
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQT 1080
VL P + + LI+ + L + + +++++P A
Sbjct: 814 NNVLSPATR--ARLIEDSKARNGEADTLPPEEFGYPRATGHWASCIQVVDPVNAKA---V 868
Query: 1081 RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRN 1138
+TI ++ +E A+++ V +++++ET L +GTA +A G + ++ R
Sbjct: 869 ISTIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDLTVNPPSSAGGFIHIY---RF 922
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
++ + L ++ +++ AL + QG L+ G + ++ +L P
Sbjct: 923 QEDGKEL-EFIHKTKVEEPPLALLAFQGRLVAGIGSILRIYDLGMKQLLRKCQAPVVPKT 981
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
+V L + I++ D+ +S+ ++ +K Q L D S +T ++D T++
Sbjct: 982 IVGLQTQGSRIIVSDVRESVTYVVYKYQENVLIPFVDDTVSRWMTSTT-MVDYETVA--G 1038
Query: 1259 SDEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDR 1310
D+ N+ + K+SE G L+ R H + + + Q + TS +
Sbjct: 1039 GDKFGNLWLVRCPKKISEEADEDGSGAHLIHERGYLHGTPNRLELMIHTYTQDIPTSVHK 1098
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
T G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1099 TQLVAG----GRDILVWTGFQGTIGMLVPFMSREDVDF--FQNLEMQLASQCPPLAGRDH 1152
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+R +++ K ++D +L Y +LP + ++ IA + + +I ++D+
Sbjct: 1153 LIYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|317143715|ref|XP_001819645.2| pre-mRNA-splicing factor rse1 [Aspergillus oryzae RIB40]
Length = 1209
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 77/357 (21%), Positives = 159/357 (44%), Gaps = 36/357 (10%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1139
+TI ++ +E A++V V T++++ET L +GTA + +A G + ++ R
Sbjct: 870 STIELEENEAAVSVAAVPF---TSQDDETFLVVGTAKDMNVNPPSSAGGYIHIY---RFQ 923
Query: 1140 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1199
++ + L ++ +++ AL QG L+ GP + ++ +L P +
Sbjct: 924 EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGPMLRIYDLGMKQLLRKCNAQVVPKTI 982
Query: 1200 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1259
V L + I++ D+ +S+ ++ +K Q L D S +T ++D T +
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETTA--GG 1039
Query: 1260 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1311
D+ NI + K+SE G L+ R H + + + Q + T+ +T
Sbjct: 1040 DKFGNIWMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKT 1099
Query: 1312 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWSGFHGTIGMLVPFVSREDVDF--FQNLEMQLAAQNPPLAGRDHL 1153
Query: 1369 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+R +++ K ++D +L Y +LP + ++ IA + + +I ++D+
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|398391687|ref|XP_003849303.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
gi|339469180|gb|EGP84279.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
Length = 1143
Score = 50.1 bits (118), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 133/323 (41%), Gaps = 52/323 (16%)
Query: 1112 LAIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
IGTAY+ +D + A+GR+L+ D LVTE+ ++GA LA G ++
Sbjct: 834 FVIGTAYLDDQDASNAKGRILVLEV--TEDRRLKLVTEI---SVRGACRCLAVSHGRIVA 888
Query: 1171 ASGPKIILHKWTGTELNGIAFYDAP--PLYV-----------VSLNIVKNFILLGDIHKS 1217
A +I++ + Y+ P P V + + + + I + D+ KS
Sbjct: 889 ALIKTVIIYSFE---------YETPSSPAMVKKAAYRTSTAPIDMCVTGDIIAVTDLMKS 939
Query: 1218 IYFL--SWKEQGAQLNL--LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1273
+ + + + G NL +A+ F +L A + + L SD + N+ + + K
Sbjct: 940 MSLVQHTLGQAGGPDNLTEVARHFDTLWGTAVANVDENIYLE---SDAEGNLVVLEHDVK 996
Query: 1274 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
++L +E +G V + R+ + +P + T T++GS
Sbjct: 997 GFSEEDRRRLRVTSEILLGEMVNRIRRIDV----------SPTPNATVIPRAFLATVEGS 1046
Query: 1334 I---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-QFHSNGKAHRPGPDSIVD 1389
I IA + R+Q+ ++V S HV R F+ Q G+ GP VD
Sbjct: 1047 IYLFALIAEGKQDLLIRMQNKMAEMVQSPGHVPFAKFRGFKTQVRDMGEE---GPSRFVD 1103
Query: 1390 CELLSHYEMLPLEEQLEIAHQTG 1412
EL+ + + Q E+A + G
Sbjct: 1104 GELIERFLDCDEDVQAEVAKELG 1126
>gi|70945139|ref|XP_742421.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56521397|emb|CAH76894.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 435
Score = 50.1 bits (118), Expect = 0.010, Method: Composition-based stats.
Identities = 52/214 (24%), Positives = 99/214 (46%), Gaps = 10/214 (4%)
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
P +++SL++++N+I++GDI S+ LS+ + L + +D+ ++ C F+ S
Sbjct: 197 PSSWIMSLDVIENYIVVGDIMTSVTILSYDFNNSILTEVCRDYSNVWC---TFVCALSKS 253
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1314
+VSD + N +F + KL A F+ G V K L + + SS
Sbjct: 254 HFLVSDMESNFLVFQKSSIKYNDEDSFKLSRVALFNHGHVVNKMLPVSL---SSLIEEEE 310
Query: 1315 PGSD-KTNRFALLFGTLDGSIGCIAPLDEL-TFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1372
P ++ + ++L + +GSI I P L F++ ++ L DS+ + +N S
Sbjct: 311 PQNEILRKKESILCASSEGSISSIIPFSNLANFKKALCIELALNDSLSSIGNINDNSNNT 370
Query: 1373 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1406
+ N +VD E+ + +P E+Q +
Sbjct: 371 YKMN--LSEKSCKGVVDGEVFKMFFSMPFEKQFK 402
>gi|430813298|emb|CCJ29330.1| unnamed protein product [Pneumocystis jirovecii]
Length = 1197
Score = 50.1 bits (118), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 87/367 (23%), Positives = 156/367 (42%), Gaps = 48/367 (13%)
Query: 1084 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNAD 1140
I + +E A ++ +VT N + +E LAIG+ G++V + S R D
Sbjct: 860 IELDDNEAAFSIAMVTFKN---QNDEIFLAIGS----GKNVILAPKSFSAAYISIYRFID 912
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1200
++ + V+ E+ AL QG LL G + +++ + A P +V
Sbjct: 913 QGKS-IELVHKTEVDDIPLALLGFQGRLLAGLGKMLRIYEMGMKKCLRKCEVRAVPNCIV 971
Query: 1201 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1260
L+ + I++ DI +SI+F +K +L + A D T ++D T++ D
Sbjct: 972 QLHTQGSRIIIADIQESIHFAVYKYLENRLIVFADDVIP-RWTTTSTMLDYETVA--AGD 1028
Query: 1261 EQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRLQMLA--------TSS 1308
+ N I ++SES G +L+ + GA RL+MLA TS
Sbjct: 1029 KFGNFWINRCPLEVSESADEDPSGAQLIHEKSYLFGAAK----RLKMLAHFYIGDTFTSM 1084
Query: 1309 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL----DELTFRRLQSLQKKLVDSVPHVAG 1364
+ G R +++ + GSIG P D F++L++L + S+ G
Sbjct: 1085 HKVQLIAGG----RDIIVYTGMMGSIGIFLPFVGREDVDFFQQLEALMRTEDLSL---IG 1137
Query: 1365 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
+ +R ++ K S+VD +L + MLP ++ IA++ S+I + D
Sbjct: 1138 RDHLMYRGYYVPVK-------SVVDGDLCERFLMLPYNKKQVIANELDREISEIAKKIED 1190
Query: 1425 LALGTSF 1431
+ + +F
Sbjct: 1191 MRVRVAF 1197
Score = 45.8 bits (107), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 121/545 (22%), Positives = 210/545 (38%), Gaps = 94/545 (17%)
Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
+HG + +L G + +D +I+ + +I++LE+ + +
Sbjct: 65 VHGIIRTLVGFRLAGTN----KDHLIVGSDSGRITILEYKPDSNAFSKVHQETYG----- 115
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVL-VYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
K G G + VDP+GR + + +++ + + L T S
Sbjct: 116 --KSGVRRVVPGQYLAVDPKGRATMIASIEKNKLVYVLNRDSATNL-----TISSP--LE 166
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMI 283
A S V +L +D+ GY PV L E E +G+ +++ ++
Sbjct: 167 AHKSCSLVFHLIGMDV----------GYENPVFAALEVDYTEAESDPSGK-AYREIQKVL 215
Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSAS 337
+ + L WS P D L V P G G LV +I Y H +
Sbjct: 216 TYYELDLGLNHVVRKWSD---PVDRKANLLVTVPGGSDGPSGALVCTEGSIFYKHKGKKT 272
Query: 338 CALALNNYAVSLDSSQ--ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
+ + SL++SQ ++ SS ++ A LQN+ GDL +T+ +
Sbjct: 273 HRIPIPTRIGSLENSQKKQIIVSSVVHKMRGAFFFLLQNE-------DGDLFKVTIDSND 325
Query: 396 RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF-TCGSGTSMLS-SGLKEE 453
V+ L + + +++ ++ + + F+ S G+ L QF G + + S +
Sbjct: 326 GEVESLKIKYFDTVPVSTGLSILKSGFLFVASEYGNHHLYQFEKLGDDNNEIEFSSVDFP 385
Query: 454 FGDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVRDSLVN 511
D+ E PS R R + L D +N + N T E A + ++ R
Sbjct: 386 VLDLNEGYEPSYFRPRSLENLLLVDDLNSMNPLMDSKILNLTDEDAPQIYALCGR----- 440
Query: 512 IGPLKDFS---YGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSR 567
GP F YGL +N + A+G LPG +WT SS
Sbjct: 441 -GPRSTFRTLRYGLEVN-EIVASG-----------LPGSPTAVWTTKLTSS--------- 478
Query: 568 MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVF 627
D+Y AY+++S T+VL + + EV+++ + T+A L G +IQV
Sbjct: 479 -----DQYDAYIVLSFVNGTLVLSIGETVEEVSDT-GFLSSSPTLAVQQL-GDDALIQVH 531
Query: 628 ERGAR 632
+G R
Sbjct: 532 PKGIR 536
>gi|321260749|ref|XP_003195094.1| hypothetical protein CGB_G1120W [Cryptococcus gattii WM276]
gi|317461567|gb|ADV23307.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 1276
Score = 50.1 bits (118), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 112/266 (42%), Gaps = 42/266 (15%)
Query: 1112 LAIGTAYV---QGED---------VAARGRVLLFSTGRNADNPQNLVTEVYSK-ELKGAI 1158
LA+GTA++ GED V GRVLL + D ++ ++ GA+
Sbjct: 945 LAVGTAFLPPDDGEDSSWDEGNLAVVKEGRVLLLEI-KEGDAGGGWDVKIKAELTTVGAV 1003
Query: 1159 SALASLQGHLLIASGPKIILHKW--TGTELNGIAFYDAPPLYVVSLNIV-------KNFI 1209
AL + G L +A+G K+ +H+ EL + + A + SL+ + + +
Sbjct: 1004 YALEEIHGFLAVAAGSKLTMHRLDHNSVELEETSSW-ASAYVISSLSALHPSHTRPEGAL 1062
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
++GD +S+ L+ E + ++ + A L D ++VVSD N+ +
Sbjct: 1063 IVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGD-AVVVSDAHSNLLTYR 1121
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
QKL A F + VT+F ++ T++ P +LF T
Sbjct: 1122 L---------NQKLERAATFGLHEEVTRFQNGSLVPTTTAPEIIIPD--------VLFAT 1164
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKL 1355
+G +G I L ++ R L LQ+ +
Sbjct: 1165 REGRLGVIGELGTMSSRTLDDLQRNM 1190
>gi|119473054|ref|XP_001258481.1| nuclear mRNA splicing factor, putative [Neosartorya fischeri NRRL
181]
gi|119406633|gb|EAW16584.1| nuclear mRNA splicing factor, putative [Neosartorya fischeri NRRL
181]
Length = 1209
Score = 50.1 bits (118), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 102/480 (21%), Positives = 210/480 (43%), Gaps = 53/480 (11%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS- 1022
C G + + +Q + ++ S DN +Q+ IPL TP ++ E+ L+ +I S
Sbjct: 759 QCVEGMVGIQAQNL----RIFSIEKLDNNI-LQESIPLSNTPRRMLKHPEQPLFYVIESD 813
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQT 1080
VL P + + LI+ + + L D + ++I++P A
Sbjct: 814 NNVLSPATR--ARLIEDSKARNGETNVLPPEDFGYPRATGHWASCIQIVDPLDAKA---V 868
Query: 1081 RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRN 1138
+TI ++ +E A+++ V +++++ET L +GTA + +A G + ++ R
Sbjct: 869 ISTIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDMIVNPPSSAGGFIHIY---RF 922
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL- 1197
++ + L ++ +++ AL QG LL G + ++ +L + AP +
Sbjct: 923 QEDGKEL-EFIHKTKVEEPPLALLGFQGRLLAGIGSTLRVYDLGMKQL--LRKCQAPVVS 979
Query: 1198 -YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1256
+V L + I++ D+ +S+ ++ +K Q L D S +T ++D T++
Sbjct: 980 KTIVGLQTQGSRIIVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETVA- 1037
Query: 1257 VVSDEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSS 1308
D+ N+ + K+SE G L+ R H + + Q + TS
Sbjct: 1038 -GGDKFGNLWLVRCPKKVSEEADEDGSGAHLIHERGYLHGTPNRLDLMIHTYTQDIPTSL 1096
Query: 1309 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGL 1365
+T G R L++ G+IG + P +++ F Q+L+ +L P +AG
Sbjct: 1097 HKTQLVAG----GRDILVWTGFQGTIGMLVPFVSREDVDF--FQNLEMQLASQCPPLAGR 1150
Query: 1366 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ +R +++ K ++D +L Y +LP + ++ IA + + +I ++D+
Sbjct: 1151 DHLIYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|358366432|dbj|GAA83053.1| UV-damaged DNA binding protein [Aspergillus kawachii IFO 4308]
Length = 1643
Score = 49.7 bits (117), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 78/325 (24%), Positives = 132/325 (40%), Gaps = 58/325 (17%)
Query: 1112 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+GTAY+ E+ + RGR+L+F DN + L T+V +KGA ALA L G ++
Sbjct: 772 FVVGTAYLDDENEESIRGRILVFEI----DNGRKL-TKVAELPVKGACRALAML-GEKIV 825
Query: 1171 ASGPKIIL------HKWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFL 1221
A+ K ++ + + +L +A Y AP V + + N I + D+ KS+ +
Sbjct: 826 AALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAP----VDVTVTGNVIAIADLMKSVCLV 881
Query: 1222 SWKE----QGAQLNLLAKDFGSL-----DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
+ E L +A+ F ++ C A + ++ +D + N+ +
Sbjct: 882 EYSEGENGMPDSLTEVARHFQTVWATGVVCIAKDTFLE--------TDAEGNLIVLRRNL 933
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
E ++L E +G V + + +Q LA+ + A GT+
Sbjct: 934 TGVEEDDKRRLEVTGEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTV 981
Query: 1331 DGSI---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1387
+GSI I P + RLQ+ V+S+ ++ R FR K P
Sbjct: 982 EGSIYLFAIINPEHQDFLMRLQATMAGKVESLGNIPFNEFRGFRSMVREAKE----PYRF 1037
Query: 1388 VDCELLSHYEMLPLEEQLEIAHQTG 1412
VD EL+ + Q EI G
Sbjct: 1038 VDGELIERFLTCEPSLQEEIVDSVG 1062
Score = 43.9 bits (102), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 105/264 (39%), Gaps = 29/264 (10%)
Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+DP GR + VY + ++ Q S G + SG E R +D
Sbjct: 62 IDPSGRFMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTTAR-IDEL 120
Query: 245 HVKDFIFVHGYIEP--VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
V+ F+H P + ++ + + +V H++ S+ ++ L +
Sbjct: 121 FVRSSAFLHVQSGPPRLALLYEDNQKKVRLKVRALHYSAATSSTGADAAFEES-LDGFSQ 179
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
L A L+ VP+P+GG+LV+G +I Y V DS++ + R
Sbjct: 180 ELDLGASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP---- 220
Query: 363 ELDAA--HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITT 417
LD A W Q D LL+ G L L +V D VQ L + S +
Sbjct: 221 -LDEATIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIY 279
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
+G + F+GS GDS +++ GS
Sbjct: 280 LGGGVIFVGSHQGDSQVLRIGNGS 303
>gi|330935579|ref|XP_003305038.1| hypothetical protein PTT_17772 [Pyrenophora teres f. teres 0-1]
gi|311318228|gb|EFQ86975.1| hypothetical protein PTT_17772 [Pyrenophora teres f. teres 0-1]
Length = 1115
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 68/297 (22%), Positives = 122/297 (41%), Gaps = 35/297 (11%)
Query: 1112 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
IGTAY+ ++ + RGR+L+ P+ ++ V +KG LA+ +G ++
Sbjct: 809 FVIGTAYLDDQNTTSERGRILILEV-----TPERILKLVTEIAVKGGCRCLATCEGKIVA 863
Query: 1171 ASGPKIILHKWTG-------TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
A I+++ T+L AP + + + + I + D+ KS+ + +
Sbjct: 864 ALIKTIVIYDVEYPTQTPFLTKLATFRCSTAP----IDITVNGSKIAIADLMKSLVVVEY 919
Query: 1224 KEQGAQL-NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1282
+ A L + L + + + T + SD + N+ + Y P ++
Sbjct: 920 TKGEAGLPDKLVEVARHYQITWATAVAEVDTNMYLESDAEGNLMVLYRDPNGVTDDDKRR 979
Query: 1283 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI---GCIAP 1339
L +E +G V + R+ +L T+SD P + GT++GSI G I+P
Sbjct: 980 LNVSSEMLLGEMVNRIRRIDVL-TASDAV-VIPRA--------FVGTVEGSIYLFGLISP 1029
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ L +LQ L VP ++ FR F NG P VD E + +
Sbjct: 1030 AHQ---NLLMTLQSNLGALVPAPGDMDFAKFRAFK-NGVREEEEPMRFVDGEFVERF 1082
>gi|68531971|ref|XP_723667.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23478038|gb|EAA15232.1| Drosophila melanogaster CG13900 gene product [Plasmodium yoelii
yoelii]
Length = 1235
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 62/295 (21%), Positives = 121/295 (41%), Gaps = 43/295 (14%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ ++ GDL + V ++ +V+ + + + + I + + F+ + G+ QF
Sbjct: 302 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 361
Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SDALQDMVNGEELSLYGSASNN 493
+ S +M +S G A T++L+ D + + ++ + + ++N
Sbjct: 362 SGIGNDSNDAMCTSN--HPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSN 419
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWT 552
R SL + +GL I A+ ELPG + IWT
Sbjct: 420 LPQIYALCGRGPRSSL------RILQHGLSIEELANN------------ELPGKPRYIWT 461
Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
V +S EY Y+I+S E T++LE + + EV +S+ + T
Sbjct: 462 VKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNVTT 505
Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
NL IQV++ G R ++G + + + P N + + + N + + VS++
Sbjct: 506 IHINLLYDNSFIQVYDTGIRHINGKIVQEWIP--PKNKQINAATSNGSQIVVSLS 558
Score = 46.2 bits (108), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 60/300 (20%), Positives = 115/300 (38%), Gaps = 59/300 (19%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G ++++ G K+ ++ +L Y P +VS+ + + I DI +S+
Sbjct: 956 CFCPFNGRVIVSVGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKVSGDRIFASDIRESVL 1015
Query: 1220 FLSWKEQGAQLNLLAKDFG----------------SLDCFATEFLIDGSTLSLVVS---- 1259
+ + L++ D + D F + F++ S L+ ++
Sbjct: 1016 IFFYDSNQNLIRLISDDIIPRWITCSEILDHHTIIAADKFDSVFILRVSLLTFFITPFCH 1075
Query: 1260 -------DEQKNI--QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1310
E+ I + +Y ++ S K +K+ FH+G VT ++++ TSS+
Sbjct: 1076 LVPEEAKQEEYGIANKCWYGGEVINSSTKNRKMEHIMSFHIGEIVTSLQKVKLSPTSSE- 1134
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGL 1365
+++ T+ G+IG P D ELT Q L+ L + G
Sbjct: 1135 -------------CIIYSTIMGTIGAFIPYDSKEELELT----QHLEIILRTEKHSLCGR 1177
Query: 1366 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
FR ++ + ++D +L + LP E Q +I T +IL L D+
Sbjct: 1178 EHIFFRSYYHPVQ-------HVIDGDLCEQFSSLPFEVQRKIGSDLEKTPDEILRKLEDI 1230
>gi|400597418|gb|EJP65151.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
Length = 1212
Score = 49.7 bits (117), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 104/512 (20%), Positives = 198/512 (38%), Gaps = 104/512 (20%)
Query: 964 NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021
C G + + Q I I +L G T +Q IPL TP + F + PL
Sbjct: 761 QCEDGVVGIQGQSLRIFSIDRL--GETL-----IQSSIPLTYTPKK---FVKHPSEPLFY 810
Query: 1022 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEV----RILEPDRAGGP 1077
++ + DNH L DL + V ++L P+ G P
Sbjct: 811 TI--------------------EADNHTLPP-DLQAKLLADPAAVNGDAKVLPPEEFGHP 849
Query: 1078 WQTR-------------------ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY 1118
R + +++E A++ VV+ +++NE+ L +GT
Sbjct: 850 RGNRRWASCISVVDPVSEEPSVLQKVDFENNEAAVSAAVVSF---ASQDNESFLVVGT-- 904
Query: 1119 VQGEDVAARGR--------VLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
G+D+ R + F G + ++ +++ AL + QG LL
Sbjct: 905 --GKDMILNPRSSSEAYIYIYRFQEGGRE------LEFIHKTKIEEPAMALLAFQGKLLA 956
Query: 1171 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1230
G + ++ +L A + P +VSLN + I++GD+ + + + +K +L
Sbjct: 957 GIGKTLRMYDLGMRQLLRKAQAEVVPQQIVSLNTQGSRIVVGDVQQGVTLVVYKPASNKL 1016
Query: 1231 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLS 1285
A D + T ++D S+ D+ N+ I K SE ++ + +
Sbjct: 1017 IPFADDTIARWTTCTT-MVDYE--SVAGGDKFGNMFIVRSPAKASEEADEEQAGLHLVNA 1073
Query: 1286 RAEFHVGAHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1340
R H H + + Q + TS ++T G LL+ + G+IG P
Sbjct: 1074 RDYLHGAQHRLELMCHFFTQDVPTSINKTSLVVGGQDV----LLWSGIMGTIGVFIPFVS 1129
Query: 1341 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
++ F QSL++ L +AG + +R +++ K ++D +L + L
Sbjct: 1130 REDADF--FQSLEQHLRTEDAPLAGRDHLMYRSYYAPVKG-------VIDGDLCERFAAL 1180
Query: 1400 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
P +++ +A + + +I ++D+ ++F
Sbjct: 1181 PNDKKQMMAGELDRSVREIERKISDIRTRSAF 1212
Score = 47.4 bits (111), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 143/614 (23%), Positives = 217/614 (35%), Gaps = 133/614 (21%)
Query: 64 NVIEIYVVRVQEEGSKES---KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
NV++ V Q G+KE SG + D + L+ H + G + S+A+
Sbjct: 19 NVVQ--AVLGQFAGTKEQLIITGSGSQLTILRPDPAQGKVIPLLSH-DIFGVLRSIAVFR 75
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
G+ +D IILA + +I+VLE+ S + M F K G G
Sbjct: 76 LAGSS----KDYIILATDSGRITVLEYLPSPNRFSRLHMETFG-------KTGIRRVVPG 124
Query: 181 PLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
+ DP+GR V L ++ + SQ E T S A VI
Sbjct: 125 EYLACDPKGRACLISAVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVIA 175
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMISALSISTTLK 293
L LD+ GY PV L E + G + T ++ + L
Sbjct: 176 LTALDV----------GYANPVFAALEIDYTEVDQDNTGEALSEVETHLVY-YELDLGLN 224
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNYAV 347
WS P D L P G GVLV G + Y HS + + +
Sbjct: 225 HVVRKWSD---PVDPTASLLFQVPGGNDGPSGVLVCGEENVTYRHSNQDALRVPIPRRR- 280
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTV--VYDGR----- 396
+ E P ++ H L+ LL T GDL +T+ V D
Sbjct: 281 ---GATEDPSRKRNIVAGVMHK--LKGSAGAFFFLLQTDDGDLFKITIDMVEDEEGAPTG 335
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
VQR+ + + + + + + + ++ S+ G+ QF E+ GD
Sbjct: 336 EVQRMKIKYFDTVPVATSLCILKSGFLYVASQFGNYAFYQF--------------EKLGD 381
Query: 457 IEADAPSTKRLRRSSSDALQD-MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPL 515
L SS D D + E + Y + N A+ DS+ + PL
Sbjct: 382 ------DDDELEFSSDDFPVDPLAAYEPVYFYPRPAEN---------LALVDSIPAMNPL 426
Query: 516 KDFSYGLRINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSS 558
D DA S G +S + + ELPG +WT+ S
Sbjct: 427 LDCKVANLTGEDAPQIYSICGNGARSTFRTIKHGLEVNEIVASELPGVPSAVWTLKLNS- 485
Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
D++Y Y+++S T+VL + + EV++S + TIAA L
Sbjct: 486 -------------DEQYDTYIVLSFTNGTLVLSIGETVEEVSDS-GFLTSVPTIAA-QLL 530
Query: 619 GRRRVIQVFERGAR 632
G +IQV RG R
Sbjct: 531 GTDGLIQVHPRGIR 544
>gi|325094412|gb|EGC47722.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H88]
Length = 1201
Score = 49.3 bits (116), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 44/315 (13%)
Query: 1111 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
L +GT+Y+ GE + RGR+L F N + +V +KGA ALA +Q +
Sbjct: 885 LFVVGTSYLDDFGEG-SIRGRILAFEVTANRQ-----LAKVAEMPVKGACRALAIVQDKI 938
Query: 1169 LIASGPKIILH-----KWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
+ A ++++ ++ L+ A Y AP V + + N I + D+ KS+
Sbjct: 939 VAALMKTVVVYTLSKGQFADYTLSKTASYRTSTAP----VDIAVTGNLIAVADLMKSVSI 994
Query: 1221 LSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+ + +QGA L +A+ F +L A + + + L SD + N+ + +
Sbjct: 995 VEY-QQGANGLPDSLTEVARHFQTLWSTAVAPVAEDTWLE---SDAEGNLVMLHRNVNGV 1050
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1334
++L +E +G V + + + + +P + GT++GSI
Sbjct: 1051 TDDDRRRLEVTSEILLGEMVNRIRPVNIQGSQGAEAAISPRA--------FLGTVEGSIY 1102
Query: 1335 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
G I P + RLQS +V + G+ FR F N P VD EL
Sbjct: 1103 LFGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTIRQTEEPYRFVDGEL 1158
Query: 1393 LSHYEMLPLEEQLEI 1407
+ + +E Q EI
Sbjct: 1159 IERFLNCGVELQEEI 1173
Score = 44.7 bits (104), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 25/144 (17%)
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
L A L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 293 ELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV 341
Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITT 417
W Q D LL+ G L L +V D VQ +LDL P S +
Sbjct: 342 -------AWEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVY 392
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
+G + F+GS GDS L++ T GS
Sbjct: 393 MGGGITFIGSHQGDSELIRITEGS 416
>gi|66361481|ref|XP_627314.1| possible spliceosome factor [Cryptosporidium parvum Iowa II]
gi|46228697|gb|EAK89567.1| possible spliceosome factor [Cryptosporidium parvum Iowa II]
Length = 1317
Score = 49.3 bits (116), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 63/295 (21%), Positives = 123/295 (41%), Gaps = 42/295 (14%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
V+ ++ + +AL +G LL+ + ++ L + Y P + + +V +
Sbjct: 1040 VHITPIENSATALTGWRGRLLVGINKTLRVYSLGKKRLLRKSEYRNIPQGLTWIKVVNDR 1099
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI- 1267
I GDI + + Q L+AKD + ++D T++ VSD+ NI +
Sbjct: 1100 IFAGDISNGVLVFKFNNTSNQFILVAKDPMPRWLTSACEVLDYHTIA--VSDKFDNIIVS 1157
Query: 1268 ---------FYYAPKMSESWKGQ--------KLLSRAEFHVGAHVTKFLRLQMLATSSDR 1310
F + +++ Q ++ + A+FH+G VT + Q+ TS++
Sbjct: 1158 RVPAEASDDFSFVTSFTDNNNSQSSALMRTHQINTVAQFHLGDIVTCLQKSQLTPTSAE- 1216
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
A+++GT+ GSIG ++P L+ L L+ L + +
Sbjct: 1217 -------------AIIYGTVLGSIGSLSPILNNEDIELLSKLEILLRKQKSTLLSRDHLM 1263
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
FR ++S H +++D + + +L + Q EIA + T +I L+D
Sbjct: 1264 FRSYYS--PVH-----NVIDGDFCQTFTILDSQIQSEIASKLDVTVEEIYKKLDD 1311
>gi|154286506|ref|XP_001544048.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407689|gb|EDN03230.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 1158
Score = 49.3 bits (116), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 74/315 (23%), Positives = 131/315 (41%), Gaps = 44/315 (13%)
Query: 1111 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
L +GT+Y+ GE + RGR+L F N + +V +KGA ALA +Q +
Sbjct: 842 LFVVGTSYLDDFGEG-SIRGRILAFEVTANRQ-----LAKVAEMPVKGACRALAIVQDKI 895
Query: 1169 LIASGPKIILH-----KWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
+ A ++++ ++ L+ A Y AP + + + N I + D+ KS+
Sbjct: 896 VAALMKTVVVYTISKGQFADYTLSKTASYRTSTAP----IDIAVTGNLIAVADLMKSVSI 951
Query: 1221 LSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+ + +QG+ L +A+ F +L A + + + L SD + N+ + +
Sbjct: 952 VEY-QQGSNGLPDSLTEVARHFQTLWSTAVAHVAEDTWLE---SDAEGNLVMLHRNVNGV 1007
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1334
++L +E +G V + + + + +P + GT++GSI
Sbjct: 1008 TDDDRRRLEVTSEILLGEMVNRIRPVNIQGSQGAEAAISPRA--------FLGTVEGSIY 1059
Query: 1335 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
G I P + RLQS +V + G+ FR F N P VD EL
Sbjct: 1060 LFGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTIRQAEEPYRFVDGEL 1115
Query: 1393 LSHYEMLPLEEQLEI 1407
+ + +E Q EI
Sbjct: 1116 IERFLSCSVELQEEI 1130
Score = 43.1 bits (100), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 63/136 (46%), Gaps = 25/136 (18%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 258 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV-------A 299
Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFL 425
W Q D LL+ G L L +V D VQ +LDL P S + +G + F+
Sbjct: 300 WEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVYMGGGITFI 357
Query: 426 GSRLGDSLLVQFTCGS 441
GS GD L++ T GS
Sbjct: 358 GSHQGDPELIRITEGS 373
>gi|242018509|ref|XP_002429717.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
gi|212514723|gb|EEB16979.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
Length = 1218
Score = 49.3 bits (116), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 77/388 (19%), Positives = 149/388 (38%), Gaps = 58/388 (14%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+RI++P +T + ++ +E AL++ +V FN + ++ + Y
Sbjct: 867 IRIIDPVEG----RTDKIVRLEQNEAALSIALVK-FNNHPESLFLVVGVVKEYQLSPRQV 921
Query: 1126 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1185
+ G + F + + + V+ + A +A+ G LL+ G + L+ +
Sbjct: 922 SFGYLYTFRINEDVTD----LELVHKTTVDEAPAAVCPYHGRLLVGVGRMLRLYDLGKKK 977
Query: 1186 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1245
L P +VS+ + + D+ +S+Y + +K Q QL + A D T
Sbjct: 978 LLRKCENKYIPNQIVSICATGQRVFVSDVQESVYMVRYKRQENQLIIFADDTHPRWITCT 1037
Query: 1246 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLSRA 1287
L D T++ +D+ NI I + +++ W QK A
Sbjct: 1038 TIL-DYDTVA--TADKFGNIAIIRLSSIITDDVDEDPTGNKALWDRGLLNGASQKADVLA 1094
Query: 1288 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1347
FHVG + ++ PG ++ L++ +L G++G + P T R
Sbjct: 1095 NFHVGETCMSLQKATLI----------PGGSES----LVYTSLSGTVGVLVP---FTSRE 1137
Query: 1348 ----LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
Q L+ + P + G + SFR ++ K +++D +L Y + +
Sbjct: 1138 DHDFFQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK-------NVIDGDLCEQYNSIEPAK 1190
Query: 1404 QLEIAHQTGTTRSQILSNLNDLALGTSF 1431
Q IA S + L D+ +F
Sbjct: 1191 QKSIAEDLDRNPSDVSKKLEDIRTRYAF 1218
Score = 41.6 bits (96), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 68/323 (21%), Positives = 122/323 (37%), Gaps = 65/323 (20%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVATSMCVMKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E GD AP AL+++V +E+ S +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPR----------ALRNLVQVDEMD-----SLSP 406
Query: 495 ESAQKTFSFAVRDS-----LVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPG 546
A + A D+ L GP L+ +GL + S + ELPG
Sbjct: 407 IMACQVADLANEDTPQLYMLCGRGPRSTLRVLRHGLEV------------SEMAVSELPG 454
Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
+WTV + ++EY AY+I+S T+VL + + EVT+S
Sbjct: 455 NPNAVWTVKRR--------------VEEEYDAYIIVSFVNATLVLSIGETVEEVTDS--G 498
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
F+ + + G ++QV+ G R + N G + T++ +
Sbjct: 499 FLGTTPTLSCSALGDDALVQVYPDGIRHIRADKRV--------NEWKAPGKK--TIMKCA 548
Query: 666 IADPYVLLGMSDGSIRLLVGDPS 688
+ V++ ++ G + DP+
Sbjct: 549 VNQRQVVIALTAGELVYFEMDPT 571
>gi|240275059|gb|EER38574.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H143]
Length = 1134
Score = 49.3 bits (116), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 44/315 (13%)
Query: 1111 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
L +GT+Y+ GE + RGR+L F N + +V +KGA ALA +Q +
Sbjct: 818 LFVVGTSYLDDFGEG-SIRGRILAFEVTANRQ-----LAKVAEMPVKGACRALAIVQDKI 871
Query: 1169 LIASGPKIILH-----KWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
+ A ++++ ++ L+ A Y AP V + + N I + D+ KS+
Sbjct: 872 VAALMKTVVVYTLSKGQFADYTLSKTASYRTSTAP----VDIAVTGNLIAVADLMKSVSI 927
Query: 1221 LSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+ + +QGA L +A+ F +L A + + + L SD + N+ + +
Sbjct: 928 VEY-QQGANGLPDSLTEVARHFQTLWSTAVAPVAEDTWLE---SDAEGNLVMLHRNVNGV 983
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1334
++L +E +G V + + + + +P + GT++GSI
Sbjct: 984 TDDDRRRLEVTSEILLGEMVNRIRPVNIQGSQGAEAAISPRA--------FLGTVEGSIY 1035
Query: 1335 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
G I P + RLQS +V + G+ FR F N P VD EL
Sbjct: 1036 LFGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTIRQTEEPYRFVDGEL 1091
Query: 1393 LSHYEMLPLEEQLEI 1407
+ + +E Q EI
Sbjct: 1092 IERFLNCGVELQEEI 1106
Score = 44.7 bits (104), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 25/144 (17%)
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
L A L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 293 ELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV 341
Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITT 417
W Q D LL+ G L L +V D VQ +LDL P S +
Sbjct: 342 -------AWEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVY 392
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
+G + F+GS GDS L++ T GS
Sbjct: 393 MGGGITFIGSHQGDSELIRITEGS 416
>gi|171685748|ref|XP_001907815.1| hypothetical protein [Podospora anserina S mat+]
gi|170942835|emb|CAP68488.1| unnamed protein product [Podospora anserina S mat+]
Length = 1235
Score = 49.3 bits (116), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 77/365 (21%), Positives = 162/365 (44%), Gaps = 44/365 (12%)
Query: 1084 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1138
+ + ++E A++ VV+ +++ E+ L +GT G+D+ R FS G R
Sbjct: 898 VDLDNNEAAISAAVVSF---ASQDGESFLIVGT----GKDMILSPR--QFSEGYIHVYRF 948
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
D+ ++L ++ +++ AL QG LL G + ++ +L A + P
Sbjct: 949 HDDGRDL-EFIHKTKIEEPPMALIPFQGRLLAGIGKTLRIYDLGLKQLLRKAQAEIAPQL 1007
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLV 1257
+VSL N I++GD+ + I + +K + +L A D +++ + T ++D S+
Sbjct: 1008 IVSLQTQGNRIVVGDVQQGITYAVYKPESNKLLAWADD--TINRWTTCTAMVDYE--SVA 1063
Query: 1258 VSDEQKNIQIFYYAPKMSESWKG-----QKLLSRAEFHVGAHVTKFLR---LQMLATSSD 1309
D+ N+ I + S+ Q + +++ H + T + Q L TS
Sbjct: 1064 GGDKFGNVWILRAPERASQESDEPGSEIQLVHAKSYLHGAPNRTALMAHFYTQDLPTSIT 1123
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1366
+T G LL+ + G++G + P +++ F QSL+ + P +AG +
Sbjct: 1124 KTNLVVGGQDV----LLWSGIQGTVGVLIPFVSREDVDF--FQSLESHMRAEDPPLAGRD 1177
Query: 1367 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
+R ++ K ++D +L + +L +++ IA + + +I ++D+
Sbjct: 1178 HLIYRGYYVPVKG-------VIDGDLCERFALLANDKKQMIAGELDRSVREIERKISDIR 1230
Query: 1427 LGTSF 1431
++F
Sbjct: 1231 TRSAF 1235
>gi|449019082|dbj|BAM82484.1| UV-damaged DNA binding protein [Cyanidioschyzon merolae strain 10D]
Length = 1372
Score = 49.3 bits (116), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 120/302 (39%), Gaps = 58/302 (19%)
Query: 1111 LLAIGTAYVQGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS------ 1163
+ +GTAYV ++ +RGR+L+FS L+ E+Y+ +SALA
Sbjct: 981 VFVVGTAYVLPSEMEPSRGRILVFSR-----EELLLLNELYTPGAVYTMSALADPSDRTC 1035
Query: 1164 ---LQGHLLIASGPK--IILHKWTGT------ELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
+A+G +IL+ W + EL +A + L V+ L + +L+G
Sbjct: 1036 RFPASAARFLAAGVNNVVILYDWGQSGHGDDYELREVARHLGHVL-VLRLEARGDQLLVG 1094
Query: 1213 DIHKSIYFLSW------KEQGAQ--LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
D+ KS+ L GA L +A D+ + A FL + + L+ +D N
Sbjct: 1095 DLMKSLCVLQLVLPEGETSDGASPCLKAVAWDYETAWITACAFLNEDTYLA---ADNSYN 1151
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG-------- 1316
+ P + S L FH+G V F R +++ +S A G
Sbjct: 1152 LLSLQRNPHETRSEFRHALNRAGAFHLGDLVNVFRRGKLVTEASGNEEAGTGNGHSTIDT 1211
Query: 1317 ---------------SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1361
+D +R LLF T G+IG I PLD R L ++K L H
Sbjct: 1212 ESTRDVARASTGTTTADNVSRQTLLFATTAGAIGIIVPLDPAQHRMLSRVEKALRSLTDH 1271
Query: 1362 VA 1363
A
Sbjct: 1272 PA 1273
>gi|189205943|ref|XP_001939306.1| DNA damage-binding protein 1 [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187975399|gb|EDU42025.1| DNA damage-binding protein 1 [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 1115
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 67/297 (22%), Positives = 121/297 (40%), Gaps = 35/297 (11%)
Query: 1112 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
IGTAY+ + + RGR+L+ P+ ++ V +KG LA+ +G ++
Sbjct: 809 FVIGTAYLDDQSTTSERGRILILEV-----TPERILKLVMEIAVKGGCRCLATCEGKIVA 863
Query: 1171 ASGPKIILHKWTG-------TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
A I+++ T+L AP + + + I++ D+ KS+ + +
Sbjct: 864 ALIKTIVIYDVEYPTQTPFLTKLATFRCSTAP----IDITVNGPKIVIADLMKSLVVVEY 919
Query: 1224 KEQGAQL-NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1282
+ A L + L + + + T + SD + N+ + Y P ++
Sbjct: 920 TKGEAGLPDKLVEVARHYQITWATAVAEVDTNMYLESDAEGNLMVLYRDPNGVTDDDKRR 979
Query: 1283 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI---GCIAP 1339
L +E +G V + R+ +L T+SD P + GT++GSI G I+P
Sbjct: 980 LNVSSEMLLGEMVNRIRRIDVL-TASDAV-VIPRA--------FVGTVEGSIYLFGLISP 1029
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ L +LQ L +P ++ FR F NG P VD E + +
Sbjct: 1030 AHQ---NLLMTLQSNLGALIPAPGDMDFAKFRAFK-NGVRQEEEPMRFVDGEFVERF 1082
>gi|67600754|ref|XP_666354.1| CG13900 gene product [Cryptosporidium hominis TU502]
gi|54657334|gb|EAL36124.1| CG13900 gene product [Cryptosporidium hominis]
Length = 1318
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 63/295 (21%), Positives = 123/295 (41%), Gaps = 42/295 (14%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
V+ ++ + +AL +G LL+ + ++ L + Y P + + +V +
Sbjct: 1041 VHITPIENSATALTGWRGRLLVGINKTLRVYSLGKKRLLRKSEYRNIPQGLTWIKVVNDR 1100
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI- 1267
I GDI + + Q L+AKD + ++D T++ VSD+ NI +
Sbjct: 1101 IFAGDISNGVLVFKFNNTSNQFILVAKDPMPRWLTSACEVLDYHTIA--VSDKFDNIIVS 1158
Query: 1268 ---------FYYAPKMSESWKGQ--------KLLSRAEFHVGAHVTKFLRLQMLATSSDR 1310
F + +++ Q ++ + A+FH+G VT + Q+ TS++
Sbjct: 1159 RVPVEASDDFSFVTSFTDNNNSQSSALMRTHQINTVAQFHLGDIVTCLQKSQLTPTSAE- 1217
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
A+++GT+ GSIG ++P L+ L L+ L + +
Sbjct: 1218 -------------AIIYGTVLGSIGSLSPILNNEDIELLSKLEILLRKQKSTLLSRDHLM 1264
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
FR ++S H +++D + + +L + Q EIA + T +I L+D
Sbjct: 1265 FRSYYS--PVH-----NVIDGDFCQTFTILDSKIQSEIASKLDVTVEEIYKKLDD 1312
>gi|156086042|ref|XP_001610430.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797683|gb|EDO06862.1| conserved hypothetical protein [Babesia bovis]
Length = 1450
Score = 48.9 bits (115), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 1198 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1257
YVVSL+ ++ I++GD+ S+ L W QG +L + KDF S+ C A ID + S V
Sbjct: 1171 YVVSLDAYQDVIVIGDLMNSMRMLQW--QGTELREVCKDFNSVYCTAAA-AIDQT--SCV 1225
Query: 1258 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1300
V+D N +F ++ + K FH G + + R
Sbjct: 1226 VADSSGNFYVFAKRQVVTNDAEAIKAEDVGLFHHGELINRIRR 1268
>gi|1399512|gb|AAC47162.1| repE [Dictyostelium discoideum]
Length = 1139
Score = 48.5 bits (114), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 57/270 (21%), Positives = 115/270 (42%), Gaps = 25/270 (9%)
Query: 1153 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK-----N 1207
+ + ++ L S G L+ A ++ ++T ++ + ++ I+K +
Sbjct: 871 KFRSSVYFLLSFNGRLIAAVHKRLFSIRYTHSKEKNCKVISSESVHKGHTMILKLASRGH 930
Query: 1208 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1267
FIL+GD+ KS+ L + G+ L +A++ + + + D + ++ N +
Sbjct: 931 FILVGDMMKSMSLLVEQSDGS-LEQIARNPQPIWIRSVAMINDDY---FIGAEASNNFIV 986
Query: 1268 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA---PGSDKTNRFA 1324
+ + + L S +H+G + +S R G+ P SD+
Sbjct: 987 VKKNNDSTNELERELLDSVGHYHIGESI-----------NSMRHGSLVRLPDSDQPIIPT 1035
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1384
+L+ +++GSIG +A + E F LQK L V V G + ++R F ++ H
Sbjct: 1036 ILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSHETWRAFSND--HHTIDS 1093
Query: 1385 DSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1414
+ +D +L+ + L E QL+ G T
Sbjct: 1094 KNFIDGDLIETFLDLKYESQLKAVADLGIT 1123
Score = 40.8 bits (94), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 52/204 (25%), Positives = 84/204 (41%), Gaps = 29/204 (14%)
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+V N+R L+ V D F++G P + +L + + H S T L
Sbjct: 150 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 201
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
P WS N+ Y L VP P+GGVLVV N I Y + + ++ AVS
Sbjct: 202 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSV-----AVSYTKFL 252
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
R V+ D + L G L +L +++ + V L + + S
Sbjct: 253 AFTR----VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 300
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
I+ + + + ++GS GDS L++
Sbjct: 301 SISYLDSGVVYIGSSSGDSQLIRL 324
>gi|341886298|gb|EGT42233.1| CBN-TAG-203 protein [Caenorhabditis brenneri]
Length = 108
Score = 48.5 bits (114), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 30/120 (25%), Positives = 62/120 (51%), Gaps = 16/120 (13%)
Query: 1315 PGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
PG+++ AL++ T+ G+IGC+ DE+ F +L+ + P + G + ++R
Sbjct: 2 PGANE----ALVYTTIGGAIGCLVSFMSKDEVDF--FTNLEMHVRSEYPPLCGRDHLAYR 55
Query: 1372 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+++ K S++D ++ + ++ L +Q E+A + G T S+I L D+ +F
Sbjct: 56 SYYAPCK-------SVIDGDICEQFSLMDLPKQKEVAEELGKTVSEISKKLEDIRTRYAF 108
>gi|167380951|ref|XP_001733297.1| DNA repair protein xp-E [Entamoeba dispar SAW760]
gi|165902459|gb|EDR28278.1| DNA repair protein xp-E, putative [Entamoeba dispar SAW760]
Length = 349
Score = 48.1 bits (113), Expect = 0.033, Method: Composition-based stats.
Identities = 59/226 (26%), Positives = 101/226 (44%), Gaps = 20/226 (8%)
Query: 1197 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL---LAKDFGSLDCFATEFLIDGST 1253
LYV ++ N IL+GD+ KSI S+ G N +++DF + A EF+ +
Sbjct: 128 LYVKTMG---NKILVGDLMKSISVYSFDNNGNNKNCLNEVSRDFYASYTTAIEFVDENCY 184
Query: 1254 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1313
LS SD N+ +F +ES + +L + A HVG + + + T S
Sbjct: 185 LS---SDSNSNLLVFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHSTY--- 237
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH-VAGLNPRSFRQ 1372
+ + +LFG + G IG I + + L +Q +++ + V P +++
Sbjct: 238 ----ETIQKKCILFGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTPDEWKK 293
Query: 1373 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
+ K R +I+D ++ Y + E+Q EIAH +G QI
Sbjct: 294 VIDDWK--RMPSSNIIDGNIVESYLEMSKEKQCEIAHLSGVNEEQI 337
>gi|68075683|ref|XP_679761.1| splicing factor 3b, subunit 3, 130kD [Plasmodium berghei strain
ANKA]
gi|56500578|emb|CAH95367.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium berghei]
Length = 1216
Score = 48.1 bits (113), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 62/297 (20%), Positives = 120/297 (40%), Gaps = 48/297 (16%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ ++ GDL + V ++ +V+ + + + + I + + F+ + G+ QF
Sbjct: 302 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 361
Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
+ S SM +S G A T++L+ L D + +
Sbjct: 362 SGIGNDSNESMCTSN--HPSGKNAIIAFKTQKLKNL---YLVDQIYSLPIVDMKILDAKN 416
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ + ++ R GP L+ +GL I A+ ELPG + I
Sbjct: 417 SNIPQIYALCGR------GPRSSLRILQHGLSIEELANN------------ELPGKPRYI 458
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WT+ +S EY Y+I+S E T++LE + + EV +S+ +
Sbjct: 459 WTIKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNV 502
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
T NL IQV++ G R ++G + + + P N + + + N + + +S++
Sbjct: 503 TTIHINLLYDNSFIQVYDTGIRHINGKIVQEWVP--PKNKQINAATSNGSQIVISLS 557
Score = 43.9 bits (102), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 60/286 (20%), Positives = 110/286 (38%), Gaps = 46/286 (16%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G ++++ G K+ ++ +L Y P +VS+ V N I DI +S+
Sbjct: 952 CFCPFNGKVIVSVGNKLRIYALGKKKLLKKCEYKDIPEAIVSIK-VSNRIFASDIRESVL 1010
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1266
+ + L++ D +E L + ++ L V +E K +
Sbjct: 1011 IFFYDSNQNVIRLISDDIIPRWITCSEILDHHTIIAADKFDSVFILRVPEEAKQEEYGIA 1070
Query: 1267 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
+Y ++ S K +K+ FH+G VT ++++ SS+
Sbjct: 1071 NKCWYGGEVINSSTKNRKMEHIMSFHIGEIVTSLQKVKLSPVSSE--------------C 1116
Query: 1325 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1379
+++ T+ G+IG P D ELT Q L+ L + G FR ++ +
Sbjct: 1117 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTEKHALCGREHIFFRSYYHPVQ- 1171
Query: 1380 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
++D +L + LP E Q +I T +IL L D+
Sbjct: 1172 ------HVIDGDLCEQFSSLPFEVQRKIGSDLEKTPDEILRKLEDI 1211
>gi|166240328|ref|XP_637896.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
gi|238064940|sp|B0M0P5.1|DDB1_DICDI RecName: Full=DNA damage-binding protein 1; AltName: Full=DNA repair
protein E; AltName: Full=UV-damaged DNA-binding protein 1
gi|165988543|gb|EAL64385.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
Length = 1181
Score = 48.1 bits (113), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 57/270 (21%), Positives = 115/270 (42%), Gaps = 25/270 (9%)
Query: 1153 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK-----N 1207
+ + ++ L S G L+ A ++ ++T ++ + ++ I+K +
Sbjct: 913 KFRSSVYFLLSFNGRLIAAVHKRLFSIRYTHSKEKNCKVISSESVHKGHTMILKLASRGH 972
Query: 1208 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1267
FIL+GD+ KS+ L + G+ L +A++ + + + D + ++ N +
Sbjct: 973 FILVGDMMKSMSLLVEQSDGS-LEQIARNPQPIWIRSVAMINDDY---FIGAEASNNFIV 1028
Query: 1268 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA---PGSDKTNRFA 1324
+ + + L S +H+G + +S R G+ P SD+
Sbjct: 1029 VKKNNDSTNELERELLDSVGHYHIGESI-----------NSMRHGSLVRLPDSDQPIIPT 1077
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1384
+L+ +++GSIG +A + E F LQK L V V G + ++R F ++ H
Sbjct: 1078 ILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSHETWRAFSND--HHTIDS 1135
Query: 1385 DSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1414
+ +D +L+ + L E QL+ G T
Sbjct: 1136 KNFIDGDLIETFLDLKYESQLKAVADLGIT 1165
Score = 40.8 bits (94), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 87/204 (42%), Gaps = 29/204 (14%)
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+V N+R L+ V D F++G P + +L + + H S T L
Sbjct: 192 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 243
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
P WS N+ Y L VP P+GGVLVV N I Y + + ++A+ +Y L ++
Sbjct: 244 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSVAV-SYTKFLAFTR 298
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
V+ D + L G L +L +++ + V L + + S
Sbjct: 299 --------VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 342
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
I+ + + + ++GS GDS L++
Sbjct: 343 SISYLDSGVVYIGSSSGDSQLIRL 366
>gi|345570887|gb|EGX53705.1| hypothetical protein AOL_s00006g33 [Arthrobotrys oligospora ATCC
24927]
Length = 1133
Score = 48.1 bits (113), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 68/302 (22%), Positives = 123/302 (40%), Gaps = 32/302 (10%)
Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1171
+GTA D + GR+L G AD L+TE+ EL GA +LA ++G++L
Sbjct: 818 FVVGTAIGNDSDESEHGRLLFLELG--ADKMLRLITEL---ELPGACHSLAIVKGYILAG 872
Query: 1172 SGPKIILHKWT------GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1225
I L++++ G + I+ A L VSL++ + +GD+ K + L E
Sbjct: 873 LSKSIDLYRFSYTRGSLGASIQQISSIRAATL-PVSLSVYGKRVFVGDLVKGVMVLEVVE 931
Query: 1226 QGAQLN----LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
G + N + + +G A E L + + +S +D N+ + + +
Sbjct: 932 GGGEGNDKLVEVCRQYGVSWVTALEALDEDTCIS---ADSDGNLVLLRRESTGATDEDTR 988
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1341
++ +E +G V R+ T P + GT+DG + + +
Sbjct: 989 RMRPLSEIRLGEMVNCIRRVNDPITQG--YVVQPKA--------YLGTVDGGLFMLGLIH 1038
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1401
F L Q + + + L+ +R +++ G P VD EL+ + L L
Sbjct: 1039 PDYFDILMKCQVNMAKVIKGIGDLDFNRYRAYNTKG-IQPEEPFRFVDGELVEKF--LDL 1095
Query: 1402 EE 1403
+E
Sbjct: 1096 DE 1097
>gi|389744702|gb|EIM85884.1| hypothetical protein STEHIDRAFT_121882 [Stereum hirsutum FP-91666
SS1]
Length = 1255
Score = 48.1 bits (113), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 69/277 (24%), Positives = 115/277 (41%), Gaps = 48/277 (17%)
Query: 1118 YVQGEDVAARGRVLLFSTGRNADNPQNLVT----EVYSKELKGAISALASLQGHLLIASG 1173
Y E + G + +FS G P V EV +E+ G + ALAS +G+++ A
Sbjct: 936 YQLDEKEPSHGELHIFSKG--GTEPDGTVKADLLEVVKQEVSGCVYALASFEGYIVAAIN 993
Query: 1174 PKIILHKW--TGTELNGIAFYDAPPLYVV-SLNIVKNFILLGDIHKSIYFLSW---KEQG 1227
+ K +GTE + ++ Y+V SL + +++L+GD S+ L E G
Sbjct: 994 STVSFFKLDTSGTEATLVKKHEWNHNYLVTSLVVSGSYLLIGDAISSVSVLQVIQVDENG 1053
Query: 1228 A---QLNLLAKDFG-----SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
+L +A+D+G SL + E +I ++ N+ F P +
Sbjct: 1054 EITEKLKTVARDYGPLWPVSLQGWGKEGVIGANS--------DCNLFSFTLQPVTPQ--- 1102
Query: 1280 GQKLLSR-AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL----LFGTLDGSI 1334
+ +L R FH+ HV KFL G S+K + LF T G I
Sbjct: 1103 -KTVLDRDGHFHLDDHVNKFLH-----------GTVHSSEKAEDLDIEARSLFFTASGRI 1150
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1371
G + + + + +LQ+ L V V+G + +R
Sbjct: 1151 GLVLDMGKELSLHMTALQRNLNGVVKDVSGTTHKRWR 1187
>gi|384500266|gb|EIE90757.1| hypothetical protein RO3G_15468 [Rhizopus delemar RA 99-880]
Length = 1057
Score = 48.1 bits (113), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 58/257 (22%), Positives = 111/257 (43%), Gaps = 22/257 (8%)
Query: 1121 GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK 1180
G++ GR+L+ +D L++++ + G I + ++G LL + + L++
Sbjct: 760 GKENDGLGRILVLQLA--SDRKLRLISQLKTG---GMIDCVRPIEGKLLASIQGTLYLYR 814
Query: 1181 WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1240
W L ++ P + + +NFI+ GD+ S+ + Q QL +A +
Sbjct: 815 WQSQRLVKVSSRRLPSV-TRCMTTHENFIMTGDLAYSVVMFQYDRQSDQLLEVAAHEKTK 873
Query: 1241 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR-AEFHVGAHVTKFL 1299
+ A + ID + LV+ E++ +F E + LL + +H+G V++F
Sbjct: 874 EVLAMK-AIDSN---LVIGAEREG-HLFVLEHCQDEVSADEPLLDVISTWHLGDVVSRFR 928
Query: 1300 --RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1357
L M D + AP +L+F T G+IG IA L ++ L +Q +
Sbjct: 929 FGSLGMNNVDPDSSPIAP--------SLIFATASGAIGVIADLSPERYKLLYQMQCNMCR 980
Query: 1358 SVPHVAGLNPRSFRQFH 1374
V + L+ +R +
Sbjct: 981 VVKGIGELSHTDWRNVN 997
>gi|328858656|gb|EGG07768.1| hypothetical protein MELLADRAFT_105631 [Melampsora larici-populina
98AG31]
Length = 1216
Score = 48.1 bits (113), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 67/323 (20%), Positives = 117/323 (36%), Gaps = 81/323 (25%)
Query: 1114 IGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSK--ELKGAISALASLQGHLLI 1170
IGT +V E + GR+L D NL + ++KG + L L G +
Sbjct: 870 IGTGFVNPNESQSNTGRILTIGLSSKHDQEGNLREFKLKRMTKVKGTVHGLGGLPGGKFV 929
Query: 1171 ASGPKII----------------LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1214
AS + L W G ++ + KN+I++GD+
Sbjct: 930 ASANAFVHAFGINEEEEDEGFEVLDTWGGGFVSQTVLTE------------KNWIIVGDL 977
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
+KSI L + + L +L +D+ ++ + D V +D + N+ + +M
Sbjct: 978 YKSIVVLEFDLKKFSLKVLGRDYSAMSVRPIGMISDR---VFVAADTEFNL----FTVEM 1030
Query: 1275 SESWKGQKLLSRAE--------------------------------------FHVGAHVT 1296
E KG K E FH+G +V
Sbjct: 1031 RERQKGLKEEDEDEEGLSVEEEKGDDDEWEEEERRMRVEKVFNDDHLDTVGGFHLGENVN 1090
Query: 1297 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT-FRRLQSLQKKL 1355
F ++ + G D L+F + G IG I L++L ++ L++L+ +L
Sbjct: 1091 HFKAGSLVKSLKHFY----GQDLKYGGKLIFVSSTGGIGVIIKLEDLKIYKHLKALEDRL 1146
Query: 1356 VDSVPHVAGLNPRSFRQFHSNGK 1378
+ + GL+ FR+F + K
Sbjct: 1147 KKEILSIGGLDSTEFRKFKNKWK 1169
>gi|70992737|ref|XP_751217.1| nuclear mRNA splicing factor [Aspergillus fumigatus Af293]
gi|74670386|sp|Q4WLI5.1|RSE1_ASPFU RecName: Full=Pre-mRNA-splicing factor rse1
gi|66848850|gb|EAL89179.1| nuclear mRNA splicing factor, putative [Aspergillus fumigatus Af293]
Length = 1225
Score = 47.8 bits (112), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 99/463 (21%), Positives = 199/463 (42%), Gaps = 49/463 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS- 1022
C G + + +Q + ++ S DN +Q+ IPL TP ++ E+ L+ +I S
Sbjct: 759 QCVEGMVGIQAQNL----RIFSIEKLDNNI-LQESIPLSNTPRRMLKHPEQPLFYVIESD 813
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQT 1080
VL P + + LI+ + + L D + ++I++P A
Sbjct: 814 NNVLSPATR--ARLIEDSKARNGETNVLPPEDFGYPRATGHWASCIQIVDPLDAKA---V 868
Query: 1081 RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRN 1138
+TI ++ +E A+++ V +++++ET L +GTA + +A G + ++ R
Sbjct: 869 ISTIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDMIVNPPSSAGGFIHIY---RF 922
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
++ + L ++ +++ AL QG LL G + ++ +L
Sbjct: 923 QEDGKEL-EFIHKTKVEEPPLALLGFQGRLLAGIGSTLRIYDLGMKQLLRKCQAQVVSKT 981
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
+V L + I++ D+ +S+ ++ +K Q L D S +T ++D T++
Sbjct: 982 IVGLQTQGSRIVVSDVRESVTYVVYKYQDNILIPFVDDSVSRWTTSTT-MVDYETVA--G 1038
Query: 1259 SDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATSSDR 1310
D+ N+ + K SE G L+ + GA L + Q + TS +
Sbjct: 1039 GDKFGNLWLVRCPKKASEEADEDGSGAHLIHERGYLHGAPNRLDLMIHTYTQDIPTSLHK 1098
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
T G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1099 TQLVAG----GRDILVWTGFQGTIGMLVPFVSREDVDF--FQNLEMQLASQCPPLAGRDH 1152
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
+R +++ K ++D +L Y +LP + ++ IA +
Sbjct: 1153 LIYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAE 1188
>gi|159130328|gb|EDP55441.1| nuclear mRNA splicing factor, putative [Aspergillus fumigatus A1163]
Length = 1225
Score = 47.8 bits (112), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 99/463 (21%), Positives = 199/463 (42%), Gaps = 49/463 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS- 1022
C G + + +Q + ++ S DN +Q+ IPL TP ++ E+ L+ +I S
Sbjct: 759 QCVEGMVGIQAQNL----RIFSIEKLDNNI-LQESIPLSNTPRRMLKHPEQPLFYVIESD 813
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQT 1080
VL P + + LI+ + + L D + ++I++P A
Sbjct: 814 NNVLSPATR--ARLIEDSKARNGETNVLPPEDFGYPRATGHWASCIQIVDPLDAKA---V 868
Query: 1081 RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRN 1138
+TI ++ +E A+++ V +++++ET L +GTA + +A G + ++ R
Sbjct: 869 ISTIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDMIVNPPSSAGGFIHIY---RF 922
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
++ + L ++ +++ AL QG LL G + ++ +L
Sbjct: 923 QEDGKEL-EFIHKTKVEEPPLALLGFQGRLLAGIGSTLRIYDLGMKQLLRKCQAQVVSKT 981
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
+V L + I++ D+ +S+ ++ +K Q L D S +T ++D T++
Sbjct: 982 IVGLQTQGSRIVVSDVRESVTYVVYKYQDNILIPFVDDSVSRWTTSTT-MVDYETVA--G 1038
Query: 1259 SDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATSSDR 1310
D+ N+ + K SE G L+ + GA L + Q + TS +
Sbjct: 1039 GDKFGNLWLVRCPKKASEEADEDGSGAHLIHERGYLHGAPNRLDLMIHTYTQDIPTSLHK 1098
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
T G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1099 TQLVAG----GRDILVWTGFQGTIGMLVPFVSREDVDF--FQNLEMQLASQCPPLAGRDH 1152
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
+R +++ K ++D +L Y +LP + ++ IA +
Sbjct: 1153 LIYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAE 1188
>gi|448528339|ref|XP_003869702.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis Co 90-125]
gi|380354055|emb|CCG23569.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis]
Length = 1170
Score = 47.8 bits (112), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 72/302 (23%), Positives = 127/302 (42%), Gaps = 21/302 (6%)
Query: 1140 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL--NGIAFYDAPPL 1197
D +NL ++ ELK + Q LL+ASG I L++ +L + D
Sbjct: 880 DKKKNL-QYIHKTELKYVPQTMEVFQDRLLVASGNSISLYELGQRQLLRKSLTRIDFIQT 938
Query: 1198 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1257
V ++ ILL D SI F + ++ Q +A D + A + L D T+ +
Sbjct: 939 IVKVTPQPRDRILLADSANSIVFAKFDQEENQFVSMADDTVKRNITAWKQL-DYDTV--I 995
Query: 1258 VSDEQKNIQIFYY----APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1313
D+ NI + + ++ ++W K ++ ++ + V K L T
Sbjct: 996 GGDKFGNIFVSRLDREESKQIDQNWTVLKQAAKNSPNLNSCVYKLQNLCEYYIPDIITSF 1055
Query: 1314 APGS-DKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
GS + +++ L G+IG + PL E+ L+ + +VAG N
Sbjct: 1056 QLGSFNLGGEECIIYTGLTGTIGILLPLISKSEIELLHDLQLEISAYNDKVNVAGKNHAK 1115
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1429
R +++ K +I D + L Y LPL+E+L+IA + + ++ LND+ +
Sbjct: 1116 LRSYYNPAK-------NIFDGDFLELYLNLPLDEKLKIAKRLNKSVGEVEKKLNDIRNRS 1168
Query: 1430 SF 1431
SF
Sbjct: 1169 SF 1170
Score = 46.2 bits (108), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 81/340 (23%), Positives = 136/340 (40%), Gaps = 48/340 (14%)
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
+P+DA L VP IGGVLV GAN I Y L N ++ L + ++S +
Sbjct: 229 VPNDANYLAPVPGHIGGVLVCGANWIMYDK--------LGNESILLPLLRRKDQTSVIIS 280
Query: 364 LDAAHATWLQND--VALLSTKTGDLVLLTVVYDG--RVVQRLDLSKTNPSVLTSDITTIG 419
HA +N LL GDL L + YD +++ ++++ + + ++
Sbjct: 281 -HVTHALKKKNYGFFILLQNDLGDLFRLIIDYDSNRELIKDIEITYFDTIPVCYNLNIFK 339
Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
N L F LL QF L EE E D K ++ + ++
Sbjct: 340 NGLCFANCINRSQLLYQF----------EKLGEEIS--EEDIRINKTVQMDNIQLTKEKY 387
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
E L G + ++ S + DS++N L S ++ T +
Sbjct: 388 --FEFKLKGLDNLALIDVVESLS-PITDSILNDDTLVTLSTKSKLKTIVHGTPTTTLVES 444
Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII--SLEARTMVLETADLLT 597
+L P I+T + A DDE YL+I +L +T+VL +++
Sbjct: 445 QLPIKP--TNIFTT-----------KTSANAVDDE---YLVITSTLSFKTLVLSLGEVIE 488
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
EV +S FV + A G+ ++Q++ G R ++G+
Sbjct: 489 EVNDS--EFVLDQPTVAVQQVGKSSIVQIYSNGLRHINGN 526
>gi|390357128|ref|XP_001198237.2| PREDICTED: splicing factor 3B subunit 3-like [Strongylocentrotus
purpuratus]
Length = 949
Score = 47.8 bits (112), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 61/259 (23%), Positives = 100/259 (38%), Gaps = 39/259 (15%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + + + + + + + F+ S G+ L Q
Sbjct: 34 LAQTEQGDIFKITLETDDDMVTEIRMKYFDTVPVATSMNVLKTGFLFIASEYGNHYLYQI 93
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS E GD AP T R L+++ E LS S
Sbjct: 94 AHLGDDDDEPEFSSATPLEEGDTFFFAPRTLR-------NLEEVDQLESLSPILSCQIAD 146
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTV 553
+++ T V ++ +GL + S + ELPG +WTV
Sbjct: 147 LASEDTPQLYVACGRGPRSSMRVLRHGLEV------------SEMAVSELPGNPNAVWTV 194
Query: 554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
KS DDEY AY+I+S T+VL + + EVT+S F+
Sbjct: 195 KKKS--------------DDEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTPTL 238
Query: 614 AGNLFGRRRVIQVFERGAR 632
+ +L G ++Q++ G R
Sbjct: 239 SSSLIGDDALLQIYPDGIR 257
>gi|358366518|dbj|GAA83139.1| nuclear mRNA splicing factor [Aspergillus kawachii IFO 4308]
Length = 1209
Score = 47.8 bits (112), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 100/478 (20%), Positives = 201/478 (42%), Gaps = 49/478 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS- 1022
C G + + Q + ++ S DN +Q+ IPL TP + E+ L+ +I S
Sbjct: 759 QCVEGMVGIQGQNL----RIFSIEKLDNNM-LQQSIPLSYTPRRFLKHPEQPLFYVIESD 813
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQT 1080
VL P + + L++ D L D + +++++P A
Sbjct: 814 NNVLSPSTR--AKLLEDSKSRGGDETVLPPEDFGYPRATGHWASCIQVVDPLDAKA---V 868
Query: 1081 RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRN 1138
TI ++ +E A+++ V T++++ET L +GTA + +A G + ++ R
Sbjct: 869 VHTIELEENEAAISIAAVPF---TSQDDETFLVVGTAKDMSVNPPKSAGGYIHIY---RF 922
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
++ + L ++ +++ AL QG L+ G + ++ +L P
Sbjct: 923 QEDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGSLLRIYDLGMKQLLRKCQAPVVPKT 981
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
+V L + I++ D+ +S+ ++ +K Q L D S AT ++D T +
Sbjct: 982 IVGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTATT-MVDYETTA--G 1038
Query: 1259 SDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDR 1310
D+ N+ + K SE G L+ + G + + Q + TS +
Sbjct: 1039 GDKFGNLWLLRCPKKTSEEADEDGSGAHLIHERGYLQGTPNRLELMIHVYTQDIPTSLHK 1098
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
T G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1099 TQLVAG----GRDILVWTGFQGTIGMLVPFIGREDVDF--FQNLEMQLAAQHPPLAGRDH 1152
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+R +++ K ++D +L Y +LP + ++ IA + + +I ++D+
Sbjct: 1153 LIYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|295667673|ref|XP_002794386.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
gi|226286492|gb|EEH42058.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
Length = 1195
Score = 47.4 bits (111), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 125/305 (40%), Gaps = 42/305 (13%)
Query: 1111 LLAIGTAYVQGEDV---AARGRVLLFS-TGRNADNPQNLVTEVYSKELKGAISALASLQG 1166
+ +GT+Y+ +DV + RGR+L F TG + +V +KGA ALA +Q
Sbjct: 875 MFVVGTSYL--DDVGEGSIRGRILAFEVTGSRQ------LAKVAELPVKGACRALAVMQD 926
Query: 1167 HLLIASGPKIILHKWTGTE-----LNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSI 1218
++ A ++++ E LN A Y AP + + + N I + D+ KS+
Sbjct: 927 KIVAALMKTVVIYSIAKGELSDYTLNKTASYRTSTAP----IDIAVTGNLIAVADLMKSV 982
Query: 1219 YFLSWKE----QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
+ +K+ Q L +A+ F +L A + + L SD + N+ +
Sbjct: 983 SIIEFKQGENDQPDSLTEVARHFQTLWSTAVAPIAENMFLE---SDAEGNLVVLNQNVNG 1039
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
++L +E +G V R++ ++ P + A L GT++GSI
Sbjct: 1040 VTDDDKRRLEVTSEILLGEMVN---RIRPVSIQGSLPATGPREAVISPKAFL-GTVEGSI 1095
Query: 1335 ---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1391
G I P + RLQS LV P N FR F N P VD E
Sbjct: 1096 YLFGLINPAYQDLLMRLQSAMAGLV-VTPGAMPFN--KFRAFK-NAVRQAEEPYRFVDGE 1151
Query: 1392 LLSHY 1396
L+ +
Sbjct: 1152 LIERF 1156
Score = 44.7 bits (104), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 29/138 (21%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
L+ VP+P+GG+LV+G +I Y D++ E S+ LD A
Sbjct: 292 LIPVPAPLGGLLVLGETSIRYLD----------------DATNE----CISLPLDEATIF 331
Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLF 423
W Q D LL+ G L L ++ D VQ +LDL P S + +G +
Sbjct: 332 VAWEQVDGQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVT 389
Query: 424 FLGSRLGDSLLVQFTCGS 441
F+GS GDS L++ T GS
Sbjct: 390 FIGSHQGDSQLIRITEGS 407
>gi|328869269|gb|EGG17647.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
Length = 1194
Score = 47.4 bits (111), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 114/567 (20%), Positives = 211/567 (37%), Gaps = 111/567 (19%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
+ L+ V + G + S+A G +D +I+ + ++ +LE++ S +
Sbjct: 46 SGRLDHVLYSEAFGVIRSIAPFRLTGGS----KDYLIVGSDSGRVVILEYNPSKNVFEKV 101
Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLV 214
F + G G + DP+GR G + L I+ + SQ +
Sbjct: 102 HQETFG-------RSGCRRIVPGQYISTDPKGRAFMIGAIEKQKLVYILNRDSQAKLSI- 153
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM--VILHERELTWAG 272
A + V ++ +D+ G+ P+ + + E T
Sbjct: 154 --------SSPLEAHKAHTIVFSMCGVDV----------GFENPIFATISVDYSEETNIE 195
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS----PIGGVLVVGANT 328
V H+T +++ + L WS + A +++VP P GGVLV
Sbjct: 196 DVEETHNTKVLTFYELDLGLNNVVRKWSE-EVDRSANLVVSVPGGSDGP-GGVLVCAQGR 253
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQNDVA--LLSTK 382
++Y + + D S +PR + E + +HA+ Q D+ L+ ++
Sbjct: 254 VYYRNIGHA------------DISVSIPRRNGMTEEKSLMIVSHASHKQRDMFFFLVQSE 301
Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSG 442
GDL +T+ Y G +V + ++ + + IT + N F+ S GD L F
Sbjct: 302 YGDLYKITLDYSGEMVSGMQIAYFDTFPTANCITMLKNGFLFVASEFGDHGLYLFK---- 357
Query: 443 TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
S GL DAP+ + + + L L + S S
Sbjct: 358 ----SLGLD--------DAPTASSAGNTEMVFFEPVFEPRNLVLTATIS----SLSPIVD 401
Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------------ELPGC-KG 549
F V D L G + ++ +G+S+++N ++ +LPG G
Sbjct: 402 FKVAD-LAQEGTPQMYAL----------SGVSERANLRVLRHGLPITQMVDSQLPGTPAG 450
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE----SVDY 605
IWT+ + N + + Y+++S T+VL + + EV + S
Sbjct: 451 IWTIPQSLTTMRNPQYQGIGTVESPADRYIVVSFVGSTLVLGVGETVEEVQDSGILSTTT 510
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ R++ A NL ++Q+F +G R
Sbjct: 511 TILIRSMGA-NL---DSIVQIFAQGIR 533
>gi|238487250|ref|XP_002374863.1| nuclear mRNA splicing factor, putative [Aspergillus flavus NRRL3357]
gi|220699742|gb|EED56081.1| nuclear mRNA splicing factor, putative [Aspergillus flavus NRRL3357]
Length = 1210
Score = 47.4 bits (111), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 75/342 (21%), Positives = 152/342 (44%), Gaps = 36/342 (10%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1139
+TI ++ +E A++V V T++++ET L +GTA + +A G + ++ R
Sbjct: 870 STIELEENEAAVSVAAVPF---TSQDDETFLVVGTAKDMNVNPPSSAGGYIHIY---RFQ 923
Query: 1140 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1199
++ + L ++ +++ AL QG L+ GP + ++ +L P +
Sbjct: 924 EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGPMLRIYDLGMKQLLRKCNAQVVPKTI 982
Query: 1200 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1259
V L + I++ D+ +S+ ++ +K Q L D S +T ++D T +
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETTA--GG 1039
Query: 1260 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1311
D+ NI + K+SE G L+ R H + + + Q + T+ +T
Sbjct: 1040 DKFGNIWMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKT 1099
Query: 1312 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWSGFHGTIGMLVPFVSREDVDF--FQNLEMQLAAQNPPLAGRDHL 1153
Query: 1369 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
+R +++ K ++D +L Y +LP + ++ IA +
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIAAE 1188
>gi|145240731|ref|XP_001393012.1| pre-mRNA-splicing factor rse1 [Aspergillus niger CBS 513.88]
gi|134077536|emb|CAK96680.1| unnamed protein product [Aspergillus niger]
Length = 1209
Score = 47.4 bits (111), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 100/478 (20%), Positives = 200/478 (41%), Gaps = 49/478 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS- 1022
C G + + Q + ++ S DN +Q+ IPL TP + E+ L+ +I S
Sbjct: 759 QCVEGMVGIQGQNL----RIFSIEKLDNNM-LQQSIPLSYTPRRFLKHPEQPLFYVIESD 813
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQT 1080
VL P + + L++ D L D + +++++P A
Sbjct: 814 NNVLAPSTR--AKLLEDSKSRGGDETVLPPEDFGYPRATGHWASCIQVVDPLDAKA---V 868
Query: 1081 RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRN 1138
TI ++ +E A+++ V T++++ET L +GTA +A G + ++ R
Sbjct: 869 VHTIELEENEAAISIAAVPF---TSQDDETFLVVGTAKDMTVNPPGSAGGYIHIY---RF 922
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
++ + L ++ +++ AL QG L+ G + ++ +L P
Sbjct: 923 QEDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGSLLRIYDLGMKQLLRKCQAPVVPKT 981
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
+V L + I++ D+ +S+ ++ +K Q L D S AT ++D T +
Sbjct: 982 IVGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTATT-MVDYETTA--G 1038
Query: 1259 SDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDR 1310
D+ N+ + K SE G L+ + G + + Q + TS +
Sbjct: 1039 GDKFGNLWLLRCPKKTSEEADEDGSGAHLIHERGYLQGTPNRLELMIHVYTQDIPTSLHK 1098
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
T G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1099 TQLVAG----GRDILVWTGFQGTIGMLVPFIGREDVDF--FQNLEMQLAAQHPPLAGRDH 1152
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+R +++ K ++D +L Y +LP + ++ IA + + +I ++D+
Sbjct: 1153 LIYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|407919154|gb|EKG12409.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1210
Score = 47.4 bits (111), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 85/393 (21%), Positives = 160/393 (40%), Gaps = 57/393 (14%)
Query: 1065 EVRILEPD-----RAGGPWQT--RATIPMQSSENALTVRV--------VTLFNTTTKENE 1109
+ +IL P+ RA G W + + P+ S E T+ + V L T++ +E
Sbjct: 835 DAKILPPEEFGYPRAEGHWASCIQVVDPISSKEVVHTLELEENESAVSVCLAPFTSQNDE 894
Query: 1110 TLLAIGTAYVQGEDVAAR----GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1165
T L +GTA + VA R G V ++ N + ++ +++ AL Q
Sbjct: 895 TFLVVGTA--KDLVVAPRSYNCGYVHIYRLQENGRE----LEFIHKTKMEAPPMALLPFQ 948
Query: 1166 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1224
G LL+ + L+ + L + P ++ L + I+ D+ +S+ ++ +K
Sbjct: 949 GKLLVGVEADLRLYDLGLRQLLRKAQALNVVPNILIGLQTQGSRIVCSDVQESVTYVVYK 1008
Query: 1225 EQGAQLNLLAKD--FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW---- 1278
+L D C A ++D T + D+ NI + PK SE
Sbjct: 1009 HLENRLIQFCDDSIHRWTSCTA---MVDYETTA--GGDKFGNIWLVRCPPKASEEADEEG 1063
Query: 1279 KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
G L++ + G + Q + TS +T G R LL+ L G++
Sbjct: 1064 SGLHLINERPYLQGTPNRLDLLAHFYTQDIPTSIQKTALVAG----GRELLLWSGLQGTL 1119
Query: 1335 GCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1391
G P +++ F QSL+++L P +AG + ++R ++ K ++D +
Sbjct: 1120 GIFIPFVSREDVDF--FQSLEQQLRTEDPPIAGRDHLAYRSYYVPVKG-------VIDGD 1170
Query: 1392 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
L + LP +++ IA + + ++ + D
Sbjct: 1171 LCERFLRLPRDKKETIAAELDRSVREVERKIGD 1203
>gi|312076588|ref|XP_003140928.1| xeroderma Pigmentosum Group E Complementing protein [Loa loa]
Length = 516
Score = 47.4 bits (111), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 78/356 (21%), Positives = 136/356 (38%), Gaps = 90/356 (25%)
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
+W NL +A ++ VP P GG L+ G + I YH + AL YA S
Sbjct: 201 LWKHDNLEGEASMVIGVPEPAGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----------VVQRLDLSKTN 407
+ +D +L D+A G+L +L + + G+ V+ + +
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLEF-GKGQEQDESSTVSVKDMKVESLG 309
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPST 464
+ + + + N + F+GSR GDS L++ + GT +S L + + ++ AP
Sbjct: 310 NTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS--LLDSYTNL---AP-- 362
Query: 465 KRLRRSSSDALQDM----VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
++DM NG++ L S + + + + + L +
Sbjct: 363 ----------IRDMTVMRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS--------- 403
Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
VEL G K ++T+ + D E+ YLI
Sbjct: 404 ---------------------VELKGIKNMFTLRTR---------------DHEFDDYLI 427
Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
+S ++ T VL E T+ + V G T+ AG LF ++QV ++DG
Sbjct: 428 LSFDSDTHVLLINGEELEDTQITGFVVDGATLWAGCLFQSTTILQVTHGEVILIDG 483
>gi|83767504|dbj|BAE57643.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1270
Score = 47.4 bits (111), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 76/350 (21%), Positives = 155/350 (44%), Gaps = 36/350 (10%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1139
+TI ++ +E A++V V T++++ET L +GTA + +A G + ++ R
Sbjct: 870 STIELEENEAAVSVAAVPF---TSQDDETFLVVGTAKDMNVNPPSSAGGYIHIY---RFQ 923
Query: 1140 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1199
++ + L ++ +++ AL QG L+ GP + ++ +L P +
Sbjct: 924 EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGPMLRIYDLGMKQLLRKCNAQVVPKTI 982
Query: 1200 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1259
V L + I++ D+ +S+ ++ +K Q L D S +T ++D T +
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETTA--GG 1039
Query: 1260 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1311
D+ NI + K+SE G L+ R H + + + Q + T+ +T
Sbjct: 1040 DKFGNIWMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKT 1099
Query: 1312 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWSGFHGTIGMLVPFVSREDVDF--FQNLEMQLAAQNPPLAGRDHL 1153
Query: 1369 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1418
+R +++ K ++D +L Y +LP + ++ IA + + +I
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIAAELDRSVREI 1196
>gi|449295711|gb|EMC91732.1| hypothetical protein BAUCODRAFT_116696 [Baudoinia compniacensis UAMH
10762]
Length = 1148
Score = 47.4 bits (111), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 74/330 (22%), Positives = 134/330 (40%), Gaps = 52/330 (15%)
Query: 1105 TKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
T E +GTAY+ +GR+L+ + +V E+ LKGA LA
Sbjct: 832 TGETAERFVVGTAYLDDAPQQQTKGRILVLEV--TEERRLKVVAEL---GLKGACRCLAV 886
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAP--PLYV-----------VSLNIVKNFIL 1210
+ G ++ A ++++ Y P P V + + + + I
Sbjct: 887 VLGRIVAALVKTVVIYALE---------YQTPSHPFLVKKAAYRTSTAPIDICVTGSTIA 937
Query: 1211 LGDIHKSIYFLSWKE-QGA---QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
+ D+ KS+ +S+K +G L+ +A+ + +L A + + + L +D + N+
Sbjct: 938 VTDLMKSVSLVSYKPGRGGVPDTLSEIARHYETLWGTAIANVAENTYLE---ADAEGNLV 994
Query: 1267 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1326
+ + ++L +E +G V + + + T++ P +
Sbjct: 995 VLQHEVNGYSDEDRRRLRPVSEMLLGEMVNRIRSISVQPTAT--AVVVPRA--------F 1044
Query: 1327 FGTLDGSI---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-QFHSNGKAHRP 1382
T++GSI I+P + RLQ+L + V S HV R FR Q G
Sbjct: 1045 LATVEGSIYLFALISPGKQDLLMRLQALLAERVKSPGHVPFAKWRGFRSQVRDMGGE--- 1101
Query: 1383 GPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
GP VD EL+ Y P+E Q+++A + G
Sbjct: 1102 GPTRFVDGELVERYLEAPVEVQVDVASELG 1131
>gi|68071595|ref|XP_677711.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56497932|emb|CAI04454.1| conserved hypothetical protein [Plasmodium berghei]
Length = 493
Score = 47.4 bits (111), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 53/214 (24%), Positives = 98/214 (45%), Gaps = 8/214 (3%)
Query: 1194 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1253
P +++SL++++N+I++GDI S+ LS+ + L + +D+ ++ C F+ S
Sbjct: 254 TPSSWIMSLDVIENYIVVGDIMTSVTILSYDFNNSTLTEVCRDYSNVWC---TFVCALSK 310
Query: 1254 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1313
+VSD + N +F + KL A F+ G V K L + + +S
Sbjct: 311 SHFLVSDMESNFLVFQKSSIRYNDEDSFKLSRVAFFNHGHVVNKMLPVSL--SSLIEEEE 368
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPLDELT-FRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1372
A + ++L + +GSI I P LT F++ ++ L DS+ + +N S
Sbjct: 369 AQNEILRKKESILCASSEGSISSIIPFSNLTNFKKALCIEIALNDSLSFIGNINNNSNNT 428
Query: 1373 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1406
+ N +VD EL + +P E+Q +
Sbjct: 429 YKMN--LSEKSCKGVVDGELFKMFFSMPFEKQFK 460
>gi|340367935|ref|XP_003382508.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Amphimedon
queenslandica]
Length = 1160
Score = 47.4 bits (111), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 74/386 (19%), Positives = 151/386 (39%), Gaps = 69/386 (17%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1123
+R++ P++ +T + + +E A ++ V + + E + +GTA +
Sbjct: 824 LRVMHPNQG----KTLDIVQFEQNEAAFSLAVCQF--VSKGDLEWFVVVGTAKDMIITPR 877
Query: 1124 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
+ G +++F + + V++ +L A+A QG LL+ G + ++
Sbjct: 878 AISSGSLIVFRLSPDGSK----LEHVHTTQLDDVPIAMAPFQGRLLVGVGKLLRIYDIGK 933
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
++ P VV + ++ + +GD+ ++++FL ++ QL + A + C
Sbjct: 934 KKMLRKCENKHLPYLVVDIKVMGRRVYVGDVQEAVHFLYYRPHENQLVIFADEVVPRFC- 992
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLS 1285
T ++D +T++ +D+ NI I +++ W QK
Sbjct: 993 TTSCILDYNTVA--SADKFGNITILRLPSDVTDQVDEDPSGSRSLWDRGFLNGATQKANV 1050
Query: 1286 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1345
+HVG + ++ ++ PG + L++ TL GSIG + P
Sbjct: 1051 MTSYHVGEGINTLHKVSLI----------PGGSE----VLVYTTLSGSIGILVPFS---- 1092
Query: 1346 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1405
K+ D H+ R SN S++D +L Y L ++
Sbjct: 1093 ------SKEDSDFFQHLE----MHMRSEWSNL--------SVIDGDLCEVYNSLDPSKRR 1134
Query: 1406 EIAHQTGTTRSQILSNLNDLALGTSF 1431
EIA + S++ L DL +F
Sbjct: 1135 EIALDLDRSPSEVAKKLEDLRTRYAF 1160
>gi|392566425|gb|EIW59601.1| hypothetical protein TRAVEDRAFT_167065 [Trametes versicolor FP-101664
SS1]
Length = 1263
Score = 47.4 bits (111), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 117/265 (44%), Gaps = 27/265 (10%)
Query: 1112 LAIGTAYVQGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1169
+GTA ++ E+ + GR+LLFS +++N +T V S +++G + AL + +G +
Sbjct: 948 FCLGTAVIRPEEREPSNGRILLFSL--SSENGVRSLTTVASHKVRGCVYALQHVSEGVIA 1005
Query: 1170 IASGPKIILHKWTGTELNGIAF---YDAPPL-----YVVSLNIVKNFILLGDIHKSIYFL 1221
A ++L+K L G F D +V SL F+L+GD S+ L
Sbjct: 1006 AAINTSVLLYKIREGNL-GEGFDRVLDKAAEWNHNHFVTSLVWDGQFLLVGDAISSVSVL 1064
Query: 1222 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
+ +L +A+D+ L A E +G + + +F +A + G
Sbjct: 1065 RVADDATKLESVARDYAPLWPVAIESTGNGGVIG-----ANSDCNLFSFALQRGPQRNG- 1118
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1341
L +H+ V K ++ + +S+D + D+ + +F T G IG I ++
Sbjct: 1119 -LEKNGVYHIDDVVNKLIKGAL--SSADVS-----QDQAVKAGHVFFTSTGRIGAILDMN 1170
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLN 1366
+ + +LQ+ + S+ G+N
Sbjct: 1171 DTMSLHMTALQRNMAKSLIGPGGVN 1195
>gi|402586182|gb|EJW80120.1| hypothetical protein WUBG_08972 [Wuchereria bancrofti]
Length = 162
Score = 47.0 bits (110), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 36/154 (23%), Positives = 68/154 (44%), Gaps = 26/154 (16%)
Query: 1281 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1340
QKL + A ++G +T S + PG++ L + T+ G IG + P
Sbjct: 32 QKLEAIAHLYIGDAIT----------SMQKASLVPGAND----CLSYTTISGIIGILVPF 77
Query: 1341 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1397
DE F Q+L+ + P + G + ++R ++ K S++D +L Y
Sbjct: 78 MSRDEFEF--FQNLEMHMRVEYPPLCGRDHLAYRSYYFPVK-------SVIDGDLCEQYS 128
Query: 1398 MLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
++PL++Q + + G ++I L D+ +F
Sbjct: 129 LMPLDKQKSVGEELGRKPTEIHKKLEDIRTRYAF 162
>gi|70954357|ref|XP_746229.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56526771|emb|CAH77136.1| hypothetical protein PC000016.02.0 [Plasmodium chabaudi chabaudi]
Length = 372
Score = 47.0 bits (110), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 58/286 (20%), Positives = 111/286 (38%), Gaps = 45/286 (15%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G ++++ G K+ ++ +L Y P +VS+ + + I DI +S+
Sbjct: 107 CFCPFNGRVIVSVGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKVSGDRIFASDIRESVL 166
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1266
+ + L++ D +E L + ++ L V +E K +
Sbjct: 167 IFFYDSNQNVIRLISDDIIPRWITCSEILDHHTIMAADKFDSVFILRVPEEAKQEEYGIA 226
Query: 1267 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
+Y +S S K +K+ FH+G VT ++++ SS+
Sbjct: 227 NKCWYGGEVISSSTKNRKMEHIMSFHIGEIVTSLQKVKLSPASSE--------------C 272
Query: 1325 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1379
+++ T+ G+IG P D ELT Q L+ L + G FR ++ +
Sbjct: 273 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTEKHALCGREHIFFRSYYHPVQ- 327
Query: 1380 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
++D +L + LP + Q ++A T +IL L D+
Sbjct: 328 ------HVIDGDLCEQFSSLPFDVQRKVASDLEKTPDEILRKLEDI 367
>gi|68060004|ref|XP_671977.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56488645|emb|CAI04030.1| hypothetical protein PB301494.00.0 [Plasmodium berghei]
Length = 346
Score = 47.0 bits (110), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 57/269 (21%), Positives = 108/269 (40%), Gaps = 41/269 (15%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ ++ GDL + V ++ +V+ + + + + I + + F+ + G+ QF
Sbjct: 90 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 149
Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SDALQDMVNGEELSLYGSASNN 493
+ S SM +S G A T++L+ D + + ++ + + ++N
Sbjct: 150 SGIGNDSNESMCTSN--HPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSN 207
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG-CKGIWT 552
R SL + +GL I A+ ELPG + IWT
Sbjct: 208 IPQIYALCGRGPRSSL------RILQHGLSIEELANN------------ELPGKPRYIWT 249
Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
+ +S EY Y+I+S E T++LE + + EV +S+ + T
Sbjct: 250 IKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNVTT 293
Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQ 641
NL IQV++ G R ++G + +
Sbjct: 294 IHINLLYDNSFIQVYDTGIRHINGKIVQE 322
>gi|403415203|emb|CCM01903.1| predicted protein [Fibroporia radiculosa]
Length = 1267
Score = 47.0 bits (110), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 80/365 (21%), Positives = 145/365 (39%), Gaps = 61/365 (16%)
Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQ--------- 1120
+P+R G P +T +++ + ++ R+ + + +E ++L + ++ V
Sbjct: 889 KPNRIGEPQETTSSLKLL--DDTTFNRIASFACESDEEVTSVLTLSSSDVSSARFCVGTV 946
Query: 1121 ----GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQG---------- 1166
GE + GR+LLFS ++ LV+ S + G + L S+QG
Sbjct: 947 QFKPGETEPSSGRILLFSLNTGPESSFQLVS---STPVSGCVYQLVSIQGMIAAAVNTSV 1003
Query: 1167 ---HLLIASGPKIILHK---WTGTELNGIAFYDAP------PLYVVSLNIVKNFILLGDI 1214
++ IA P + +HK + +LN V L + +++GD
Sbjct: 1004 RTAYVFIAFDPNMTIHKVILFKPEKLNNSTVVLTKVSEWNHNYSVTGLVVHGCMLIVGDA 1063
Query: 1215 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
SI F+ K L +A+D+ L + E + DG + SD +F +A +
Sbjct: 1064 ISSISFV--KVDDTTLESIARDYSPLWPVSVEAM-DGDGVIGANSD----CNLFTFA--L 1114
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGS 1333
S L +++G V KFLR + S+R P LF T G
Sbjct: 1115 QRSGHRSTLERNGSYYLGDMVNKFLRGSLTNIDISERKSIEPKH--------LFFTSTGR 1166
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS--IVDCE 1391
IG I +++ + LQ+ + + G++ +FR +N K H + +D +
Sbjct: 1167 IGVILEMNDKISLHMTGLQRNMGKRIIGPGGVHHATFRA-PANSKGHSDAEAAFGFLDGD 1225
Query: 1392 LLSHY 1396
L Y
Sbjct: 1226 FLEQY 1230
>gi|213407660|ref|XP_002174601.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces japonicus
yFS275]
gi|212002648|gb|EEB08308.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces japonicus
yFS275]
Length = 1078
Score = 47.0 bits (110), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 70/301 (23%), Positives = 134/301 (44%), Gaps = 29/301 (9%)
Query: 1104 TTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALA 1162
T +NE ++ +GT + D GR+++F D + LVTE K +GAI ++
Sbjct: 768 TLPDNERVV-VGTGFNYPDRDEPDGGRLIVF----RLDEQEKLVTEAVYKT-QGAIFSVE 821
Query: 1163 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV---KNFILLGDIHKSIY 1219
+G LL+ + ++ L + P LNI K+ +++GD+ KS+
Sbjct: 822 YQEGKLLVGMNAVLCTFRYENKTLRVVGSTRTP---TYCLNIAASSKDIVVVGDMMKSLT 878
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
+ ++ A+ +A+DFG+L + + L TL + + + + + + K +S +
Sbjct: 879 LYNTEKDTAE--EVARDFGALWVTSVQPL--SETLFFCTTADGEAVTML-WDTKAPQSVE 933
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
+KL ++ + +G V + R + +S R P L+ T++G I I
Sbjct: 934 RKKLRWKSCYRLGDMVNRTRRGCFVLSSPSRL-VKP--------ELMCVTVEGGILLIGD 984
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
+ LQ +Q +++VP + GL+ + + +A D +D +LL E L
Sbjct: 985 ASQHADLLLQ-IQHNFLEAVPPLGGLDFYKWHERLFPARASAANKD-FIDGDLLESIEDL 1042
Query: 1400 P 1400
P
Sbjct: 1043 P 1043
>gi|226291941|gb|EEH47369.1| DNA damage-binding protein 1a [Paracoccidioides brasiliensis Pb18]
Length = 1209
Score = 46.6 bits (109), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 125/305 (40%), Gaps = 42/305 (13%)
Query: 1111 LLAIGTAYVQGEDV---AARGRVLLFS-TGRNADNPQNLVTEVYSKELKGAISALASLQG 1166
+ +GT+Y+ +DV + RGR+L F TG + +V +KGA ALA +Q
Sbjct: 889 IFVVGTSYL--DDVGEGSIRGRILAFEVTGSRQ------LAKVAELPVKGACRALAVVQD 940
Query: 1167 HLLIASGPKIILHKWTGTE-----LNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSI 1218
++ A ++++ E LN A Y AP + + + N I + D+ KS+
Sbjct: 941 KIVAALMKTVVIYSIAKGELSDYTLNKTASYRTSTAP----IDIAVTGNLIAVADLMKSV 996
Query: 1219 YFLSWKE----QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1274
+ +K+ Q L +A+ F +L A + + L SD + N+ +
Sbjct: 997 SIIEFKQGENDQPDSLTEVARHFQTLWSTAVAPIAENMFLE---SDAEGNLVVLNRNVNG 1053
Query: 1275 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
++L +E +G V R++ ++ P + A L GT++GSI
Sbjct: 1054 VTDDDKRRLEVTSEILLGEMVN---RIRPVSIQGSLPATGPREAVISPKAFL-GTVEGSI 1109
Query: 1335 ---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1391
G I P + RLQS LV P N FR F N P VD E
Sbjct: 1110 YLFGLINPAYQDLLMRLQSAMAGLV-VTPGAMPFN--KFRAFK-NAVRQAEEPYRFVDGE 1165
Query: 1392 LLSHY 1396
L+ +
Sbjct: 1166 LIERF 1170
Score = 44.7 bits (104), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 74/174 (42%), Gaps = 38/174 (21%)
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
+W+ C I+ LK+ L A L+ VP+P+GG+LV+G +I Y
Sbjct: 279 AWQDTGC-IAVFKALDLLKEE--------LEMGASFLIPVPAPLGGLLVLGETSIRYLD- 328
Query: 335 SASCALALNNYAVSLDSSQELPRSSFSVELDAA--HATWLQNDVA--LLSTKTGDLVLLT 390
D++ E S+ LD A W Q D LL+ G L L
Sbjct: 329 ---------------DATNE----CISLPLDEATIFVAWEQVDGQRWLLADDYGRLFFLM 369
Query: 391 VVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
++ D VQ +LDL P S + +G + F+GS GDS L++ T GS
Sbjct: 370 LILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVTFIGSHQGDSQLIRITEGS 421
>gi|118369889|ref|XP_001018147.1| hypothetical protein TTHERM_00279910 [Tetrahymena thermophila]
gi|89299914|gb|EAR97902.1| hypothetical protein TTHERM_00279910 [Tetrahymena thermophila SB210]
Length = 1563
Score = 46.6 bits (109), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 82/179 (45%), Gaps = 27/179 (15%)
Query: 1206 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC-----FATEFLIDGSTLSLVVSD 1260
K + L+GDI K + F E ++ L+A++ +++ F+ + + + L +++D
Sbjct: 1343 KQYFLVGDIQKGVQFYEMDEIQSKPRLIAEENININVRQCVLFSIQ---NKNALRALITD 1399
Query: 1261 EQKNIQIFYYAPKMSESWKGQKLLSR---AEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1317
E +N+ + + + E ++ +S A FHVG+ + K T+ +
Sbjct: 1400 ESRNVYAYSFIQQQQELPTDKRKISMELVASFHVGSKINK-------ITTDYKVNDTSMQ 1452
Query: 1318 DKTNRFALLFGTLDGSIGCIA----PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1372
+ + LL DG++ I P D L +Q + + +P++ GL+PR FR+
Sbjct: 1453 ESVSHLLLL--KQDGNLSDIQLIYEPGDNTN---LFDMQNTIFEELPYIGGLDPREFRE 1506
>gi|170090007|ref|XP_001876226.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164649486|gb|EDR13728.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1275
Score = 46.6 bits (109), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 71/322 (22%), Positives = 138/322 (42%), Gaps = 36/322 (11%)
Query: 1069 LEPDRAGGPWQTRATIPM------------QSSENALTVRVVTLFNTTTKENETLLAIGT 1116
+EP+R P +R++ + + T VV+ + +GT
Sbjct: 903 MEPNRVNDPEISRSSFKLLDDTSFANLCQFNCDPDEETTAVVSFSQKIAGKPMPFFCVGT 962
Query: 1117 -AYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1175
Y GE + GR+++F T + + ++ + S ++ G + AL +Q ++ A
Sbjct: 963 YVYKAGEVEPSAGRLMIF-TATTSTSSNLALSLMASTKVPGCVYALTVVQNQIVAAVNSS 1021
Query: 1176 IILHKWTGTELNG----IAFYDAPPLYVV-SLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1230
++L + + + I + Y+V SL + +++GD SI L + ++L
Sbjct: 1022 VMLFRLESSSDSLSPSLIKVSEWHHNYLVTSLGSYADRVVVGDQPSSISLLQVTQ--SKL 1079
Query: 1231 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1290
A+D+G L E L + ++ +++ N+ F M S +L +H
Sbjct: 1080 ISQARDYGPLWPVCVEALDERH---IIGANDSLNLFTFSLEKAMGRS----RLERDGCYH 1132
Query: 1291 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQ 1349
V VTKFLR + +SSD + +P + + +F T G IG + + DE +L
Sbjct: 1133 VADLVTKFLRGSL--SSSDASTTSPLTSEA-----MFFTSSGRIGVVVDVKDEELSLQLT 1185
Query: 1350 SLQKKLVDSVPHVAGLNPRSFR 1371
++Q+ L + + V G + +R
Sbjct: 1186 NMQRNLANVIQGVGGSSHSKYR 1207
>gi|238491136|ref|XP_002376805.1| UV-damaged DNA binding protein, putative [Aspergillus flavus
NRRL3357]
gi|220697218|gb|EED53559.1| UV-damaged DNA binding protein, putative [Aspergillus flavus
NRRL3357]
Length = 1117
Score = 46.6 bits (109), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 130/313 (41%), Gaps = 34/313 (10%)
Query: 1112 LAIGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+GTAY+ E + + RGR+L+F DN + L T+V +KGA ALA L ++
Sbjct: 809 FVVGTAYLDDEGEESIRGRILMFEI----DNGRKL-TKVAELPVKGACRALAMLGDKIVA 863
Query: 1171 ASGPKIILHKWTGTELNGIAFYDAPPLYV----VSLNIVKNFILLGDIHKSIYFLSWKEQ 1226
A I+++K + V + +V N I++ D+ KS+ L +KE
Sbjct: 864 ALVKTIVIYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSDLMKSVCLLEFKEG 923
Query: 1227 GA----QLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
L +A+ F ++ +AT ID T + SD + N+ + E +
Sbjct: 924 ENGLPDSLTEVARHFQTV--WATGVACIDKDT--FLESDAEGNLIVLRRNLAGVEEDDRR 979
Query: 1282 KLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
+L +E +G V + + +Q LA+ + A GT++GSI A
Sbjct: 980 RLEVTSEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTVEGSIYLFAI 1027
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
++ L LQ + V + + FR F S + P VD EL+ +
Sbjct: 1028 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVR-EATEPYRFVDGELIEQFLNC 1086
Query: 1400 PLEEQLEIAHQTG 1412
E Q EI + G
Sbjct: 1087 EPELQEEIVNSVG 1099
>gi|169773185|ref|XP_001821061.1| UV-damaged DNA binding protein [Aspergillus oryzae RIB40]
gi|83768922|dbj|BAE59059.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1139
Score = 46.2 bits (108), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 130/313 (41%), Gaps = 34/313 (10%)
Query: 1112 LAIGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+GTAY+ E + + RGR+L+F DN + L T+V +KGA ALA L ++
Sbjct: 831 FVVGTAYLDDEGEESIRGRILMFEI----DNGRKL-TKVAELPVKGACRALAMLGDKIVA 885
Query: 1171 ASGPKIILHKWTGTELNGIAFYDAPPLYV----VSLNIVKNFILLGDIHKSIYFLSWKEQ 1226
A I+++K + V + +V N I++ D+ KS+ L +KE
Sbjct: 886 ALVKTIVMYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSDLMKSVCLLEFKEG 945
Query: 1227 GA----QLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
L +A+ F ++ +AT ID T + SD + N+ + E +
Sbjct: 946 ENGLPDSLTEVARHFQTV--WATGVACIDKDT--FLESDAEGNLIVLRRNLAGVEEDDRR 1001
Query: 1282 KLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
+L +E +G V + + +Q LA+ + A GT++GSI A
Sbjct: 1002 RLEVTSEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTVEGSIYLFAI 1049
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
++ L LQ + V + + FR F S + P VD EL+ +
Sbjct: 1050 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVR-EATEPYRFVDGELIEQFLNC 1108
Query: 1400 PLEEQLEIAHQTG 1412
E Q EI + G
Sbjct: 1109 EPELQEEIVNSVG 1121
>gi|430814207|emb|CCJ28534.1| unnamed protein product [Pneumocystis jirovecii]
Length = 904
Score = 46.2 bits (108), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 64/307 (20%), Positives = 134/307 (43%), Gaps = 34/307 (11%)
Query: 1098 VTLFNTTTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSK-ELK 1155
V + T +N+ + +GT + + E+ +++GR++LF N V+S+ ++
Sbjct: 588 VQCITSVTIDNQDIFVVGTGFSLPEEEESSKGRIILFGV-------TNKKIWVFSEIQVN 640
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGT--ELNGIAFYDAPPLYVVSLNIVKNFILLGD 1213
A+ + + ++ + ++ + + N IA Y + L +SL + +++GD
Sbjct: 641 DAVYCIGIIDNKIIAGINALVHIYAYDSSLKNFNVIATYRSTTL-CLSLAVHGTHVIIGD 699
Query: 1214 IHKSIYFLSW--KEQGAQLNLLAKDFGSL--DCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
+ KS+ L++ E G +L +AKD L C A +D + ++ + N+ +F+
Sbjct: 700 LMKSVSLLAFINTENGPRLKEVAKDCNPLWMTCVAA---LDNDLY--IGAEAEGNLSLFW 754
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
+ +++ KL +E G V + +L S+ + P + F T
Sbjct: 755 --KDFNTTFEENKLQIISEIKWGELVNQIKPGTILY--SENSIIIPKAT--------FVT 802
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1389
+DGSIG I + L +LQ + + + LN ++R F N + P +D
Sbjct: 803 VDGSIGIIFTVKREYLEFLVNLQSNMGKIISGIGCLNHSNWRAF-CNRRKKSNEPKCFID 861
Query: 1390 CELLSHY 1396
+ + +
Sbjct: 862 GDFVEIF 868
>gi|391865638|gb|EIT74917.1| damage-specific DNA binding complex, subunit DDB1 [Aspergillus oryzae
3.042]
Length = 1135
Score = 46.2 bits (108), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 130/313 (41%), Gaps = 34/313 (10%)
Query: 1112 LAIGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1170
+GTAY+ E + + RGR+L+F DN + L T+V +KGA ALA L ++
Sbjct: 827 FVVGTAYLDDEGEESIRGRILMFEI----DNGRKL-TKVAELPVKGACRALAMLGDKIVA 881
Query: 1171 ASGPKIILHKWTGTELNGIAFYDAPPLYV----VSLNIVKNFILLGDIHKSIYFLSWKEQ 1226
A I+++K + V + +V N I++ D+ KS+ L +KE
Sbjct: 882 ALVKTIVIYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSDLMKSVCLLEFKEG 941
Query: 1227 GA----QLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
L +A+ F ++ +AT ID T + SD + N+ + E +
Sbjct: 942 ENGLPDSLTEVARHFQTV--WATGVACIDKDT--FLESDAEGNLIVLRRNLAGVEEDDRR 997
Query: 1282 KLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
+L +E +G V + + +Q LA+ + A GT++GSI A
Sbjct: 998 RLEVTSEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTVEGSIYLFAI 1045
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
++ L LQ + V + + FR F S + P VD EL+ +
Sbjct: 1046 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVR-EATEPYRFVDGELIEQFLNC 1104
Query: 1400 PLEEQLEIAHQTG 1412
E Q EI + G
Sbjct: 1105 EPELQEEIVNSVG 1117
>gi|291000406|ref|XP_002682770.1| predicted protein [Naegleria gruberi]
gi|284096398|gb|EFC50026.1| predicted protein [Naegleria gruberi]
Length = 1216
Score = 46.2 bits (108), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 67/349 (19%), Positives = 137/349 (39%), Gaps = 64/349 (18%)
Query: 1106 KENETLLAIGTAY---VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALA 1162
K NE+L+ +GTA + G + +F + + ++ E++ AL
Sbjct: 897 KSNESLIIVGTAKNMKLYPTRTCDCGYINVFQISEDGK-----LQLIHKTEVEDVPYALH 951
Query: 1163 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
+ +G LL+ + ++ +L + P ++ S+ + N I +GDI +S +F+
Sbjct: 952 AFRGRLLVGVKNMLRIYDLGKKKLLRKCENKSFPNFITSIAVDGNRIFVGDITESFHFVK 1011
Query: 1223 WKEQGAQLNLLAKD----------------FGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1266
+ L + A + D F F+ S L VSDE ++
Sbjct: 1012 FNSSENSLTIFADNTTPRWLTASALVDHNTIAGGDKFGNFFI---SRLPSDVSDELED-- 1066
Query: 1267 IFYYAPKMSESW---KG------QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1317
+ E W +G QK +F+VG+ +T + ++A P
Sbjct: 1067 ----SSTGKEKWIWERGLLNGAPQKATEIVKFYVGSMITSIYKTSLIA-------GGPS- 1114
Query: 1318 DKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1376
L++ T+ G++G P + L+ L + P + G + ++R ++
Sbjct: 1115 ------ILIYTTITGAVGVFFPFTSKKDIEFFTQLEMHLREKNPPLCGRDHLAYRSYYFP 1168
Query: 1377 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
K S+VD +L+ + + L+ + +I+ T ++I + D+
Sbjct: 1169 VK-------SVVDGDLIEQFNDVDLQTKTKISEDLQRTINEIAKKIEDM 1210
>gi|58269920|ref|XP_572116.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21]
gi|57228352|gb|AAW44809.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1276
Score = 45.8 bits (107), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 61/266 (22%), Positives = 110/266 (41%), Gaps = 42/266 (15%)
Query: 1112 LAIGTAYVQGED------------VAARGRVLLFSTGRNADNPQNLVTEVYSK-ELKGAI 1158
LA+GTA++ +D V GRVLL + D ++ ++ GA+
Sbjct: 945 LAVGTAFLPADDGEDSSWDEGNLAVVREGRVLLLEF-KEGDAGGGWDIKIKAELATVGAV 1003
Query: 1159 SALASLQGHLLIASGPKIILHKW--TGTELNGIAFYDAPPLYVVSLNIV-------KNFI 1209
AL + G L +A+G K+ +H+ EL + + A + SL+++ + +
Sbjct: 1004 YALEEIHGFLAVAAGSKLTIHRLDHNPVELEETSSW-ASAYVISSLSVLPPSHIRPEGAL 1062
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
++GD +S+ L+ E + ++ + A L D +V+SD N+ +
Sbjct: 1063 IVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGD-GVVISDAYSNLLTYR 1121
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
QKL A F + VT+F ++ T++ P +LF T
Sbjct: 1122 L---------NQKLERAATFGLHEEVTRFQSGSLVPTTTAPEIIIPD--------VLFAT 1164
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKL 1355
+G +G I L + R L LQ+ +
Sbjct: 1165 REGRLGIIGELGTRSSRTLDDLQRNM 1190
>gi|407034933|gb|EKE37449.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
Length = 995
Score = 45.8 bits (107), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 92/197 (46%), Gaps = 34/197 (17%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNAD 1140
TI ++S+E AL V + + + A+GTA ++ ++ + GR+LL
Sbjct: 692 TTIELKSNELALCVDSL---------EDNIYAVGTAIIRENEIEPSSGRILLIR-----Q 737
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1200
+ + L+ V +++ GA+ L Q ++ + + + G +LN PL V
Sbjct: 738 DTEGLIYIVGTEDYDGAVYCLKKCQKGIVAFINRNVHVIEKKGKDLNTKQNM-LLPLIGV 796
Query: 1201 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------FGSLDC-FATEFLIDG 1251
SL+I K++I+ GD+ +S+ ++ L+++ KD GS++ + T FL
Sbjct: 797 SLDICKDYIIAGDLARSLSVYRYRNDIEHLDVVGKDNQIVWSSCVGSIESEYGTSFL--- 853
Query: 1252 STLSLVVSDEQKNIQIF 1268
V+D NI+IF
Sbjct: 854 ------VADVSGNIKIF 864
>gi|390603312|gb|EIN12704.1| hypothetical protein PUNSTDRAFT_97523 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1268
Score = 45.8 bits (107), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 59/237 (24%), Positives = 106/237 (44%), Gaps = 35/237 (14%)
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV--AARGRVLLFSTGRNADNPQ 1143
+++ E+ + V++L T + + IGTA +D ++GR+++F D
Sbjct: 933 LEADEDITSAVVLSL--GTAEAYTSHFCIGTADFTSDDQLEVSKGRLVVF------DPST 984
Query: 1144 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLN 1203
+++ V + ++ G + ALAS+QG + A +I+++ E +G F + + + N
Sbjct: 985 KVLSPVATLDVNGCVYALASIQGLVAAAVNSAVIVYRL---ETDGPTFSSKRLVQLANWN 1041
Query: 1204 ---IVKNF------ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
V N I +GD S+ L G L +A+D+G L A E
Sbjct: 1042 HNYFVTNLVTRGSRIFVGDAISSVSILELT--GQALQTVARDYGPLWPVAIE---STGPD 1096
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRT 1311
S++ +D + N+ F K+SE KL +H+G V KF+ +LA T
Sbjct: 1097 SVIGADGEFNLFTF----KLSEG----KLERDGSYHLGEQVNKFVPGGLLAADPAHT 1145
>gi|448111975|ref|XP_004201977.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
gi|359464966|emb|CCE88671.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
Length = 1249
Score = 45.8 bits (107), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 98/239 (41%), Gaps = 43/239 (17%)
Query: 95 GISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
I L+ +C + + ++SL + G+ ++D +++ + K+++L++D + L
Sbjct: 52 NIDTGKLDKICVHNVFSVIQSLEKVRLTGS----QKDYLVVTSDSGKLAILQYDTGRNRL 107
Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ--MIILKASQGGSG 212
+ F+ P H K G GP + DPQ R +L+ L+ +I K G
Sbjct: 108 ----VTVFQEP---HSKTGFRRNTPGPYLLTDPQNR--AILIGALERNKLIYKVHSDDKG 158
Query: 213 LVGDEDTFGSGGGFSARIESS--HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTW 270
G S+ +ES H I L + GY PV V +
Sbjct: 159 ----------GMQISSPLESQIRHTITLAMCALDT--------GYENPVFVAIEAEYGAL 200
Query: 271 AGRV----SWKHHTCMISALSISTTLKQHPLIWSAMN--LPHDAYKLLAVPSPIGGVLV 323
+ S H T + ++ + L ++ +N LP A L+ +PSP+GGVL+
Sbjct: 201 DSKEYSIDSQAHQTLLFTSYELDQGLNH--VVRRVVNNKLPISATHLIPLPSPVGGVLI 257
>gi|134113697|ref|XP_774433.1| hypothetical protein CNBG0790 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257071|gb|EAL19786.1| hypothetical protein CNBG0790 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1276
Score = 45.8 bits (107), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 63/266 (23%), Positives = 111/266 (41%), Gaps = 42/266 (15%)
Query: 1112 LAIGTAYV---QGED---------VAARGRVLLFSTGRNADNPQNLVTEVYSK-ELKGAI 1158
LA+GTA++ GED V GRVLL + D ++ ++ GA+
Sbjct: 945 LAVGTAFLPPDDGEDSSWDEGNLAVVREGRVLLLEF-KEGDAGGGWDIKIKAELATVGAV 1003
Query: 1159 SALASLQGHLLIASGPKIILHKW--TGTELNGIAFYDAPPLYVVSLNIV-------KNFI 1209
AL + G L +A+G K+ +H+ EL + + A + SL+++ + +
Sbjct: 1004 YALEEIHGFLAVAAGSKLTIHRLDHNPVELEETSSW-ASAYVISSLSVLPPSHIRPEGAL 1062
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
++GD +S+ L+ E + ++ + A L D +V+SD N+ +
Sbjct: 1063 IVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGD-GVVISDAYSNLLTYR 1121
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
QKL A F + VT+F ++ T++ P +LF T
Sbjct: 1122 L---------NQKLERAATFGLHEEVTRFQSGSLVPTTTAPEIIIPD--------VLFAT 1164
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKL 1355
+G +G I L + R L LQ+ +
Sbjct: 1165 REGRLGIIGELGTRSSRTLDDLQRNM 1190
>gi|239613967|gb|EEQ90954.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ER-3]
gi|327353314|gb|EGE82171.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ATCC 18188]
Length = 1199
Score = 45.8 bits (107), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 126/314 (40%), Gaps = 42/314 (13%)
Query: 1111 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
L +GT+Y+ GE + RGR+L F P + +V +KGA ALA +Q +
Sbjct: 883 LFIVGTSYLDDFGEG-SIRGRILAFEV-----TPNRQLGKVAEMPVKGACRALAIVQDKI 936
Query: 1169 LIASGPKIILHKWTGTE-----LNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1220
+ A ++++ + + L A Y AP + + + N I + D+ KS+
Sbjct: 937 VAALMKTVVVYTLSKGQFADYILTKTASYRTSTAP----IDIAVTGNLIAVADLMKSVSI 992
Query: 1221 LSWKEQ----GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1276
+ +++ L +A+ F +L A + + L SD + N+ +
Sbjct: 993 VEYQQGTDGLSGSLTEVARHFQTLWSTAVAPVAQDTWLE---SDAEGNLVVLRRNVNGVT 1049
Query: 1277 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI-- 1334
++L +E +G V + + + A+ +P + GT++GSI
Sbjct: 1050 EDDRRRLEVTSEVLLGEMVNRIRPVNIQASLGTEAAISPRA--------FLGTVEGSIYL 1101
Query: 1335 -GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
G I P + RLQS +V + G+ FR F N P VD EL+
Sbjct: 1102 FGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTVRQAEEPYRFVDGELI 1157
Query: 1394 SHYEMLPLEEQLEI 1407
+ E Q EI
Sbjct: 1158 ERFLGCGAELQEEI 1171
Score = 45.1 bits (105), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 21/134 (15%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 299 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLEEATIFV-------A 340
Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGS 427
W Q D LL+ G L L ++ D VQ L + S + +G + F+GS
Sbjct: 341 WEQVDGQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIPRASVLVYMGGGVTFIGS 400
Query: 428 RLGDSLLVQFTCGS 441
GDS L++ T GS
Sbjct: 401 HQGDSQLIRITEGS 414
>gi|322706594|gb|EFY98174.1| DNA damage-binding protein 1 [Metarhizium anisopliae ARSEF 23]
Length = 1121
Score = 45.8 bits (107), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 72/318 (22%), Positives = 136/318 (42%), Gaps = 46/318 (14%)
Query: 1114 IGTAYVQGEDVA----ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1169
IGT+++ +D RGR+L+ N V ++ S LKGA L +L H++
Sbjct: 794 IGTSFITDDDAIEENDTRGRILVLGVDENRQ-----VYQIVSHNLKGACRCLGTLGEHIV 848
Query: 1170 IASGPKIILHKWTGT-----ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW- 1223
++++ + L +A Y P + +SL+I N I + D+ +S+ + +
Sbjct: 849 AGLSKTVVVYHYVEETTVFGSLQKLAAY-RPASFPLSLDISGNIIGVVDLMQSLTLVEFI 907
Query: 1224 -KEQGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
E G++ L +AT +DG + +D Q NI + P+
Sbjct: 908 PSEDGSRAKLEETARHYQPGWATSVAHLDGE--RWLEADAQGNIIVLQRNPEAPTEQDRS 965
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP-------GSDKT-----NRFALLFGT 1329
KL +E ++G + + +L + S++ +P G +T N+ +L
Sbjct: 966 KLEVTSEMNIGEQINQIRKLHV--ASNENAVVSPKAFLGSVGLSETIITCWNQLLMLV-Q 1022
Query: 1330 LDGSI---GCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1385
++G++ G IAP +L L + Q +L D + ++ +R F + + GP
Sbjct: 1023 IEGTLYLFGEIAPNYQDL----LLTFQSRLQDYIYAPGNVSFNLWRAFRNKAR-EGDGPF 1077
Query: 1386 SIVDCELLSHYEMLPLEE 1403
VD E++ + L L+E
Sbjct: 1078 RFVDGEMVERF--LDLDE 1093
>gi|322700233|gb|EFY91989.1| Pre-mRNA-splicing factor rse-1 [Metarhizium acridum CQMa 102]
Length = 1039
Score = 45.8 bits (107), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 81/365 (22%), Positives = 151/365 (41%), Gaps = 48/365 (13%)
Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNP 1142
++ +E ++ VV T+++ E+ L IGT G+D+ R R D+
Sbjct: 704 LEGNEAGVSAAVVPF---TSQDGESFLIIGT----GKDMIVNPRQSSEGFIHVYRFHDDG 756
Query: 1143 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1202
++L ++ +++ +AL S G LL G + ++ +L A D P ++VSL
Sbjct: 757 RSL-EFIHKTKVEEPPTALLSFHGRLLAGIGKTLRIYDLGMRQLLRKAQADISPQHIVSL 815
Query: 1203 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSD 1260
I++GD+ + + + +L D + C A S+ D
Sbjct: 816 QSQGFRIVVGDVQHGVTMVVYNPVSNKLLPFVDDTIARWTTCLAM-----ADYESVAGGD 870
Query: 1261 EQKNIQIFYYAPKMS----ESWKGQKLLS-RAEFHVGAHVTKFLRLQMLA--------TS 1307
+ NI I K S E +L + ++ H AH RLQ++A TS
Sbjct: 871 KFGNIWIVRCPDKASAEADEPGSDVQLSNGQSYLHGAAH-----RLQLMAHMFVQDIPTS 925
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKLVDSVPHVAGLN 1366
+T G LL+ L G+IG + PL T Q+L+ + + P +AG +
Sbjct: 926 ICKTSLVVGGQDV----LLWSGLQGTIGVLIPLVTRETADFFQTLEMHMRNEDPPLAGRD 981
Query: 1367 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
+R +H K ++D +L + +L E++ IA + + ++ ++D+
Sbjct: 982 HLMYRGYHVPVKG-------VIDGDLCERFSLLSREKKQMIAGELDRSVREVERRISDVR 1034
Query: 1427 LGTSF 1431
+ + F
Sbjct: 1035 IRSVF 1039
>gi|159486547|ref|XP_001701300.1| nuclear pre-mRNA splicing factor, component of splicing factor 3b
[Chlamydomonas reinhardtii]
gi|158271783|gb|EDO97595.1| nuclear pre-mRNA splicing factor, component of splicing factor 3b
[Chlamydomonas reinhardtii]
Length = 1078
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 79/355 (22%), Positives = 137/355 (38%), Gaps = 84/355 (23%)
Query: 1109 ETLLAIGTA----YVQGEDVAARGRVL-LFSTGRNADNPQNLVTEVYSKELKGAI-SALA 1162
ET+L +GTA Y+ + AA RV L GR D ++ ++ G + ALA
Sbjct: 765 ETVLLVGTARGLRYMPTDCEAAYIRVYRLGDGGRRLD-------LLHKTQVDGGVPGALA 817
Query: 1163 SLQGHLLIASGPKIILH--------------KWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
+G LL GP + L+ +WT L+ + FY P + S
Sbjct: 818 GFKGRLLAGVGPTLRLYDMGKKKMLRKCEYNRWTNIFLH-VFFYR--PYFRSS------- 867
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFG----------SLDCFATEFLIDGSTLSLVV 1258
+S++ + +K+ + A D D AT + +
Sbjct: 868 ------QESVHMMRYKKADNAFYIFADDVAPRYLSALLPLDYDTIATGDKFGNLVILRLP 921
Query: 1259 SDEQKNIQIFYYAPKMSES-----WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1313
+ + ++ KM+ + KL +FHVG +T R +M A +
Sbjct: 922 QEASQQVEDDPTGGKMAAASGKLNGAPHKLEELVKFHVGDTITALQRAEMQAGGQE---- 977
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1370
L++ T+ G+IG + P +++ F L+ L P +AG + ++
Sbjct: 978 ----------VLVYSTVMGAIGVVYPFTSREDVDF--FSHLEMHLRQENPPLAGRDHLAY 1025
Query: 1371 RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
R A+ P + VD +L S Y +P+++Q IA T ++L L D+
Sbjct: 1026 RS------AYFP-VRNCVDGDLCSQYASIPMKKQQMIAEAMDRTTGEMLKKLEDI 1073
>gi|147787360|emb|CAN64633.1| hypothetical protein VITISV_043788 [Vitis vinifera]
Length = 1143
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 76/181 (41%), Gaps = 28/181 (15%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+RIL+P A T + +Q +E A ++ V N KE TLLA+GTA
Sbjct: 822 IRILDPRTA----TTTCLLELQDNEAAFSICTV---NFHDKEYGTLLAVGTA-------- 866
Query: 1126 ARGRVLLFSTGRNAD----------NPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1175
+ L F R+ D + ++ +++G AL QG LL G
Sbjct: 867 ---KSLQFWPKRSFDAGYIHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSV 923
Query: 1176 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1235
+ L+ L P +VS++ ++ I +GDI +S ++ ++ QL + A
Sbjct: 924 LRLYDLGKRRLLRKCENKLFPNTIVSIHTYRDRIYVGDIQESFHYCKYRRDENQLYIFAD 983
Query: 1236 D 1236
D
Sbjct: 984 D 984
>gi|183233163|ref|XP_654084.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
gi|169801703|gb|EAL48698.2| damaged DNA binding protein, putative [Entamoeba histolytica
HM-1:IMSS]
gi|449708240|gb|EMD47737.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
Length = 995
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 92/197 (46%), Gaps = 34/197 (17%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNAD 1140
TI ++S+E AL V + + + A+GTA ++ ++ + GR+LL
Sbjct: 692 TTIELKSNELALCVDSL---------EDNIYAVGTAIIRENEIEPSSGRILLIR-----Q 737
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1200
+ + L+ V +++ GA+ L Q ++ + + + G +LN PL V
Sbjct: 738 DTEGLIYIVGTEDYDGAVYCLKKCQKGIVAFINRNVHVIEKKGKDLNTKQNM-LLPLIGV 796
Query: 1201 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------FGSLDC-FATEFLIDG 1251
SL+I K++I+ GD+ +S+ ++ L+++ KD GS++ + T FL
Sbjct: 797 SLDICKDYIIAGDLARSLSVYRYRNDIEHLDVVGKDNQIVWSSCVGSIESEYGTSFL--- 853
Query: 1252 STLSLVVSDEQKNIQIF 1268
V+D NI+IF
Sbjct: 854 ------VADVSGNIKIF 864
>gi|125977518|ref|XP_001352792.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
gi|54641542|gb|EAL30292.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
Length = 1228
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + S + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP T L+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPRT----------LKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|402467441|gb|EJW02742.1| hypothetical protein EDEG_02863 [Edhazardia aedis USNM 41457]
Length = 1274
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 50/78 (64%), Gaps = 2/78 (2%)
Query: 1154 LKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLG 1212
+K + + S++G+L+I G +++++K E L IAF+D + VSL+++KNFIL+G
Sbjct: 988 IKSSAISCDSIRGNLVIGQGTRLMIYKIDRLEGLVAIAFHDLSII-AVSLSVIKNFILVG 1046
Query: 1213 DIHKSIYFLSWKEQGAQL 1230
D+ + + F ++ + ++
Sbjct: 1047 DLLRGVTFFYFQTRPVKI 1064
>gi|336263557|ref|XP_003346558.1| hypothetical protein SMAC_04731 [Sordaria macrospora k-hell]
gi|380090453|emb|CCC11749.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1149
Score = 45.4 bits (106), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 77/337 (22%), Positives = 138/337 (40%), Gaps = 49/337 (14%)
Query: 1114 IGTAYVQGEDVAA----RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1169
+GT++++ D A RGR+L+F N D V EL+GA ALA + ++
Sbjct: 823 VGTSFLEDPDRGAGTDKRGRILVFGIDSNRDPYL-----VLKHELRGACRALAVMGSKIV 877
Query: 1170 IASGPKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW- 1223
A +++ ++ T L +A Y Y + + + N I + D+ KS + +
Sbjct: 878 AALHKTVVISQYEETSSTEARLVKLASYRCTT-YPIDIAVHGNIIAVADMMKSATLVEYV 936
Query: 1224 -------KEQGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
K + A+L A+ S +AT ++G S + +D N+ + +
Sbjct: 937 QAKTEEEKYEPAKLVECARHRHS--AWATAVAHVEGE--SWLEADANGNLVVLQRNVEGV 992
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1334
+ ++L +E ++G V K +++ +S T P + T +G I
Sbjct: 993 TAEDQRQLRITSELNLGEQVNKIRPIKV--ETSPNTIIIPRA--------FLATAEGGIY 1042
Query: 1335 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR----PGPDSIV 1388
G IA +L R Q KL + V L+ S+R F + + GP +
Sbjct: 1043 LFGTIAREQDLLLR----FQDKLAAVIKTVGELDFNSYRAFRNAERGPETDGTTGPVRFL 1098
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
D ELL + + Q EI G + Q+ + + +L
Sbjct: 1099 DGELLERFLDVDETTQKEICEGLGPSVEQMRNMVEEL 1135
>gi|195586770|ref|XP_002083143.1| GD13507 [Drosophila simulans]
gi|194195152|gb|EDX08728.1| GD13507 [Drosophila simulans]
Length = 1227
Score = 45.4 bits (106), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G+SD ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLSDNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|353232348|emb|CCD79703.1| putative dna repair protein xp-E [Schistosoma mansoni]
Length = 1329
Score = 45.4 bits (106), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 126/323 (39%), Gaps = 57/323 (17%)
Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
R DS+ L A ++++E +DS+ + + S + R +G V
Sbjct: 72 RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVED--------RSARIIDQGFDVL 123
Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
+DP V +Y GL IIL G + G+ + +IE +++
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIG--------EKIGTDFLDTNQIEEGNIV------- 168
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
D F++GY P +++E EL H L+ L ++
Sbjct: 169 ----DMAFIYGYSLPTFAMIYEDELVL--------HMKTYEIYGREPVLRNVQLTLDSIE 216
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
D+ L+ VP P GGV++VG N I YH++ ++ Y +SQ L ++ +
Sbjct: 217 --PDSKLLIPVPKPYGGVILVGDNIICYHTKDGP---HISQYIPQAKASQVLCYAAVDAQ 271
Query: 364 L----DAAHATW----LQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
D A + L D+ A + T + L+ V G + L P
Sbjct: 272 RYLLGDMAGRLYMVHLLSEDISAAANNGTSNSDSLSAVRIGSIRIELLGETATP----ES 327
Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
I + N + F+GS LGDS L++
Sbjct: 328 IAYLDNGVVFIGSTLGDSQLIRL 350
>gi|443896643|dbj|GAC73987.1| predicted DNA methylase [Pseudozyma antarctica T-34]
Length = 1285
Score = 45.4 bits (106), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 62/301 (20%), Positives = 121/301 (40%), Gaps = 43/301 (14%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1208
V+ E+ L + QG LL G + +++ +L + P +V+L+ +
Sbjct: 1007 VHKTEVDDVPLVLRAFQGRLLAGVGKVLRIYELGKKKLLRKCENRSFPTAIVALDAQGSR 1066
Query: 1209 ILLGDIHKSIYFLSWKEQGAQLNLLAKDF----------------GSLDCFATEFL--ID 1250
I++GD+ +S+ F S+K +L A D + D F ++ ID
Sbjct: 1067 IVVGDMQESVIFASYKPLENRLVTFADDIMPRYVTRCTMLDYDTVAAADKFGNVYVVRID 1126
Query: 1251 GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1310
T S V ++ + + P + + LL A + VG +T R M+
Sbjct: 1127 ADT-SRSVDEDVTGMTTMHEKPLLMGAAHKATLL--AHYFVGDIITSLSRAVMV------ 1177
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
PG R LL+ + G+IG + P + + + +L+ +L + G + +
Sbjct: 1178 ----PG----GREVLLYTGISGTIGALVPFVSKEDVDTMTTLEMQLRQQSDSLVGRDHLA 1229
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1429
+R ++ K ++D +L + +LP +Q +A + S++ L L G
Sbjct: 1230 YRSSYAPVK-------HVIDGDLCESFGLLPPAKQSAVAQELDRKPSEVNKKLAQLREGA 1282
Query: 1430 S 1430
+
Sbjct: 1283 T 1283
>gi|256088964|ref|XP_002580590.1| DNA repair protein xp-E [Schistosoma mansoni]
Length = 1329
Score = 45.4 bits (106), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 126/323 (39%), Gaps = 57/323 (17%)
Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
R DS+ L A ++++E +DS+ + + S + R +G V
Sbjct: 72 RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVED--------RSARIIDQGFDVL 123
Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
+DP V +Y GL IIL G + G+ + +IE +++
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIG--------EKIGTDFLDTNQIEEGNIV------- 168
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
D F++GY P +++E EL H L+ L ++
Sbjct: 169 ----DMAFIYGYSLPTFAMIYEDELVL--------HMKTYEIYGREPVLRNVQLTLDSIE 216
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
D+ L+ VP P GGV++VG N I YH++ ++ Y +SQ L ++ +
Sbjct: 217 --PDSKLLIPVPKPYGGVILVGDNIICYHTKDGP---HISQYIPQAKASQVLCYAAVDAQ 271
Query: 364 L----DAAHATW----LQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
D A + L D+ A + T + L+ V G + L P
Sbjct: 272 RYLLGDMAGRLYMVHLLSEDISAAANNGTSNSDSLSAVRIGSIRIELLGETATP----ES 327
Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
I + N + F+GS LGDS L++
Sbjct: 328 IAYLDNGVVFIGSTLGDSQLIRL 350
>gi|195169735|ref|XP_002025674.1| GL20829 [Drosophila persimilis]
gi|194109167|gb|EDW31210.1| GL20829 [Drosophila persimilis]
Length = 1225
Score = 45.1 bits (105), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + S + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP T L+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPRT----------LKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|350630003|gb|EHA18376.1| hypothetical protein ASPNIDRAFT_38018 [Aspergillus niger ATCC 1015]
Length = 1219
Score = 45.1 bits (105), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 98/463 (21%), Positives = 193/463 (41%), Gaps = 49/463 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS- 1022
C G + + Q + ++ S DN +Q+ IPL TP + E+ L+ +I S
Sbjct: 759 QCVEGMVGIQGQNL----RIFSIEKLDNNM-LQQSIPLSYTPRRFLKHPEQPLFYVIESD 813
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQT 1080
VL P + + L++ D L D + +++++P A
Sbjct: 814 NNVLAPSTR--AKLLEDSKSRGGDETVLPPEDFGYPRATGHWASCIQVVDPLDAKA---V 868
Query: 1081 RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRN 1138
TI ++ +E A+++ V T++++ET L +GTA +A G + ++ R
Sbjct: 869 VHTIELEENEAAISIAAVPF---TSQDDETFLVVGTAKDMTVNPPGSAGGYIHIY---RF 922
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
++ + L ++ +++ AL QG L+ G + ++ +L P
Sbjct: 923 QEDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGSLLRIYDLGMKQLLRKCQAPVVPKT 981
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
+V L + I++ D+ +S+ ++ +K Q L D S AT ++D T +
Sbjct: 982 IVGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTATT-MVDYETTA--G 1038
Query: 1259 SDEQKNIQIFYYAPKMSES----WKGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDR 1310
D+ N+ + K SE G L+ + G + + Q + TS +
Sbjct: 1039 GDKFGNLWLLRCPKKTSEEADEDGSGAHLIHERGYLQGTPNRLELMIHVYTQDIPTSLHK 1098
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
T G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1099 TQLVAG----GRDILVWTGFQGTIGMLVPFIGREDVDF--FQNLEMQLAAQHPPLAGRDH 1152
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
+R +++ K ++D +L Y +LP + ++ IA +
Sbjct: 1153 LIYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAE 1188
>gi|401413996|ref|XP_003886445.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325120865|emb|CBZ56420.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 2869
Score = 45.1 bits (105), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 61/292 (20%), Positives = 120/292 (41%), Gaps = 59/292 (20%)
Query: 1112 LAIGTAYVQGEDVAARGRVLLF----------STGRNADNPQNLVT-------EVYSK-E 1153
LA G E + GR+ LF S R+AD P + E+++
Sbjct: 2457 LAAGVGVPLSETIECSGRLYLFKLPESAMRLASPPRSADTPGDQAEYGTPERLELFADIV 2516
Query: 1154 LKGAISALASL------QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1207
L G ++ + S + +++ + GP++ +H+ ++ AF D+ + V ++ ++N
Sbjct: 2517 LNGPVTVVGSFFSSPAERSYVVHSVGPRLFVHEMESSKFLRGAFSDSS-VCVTAVANLRN 2575
Query: 1208 FILLGDIHKSIYFLSWKEQGA----QLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDE 1261
F LL D K + ++W+ ++ +++ F + A FL + L ++ +D
Sbjct: 2576 FFLLADALKGLNLVAWEYHAEADSRKVTRISRTFPKSNFPVAACSFLAYENLLGMLATDI 2635
Query: 1262 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1321
N+++F Y + + + F +L +L ++ AA K
Sbjct: 2636 DGNVRLFCY-------------------NADKNASGFEKLDILQCDAEDRCAAGCVVKLQ 2676
Query: 1322 RFALLFGTLDGSIGCIAPLDELTFRR--------LQSLQKKLVDSVPHVAGL 1365
++ + T+ S+G A L FR LQ+LQ +L +P GL
Sbjct: 2677 QYVVDSETV-ASLGEAADGSTLAFRLLSVENHAFLQTLQDRLARYLPEPLGL 2727
>gi|261193401|ref|XP_002623106.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
gi|239588711|gb|EEQ71354.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
Length = 1168
Score = 45.1 bits (105), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 21/134 (15%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 299 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLEEATIFV-------A 340
Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGS 427
W Q D LL+ G L L ++ D VQ L + S + +G + F+GS
Sbjct: 341 WEQVDGQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIPRASVLVYMGGGVTFIGS 400
Query: 428 RLGDSLLVQFTCGS 441
GDS L++ T GS
Sbjct: 401 HQGDSQLIRITEGS 414
>gi|322700871|gb|EFY92623.1| DNA damage-binding protein 1 [Metarhizium acridum CQMa 102]
Length = 1121
Score = 45.1 bits (105), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 137/318 (43%), Gaps = 46/318 (14%)
Query: 1114 IGTAYVQGEDVA----ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1169
IGT++V +D RGR+L+ N V ++ S LKGA L++L H++
Sbjct: 794 IGTSFVTDDDAIEENDTRGRILVLGVDENRQ-----VYQIVSHNLKGACRCLSTLGEHIV 848
Query: 1170 IASGPKIILHKWTGT-----ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS---IYFL 1221
++++ + L +A Y P + + L+I N I + D+ +S + F+
Sbjct: 849 AGLSKTVVVYNYVEETTVFGSLQKLAAY-RPASFPLGLDISGNIIGVVDLMQSLTLVEFI 907
Query: 1222 SWKEQG-AQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
K+ A+L +A+ + +AT +DG + +D Q NI + P+
Sbjct: 908 PSKDGSRAKLEEVARHYQP--GWATSVTNLDGE--RWLEADAQGNIIVLQRNPEAPTEQD 963
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP-------GSDKTN----RFALLFG 1328
KL +E ++G + + RL + S++ +P G +T L+
Sbjct: 964 RSKLEVTSEINIGEQINQIRRLHV--ASNENAVVSPKAFLGSVGLSETTINCWTQLLILV 1021
Query: 1329 TLDGSI---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1385
++G++ G IAP + L + Q +L D + ++ +R F + + GP
Sbjct: 1022 QIEGTLYLFGEIAPKYQ---DLLLTFQARLQDYIYAPGNVSFNLWRAFRNKAR-EGDGPF 1077
Query: 1386 SIVDCELLSHYEMLPLEE 1403
VD E++ + L L+E
Sbjct: 1078 RFVDGEMVERF--LDLDE 1093
>gi|170589359|ref|XP_001899441.1| Xeroderma Pigmentosum Group E Complementing protein [Brugia malayi]
gi|158593654|gb|EDP32249.1| Xeroderma Pigmentosum Group E Complementing protein, putative
[Brugia malayi]
Length = 521
Score = 44.7 bits (104), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 67/155 (43%), Gaps = 31/155 (20%)
Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADA---SATGISKQSNY----------EL-- 541
A T ++ DS N+ P++D + +R N + +G K EL
Sbjct: 345 ADGTGYISLLDSYTNLAPIRDMTV-MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS 403
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
VEL G K ++T+ + DDE+ YLI+S ++ T VL E TE
Sbjct: 404 VELKGIKNMFTLRTR---------------DDEFDDYLILSFDSETHVLLINGEELEDTE 448
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
+ V G T+ AG LF + ++QV ++DG
Sbjct: 449 ITGFTVDGATLWAGCLFHSKTILQVTHGEVILIDG 483
>gi|241952575|ref|XP_002419009.1| pre-mRNA-splicing factor, putative; pre-spliceosome component,
putative [Candida dubliniensis CD36]
gi|223642349|emb|CAX42591.1| pre-mRNA-splicing factor, putative [Candida dubliniensis CD36]
Length = 1187
Score = 44.7 bits (104), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 93/395 (23%), Positives = 162/395 (41%), Gaps = 62/395 (15%)
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+K+ P ++ LP D ++ +P IGG+LV G+N Y L+ + L
Sbjct: 218 VKKKPASLNSDPLPDDVNYMIPLPGHIGGMLVCGSNWCFYD--------KLDGPRIYLPL 269
Query: 352 SQELPRSSFSVELD-AAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLS--KT 406
+ ++ S+ ++ H +N LL GDL LTV Y D ++ + ++ T
Sbjct: 270 PRRDGQTQESIIVNHVTHVLKKKNFFILLQNTLGDLFKLTVDYDFDKETIKNISITYFDT 329
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
P L+ +I N F+ D LL QF E+ GD A+
Sbjct: 330 IPPALSLNI--FKNGFLFVNVLNNDKLLYQF--------------EKLGDDLAE----NE 369
Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL--RI 524
L +SSD Y S N + TF D+L I L+ S + RI
Sbjct: 370 LVINSSD-------------YDSLDNVRGTDTTTFKLKGLDNLALIDVLETLSPIIDSRI 416
Query: 525 NADASATGISKQSNYELVE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII- 581
N D+ +S S + + +P + + + + + +DE YL+I
Sbjct: 417 N-DSKLVTLSSHSYVKSITHGVPTTTLVESPLPITPTDIFTTKLSLESANDE---YLVIS 472
Query: 582 -SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG---ARILDGS 637
SL ++T+VL +++ +V +S FV ++ + G V+QV+ G R ++G
Sbjct: 473 SSLSSKTLVLSIGEVVEDVEDS--EFVLDQSTISVQQVGIASVVQVYSNGIKHIRTVNGK 530
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
T D F P+ S N+ + +++++ V+
Sbjct: 531 KKTTDW-FPPAGITITHASTNNQQVLIALSNLNVV 564
>gi|195490209|ref|XP_002093045.1| GE20993 [Drosophila yakuba]
gi|194179146|gb|EDW92757.1| GE20993 [Drosophila yakuba]
Length = 1227
Score = 44.7 bits (104), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|24654874|ref|NP_728546.1| CG13900, isoform A [Drosophila melanogaster]
gi|23092721|gb|AAF47416.2| CG13900, isoform A [Drosophila melanogaster]
gi|60678131|gb|AAX33572.1| LD01809p [Drosophila melanogaster]
gi|220950356|gb|ACL87721.1| CG13900-PA [synthetic construct]
gi|289803030|gb|ADD20765.1| FI04459p [Drosophila melanogaster]
Length = 1227
Score = 44.7 bits (104), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|225680146|gb|EEH18430.1| DNA damage-binding protein [Paracoccidioides brasiliensis Pb03]
Length = 1138
Score = 44.3 bits (103), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 29/138 (21%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
L+ VP+P+GG+LV+G +I Y D++ E S+ LD A
Sbjct: 318 LIPVPAPLGGLLVLGETSIRYLD----------------DATNE----CISLPLDEATIF 357
Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLF 423
W Q D LL+ G L L ++ D VQ +LDL P S + +G +
Sbjct: 358 VAWEQVDGQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVT 415
Query: 424 FLGSRLGDSLLVQFTCGS 441
F+GS GDS L++ T GS
Sbjct: 416 FIGSHQGDSQLIRITEGS 433
>gi|195336406|ref|XP_002034829.1| GM14250 [Drosophila sechellia]
gi|194127922|gb|EDW49965.1| GM14250 [Drosophila sechellia]
Length = 1227
Score = 44.3 bits (103), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|194864680|ref|XP_001971056.1| GG14635 [Drosophila erecta]
gi|190652839|gb|EDV50082.1| GG14635 [Drosophila erecta]
Length = 1227
Score = 44.3 bits (103), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPSGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|60677959|gb|AAX33486.1| RE01065p [Drosophila melanogaster]
Length = 1227
Score = 44.3 bits (103), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|194749950|ref|XP_001957397.1| GF24063 [Drosophila ananassae]
gi|190624679|gb|EDV40203.1| GF24063 [Drosophila ananassae]
Length = 1228
Score = 44.3 bits (103), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|258572939|ref|XP_002540651.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237900917|gb|EEP75318.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1144
Score = 44.3 bits (103), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 61/142 (42%), Gaps = 21/142 (14%)
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
NL A L+ VP P+GG+L++G I Y ++N ++L +
Sbjct: 239 NLELGAEILVPVPLPLGGILILGEKCIKYVD-------TISNETITL-----------PL 280
Query: 363 ELDAAHATW--LQNDVALLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIG 419
E + W L N LL+ G L L +V D V+ + + S + +G
Sbjct: 281 EYNTVFVAWEQLDNQRWLLADDYGRLFFLMLVLDSANAVRTWKVDLLGETSRASVLVHLG 340
Query: 420 NSLFFLGSRLGDSLLVQFTCGS 441
+ FLGS GDS +++ T GS
Sbjct: 341 GGVVFLGSHQGDSHVIRITEGS 362
>gi|240280498|gb|EER44002.1| pre-mRNA-splicing factor rse1 [Ajellomyces capsulatus H143]
Length = 305
Score = 44.3 bits (103), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 61/283 (21%), Positives = 118/283 (41%), Gaps = 27/283 (9%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
AL QG LL G + ++ ++ P VV L + I++ D+ +S+
Sbjct: 39 ALLGFQGRLLAGIGTDLRIYDLGMKQMLRKCQASVVPHLVVGLQTQGSRIIVSDVQESLT 98
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW- 1278
++ +K Q +L D S T ++D T++ D+ N+ + K SE
Sbjct: 99 YVVYKYQENRLIPFVDDVISRWTTCTT-MVDYETVA--GGDKFGNLWLLRCPAKASEEAD 155
Query: 1279 ---KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1331
G L+ ++ GA ++ Q L TS + G R L++ L
Sbjct: 156 EDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLVTG----GRDILVWTGLQ 211
Query: 1332 GSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
G++ + P +E+ F QSL+ +L P +AG + +R +++ K +
Sbjct: 212 GTVSMLIPFISREEVDF--FQSLEMQLAAQNPPLAGRDHLIYRSYYAPAKG-------TI 262
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
D +L Y +LP +++ +IA + + +I + D+ ++
Sbjct: 263 DGDLCETYLLLPNDKKQQIAGELDRSVREIERKIADMRTQVAY 305
>gi|320581947|gb|EFW96166.1| hypothetical protein HPODL_2449 [Ogataea parapolymorpha DL-1]
Length = 1203
Score = 44.3 bits (103), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 60/281 (21%), Positives = 111/281 (39%), Gaps = 42/281 (14%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
AL + QG LL+ + ++IL+ +L + +V L +++ D+ S+
Sbjct: 944 ALTAFQGKLLVGAKNELILYDIGQKQLVKRSSTRLECYEIVDLKTQGFRVIVSDVRDSVR 1003
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE--- 1276
+ +K D T L+D T+ VV D+ NI + ++SE
Sbjct: 1004 YTVYKPLENSFVDFIDDTMQRHVTRT-LLLDYDTV--VVGDKFGNISVLRCPEQISEMSD 1060
Query: 1277 --------SWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
+ KL + ++VG T F + S G A ++++G
Sbjct: 1061 EDNHGFLVKMRRTKLDNPVNYYVGDMPTFFQK------GSLTIGGAE--------SIIYG 1106
Query: 1329 TLDGSIGCIAPLDELT----FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1384
L G +GC+ P+ L+ F+ LQ L + L R + +F K + P
Sbjct: 1107 CLQGQMGCLYPMKSLSEINFFKELQRL------IIHEFTSLTDREYLKF----KGYYNPP 1156
Query: 1385 DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ +D +L+ Y L E+++ IA + I ++D+
Sbjct: 1157 KNSIDGDLIEEYYRLGPEKRIRIATKMDRLPRDIDRRISDM 1197
>gi|154303693|ref|XP_001552253.1| hypothetical protein BC1G_08731 [Botryotinia fuckeliana B05.10]
Length = 1087
Score = 44.3 bits (103), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 34/143 (23%), Positives = 68/143 (47%), Gaps = 10/143 (6%)
Query: 1114 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1173
+GT+++ E+ RGR+L+F G NAD ++ S LKG+ + L G ++ A
Sbjct: 118 VGTSFLHEEEANVRGRLLIF--GVNADRAPYMIA---SHNLKGSCRCIGVLDGKIVAALN 172
Query: 1174 PKIILHKWTG-TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1232
++++ T L G+ L+ I+ + +++ + +W+ + L
Sbjct: 173 KTVVMNATTAKANLGGLVVLGETKF--TYLDDESKAIVEYALDEAVLWAAWEPIDERTYL 230
Query: 1233 LAKDFGSLDCFATEFLIDGSTLS 1255
L D+G L + L+DG+T++
Sbjct: 231 LGDDYGFL--YILTILVDGATVT 251
Score = 42.0 bits (97), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 77/331 (23%), Positives = 134/331 (40%), Gaps = 57/331 (17%)
Query: 1114 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1173
+GT+++ E+ RGR+L+F G NAD ++ S LKG+ + L G ++ A
Sbjct: 765 VGTSFLHEEEANVRGRLLIF--GVNADRAPYMIA---SHNLKGSCRCIGVLDGKIVAALN 819
Query: 1174 PKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
++++ + T L +A Y + + DI KSI + + GA
Sbjct: 820 KTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIA-VADIMKSIALVEYT-PGA 877
Query: 1229 -----QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+L +A+ + F+T + + T + + +D N+ + + ++
Sbjct: 878 DGLPDKLEEVARH--AQQVFSTS-VAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRM 934
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL-----FGTLDGSI---G 1335
E ++G V + R+ + +T++ ALL GT +GSI
Sbjct: 935 EVTCEMNLGEMVNRVKRINV---------------ETSKDALLIPRAFLGTTEGSIYLFS 979
Query: 1336 CIAPLDELTFRRLQSLQKKL---------VDSV-PHVAGLNPRS--FRQFHSNGKAHRPG 1383
I P ++ RLQS L DS PH L+P + F ++ S A R
Sbjct: 980 LIPPQNQDLLMRLQSRLASLPSASSIRGSSDSTSPHQIELSPGNLDFNKYRSYISATRET 1039
Query: 1384 --PDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
P VD EL+ + L +E Q +A G
Sbjct: 1040 SEPFRFVDGELIERFLDLEVEVQEHVAEGLG 1070
>gi|405121632|gb|AFR96400.1| hypothetical protein CNAG_03173 [Cryptococcus neoformans var. grubii
H99]
Length = 1276
Score = 43.9 bits (102), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 63/266 (23%), Positives = 109/266 (40%), Gaps = 42/266 (15%)
Query: 1112 LAIGTAYV---QGED---------VAARGRVLLFSTGRNADNPQNLVTEVYSK-ELKGAI 1158
LA+GT + GED V GRVLL + D +V ++ GA+
Sbjct: 945 LAVGTGILPPDDGEDSSWDEGNLAVVREGRVLLLEF-KEGDAGSGWDIKVKAELATVGAV 1003
Query: 1159 SALASLQGHLLIASGPKIILHKW--TGTELNGIAFYDAPPLYVVSLNIV-------KNFI 1209
AL + G L +A+G K+ +H+ EL + + A + SL+++ + +
Sbjct: 1004 YALEEIHGFLAVAAGSKLTIHRLDHNPVELEETSSW-ASAYVISSLSVLPPSLMRPEGAL 1062
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
++GD +S+ L+ E + ++ + A L D +V+SD N+ +
Sbjct: 1063 IVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGD-GVVISDAHSNLLTYR 1121
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
QKL A F + VT+F ++ T++ P +LF T
Sbjct: 1122 L---------NQKLERAATFGLHEEVTRFQSGSLVPTTTAPEIIIPD--------VLFAT 1164
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKL 1355
+G +G I L + R L LQ+ +
Sbjct: 1165 REGRLGIIGELGTRSSRTLDDLQRNM 1190
>gi|301091539|ref|XP_002895953.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262096049|gb|EEY54101.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 118
Score = 43.9 bits (102), Expect = 0.61, Method: Composition-based stats.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1385
T + + + P+ + FRRL +LQ ++V+++P L+PR FR +N K PD
Sbjct: 6 TSEEGVSALIPVGKGVFRRLFTLQNEMVNTLPQNCALDPREFRMLKTNAKRRCGRPD 62
>gi|83314897|ref|XP_730560.1| multisubunit cleavage/polyadenylation specificity factor subunit A
[Plasmodium yoelii yoelii 17XNL]
gi|23490318|gb|EAA22125.1| CPSF A subunit region, putative [Plasmodium yoelii yoelii]
Length = 863
Score = 43.9 bits (102), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 52/213 (24%), Positives = 97/213 (45%), Gaps = 8/213 (3%)
Query: 1194 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1253
P +++SL++++N+I++GDI S+ LS+ + L + +D+ ++ C F+ S
Sbjct: 624 TPSSWIMSLDVIENYIVVGDIMTSVTILSYDFNNSTLTEVCRDYSNVWC---TFVCALSK 680
Query: 1254 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1313
+VSD + N +F + KL A F+ G V K L + + +S
Sbjct: 681 SHFLVSDMESNFLVFQKSSIRYNDEDSFKLSRVALFNHGHVVNKMLPVSL--SSLIEEEE 738
Query: 1314 APGSDKTNRFALLFGTLDGSIGCIAPLDELT-FRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1372
A + ++L + +GSI I P LT F++ ++ L DS+ + +N S
Sbjct: 739 AQNEILRKKESILCASSEGSISSIIPFSNLTNFKKALCIEIALNDSLSFIXNINNNSNNT 798
Query: 1373 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1405
+ N +VD E+ + +P E+Q
Sbjct: 799 YKMN--LSEKSSKGVVDGEVFKMFFSMPFEKQF 829
>gi|116191283|ref|XP_001221454.1| hypothetical protein CHGG_05359 [Chaetomium globosum CBS 148.51]
gi|88181272|gb|EAQ88740.1| hypothetical protein CHGG_05359 [Chaetomium globosum CBS 148.51]
Length = 979
Score = 43.9 bits (102), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 71/344 (20%), Positives = 144/344 (41%), Gaps = 42/344 (12%)
Query: 1083 TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----R 1137
TI +Q +E A ++ + + T++++E+ L + T G D+ R L S G R
Sbjct: 558 TIHLQENEAAASLAIASF---TSQDDESFLVVST----GRDMVLNPRQL--SGGYSYVYR 608
Query: 1138 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPP 1196
D+ ++L ++ +AL + +G L+ G ++++ + L D P
Sbjct: 609 FHDDGRDLEL-IHKTGTTEPPTALEAFRGRLVAGIGKTLVVYDLGLKQMLRKTQANDVVP 667
Query: 1197 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1256
+VSL N I++GD+ + ++++ + QL D + T ++D S+
Sbjct: 668 GLIVSLQTQGNRIVVGDVQHGVAMVAYRTESNQLIPFVDDTIARWTTCTT-MVDYD--SV 724
Query: 1257 VVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAH---VTKFLRLQMLATSSD 1309
D+ N I + S E + L +R H H +T Q + T
Sbjct: 725 AGGDKFGNFWIVRTPQQASLEADEPGAHRLLHAREHLHGAPHRLQLTAHFHTQDIPTGIT 784
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1366
+T G + L++ G++G P ++ F +L++ + P + G +
Sbjct: 785 KTHLVVGGQE----VLVWSGFQGTVGVFVPFVTREDADF--FLALEQHMRGEEPSLIGRD 838
Query: 1367 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
++R ++ K +VD +L Y++LP +++ IA +
Sbjct: 839 HLAYRGYYEPAKG-------VVDGDLCERYQLLPGDKKQRIAAE 875
>gi|453087531|gb|EMF15572.1| splicing factor 3B subunit 3 [Mycosphaerella populorum SO2202]
Length = 1223
Score = 43.9 bits (102), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 74/363 (20%), Positives = 143/363 (39%), Gaps = 36/363 (9%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1139
+T+ M +E AL V ++ E LA+GT G + A G V +F +
Sbjct: 884 STVEMGDNEAALCCACVAF---ESRNWEVFLAVGTGQHMSPGTGLQAAGYVHIFKLEEDG 940
Query: 1140 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1199
+T V+ + + AL G L + G ++ ++ L A A P +
Sbjct: 941 TK----LTFVHKTKFDQPVYALLPFHGRLALGVGNELFIYDIGQKALLRKARGQATPNQI 996
Query: 1200 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1259
VSL I+ GD+ + + ++ +K +L D T +ID T +
Sbjct: 997 VSLESHGQRIICGDVSEGVTYMVYKPGYNRLIPFVDDVVQRWTTGTT-MIDYETTA--GG 1053
Query: 1260 DEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAHVTKFLR----LQMLATSSDRT 1311
D+ N+ + + S E G +++ + GA LR Q + S RT
Sbjct: 1054 DKFGNLWVVRCPEQPSQEADEEGAGGFIMNERSYLGGAPYRLDLRAHYYCQDIPMSLQRT 1113
Query: 1312 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
G + L + L G++G + P +++ F SL+++L P +AG +
Sbjct: 1114 ALVAGGQEV----LFWSGLQGTLGMLVPFVTREDVEF--FTSLEQQLRIEDPPLAGRDHL 1167
Query: 1369 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
+R ++ K ++D +L + L + + ++A + + +I + ++
Sbjct: 1168 MYRSYYVPVKG-------VIDGDLCERFMALSYDSKQKVAAEVDRSVKEIEKKVQEMRTR 1220
Query: 1429 TSF 1431
+F
Sbjct: 1221 VAF 1223
>gi|145549784|ref|XP_001460571.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428401|emb|CAK93174.1| unnamed protein product [Paramecium tetraurelia]
Length = 1178
Score = 43.9 bits (102), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 63/284 (22%), Positives = 120/284 (42%), Gaps = 41/284 (14%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
ALA+ +G LL+ +G + +++ + A ++ S+ + ++ I + ++ SI+
Sbjct: 913 ALAAWRGRLLVGAGCNLRVYEMGNQRILKKAEIKNLNSFITSIMVKEDRIYVAEVADSIH 972
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFL-----IDGSTL-SLVVS------DEQKNIQI 1267
L + + LA D A+ L I G ++ VS DE++
Sbjct: 973 LLRYNIRDQTFMELADDILPRYVTASTVLDYHTVIAGDKFENIFVSRVPLDIDEEQEEHP 1032
Query: 1268 FYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
+ Y KM + K+ F+VG +T ++ +++TSS+
Sbjct: 1033 YEYKMKMDQGCMNGAPFKMDQICNFYVGEVITSLQKIALVSTSSE--------------V 1078
Query: 1325 LLFGTLDGSIGCIAPLD---ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
+++GT GSI + P D ++ F L ++ V H L+ R QF S A+
Sbjct: 1079 VVYGTSMGSIAALYPFDNKEDIDF----FLHLEMYLRVEH-QPLSGRDHMQFRS---AYG 1130
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
P SI+D +L + + +Q +A + T + I+ L D+
Sbjct: 1131 PC-KSIIDGDLCDQFGNMQYNKQRAVAEEFDRTPADIIKKLEDI 1173
>gi|145510432|ref|XP_001441149.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408388|emb|CAK73752.1| unnamed protein product [Paramecium tetraurelia]
Length = 1174
Score = 43.9 bits (102), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 63/284 (22%), Positives = 120/284 (42%), Gaps = 41/284 (14%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
ALA+ +G LL+ +G + +++ + A ++ S+ + ++ I + ++ SI+
Sbjct: 909 ALAAWRGRLLVGAGCNLRVYEMGNQRILKKAEIKNLNSFITSIMVKEDRIYVAEVSDSIH 968
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFL-----IDGSTL-SLVVS------DEQKNIQI 1267
L + + LA D A+ L I G ++ VS DE++
Sbjct: 969 LLRYNIRDQTFMELADDILPRYVTASTVLDYHTVIAGDKFENIFVSRVPLDIDEEQEEHP 1028
Query: 1268 FYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
+ Y KM + K+ F+VG +T ++ +++TSS+
Sbjct: 1029 YEYKMKMDQGCMNGAPFKMDQICNFYVGEVITSLQKIALVSTSSE--------------V 1074
Query: 1325 LLFGTLDGSIGCIAPLD---ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
+++GT GSI + P D ++ F L ++ V H L+ R QF S A+
Sbjct: 1075 VVYGTSMGSIAALYPFDNKEDIDF----FLHLEMYLRVEH-QPLSGRDHMQFRS---AYG 1126
Query: 1382 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
P SI+D +L + + +Q +A + T + I+ L D+
Sbjct: 1127 PC-KSIIDGDLCEQFGNMQYNKQRTVAEEFDRTPADIIKKLEDI 1169
>gi|406700450|gb|EKD03620.1| hypothetical protein A1Q2_02097 [Trichosporon asahii var. asahii CBS
8904]
Length = 1119
Score = 43.9 bits (102), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 83/352 (23%), Positives = 141/352 (40%), Gaps = 62/352 (17%)
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQI-DNHNLSSVDLHRTYTVE---EYEVRILEPDRAGGPW 1078
V L N V++ + + + HQ D SSV+L T+E E+++ P+R
Sbjct: 727 VAALPGYNLVVAGTVTRSMDHQTGDVLQSSSVELRNATTLELLSEFQL----PEREA--- 779
Query: 1079 QTRATIPMQSSENALTV--RVVTLFNTTTKENETLLAIGTAYVQGEDVAA-----RGRVL 1131
+S NA+T+ R L T ENE L T EDV + RGR+L
Sbjct: 780 --------VASVNAVTLHGRKYILVGTAIFENEDALEDATL----EDVTSFIATNRGRLL 827
Query: 1132 LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-----EL 1186
LF +A +LVT S G + + G L +A+ K+ + + T E
Sbjct: 828 LFQINESAGPSLDLVT---SMTFNGPVYDTVVIHGFLAVATSTKVSILRLTTQPPSLEEA 884
Query: 1187 NGIAF-YDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCF 1243
AF ++ L VV ++ K + +GD +SI LS + + +D + + C
Sbjct: 885 ASFAFAFETHHLAVVEIDKEKRLV-VGDAMRSIIVLSVDPESGDIVGDQRDMNAHLVRCL 943
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1303
+ ++ + ++D N+ F G+ + A + VT+ +
Sbjct: 944 SAVHDVEPGVM---IADNYANLLTFRL---------GKGITPAASIGLSEDVTRLQPGTL 991
Query: 1304 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1355
S++ R LL T++G +G I L + + R L LQ+ +
Sbjct: 992 APVSAE--------GDILRADLLCTTVNGRLGVIGELGKGSIRTLDDLQRNM 1035
>gi|384490729|gb|EIE81951.1| hypothetical protein RO3G_06656 [Rhizopus delemar RA 99-880]
Length = 967
Score = 43.9 bits (102), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 72/154 (46%), Gaps = 17/154 (11%)
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
S + + + L+ VP P+GG+LV+G I Y L N +S+D ++ ++
Sbjct: 198 STIKVEASTHALVPVPEPLGGLLVIGEYIITYFD-----PLTNTNRELSIDPAR---VTA 249
Query: 360 FSVELDAAHATWLQND-----VALLSTKTGDLVLLTVVYDGRVV---QRLDLSKTNPSV- 410
+ D ++ L ++ V + T +V L+ + G+V Q ++ +P V
Sbjct: 250 WEFMKDESNRYLLGDEEGYLYVFSIETSHNKVVNLSSTFIGQVPSFNQNIESKANHPQVS 309
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
S I +GN +F++GS GDS L+Q G S
Sbjct: 310 RPSCIVDLGNLMFYIGSTHGDSCLIQLIKGQEKS 343
Score = 42.7 bits (99), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 51/224 (22%), Positives = 108/224 (48%), Gaps = 18/224 (8%)
Query: 1153 ELKGAISALASLQGHLLIASGPKII-LHKWTGTELNG--IAFYDAPPLYVVSLNI---VK 1206
++ G + + S++ ++ A KI L+ + L G I F VV+L++
Sbjct: 746 DMPGVVYRMESIKNTIIAAVDGKIYGLYNFKPDLLKGERIEFKFLLHNNVVALDMDTDNN 805
Query: 1207 NFILLGDIHKSIYFLSWK--EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1264
+ +L+GD+ +S+ L + E+ +L+L A D + A +F+ + L+ +D++ N
Sbjct: 806 DTLLVGDLMESMSLLKVEKDEESLKLSLEAVDNKQVWMTAVKFVNENV---LIGADDRHN 862
Query: 1265 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1324
+ P++ + K KL +H+G V +F R +L D A+ D +++
Sbjct: 863 L-FTMIKPEIRQEGKTCKLELEGGYHLGTLVNRF-RKDIL---RDVENASDNIDSISKYE 917
Query: 1325 --LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN 1366
F T++GSIG + + +F + +Q+ +++ +P+ L+
Sbjct: 918 SEFTFATVNGSIGTVKTISRESFEFFKGIQEGILNILPNNGNLD 961
>gi|422294117|gb|EKU21417.1| uv-damaged dna-binding protein [Nannochloropsis gaditana CCMP526]
Length = 192
Score = 43.5 bits (101), Expect = 0.79, Method: Composition-based stats.
Identities = 31/106 (29%), Positives = 47/106 (44%), Gaps = 5/106 (4%)
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
R+G L+FGT G IG I P+ E +R +L K L V V GL+
Sbjct: 29 RSGGEVARGHVQDLGLMFGTQQGMIGSILPISEEDYRFFVALTKCLNKVVKGVGGLSHEE 88
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLP---LEEQLEIAHQTG 1412
+R+F ++ VD +L+ + LP +EE +E+ G
Sbjct: 89 YRRFLTDKAI--SDTQGFVDGDLIESFLELPTQRMEEVVELMRVEG 132
>gi|156049323|ref|XP_001590628.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980]
gi|154692767|gb|EDN92505.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1153
Score = 43.5 bits (101), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 58/252 (23%), Positives = 106/252 (42%), Gaps = 37/252 (14%)
Query: 1114 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1173
+GT+++ +V RGR+L+F G N+D ++ S LKG+ + L G ++ A
Sbjct: 835 VGTSFLHDGEVNIRGRLLIF--GVNSDRTPYIIA---SHTLKGSCRCIGVLNGKIVAALN 889
Query: 1174 PKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
++++ + T L +A Y + ++I N I + DI KS+ + +
Sbjct: 890 KTVVMYDYEETSRTTANLRKVATYRCATC-PIDIDIRGNIIAVADIMKSVALVEYTPGVD 948
Query: 1229 QLNLLAKDFG--SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1286
L ++ G + FAT + + T + + SD N+ + + +L
Sbjct: 949 GLPDKLEEVGRHAQQVFATS-IAEVDTDTYLESDHDGNLIVLKRNREGVTREDKLRLEVL 1007
Query: 1287 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLF-----GTLDGSI---GCIA 1338
E ++G V K R+ + +T++ ALL T +GSI I
Sbjct: 1008 CEMNLGEMVNKIKRINV---------------ETSKDALLIPRAFVATTEGSIYLFSLIP 1052
Query: 1339 PLDELTFRRLQS 1350
P ++ RLQS
Sbjct: 1053 PQNQDLLMRLQS 1064
>gi|195428692|ref|XP_002062402.1| GK16677 [Drosophila willistoni]
gi|194158487|gb|EDW73388.1| GK16677 [Drosophila willistoni]
Length = 1273
Score = 43.5 bits (101), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 347 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 406
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 407 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIVTSQ 456
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 457 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 504
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV + DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 505 WTVKKR--------------VDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 549
Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
T+ L G ++QV+ G R
Sbjct: 550 TLCCAAL-GDDALVQVYPDGIR 570
>gi|226480826|emb|CAX73510.1| glyceraldehyde 3-phosphate dehydrogenase [Schistosoma japonicum]
Length = 332
Score = 43.5 bits (101), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 92/212 (43%), Gaps = 34/212 (16%)
Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
R DS+ L A ++++E +DS+ + + S S E R +G V
Sbjct: 72 RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVAS----GSVE----DRSARIIDQGFDVL 123
Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFG-SGGGFSARIESSHVINLRDLD 242
+DP V +Y GL IIL G DT + +S RIE +++
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIGDKIGTDFLDTNQWTVNTYSVRIEEGNIV------ 177
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
D F++GY P +++E EL + +++ + + ++ TL
Sbjct: 178 -----DMAFIYGYSLPTFAMIYEDELVLHMK-TYEIYGREPALRNVQLTLD--------- 222
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
++ D+ L+ VP P GGV++VG N I YH++
Sbjct: 223 SIEPDSKLLIPVPKPYGGVILVGDNIICYHTK 254
>gi|198420618|ref|XP_002125906.1| PREDICTED: similar to Splicing factor 3B subunit 3
(Spliceosome-associated protein 130) (SAP 130)
(Pre-mRNA-splicing factor SF3b 130 kDa subunit)
(SF3b130) (STAF130) [Ciona intestinalis]
Length = 1216
Score = 43.5 bits (101), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 97/470 (20%), Positives = 164/470 (34%), Gaps = 129/470 (27%)
Query: 304 LPHDAYKLLAVP---SPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS-- 358
L A L++VP GGVLV N I Y N+ D +PR
Sbjct: 228 LEERANHLISVPGGNDGPGGVLVCAENYITY-----------KNFGDQPDIRTPIPRRRN 276
Query: 359 -------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
V A H T L+ T+ GD+ +T+ D +V + L + +
Sbjct: 277 DLDDPERGMIVVCSATHKTKSMF-FFLIQTEQGDIFKVTLETDEDMVTEIRLKYFDTVPV 335
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPSTKR-- 466
+ + + F+ + +G+ L Q + SS + E GD AP R
Sbjct: 336 SMAMCVLRTGFLFVAAEMGNHCLYQIAHLGDDDDETEFSSAMPLEEGDTFFYAPRALRNL 395
Query: 467 --------LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
L + + D+ N + LY + S+ L+
Sbjct: 396 VLVDELDSLSPIMTCLISDLANEDTPQLYVTCGRGPRSS-----------------LRVL 438
Query: 519 SYGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
+GL + S + ELPG +WTV K ++E+ +
Sbjct: 439 RHGLEV------------SEMAVSELPGNPNAVWTVKIKE--------------EEEFDS 472
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
Y+I+S T+VL + + EVT+S F+ + +L G ++QV+ G R +
Sbjct: 473 YIIVSFVNATLVLSIGETVEEVTDS--GFLGTTPTLSCSLLGENALVQVYPDGIRHIRAD 530
Query: 638 ----------------------------------YMTQDLSFGPSNSESGSGSENSTVLS 663
Y D S G N + NS V+
Sbjct: 531 KRVNEWKTPGKKTILRCAVNQRQVVIALTGGELVYFEMDQS-GQLNEYTERKEMNSEVVC 589
Query: 664 VSIAD--------PYVLLGMSDGSIRLLVGDPSTC--TVSVQT-PAAIES 702
+ ++ ++ +G++D ++R++ DP+ C +S+Q PA ES
Sbjct: 590 MDLSKVPPTEQRTRFLAVGLADNTVRIISLDPTDCLQPLSMQALPATPES 639
>gi|328700785|ref|XP_001945395.2| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
Length = 1072
Score = 43.5 bits (101), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 41/202 (20%), Positives = 88/202 (43%), Gaps = 38/202 (18%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
++ +++D F++G+ P ++I++E +A+ + +K+
Sbjct: 188 MEETNIQDIGFLYGFTNPTIIIIYE------------------NAMGRTIKIKKIIDSKK 229
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
++ +A ++ VPSP+ G +++G N+I YH + SC + LP
Sbjct: 230 YKSIEKEASMVIPVPSPLCGAIIIGENSIFYH--NGSCNII------------RLPIRQ- 274
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSDI 415
+E+ L+ LL +G L++L + Y+ + V L L + +
Sbjct: 275 KIEIVCYTRVDLEGTRYLLGDHSGCLLMLFLKYEKTLNGKFKVTDLYLRYFGEISIPISL 334
Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
T + N + ++ S+ GDS L++
Sbjct: 335 TYLDNKVIYVASKFGDSQLIKL 356
>gi|195376606|ref|XP_002047087.1| GJ13230 [Drosophila virilis]
gi|194154245|gb|EDW69429.1| GJ13230 [Drosophila virilis]
Length = 1229
Score = 43.5 bits (101), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV + DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
T+ L G ++QV+ G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525
>gi|302423344|ref|XP_003009502.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
gi|261352648|gb|EEY15076.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
Length = 1119
Score = 43.1 bits (100), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 61/136 (44%), Gaps = 13/136 (9%)
Query: 312 LAVPSPIGGVLV----VGANTIHYHSQSASCALA-LNNYAVSLDSS----QELPRSSFSV 362
L +P P L+ V ++ YH + + A A L V+ ++ L + S
Sbjct: 221 LEIPDPFARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISK 280
Query: 363 ELDAAH--ATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
LD +W + DV LL+ G+L LLT+ DG +V L L + S + +
Sbjct: 281 ALDEPRIFVSWAKYDVTRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYM 340
Query: 419 GNSLFFLGSRLGDSLL 434
GN + FLGS GDS L
Sbjct: 341 GNEILFLGSHHGDSQL 356
>gi|195126264|ref|XP_002007593.1| GI12293 [Drosophila mojavensis]
gi|193919202|gb|EDW18069.1| GI12293 [Drosophila mojavensis]
Length = 1227
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV + DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
T+ L G ++QV+ G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525
>gi|195012560|ref|XP_001983703.1| GH16029 [Drosophila grimshawi]
gi|193897185|gb|EDV96051.1| GH16029 [Drosophila grimshawi]
Length = 1228
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV + DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
T+ L G ++QV+ G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525
>gi|449437538|ref|XP_004136549.1| PREDICTED: pre-mRNA-splicing factor RSE1-like [Cucumis sativus]
Length = 1376
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 90/216 (41%), Gaps = 12/216 (5%)
Query: 1149 VYSKELKGAISALAS-LQGHLLIASGPKIILHKWTGTELNGIAFYDA--PPLYVVSLNIV 1205
VYS L G + A+ L + L ++G + + + + + SL
Sbjct: 1076 VYSTSLPGMVLAICPYLDRYFLASAGNAFYVCGFPNDSFQRVKRFAVGRTRFMITSLTAH 1135
Query: 1206 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1265
N I +GD I F S++E +L + D S A L+D T VVSD + +I
Sbjct: 1136 VNRIAVGDCRDGILFFSYQEDAKKLEQIYSD-PSQRLVADCTLLDVDTA--VVSDRKGSI 1192
Query: 1266 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR-----LQMLATSSDRTGAAPGSD-K 1319
I + ++ ++ + L+ + + LR ++ A R A PGSD
Sbjct: 1193 AILSCSDRLEDNASPECNLTLNCAYYMGEIAMTLRKGSFSYKLPADDLLRGCAVPGSDFD 1252
Query: 1320 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1355
++ ++ TL GSI PL + L+++Q KL
Sbjct: 1253 SSHNTIIASTLLGSIVIFTPLSRDEYELLEAVQAKL 1288
>gi|213405251|ref|XP_002173397.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
yFS275]
gi|212001444|gb|EEB07104.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
yFS275]
Length = 1166
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 121/561 (21%), Positives = 202/561 (36%), Gaps = 99/561 (17%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
+ L+ +G V ++A L G ++D ++L + + ++LE+D + L
Sbjct: 56 MNLMISQNCYGIVRNIAPLRLTGF----KKDYLVLTSDSGRFTILEYDIGKNKLVSVYQE 111
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGLQ------MII---L 204
F K G G + +D +GR V LVY L + I L
Sbjct: 112 AFG-------KSGIRRIVPGEYLALDAKGRAAMVASTEKNKLVYVLNRDSEANLTISSPL 164
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH 264
+A + G+ D G G+ I ++ + DLD + +
Sbjct: 165 EAHKAGTICF---DLVGLDTGYENPIFAALEVEYSDLDHDPLGEL--------------- 206
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS----PIGG 320
+KH +++ + L WS + + AYKL+ VP P G
Sbjct: 207 -----------YKHSEKVLTYYELDLGLNHVVKRWSKV-VDRSAYKLIRVPGGNDGP-SG 253
Query: 321 VLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
V+V+ I Y H Q S + + ++ LP + + A + LL
Sbjct: 254 VIVISTGWISYRHLQRQSHFVPIPTRETKATTNTALP-----IIVSAVMHKMRDSFFYLL 308
Query: 380 STKTGDLVLLTVVYDGRV-VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
GDL+ LT+ D V+ L + + + + + + L F G G+ L QF
Sbjct: 309 QNSDGDLLKLTMELDDHSQVKELRIKYFDTIPFAAILNILKSGLLFAGCEGGNHHLYQFE 368
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ 498
S+ + EF +K + + L + N L S T++
Sbjct: 369 -----SLAIDDDEPEFSSANFSEEQSKHSPKKLTYKLHPLQNISLLDEIPSLFPLTDAIV 423
Query: 499 KTFSFAVRDSLVNI-GPLKDFSYGLRINADASATGISKQSNYELVELPGCK-GIWTVYHK 556
S L + G K+ S L + SAT + L ELPG IWTV K
Sbjct: 424 TRTSTDANSQLYTLCGRHKEASLRL-LKRGVSATEVV------LSELPGAPIAIWTVKQK 476
Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
+D Y Y+++S T+VL + + EV +S T+
Sbjct: 477 --------------LNDPYDKYMVLSFTNGTLVLSIGETVEEVLDS-GLLSSVSTLNVRQ 521
Query: 617 LFGRRRVIQVFERGARILDGS 637
L GR V+Q+ +G R + +
Sbjct: 522 L-GRSSVVQIHSKGIRCISAN 541
>gi|167539942|ref|XP_001741428.1| DNA repair protein xp-E [Entamoeba dispar SAW760]
gi|165894130|gb|EDR22214.1| DNA repair protein xp-E, putative [Entamoeba dispar SAW760]
Length = 1004
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 92/197 (46%), Gaps = 34/197 (17%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNAD 1140
T+ ++S+E AL V + + + A+GTA ++ ++ + GR+LL
Sbjct: 701 TTVELKSNELALCVDSL---------EDNIYAVGTAIIRENEIEPSSGRILLIR-----Q 746
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1200
+ + L+ V +++ GA+ L Q ++ + + + G +L+ PL V
Sbjct: 747 DSEGLIYIVGTEDYDGAVYCLKKYQKGIVAFINRNVHVIEKKGKDLSTKQNM-LLPLIGV 805
Query: 1201 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------FGSLDC-FATEFLIDG 1251
SL+I K++I+ GD+ +S+ ++ L+++ KD GS++ + T FL
Sbjct: 806 SLDICKDYIIAGDLARSVSVYRYRNDIEHLDIVGKDNQIVWSSCVGSIESEYGTSFL--- 862
Query: 1252 STLSLVVSDEQKNIQIF 1268
V+D NI+IF
Sbjct: 863 ------VADVSGNIKIF 873
>gi|406868052|gb|EKD21089.1| pre-mRNA-splicing factor rse1 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 1236
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 71/339 (20%), Positives = 142/339 (41%), Gaps = 38/339 (11%)
Query: 1084 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGR---VLLFSTGRNAD 1140
I + +E A+++ V+ +++++E L IGT G+D+ R R D
Sbjct: 875 IDLDDNEAAVSMAAVSF---SSQDDEVFLVIGT----GKDMIVSPRSSTAGFIHVYRFHD 927
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1200
N + + ++ +++ AL QG LL+ G + ++ +L A + P +V
Sbjct: 928 NGKE-IEFIHKTKVEEPPMALLGFQGRLLVGIGKDLRIYDLGMRQLLRKAQAEVAPNLIV 986
Query: 1201 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1260
L + I++ D+ +SI + +K Q +L D S T ++D T++ D
Sbjct: 987 GLQTQGSRIVVSDVQESIIMIVYKFQENKLIPFVDDTISRWTSCTT-MVDYETVA--GGD 1043
Query: 1261 EQKNIQIFYYAPKMSESWKGQKLLS-----RAEFHVGAHVTKFLR---LQMLATSSDRTG 1312
+ N+ + K SE + S R+ H + Q + S +T
Sbjct: 1044 KFGNLWLLRCPTKASEEADEEGSASHLVHERSYLQGSPHRLTLMAHFFTQDIPMSIQKTN 1103
Query: 1313 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
G R +L+ + G++G + P +++ F Q+L++ L +AG +
Sbjct: 1104 LVAG----GRDCILWSGIQGTLGILIPFVSREDVDF--FQTLEQHLRSEDAPLAGRDHLI 1157
Query: 1370 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
+R ++ K ++D +L Y +LP +++ IA
Sbjct: 1158 YRSYYVPVKG-------VIDGDLCERYTLLPTDKKQMIA 1189
>gi|281202530|gb|EFA76732.1| CPSF domain-containing protein [Polysphondylium pallidum PN500]
Length = 933
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 46/203 (22%), Positives = 88/203 (43%), Gaps = 33/203 (16%)
Query: 1226 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1285
+G QLN + S+ + F +G TL LV E + + P ++G+ L+
Sbjct: 755 KGVQLNPRKVESASIHLY--RFTNNGQTLQLVYKTEVEEV------PYAISHFQGRLLVG 806
Query: 1286 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1342
+ LR+ + +T + G + L++ TL+G+IG + P ++
Sbjct: 807 ---------IANQLRIYEMVNHISKTSLSVGGPE----VLVYATLNGTIGALVPFVSRED 853
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1402
+ F SL+ ++ P + G + ++R ++ K +++D +L Y L
Sbjct: 854 VDF--YTSLELQMRQENPPLCGRDHLAYRSYYFPVK-------NVIDGDLCEQYISLDPT 904
Query: 1403 EQLEIAHQTGTTRSQILSNLNDL 1425
+Q IA + + S+IL L DL
Sbjct: 905 KQQSIAEELSRSPSEILKKLEDL 927
>gi|134077422|emb|CAK45676.1| unnamed protein product [Aspergillus niger]
Length = 1133
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G +I Y V DS++ + R LD A
Sbjct: 229 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 268
Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
W Q D LL+ G L L +V D VQ L + S + +G +
Sbjct: 269 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 328
Query: 423 FFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ GS
Sbjct: 329 IFVGSHQGDSQVLRIGNGS 347
>gi|452986188|gb|EME85944.1| hypothetical protein MYCFIDRAFT_59215 [Pseudocercospora fijiensis
CIRAD86]
Length = 1223
Score = 43.1 bits (100), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 76/363 (20%), Positives = 149/363 (41%), Gaps = 36/363 (9%)
Query: 1076 GPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA-YVQ-GEDVAARGRVLLF 1133
G + TI + +E AL+ V +K E LA+GT ++Q G V G V ++
Sbjct: 878 GGREVTCTIELGENEAALSCACVAF---ESKNWEVYLAVGTGQHMQPGTGVQTAGYVHIY 934
Query: 1134 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1193
++ + V+ + + + AL +G L + G ++ ++ L A
Sbjct: 935 KLLKDGAE----LEFVHKTKFELPVYALMPFRGRLALGVGNELFIYDMGMKALLRKARNI 990
Query: 1194 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1253
A P +VSL N I+ GD+ + + +L +K +L D T ++D T
Sbjct: 991 AVPNQIVSLESQGNRIICGDVSEGVTYLVYKPTFNRLIPFVDDTVQ-RWTTTTTMVDYET 1049
Query: 1254 LSLVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAHVTKFLR----LQMLA 1305
+ D+ N+ I + S E G +++ + GA LR Q +
Sbjct: 1050 AA--GGDKFGNLWIVRCPEQPSQEADEEGAGGYIMNERSYLNGAPYRLDLRAHYFCQDIP 1107
Query: 1306 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1362
S RT G + L + L G++G + P +++ F +L++++ P +
Sbjct: 1108 MSMQRTALVAGGQEV----LFWSGLQGTLGILIPFVTREDVEF--FTALEQQMRTEDPPL 1161
Query: 1363 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
AG + +R ++ K ++D +LL + L + + +IA + + +I +
Sbjct: 1162 AGRDHLMYRSYYVPVKG-------VIDGDLLERFMGLSYDTKQKIAAEVDRSVKEIEKKV 1214
Query: 1423 NDL 1425
++
Sbjct: 1215 QEM 1217
>gi|115390120|ref|XP_001212565.1| splicing factor 3B subunit 3 [Aspergillus terreus NIH2624]
gi|114194961|gb|EAU36661.1| splicing factor 3B subunit 3 [Aspergillus terreus NIH2624]
Length = 1217
Score = 43.1 bits (100), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 96/461 (20%), Positives = 195/461 (42%), Gaps = 49/461 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS- 1022
C G + + Q + ++ S DN +Q+ IPL TP + E+ L+ +I S
Sbjct: 759 QCLEGMVGIQGQNL----RIFSIEKLDNNM-LQQSIPLSYTPRRFVKHPEQPLFYVIESD 813
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQT 1080
VL P + + L++ D L + + +++++P A
Sbjct: 814 NNVLSPSTR--ARLLEDSKSRNGDTTVLPPEEFGYPRATGHWASCIQVVDPLDAKA---V 868
Query: 1081 RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRN 1138
T+ ++ +E A++V V T++++ET L +GT +A G + ++ R
Sbjct: 869 VHTVELEDNEAAVSVAAVPF---TSQDDETFLVVGTVKDMTVNPPSSAGGFIHIY---RF 922
Query: 1139 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1198
++ + L ++ +++ AL + QG L + G + ++ +L P
Sbjct: 923 QEDGREL-EFIHKTKVEEPPLALLAFQGRLAVGLGSLLRIYDLGMKQLLRKCQAHVVPKT 981
Query: 1199 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
+V L + I++ D+ +S+ ++ +K Q L D S +T ++D T++
Sbjct: 982 IVGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSISRWTTSTT-MVDYETVA--G 1038
Query: 1259 SDEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDR 1310
D+ N+ + K+SE G L+ R H + + + Q + TS +
Sbjct: 1039 GDKFGNLWLVRCPKKVSEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTSLHK 1098
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
T G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1099 TQLVAG----GRDILVWTGFHGTIGMLVPFVSREDVDF--FQNLEMQLASQHPPLAGRDH 1152
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1408
+R +++ K ++D +L Y +LP + ++ IA
Sbjct: 1153 LIYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIA 1186
>gi|346970653|gb|EGY14105.1| hypothetical protein VDAG_00787 [Verticillium dahliae VdLs.17]
Length = 1160
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 61/136 (44%), Gaps = 13/136 (9%)
Query: 312 LAVPSPIGGVLV----VGANTIHYHSQSASCALA-LNNYAVSLDSS----QELPRSSFSV 362
L +P P L+ V ++ YH + + A A L V+ ++ L + S
Sbjct: 221 LEIPDPFARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISK 280
Query: 363 ELDAAH--ATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
LD +W + DV LL+ G+L LLT+ DG +V L L + S + +
Sbjct: 281 ALDEPRIFVSWAKYDVTRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYM 340
Query: 419 GNSLFFLGSRLGDSLL 434
GN + FLGS GDS L
Sbjct: 341 GNEILFLGSHHGDSQL 356
>gi|154320780|ref|XP_001559706.1| hypothetical protein BC1G_01862 [Botryotinia fuckeliana B05.10]
Length = 238
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 45/182 (24%), Positives = 73/182 (40%), Gaps = 36/182 (19%)
Query: 57 NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD-GIS---------- 97
NLVV +++++I+ + + E+ S +K+ RV D G+
Sbjct: 28 NLVVAKSSLLQIFTTKTVSVDLDELSEKDSSTAKDDTNIDPRVNNDDGVEDSFLGTDSIM 87
Query: 98 -------AASLELVCHYRLHGNVESL----AILSQGGADNSRRRDSIILAFEDAKISVLE 146
L LV Y L G V SL I S+ G + +I++ F+DAK+S++E
Sbjct: 88 QRPELARTTKLVLVAEYNLSGTVTSLVRVKTISSKTGGE------AILVGFKDAKLSLVE 141
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+D G+ S+H +E E + VDP RC + + IL
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201
Query: 207 SQ 208
Q
Sbjct: 202 KQ 203
>gi|451818558|ref|YP_007454759.1| succinate dehydrogenase/fumarate reductase, flavoprotein subunit
[Clostridium saccharoperbutylacetonicum N1-4(HMT)]
gi|451784537|gb|AGF55505.1| succinate dehydrogenase/fumarate reductase, flavoprotein subunit
[Clostridium saccharoperbutylacetonicum N1-4(HMT)]
Length = 493
Score = 42.4 bits (98), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 47/97 (48%), Gaps = 5/97 (5%)
Query: 1057 RTYTVEEYEVRILEPDRAGGPWQTRAT-IPMQSSENALTVRVVTLFNTTTK----ENETL 1111
R Y+ E+ +++P+ G P AT + + +E A + V +F T EN
Sbjct: 122 RAYSDGEFTWHVVKPEGGGVPGPRAATTMTKRMTEKARELGVEIIFETPVNKIIMENGEA 181
Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE 1148
+ + GE++ ARG+ ++ +TG DNPQ + E
Sbjct: 182 VGVIAKNKAGEEIEARGKAVILATGGFGDNPQMIKEE 218
>gi|67516629|ref|XP_658200.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
gi|40747539|gb|EAA66695.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
gi|259489136|tpe|CBF89158.1| TPA: damaged DNA binding protein (Eurofung) [Aspergillus nidulans
FGSC A4]
Length = 1132
Score = 42.4 bits (98), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 65/265 (24%), Positives = 108/265 (40%), Gaps = 31/265 (11%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G +DP GR + +Y ++++ Q S G + +G E I R
Sbjct: 117 GSRCMIDPSGRFMTLEIYDGMIVVIPIIQLPSKRRGRQVALPTGPDAPRIGELGEPIITR 176
Query: 240 DLDMKHVKDFIFVHGYI-EPVMVILHE-RELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
+D V+ F+H P + +L+E + +V ++ A S T++ +
Sbjct: 177 -IDELFVRSSAFLHVQAGSPRLALLYEDNQKKVKLKVRELKYSTAAGAESEFTSIADY-- 233
Query: 298 IWSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
A L A L+ VP+P+ GG+L++G +I Y A NN VS Q
Sbjct: 234 ---AQELDLGASHLIPVPAPLAAAGGLLILGETSIKYVD-------ADNNEIVS----QP 279
Query: 355 LPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT 412
L ++ V W Q D LL+ G L L +V V+R +L +
Sbjct: 280 LEEATIFV-------AWEQVDSQRWLLADDYGRLFFLMLVLRNSEVERWELHSLGNTSRA 332
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
S + +G + F+GS GDS +++
Sbjct: 333 SVLVYLGGGVVFVGSHQGDSQVIRI 357
>gi|242803623|ref|XP_002484212.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus
ATCC 10500]
gi|218717557|gb|EED16978.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus
ATCC 10500]
Length = 1140
Score = 42.4 bits (98), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 63/137 (45%), Gaps = 21/137 (15%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G I Y + NN + S+ L ++ V
Sbjct: 245 ASHLIPVPAPLGGLLVLGETCIKYIDDA-------NNETI----SRPLDEATIFV----- 288
Query: 368 HATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDITTIGNSLFF 424
W+Q D LL+ G L L +V D R V+ + + S + +G + F
Sbjct: 289 --AWVQVDGQRWLLADDYGRLFFLMLVLDSRNEVEGWKIDYLGSASRASVLIYLGAGMTF 346
Query: 425 LGSRLGDSLLVQFTCGS 441
+GS GDS +++ + GS
Sbjct: 347 IGSHQGDSQVIRISEGS 363
>gi|82541417|ref|XP_724950.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23479780|gb|EAA16515.1| CPSF A subunit region, putative [Plasmodium yoelii yoelii]
Length = 2227
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 51/219 (23%), Positives = 92/219 (42%), Gaps = 35/219 (15%)
Query: 1168 LLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY--FLSWKE 1225
LL + KI +H+ + AF D Y+ + I +NFI++ D++K IY S++E
Sbjct: 2005 LLHCTNSKIYIHEIKNNDFIKGAFLDNN-FYISDIKIFRNFIIISDLYKGIYINMYSYEE 2063
Query: 1226 Q--GAQLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
Q ++ ++K F +L+ + +++ S + ++ D N IF + K
Sbjct: 2064 QYDSRRIISISKTFYNHNLNILSCHYIVYNSNICIIAMDVYNNFFIFGHKNKQD----ID 2119
Query: 1282 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1341
L F+ + KF + L + A S+ DGSI PL+
Sbjct: 2120 NLYIYNYFNFNRRILKF--INELHQNKHSNSALSISN------------DGSIHIYHPLN 2165
Query: 1342 E---LTFRRLQSLQKKLVDSVPHVA-----GLNPRSFRQ 1372
+ + F+ + + KK + P++A L P F Q
Sbjct: 2166 DKAFIFFKHIFKIVKKFI--FPNLALNINSDLKPDIFIQ 2202
>gi|62318969|dbj|BAD94072.1| spliceosomal - like protein [Arabidopsis thaliana]
Length = 165
Score = 41.6 bits (96), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 35/141 (24%), Positives = 64/141 (45%), Gaps = 26/141 (18%)
Query: 1287 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1343
+FHVG VT + M+ S+ ++++GT+ GSIG + D++
Sbjct: 42 VQFHVGDVVTCLQKASMIPGGSE--------------SIMYGTVMGSIGALHAFTSRDDV 87
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1403
F L+ + P + G + ++R A+ P D ++D +L + LP++
Sbjct: 88 DF--FSHLEMHMRQEYPPLRGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDL 138
Query: 1404 QLEIAHQTGTTRSQILSNLND 1424
Q +IA + T ++IL L D
Sbjct: 139 QRKIADELDRTPAEILKKLED 159
>gi|302820387|ref|XP_002991861.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
gi|300140399|gb|EFJ07123.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
Length = 1292
Score = 41.6 bits (96), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 47/216 (21%), Positives = 85/216 (39%), Gaps = 38/216 (17%)
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFS----YGLRINADASATGISKQSNYELVE----- 543
E Q +F V+ NI P+ DFS YG + + + G ++ + ++
Sbjct: 419 KVEDGQLSFQSFVQ----NIAPILDFSLVDYYGEKQDQMFACCGGDEEGSVRIIRNGNSV 474
Query: 544 ---------LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
G GIWT+ ++ + D YHA+ +IS T VL
Sbjct: 475 EKLICTPPVYQGVSGIWTMRYR--------------FKDPYHAFFLISFVEETRVLSVGL 520
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
++T++V + Q T+A G L V QV+ ++ + SN S +
Sbjct: 521 NFVDITDAVGFESQVNTLACG-LVEDGWVAQVWRYEVKLCSPTKAAHPAGVSGSNPLSTT 579
Query: 655 GSENSTVLSV-SIADPYVLLGMSDGSIRLLVGDPST 689
+ +SV ++ V+L ++ + L++G T
Sbjct: 580 WRKPGYPISVGAVCRSRVILALARPGLLLMLGATQT 615
>gi|347838030|emb|CCD52602.1| similar to DDB1B (Damaged DNA Binding protein 1 B); damaged DNA
binding / protein binding [Botryotinia fuckeliana]
Length = 1157
Score = 41.6 bits (96), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 77/331 (23%), Positives = 134/331 (40%), Gaps = 57/331 (17%)
Query: 1114 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1173
+GT+++ E+ RGR+L+F G NAD ++ S LKG+ + L G ++ A
Sbjct: 835 VGTSFLHEEEANVRGRLLIF--GVNADRAPYMIA---SHNLKGSCRCIGVLDGKIVAALN 889
Query: 1174 PKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
++++ + T L +A Y + + DI KSI + + GA
Sbjct: 890 KTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIA-VADIMKSIALVEYT-PGA 947
Query: 1229 -----QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+L +A+ + F+T + + T + + +D N+ + + ++
Sbjct: 948 DGLPDKLEEVARH--AQQVFSTS-VAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRM 1004
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL-----FGTLDGSI---G 1335
E ++G V + R+ + +T++ ALL GT +GSI
Sbjct: 1005 EVTCEMNLGEMVNRVKRINV---------------ETSKDALLIPRAFLGTTEGSIYLFS 1049
Query: 1336 CIAPLDELTFRRLQSLQKKL---------VDSV-PHVAGLNPRS--FRQFHSNGKAHRPG 1383
I P ++ RLQS L DS PH L+P + F ++ S A R
Sbjct: 1050 LIPPQNQDLLMRLQSRLASLPSASSIRGSSDSTSPHQIELSPGNLDFNKYRSYISATRET 1109
Query: 1384 --PDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
P VD EL+ + L +E Q +A G
Sbjct: 1110 SEPFRFVDGELIERFLDLEVEVQEHVAEGLG 1140
>gi|392591958|gb|EIW81285.1| hypothetical protein CONPUDRAFT_56293 [Coniophora puteana RWD-64-598
SS2]
Length = 1245
Score = 41.6 bits (96), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 105/458 (22%), Positives = 187/458 (40%), Gaps = 96/458 (20%)
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI---- 977
G F GSR ++RLR H L + A LH + + TS+G+
Sbjct: 775 GRTAVFACGSRTSVLFWEKDRLR-HSPLILKEVAAAASLHTHDYRSSLVLATSEGLVIGD 833
Query: 978 ------LKICQLPSGSTYDN-----YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVL 1026
L I + +G DN + PV K + + H P+ V P +
Sbjct: 834 VQNLEKLHIRSIHTG--LDNPRRISHSPVHKALAVGCVRHT----------PVRVGEPEI 881
Query: 1027 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1086
+ SV L+ T+++ +L+ D PM
Sbjct: 882 ----------------------SRGSVQLYNDTTLDKLGQVVLDHDEE----------PM 909
Query: 1087 QSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG-EDVAARGRVLLFSTGRNADNPQNL 1145
+ AL+VRV +E + +GT + E+ ++ GR+LL + ++
Sbjct: 910 --AIKALSVRV-------AEEAKDCFVVGTVIIDSLENESSSGRLLLVEP--DYSRGESF 958
Query: 1146 VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE----LNGIAFYDAPPLYVVS 1201
V S+++KG + A+A++ G ++ A ++++ + L+ + + YVV+
Sbjct: 959 VAVSASEKVKGCVYAVAAVDGLVVAAVNSAVVIYSIEADDHTRALSFVKKVEWNHNYVVA 1018
Query: 1202 LNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1259
N+V N +L+GD S+ L + E+GA N+ A+D+ L + E L + + + +
Sbjct: 1019 -NLVSRGNLLLVGDAISSVTLLQY-ERGALQNV-ARDYSPLWPTSVEMLDERNVIG---A 1072
Query: 1260 DEQKNIQIFYYAPKMSESWKGQKLLSR-AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1318
D N+ +F + +K+L R ++ G V KF+ ++ S + SD
Sbjct: 1073 DNDCNLFMFTL-----QDGAERKVLERNGHYYFGDMVNKFIPGEIYRALS----SFEASD 1123
Query: 1319 KTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKL 1355
LF T GSIG + + DEL+ + SLQ+ L
Sbjct: 1124 IEVEPKQLFFTTTGSIGVVIDMSDELSL-HMSSLQRNL 1160
>gi|313235544|emb|CBY10999.1| unnamed protein product [Oikopleura dioica]
Length = 1185
Score = 41.6 bits (96), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 87/428 (20%), Positives = 157/428 (36%), Gaps = 105/428 (24%)
Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE------LDAAHATWL 372
GGV+V N + Y N+ D +PR ++ + AHAT
Sbjct: 246 GGVIVCAENYLIY-----------KNFGDQPDIRFPIPRRRNDLDDPERGMIIVAHATHK 294
Query: 373 QNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ LL T+ GDL +T+ + +V + L + ++S + + F+ G
Sbjct: 295 TRSMFFFLLQTEQGDLFKVTLETEEDIVTEIRLKYFDTVPVSSSLCVLRTGFLFVAGEFG 354
Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRR----SSSDALQDMVNGEELS 485
+ L Q T G E + E + + LR D+L ++N E
Sbjct: 355 NHNLYQITRLGEDDDEPEFSSAEPLEEGETFFFTPRGLRNLALTDEMDSLSPVLNCEVAD 414
Query: 486 LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
L A+ +T T R +L + +GL + S + ELP
Sbjct: 415 L---ANEDTPQLYVTCGRGPRSTL------RVLRHGLEV------------SEMAVSELP 453
Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
G +WTV + D ++ +Y+I+S T+VL + + E+T+S
Sbjct: 454 GNPNAVWTV--------------KTSADADHDSYIIVSFVNATLVLSIGETVEEITDS-G 498
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGAR-------------------------------I 633
+ T+++G L G ++Q++ G R
Sbjct: 499 FLGTTPTLSSG-LMGEDALVQIYPEGIRHIRSDRRVNEWRAPDRKQIVRCACNRQQVVIA 557
Query: 634 LDGS---YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRL 682
L G Y D + G N + S ++++ + D ++ +G+SDG++R+
Sbjct: 558 LTGGEIVYFEMDPT-GQLNEYTERREFGSEIIALDVGDVPAGEQRCRFLAVGLSDGTVRI 616
Query: 683 LVGDPSTC 690
+ DP+ C
Sbjct: 617 ISLDPNDC 624
>gi|340502545|gb|EGR29224.1| hypothetical protein IMG5_160230 [Ichthyophthirius multifiliis]
Length = 315
Score = 41.2 bits (95), Expect = 4.7, Method: Composition-based stats.
Identities = 39/175 (22%), Positives = 80/175 (45%), Gaps = 17/175 (9%)
Query: 1206 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD---CFATEFLIDGSTLSLVVSDEQ 1262
K + L+GDI K + E+ ++ + ++ +++ C F+ + +TL +V+DE
Sbjct: 93 KQYFLVGDIQKGMQLYEMDERENKVKQIGEENANINIRQCLLF-FIQNTNTLRALVADEY 151
Query: 1263 KNIQIFYYAPKMS-ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1321
KN+ + + K ++ A FH+G+ V K + +D+ D
Sbjct: 152 KNLYAYSLIQQQDLNEKKNVQMELIANFHLGSKVNKII--------TDQKQIIEKGDNQQ 203
Query: 1322 RFA---LLFGTLDGSIGCIAPLDELTFRRLQ-SLQKKLVDSVPHVAGLNPRSFRQ 1372
+ A +L + DG+I + + E L LQK + + +P++ L+ R +R+
Sbjct: 204 QEAVSHILLLSQDGNISVLKLIYEQGENTLLFDLQKTIYEELPYIGSLDYREYRE 258
>gi|284028391|ref|YP_003378322.1| NAD+ synthetase [Kribbella flavida DSM 17836]
gi|283807684|gb|ADB29523.1| NAD+ synthetase [Kribbella flavida DSM 17836]
Length = 271
Score = 41.2 bits (95), Expect = 4.9, Method: Composition-based stats.
Identities = 38/137 (27%), Positives = 59/137 (43%), Gaps = 9/137 (6%)
Query: 1254 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1313
L + DE + I M ++ K L R EFH G + + A + R G
Sbjct: 100 LDFIRPDETLTVDIKDSTDAMVDAMKHVGLGDRVEFHAGNVKARERMIAQYAVAGVRGGL 159
Query: 1314 APGSDKTNRFALLFGTLDGSIGC-IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1372
G+D + F T G C + PL LT RR++++ ++L S P + G P
Sbjct: 160 VVGTDHAAEAVMGFYTKWGDGACDVTPLSGLTKRRVRAIGERLGAS-PEITGKVPT---- 214
Query: 1373 FHSNGKAHRPG-PDSIV 1388
++ ++ RPG PD V
Sbjct: 215 --ADLESDRPGIPDETV 229
>gi|154295205|ref|XP_001548039.1| pre-mRNA splicing factor 3b [Botryotinia fuckeliana B05.10]
Length = 1020
Score = 40.8 bits (94), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 129/600 (21%), Positives = 226/600 (37%), Gaps = 94/600 (15%)
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ + G + ++A G++ +D II+ + +I+++EF + + + F
Sbjct: 62 HDVFGIIRAIAAFRLAGSN----KDYIIITSDSGRITIVEFVPAQNKFNRLHLETFG--- 114
Query: 167 WLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
K G G + VDP+GR V L ++ + SQ E T S
Sbjct: 115 ----KSGVRRVVPGQYLAVDPKGRACLTASVEKNKLVYVLNRNSQA-------ELTISSP 163
Query: 224 GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHH 279
A + V L LD+ GY PV L E + G+ ++
Sbjct: 164 --LEAHKAQTLVFALVALDV----------GYANPVFAALEIDYGESDQDPTGQ-AYDEI 210
Query: 280 TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHY-HSQS 335
+ + L WS + A L VP GVLV G + I Y HS
Sbjct: 211 EKQLVYYELDLGLNHVVRKWSE-PVDRTANILFQVPGGTDGPSGVLVCGEDNITYRHSNQ 269
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV-- 391
+ +A+ + + Q V +L A + LL T GDL +T+
Sbjct: 270 EAFRVAIPRRRGATEDPQRKRNIVAGVMHKLKGAAGAFF----FLLQTDDGDLFKITIEM 325
Query: 392 VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
V D V+RL + + + + + + + F+ S G+ QF
Sbjct: 326 VEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGFLFVASEFGNHQFYQFEKLGDDDEE 385
Query: 447 SSGLKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSF 503
+ + ++F G E+ P R + + +L + ++ + +N T E A + +S
Sbjct: 386 TEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESIDSMNPLMDCKVANLTDEDAPQIYSI 445
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHN 562
+ LK +GL ++ + ELPG +WT K +RG
Sbjct: 446 CGTGARSTFRTLK---HGLEVSEIVES------------ELPGVPSAVWTT--KLTRG-- 486
Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
D Y AY+I+S T+VL + + EVT++ + T+A L G
Sbjct: 487 ----------DTYDAYIILSFSNGTLVLSIGETVEEVTDT-GFLSSAPTLAVQQL-GEDS 534
Query: 623 VIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD-PYVLLGM-SDGSI 680
+IQV +G R + + + + P + + + N ++V+++ V M SDGS+
Sbjct: 535 LIQVHPKGIRHIRADHRVNEWA-APQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSL 593
>gi|406602265|emb|CCH46158.1| Pre-mRNA-splicing factor [Wickerhamomyces ciferrii]
Length = 1123
Score = 40.8 bits (94), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 100/255 (39%), Gaps = 81/255 (31%)
Query: 493 NTESAQKTFSFA-VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE--LPG-CK 548
N ++ K +S + V+DS LK YGL IN E+VE LPG
Sbjct: 406 NDDAFTKIYSLSGVKDS----SSLKILQYGLSIN--------------EIVESDLPGIAN 447
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+WT +DE+ YL+IS T+VL + + E+T+S +
Sbjct: 448 KVWTTKLNK--------------NDEFDKYLVISFMDTTLVLSIGENVEEITDS-GLALN 492
Query: 609 GRTIAAGNLFGRRRVIQVFERGAR------------------ILDGSYMTQDLSFGPSNS 650
TI + G ++Q+ G R IL S + ++ G SN
Sbjct: 493 EETIGIQQI-GINSLVQIHSNGIRNIKNGELINEWQPPAGIKILTTSTTNRQIAIGLSND 551
Query: 651 E---------------SGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVGDP 687
E + S ++S+S+ D P++++G D +IR+L DP
Sbjct: 552 ELVYFEVDDRDRLIEYNERKELTSRIVSLSLGDIPEGRLRSPFLIVGCQDSTIRVLSTDP 611
Query: 688 STC--TVSVQTPAAI 700
+ +S+Q ++I
Sbjct: 612 GSTLELLSLQALSSI 626
>gi|171691144|ref|XP_001910497.1| hypothetical protein [Podospora anserina S mat+]
gi|170945520|emb|CAP71632.1| unnamed protein product [Podospora anserina S mat+]
Length = 1158
Score = 40.8 bits (94), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 72/188 (38%), Gaps = 53/188 (28%)
Query: 282 MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA 341
+I + +K+H + PH +GGV+VVG + Y
Sbjct: 233 LIPVRKVEEEVKRHNFRNTGSAKPH-----------LGGVIVVGETRLLY---------- 271
Query: 342 LNNYAVSLDSSQELPRSSFSVELDAA--HATWLQNDVA--LLSTKTGDLVLLTVVYDGRV 397
++ +++ +LD A W + +V L+ G L LLT+ DG
Sbjct: 272 ----------IDDVTKATVESKLDKASIFVKWAEYNVQTYFLADDYGSLHLLTINTDGAE 321
Query: 398 VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
V+ + L+K + S++ +GN + F+ S GDS L Q D+
Sbjct: 322 VKGMVLTKIGVTSRASELVYLGNEMLFVASHHGDSRLFQL------------------DL 363
Query: 458 EADAPSTK 465
AD P+ K
Sbjct: 364 SADKPADK 371
>gi|320593036|gb|EFX05445.1| uv-damaged DNA-binding protein [Grosmannia clavigera kw1407]
Length = 1504
Score = 40.8 bits (94), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 58/135 (42%), Gaps = 21/135 (15%)
Query: 306 HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD 365
H+ + +GG+LVVG + Y + C + E+P + S+
Sbjct: 562 HNVRNTATATANLGGLLVVGETRLLYIDSTTKCTV-------------EVPLRAASI--- 605
Query: 366 AAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN-SL 422
W + D LL+ + G L LLT++ G VV LD+S + S + + + L
Sbjct: 606 --FVAWARYDATHYLLADEYGTLHLLTILVSGAVVDNLDVSPIGKTSRASCLVYLPDRRL 663
Query: 423 FFLGSRLGDSLLVQF 437
F+GS GDS L +
Sbjct: 664 LFVGSHNGDSQLFRL 678
>gi|407923753|gb|EKG16818.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1129
Score = 40.4 bits (93), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 63/262 (24%), Positives = 102/262 (38%), Gaps = 32/262 (12%)
Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
+DP GR + +Y G+ ++ +G GD + G +RIE V + L
Sbjct: 121 LDPTGRFMTLELYEGIVTVVPLTEKGKRK--GDPEVSALGEPVPSRIEEMFVRSSAFLHR 178
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
K + +P++ +L+E + R+ + + + P+
Sbjct: 179 KSPESE-------KPLVALLYEEDEDSKIRLRLRQLAFQTAGTEEQSVAALEPVEGLKEE 231
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
L A L+ VP P GVLV+G I Y N+Y +L + L S+ V
Sbjct: 232 LDLGASHLIPVPGPCYGVLVLGETCITY----------FNDYTKAL-VKKPLQDSTIFV- 279
Query: 364 LDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGR--VVQRLDLSKTNPSVLTSDITTIG 419
W Q N LL+ G L L ++ D VV+ L K + S + +
Sbjct: 280 ------AWEQIDNQRFLLADDFGGLYLFMLLLDDNSGVVEGWRLDKIGETSRASVLVYLD 333
Query: 420 NSLFFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ T GS
Sbjct: 334 AGHVFVGSHEGDSQVIRITEGS 355
>gi|402083318|gb|EJT78336.1| hypothetical protein GGTG_03437 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 1155
Score = 40.4 bits (93), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 80/373 (21%), Positives = 147/373 (39%), Gaps = 62/373 (16%)
Query: 1068 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-- 1125
+L+P G P++ + E TV L +T + E + +GT ++ E++
Sbjct: 800 VLQP--LGNPFE------LNEGEVVETVIRAQLRDTFGRLAERFI-VGTRFLVDENLVPG 850
Query: 1126 --ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW-- 1181
++GRVL+F Q + S LK LA ++ +++A +++ ++
Sbjct: 851 SNSKGRVLVFGVDEERSPFQ-----IVSHPLKSGCRRLAVMEEMIVVALTKTVVVARYEE 905
Query: 1182 ---TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW--------------K 1224
T +L +A Y Y + + + I +GDI KS+ + +
Sbjct: 906 LTSTSGKLIKVASYQTTS-YAIDVAVEGRLIAVGDIMKSMSLVEFVPPTTVAGDGKAGET 964
Query: 1225 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1284
++ AQL + + + S A S L +D N+ + + + G L
Sbjct: 965 KKPAQLIEVCRHYQSSWSTAVAHFEGESWLE---ADADGNVMV------LGRNTTGVTLE 1015
Query: 1285 SRAEFHVGAHVTKFLRLQMLATSSDRTGA-APGSDKTNRFALLFGTLDGSI---GCIAP- 1339
R + + + + + S TG AP K T +GSI G IAP
Sbjct: 1016 DRRRMEITSEINLGENINRIQKISVETGPNAPIHPKA-----FLSTTEGSIYLVGAIAPQ 1070
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1399
+ +L L +LQ +L D V + + ++FR F N + GP +D E + + +
Sbjct: 1071 MRDL----LLNLQDRLEDYVGTLGNIPFKNFRSFR-NAEREADGPVRFIDGEYIERFLDM 1125
Query: 1400 PLEEQLEIAHQTG 1412
E Q ++ G
Sbjct: 1126 NEETQSQVCRDLG 1138
>gi|341893349|gb|EGT49284.1| hypothetical protein CAEBREN_30765 [Caenorhabditis brenneri]
Length = 213
Score = 40.0 bits (92), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 39/200 (19%), Positives = 82/200 (41%), Gaps = 15/200 (7%)
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1275
+S+ LS++ +AKD+ S EF+ S L +++ P
Sbjct: 2 RSVSLLSYRTLEGNFEEVAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPVTD 61
Query: 1276 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGS 1333
+ G+ +L + +TK + +L D D + R+ +++GT GS
Sbjct: 62 D---GRYVLEPTGYWYLGELTKVMIRAVLVPQPD--------DNSIRYTQPIMYGTNQGS 110
Query: 1334 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+G + +D++ + L S++K + D+ + + ++R F N + P +D +L+
Sbjct: 111 LGLVVQIDDMYKKFLLSIEKAISDAEKNCMQIEHSTYRSFTYNKRIEPPS--GFIDGDLI 168
Query: 1394 SHYEMLPLEEQLEIAHQTGT 1413
+ +EI + T
Sbjct: 169 ESILDMDRSRAIEILEKANT 188
>gi|380481704|emb|CCF41690.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
Length = 932
Score = 40.0 bits (92), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 58/144 (40%), Gaps = 14/144 (9%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS-----QELPRSSFS 361
D Y + +P PI V YH + + A A + + + L R+
Sbjct: 179 DPYARIVIPVPI-----VEDEVKRYHKRDTTGAKAQLGGLIVVGETLLVYVDTLTRTVVE 233
Query: 362 VELD--AAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
L+ A W D LS G+L LLT+ +G VV L L + S +
Sbjct: 234 SGLNSPAIFVAWAAYDDTNYFLSDDYGNLHLLTIETEGVVVTNLSLRLLGVTSRASCLVH 293
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
+GN L FLGS GDS L+Q S
Sbjct: 294 MGNGLLFLGSHYGDSQLLQINMES 317
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.393
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 22,133,755,139
Number of Sequences: 23463169
Number of extensions: 940418432
Number of successful extensions: 2152374
Number of sequences better than 100.0: 997
Number of HSP's better than 100.0 without gapping: 461
Number of HSP's successfully gapped in prelim test: 536
Number of HSP's that attempted gapping in prelim test: 2145869
Number of HSP's gapped (non-prelim): 3183
length of query: 1432
length of database: 8,064,228,071
effective HSP length: 156
effective length of query: 1276
effective length of database: 8,698,941,003
effective search space: 11099848719828
effective search space used: 11099848719828
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 84 (37.0 bits)