BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000548
(1431 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225455571|ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Vitis vinifera]
Length = 1442
Score = 2441 bits (6327), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1166/1457 (80%), Positives = 1283/1457 (88%), Gaps = 41/1457 (2%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS+AAYKMMHWPTGI NC SGF+THSRAD+ PQI IQT++L+SE P+KR IGP+PNL+V
Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
TAAN++E+Y+VRVQE+ S+ES+ S ETKR +M GIS A+LELVC YRLHGNVE++ +L
Sbjct: 61 TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
GG DNSRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEW HLKRG ESFARG
Sbjct: 121 SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
PLVKVDPQGRC GVLVYGLQMIILKASQ G GLVGDE+ SG SAR+ESS+VI+LRD
Sbjct: 181 PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
LDMKHVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
Sbjct: 241 LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
A+NLPHDAYKLL VPSPIGGV+V+ AN+IHYHSQSASCALALNNYAVS D+SQE+PRSSF
Sbjct: 301 AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
SVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGRVV RLDLSK+ SVLTS I IGN
Sbjct: 361 SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
SLFFLGSRLGDSLLVQFT S+LSS +KEE GDIE D PS KRLR+SSSDALQDMVN
Sbjct: 421 SLFFLGSRLGDSLLVQFT-----SILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVN 475
Query: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
GEELSLYGSA N+TE++QKTFSF+VRDS +N+GPLKDF+YGLRINAD ATGI+KQSNYE
Sbjct: 476 GEELSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSNYE 535
Query: 541 LV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
LV ELPGCKGIWTVYHK++RGHNADS++MA DDE
Sbjct: 536 LVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKDDE 595
Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
YHAYLIISLE+RTMVLETADLL EVTESVDY+VQG TI+AGNLFGRRRV+QV+ RGARIL
Sbjct: 596 YHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGARIL 655
Query: 635 DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 694
DG++MTQDL SE+STVLSVSIADPYVLL MSDG+I+LLVGDPSTCTVS+
Sbjct: 656 DGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCTVSI 705
Query: 695 QTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 754
PA ESSKK +S+CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG DQGDIY
Sbjct: 706 NIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDIYC 765
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
VV YESG LEIFDVPNFNCVF+VDKF+SG H+VDT + E +D++ ++ +SEE QG
Sbjct: 766 VVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQG 825
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
RKEN H++KVVELAMQRWS HSRPFLF ILTDGTILCY AYL+EGPE+T K+++ VS
Sbjct: 826 RKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQ 885
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
SLS+SNVSASRLRNLRF R PLD YTREE G R+T+FKNI G QG FLSGSRP
Sbjct: 886 NSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPL 945
Query: 935 WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
W MVFRER+RVHPQLCDGSIVAFTVLHN+NCNHG IYVTSQG LKICQLP+ S+YDNYWP
Sbjct: 946 WFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWP 1005
Query: 995 VQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL 1054
VQKIPLK TPHQ+TYFAEKNLYPLIVSVPVLKPLN VLS L+DQE GHQ++N NLSS +L
Sbjct: 1006 VQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSDEL 1065
Query: 1055 HRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIG 1114
HR+Y+V+E+EVR+LEP+++G PWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIG
Sbjct: 1066 HRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIG 1125
Query: 1115 TAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
TAYVQGEDVAARGRVLLFS G+N DN QNLV+E+YSKELKGAISA+ASLQGHLLIASGPK
Sbjct: 1126 TAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPK 1185
Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
IILHKWTGTELNG+AF+DAPPLYVVSLNIVKNFILLGDIH+SIYFLSWKEQGAQLNLLAK
Sbjct: 1186 IILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAK 1245
Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
DFGSLDCFATEFLIDGSTLSL+VSD+QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV
Sbjct: 1246 DFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1305
Query: 1295 TKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1354
TKFLRLQML SSDRT A GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL
Sbjct: 1306 TKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1365
Query: 1355 VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
VD+VPHVAGLNPRSFRQF SNGKAHRPGPD+IVDCELL HYEMLP EEQLEIA Q GTTR
Sbjct: 1366 VDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTR 1425
Query: 1415 SQILSNLNDLALGTSFL 1431
QILSNLNDL+LGTSFL
Sbjct: 1426 MQILSNLNDLSLGTSFL 1442
>gi|296084122|emb|CBI24510.3| unnamed protein product [Vitis vinifera]
Length = 1448
Score = 2436 bits (6313), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1166/1463 (79%), Positives = 1283/1463 (87%), Gaps = 47/1463 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS+AAYKMMHWPTGI NC SGF+THSRAD+ PQI IQT++L+SE P+KR IGP+PNL+V
Sbjct: 1 MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
TAAN++E+Y+VRVQE+ S+ES+ S ETKR +M GIS A+LELVC YRLHGNVE++ +L
Sbjct: 61 TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
GG DNSRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEW HLKRG ESFARG
Sbjct: 121 SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
PLVKVDPQGRC GVLVYGLQMIILKASQ G GLVGDE+ SG SAR+ESS+VI+LRD
Sbjct: 181 PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
LDMKHVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
Sbjct: 241 LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
A+NLPHDAYKLL VPSPIGGV+V+ AN+IHYHSQSASCALALNNYAVS D+SQE+PRSSF
Sbjct: 301 AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
SVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGRVV RLDLSK+ SVLTS I IGN
Sbjct: 361 SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
SLFFLGSRLGDSLLVQFT S+LSS +KEE GDIE D PS KRLR+SSSDALQDMVN
Sbjct: 421 SLFFLGSRLGDSLLVQFT-----SILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVN 475
Query: 481 GEELSLYGSASNNTESAQ------KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
GEELSLYGSA N+TE++Q KTFSF+VRDS +N+GPLKDF+YGLRINAD ATGI+
Sbjct: 476 GEELSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIA 535
Query: 535 KQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
KQSNYELV ELPGCKGIWTVYHK++RGHNADS++M
Sbjct: 536 KQSNYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKM 595
Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
A DDEYHAYLIISLE+RTMVLETADLL EVTESVDY+VQG TI+AGNLFGRRRV+QV+
Sbjct: 596 ATKDDEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYA 655
Query: 629 RGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
RGARILDG++MTQDL SE+STVLSVSIADPYVLL MSDG+I+LLVGDPS
Sbjct: 656 RGARILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPS 705
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
TCTVS+ PA ESSKK +S+CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG D
Sbjct: 706 TCTVSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQD 765
Query: 749 QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE 808
QGDIY VV YESG LEIFDVPNFNCVF+VDKF+SG H+VDT + E +D++ ++ +SE
Sbjct: 766 QGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSE 825
Query: 809 EGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSD 868
E QGRKEN H++KVVELAMQRWS HSRPFLF ILTDGTILCY AYL+EGPE+T K++
Sbjct: 826 EEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTE 885
Query: 869 DPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFL 928
+ VS SLS+SNVSASRLRNLRF R PLD YTREE G R+T+FKNI G QG FL
Sbjct: 886 EAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFL 945
Query: 929 SGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
SGSRP W MVFRER+RVHPQLCDGSIVAFTVLHN+NCNHG IYVTSQG LKICQLP+ S+
Sbjct: 946 SGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSS 1005
Query: 989 YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
YDNYWPVQKIPLK TPHQ+TYFAEKNLYPLIVSVPVLKPLN VLS L+DQE GHQ++N N
Sbjct: 1006 YDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDN 1065
Query: 1049 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1108
LSS +LHR+Y+V+E+EVR+LEP+++G PWQTRATIPMQSSENALTVRVVTLFNTTTKENE
Sbjct: 1066 LSSDELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1125
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
TLLAIGTAYVQGEDVAARGRVLLFS G+N DN QNLV+E+YSKELKGAISA+ASLQGHLL
Sbjct: 1126 TLLAIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLL 1185
Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
IASGPKIILHKWTGTELNG+AF+DAPPLYVVSLNIVKNFILLGDIH+SIYFLSWKEQGAQ
Sbjct: 1186 IASGPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQ 1245
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
LNLLAKDFGSLDCFATEFLIDGSTLSL+VSD+QKNIQIFYYAPKMSESWKGQKLLSRAEF
Sbjct: 1246 LNLLAKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEF 1305
Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1348
HVGAHVTKFLRLQML SSDRT A GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ
Sbjct: 1306 HVGAHVTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1365
Query: 1349 SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1408
SLQKKLVD+VPHVAGLNPRSFRQF SNGKAHRPGPD+IVDCELL HYEMLP EEQLEIA
Sbjct: 1366 SLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQ 1425
Query: 1409 QTGTTRSQILSNLNDLALGTSFL 1431
Q GTTR QILSNLNDL+LGTSFL
Sbjct: 1426 QIGTTRMQILSNLNDLSLGTSFL 1448
>gi|255539681|ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
[Ricinus communis]
gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative
[Ricinus communis]
Length = 1461
Score = 2425 bits (6285), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1175/1461 (80%), Positives = 1296/1461 (88%), Gaps = 30/1461 (2%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELP-SKRGIGPVPNLV 59
MS+AAYKM+HWPTGI +C SG+ITHSRAD+VPQIP IQT+ LDSE P SKRGIGP+PNL+
Sbjct: 1 MSYAAYKMLHWPTGIESCASGYITHSRADFVPQIPPIQTDNLDSEWPPSKRGIGPMPNLI 60
Query: 60 VTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119
VTA +V+E+YVVRVQE+GS+ES++S ETKR LMDG+S ASLELVCHYRLHGNVES+ +L
Sbjct: 61 VTAGSVLEVYVVRVQEDGSRESRSSRETKRGGLMDGVSGASLELVCHYRLHGNVESMVVL 120
Query: 120 SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179
G D+SRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEWLHLKRGRESFAR
Sbjct: 121 PTEGGDSSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLKRGRESFAR 180
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
GPL+KVDPQGRCGG+LVY +QMIIL+A+Q SGLVGD+D SGG SAR++SS+VINLR
Sbjct: 181 GPLLKVDPQGRCGGILVYDMQMIILRAAQASSGLVGDDDALSSGGSISARVQSSYVINLR 240
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
D+DMKHVKDFIF+H YIEPV+VILHERELTWAGRVSWKHHTCMISALSISTTLKQ LIW
Sbjct: 241 DMDMKHVKDFIFLHDYIEPVVVILHERELTWAGRVSWKHHTCMISALSISTTLKQPTLIW 300
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
S +NLPHDAYKLLAVP PIGGVLV+ ANTIHYHS+SA+ ALALNNYAVS+DSSQELPR+S
Sbjct: 301 SVVNLPHDAYKLLAVPPPIGGVLVICANTIHYHSESATYALALNNYAVSIDSSQELPRAS 360
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419
FSVELDA A WL NDVALLS K G+L+LL++VYDGRVVQRLDLSK+ SVLTSDITTIG
Sbjct: 361 FSVELDAVKAAWLLNDVALLSAKNGELLLLSLVYDGRVVQRLDLSKSKASVLTSDITTIG 420
Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
NSLFFLGSRLGDSLLVQFT G G S++SSGLKEE G+IE D PS KRL+RS+SD LQDMV
Sbjct: 421 NSLFFLGSRLGDSLLVQFTNGLGPSVVSSGLKEEVGEIEGDVPSAKRLKRSASDGLQDMV 480
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
+GEELSLYGS +NNTESAQK+FSFAVRDSL+N+GPLKDFSYGLR N DASATGI+KQSNY
Sbjct: 481 SGEELSLYGSTANNTESAQKSFSFAVRDSLINVGPLKDFSYGLRSNYDASATGIAKQSNY 540
Query: 540 ELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD 573
+LV +LPGC+GIWTVYHK++RGHN D S+MAA D
Sbjct: 541 DLVCCSGHGKNGTLCILRQSIRPEMITEVDLPGCRGIWTVYHKNARGHNVDLSKMAAAAD 600
Query: 574 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
EYHAYLIIS+EARTMVLETADLL+EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI
Sbjct: 601 EYHAYLIISMEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 660
Query: 634 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
LDGS+MTQDLS G SNSES GSE++TV SVSIADPYVL+ M+DGSIRLL+GD STC VS
Sbjct: 661 LDGSFMTQDLSIGSSNSESSPGSESATVSSVSIADPYVLIKMTDGSIRLLIGDSSTCMVS 720
Query: 694 VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGA---DGGPLDQG 750
+ TP+A E+S++ VS+CTLYHDKGPEPWLRK STDAWLSTGV EAIDGA DGGP DQG
Sbjct: 721 INTPSAFENSERSVSACTLYHDKGPEPWLRKASTDAWLSTGVSEAIDGAESADGGPHDQG 780
Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
DIY +VCYESGALEIFDVPNFN VF+VDKFVSG+TH+ D Y+RE KDS+ + N SEE
Sbjct: 781 DIYCIVCYESGALEIFDVPNFNRVFSVDKFVSGKTHLADAYVREPPKDSQEKTNRISEEV 840
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
G GRKEN H+MK VELAMQRWS HHSRPFLF +LTDGTILCY AYLFE P+ TSK++D
Sbjct: 841 AGLGRKENAHNMKAVELAMQRWSGHHSRPFLFGVLTDGTILCYHAYLFEAPDATSKTEDS 900
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
VS + + ++SASRLRNLRF R PLD+Y +EET CQRITIF NISGHQGFFL G
Sbjct: 901 VSAQNPVGLGSISASRLRNLRFVRVPLDSYIKEETSTENSCQRITIFNNISGHQGFFLLG 960
Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
SRP W MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG LKICQLPS S YD
Sbjct: 961 SRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGNLKICQLPSFSNYD 1020
Query: 991 NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
NYWPVQKIPLK TPHQ+TYF EKNLYPLIVSVPV KP+NQVLS L+DQEVGHQI+NHNLS
Sbjct: 1021 NYWPVQKIPLKGTPHQVTYFPEKNLYPLIVSVPVHKPVNQVLSSLVDQEVGHQIENHNLS 1080
Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1110
S +L +TY+VEE+EVRILE + GGPWQT+ATIPMQSSENALTVRVVTLFN TTKENETL
Sbjct: 1081 SDELLQTYSVEEFEVRILESENGGGPWQTKATIPMQSSENALTVRVVTLFNATTKENETL 1140
Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
LAIGTAYVQGEDVAARGRVLLFS ++ +N Q LV+EVYSKELKGAISALASLQGHLLIA
Sbjct: 1141 LAIGTAYVQGEDVAARGRVLLFSVVKSTENSQVLVSEVYSKELKGAISALASLQGHLLIA 1200
Query: 1171 SGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1230
SGPKIILHKWTGTELNG+AFYDAPPLYV S+NIVKNFILLGDIHKSIYFLSWKEQGAQL+
Sbjct: 1201 SGPKIILHKWTGTELNGVAFYDAPPLYVASMNIVKNFILLGDIHKSIYFLSWKEQGAQLS 1260
Query: 1231 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1290
LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM ESWKGQKLLSRAEFHV
Sbjct: 1261 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMLESWKGQKLLSRAEFHV 1320
Query: 1291 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1350
GAH+TKF+RL ML+TSSDR+GAAPG DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL
Sbjct: 1321 GAHITKFIRLSMLSTSSDRSGAAPGPDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1380
Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1410
Q+KLVD+VPHVAGLNPRSFRQF S+GK HRPGP+SIVDCELLSH+EMLPLEEQLEIA Q
Sbjct: 1381 QRKLVDAVPHVAGLNPRSFRQFRSDGKVHRPGPESIVDCELLSHFEMLPLEEQLEIAQQV 1440
Query: 1411 GTTRSQILSNLNDLALGTSFL 1431
GTTR+QILSNLNDL+LGTSFL
Sbjct: 1441 GTTRAQILSNLNDLSLGTSFL 1461
>gi|356559917|ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Glycine max]
Length = 1447
Score = 2391 bits (6197), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1148/1459 (78%), Positives = 1290/1459 (88%), Gaps = 40/1459 (2%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK--RGIGPVPNL 58
MSFAAYKMM PTGI NC +GF+THSR+D+VP +Q ++LD+E PS+ +G +PNL
Sbjct: 1 MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAEWPSRPRHHVGSLPNL 56
Query: 59 VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAI 118
VVTAANV+E+Y VR+QE+ + K + +++R L+DGI+ ASLELVCHYRLHGNVE++A+
Sbjct: 57 VVTAANVLEVYAVRLQED--QPPKAAADSRRGALLDGIAGASLELVCHYRLHGNVETMAV 114
Query: 119 LSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA 178
LS GG D SRRRDSI+L F DAKISVLE+DDSIHGLR +S+HCFE PEWLHLKRGRE FA
Sbjct: 115 LSIGGGDVSRRRDSIMLTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQFA 174
Query: 179 RGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
RGP+VKVDPQGRCGGVL+Y LQMIILKA+Q GSGLVG++D GS G +ARIESS++INL
Sbjct: 175 RGPVVKVDPQGRCGGVLIYDLQMIILKATQAGSGLVGEDDALGSSGAVAARIESSYMINL 234
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
RDLDM+HVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI
Sbjct: 235 RDLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 294
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
WSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALN+YAV+LDSSQE+PRS
Sbjct: 295 WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNSYAVTLDSSQEIPRS 354
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
SF+VELDAA+ATWL +DVALLSTKTG+L+LLT+VYDGRVVQRLDLSK+ SVL+S ITTI
Sbjct: 355 SFNVELDAANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTI 414
Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
GNSLFFL SRLGDS+LVQF+CGSG SMLSS LKEE GDIEADAPS KRLRRS SDALQDM
Sbjct: 415 GNSLFFLASRLGDSMLVQFSCGSGVSMLSSNLKEEVGDIEADAPS-KRLRRSPSDALQDM 473
Query: 479 VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN 538
V+GEELSLYGSA N TESAQK+FSFAVRDSL+N+GPLKDFSYGLRINADA+ATGI+KQSN
Sbjct: 474 VSGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 533
Query: 539 YEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
YEL VELPGCKGIWTVYHKS+R HNADSS+MA D
Sbjct: 534 YELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADDD 593
Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
DEYHAYLIISLEARTMVLETADLL+EVTESVDY+VQG+T+AAGNLFGR RVIQV+ERGAR
Sbjct: 594 DEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRCRVIQVYERGAR 653
Query: 633 ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
ILDGS+MTQD+SFG SN ESGS S+++ LSVSIADP+VLL MSDGSIRLL+GDPSTCT+
Sbjct: 654 ILDGSFMTQDVSFGASNLESGSASDSAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCTI 713
Query: 693 SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
SV +PA+ ESSK VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE IDG DG D GDI
Sbjct: 714 SVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGETIDGTDGAAQDHGDI 773
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
Y VVC+++G LEIFDVPNFNCVF+V+ F+SG++H+VD M+E LKDS+ +
Sbjct: 774 YCVVCFDNGNLEIFDVPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVIN 830
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QGRKENI MKVVELAMQRWS HSRPFLF IL+DGTILCY AYL+E P++TSK +D S
Sbjct: 831 QGRKENIPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDSTSKVEDSAS 890
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
S+ +S+ + SRLRNLRF R PLDAY RE+T +G PCQ+ITIFKNI ++GFFLSGSR
Sbjct: 891 AGGSIGLSSTNVSRLRNLRFVRVPLDAYAREDTSNGPPCQQITIFKNIGSYEGFFLSGSR 950
Query: 933 PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
P W MV RERLRVHPQLCDGSIVAFTVLHNVNCN G IYVTSQG+LKICQLPSGS YD+Y
Sbjct: 951 PAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNQGLIYVTSQGVLKICQLPSGSNYDSY 1010
Query: 993 WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1052
WPVQKIPLKATPHQ+TYFAEKNLYPLIVS PVLKPLNQV+S L+DQ++ HQ ++ N++
Sbjct: 1011 WPVQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVIS-LVDQDINHQNESQNMNPD 1069
Query: 1053 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLA 1112
+ +R Y ++E+EVRI+EP+++GGPWQT+ATIPMQSSENALTVR+VTL NTT+KENETLLA
Sbjct: 1070 EQNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLVNTTSKENETLLA 1129
Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1172
IGTAYVQGEDVAARGR+LLFS G+N DNPQ LV+EVYSKELKGAISALASLQGHLLIASG
Sbjct: 1130 IGTAYVQGEDVAARGRILLFSLGKNTDNPQTLVSEVYSKELKGAISALASLQGHLLIASG 1189
Query: 1173 PKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1232
PKIILHKW GTELNGIAF+DAPPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+LL
Sbjct: 1190 PKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSLL 1249
Query: 1233 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1292
AKDFGSLDCFATEFLIDGSTLSL+VSD+ +NIQIFYYAPKMSESWKGQKLLSRAEFHVGA
Sbjct: 1250 AKDFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1309
Query: 1293 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
HVTKFLRLQML+T SDR GA PGSDKTNRFALLFGTLDGSIGCIAPLDE+TFRRLQSLQ+
Sbjct: 1310 HVTKFLRLQMLST-SDRAGAVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQR 1368
Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
KLVD+VPHVAGLNPR+FR F SNGKAHRPGPDSIVDCELL HYEMLPLEEQLEIAHQ GT
Sbjct: 1369 KLVDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQVGT 1428
Query: 1413 TRSQILSNLNDLALGTSFL 1431
TRSQILSNL+DL+LGTSFL
Sbjct: 1429 TRSQILSNLSDLSLGTSFL 1447
>gi|356530945|ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Glycine max]
Length = 1449
Score = 2367 bits (6134), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1147/1460 (78%), Positives = 1292/1460 (88%), Gaps = 40/1460 (2%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDS-ELPSK--RGIGPVPN 57
MSFAAYKMM PTGI NC +GF+THSR+D+VP +Q ++LD+ E PS+ +GP+PN
Sbjct: 1 MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAAEWPSRPRHHVGPLPN 56
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVVTAANV+E+Y VR+QE+ + S +++R L+DGI+ ASLEL CHYRLHGNVE++A
Sbjct: 57 LVVTAANVLEVYAVRLQED-QQPKDASDDSRRGTLLDGIAGASLELECHYRLHGNVETMA 115
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+LS GG D SR+RDSIIL F DAKISVLE+DDSIHGLR +S+HCFE PEWLHLKRGRE F
Sbjct: 116 VLSIGGGDVSRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQF 175
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
ARGP+VK+DPQGRCGGVL+Y LQMIILKA+Q GSGLVGD+D FGS G +ARIESS++IN
Sbjct: 176 ARGPVVKIDPQGRCGGVLIYDLQMIILKATQVGSGLVGDDDAFGSSGAVAARIESSYMIN 235
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDLDM+HVKDF FV+GYIEPVMVILHERELTWAGRVSW HHTCMISALSISTTLKQHPL
Sbjct: 236 LRDLDMRHVKDFTFVYGYIEPVMVILHERELTWAGRVSWTHHTCMISALSISTTLKQHPL 295
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+GANTIHYHSQSASCALALNNYAV+LDSSQE+PR
Sbjct: 296 IWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNNYAVTLDSSQEIPR 355
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
SSF+VELDAA+ATWL +DVALLSTKTG+L+LL +VYDGRVVQRLDLSK+ SVL+S ITT
Sbjct: 356 SSFNVELDAANATWLLSDVALLSTKTGELLLLMLVYDGRVVQRLDLSKSKASVLSSGITT 415
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
IGNSLFFL SRLGDS+LVQF+CGSG SM+SS LKEE GDIE DAPS KRLRRS SDALQD
Sbjct: 416 IGNSLFFLASRLGDSMLVQFSCGSGVSMMSSNLKEEVGDIEVDAPS-KRLRRSPSDALQD 474
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
MV+GEELSLYGSA+N TESAQK+FSFAVRDSL+N+GPLKDFSYGLRINADA+ATGI+KQS
Sbjct: 475 MVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQS 534
Query: 538 NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
NYEL VELPGCKGIWTVYHKS+R HNADSS+MA
Sbjct: 535 NYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADD 594
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
DDEYHAYLIISLEARTMVLETADLL+EVTESVDY+VQG+T+AAGNLFGRRRVIQV+ERGA
Sbjct: 595 DDEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERGA 654
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
RILDGS+MTQD+SFG SNSESGS SE++ LSVSIADP+VLL MSDGSIRLL+GDPSTCT
Sbjct: 655 RILDGSFMTQDVSFGASNSESGSASESAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCT 714
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
+SV +PA+ ESSK VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG DG D GD
Sbjct: 715 ISVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHGD 774
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
IY VVC+++G LEIFD+PNFNCVF+V+ F+SG++H+VD M+E LKDS+ +
Sbjct: 775 IYCVVCFDNGNLEIFDIPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVV 831
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
QGRK+NI +MKVVELAMQRWS HSRPFLF IL+DGTILCY AYL+E P+ TSK +D
Sbjct: 832 NQGRKDNIPNMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSA 891
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
S S+ +S+ + SRLRNLRF R PLDAY RE+T +G+PCQ+ITIFKNI +QGFFLSGS
Sbjct: 892 SAGGSIGLSSTNVSRLRNLRFVRVPLDAYPREDTSNGSPCQQITIFKNIGSYQGFFLSGS 951
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
RP W MV RERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG+LKICQLPSGS YD+
Sbjct: 952 RPAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDS 1011
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
YWPVQKIPLKATPHQ+TYFAEKNLYPLIVS PVLKPLNQV+S L+DQ+ HQ ++ N++
Sbjct: 1012 YWPVQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVIS-LVDQDFNHQNESQNMNP 1070
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
+ +R Y ++E+EVRI+EP+++GGPWQT+ATIPMQSSENALTVR+VTL NTT+KENETLL
Sbjct: 1071 DEQNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLLNTTSKENETLL 1130
Query: 1112 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
AIGTAYVQGEDVAARGR+LLFS G+ DNPQ LV+EVYSKELKGAISALASLQGHLLIAS
Sbjct: 1131 AIGTAYVQGEDVAARGRILLFSLGKITDNPQTLVSEVYSKELKGAISALASLQGHLLIAS 1190
Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
GPKIILHKW GTELNGIAF+DAPPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+L
Sbjct: 1191 GPKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSL 1250
Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
LAKDFGSLDCFATEFLIDGSTLSL+VSD+ +NIQIFYYAPKMSESWKGQKLLSRAEFHVG
Sbjct: 1251 LAKDFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1310
Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
AHVTKFLRLQML+T SDR G+ PGSDKTNRFALLFGTLDGSIGCIAPLDE+TFRRLQSLQ
Sbjct: 1311 AHVTKFLRLQMLST-SDRAGSVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQ 1369
Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
+KLVD+VPHVAGLNPR+FR F SNGKAHRPGPDSIVDCELL HYEMLPLEEQLEIA+Q G
Sbjct: 1370 RKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIANQIG 1429
Query: 1412 TTRSQILSNLNDLALGTSFL 1431
TTRSQILSNL+DL+LGTSFL
Sbjct: 1430 TTRSQILSNLSDLSLGTSFL 1449
>gi|224120960|ref|XP_002318462.1| predicted protein [Populus trichocarpa]
gi|222859135|gb|EEE96682.1| predicted protein [Populus trichocarpa]
Length = 1455
Score = 2360 bits (6117), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1150/1466 (78%), Positives = 1271/1466 (86%), Gaps = 46/1466 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKR----GIGPVP 56
MS+AAYKMMHWPT I C SGF+THSR++ +P + T++LDS+ PS+R GIGP P
Sbjct: 1 MSYAAYKMMHWPTTIDTCVSGFVTHSRSESA-HLPQLHTDDLDSDWPSRRRHGGGIGPTP 59
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V + NV+E+YVVRVQEEG++ +SGE KR +MDG++ ASLELVCHYRLHGNVES+
Sbjct: 60 NLIVASGNVLELYVVRVQEEGAR---SSGELKRGGVMDGVAGASLELVCHYRLHGNVESM 116
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+LS G D+SRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE P+W HLKRGRES
Sbjct: 117 GVLSVEGGDDSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPDWRHLKRGRES 176
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
FARGPLVKVDPQGRCGGVLVY LQMIILKA+Q GS LV DED FGSG SA I SS++I
Sbjct: 177 FARGPLVKVDPQGRCGGVLVYDLQMIILKAAQAGSALVQDEDAFGSGAAISAHIASSYII 236
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
NLRDLDMKHVKDFIFVH YIEPV+V+LHERELTWAGRV WKHHTCMISALSISTTLKQ
Sbjct: 237 NLRDLDMKHVKDFIFVHDYIEPVVVVLHERELTWAGRVVWKHHTCMISALSISTTLKQPT 296
Query: 297 LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
LIWS NLPHDAYKLLAVPSPIGGVLV+G NTIHYHS+SASCALALN+YA S+DSSQELP
Sbjct: 297 LIWSIGNLPHDAYKLLAVPSPIGGVLVIGVNTIHYHSESASCALALNSYAASVDSSQELP 356
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT 416
R++FSVELDAA+ATWL DVALLSTKTG+L+LLT+VYDGRVVQRLDLSK+ SVLTSDIT
Sbjct: 357 RATFSVELDAANATWLLKDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDIT 416
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ 476
T+GNS FFLGSRLGDSLLVQFT G G+SMLS GLKEE GDIE D PS KRL+ SSSDALQ
Sbjct: 417 TLGNSFFFLGSRLGDSLLVQFTSGLGSSMLSPGLKEEVGDIEGDLPSAKRLKVSSSDALQ 476
Query: 477 DMVNGEELSLYGSASNNTESAQ-----KTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
DMV+GEELSLY SA NN ES+Q KTFSF VRDSL+N+GPLKDF+YGLRINADA+AT
Sbjct: 477 DMVSGEELSLYSSAPNNAESSQVVSVIKTFSFTVRDSLINVGPLKDFAYGLRINADANAT 536
Query: 532 GISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGHNADS 565
GISKQSNYEL VELPGCKGIWTVYHK++R H+ DS
Sbjct: 537 GISKQSNYELVCCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNARIHSVDS 596
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
+MA+ DDEYHAYLIIS+EARTMVLETAD LTEVTESVDYFVQGRTIAAGNLFGRRRV+Q
Sbjct: 597 LKMAS-DDEYHAYLIISMEARTMVLETADHLTEVTESVDYFVQGRTIAAGNLFGRRRVVQ 655
Query: 626 VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
VFERGARILDGS+MTQDLSFG SNSE+G SE+STV+ VSI DPYVL+ M+DGSI++LVG
Sbjct: 656 VFERGARILDGSFMTQDLSFGGSNSETGR-SESSTVMHVSIVDPYVLVRMADGSIQILVG 714
Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
DPS CTVSV TP+A +SS K VS+CTLYHDKGPEPWLRKTSTDAWLSTG+ EAIDGAD G
Sbjct: 715 DPSACTVSVNTPSAFQSSTKSVSACTLYHDKGPEPWLRKTSTDAWLSTGISEAIDGADSG 774
Query: 746 PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
+QGDIY VVCYE+GALEIFDVPNFN VF VDKFVSG+TH++DT E KD +
Sbjct: 775 AHEQGDIYCVVCYETGALEIFDVPNFNSVFFVDKFVSGKTHLLDTCTGEPAKDM---MKG 831
Query: 806 SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
EE G GRKE+ +MKVVEL M RWS HSRPFLF ILTDGTILCY AYLFEGP+ TS
Sbjct: 832 VKEEVAGAGRKESTQNMKVVELTMLRWSGRHSRPFLFGILTDGTILCYHAYLFEGPDGTS 891
Query: 866 KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQG 925
K +D VS S+ S +SASRLRNLRF R PLD YTREET CQRIT FKNISG+QG
Sbjct: 892 KLEDSVSAQNSVGASTISASRLRNLRFVRVPLDTYTREETSSETSCQRITTFKNISGYQG 951
Query: 926 FFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
FFLSGSRP W MVFRERLRVHPQLCDGSIVAFTVLH VNCNHG IYVTSQG LKIC L S
Sbjct: 952 FFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHTVNCNHGLIYVTSQGNLKICHLSS 1011
Query: 986 GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1045
S+YDNYWPVQKIPLK TPHQ+TYFAE+NLYPLIVSVPV KP+NQVLS L+DQEVGHQI+
Sbjct: 1012 VSSYDNYWPVQKIPLKGTPHQVTYFAERNLYPLIVSVPVQKPVNQVLSSLVDQEVGHQIE 1071
Query: 1046 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTK 1105
NHNLSS ++HRTY+V+E+EVRILEP + GPWQ +ATIPMQ+SENALTVR+V+LFNT+TK
Sbjct: 1072 NHNLSSEEIHRTYSVDEFEVRILEP--SNGPWQVKATIPMQTSENALTVRMVSLFNTSTK 1129
Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQG 1165
ENETLLA+GTAYVQGEDVAARGR+LLFS +N +N Q LV+EVYSKELKGAISALASLQG
Sbjct: 1130 ENETLLAVGTAYVQGEDVAARGRILLFSVVKNPENSQILVSEVYSKELKGAISALASLQG 1189
Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
HLLIASGPKIILHKWTGTEL G+AF DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ
Sbjct: 1190 HLLIASGPKIILHKWTGTELTGVAFSDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1249
Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
GAQL+LLAKDF SLDCF+TEFLIDGSTLSLVVSDEQKN+QIFYYAPKMSESWKGQKLLSR
Sbjct: 1250 GAQLSLLAKDFASLDCFSTEFLIDGSTLSLVVSDEQKNVQIFYYAPKMSESWKGQKLLSR 1309
Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1345
AEFHVGA VTKF+RLQML+ S DR+GAAP SDKTNRFALLFGTLDGSIGCIAPLDELTFR
Sbjct: 1310 AEFHVGALVTKFMRLQMLSPSLDRSGAAPVSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1369
Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
RLQSLQKKLVD+VPHVAGLNP+SFRQF S+GKAHRPGP+SIVDCE+LS+YEM+PLEEQ+E
Sbjct: 1370 RLQSLQKKLVDAVPHVAGLNPKSFRQFRSDGKAHRPGPESIVDCEMLSYYEMIPLEEQVE 1429
Query: 1406 IAHQTGTTRSQILSNLNDLALGTSFL 1431
IA Q GTTR+QILSNLNDL LGTSFL
Sbjct: 1430 IAQQIGTTRAQILSNLNDLTLGTSFL 1455
>gi|297792471|ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
lyrata]
gi|297309955|gb|EFH40379.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
lyrata]
Length = 1444
Score = 2256 bits (5845), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1096/1460 (75%), Positives = 1254/1460 (85%), Gaps = 45/1460 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQ-TEELDSELPS-KRGIGPVPNL 58
MSFAA+KMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAFKMMHWPTGVENCASGYITHSLSDSTLQIPIVSGDDDMEAEWPNHKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TA N++E+Y+VR QEEG ++E + KR +MDG+S SLELVCHYRLHGNVES+A
Sbjct: 61 VITAGNILEVYIVRAQEEGNTQELRIPKLVKRGGVMDGVSGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILKASQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKASQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI+TTLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINTTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ +DVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISSDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRLR SSD QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLR-ISSDTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ EELSL+GS NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 538 NYELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
NYELV ELPGCKGIWTVYHKSSRGHNADSS+MAA
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSVRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+DEYHAYLIIS+EARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600 EDEYHAYLIISVEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
RILDGS+M Q+LSFG NSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660 RILDGSFMNQELSFGAPNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
VS+ +P+ +E SKK +S+CTL+HDKGPEPWLRK STDAWLS+GVGEA+D ADGGP DQGD
Sbjct: 720 VSISSPSVLEGSKKKISACTLFHDKGPEPWLRKASTDAWLSSGVGEAVDSADGGPQDQGD 779
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
IY V+CYESGALEIFDVP FNCVF+VDKF SGR H+ D + E E E+N +SE+
Sbjct: 780 IYCVLCYESGALEIFDVPGFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSED-N 834
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
R E I + KVVEL+MQRWS H+RPFLFA+L DGTILCY AYLFEG ++T K+++ V
Sbjct: 835 ASSRNEEIKNTKVVELSMQRWSGPHTRPFLFAVLADGTILCYHAYLFEGVDST-KAENSV 893
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
S+ ++++ +S+LRNL+F R P D TRE T G QRIT+FKNISGHQGFFLSGS
Sbjct: 894 SSENPAALNSSGSSKLRNLKFLRIPFDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 953
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVTSQ +LKICQLPS S YDN
Sbjct: 954 RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTSQVVLKICQLPSASIYDN 1013
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
YWPVQKIPLKATPHQ+TY+AEKNLYPLIVS PV KP+NQVLS L+DQE G QIDNHNLSS
Sbjct: 1014 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPINQVLSSLVDQEAGQQIDNHNLSS 1073
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
DL RTYTVEE+E++ILEP+R+GGPW+T+ATIPMQSSE+ALTVRVVTL N +T ENETLL
Sbjct: 1074 DDLQRTYTVEEFEIQILEPERSGGPWETKATIPMQSSEHALTVRVVTLLNASTGENETLL 1133
Query: 1112 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
A+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+QGHLLI+S
Sbjct: 1134 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1193
Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
GPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK FILLGD+HKSIYFLSWKEQG+QL+L
Sbjct: 1194 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKTFILLGDVHKSIYFLSWKEQGSQLSL 1253
Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
LAKDFGSLDCFATEFLIDG+TLSL VSDEQKNIQ+FYYAPKM+ESWKGQKLLSRAEFHVG
Sbjct: 1254 LAKDFGSLDCFATEFLIDGNTLSLAVSDEQKNIQVFYYAPKMAESWKGQKLLSRAEFHVG 1313
Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
+HVTKFLRLQM+ + G+DKTNRFALLFGTLDGS GCIAPLDE+TFRRLQSLQ
Sbjct: 1314 SHVTKFLRLQMVTS---------GADKTNRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQ 1364
Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
KKLVD+VPHVAGLNP SFRQF ++GKA R GPDSI+DCELL HYEMLPLEEQLE+AHQ G
Sbjct: 1365 KKLVDAVPHVAGLNPHSFRQFRTSGKARRSGPDSIIDCELLCHYEMLPLEEQLELAHQIG 1424
Query: 1412 TTRSQILSNLNDLALGTSFL 1431
TTRS IL NL +L++GTSFL
Sbjct: 1425 TTRSVILLNLVELSVGTSFL 1444
>gi|449470342|ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Cucumis sativus]
Length = 1504
Score = 2252 bits (5835), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1118/1509 (74%), Positives = 1248/1509 (82%), Gaps = 83/1509 (5%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MSFAAY+MMHWPTGI NC S +ITHSRAD+VP + +++LDS+ +R IGPVPNLVV
Sbjct: 1 MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVT-SHSDDLDSDWHPRRDIGPVPNLVV 59
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
TA NV+E+YVVRV EEG +ESK+SGE +R +MDG+S ASLELVCHYRLHGNVES+AILS
Sbjct: 60 TAGNVLEVYVVRVLEEGGRESKSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILS 119
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
G D S++RDSIIL F++AKISVLEFDDS H LR +SMHCF+ P+WLHLKRGRESFARG
Sbjct: 120 SRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARG 179
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
P+VKVDPQGRCGGVLVYGLQMIILKASQ GSGLV D++ FG+ G SAR+ESS++INLRD
Sbjct: 180 PVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRD 239
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
LD+KHVKDF+FVHGYIEPVMVILHE+ELTWAGRVSWKHHTCM+SALSISTTLKQHPLIWS
Sbjct: 240 LDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMVSALSISTTLKQHPLIWS 299
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
A NLPHDAYKLLAVPSPIGGVLV+ AN+IHY+SQSASC LALNNYAVS DSSQ++PRS+F
Sbjct: 300 ASNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNF 359
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
+VELDAA+ATWL NDVALLSTKTG+L+LL +VYDGRVVQRLDLSK+ SVLTS I +IGN
Sbjct: 360 NVELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGN 419
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF-------------------------- 454
SLFFLGSRLGDSLLVQF+CG G+S L+S LK+E
Sbjct: 420 SLFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEITYYTQNLQKEMVPPTLPSALVHESKP 479
Query: 455 ----GDIEADAPS----------------------TKRLRRSSSDALQDMVNGEELSLYG 488
G IE + + R+ R V G+ELSLYG
Sbjct: 480 TQAKGTIELNNNNLCVENDIVDVVEVDITNMTILGENRIARRDETLTDTQVGGDELSLYG 539
Query: 489 SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------ 542
SA+NNTESAQK FSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNYELV
Sbjct: 540 SAANNTESAQKIFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGHG 599
Query: 543 --------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
ELPGCKGIWTVYHK++RG ADSSRM DDEYHAYLIIS
Sbjct: 600 KNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIIS 659
Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
LEARTMVL T +LLTEVTESVDYFV GRTIAAGNLFGRRRVIQV+E GARILDGS+MTQD
Sbjct: 660 LEARTMVLVTGELLTEVTESVDYFVHGRTIAAGNLFGRRRVIQVYESGARILDGSFMTQD 719
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
L+ + +ESG+ SE TVLS SI+DPYVLL M+DGSIRLLVGD S+C+VSV PAA S
Sbjct: 720 LNLVVNGNESGNASEGCTVLSASISDPYVLLTMTDGSIRLLVGDSSSCSVSVSAPAAFGS 779
Query: 703 SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA 762
SKK VSSCTLY DKG EPWLR TSTDAWLSTGVGE IDG DG DQGDIY V CY++G
Sbjct: 780 SKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVACYDNGD 839
Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
LEIFDVPNF VF VDKFVSG++H+VD + + K SE + NS +E GR E+ +M
Sbjct: 840 LEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQNS--QELISHGRNESSQNM 897
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
KV+E+AMQRWS HSRPFLF ILTDGTILCY AYLFE ++ SK DD VS S+S SN+
Sbjct: 898 KVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNSVSSSNM 957
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRER 942
S+SRLRNLRF R PLD RE+ P+G R++IFKNISG+QG FL GSRP W MVFRER
Sbjct: 958 SSSRLRNLRFLRVPLDIQGREDMPNGTLSCRLSIFKNISGYQGLFLCGSRPAWFMVFRER 1017
Query: 943 LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKA 1002
LRVHPQLCDG IVAF VLHNVNCNHG IYVTSQG+LKICQLPS S YDNYWPVQK+PLK
Sbjct: 1018 LRVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQKVPLKG 1077
Query: 1003 TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE 1062
TPHQ+TYF EKNLYP+I+S PV KPLNQVLS ++DQ+VGH ++NHNLS+ +L +TY+VEE
Sbjct: 1078 TPHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGH-VENHNLSADELQQTYSVEE 1136
Query: 1063 YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED 1122
+E+RILEP+++GGPWQTRATI M SSENALT+RVVTL NTTTKENETLLA+GTAYVQGED
Sbjct: 1137 FEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAYVQGED 1196
Query: 1123 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
VAARGRVLLFS G++ADN Q LV+EVYSKELKGAISALASLQGHLLIASGPKIILHKWTG
Sbjct: 1197 VAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1256
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
ELNGIAFYD PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL+LLAKDFGSLDC+
Sbjct: 1257 AELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCY 1316
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
ATEFLIDGSTLSL VSD+QKNIQIFYYAPK +ESWKGQKLLSRAEFHVGAHVTKFLRLQM
Sbjct: 1317 ATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQM 1376
Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1362
L+TSSD+ + SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL D+VPHV
Sbjct: 1377 LSTSSDK-ACSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGDAVPHVG 1435
Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
GLNPRSFRQFHSNGK HR GPDSIVDCELL HYEMLPLEEQL+IAHQ GTTRSQILSNLN
Sbjct: 1436 GLNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNLN 1495
Query: 1423 DLALGTSFL 1431
DL+LGTSFL
Sbjct: 1496 DLSLGTSFL 1504
>gi|30696088|ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1
[Arabidopsis thaliana]
gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=AtCPSF160; Short=CPSF 160
kDa subunit
gi|332008729|gb|AED96112.1| cleavage and polyadenylation specificity factor subunit 1
[Arabidopsis thaliana]
Length = 1442
Score = 2249 bits (5829), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1092/1460 (74%), Positives = 1254/1460 (85%), Gaps = 47/1460 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
MSFAAYKMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TAAN++E+Y+VR QEEG ++E +N KR +MDG+ SLELVCHYRLHGNVES+A
Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRLR +S D QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTS-DTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ EELSL+GS NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 538 NYELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
NYELV ELPGCKGIWTVYHKSSRGHNADSS+MAA
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600 EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660 RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D DGGP DQGD
Sbjct: 720 VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D + E E E+N +SE+ T
Sbjct: 780 IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+ I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836 S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
S+ ++++ +S+LRNL+F R PLD TRE T G QRIT+FKNISGHQGFFLSGS
Sbjct: 892 SSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952 RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
YWPVQKIPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+DNHN+SS
Sbjct: 1012 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMSS 1071
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T ENETLL
Sbjct: 1072 DDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETLL 1131
Query: 1112 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
A+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+QGHLLI+S
Sbjct: 1132 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1191
Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
GPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK+FILLGD+HKSIYFLSWKEQG+QL+L
Sbjct: 1192 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSL 1251
Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
LAKDF SLDCFATEFLIDGSTLSL VSDEQKNIQ+FYYAPKM ESWKG KLLSRAEFHVG
Sbjct: 1252 LAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVG 1311
Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
AHV+KFLRLQM+++ G+DK NRFALLFGTLDGS GCIAPLDE+TFRRLQSLQ
Sbjct: 1312 AHVSKFLRLQMVSS---------GADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQ 1362
Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
KKLVD+VPHVAGLNP +FRQF S+GKA R GPDSIVDCELL HYEMLPLEEQLE+AHQ G
Sbjct: 1363 KKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIG 1422
Query: 1412 TTRSQILSNLNDLALGTSFL 1431
TTR IL +L DL++GTSFL
Sbjct: 1423 TTRYSILKDLVDLSVGTSFL 1442
>gi|24415580|gb|AAN41460.1| putative cleavage and polyadenylation specificity factor 160 kDa
subunit [Arabidopsis thaliana]
Length = 1442
Score = 2248 bits (5825), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1091/1460 (74%), Positives = 1253/1460 (85%), Gaps = 47/1460 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
MSFAAYKMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TAAN++E+Y+VR QEEG ++E +N KR +MDG+ SLELVCHYRLHGNVES+A
Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRLR +S D QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTS-DTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ EELSL+GS +N++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480 TIGNEELSLFGSTPDNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 538 NYELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
NYELV ELPGCKGIWTVYHKSSRGHNADSS+MAA
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600 EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660 RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D DGGP DQGD
Sbjct: 720 VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D + E E E+N +SE+ T
Sbjct: 780 IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+ I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836 S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
S ++++ +S+LRNL+F R PLD TRE T G QRIT+FKNISGHQGFFLSGS
Sbjct: 892 SPENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952 RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
YWPVQKIPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+DNHN+SS
Sbjct: 1012 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMSS 1071
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T ENETLL
Sbjct: 1072 DDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETLL 1131
Query: 1112 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
A+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+QGHLLI+S
Sbjct: 1132 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1191
Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
GPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK+FILLGD+HKSIYFLSWKEQG+QL+L
Sbjct: 1192 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSL 1251
Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
LAKDF SLDCFATEFLIDGSTLSL VSDEQKNIQ+FYYAPKM ESWKG KLLSRAEFHVG
Sbjct: 1252 LAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVG 1311
Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
AHV+KFLRLQM+++ G+DK NRFALLFGTLDGS GCIAPLDE+TFRRLQSLQ
Sbjct: 1312 AHVSKFLRLQMVSS---------GADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQ 1362
Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
KKLVD+VPHVAGLNP +FRQF S+GKA R GPDSIVDCELL HYEMLPLEEQLE+AHQ G
Sbjct: 1363 KKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIG 1422
Query: 1412 TTRSQILSNLNDLALGTSFL 1431
TTR IL +L DL++GTSFL
Sbjct: 1423 TTRYSILKDLVDLSVGTSFL 1442
>gi|10257491|dbj|BAB11613.1| cleavage and polyadenylation specificity factor subunit [Arabidopsis
thaliana]
Length = 1448
Score = 2244 bits (5815), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1092/1466 (74%), Positives = 1254/1466 (85%), Gaps = 53/1466 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
MSFAAYKMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TAAN++E+Y+VR QEEG ++E +N KR +MDG+ SLELVCHYRLHGNVES+A
Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRLR +S D QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTS-DTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQ------KTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
+ EELSL+GS NN++SAQ K+FSFAVRDSLVN+GP+KDF+YGLRINADA+AT
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQVTSSVLKSFSFAVRDSLVNVGPVKDFAYGLRINADANAT 539
Query: 532 GISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNADS 565
G+SKQSNYELV ELPGCKGIWTVYHKSSRGHNADS
Sbjct: 540 GVSKQSNYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADS 599
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
S+MAA +DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQ
Sbjct: 600 SKMAADEDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQ 659
Query: 626 VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
VFE GARILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVG
Sbjct: 660 VFEHGARILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVG 719
Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
DPSTCTVS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D DGG
Sbjct: 720 DPSTCTVSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGG 779
Query: 746 PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
P DQGDIY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D + E E E+N
Sbjct: 780 PQDQGDIYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNK 835
Query: 806 SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
+SE+ T + I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T
Sbjct: 836 NSEDNTS---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST- 891
Query: 866 KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQG 925
K+++ +S+ ++++ +S+LRNL+F R PLD TRE T G QRIT+FKNISGHQG
Sbjct: 892 KAENSLSSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQG 951
Query: 926 FFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
FFLSGSRP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS
Sbjct: 952 FFLSGSRPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPS 1011
Query: 986 GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1045
S YDNYWPVQKIPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+D
Sbjct: 1012 ASIYDNYWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLD 1071
Query: 1046 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTK 1105
NHN+SS DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T
Sbjct: 1072 NHNMSSDDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTG 1131
Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQG 1165
ENETLLA+GTAYVQGEDVAARGRVLLFS G+N DN QN+VTEVYS+ELKGAISA+AS+QG
Sbjct: 1132 ENETLLAVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQG 1191
Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
HLLI+SGPKIILHKW GTELNG+AF+DAPPLYVVS+N+VK+FILLGD+HKSIYFLSWKEQ
Sbjct: 1192 HLLISSGPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQ 1251
Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
G+QL+LLAKDF SLDCFATEFLIDGSTLSL VSDEQKNIQ+FYYAPKM ESWKG KLLSR
Sbjct: 1252 GSQLSLLAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSR 1311
Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1345
AEFHVGAHV+KFLRLQM+++ G+DK NRFALLFGTLDGS GCIAPLDE+TFR
Sbjct: 1312 AEFHVGAHVSKFLRLQMVSS---------GADKINRFALLFGTLDGSFGCIAPLDEVTFR 1362
Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
RLQSLQKKLVD+VPHVAGLNP +FRQF S+GKA R GPDSIVDCELL HYEMLPLEEQLE
Sbjct: 1363 RLQSLQKKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLE 1422
Query: 1406 IAHQTGTTRSQILSNLNDLALGTSFL 1431
+AHQ GTTR IL +L DL++GTSFL
Sbjct: 1423 LAHQIGTTRYSILKDLVDLSVGTSFL 1448
>gi|222628488|gb|EEE60620.1| hypothetical protein OsJ_14038 [Oryza sativa Japonica Group]
Length = 1441
Score = 1877 bits (4863), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 944/1472 (64%), Positives = 1123/1472 (76%), Gaps = 72/1472 (4%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
MS+AAYKMMHWPTG+ +C +GF+THS +D ++DS + R +G
Sbjct: 1 MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60
Query: 54 PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
P PNLVV AANV+E+Y VR + G++ S +SG ++DGIS A LELVC+Y
Sbjct: 61 PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RLHGN+ES+ +LS G A+N RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116 RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
HLKRGRESFA GP++K DP GRCG L YGLQMIILKA+Q G LVG+++ + +
Sbjct: 173 QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+ KHHTCMISA S
Sbjct: 233 VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++
Sbjct: 293 ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+
Sbjct: 353 HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVL+S +T+IGNS FFLGSRLGDSLLVQF+ + S+L E DIE D P +KRL
Sbjct: 413 ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472
Query: 468 RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
+R SD LQD+ + EELS A N+ ESAQK S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473 KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531
Query: 527 DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
D +A G +KQSNYEL VELP C+GIWTVY+KS RG
Sbjct: 532 DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
A+ D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ TIAAGNLFGR
Sbjct: 592 QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
RRVIQV+ +GAR+LDGS+MTQ+L+F +++ S SE V SIADPYVLL M DGS+
Sbjct: 645 RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
+LL+GD TCT+SV P+ SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704 QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
G DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG +VD + + +DS
Sbjct: 764 GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
E ++ +KE S+++VELAM RWS SRPFLF +L DGT+LCY A+ +E
Sbjct: 824 KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
E+ K P+S S N S SRLRNLRF R +D +RE+ P G P RIT F N
Sbjct: 881 SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+ G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938 VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997
Query: 980 ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
ICQLPS YD+YWPVQK+PL TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQE
Sbjct: 998 ICQLPSAYNYDSYWPVQKVPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQE 1057
Query: 1040 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
H +DN S+ LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ ENALTVR+VTL
Sbjct: 1058 SVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVTL 1117
Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISA 1159
NTTTKENETLLAIGTAYV GEDVAARGRVLLFS + ++N QNLVTEVYSKE KGA+SA
Sbjct: 1118 HNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SENSQNLVTEVYSKESKGAVSA 1176
Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
+ASLQGHLLIASGPKI L+KWTG EL +AFYDA PL+VVSLNIVKNF+L GDIHKSIYF
Sbjct: 1177 VASLQGHLLIASGPKITLNKWTGAELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSIYF 1235
Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
LSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD KN+QIFYYAPKM ESWKG
Sbjct: 1236 LSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKG 1295
Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
QKLLSRAEFHVGAH+TKFLRLQML T S+KTNRFALLFG LDG IGCIAP+
Sbjct: 1296 QKLLSRAEFHVGAHITKFLRLQMLPTQ------GLSSEKTNRFALLFGNLDGGIGCIAPI 1349
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
DELTFRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HRPGPD+I+D ELL+HYEML
Sbjct: 1350 DELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPGPDNIIDFELLAHYEMLS 1409
Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
L+EQL++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 1410 LDEQLDVAQQIGTTRSQILSNFSDISLGTSFL 1441
>gi|75145059|sp|Q7XWP1.2|CPSF1_ORYSJ RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
gi|38345987|emb|CAD39979.2| OSJNBa0032B23.5 [Oryza sativa Japonica Group]
Length = 1441
Score = 1873 bits (4852), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 943/1472 (64%), Positives = 1121/1472 (76%), Gaps = 72/1472 (4%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
MS+AAYKMMHWPTG+ +C +GF+THS +D ++DS + R +G
Sbjct: 1 MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60
Query: 54 PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
P PNLVV AANV+E+Y VR + G++ S +SG ++DGIS A LELVC+Y
Sbjct: 61 PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RLHGN+ES+ +LS G A+N RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116 RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
HLKRGRESFA GP++K DP GRCG L YGLQMIILKA+Q G LVG+++ + +
Sbjct: 173 QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+ KHHTCMISA S
Sbjct: 233 VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++
Sbjct: 293 ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+
Sbjct: 353 HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVL+S +T+IGNS FFLGSRLGDSLLVQF+ + S+L E DIE D P +KRL
Sbjct: 413 ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472
Query: 468 RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
+R SD LQD+ + EELS A N+ ESAQK S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473 KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531
Query: 527 DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
D +A G +KQSNYEL VELP C+GIWTVY+KS RG
Sbjct: 532 DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
A+ D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ TIAAGNLFGR
Sbjct: 592 QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
RRVIQV+ +GAR+LDGS+MTQ+L+F +++ S SE V SIADPYVLL M DGS+
Sbjct: 645 RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
+LL+GD TCT+SV P+ SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704 QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
G DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG +VD + + +DS
Sbjct: 764 GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
E ++ +KE S+++VELAM RWS SRPFLF +L DGT+LCY A+ +E
Sbjct: 824 KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
E+ K P+S S N S SRLRNLRF R +D +RE+ P G P RIT F N
Sbjct: 881 SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+ G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938 VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997
Query: 980 ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
ICQLPS YD+YWPVQK+PL TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQE
Sbjct: 998 ICQLPSAYNYDSYWPVQKVPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQE 1057
Query: 1040 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
H +DN S+ LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ ENALTVR+VTL
Sbjct: 1058 SVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVTL 1117
Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISA 1159
NTTTKENETLLAIGTAYV GEDVAARGRVLLFS + ++N QNLVTEVYSKE KGA+SA
Sbjct: 1118 HNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SENSQNLVTEVYSKESKGAVSA 1176
Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
+ASLQGHLLIASGPKI L+KWTG EL +AFYDA PL+VVSLNIVKNF+L GDIHKSIYF
Sbjct: 1177 VASLQGHLLIASGPKITLNKWTGAELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSIYF 1235
Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
LSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD KN+QIFYYAPKM ESWKG
Sbjct: 1236 LSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESWKG 1295
Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
QKLLSRAEFHVGAH+TKFLRLQML T S+KTNRFALLFG LDG IGCIAP+
Sbjct: 1296 QKLLSRAEFHVGAHITKFLRLQMLPTQ------GLSSEKTNRFALLFGNLDGGIGCIAPI 1349
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
DELTFRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HRPGPD+I+D ELL YEML
Sbjct: 1350 DELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPGPDNIIDFELLCSYEMLS 1409
Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
L+EQL++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 1410 LDEQLDVAQQIGTTRSQILSNFSDISLGTSFL 1441
>gi|357162146|ref|XP_003579318.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 1-like [Brachypodium distachyon]
Length = 1442
Score = 1865 bits (4830), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 946/1471 (64%), Positives = 1128/1471 (76%), Gaps = 69/1471 (4%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE--ELDSELPSK----RGIGP 54
MS+AAYKMMHWPTGI +C +GFITH +D E D L + + +GP
Sbjct: 1 MSYAAYKMMHWPTGIDHCAAGFITHCPSDAAAFCSAAAASGPEGDVGLVAAARHPKRLGP 60
Query: 55 VPNLVVTAANVIEIYVVRVQEEGS------KESKNSGETKRRVLMDGISAASLELVCHYR 108
PNLVV AANV+E+Y VR + + S +SG + DGIS A LELVCHYR
Sbjct: 61 TPNLVVAAANVLEVYAVRADAAAADGAGGAQPSSSSG-----AVFDGISGARLELVCHYR 115
Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
LHGN+ES+AILS G A+N RRDSI LAF DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116 LHGNIESMAILSDG-AEN--RRDSIALAFRDAKITCLEFDDAIHGLRTSSMHCFEGPEWQ 172
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
HLKRGRESFA GP++K DP GRCG LVYGLQMIILK++Q G LVG+++ + +
Sbjct: 173 HLKRGRESFAWGPVIKSDPLGRCGAALVYGLQMIILKSAQVGQSLVGEDEPTRALSSAAV 232
Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
RIESS++I+LR LD HVKDF FVHGYIEPV+VILHERE TWAGR+S KHHTCMISA SI
Sbjct: 233 RIESSYLIDLRALDTNHVKDFTFVHGYIEPVLVILHEREPTWAGRISSKHHTCMISAFSI 292
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
S TLKQHP+IWSA N+PHDAY++L+VP PI GVLV+ AN+IHYHSQS SC+LALNN+A
Sbjct: 293 SMTLKQHPMIWSAANIPHDAYQILSVPPPISGVLVICANSIHYHSQSTSCSLALNNFASQ 352
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
D S E+ + +F VELDAA ATWL ND+ + S KTG+++LLTVVYDGR VQ+LDL K+
Sbjct: 353 PDGSPEIHKVNFHVELDAAKATWLSNDIVMFSAKTGEMLLLTVVYDGRTVQKLDLMKSKA 412
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
SV++S +TTIG+S FFLGSR+GDSLLVQF+CG TS++ E DIE D P +KRL+
Sbjct: 413 SVISSGVTTIGSSFFFLGSRVGDSLLVQFSCGVPTSVIPDIADERSADIEGDLPFSKRLK 472
Query: 469 RSSSDALQDMVNGEELSLYGSA-SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
R SD LQD+ + EELS + N+ ESAQK S+ VRD+LVN+GPLKDFSYGLR+NAD
Sbjct: 473 RVPSDILQDVTSVEELSFQNNMLPNSLESAQK-ISYVVRDALVNVGPLKDFSYGLRVNAD 531
Query: 528 ASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGH 561
+ATG +KQSNYEL VELP C+GIWTVY+KSSRGH
Sbjct: 532 PNATGNAKQSNYELVCCSGHGKNGALSVLQQSIRPDLITEVELPSCRGIWTVYYKSSRGH 591
Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
+ D+EYHAYLIISLE+RTMVLET D L EVTE+VDY+VQG TI AGNLFGRR
Sbjct: 592 TTE-------DNEYHAYLIISLESRTMVLETGDDLGEVTETVDYYVQGATITAGNLFGRR 644
Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST-VLSVSIADPYVLLGMSDGSI 680
RVIQV+ GAR+LDGS+MTQ+L+F +SES S V S SIADPYVLL M DG+I
Sbjct: 645 RVIQVYATGARVLDGSFMTQELNFTALSSESSSSGSEPLGVASASIADPYVLLKMVDGTI 704
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
+LLVGD STC +S+ P+ + S + +S+CTLYHD+GPEPWLRKT DAWLS+GV A+D
Sbjct: 705 QLLVGDHSTCALSINAPSTLTSRGERISACTLYHDRGPEPWLRKTRGDAWLSSGVTVAVD 764
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
+ DQ DIY ++CYESG LEIF+VP+F VF+V F SG + +VD + + +DS
Sbjct: 765 VSGSSSQDQSDIYCIICYESGKLEIFEVPSFRQVFSVGSFFSGESLLVDAFAQGFTEDSA 824
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
+E +KE +++++VELAM RWS SRPFLF +L DGT+LCYQAY +EG
Sbjct: 825 ---EGRQDETKVSLKKEVANNIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYQAYCYEG 881
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
E+ K +S S+ + N S SRL+NLRF R +D +RE+ A RITIF N+
Sbjct: 882 LESNIKGTS-LSPDGSVDLGNASDSRLKNLRFHRVSVDITSREDISSLAR-PRITIFNNV 939
Query: 921 SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKI 980
G++G FLSG+RP W MV R+R RVHPQLCDG I AFTVLHNVNC+HG IYVTSQG LKI
Sbjct: 940 GGYEGLFLSGTRPVWVMVCRQRFRVHPQLCDGPIEAFTVLHNVNCSHGLIYVTSQGFLKI 999
Query: 981 CQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1040
CQLPS YDNYWPVQKIPL TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS++ DQE+
Sbjct: 1000 CQLPSAYNYDNYWPVQKIPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSIMADQEM 1059
Query: 1041 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1100
H +DN S+ DL +TYTVEE+EVR+LE ++ GG W+TR+TIPMQS ENALTVR+VTL
Sbjct: 1060 IHHMDNDASSADDLQKTYTVEEFEVRVLELEKPGGRWETRSTIPMQSFENALTVRIVTLH 1119
Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1160
NTTTKENETL+AIGTAYVQGEDVAARGRVLLFS + ++N QNLVTEVYSKE KGA+SA+
Sbjct: 1120 NTTTKENETLMAIGTAYVQGEDVAARGRVLLFSFTK-SENSQNLVTEVYSKESKGAVSAV 1178
Query: 1161 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
ASLQGHL+IASGPKI L+KW G+EL +AFYDA PL+VVSLNIVKNF+L GDIHKS+YFL
Sbjct: 1179 ASLQGHLVIASGPKITLNKWNGSELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSVYFL 1237
Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
SWKEQG+QL LLAKDFGSLDCFATEFLIDGSTLSLVVSD KN+QIFYYAPKM ESWKGQ
Sbjct: 1238 SWKEQGSQLTLLAKDFGSLDCFATEFLIDGSTLSLVVSDSDKNLQIFYYAPKMVESWKGQ 1297
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
KLLSRAE HVGAH+TKFLRLQML G A S+KTNRFALLFGTLDGSIGCIAP+D
Sbjct: 1298 KLLSRAELHVGAHMTKFLRLQMLPAQ----GLA--SEKTNRFALLFGTLDGSIGCIAPVD 1351
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
ELTFRRLQSLQ+KLVD+V HV GLNPRSFRQF SNGKAHRPGPD+I+D ELL++YE+L L
Sbjct: 1352 ELTFRRLQSLQRKLVDAVSHVCGLNPRSFRQFKSNGKAHRPGPDNIIDFELLTYYEILSL 1411
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
EEQL++A Q GTTR+QILSN +D++LGTSFL
Sbjct: 1412 EEQLDMAQQIGTTRAQILSNFSDISLGTSFL 1442
>gi|218194461|gb|EEC76888.1| hypothetical protein OsI_15095 [Oryza sativa Indica Group]
Length = 1503
Score = 1796 bits (4651), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 925/1528 (60%), Positives = 1114/1528 (72%), Gaps = 122/1528 (7%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
MS+AAYKMMHWPTG+ +C +GF+THS +D ++DS + R +G
Sbjct: 1 MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60
Query: 54 PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
P PNLVV AANV+E+Y VR + G++ S +SG ++DGIS A LELVC+Y
Sbjct: 61 PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RLHGN+ES+ +LS G A+N RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116 RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
HLKRGRESFA GP++K DP GRCG L YGLQMIILKA+Q G LVG+++ + +
Sbjct: 173 QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
RIESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+ KHHTCMISA S
Sbjct: 233 VRIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++
Sbjct: 293 ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
D S E+ +S+F VELDAA ATW ND+ + S+K G+++LLTVVYDGRVVQRLDL K+
Sbjct: 353 HPDGSPEISKSNFQVELDAAKATWFSNDIVMFSSKAGEMLLLTVVYDGRVVQRLDLMKSK 412
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVL+S +T+IGNS FFLGSRLGDSLLVQF+ G+ S+L E DIE D P +KRL
Sbjct: 413 ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYGASKSVLQDLTNERSADIEGDLPFSKRL 472
Query: 468 RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
+R SD LQD+ + EELS A N+ ESAQK S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473 KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531
Query: 527 DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
D +A G +KQSNYEL VELP C+GIWTVY+KS RG
Sbjct: 532 DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
A+ D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ TIAAGNLFGR
Sbjct: 592 QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
RRVIQV+ +GAR+LDGS+MTQ+L+F +++ S SE V SIADPYVLL M DGS+
Sbjct: 645 RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
+LL+GD TCT+SV P+ SS + +++CTLY D+GPEPWLRKT +DAWLSTG+ EAID
Sbjct: 704 QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLRKTRSDAWLSTGIAEAID 763
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
G DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG +VD + + +DS
Sbjct: 764 GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
E ++ +KE S+++VELAM RWS SRPFLF +L DGT+LCY A+ +E
Sbjct: 824 KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
E+ K P+S S N S SRLRNLRF R +D +RE+ P G P RIT F N
Sbjct: 881 SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+ G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938 VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997
Query: 980 ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
ICQLPS YDNYWPVQK+PL TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQE
Sbjct: 998 ICQLPSAYNYDNYWPVQKVPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQE 1057
Query: 1040 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
H +DN S+ LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ ENALTVR+VTL
Sbjct: 1058 SVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVTL 1117
Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISA 1159
NTTTKENETLLAIGTAYV GEDVAARGRVLLFS + ++N QNLVTEVYSKE KGA+SA
Sbjct: 1118 HNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFMK-SENSQNLVTEVYSKESKGAVSA 1176
Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
+ASLQGHLLIASGPKI L+KWTG EL +AFYDA PL+VVSLNIVKNF+L GDIHKSIYF
Sbjct: 1177 VASLQGHLLIASGPKITLNKWTGAELTAVAFYDA-PLHVVSLNIVKNFVLFGDIHKSIYF 1235
Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI--FYYAPKMSE-- 1275
LSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD KN+Q+ F + +
Sbjct: 1236 LSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQVKNFVLFGDIHKSI 1295
Query: 1276 ---SWKGQ----KLLSRAEFHVGAHVTKFL----RLQMLATSSDRTGA----AP------ 1314
SWK Q LL++ + T+FL L ++A+ SD+ AP
Sbjct: 1296 YFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDSDKNVQIFYYAPKMVESW 1355
Query: 1315 -------------------------------GSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
S+KTNRFALLFG LDG IGCIAP+DELT
Sbjct: 1356 KGQKLLSRAEFHVGAHITKFLRLQMLPTQGLSSEKTNRFALLFGNLDGGIGCIAPIDELT 1415
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
FRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HRPGPD+I+D ELL+HYEML L+EQ
Sbjct: 1416 FRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHRPGPDNIIDFELLAHYEMLSLDEQ 1475
Query: 1404 LEIAHQTGTTRSQILSNLNDLALGTSFL 1431
L++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 1476 LDVAQQIGTTRSQILSNFSDISLGTSFL 1503
>gi|168021793|ref|XP_001763425.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685218|gb|EDQ71614.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1452
Score = 1585 bits (4103), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 827/1496 (55%), Positives = 1035/1496 (69%), Gaps = 109/1496 (7%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADY-VPQIPLIQTEELDSELPSKRGIGPVPNLV 59
MS+AA+KM+H PTG+ NC + ++THS + IPL ++L + G G PNLV
Sbjct: 1 MSYAAFKMVHCPTGVDNCVAAYVTHSAGETDSDSIPLP-----GADLIASGGSGFPPNLV 55
Query: 60 VTAANVIEIYVVRVQE------EGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
+T ANV+E++ VR+ E GS N T R LM G+S LEL CHYRLHGNV
Sbjct: 56 ITKANVLEVFHVRLLEGDDSAANGSNGVGNPETTPRGGLMAGLSYVKLELACHYRLHGNV 115
Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
ESL +LS A+ + RD+IIL F DAKISVLEFDDS HGLRI S+H FE PEW +LKRG
Sbjct: 116 ESLGVLSYRHAEGRKGRDAIILTFRDAKISVLEFDDSTHGLRIGSLHYFEGPEWQYLKRG 175
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
RE FA GP V+ DP GRC GVL+Y Q+++LKA+Q G GL ++++ GG A + +S
Sbjct: 176 REQFASGPSVRADPVGRCAGVLIYNSQLVLLKAAQVGYGLGDEDESLIMGGKLCAHVATS 235
Query: 234 HVINLRDLDMKHVKDFIFVHG--------------YIEPVMVILHERELTWAGRVSWKHH 279
++++LRDLDMKH+KDF+F+HG YIEPV+V+LHE++ TWAGRV+ + H
Sbjct: 236 YIVSLRDLDMKHIKDFVFLHGKLLFLIQYIFAFSSYIEPVLVVLHEKDPTWAGRVAVRRH 295
Query: 280 TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCA 339
TC I+ALSI+TTLKQHP IWSA NLP+DAYKLLAVP+PIGGVLV AN++HYHSQS SCA
Sbjct: 296 TCAITALSINTTLKQHPHIWSATNLPYDAYKLLAVPAPIGGVLVFCANSLHYHSQSGSCA 355
Query: 340 LALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
L LN +AV+ + S E PRS SVELD AHATW+ N+VAL+STK G L+ L +VY+GR VQ
Sbjct: 356 LGLNEFAVAPEGSAEYPRSKMSVELDCAHATWVANEVALISTKNGMLLFLNLVYEGRSVQ 415
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
RL+L+K+ SVLTS + TIG + FFLGSRL DSLLVQ T GS + SS + GDIEA
Sbjct: 416 RLELTKSKASVLTSCMCTIGENFFFLGSRLADSLLVQHTLGSASGRTSSLM----GDIEA 471
Query: 460 D--APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVRDSLVNIGPLK 516
D AP+ KRL+R S+ + + EE+SLY S ++ S +KTF+F VRDSLVNI PL+
Sbjct: 472 DLSAPAAKRLKREPSEEEEGVSA-EEMSLYYSTPTASDISQKKTFTFTVRDSLVNICPLR 530
Query: 517 DFSYGLRINADASATGISKQSNYEL--------------------------VELPGCKGI 550
DF+YGLR NAD SATG+ KQSNYEL V LPGC GI
Sbjct: 531 DFAYGLRSNADQSATGLGKQSNYELVACSGHGKNGSLSVLHQSIRPDLINKVALPGCSGI 590
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTVYHK+ R + + + DDE+HAYLIISLE+RTMVLET D L EVTE+V+Y+ +G
Sbjct: 591 WTVYHKTDRDDSNEFDFGTSEDDEFHAYLIISLESRTMVLETGDTLGEVTENVEYYTEGN 650
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS-ENSTVLSVSIADP 669
TIAAGNLFGRR V+QV++ G R+LDG+ M Q+L S E+ S N+ V+ IADP
Sbjct: 651 TIAAGNLFGRRFVVQVYQNGLRLLDGAKMLQELLITNSELENNSSEVANNLVIEAVIADP 710
Query: 670 YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDA 729
Y+LL M+DGS++L+VGD +S+ P + +++ TLY DKGP WLR+T ++
Sbjct: 711 YMLLKMTDGSLQLVVGDVENTKLSIPQPQGFGITTDAITAFTLYQDKGPHQWLRRTCSEM 770
Query: 730 ------WLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSG 783
W ST DQG +Y +VC SG EI+++P CV+ VD F G
Sbjct: 771 NSDRSQWSSTS-------------DQGYVYCIVCRISGRFEIYELPRMVCVYAVDNFNHG 817
Query: 784 RTHIVDTYMREALKDSETEINSSSEEGTGQGR---KENIHSMKVVELAMQRWSAHHSRPF 840
+ + D + E +S + +EE G ++ S+ V ++ + W RPF
Sbjct: 818 MSVLWDQKVLERRANSNAALKEGAEEDKAPGDALLRDAGLSLHVSQICFESWGEKFGRPF 877
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
L A L+DGT+LCY A+ ++ E++ + R + S SRL +LRF+R P+D
Sbjct: 878 LLATLSDGTMLCYHAFSYDANESSDALE-----FRETATSLKDLSRLTHLRFARIPIDWV 932
Query: 901 TREETPHGAPC---QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
+ +E GA + FKN+ G F++G RP W MV R RLR HPQ CDG+I+ F
Sbjct: 933 SGQED--GAKVLYETKFCSFKNVGSFPGVFVTGLRPTWLMVCRGRLRPHPQFCDGAILGF 990
Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYP 1017
T LHNVNC HGFIY+T+QG LKICQLPS YDN WPVQKIPL+ TPHQITY ++ NLY
Sbjct: 991 TPLHNVNCAHGFIYITAQGQLKICQLPSLLFYDNDWPVQKIPLRGTPHQITYHSDVNLYA 1050
Query: 1018 LIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV--DLHRTYTVEEYEVRILEPDRAGG 1075
LI+S PV +P +QVL GH D +S+ D R T E+YEVRI+EP + GG
Sbjct: 1051 LIISTPVSRPTSQVL-----MGDGHPFDQQQENSIGEDGQRLVTSEDYEVRIIEPAQPGG 1105
Query: 1076 PWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1135
W+ +A I M +ENALTVR+V++ N TT + +TLLAIGT+YVQGEDVAA+GR++L S G
Sbjct: 1106 NWEAKAAIKMHLTENALTVRIVSIKNITTDQTQTLLAIGTSYVQGEDVAAKGRIILVSVG 1165
Query: 1136 RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
++ +P + EVYSKELKG+ISA+ASLQGHLLIA GPKIILH W G+ELNG AF+DA P
Sbjct: 1166 KDPQDPGSWAREVYSKELKGSISAIASLQGHLLIAIGPKIILHSWNGSELNGAAFFDA-P 1224
Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
LYVVSLNIVKNFIL GDIHKSIYFL WKE GAQL LLAKDFGSLDC+ATEFLIDGSTLSL
Sbjct: 1225 LYVVSLNIVKNFILFGDIHKSIYFLCWKEDGAQLTLLAKDFGSLDCYATEFLIDGSTLSL 1284
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
+VSD +KN+QIF YAPK ESWKGQKLLSRAEFH+GAHV KF RLQML T PG
Sbjct: 1285 LVSDSRKNLQIFSYAPKSMESWKGQKLLSRAEFHLGAHVNKFHRLQMLPT--------PG 1336
Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
S ++NR+A+LFGTLDG+I +APLDELTFRRL +LQ+KLVD V HVAG+NPR+FRQF +
Sbjct: 1337 SARSNRYAVLFGTLDGAIDYLAPLDELTFRRLHTLQRKLVDCVSHVAGVNPRAFRQFRCD 1396
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
GKAHRPGPD+IVDCELLSHY+MLPL+EQLEIA Q GTTR+ +LSNL DLAL TSFL
Sbjct: 1397 GKAHRPGPDNIVDCELLSHYDMLPLDEQLEIARQIGTTRAHVLSNLRDLALSTSFL 1452
>gi|302814354|ref|XP_002988861.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
gi|300143432|gb|EFJ10123.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
Length = 1413
Score = 1455 bits (3766), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 773/1485 (52%), Positives = 989/1485 (66%), Gaps = 126/1485 (8%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS+AA K++H PTG++ C S FITHS P P S S +PNLV+
Sbjct: 1 MSYAAIKLVHGPTGVSACASAFITHS-----PVNP-----ASSSGWKSGNAKDSLPNLVL 50
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGE-------------TKRRVLMDGISAASLELVCHY 107
ANV+EIY VR QE G ++S GE KR M GI+AA LELVC Y
Sbjct: 51 VKANVLEIYNVRFQE-GDEKSARGGEQLVGSACVAFPASAKRGGFMSGITAAWLELVCQY 109
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RL G V+S+AIL +G D R RD+IILAF AK SVL FDD+ L+ +SMH FE PEW
Sbjct: 110 RLFGIVDSMAILHRG-RDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEW 168
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
+HLKRGRE F GPLV+ D QGRC GVL+Y Q++++KA+Q GLV ++D SG S
Sbjct: 169 IHLKRGREKFPGGPLVRADSQGRCAGVLIYKSQLVMMKAAQEAYGLVEEDDP--SGNIVS 226
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
ARIESS+V+NL++L M HVKDF+F++GYIEPV+ ILHERELTWAGRV+++ TC ++ALS
Sbjct: 227 ARIESSYVVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALS 286
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
I+T K+HP +W LP+DAY LLAVPSPIGGVLV+ AN+I Y+SQ ++C +A+N A
Sbjct: 287 INTNTKKHPRLWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELAT 346
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
S E+PRS FS+ELDAAHATWL D ALLSTKTG LV L +++DGR VQRL+LSK+
Sbjct: 347 PPAGSLEMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSK 406
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVL+S + TIG+ FF+GSRLGDSLLVQF S ++ LS G+ + +KR+
Sbjct: 407 GSVLSSSLCTIGDMFFFVGSRLGDSLLVQFGSASTSNSLSQSYD---GEDDIMVRPSKRM 463
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFS-------- 519
R L D N + L Y SA ++++ F F+VRDSL NIGP++D +
Sbjct: 464 R------LDDDANEQSLYQYKSAVSDSQK-NMNFLFSVRDSLCNIGPIRDITGRSQNPSE 516
Query: 520 -------------YG----LRINADASATGISKQSNYEL------------VELPGCKGI 550
+G L I + + Q+N L V+LPGC G+
Sbjct: 517 QPGSAQDLIACCGHGKNGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCVGV 576
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTVYH+S + + A DEYHAYLIISLE+RTMVLET + L EVT+SV+Y+ +G
Sbjct: 577 WTVYHRSGQ--------IPAEKDEYHAYLIISLESRTMVLETGETLGEVTDSVEYYTEGP 628
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
+I+AGNLFGRRR+ QV+++G RILDG+ TQDL G E G+ E S S ADPY
Sbjct: 629 SISAGNLFGRRRIAQVYQKGVRILDGARQTQDLQVG----EPGNAIE-----SASFADPY 679
Query: 671 VLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
VLL M DGS +L+VGD T TVSV TP + S P+S+CTLY+D+GP PWLR+ + D W
Sbjct: 680 VLLRMQDGSCQLVVGDSETLTVSVSTPPELGLSPDPISACTLYNDRGPSPWLRRATGDVW 739
Query: 731 LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDT 790
+ GV +A DQGD+Y +VC SG +E ++P+ C++ V++ G + D
Sbjct: 740 QTLGVPDA-----NFAFDQGDMYCIVCRNSGTMEFLELPSMACLYRVERLPYGVQVLADN 794
Query: 791 YMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTI 850
R A K ++ EEG + R E + +KVV++ + W + RPF+F +L+DGT+
Sbjct: 795 --RTASKVPVDTSSNKDEEGAEEIR-ERMSKIKVVDICVDTWGEKYGRPFVFVLLSDGTL 851
Query: 851 LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG-- 908
L Y+A+++EG ++ + + D S RNLRF R LD EE +
Sbjct: 852 LSYRAFIYEGQDSGAHASDGTS--------------FRNLRFLRLQLDLELGEEDSNADE 897
Query: 909 -APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
Q+I FK++ G QG FL+G +P W M+FRE++R+HPQ DG IVAFT LHNVNC H
Sbjct: 898 VRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIFREQIRLHPQASDGPIVAFTSLHNVNCQH 957
Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1027
G IYVT++ LKIC+L + YDN WPVQKIPLK TPHQ+ + + N+Y L++S V P
Sbjct: 958 GLIYVTNEASLKICRLSNILNYDNDWPVQKIPLKGTPHQMAHHPDLNIYVLVLSFSVSVP 1017
Query: 1028 LNQVLSLLIDQEVGHQIDNHNLSS-VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1086
+ VL D GHQID S +D + V+++EVR+LEP G PW+T+ TI Q
Sbjct: 1018 TSLVLPSAADGPPGHQIDQSEASDGLDPQKMVQVDDFEVRLLEPMAQGVPWETKDTIKFQ 1077
Query: 1087 SSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT 1146
+EN LTVR+V++ N T++ E LLAIGT Y+QGEDVA+RGR++L S G + +P+
Sbjct: 1078 PAENVLTVRIVSIKNAATEQVENLLAIGTGYLQGEDVASRGRIILVSLGEDPSDPKVWAK 1137
Query: 1147 EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
E+YSKELKGAISALA+LQGHLL+A GPKIILH W G+EL G AF+DAP LYVVSLNIVKN
Sbjct: 1138 ELYSKELKGAISALAALQGHLLLAIGPKIILHTWNGSELIGTAFFDAP-LYVVSLNIVKN 1196
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
F+L GD HKSIYFL WKE+GAQL LLAKDFGSLDC+ATEFLIDGSTLSL+VSD +KNIQ+
Sbjct: 1197 FVLFGDFHKSIYFLCWKEEGAQLVLLAKDFGSLDCYATEFLIDGSTLSLLVSDSRKNIQV 1256
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLF 1326
F YAPK +ESWKGQKLL R EFH+G+HVTKFLRLQML T PGS +TNRFAL F
Sbjct: 1257 FSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRLQMLQT--------PGSSRTNRFALCF 1308
Query: 1327 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1386
GTLDG IG I PLDELTFRRLQ+LQ+KLVD VPHVAGLNP+++RQF +NG+ H+ GPD+
Sbjct: 1309 GTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLNPKAYRQFQANGEHHKHGPDNT 1368
Query: 1387 VDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
VD E L YE L L++Q+ IA Q GTTR QI +NL D++L TSF
Sbjct: 1369 VDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDISLSTSFF 1413
>gi|302761560|ref|XP_002964202.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
gi|300167931|gb|EFJ34535.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
Length = 1413
Score = 1454 bits (3763), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 773/1487 (51%), Positives = 991/1487 (66%), Gaps = 130/1487 (8%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS+AA K++H PTG++ C S FITHS P P S S +PNLV+
Sbjct: 1 MSYAAIKLVHGPTGVSACASAFITHS-----PVNP-----ASSSGWKSGNAKDSLPNLVL 50
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGE-------------TKRRVLMDGISAASLELVCHY 107
ANV+EIY VR QE G ++S GE KR M GI+AA LELVC Y
Sbjct: 51 VKANVLEIYNVRFQE-GDEKSARGGEQLVGSACVAFPASAKRGGFMSGITAAWLELVCQY 109
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
RL G V+S+AIL +G D R RD+IILAF AK SVL FDD+ L+ +SMH FE PEW
Sbjct: 110 RLFGIVDSMAILHRG-RDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEW 168
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
+HLKRGRE F GPLV+ D QGRC GVL+Y Q++++KA+Q GLV ++D SG S
Sbjct: 169 IHLKRGREKFPGGPLVRADSQGRCAGVLIYKCQLVMMKAAQEAYGLVEEDDP--SGNIVS 226
Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
ARIESS+V+NL++L M HVKDF+F++GYIEPV+ ILHERELTWAGRV+++ TC ++ALS
Sbjct: 227 ARIESSYVVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALS 286
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
I+T K+HP +W LP+DAY LLAVPSPIGGVLV+ AN+I Y+SQ ++C +A+N A
Sbjct: 287 INTNTKKHPRLWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELAT 346
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
S E+PRS FS+ELDAAHATWL D ALLSTKTG LV L +++DGR VQRL+LSK+
Sbjct: 347 PPAGSLEMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSK 406
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF-GDIEADAPSTKR 466
SVL+S + TIG+ FF+GSRLGDSLLVQF G++ S+ L+ + G+ + +KR
Sbjct: 407 GSVLSSSLCTIGDKFFFVGSRLGDSLLVQF----GSASTSNSLEHSYDGEDDIMVRPSKR 462
Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSY----- 520
+R L D + E SLY S ++S + F F+VRDSL NIGP++D +
Sbjct: 463 MR------LDD--DASEQSLYQYKSGVSDSQKNMNFLFSVRDSLCNIGPIRDITCRSQNP 514
Query: 521 --------------------GLRINADASATGISKQSNYEL------------VELPGCK 548
L I + + Q+N L V+LPGC
Sbjct: 515 SEQPGSAQDLIACCGHGKNGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCV 574
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
G+WTVYH+S + + A DEYHAYLIISLE+RTMVLET + L EVT+SV+Y+ +
Sbjct: 575 GVWTVYHRSGQ--------IPAEKDEYHAYLIISLESRTMVLETGETLGEVTDSVEYYTE 626
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
G +I+AGNLFGRRR+ QV+++G RILDG+ TQDL G E G+ E S S AD
Sbjct: 627 GPSISAGNLFGRRRIAQVYQKGVRILDGARQTQDLQVG----EPGNAIE-----SASFAD 677
Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
PYVLL M DGS +L+VGD T TVSV TP + S P+S+CTLY+D+GP PWLR+ + D
Sbjct: 678 PYVLLRMQDGSCQLVVGDSETLTVSVSTPPELGLSPDPISACTLYNDRGPSPWLRRATGD 737
Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
W + GV +A DQGD+Y +VC SG +E ++P+ C++ V++ G +
Sbjct: 738 VWQTLGVPDA-----NFAFDQGDMYCIVCRNSGTMEFLELPSMACLYRVERLPYGVQVLA 792
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D+ R A K ++ EEG + R E + +KVV++ + W + RPF+F +L+DG
Sbjct: 793 DS--RTASKVPVDTSSNKDEEGAEEIR-ERMSKIKVVDICVDTWGEKYGRPFVFVLLSDG 849
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
T+L Y+A+++EG ++ + + D S RNLRF R LD EE +
Sbjct: 850 TLLSYRAFIYEGQDSGAHASDGTS--------------FRNLRFLRLQLDLELGEEDSNA 895
Query: 909 ---APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
Q+I FK++ G QG FL+G +P W M+FRE++R+HPQ DG IVAFT LHNVNC
Sbjct: 896 DEVRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIFREQIRLHPQASDGPIVAFTSLHNVNC 955
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
HG IYVT++ LKIC+L + YDN WPVQKIPLK TPHQ+ + + N+Y L++S V
Sbjct: 956 QHGLIYVTNEASLKICRLSNILNYDNDWPVQKIPLKGTPHQMAHHPDLNIYVLVLSFSVS 1015
Query: 1026 KPLNQVLSLLIDQEVGHQIDNHNLSS-VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIP 1084
P + VL D GHQID S +D + V+++EVR+LEP G PW+T+ TI
Sbjct: 1016 VPTSLVLPSAADGPPGHQIDQSEASDGLDPQKMVQVDDFEVRLLEPMAQGVPWETKDTIK 1075
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL 1144
Q +EN LTVR+V++ N T++ E LLAIGT Y+QGEDVA+RGR++L S G + +P+
Sbjct: 1076 FQPAENVLTVRIVSIKNAATEQVENLLAIGTGYLQGEDVASRGRIILVSLGEDPSDPKVW 1135
Query: 1145 VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
E+YSKELKGAISALA+LQGHLL+A GPKIILH W G+EL G AF+DAP LYVVSLNIV
Sbjct: 1136 AKELYSKELKGAISALAALQGHLLLAIGPKIILHTWNGSELIGTAFFDAP-LYVVSLNIV 1194
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
KNF+L GD HKSIYFL WKE+GAQL LLAKDFGSLDC+ATEFLIDGSTLSL+VSD +KNI
Sbjct: 1195 KNFVLFGDFHKSIYFLCWKEEGAQLVLLAKDFGSLDCYATEFLIDGSTLSLLVSDSRKNI 1254
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
Q+F YAPK +ESWKGQKLL R EFH+G+HVTKFLRLQML T PGS +TNRFAL
Sbjct: 1255 QVFSYAPKNAESWKGQKLLPRVEFHLGSHVTKFLRLQMLQT--------PGSSRTNRFAL 1306
Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1384
FGTLDG IG I PLDELTFRRLQ+LQ+KLVD VPHVAGLNP+++RQF +NG+ H+ GPD
Sbjct: 1307 CFGTLDGGIGYITPLDELTFRRLQTLQRKLVDLVPHVAGLNPKAYRQFQANGEHHKHGPD 1366
Query: 1385 SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
+ VD E L YE L L++Q+ IA Q GTTR QI +NL D++L TSF
Sbjct: 1367 NTVDSEQLREYESLSLDKQVAIARQIGTTRQQIFANLRDISLSTSFF 1413
>gi|449524573|ref|XP_004169296.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Cucumis sativus]
Length = 741
Score = 1203 bits (3113), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 583/735 (79%), Positives = 641/735 (87%), Gaps = 4/735 (0%)
Query: 697 PAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVV 756
PAA SSKK VSSCTLY DKG EPWLR TSTDAWLSTGVGE IDG DG DQGDIY V
Sbjct: 11 PAAFGSSKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVA 70
Query: 757 CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK 816
CY++G LEIFDVPNF VF VDKFVSG++H+VD + + K SE + NS +E GR
Sbjct: 71 CYDNGDLEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQNS--QELISHGRN 128
Query: 817 ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
E+ +MKV+E+AMQRWS HSRPFLF ILTDGTILCY AYLFE ++ SK DD VS S
Sbjct: 129 ESSQNMKVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNS 188
Query: 877 LSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC 936
+S SN+S+SRLRNLRF R PLD RE+ P+G +R++IFKNISG+QG FL GSRP W
Sbjct: 189 VSSSNMSSSRLRNLRFLRVPLDIQGREDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 248
Query: 937 MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
MVFRERLRVHPQLCDG IVAF VLHNVNCNHG IYVTSQG+LKICQLPS S YDNYWPVQ
Sbjct: 249 MVFRERLRVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 308
Query: 997 KIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHR 1056
K+PLK TPHQ+TYF EKNLYP+I+S PV KPLNQVLS ++DQ+VGH ++NHNLS+ +L +
Sbjct: 309 KVPLKGTPHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGH-VENHNLSADELQQ 367
Query: 1057 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA 1116
TY+VEE+E+RILEP+++GGPWQTRATI M SSENALT+RVVTL NTTTKENETLLA+GTA
Sbjct: 368 TYSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTA 427
Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKII 1176
YVQGEDVAARGRVLLFS G++ADN Q LV+EVYSKELKGAISALASLQGHLLIASGPKII
Sbjct: 428 YVQGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKII 487
Query: 1177 LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1236
LHKWTG ELNGIAFYD PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL+LLAKDF
Sbjct: 488 LHKWTGAELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDF 547
Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
GSLDC+ATEFLIDGSTLSL VSD+QKNIQIFYYAPK +ESWKGQKLLSRAEFHVGAHVTK
Sbjct: 548 GSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTK 607
Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
FLRLQML+TSSD+ + SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL D
Sbjct: 608 FLRLQMLSTSSDK-ACSTVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGD 666
Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1416
+VPHV GLNPRSFRQFHSNGK HR GPDSIVDCELL HYEMLPLEEQL+IAHQ GTTRSQ
Sbjct: 667 AVPHVGGLNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQ 726
Query: 1417 ILSNLNDLALGTSFL 1431
ILSNLNDL+LGTSFL
Sbjct: 727 ILSNLNDLSLGTSFL 741
>gi|255075065|ref|XP_002501207.1| predicted protein [Micromonas sp. RCC299]
gi|226516471|gb|ACO62465.1| predicted protein [Micromonas sp. RCC299]
Length = 1423
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 505/1529 (33%), Positives = 762/1529 (49%), Gaps = 204/1529 (13%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MSFA +K +H PTG+ + + + TH D P PNLVV
Sbjct: 1 MSFAIHKQVHPPTGVDHAVAAYFTHPIGDGGP-----------------------PNLVV 37
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
AN + ++ +R + SG+ G A SLE+V + L+G V S+A++
Sbjct: 38 MQANHLTVFAIRRD----PSADASGDAAL-----GAKAMSLEVVAEFDLNGTVGSIAVMR 88
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR- 179
+ +RD++++A ++K+SV+E+D S + +S+H +E+P G S R
Sbjct: 89 RRSGAPRNQRDALLIAVRESKLSVIEWDPSEMTVVPSSLHSWETPVG---TGGVPSALRV 145
Query: 180 ---GPLVKVDPQGRCGGVLVY--GLQMIIL----KASQGGSGLVGDEDTFGSGGGFSARI 230
PL DP+GRC VL+ G + L A G G +D G G +A +
Sbjct: 146 APLPPLAIADPEGRCAAVLLRAEGRSRLALCPAVDADADADGDGGGDDGDRRGQGPAASV 205
Query: 231 ESSHVINLR-DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S V++L DL + V+D F+HGY EPV++ILHERE TWA R+ + TC+++A+SI+
Sbjct: 206 RKSFVVDLTADLALSGVRDAAFLHGYGEPVVLILHEREPTWAARMPLVNDTCVLTAVSIN 265
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
K+ +IW LP Y+L A+P P+GG +V+ N + + SQ +S ALALN A
Sbjct: 266 LDTKRCTVIWQREKLPCTCYRLCAMPDPLGGAIVLSNNFLLHESQESSKALALNPLAGGG 325
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ---RLDLSKT 406
S SVELD+AHA L L++TK G L+LL++ +GR + + L +
Sbjct: 326 TESA----LGVSVELDSAHAAVLSERQVLVTTKQGALMLLSLRVEGRRLAAHGAMHLRRA 381
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGL--KEEFGDIEADA 461
+VL+S + I L FLGSR+GDSLLV ML + K + G+ E
Sbjct: 382 GGAVLSSGMCLITKRLLFLGSRVGDSLLVSLKKKEAAGAAQMLPAAAPKKRKAGEAEPPK 441
Query: 462 PSTKRLRRSSS----DALQDMVNGE-ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLK 516
P + +S D L+ M+ GE E + + + E ++F VRDS++ I P+
Sbjct: 442 PPPPPQKVGTSQDDEDELEAMLYGEGEAAAKAANAGRKE--DPGYTFTVRDSVLGISPII 499
Query: 517 DFSYGLRINADA---------SATGISKQSNYELVE---------------LPGCKGIWT 552
D + G + +A G K +++ LPG G WT
Sbjct: 500 DLTAGASASVQGDTEERAELVAACGHGKNGALAILQRGIQPELVTEVEAGTLPGLMGTWT 559
Query: 553 VYHKS---SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
VYH+S R + ++ AA D +H+YL+ISLE+ TMVLET + L EV+E+V+
Sbjct: 560 VYHESRDNERLRESGAAAAAANVDPFHSYLVISLESTTMVLETGEELREVSEAVELVTDA 619
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
T+AAGN+ GR+R+ QV + G RI +G QDLS +G S + +++ + DP
Sbjct: 620 ATLAAGNMHGRKRIAQVHKGGVRICEGPVKIQDLSAA-EMPAAGDVSPDLEIIAAQVLDP 678
Query: 670 YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES--SKKPVSSCTLYHDKGP--------- 718
YVL MSDGS+R+L GD +V +P++ + + + ++S L D P
Sbjct: 679 YVLCRMSDGSLRVLKGDEEKGSVEAMSPSSYANLPTGESIASAALVDDSVPAAERPGLTT 738
Query: 719 -EP-WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFT 776
EP +LR+T+T STGV P D+ V G LE++ +P+ +++
Sbjct: 739 REPGFLRRTAT----STGV---------LPEDEEGTVLAVTRVGGTLELYALPSCERIWS 785
Query: 777 VDKFVSGRTHIVDTYMREALK-DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAH 835
D G + + + D + E+ + ++ + ++VE + +
Sbjct: 786 ADGLSEGLNVLAPGGAGDDVNVDGDGEVEPT----------DDYPAPEIVEFRLDAFPRA 835
Query: 836 HSRPFLFAILTDGTILCYQAYLF-EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR 894
H RP L A+ DG++L Y+A+L G N P LRF R
Sbjct: 836 HERPMLTALRGDGSVLVYRAFLCPPGAGNVGHEAKP------------------QLRFCR 877
Query: 895 TPLD------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ 948
P++ + G+ R + G +G F+SG RP W +V R R+ P
Sbjct: 878 VPIELEGGGGGMVDTKALSGSRLTRFERVGDRGGIRGVFVSGPRPLWLLVRRSRVLALPI 937
Query: 949 LCDGS-IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQI 1007
+ V+FT HNVNC +GF+ T+ G ++ICQ+P Y+ WPV+K+ L+ TPH +
Sbjct: 938 RGEAQRTVSFTPFHNVNCLNGFMLGTAAGGVRICQIPGRMHYEAAWPVRKLALRCTPHHV 997
Query: 1008 TYFAEKNLYPLIVSVPVLKPLNQV------LSLLIDQEVGHQIDNHNLSSVDLHRTYTVE 1061
Y + LY L S PV ++V LS LI + + + V
Sbjct: 998 QYLPDFRLYALSTSAPVKWKDHEVNEDDIHLSTLIKVRKANAMAKGGVEQV--------- 1048
Query: 1062 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGE 1121
+ +R+L P WQ + E+ ++R V L NT T +++L +GTA GE
Sbjct: 1049 -FSLRLLVPGTLECAWQ----YTVDPGEHVQSIRNVQLRNTMTGALQSMLVVGTALPGGE 1103
Query: 1122 DVAARGRVLLFS-----TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKII 1176
D RGRVL+F T R LV ++ K A +AL + GHL +A G K+I
Sbjct: 1104 DAPCRGRVLIFEVVWQMTDRGTKWQGQLVC---VRDAKMACTALEGVGGHLAVAIGTKLI 1160
Query: 1177 LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ--LNLLAK 1234
+H W G L +AF+D PL+ V++N+VKNFILLGDI K +F WK+ + L +AK
Sbjct: 1161 VHSWDGHSLMPVAFFDT-PLHTVTMNVVKNFILLGDIQKGAFFFRWKDTPDEKLLVQMAK 1219
Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
DF +D ATEFL+DGSTLS++ +D N IF Y PK ESWKGQKLL++ FHVG+ V
Sbjct: 1220 DFEGMDILATEFLVDGSTLSMLTTDMTGNAFIFSYDPKSLESWKGQKLLTKGAFHVGSPV 1279
Query: 1295 TKFLRLQMLATSSDRTGAAPGSD--------KTNRFALLFGTLDGSIGCIAPLDELTFRR 1346
+ +R ++ A + AAPG + NR A+ FGTLDGS+G + P++E
Sbjct: 1280 HRMVRFRLKAPT-----AAPGQTISPAEQKAQANRHAVFFGTLDGSLGILVPIEEAAHAS 1334
Query: 1347 LQSLQKKLVDSVPH--VAGLNPRSFRQFHS-NGKAHR-PGPDSIVDCELLSHYEMLPLEE 1402
LQSLQ+ L + PH +AGLN R+ R + G+ R P P S++D LL+ YE +P +
Sbjct: 1335 LQSLQRYLTYATPHAALAGLNARTHRHPKTVEGRPMRQPAPHSLLDGGLLAVYEHMPWKA 1394
Query: 1403 QLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
Q + A + G TR L +L+ L+ T+F+
Sbjct: 1395 QAKAAKEAGMTRDVALGHLHQLSARTAFM 1423
>gi|145348791|ref|XP_001418827.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579057|gb|ABO97120.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1386
Score = 677 bits (1746), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 476/1522 (31%), Positives = 747/1522 (49%), Gaps = 227/1522 (14%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MS A ++ +H PTG+ + + + T D G PNL+V
Sbjct: 1 MSHAVHREVHPPTGVDHAVTAYFTRPVGD-----------------------GGDPNLIV 37
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
+AN I +Y V G +ES L++ + G + S+++L
Sbjct: 38 ASANRITVYAV--NRRGDEES-------------------LDVCAEFDAQGAIGSMSVLR 76
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES-----PEWLHLKRGRE 175
+ +RD++++A + K+SV+E+D + + +SMH FES P L+ RE
Sbjct: 77 RRFGAPRNQRDALLIAIRERKLSVVEYDAATGDVCCSSMHSFESALGCNPLGTTLRMSRE 136
Query: 176 SFARGPLVKVDPQGRCGGVLV----YGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE 231
+ PLV DP+GRC V++ ++ +L + GG GLV ++D G G +A +
Sbjct: 137 A----PLVVSDPEGRCAAVVLREDGVAGKVRVLPSVDGGLGLVANDDE-GRVRGPAASVR 191
Query: 232 SSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
S ++L + + ++D F+HGY EP + +L+E+ TWAGR + TC I ALS+
Sbjct: 192 ESFPLHLPGVRL--IRDACFLHGYGEPALAVLYEKTPTWAGRYNLSKDTCEIVALSVDVD 249
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
++ +IW NLP +YKL A+ P+GG LV + + + SQ +S L LN +
Sbjct: 250 KQKGTVIWRRQNLPSSSYKLTALLPPLGGALVFSQDFLLHESQESSSVLGLNTFGHG--G 307
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
QE + + LD A A+ + D L++TKTG L+LL + DGR ++R+ L + +VL
Sbjct: 308 PQE--GNDAEITLDGAQASVVSEDRVLVTTKTGALLLLALHTDGRSLRRMMLQRAGGAVL 365
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---ML-----------SSGLKEEFGDI 457
+S + + L FLGSR+GDSLLV+FT + ML + K++ ++
Sbjct: 366 SSGMCLLSRDLLFLGSRIGDSLLVKFTPKEEPTAPLMLPDAEDESEDEATEKSKDDDDEL 425
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
EA T + +DA+Q E G A + V+DSL+ + P+ D
Sbjct: 426 EALLYGTTKTETVQTDAVQ-----TEKKREGLAGIIPGLKVAGYDLKVKDSLLGVAPVVD 480
Query: 518 FSYGLRINADASATGISKQSNYELVE-----------------------------LPGCK 548
+ G ++ G +K EL+ LP +
Sbjct: 481 IAVGA-----SAPMGSNKNERTELITACGQGKNGALAILTRGVQPELVTEVESGTLPNLQ 535
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
G+WT+++ R + R + +H +L++S+++ TM++ET + L EV+ S+++
Sbjct: 536 GLWTLHY---RKEGSKEER-----EPFHHHLLLSMKSSTMIMETGEELQEVSASLEFITN 587
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
T+AA N+FG +QV G R+L G QD+ ++ G+ + S I D
Sbjct: 588 QATLAASNIFGHYCSVQVTGTGIRVLKGGVKVQDVGLQDMDAPKGA-----AIASAQILD 642
Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK--------GPE- 719
PY+++ +SDGSIRLL GD +VS+ AI +S V++ L D G E
Sbjct: 643 PYIIVRLSDGSIRLLSGDEKQMSVSLMETGAIPTSS--VTAFALVDDSVEAADAAGGGER 700
Query: 720 --PWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV 777
W+ + +T+ ++ G GA + + + E G+LE+F +P+ ++
Sbjct: 701 KSGWIHRAATNGTITGLEGNKKSGA----CNNSEAIVALTREGGSLELFSLPSCTRIWCA 756
Query: 778 DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
D G MR + +T +N+ S ++V++ + + H
Sbjct: 757 DGLSEG--------MR--VLSPQTPVNAESS------------VPEIVDIRIDSFQDAHE 794
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
RP L A+ DGT+L Y+ ++ D+P+ + LRFSR +
Sbjct: 795 RPLLTAVRGDGTLLLYKGFIVPAGTTYEGQDEPLEKN--------------ELRFSRVNV 840
Query: 898 D-------------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLR 944
D A ++ GA RI G QG F++G P W +V R R+
Sbjct: 841 DVEGSGLNVAGIGAAGQLRDSLAGARLTRIGNVGEGQGVQGIFVAGPNPLWLIVRRSRVL 900
Query: 945 VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATP 1004
P +G +VAFTV HNVNC HGFI T+ G ++ICQ+PS Y+ WPV+K+ LK TP
Sbjct: 901 ALPTRGEGEVVAFTVFHNVNCPHGFILGTALGGVRICQMPSKMHYEAAWPVRKVALKCTP 960
Query: 1005 HQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE--- 1061
H ITY + LY L+ S PV + I+Q+ H I L+ V R +
Sbjct: 961 HTITYLPDFKLYALVTSAPV-----PWVEREIEQDNVHGI---ALAKVRRERAKANDDME 1012
Query: 1062 -EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG 1120
+Y VR+L P WQ ++ E+ VR V L + T +LLA+GTA G
Sbjct: 1013 LQYSVRLLVPGSLDSAWQH----ALEPGEHVQCVRNVQLRDINTGALLSLLAVGTAMPGG 1068
Query: 1121 EDVAARGRVLLFST--GRNADNPQNLVTE---VYSKELKGAISALASLQGHLLIASGPKI 1175
ED RGRV+LF R+A++ + +E K A +AL++L GHL++A G K+
Sbjct: 1069 EDTPCRGRVILFQMVWERDAESMDGYRWKGQVCCVREAKMACTALSALDGHLIVAVGTKL 1128
Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL--LA 1233
+H W G ELN +AF+D P++ VS+N+VKNFIL+GD+ K ++F WK G + ++ L+
Sbjct: 1129 TVHTWDGVELNSVAFFDT-PIHTVSINVVKNFILVGDLEKGLHFFRWKANGFEKSIIQLS 1187
Query: 1234 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
KDF +D +TEFLIDG+TLSL+ SD N +IF Y PK ESWKGQKLL R+ +HVG+
Sbjct: 1188 KDFDRMDVVSTEFLIDGATLSLLGSDMSGNARIFGYDPKSLESWKGQKLLVRSAYHVGSP 1247
Query: 1294 VTKFLRLQMLATSSDRTGAAPGS--DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
+++ +R + T++ AAPG TNR A+ FGTLDG++G P DE T+ +L +LQ
Sbjct: 1248 ISRMVRFNVEGTTAK---AAPGERPKGTNRHAVFFGTLDGALGIFMPTDEPTYAKLHALQ 1304
Query: 1352 KKLVDSVPHVAGLNPRSFR--QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
++L +V G NPR+FR + P ++D LLS +E L EQ +A +
Sbjct: 1305 RELNTTVRSPIGCNPRTFRTPKVFEGKHVQLLAPLDVLDGGLLSKFETLTFTEQRAVAER 1364
Query: 1410 TGTTRSQILSNLNDLALGTSFL 1431
+G R L + L+ +F+
Sbjct: 1365 SGVDRDLALGLIQHLSASNAFV 1386
>gi|303285993|ref|XP_003062286.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455803|gb|EEH53105.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1469
Score = 662 bits (1707), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 487/1549 (31%), Positives = 773/1549 (49%), Gaps = 198/1549 (12%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MSFA +K +H PTG+ + + + TH P+ G G PNLVV
Sbjct: 1 MSFAIHKQVHPPTGVDHACAAYFTH---------PI--------------GSGAPPNLVV 37
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRR---------VLMDGISAA------------ 99
AN + IY +R +G SG + ++ D IS A
Sbjct: 38 LQANRLTIYAIR--RDGDARDNPSGNATKEADDAAIAASLVADAISGAGATASATIDADD 95
Query: 100 ---SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
SLE+V + L+G V S+A L + +RD+++LA ++K+SV+EFD S L
Sbjct: 96 AEVSLEVVAEFDLNGTVGSIATLRRRFGAPREQRDALLLAVRESKLSVVEFDPSTLSLVC 155
Query: 157 TSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVY---GLQMIILKASQG 209
+S+H +E+P G S R P+V DP+GRC VL+ G ++ +L
Sbjct: 156 SSLHSWETPPG---AGGVPSALRLAPTPPVVVADPEGRCAAVLLRAEGGTRLALLPTDND 212
Query: 210 GSGLVGDEDTFGSGG----GFSARIESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILH 264
+ G + + G G G +A ++ S+V++L R++ +++V+D F+HGY EPV+++LH
Sbjct: 213 AMDVDGGDGSEGKGRRTLRGTAAAVKKSYVVDLVREMGVRYVRDVCFLHGYGEPVLLVLH 272
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
E LTWA R + T +SA+S++ ++H +IW LPH Y+L A+P+P+GG +V+
Sbjct: 273 EERLTWAARATLVKDTMRLSAISLNVDARKHTVIWRRSALPHSCYRLTAMPAPLGGAIVL 332
Query: 325 GANTIHYHSQSASCALALNNYA---VSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
N + + SQ +S ALALN A D + + ++ + LD A+A + AL++T
Sbjct: 333 SQNFLLHESQESSAALALNPLAGGGRGDDPAAKAAAAASAAALDGAYAAVISEKQALVTT 392
Query: 382 KTGDLVLLTVVYDGRVVQR---LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
K G L LL++ +GR + + L + +VL+S + + L FLGSR+GDSLLV
Sbjct: 393 KAGALYLLSLRIEGRRLATRGGMHLKRAGGAVLSSGMCLVTRRLLFLGSRVGDSLLVS-R 451
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG-EELSLYGSASNNTESA 497
C + + ++ + A + +R D V G SL +A+ +
Sbjct: 452 CSTARASTAAPGRRPRAAAAAATTAAAEVRLLPIRPQIDGVGGVSAASLRAAAAAHRAPD 511
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATG----------------------ISK 535
++F VRDS++ I P+ D + G A AS +G + +
Sbjct: 512 HPGYTFTVRDSVLGISPVIDLTVG----ASASVSGDTIERTELIAACGHGKNGALAVLQR 567
Query: 536 QSNYELVE------LPGCKGIWTVYHKSS---RGHNADSSRMAAYDDEYHAYLIISLEAR 586
ELV LPG KG WTV+H S+ R + ++ A D YHAYL+ISL +
Sbjct: 568 GIQPELVTEVESGTLPGLKGTWTVHHDSADNERLRGSAAAAAAQAVDPYHAYLVISLASS 627
Query: 587 TMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
TM+LET + L EV+E V+ T+ AGN FGR R++QV+++G R+ G QD++
Sbjct: 628 TMILETGEELKEVSEHVELVTDAATLCAGNAFGRERIVQVYDKGVRVAAGPVKVQDIAST 687
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-------VQTP-- 697
+++G G E +++ I+ PYVL +SDGS+ +L GD + T+ + P
Sbjct: 688 ELVADAGDG-EGIEIVAAEISFPYVLCRLSDGSLAVLKGDEESKTLVKLDVDALARLPPG 746
Query: 698 -----AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI--DGADGGPLDQG 750
A + P ++ HD+ P +L++ +T +T + + D +
Sbjct: 747 GGIACATLVDDSTPAAAHGGLHDRSPG-FLKRATTATATTTTTTASASREDGDDDDDSRR 805
Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
++ V GALE++ +P+ + +T + G + +
Sbjct: 806 PMFLAVTRTGGALELYSLPSCDKAWTANGLSEGVAVLSPA--------GSASAALVDRDA 857
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
+ ++VEL + ++ H RP L A+ DG +L Y+A+
Sbjct: 858 AAAADAGADRAPEIVELRVDAFARAHERPLLTALRADGAVLVYRAF-------------- 903
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA------PCQRITIFKNIS--- 921
+ +V+ L LRF+R P++ E GA P R+T F+ +
Sbjct: 904 -----TCAVAGPGGRALTQLRFARVPVEL---EGGGGGAVDLSALPGSRLTRFERVGDRG 955
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS-IVAFTVLHNVNCNHGFIYVTSQGILKI 980
G +G F+SG +P W + R R+ P + +V+FT HNVNC+ GFI T+ G ++I
Sbjct: 956 GIRGVFVSGPQPLWLLARRSRVLALPVRGEAQRVVSFTAFHNVNCHAGFILGTAAGGVRI 1015
Query: 981 CQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1040
CQ+P Y+ WPV+K+ L+ TPH + Y + LY L S P + ++ EV
Sbjct: 1016 CQIPGRMHYEAAWPVRKLALRCTPHHVQYLPDFKLYALSTSAP---------AKWVEPEV 1066
Query: 1041 GHQIDNHNLSSVDLHRTYTV------EEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ D H + V R + E++ V++L P G +T + M E+ V
Sbjct: 1067 AEE-DIHAATVVKTRRAKAMARGGVEEQFAVKLLVP----GSLETAWSRTMDPGEHVQAV 1121
Query: 1095 RVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK 1154
+ V + N T ++LA+GTA GED RGRV+LF + + +
Sbjct: 1122 KNVQVRNLRTGALHSMLAVGTAMPGGEDTPCRGRVILFEISWQMVDGETRRVPLLLLFFD 1181
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
A++AL+ L+GHL++A G K+I+H W G EL +AF+D P ++ V++N+VKNF+ +GD+
Sbjct: 1182 DALAALSGLEGHLVVAIGTKLIVHAWDGAELIPVAFFDTP-VHTVTINVVKNFVCIGDVQ 1240
Query: 1215 KSIYFLSWKE--QGAQLNL--LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA 1270
K YF WK+ + + NL LAKDF S+D +TEFL+DGSTLSL+ +D N +F Y
Sbjct: 1241 KGAYFFRWKDDPRTGEKNLIQLAKDFESMDVLSTEFLVDGSTLSLLAADTAGNAYVFAYD 1300
Query: 1271 PKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML----ATSSDRTGAAPGSDK--TNRFAL 1324
PK SESWKGQKLL++A FHVG+ V + +R ++ A + R P K NR A+
Sbjct: 1301 PKSSESWKGQKLLTKASFHVGSPVHRMVRFKLKTPTGAGNDGRAAPTPAEIKANANRHAV 1360
Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN-GKAHR-PG 1382
FGTLDGS+G + P++ T +L+ LQ+ L + AGLN RS+R + G+A R P
Sbjct: 1361 FFGTLDGSLGILVPMESSTHAKLEVLQRWLNYNTAQNAGLNGRSYRAPKTTEGRAMRSPA 1420
Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
P +++D E+L +E L +Q E A G TR + L+ L+ L+ T+F+
Sbjct: 1421 PHNLLDGEMLQGFESLAWTKQAEAADAAGMTREEALTYLHTLSAKTAFM 1469
>gi|414587801|tpg|DAA38372.1| TPA: hypothetical protein ZEAMMB73_993613 [Zea mays]
Length = 573
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/569 (58%), Positives = 411/569 (72%), Gaps = 34/569 (5%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE--------ELDSELP-SKRG 51
MS+AAYKMMH PTGI +C +GFITHS AD ++DS + R
Sbjct: 1 MSYAAYKMMHLPTGIDHCAAGFITHSPADAAAFSTPAPAPTAAAGPDGDIDSTAARAPRR 60
Query: 52 IGPVPNLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYR-- 108
+GP PNLVV+AANV+E+Y VR + G++++ NS T ++DGIS A LELVCHYR
Sbjct: 61 VGPTPNLVVSAANVLEVYAVRAEVATGAEDAGNSSSTG--TILDGISGARLELVCHYRCK 118
Query: 109 ----------------LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
LHGN+ES+A+LS G RRDSI + F DAKI+ LEFDDSI+
Sbjct: 119 QMALASLHSLLAVNFRLHGNIESMAVLSDG---TENRRDSIAVTFNDAKITCLEFDDSIN 175
Query: 153 GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
GLR +SMHCFE PEW HLKRGRESFA GP++K DPQGRCG VLVYGLQ+IILKA+Q G
Sbjct: 176 GLRTSSMHCFEGPEWFHLKRGRESFAWGPIIKGDPQGRCGAVLVYGLQIIILKAAQVGQS 235
Query: 213 LVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
LVG+++ + RIESS+VI+LRDL+M H+KDF FVHGYIEPV+VILHERE TWAG
Sbjct: 236 LVGEDEPTRVLSSTAVRIESSYVIDLRDLEMNHIKDFTFVHGYIEPVLVILHEREPTWAG 295
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
R+S K TCM+SA SIS LKQHP+IWSA LPHDAY+LLAVP PI G+LV+ AN+IHYH
Sbjct: 296 RISSKSQTCMLSAFSISMGLKQHPMIWSAAKLPHDAYQLLAVPPPISGILVICANSIHYH 355
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
SQS SC+LALN+++ D S E+ ++SF VELD A ATWL +D+ + S+K G+++LLTVV
Sbjct: 356 SQSTSCSLALNSFSSQPDGSPEILKTSFHVELDVAKATWLSHDIVMFSSKNGEILLLTVV 415
Query: 393 YDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
YDGR VQRLDL K+ SVL+S TT+G+S FLGSRL DSLLVQF+CG TS+L L +
Sbjct: 416 YDGRAVQRLDLMKSKASVLSSGATTLGSSFIFLGSRLADSLLVQFSCGMPTSVLPD-LTD 474
Query: 453 EFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
E DIE+D P +KRL+R SD LQD+ + EELS + A N + + SF VRD+L+N+
Sbjct: 475 EPADIESDLPFSKRLKRIPSDVLQDVTSVEELSFHNKAVPNIVDSAEKISFVVRDALINV 534
Query: 513 GPLKDFSYGLRINADASATGISKQSNYEL 541
GPLKDF+YGLR N+D +A GI+KQSNYEL
Sbjct: 535 GPLKDFAYGLRTNSDPNAAGIAKQSNYEL 563
>gi|297722899|ref|NP_001173813.1| Os04g0252200 [Oryza sativa Japonica Group]
gi|255675253|dbj|BAH92541.1| Os04g0252200, partial [Oryza sativa Japonica Group]
Length = 432
Score = 644 bits (1662), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/411 (77%), Positives = 356/411 (86%), Gaps = 8/411 (1%)
Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1080
SV +PLNQVLS + DQE H +DN S+ LH+TYTV+E+EVRILE ++ GG W+T+
Sbjct: 30 SVCSFRPLNQVLSSMADQESVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETK 89
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
+TIPMQ ENALTVR+VTL NTTTKENETLLAIGTAYV GEDVAARGRVLLFS + ++N
Sbjct: 90 STIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SEN 148
Query: 1141 PQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS 1200
QNLVTEVYSKE KGA+SA+ASLQGHLLIASGPKI L+KWTG EL +AFYDAP L+VVS
Sbjct: 149 SQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGAELTAVAFYDAP-LHVVS 207
Query: 1201 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
LNIVKNF+L GDIHKSIYFLSWKEQG+QL+LLAKDFGSLDCFATEFLIDGSTLSLV SD
Sbjct: 208 LNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVASDS 267
Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
KN+QIFYYAPKM ESWKGQKLLSRAEFHVGAH+TKFLRLQML T S+KTN
Sbjct: 268 DKNVQIFYYAPKMVESWKGQKLLSRAEFHVGAHITKFLRLQMLPTQ------GLSSEKTN 321
Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
RFALLFG LDG IGCIAP+DELTFRRLQSLQ+KLVD+VPHV GLNPRSFRQFHSNGK HR
Sbjct: 322 RFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHSNGKGHR 381
Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
PGPD+I+D ELL+HYEML L+EQL++A Q GTTRSQILSN +D++LGTSFL
Sbjct: 382 PGPDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSDISLGTSFL 432
>gi|242075246|ref|XP_002447559.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
gi|241938742|gb|EES11887.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
Length = 389
Score = 632 bits (1630), Expect = e-178, Method: Compositional matrix adjust.
Identities = 311/397 (78%), Positives = 344/397 (86%), Gaps = 8/397 (2%)
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ DQE+G I+N S DL + YTV+E+EVRI+E + G W+TR TIPMQS ENALTV
Sbjct: 1 MADQELGLHIENDVTSGDDLQKVYTVDEFEVRIMELGKPSGHWETRFTIPMQSFENALTV 60
Query: 1095 RVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK 1154
R+VTL NT+TKENETL+AIGTAYVQGEDVAARGRVLL+S R ++N QNLVTEVYSKE K
Sbjct: 61 RIVTLQNTSTKENETLMAIGTAYVQGEDVAARGRVLLYSFSR-SENSQNLVTEVYSKESK 119
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
GA+SA+ASLQGHLLIASGPKI L+KWTG+EL +AFYDAP L+VVSLNIVKNF+L GDIH
Sbjct: 120 GAVSAVASLQGHLLIASGPKITLNKWTGSELTAVAFYDAP-LHVVSLNIVKNFVLFGDIH 178
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
KSIYFLSWKEQG+QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD KN+QIFYYAPKM
Sbjct: 179 KSIYFLSWKEQGSQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDSDKNVQIFYYAPKMV 238
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
ESWKGQKLLSRAEFHVGAHV+KFLRLQML T S+KTNRFAL+FGTLDG IG
Sbjct: 239 ESWKGQKLLSRAEFHVGAHVSKFLRLQMLPTQ------GLASEKTNRFALVFGTLDGGIG 292
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
CIAP+DELTFRRLQSLQ+KLVD+VPHV GLNPRSFR F SNGKAHRPGPD+I+D ELLSH
Sbjct: 293 CIAPVDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRHFKSNGKAHRPGPDNIIDFELLSH 352
Query: 1395 YEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
YEML LEEQLEIA Q GTTRSQILSN +D LGTSFL
Sbjct: 353 YEMLSLEEQLEIAQQIGTTRSQILSNFSDFLLGTSFL 389
>gi|432883539|ref|XP_004074300.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Oryzias latipes]
Length = 1456
Score = 612 bits (1579), Expect = e-172, Method: Compositional matrix adjust.
Identities = 461/1508 (30%), Positives = 732/1508 (48%), Gaps = 227/1508 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + S S + K R LE V + L GNV S+
Sbjct: 29 NLVVAGTSQLFVYRIIHDVESTSSSDKSSDAKTR-------KEKLEQVASFSLFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +D+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLTGAS----KDALLLSFKDAKLSVIEYDPGTHDLKTLSLHYFEEPE---LRDGFFQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+ RC +L+YG ++++L + + DE G G G + S++I
Sbjct: 135 NVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKD---TLSDEQEGGVGEGPKSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 192 DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS Y VSL+S
Sbjct: 252 HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVSLNSQTN 305
Query: 352 -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ P + + LD + ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 GTTSFPLRVQEEVKITLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 365
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLT+ + T+ FLGSRLG+SLL+++T + G ++ + E P K+
Sbjct: 366 AASVLTTCMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPAEDGNDKQ--EKEKQEPPNKK 423
Query: 467 LRRSSSD-----------ALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGP 514
R SS L D V+ E+ +YGS A + T+ A TFSF V DS++NIGP
Sbjct: 424 KRVESSSNWTGCSASYFFVLSDEVD--EIEVYGSEAQSGTQLA--TFSFEVCDSILNIGP 479
Query: 515 LKDFSYG--------LRINADAS-----ATGISKQSNYELV------------ELPGCKG 549
+ S G + N + +G K ++ ELPGC
Sbjct: 480 CANASMGEPAFLSEEFQSNPEPDLEIVVCSGYGKNGALSVLQRSIRPQVVTTFELPGCHD 539
Query: 550 IWTVY----HKSSRG--HNADSSRMAAYDD---------EYHAYLIISLEARTMVLETAD 594
+WTV K S G AD+ + D + H +LI+S E TM+L+T
Sbjct: 540 MWTVISGEDKKESEGGEKEADAEKKEEQDKTEPPLEDDAKKHGFLILSREDSTMILQTGQ 599
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
+ E+ S + QG T+ AGN+ + +IQV G R+L+G + L F P +
Sbjct: 600 EIMELDTS-GFATQGPTVFAGNIGDNQYIIQVSPMGLRLLEG---VKQLHFIPVDL---- 651
Query: 655 GSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT-----VSVQTPAAIESSK----- 704
S ++ S+ADPYV++ ++G + + V T +++Q P S+
Sbjct: 652 ---GSPIVHCSVADPYVVIMTAEGVVTMFVLKSDTYMGKTHRLALQKPQISTLSRVIALC 708
Query: 705 -----------KPVSSC---------------TLYHDKG----PEPWLRKTSTDAWLSTG 734
+ SSC T+Y D E + + A ++ G
Sbjct: 709 AYRDVSGMFTTENKSSCSSKEDLILRSNSETETVYQDLSNTVDDEEEMLYGESGASMAAG 768
Query: 735 V-----GEAIDGADGGPLDQGDI----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
G A GG G + V+ E+G +EI+ +P++ VF V F G+
Sbjct: 769 KEEMSRGSAATAPPGGEGSAGKAEPSHWCVLIRENGVMEIYQLPDWRLVFLVKNFPVGQR 828
Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
+VD+ + S T+ + EE T QG + + +V L R SRP+L +
Sbjct: 829 VLVDS----SSGQSATQGDGKKEEVTRQGEIPLVKEVALVALGNNR-----SRPYLL-VH 878
Query: 846 TDGTILCYQAYLF--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
+ +L Y+A+ + + P+N K +RF + P RE
Sbjct: 879 VENELLVYEAFPYDQQQPQNNLK-----------------------VRFKKVPHSINFRE 915
Query: 904 ETPH---------GAPCQRITI---------FKNISGHQGFFLSGSRPCWCMVF-RERLR 944
+ P G P + + + F++ISG+ G F+ G P W ++ R LR
Sbjct: 916 KKPKLKKDKKAEGGGPEENVAVKSRISRFRYFEDISGYSGVFICGPSPHWMLITSRGGLR 975
Query: 945 VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATP 1004
+HP DG I +F+ HN+NC GF+Y QG L+I LP+ +YD WPV+KIPL+ T
Sbjct: 976 LHPMTIDGPIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTV 1035
Query: 1005 HQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE 1064
H ++Y E +Y + SV + L I + G + + + + + E++
Sbjct: 1036 HFVSYHVESKVYAVCTSV-------KELCTRIPRMTGEEKEFETIERDERYINPLQEKFS 1088
Query: 1065 VRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGE 1121
++++ P W+T I ++ E+ ++ V L + T + +A GT +QGE
Sbjct: 1089 IQLISPVS----WETIPNTRIDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTCVLQGE 1144
Query: 1122 DVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKII 1176
+V RGR+L+ P +T+ +Y KE KG ++AL G+L+ A G KI
Sbjct: 1145 EVTCRGRILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCHGYLVSAIGQKIF 1204
Query: 1177 LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1236
L +L G+AF D LY+ + +KNFIL D+ KSI L ++E+ L+L+++D
Sbjct: 1205 LWALKDNDLTGMAFIDTQ-LYIHQMISIKNFILAADVMKSISLLRYQEESKTLSLVSRDA 1263
Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
L+ ++ EF++D + L +VSD KN+ ++ Y P+ ES+ G +LL RA+F+ GAH+
Sbjct: 1264 KPLEVYSIEFIVDNNQLGFLVSDRDKNLFVYMYLPEAKESFGGMRLLRRADFNAGAHINS 1323
Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
R M + +G+ N+ F TLDG IG + P+ E T+RRL LQ L
Sbjct: 1324 LWR--MPCRGALDSGSKKALTWDNKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTT 1381
Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1416
+PH AGLNP++FR HSN ++ + +I+D ELL+ Y L E+ E+A + GTT+
Sbjct: 1382 MLPHHAGLNPKAFRMMHSNRRSLQNAVKNILDGELLAKYLYLSTMERSELAKKIGTTQDI 1441
Query: 1417 ILSNLNDL 1424
IL +L ++
Sbjct: 1442 ILDDLLEI 1449
>gi|444523674|gb|ELV13604.1| Cleavage and polyadenylation specificity factor subunit 1 [Tupaia
chinensis]
Length = 1469
Score = 611 bits (1575), Expect = e-171, Method: Compositional matrix adjust.
Identities = 458/1505 (30%), Positives = 716/1505 (47%), Gaps = 214/1505 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVS 348
T + HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQRVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTAG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD AHA ++ D ++S K G++ +LT+V DG R V+ K
Sbjct: 307 TTAFPLRTQDGVRLTLDCAHAAFISYDKMVISLKGGEIYVLTLVTDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASAVREAADKEEPPSKKKRV 424
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFS-- 519
+ SS QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + +
Sbjct: 425 DPTGGWSGSSTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 480
Query: 520 ---------------YGLRINADA--SATGI---------SKQSNYELV----------- 542
YGL A+ TG+ S + + E+V
Sbjct: 481 EPAFLSEEVGTGVAEYGLIGQAEGWGRRTGLTPAPVQFQNSPEPDLEIVVCSGYGKNGAL 540
Query: 543 ---------------ELPGCKGIWTVY-------HKSSRGHNADSSRMAAYDD--EYHAY 578
ELPGC +WTV ++ + + R A +D H +
Sbjct: 541 SVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKDEEETPKAEGTEQPRAAEAEDGVRRHGF 600
Query: 579 LIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY 638
LI+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 601 LILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG-- 657
Query: 639 MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC-----TVS 693
L F P + + ++ ++ADPYV++ ++G + + + T ++
Sbjct: 658 -VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFLLKSDTYGGRHHRLA 709
Query: 694 VQTPAAIESSKKPVSSCTLYHDKG-------------PEPWLRKTSTDAWLSTGVGEAID 740
+ P SK V + LY D EP R + L +D
Sbjct: 710 LHKPPLHHQSK--VITLCLYRDVSGMFTTESRLGGARDEPGARGSCEVEGLGAETSPTVD 767
Query: 741 G------ADGG----------------PLDQGDI--------YSVVCYESGALEIFDVPN 770
D G P D+ + ++ E+G +E++ +P+
Sbjct: 768 DEEEMLYGDSGSLFSPSKEETRRSSQPPADRDPAPFRAEPTHWCLLVRENGTMEMYQLPD 827
Query: 771 FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQ 830
+ VF V F G+ +VD+ + T+ + EE T QG + + +V L
Sbjct: 828 WRLVFLVKNFPVGQRVLVDS----SFGQPATQAEARKEEATRQGELPLVKEVLLVALG-- 881
Query: 831 RWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNL 890
+ SRP+L + D +L Y+A+ P ++ + N++ +
Sbjct: 882 ---SRQSRPYLL-VHVDQELLLYEAF----PHDSQLGQGNLKVRFKKVPHNINFREKKLK 933
Query: 891 RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQL 949
+ T E R F++I G+ G F+ G P W +V R LR+HP
Sbjct: 934 PSKKKAEGGSTEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMG 993
Query: 950 CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITY 1009
DG I +F HNVNC GF+Y QG L+I LP+ +YD WPV+KIPL+ T H + Y
Sbjct: 994 IDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAY 1053
Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
E +Y + S P + I + G + + + D + E + ++++
Sbjct: 1054 HVESKVYAVATSTNA--PCTR-----IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLIS 1106
Query: 1070 PDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAAR 1126
P W+ A I ++ E+ ++ V+L + T + +A GT +QGE+V R
Sbjct: 1107 PVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCR 1162
Query: 1127 GRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
GR+L+ P +T+ +Y KE KG ++AL GHL+ A G KI L
Sbjct: 1163 GRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLR 1222
Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
+EL G+AF D LY+ + VKNFIL D+ KSI L ++E+ L+L+++D L+
Sbjct: 1223 ASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEV 1281
Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1301
++ +F++D + L +VSD +N+ ++ Y P+ ES+ G LL RA+FH+GAHV F R
Sbjct: 1282 YSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGLLLLRRADFHLGAHVNTFWR-- 1339
Query: 1302 MLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
+ GA G K N+ F TLDG IG + P+ E T+RRL LQ L
Sbjct: 1340 -----TPCRGAVEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTT 1394
Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1416
+PH AGLNPR+FR H + + + +++D ELLS Y L E+ E+A + GTT
Sbjct: 1395 MLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLSRYLYLSTMERSELAKKIGTTPDI 1454
Query: 1417 ILSNL 1421
IL +L
Sbjct: 1455 ILDDL 1459
>gi|348512553|ref|XP_003443807.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Oreochromis niloticus]
Length = 1456
Score = 608 bits (1569), Expect = e-171, Method: Compositional matrix adjust.
Identities = 455/1509 (30%), Positives = 732/1509 (48%), Gaps = 235/1509 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + ++ S ++K R LE V + L GN+ S+
Sbjct: 29 NLVVAGTSQLFVYRIIHDVESTSKADKSSDSKSR-------KEKLEQVASFSLFGNIMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLVGAS----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+ RC +LVYG ++++L + + DE G G G + S++I
Sbjct: 135 NVHIPVVRVDPENRCAVMLVYGTKLVVLPFRKDT---LTDEQESGVGEGPKSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 192 DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS Y VSL+S
Sbjct: 252 HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVSLNSQTN 305
Query: 352 -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ P + + LD + ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 GTTAFPLRVQDEVKLTLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 365
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA---PS 463
SVLT+ + T+ FLGSRLG+SLL+++T + G + + + + D PS
Sbjct: 366 AASVLTTCMVTMEPGYLFLGSRLGNSLLLKYTEKLQETPAEEGKERQDKEKDKDKQEPPS 425
Query: 464 TKRLRRSSSD----------ALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
K+ SS++ L D V+ E+ +YGS A + T+ A T+SF V DS++NI
Sbjct: 426 KKKRVESSTNWTVCVILDFFVLSDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 481
Query: 513 GPLKDFSYG--------LRINADAS-----ATGISKQSNYELV------------ELPGC 547
GP + S G + N + +G K ++ ELPGC
Sbjct: 482 GPCANASMGEPAFLSEEFQSNPEPDLEVVVCSGYGKNGALSVLQRSIRPQVVTTFELPGC 541
Query: 548 KGIWTVYHKSSRGHNADSSRMAAY------------DDEYHAYLIISLEARTMVLETADL 595
+WTV + D + D + H +LI+S E TM+L+T
Sbjct: 542 HDMWTVISSDVKEDKTDKEEVEKEEEEKKTEPPLEDDAKKHGFLILSREDSTMILQTGQE 601
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ S + QG T+ AGN+ + +IQV G R+L+G + L F P +
Sbjct: 602 IMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGLRLLEG---VRQLHFIPVDL----- 652
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDP-----STCTVSVQTPAAIESSKKPVSSC 710
S ++ S+ADPYV++ ++G + + V T +++Q P I S + ++ C
Sbjct: 653 --GSPIVHCSVADPYVVIMTAEGVVTMFVLKSDSYMGKTHRLALQKPQ-IPSQSRVITLC 709
Query: 711 -------------------------------TLYHDKG-----PEPWLRKTSTDAWLSTG 734
T+ HD E L S + +T
Sbjct: 710 AYRDVSGMFTTENKVSCSIKEDTIRSQSEAETIIHDMSNTVDDEEEMLYGDSNAS--ATP 767
Query: 735 VGEAIDGADGGPLDQGDI----------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGR 784
E I+ + P G + ++ E+G +EI+ +P++ VF V F G+
Sbjct: 768 AKEDINRSFVAPTTSGSEATSSKAEPTHWCMIIRENGVMEIYQLPDWRLVFLVKNFPVGQ 827
Query: 785 THIVDTYMREALKDSETEINSSSEEGT-GQGRKENIHSMK----VVELAMQRWSAHHSRP 839
+VD+ SS + T G+G+KE + V E+A+ +HS+P
Sbjct: 828 RVLVDS--------------SSGQSATQGEGKKEEVTRQGEIPLVKEVALVSLGNNHSKP 873
Query: 840 FLFAILTDGTILCYQAYLF--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
+L + + +L Y+A+ + + P+N K +RF + P
Sbjct: 874 YLL-VHVEQELLIYEAFQYDQQQPQNNLK-----------------------VRFKKVPH 909
Query: 898 DAYTRE----------------ETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCMVF 939
+ RE E G + R F++ISG+ G F+ G P W +V
Sbjct: 910 NINFREKKSKLKKDKKAESSATEESSGVKGRIARFRFFEDISGYSGVFICGPSPHWMLVT 969
Query: 940 -RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKI 998
R LR+HP DGSI +F+ HN+NC GF+Y QG L+I LP+ +YD WPV+KI
Sbjct: 970 SRGALRLHPMTIDGSIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKI 1029
Query: 999 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY 1058
PL+ T H ++Y E +Y + SV +P + I + G + + + + +
Sbjct: 1030 PLRCTVHYVSYHVESKVYAVCTSVK--EPCTR-----IPRMTGEEKEYEVIERDERYIHP 1082
Query: 1059 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAY 1117
E++ ++++ P TR I ++ E+ ++ V L + T + +A GT
Sbjct: 1083 QQEKFSIQLISPVSWEAIPNTR--IDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTCL 1140
Query: 1118 VQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASG 1172
+QGE+V RGR+L+ P +N +Y KE KG ++AL G+L+ A G
Sbjct: 1141 MQGEEVTCRGRILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIG 1200
Query: 1173 PKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1232
KI L +L G+AF D LY+ + +KNFIL D+ KSI L ++E+ L+L+
Sbjct: 1201 QKIFLWVLKDNDLTGMAFIDTQ-LYIHQMFSIKNFILAADLMKSISLLRYQEESKTLSLV 1259
Query: 1233 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1292
++D L+ ++ EF++D + L +VSD KN+ ++ Y P+ ES+ G +LL RA+F+ GA
Sbjct: 1260 SRDAKPLEVYSIEFMVDNNQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGA 1319
Query: 1293 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
++ F R+ + A D N+ F TLDG IG + P+ E T+RRL LQ
Sbjct: 1320 NINTFWRMPCRGALDASSKKALTWD--NKHITWFATLDGGIGLLLPMQEKTYRRLLMLQN 1377
Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
L +PH AGLNP++FR HS+ ++ + +I+D ELL+ Y L + E+ E+A + GT
Sbjct: 1378 ALTTMLPHHAGLNPKAFRMLHSDRRSLQNPVKNILDGELLNKYLYLSMMERSELAKKIGT 1437
Query: 1413 TRSQILSNL 1421
T+ IL +L
Sbjct: 1438 TQDIILDDL 1446
>gi|410911304|ref|XP_003969130.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Takifugu rubripes]
Length = 1444
Score = 606 bits (1563), Expect = e-170, Method: Compositional matrix adjust.
Identities = 440/1474 (29%), Positives = 721/1474 (48%), Gaps = 191/1474 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + ++ S ++K R LE V + L GNV S+
Sbjct: 29 NLVVAGTSQLFVYRIIHDVESTSKTDKSSDSKTR-------KEKLEQVAAFSLFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ GA+ RD+++L+F+DAK+SV+E+D H L+ S+H FE L L+ G
Sbjct: 82 ESVQLVGAN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEE---LELRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+ RC +L+YG ++++L + + DE G G G + +++I
Sbjct: 135 NVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKDT---LTDEQEVGVGEGPKSSFLPTYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 192 DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD + A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRLQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA-----PSTKRL 467
+ + T+ FLGSRLG+SLL+++T L G ++ + E D P +K+
Sbjct: 372 TCMVTMEPGYLFLGSRLGNSLLLKYTEKLQDMPLEEGKDQQDKEKEKDMDKQEEPPSKKK 431
Query: 468 RRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
R SS D V+ E+ +YGS A + T+ A T+SF V DS++NIGP + S G
Sbjct: 432 RVESSSNWTDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANASMGEPAFL 487
Query: 522 ---LRINADAS-----ATGISKQSNYELV------------ELPGCKGIWTVY------- 554
+ N + +G K ++ ELPGC +WTV
Sbjct: 488 SEEFQSNPEPDLEVVVCSGHGKNGALSVLQRSIRPQVVTTFELPGCHDMWTVISNEPVQK 547
Query: 555 --HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
++ R + A D + H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 548 EQEETEREGKEKTEPPAEEDTKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTV 606
Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
AGN+ + +IQV G R+L+G +TQ L F P + S ++ S+ADPYV+
Sbjct: 607 FAGNIGDNKYIIQVSPMGIRLLEG--VTQ-LHFIPVDL-------GSPIVHCSLADPYVV 656
Query: 673 LGMSDGSIRLLVGD-----PSTCTVSVQTPAAIESSK-------KPVS---------SCT 711
+ ++G + + V T +++Q P S+ + VS SC+
Sbjct: 657 IMTAEGVVTMFVLKIDSYMGKTHRLALQKPQISTQSRVIALCAYRDVSGMFTTENKVSCS 716
Query: 712 LYHDKGPEPWLRKTSTDAWLSTGV---------GEAIDGADGGPLDQGDI---------- 752
+ D + LST + G++ G +++
Sbjct: 717 ITEDISIRSQSEAETIIQDLSTNIVDDEEEMLYGDSNTGPSKEEMNRSSFAGPSEGSYSK 776
Query: 753 -----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
+ ++ +SG +EI+ +P++ VF V F G+ +VD+ ++ E E
Sbjct: 777 AEPSHWCLITRDSGVMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQSATQGEKE--GKK 834
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF--EGPENTS 865
EE T QG + + +V L +HSRP+L + D +L Y+A+ + + P+N
Sbjct: 835 EEVTRQGEIPLVKEVTLVSLGY-----NHSRPYLL-VHVDQELLIYEAFPYDQQQPQNNL 888
Query: 866 KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--------PHGAPCQ----- 912
K +RF + P + RE+ G +
Sbjct: 889 K-----------------------VRFKKVPHNINFREKKSKLRKDKKAEGTAAEDSVAA 925
Query: 913 -----RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCN 966
R F++ISG+ G F+ G P W +V R LR+HP DG I +F+ HN+NC
Sbjct: 926 RGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRLHPMSIDGPIESFSPFHNINCP 985
Query: 967 HGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLK 1026
GF+Y QG L+I LP+ +YD WPV+KIPL+ T H ++Y E +Y + S+
Sbjct: 986 KGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVESKVYAVCTSL---- 1041
Query: 1027 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1086
+ L I + G + + + + + +++ ++++ P TR I ++
Sbjct: 1042 ---KELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLISPVSWEAIPNTR--IDLE 1096
Query: 1087 SSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP---- 1141
E ++ V L + T + +A GT +QGE+V RGR+L+ P
Sbjct: 1097 EWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCRGRILILDVIEVVPEPGQPL 1156
Query: 1142 -QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS 1200
+N +Y KE KG ++AL G+L+ A G KI L +L G+AF D L++
Sbjct: 1157 TKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWVLKDNDLTGMAFIDTQ-LHIHQ 1215
Query: 1201 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
+ +KNFIL D+ KS+ L ++E+ L+L+++D L+ ++ EF++D + L +VSD
Sbjct: 1216 MMSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVYSIEFMVDNNQLGFLVSDR 1275
Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
KN+ ++ Y P+ ES+ G +LL RA+F+ GA++ F R M + G+ N
Sbjct: 1276 DKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTFWR--MPCRGALEAGSRKAMTWDN 1333
Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
+ F TLDG +G + P+ E T+RRL LQ L + H AGLNP++FR H + ++ +
Sbjct: 1334 KHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLSHHAGLNPKAFRMLHCDRRSLQ 1393
Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
+I+D ELL+ Y L + E+ E+A + GTT+
Sbjct: 1394 NPVKNILDGELLNKYLYLSMMERSELAKKIGTTQ 1427
>gi|156364999|ref|XP_001626630.1| predicted protein [Nematostella vectensis]
gi|156213514|gb|EDO34530.1| predicted protein [Nematostella vectensis]
Length = 1420
Score = 600 bits (1547), Expect = e-168, Method: Compositional matrix adjust.
Identities = 455/1539 (29%), Positives = 728/1539 (47%), Gaps = 244/1539 (15%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A YK H PTG+ C + +R NLVV
Sbjct: 2 YAIYKETHPPTGVEFCVNCHFYSARES---------------------------NLVVAG 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
+ ++ + Q+EGS +++ G + +R LELV + L GN+ESL +
Sbjct: 35 TTEVRVFRLCYQQEGSSSAESGGSSLKR---------KLELVGQHSLFGNIESLHAIRLA 85
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
G RDS++++F+DAK+S++++D H ++ S+H FE + +K + R P+
Sbjct: 86 G----NTRDSLLMSFKDAKLSIVDYDPGKHDIKTRSLHFFEDEK---IKSHCLAQDRAPV 138
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
V++DP+ RC +L YG +++L Q G +D+ S + S++I+++++D
Sbjct: 139 VRIDPERRCAVMLAYGTHLVVLPFRQEGGIDDTAQDSIISSSD-RPPVLPSYIIDVKEID 197
Query: 243 MK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
K ++ D F+HGY EP ++IL+E TWAGR++ ++ TC + A+S++ + K HP++W
Sbjct: 198 EKTCNILDIQFLHGYYEPTLLILYEPLKTWAGRLAMRNDTCALVAVSLNMSQKAHPVVWQ 257
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE------ 354
LP D ++ VP PIGGVLV N + Y +QS Y VS++S E
Sbjct: 258 LSCLPFDCIYVMPVPKPIGGVLVCCMNALLYLNQSVP------PYGVSVNSIGENSTVFP 311
Query: 355 -LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
P+ ++ L+ ++A ++ ND + S K G++ ++T++ DG R V+ KT SVLT
Sbjct: 312 LKPQKGVTITLEGSNAIFIANDKLVFSLKGGEIYVVTLIADGVRSVRNFVFDKTAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
S + G+ FLGSRLG+SLLV++T + G + ++ D + +R + +
Sbjct: 372 SCVCECGDGYLFLGSRLGNSLLVKYT--EKPQDIVYGTENNAQSMQCD--NIERWQILNG 427
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----LRIN--- 525
L + + +EL +YG A +++F V DSL+NIGP G L ++
Sbjct: 428 SLLLIVDDLDELEVYG-AQQEAGVELTSYTFEVCDSLLNIGPCSCMDIGEPAFLSVSSYF 486
Query: 526 ADA--------SATGISKQSNYELV------------ELPGCKGIWTVYHKSSRG----- 560
ADA S +G K ++ ELPGC +WTV+ K +
Sbjct: 487 ADAQELDLEVVSCSGYGKNGALTVLQRSIRPQVVTTFELPGCTDMWTVFSKDQKKGAQTN 546
Query: 561 --HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
H S +++YH++LI+S E +M+L+T + EV +S + Q TI AGN
Sbjct: 547 AIHRYPSQPCTQGNEKYHSFLILSREDSSMILKTEQEIMEVDQS-GFSTQCATIYAGNFG 605
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
++QV G R+L+G Q + +SG S ++ S+ DPY +L M+DG
Sbjct: 606 NGSYILQVTPLGVRLLEGVNQLQHIPM-----DSGL----SNIVWCSVCDPYAVLLMADG 656
Query: 679 SIRLL--VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD----------------KGPEP 720
S+ L+ + S ++V P+ +SSK V +C Y D K P P
Sbjct: 657 SVILIEFIKSASGPKLTVSRPSLSQSSK--VCACCTYKDMSGLFTTENSNLEEVSKVPSP 714
Query: 721 WLRKTS----------------------TDAWLSTGVGEAIDGADGGPLD------QGDI 752
T+ T L+ E P++ Q
Sbjct: 715 KPEMTAPPRQEKESLTIDEEDELLYGGDTSLTLTFEPPEPSKAESAAPVEVFEEPLQPSY 774
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ +VC E+G +EI+ +P F VF V F IVD+ DS SS E
Sbjct: 775 WCLVCRENGVMEIYSLPGFTRVFFVKNFSKAPRVIVDS------GDSGASTQSSVSEE-- 826
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
S+ V E+ + + R L A++ D +L Y+A+ + E
Sbjct: 827 -------ESLNVREVLLTGLGYKNRRATLVAVM-DQDLLIYEAFSYPTVEGH-------- 870
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-------------CQRITIFKN 919
NLRF + + RE+ P P + +F +
Sbjct: 871 ---------------LNLRFKKLQHNIQIREKKPKQEPKNDSETKSGLDPKVAMLRVFND 915
Query: 920 ISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
IS + G F+ GS P W V R HP DG + F HNVNC GF+Y ++G L
Sbjct: 916 ISSYSGIFVCGSYPFWIFVTNRGAFHWHPMSIDGPVTCFAAFHNVNCPKGFLYFNTRGEL 975
Query: 979 KICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI-D 1037
+I LP+ +YD+ WPV+K+PL+ TPH ++Y E Y ++ S +P ++ + D
Sbjct: 976 RISVLPTHLSYDSPWPVRKVPLRYTPHMVSYNRESKTYAIVTSEQ--EPCKKIPRVTAED 1033
Query: 1038 QEVGHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+E I D Y + E + ++++ P W+ IP + V
Sbjct: 1034 KEFVDTIR-------DARFIYPSTERFVLQLISPIS----WEV---IPNTRHDLDEWEHV 1079
Query: 1097 VTLFNTTTKENET------LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE--- 1147
T+ N ET + +GT + GE++A RGR+L+F P +T+
Sbjct: 1080 TTMKNLLLHSEETHTGRKGFICVGTTQLYGEEIAVRGRILIFDIIEVVPEPGQPLTKNKF 1139
Query: 1148 --VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK 1205
+Y KE KG ++AL + G+L+ G KI + +T +L G+AF D LY+ SL ++
Sbjct: 1140 KLLYEKEQKGPVTALNQVNGYLVSGIGQKIYIWNFTDNDLVGMAFIDTQ-LYIHSLVTIR 1198
Query: 1206 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
NF++ D+ KSI L +E+ L ++KD +L+ +A +F IDG + +VSD +KN+
Sbjct: 1199 NFVIAADVCKSITLLRLQEETKTLAFVSKDPKNLEVYAADFFIDGPQIGFLVSDVEKNLV 1258
Query: 1266 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
+F Y P+ ES GQ+LL RA+ +VG H+T F R+ A A+ K R
Sbjct: 1259 LFTYQPEAIESQGGQRLLQRADINVGTHITSFFRIAAKA----HLKASGEKSKEMRQLTC 1314
Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
FGTLDG++G + P+ E TFRRL LQ KLVD +PHVAGLNP++FR + +
Sbjct: 1315 FGTLDGALGLMLPMTEKTFRRLHMLQTKLVDCIPHVAGLNPKAFRMLQWRKRKLCNPHRN 1374
Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
++D +LL Y L E+ E+A + GTT +QI+ ++ D+
Sbjct: 1375 VLDWQLLFKYMHLSFMERQEVARKIGTTPAQIMDDMMDI 1413
>gi|405977622|gb|EKC42064.1| Cleavage and polyadenylation specificity factor subunit 1
[Crassostrea gigas]
Length = 1369
Score = 592 bits (1526), Expect = e-166, Method: Compositional matrix adjust.
Identities = 439/1431 (30%), Positives = 700/1431 (48%), Gaps = 180/1431 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
+E + + L GN+ S+ + GA RDS++L+F +AK+SV+E+D H L+ TS+H
Sbjct: 5 MECLATFTLFGNIMSMKYVKLPGA----LRDSLLLSFSEAKLSVVEYDPGTHDLQTTSLH 60
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE P +K G + P V+VDP GRC +LVYG M+IL + GD
Sbjct: 61 FFEEPS---MKGGFFTNYCIPEVRVDPDGRCAAMLVYGTHMVILPFRRDVMVEEGD---- 113
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G + I SS++I+LR+ D K +VKDF F+HGY EP + IL E TWAGR + +
Sbjct: 114 NLAGTSKSPILSSYIIDLRNFDEKIINVKDFQFLHGYYEPTVFILFEPLQTWAGRTAVRA 173
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
TC I A+S++ K HP+IWS +LP D ++LAVP PIGGV+++ N++ Y +QS
Sbjct: 174 DTCSIVAISLNLQEKVHPVIWSLGSLPFDCCQVLAVPRPIGGVIIIAVNSLLYLNQSVP- 232
Query: 339 ALALNNYAVSLDS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
Y VSL+S S P + + LD A ++ D +LS K G+L +LT+
Sbjct: 233 -----PYGVSLNSISAQSTLFPLRVQEGVRIALDCCQAAFMSYDKIVLSLKGGELYVLTL 287
Query: 392 VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
V DG R V+ + K+ SVLTS + + FLGSRLG+SLL+++T + + + L
Sbjct: 288 VVDGMRSVRSFNFDKSAASVLTSCMCICEDGFLFLGSRLGNSLLLKYTEKASECLENGDL 347
Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMV----NGEELSLYGSASNNTESAQKTFSFAVR 506
++ + D P+ K+ + S + V N +L +YGSA N T + +++F V
Sbjct: 348 DKK----KEDEPAAKKKKVEGSTEIASDVSQIENLYDLEVYGSAENPTSTTITSYTFEVC 403
Query: 507 DSLVNIGPL------------KDFSYGLRINADASAT-GISKQSNYELV----------- 542
D++ NIGP ++FS + + T G K ++
Sbjct: 404 DNIWNIGPCGNIVMGEPAFLSEEFSSCEDPDIEMVMTSGYGKNGALSVLQRSIRPQVVTT 463
Query: 543 -ELPGCKGIWTVYH--KSSRGHNADSSRMAAYDDEY---HAYLIISLEARTMVLETADLL 596
ELPGC +WTV + + ++S DD H++LI+S +M+LET +
Sbjct: 464 FELPGCLDMWTVKSLVPKEKSEDKENSMEDDSDDNIEGGHSFLILSRSDSSMILETGQEM 523
Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS 656
E+ S + Q TI AGN+ G R ++QV + R+L+G Q + ++GS
Sbjct: 524 NELDHS-GFSTQTTTIFAGNIGGDRYIVQVSDTSLRLLEGVRQIQHIPL-----DTGS-- 575
Query: 657 ENSTVLSVSIADPYVLLGMSDGSI--------------RLLVGDPSTCTVSVQTPAAIES 702
V+ S+ADPY++L +G I RL+VG PS +S + + S
Sbjct: 576 ---PVVQCSLADPYIVLLTQEGQILMFTLRTESVGLGVRLVVGKPS---ISQHSKVEVIS 629
Query: 703 SKKPVSSCTLYHDK------GPEPWLRKTSTDAWLSTGV-----------GEAIDGADGG 745
+ K VS ++ P+ KT T+ S GE
Sbjct: 630 AYKDVSGLFTCMNQMEDVQVTPDTKATKTVTERSFSIDAKTADEEDELLYGETESNVFNS 689
Query: 746 PLDQGDI-------------------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTH 786
+ G + ++C E+G LEI+ +P++ V+ V F G+
Sbjct: 690 SFNMGQTAEMESPTKEKKQTEAKPTYWLLLCRENGVLEIYSIPDYKKVYYVKNFPMGQKL 749
Query: 787 IVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILT 846
+VD+ ++ G Q K N + EL M SRP L A +
Sbjct: 750 LVDS----------VQVTDKLSSGERQ-EKVNAECPALKELLMVGLGYKDSRPHLLARVE 798
Query: 847 DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP 906
D Y++E S D R + + R + + + + + +EE
Sbjct: 799 D------DLYIYEAFSYPQSSIDNHLKLRFKKIQHDLILREKRSKSKKKDPEEFQKEEKK 852
Query: 907 HGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNC 965
G ++ FK+++G+ G F+ G+ P W V R LR+HP DG + F+ HN+NC
Sbjct: 853 VG----KMRYFKDVAGYSGVFVCGAYPHWIFVTSRGSLRIHPMGIDGPVWCFSEFHNINC 908
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
HGF+Y G L+I LP+ TYD WPV+K+PL+ TPH + Y E +Y ++ S P +
Sbjct: 909 PHGFLYFNKMGELRISVLPTHLTYDAPWPVRKVPLRCTPHFVAYHFENKIYAVVTSTPEI 968
Query: 1026 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQT--RAT 1082
N++ + I+ D Y T+ + +++ P W+
Sbjct: 969 --CNKLPKTTTEDREWDTIEK------DERFIYPTIPRFTLQLYSPTS----WEVVPNTK 1016
Query: 1083 IPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1141
I + E+ ++++ + L + T ++ + +GT GE+V +RGRV++ P
Sbjct: 1017 IECEEWEHVVSMKTIRLRSEETLSGFKSYIVMGTNLSLGEEVTSRGRVIIADIIEVVPEP 1076
Query: 1142 -----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
++ + +Y KE KG ++ALA + G L+ A G K+ + + +L G+AF D +
Sbjct: 1077 GMPLTKHKIKTLYEKEQKGPVTALADINGLLITAIGQKLYIWQLKDNDLMGVAFIDT-HI 1135
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
Y+ +L +K+ IL GDI KS+ ++E+ L+++++D L+ + +FLID + L +
Sbjct: 1136 YIHTLVTIKHIILAGDILKSVSVYQYQEEHKVLSIVSRDPRPLEVYTADFLIDNTQLCCL 1195
Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSD-RTGAA 1313
VSD KN+ ++ Y P+ ES GQ+L+ +A+F+ G++V+ R+ ++ SSD R A
Sbjct: 1196 VSDRMKNLVVYSYQPEARESHGGQRLIRKADFNAGSNVSSMFRVRCKLYDPSSDKRMTGA 1255
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
P R F TLDGS+G + PL E +RRL LQ LV +PHVAGLNPRS+R
Sbjct: 1256 P----EKRHITYFATLDGSLGFVLPLSEKVYRRLFMLQNALVTHIPHVAGLNPRSYRHVI 1311
Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
R +I+D ELL Y L + E++EIA + GT+ QI+ +L ++
Sbjct: 1312 GTFPELRNPQKNILDGELLWKYTNLSIMEKIEIAKRLGTSNDQIMDDLMEI 1362
>gi|307190910|gb|EFN74734.1| Cleavage and polyadenylation specificity factor subunit 1 [Camponotus
floridanus]
Length = 1418
Score = 578 bits (1490), Expect = e-162, Method: Compositional matrix adjust.
Identities = 436/1478 (29%), Positives = 717/1478 (48%), Gaps = 197/1478 (13%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV ANVI ++ + + ++ K + ++ LE + Y LHGN+ S+
Sbjct: 30 LVVAGANVIRVFRLIPDIDMTRREKYTENRPPKM--------KLECLAQYTLHGNIMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+ G+ +RDS++L+F DAK+SV+E+D IH LR S+H FE E +K G +
Sbjct: 82 AVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IKDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR---IESSH 234
P+V+VDP+GRC +L+YG ++++L + S + D D S S+ I SS+
Sbjct: 135 HHIPIVRVDPEGRCAIMLIYGRKLVVLPFRKDPS--LDDGDLLDSAKLTSSNKTPILSSY 192
Query: 235 VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+I L+ L+ M +V D F++GY EP ++IL+E T++GR++ + TC + A+S++
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLYGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDS 351
+ HP+IWS NLP D Y+++ V P+GG L++ N++ Y +QS ++LN+ A + +
Sbjct: 253 RVHPIIWSVSNLPFDCYQVVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTN 312
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
P+ + L+ + ++ D ++S K+G+L +L++ D R V+ K SV
Sbjct: 313 FPLKPQEGVKMSLEGSQVAFISGDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKRLRR 469
LTS + ++ FLGSRLG+SLL++FT ++ + E + E++ K+ ++
Sbjct: 373 LTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNDNEITIEENESEETPAKKAKQ 432
Query: 470 S------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-- 521
+SD L D+ + EEL +YGS ++ T ++ F V DSL+NIGP + S G
Sbjct: 433 DFLGDWMASDVL-DIKDPEELEVYGSETH-TSIQITSYIFEVCDSLLNIGPCGNISMGEP 490
Query: 522 ------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSS 558
N D + +G K ++ ELPGC+ +WTV
Sbjct: 491 AFLSEEFLHNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI---- 546
Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
G + ++ A + HA+LI+S E TM+L+T + EV +S + QG T+ AGNL
Sbjct: 547 -GTLNNDEQVKAEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTVFAGNLG 604
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
R ++QV + G R+L G Q + ++ S ADPYV L DG
Sbjct: 605 ANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLSEDG 654
Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGP--EPWLRKTSTDAWLST--G 734
+ LL T + A + + + Y D L +T+ D +
Sbjct: 655 QVVLLTLREVRGTARLHAQPANLLFRPQIEALCTYRDVSGIFTTQLSETTDDEQVEEEHN 714
Query: 735 VGEA-----IDGADG-----------------GPLD----------------QGDIYSVV 756
V E ID D PLD + + +V
Sbjct: 715 VEEPSLLSNIDNEDDLLYGDAPAFQMPAPSYQKPLDGVSKKAPWWQRHLQEIKPTYWLLV 774
Query: 757 CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK 816
+SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 775 YRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQSAPVNEIPNPE-------- 826
Query: 817 ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 827 -----MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYKY--PKGYLK---------- 868
Query: 877 LSVSNVSASRLRNLRFSRTP--LDAYTREE-TPHGAPCQRITI---FKNISGHQGFFLSG 930
R + L P L +EE P A RI + F NI+G+ G F+
Sbjct: 869 --------LRFKKLEHGIIPGRLSPKPKEEDMPMNASETRICMMRYFSNIAGYNGVFICC 920
Query: 931 SRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
P W + R LR HP DG I +F +NVNC GF+Y + L+IC LP+ +Y
Sbjct: 921 DYPHWIFLTGRGELRTHPMGIDGPITSFAAFNNVNCPQGFLYFNRKEELRICVLPTHLSY 980
Query: 990 DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
D WPV+K+PL+ TPH +TY E Y +I S+ +PL +
Sbjct: 981 DAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------YRF 1023
Query: 1050 SSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVTL 1099
+ D + +T EE R L P + + W+T I ++ E+ ++ V+L
Sbjct: 1024 NGED--KEFTEEERPERFLYPSQEQFSIVLFSPVSWETIPNTKIELEQWEHVTCLKNVSL 1081
Query: 1100 FNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKEL 1153
T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+KE
Sbjct: 1082 AYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQ 1141
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
KG I+A+ + G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+ D+
Sbjct: 1142 KGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDT-QIYIHQMLSIKSLILIADV 1200
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
+KSI L ++E+ L+L+++DF + + E+LID + L ++D + N+ +F Y P+
Sbjct: 1201 YKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFFLADGESNLALFMYQPES 1260
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
ES GQKL+ +A+FH+G V F R++ ++ ++ G+DK R ++ TLDGS
Sbjct: 1261 RESLGGQKLIRKADFHLGQKVNTFFRIRCRVSDPANDKKQFSGADK--RHVTMYATLDGS 1318
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCEL 1391
+G I P+ E T+RRL LQ LV + H+AGLNP+S+RQ + + ++ P I+D +L
Sbjct: 1319 LGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKSYRQTYKSYIRNQGNPARGIIDGDL 1378
Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
+ Y LP E+ ++A + GT +I+ ++ ++ T+
Sbjct: 1379 VWRYLFLPNNEKTDVAKKIGTRVQEIIEDITEIDRQTA 1416
>gi|340710064|ref|XP_003393618.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Bombus terrestris]
Length = 1417
Score = 578 bits (1489), Expect = e-161, Method: Compositional matrix adjust.
Identities = 428/1478 (28%), Positives = 709/1478 (47%), Gaps = 198/1478 (13%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
L V AN+I I+ + + +K+ K + ++ LE + Y LHGNV S+
Sbjct: 30 LAVAGANIIRIFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D H LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR--IESSHV 235
P+V+VDP+GRC +L+YG ++++L + S + D D + S + I SS++
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPS--LDDGDLLDNSKALSNKTPILSSYM 192
Query: 236 INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
I L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ +
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
HP+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLA 306
Query: 354 EL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
E P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 307 ETSTNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDK 366
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE-ADAPST 464
SVLTS + ++ FLGSRLG+SLL++FT ++ ++ E + +
Sbjct: 367 AAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPA 426
Query: 465 KRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPLKD 517
K++++ +SD L D+ + EEL +YGS S Q T + F V DSL+NIGP +
Sbjct: 427 KKIKQDFIGDWMASDVL-DIKDPEELEVYGSERETHTSIQITSYIFEVCDSLLNIGPCGN 485
Query: 518 FSYGLRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
S G + S+ + ELV ELPGC+ +W
Sbjct: 486 ISMGEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMW 544
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV G + ++ + HA+LI+S E TM+L+T + EV +S + QG T
Sbjct: 545 TVI-----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGST 598
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
I AGNL R ++QV + G R+L G Q + ++ S ADPYV
Sbjct: 599 IFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYV 648
Query: 672 LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDK 716
L DG + LL T + AA + + + Y D+
Sbjct: 649 TLLSEDGQVMLLTLREGRGTAKLHAQAANLLFRPQIEALCAYRDVSGIFTTQLPENVEDE 708
Query: 717 GPE--------------------------PWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
PE + T + S G+ + +
Sbjct: 709 APEEEHNIEEPPIVGNIDNEDDLLYGDAPAFQMPTPSHTKTSEGISKRTPWWQKHLQEIK 768
Query: 751 DIYSVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
Y ++ Y +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 769 PTYWLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE- 827
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 828 ------------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL-- 870
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
+ L + LR P+ TR C + F NI+G+ G F+
Sbjct: 871 ---RFKKLDHGIIPGQLKPKLRDEDIPMMNETRH-------CM-MRYFSNIAGYNGVFIC 919
Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +
Sbjct: 920 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 979
Query: 989 YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
YD WPV+K+PL+ TPH +TY E Y +I S+ +PL +
Sbjct: 980 YDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------YR 1022
Query: 1049 LSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVT 1098
+ D + +T EE R + P + + W+T I + E+ ++ V+
Sbjct: 1023 FNGED--KEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNVS 1080
Query: 1099 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKE 1152
L T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+KE
Sbjct: 1081 LAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKE 1140
Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
KG I+A+ + G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+ D
Sbjct: 1141 QKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYIHQMLSIKSLILIAD 1199
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
++KSI L ++E+ L+L+++DF + + E+LID + L +V+D + N+ +F Y P+
Sbjct: 1200 VYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGESNMALFMYQPE 1259
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLDG 1331
ES GQKL+ +A+FH+G V F R++ L+ ++ G+DK R ++ +LDG
Sbjct: 1260 SRESLGGQKLIRKADFHLGQKVNTFFRIRCRLSDPANDKKHFSGADK--RHVTMYASLDG 1317
Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
S+G I P+ E T+RRL LQ LV + H+AGLNP+++R + S+ + I+D +L
Sbjct: 1318 SLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDL 1377
Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
+ Y LP E++++A + GT +I+ +L ++ T+
Sbjct: 1378 VWRYFYLPNNEKIDVAKKIGTRVQEIIEDLTEIDRQTA 1415
>gi|91078626|ref|XP_968117.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
cpsf [Tribolium castaneum]
Length = 1413
Score = 575 bits (1483), Expect = e-161, Method: Compositional matrix adjust.
Identities = 431/1477 (29%), Positives = 708/1477 (47%), Gaps = 208/1477 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LV + ANVI+++ + + ET + LE V Y L GN+ S+
Sbjct: 29 SLVTSGANVIKVFRLIPDIDTKTRIDKFNETNP-------PKSKLECVAQYTLFGNIMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
++ + RD+++LAF+DAK+SV+E+D H L+ S+H FE + +K G
Sbjct: 82 QSVNLANSP----RDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDD---MKDGWTH 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED---TFGSGGGFSARIESS 233
P+V+ DP+ RC + V+G ++++L + + D D G G A I +S
Sbjct: 135 HYHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILAS 194
Query: 234 HVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
++I L+D + ++ D F+HGY EP ++IL E T+AGRV+ + TC ++A+S++
Sbjct: 195 YMIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNLQ 254
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
K HP+IWS NLP D K + + P+GG L+ N + Y +QS + Y VSL+S
Sbjct: 255 QKVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQS------IPPYGVSLNS 308
Query: 352 SQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
E P+ + LD A AT+L++D +LS K G+L +LT++ D R V+
Sbjct: 309 IAENSTNFPLKPQDDLCISLDCAQATFLEDDTIVLSLKGGELYVLTLLADNMRYVRSFHF 368
Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--CGSGTSMLSSGLKEEFGDIEADA 461
K SVLT+ I+ N+ FLGSRLG+SLL++FT C ++ E
Sbjct: 369 EKAAASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITL-----------DETIE 417
Query: 462 PSTKRLRRSSS----------DALQDMV--------NGEELSLYGSASNNTESAQKTFSF 503
PS KRL+ S+S D L D + + EEL +YG+ + ++ F
Sbjct: 418 PSAKRLKASNSTSENEDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQIS-SYVF 476
Query: 504 AVRDSLVNIGPL------------KDFSYGLRINADASATG----------ISKQSNYEL 541
V DSL+NIGP ++FS L ++ + T + K ++
Sbjct: 477 EVCDSLLNIGPCGNISLGEPAFLSEEFSENLDLDLELVTTAGYGKNGALCVLQKSVRPQI 536
Query: 542 VE---LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
V LPGC +WTV+ + HA+LI+S E TM+L+T D + E
Sbjct: 537 VTTFTLPGCSNMWTVHAGEDK----------------HAFLILSQEDGTMILQTGDEINE 580
Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSEN 658
+ ++ + T+ AGNL + ++QV R+L G Q + E GS
Sbjct: 581 I-DNTGFATHIPTVYAGNLGNLKYIVQVTSSAVRLLQGINQLQHIPL-----ELGS---- 630
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG- 717
++ V+ DPY+ L +DG + L+ + + + S+ PV++ +Y D
Sbjct: 631 -PIVHVTSVDPYISLLTTDGQVITLMLREARGVAKLVISKSTLSNSPPVTTICMYRDVSG 689
Query: 718 -------------PEPWLRKTSTDAWLSTGVGEAIDGADGG--------PLDQGDIYS-- 754
PE ++ ++ T + + + G D P + +Y
Sbjct: 690 LFTSKIPEDFTHIPEHFINESETKMEVENE-DDLLYGDDSDFKMPTLNPPQPKPKVYYNW 748
Query: 755 --------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
V E+ LEI+ +P+F + + G +VD E++ S
Sbjct: 749 WKKYLLDVRPSYWLFVVRENSNLEIYSIPDFKLCYYITNLCFGHKVLVDNL--ESVTISA 806
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
+ S++ E Q R+ ++ + VV L H SRP L L + + Y+ + F
Sbjct: 807 STPISAAHEANIQ-RQFDVKEILVVALG-----NHGSRPLLMVRL-ERDLYIYEVFRF-- 857
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
P K + NVS R D + +E ++ F NI
Sbjct: 858 PRGNLKMRFRKIKHSLIYSPNVSG------RIDTEDSDFFAIQER-----IIKMRYFTNI 906
Query: 921 SGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+G+ G F+ G+ P W M R LR HP DG +++F +NVNC GF+Y + L+
Sbjct: 907 AGYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELR 966
Query: 980 ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
I LP+ +YD WPV+K+PL+ TPH +TY E Y L+ S+ +P N+
Sbjct: 967 IGVLPTHLSYDAAWPVRKVPLRCTPHFVTYHLESKTYCLVTSIA--EPSNKYYKF----- 1019
Query: 1040 VGHQIDNHNLSSVDLHRTYTV---EEYEVRILEPDRAGGPWQT--RATIPMQSSENALTV 1094
++ LS D + E++ + + P W I + E+ +
Sbjct: 1020 ---NGEDKELSVEDRGDRFPYPLQEKFSLMLFSP----VSWDVIPNTKIDLDEWEHVNCL 1072
Query: 1095 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEV 1148
+ V+L T+ + +A+GT Y GEDV +RGR+L+F P +N E+
Sbjct: 1073 KNVSLAYEGTRSGLKGYIAVGTNYNYGEDVTSRGRILIFDIIEVVPEPGQPLTKNRFKEI 1132
Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
Y+K+ KG ++AL+ ++G L+ A G KI + + +L G+AF D +Y + +K+ +
Sbjct: 1133 YAKDQKGPVTALSQVKGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYTHQILTIKSLL 1191
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
L+ D++KSI L ++E+ L+L+++DF + F+ E++ID +T+ +VSD +KN+ ++
Sbjct: 1192 LVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMIDNTTMGFLVSDSEKNLVLYM 1251
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFG 1327
Y P+ ES GQ+LL +A+FH+G V F R++ L + G+DK R ++
Sbjct: 1252 YQPESRESLGGQRLLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADK--RHITMYA 1309
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
TLDG +G I P+ E T+RRL LQ LV H+AGLNP++FR + S K S++
Sbjct: 1310 TLDGGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTNPARSVI 1369
Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
D EL+ +Y L + E+LE++ + GT ++L +L+D+
Sbjct: 1370 DGELVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDI 1406
>gi|350413821|ref|XP_003490124.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Bombus impatiens]
Length = 1417
Score = 575 bits (1481), Expect = e-161, Method: Compositional matrix adjust.
Identities = 427/1478 (28%), Positives = 709/1478 (47%), Gaps = 198/1478 (13%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
L V AN+I I+ + + +K+ K + ++ LE + Y LHGNV S+
Sbjct: 30 LAVAGANIIRIFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D H LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR--IESSHV 235
P+V+VDP+GRC +L+YG ++++L + S + D D + S + I SS++
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPS--LDDGDLLDNSKALSNKTPILSSYM 192
Query: 236 INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
I L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ +
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
HP+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLA 306
Query: 354 EL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
E P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 307 ETSTNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDK 366
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPST 464
SVLTS + ++ FLGSRLG+SLL++FT ++ ++ + + E +
Sbjct: 367 AAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPA 426
Query: 465 KRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPLKD 517
K++++ +SD L D+ + EEL +YGS S Q T + F V DSL+NIGP +
Sbjct: 427 KKIKQDFIGDWMASDVL-DIKDPEELEVYGSERETHTSIQITSYIFEVCDSLLNIGPCGN 485
Query: 518 FSYGLRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
S G + S+ + ELV ELPGC+ +W
Sbjct: 486 ISMGEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMW 544
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV G + ++ + HA+LI+S E TM+L+T + EV +S + QG T
Sbjct: 545 TVI-----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGST 598
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
I AGNL R ++QV + G R+L G Q + ++ S ADPYV
Sbjct: 599 IFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYV 648
Query: 672 LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDK 716
L DG + LL T + AA + + + Y D+
Sbjct: 649 TLLSEDGQVMLLTLREGRGTAKLHVQAANLLFRPQIEALCAYRDVSGIFTTQLPENVEDE 708
Query: 717 GPE--------------------------PWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
PE + T + S GV + +
Sbjct: 709 APEEEHNIEEPPIVGNIDNEDDLLYGDAPAFQMPTPSHTKTSEGVSKRTPWWQKHLQEIK 768
Query: 751 DIYSVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
Y ++ Y +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 769 PTYWLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE- 827
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 828 ------------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLK--- 869
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
+ L + R P+ TR C + F NI+G+ G F+
Sbjct: 870 --LRFKKLDHGIIPGQLRPKPRDEDIPMMNETRH-------CM-MRYFSNIAGYNGVFIC 919
Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +
Sbjct: 920 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 979
Query: 989 YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
YD WPV+K+PL+ TPH +TY E Y +I S+ +PL +
Sbjct: 980 YDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------YR 1022
Query: 1049 LSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVT 1098
+ D + +T EE R + P + + W+T I + E+ ++ V+
Sbjct: 1023 FNGED--KEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNVS 1080
Query: 1099 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKE 1152
L T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+KE
Sbjct: 1081 LAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKE 1140
Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
KG I+A+ + G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+ D
Sbjct: 1141 QKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDT-QIYIHQMLSIKSLILIAD 1199
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
++KSI L ++E+ L+L+++DF + + E+LID + L +V+D + N+ +F Y P+
Sbjct: 1200 VYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGESNMALFMYQPE 1259
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLDG 1331
ES GQKL+ +A+FH+G V F R++ ++ ++ G+DK R ++ +LDG
Sbjct: 1260 SRESLGGQKLIRKADFHLGQKVNTFFRIKCRVSDPANDKKHFSGADK--RHVTMYASLDG 1317
Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
S+G I P+ E T+RRL LQ LV + H+AGLNP+++R + S+ + I+D +L
Sbjct: 1318 SLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDL 1377
Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
+ Y LP E++++A + GT +I+ +L ++ T+
Sbjct: 1378 VWRYLYLPNNEKIDVAKKIGTRVQEIIEDLTEIDRQTA 1415
>gi|242021233|ref|XP_002431050.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
putative [Pediculus humanus corporis]
gi|212516279|gb|EEB18312.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
putative [Pediculus humanus corporis]
Length = 1409
Score = 568 bits (1463), Expect = e-158, Method: Compositional matrix adjust.
Identities = 429/1457 (29%), Positives = 701/1457 (48%), Gaps = 178/1457 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LVV N++ ++ + + +K T+RR LE + + L NV S+
Sbjct: 29 SLVVAGKNILRVFQLIPDID---PTKRDAYTERRP-----PKMKLECLSSFSLFANVMSM 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S G+ RD+++L+F +AK+ V+E+D H LR S+H FE + +K G +
Sbjct: 81 QAVSLAGSS----RDALLLSFREAKLCVVEYDPDSHDLRTLSLHYFEEED---MKGGWTN 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF------GSGGGFSARI 230
P V+VDP+GRC +LVYG +++IL + + D D S A +
Sbjct: 134 HYDIPYVRVDPEGRCAAMLVYGRKLVILPFRRESK--LDDPDIALLDPHSSSVATAKAPV 191
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
SS+ I LR++D +++V D F++GY EP ++IL+E T+AGR++ + TC + A+S+
Sbjct: 192 LSSYTITLREIDEKLENVIDIQFLYGYYEPTLLILYEPLKTFAGRIAVRSDTCAMIAVSL 251
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAV 347
+ + HP IWS NLP + + + VP P+GG L+ N + Y +QS +++N+ A
Sbjct: 252 NIQQRVHPAIWSVGNLPFNCTQAIPVPKPLGGTLIFSVNALIYLNQSIPPFGVSVNSIAE 311
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ + Q + + L+ + AT++ +D +LS KTG+L +L+++ D R V+ K
Sbjct: 312 NSTNFQLKIQEGVKITLEGSQATFISHDRLVLSLKTGELYVLSLLADNIRSVRGFHFDKA 371
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLT+ + + FLGSRLG+SLL++FT + L E E PS +R
Sbjct: 372 AASVLTTCLCVCEDKYLFLGSRLGNSLLLRFTEKESSEAPIITLDESIR--EVPVPSKRR 429
Query: 467 LRRSSSDALQ----DMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
+ + D + D+ + +EL +YG+ ++ +F F V DSL+NIGP + S G
Sbjct: 430 RQDALGDWMASDVADIRDLDELEVYGTQEASSSVQITSFMFEVCDSLLNIGPCGNVSMGE 489
Query: 523 RINADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHK 556
+ ++ + ELV ELPGC +WTV
Sbjct: 490 PAFLSEEFSN-NRDPDLELVTTSGHGKNGAICVLQRTIRPQVVTTFELPGCLDMWTVI-- 546
Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
G +DS A DD HA+LI+S + TM+L+T + EV S + QG TI AGN
Sbjct: 547 ---GPQSDSGPTQAEDDISHAFLILSQKDSTMILQTGQEINEVDHS-GFNTQGPTIFAGN 602
Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
L + ++QV + G R+L G Q + S+V+ S ADPYV L
Sbjct: 603 LASNKYIVQVSKAGVRLLRGLEQIQHIPL----------DLGSSVVHASTADPYVALLTE 652
Query: 677 DGSI------------RLLVGDPST---------CTVSVQTPAAIESSKKPVSSCTLYHD 715
DG + RL V P+ CT + ++++ + + T D
Sbjct: 653 DGQVVLLTLRESRGQGRLSVFKPTIPTNPRVSKICTYRDVSGLFTLTTEEELQNATFKSD 712
Query: 716 KGPEPWLRKTS--TDAWLSTG---------VGEAIDGADGGPLDQGDIYS---------V 755
++K + D L G + + + P + YS
Sbjct: 713 SKN---MKKEADDEDEMLYGGSEVKFQLLPITNTNEPSPPRPFVRWKKYSQEIKPNYWMF 769
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
V E+G L+I+ +P+F F + + G + D ++ + G
Sbjct: 770 VLRETGTLDIYSLPDFRPSFQIRRIGQGHRVLYDV------------LDMAQTSGMDGSD 817
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF-EGPENTSKSDDPVSTS 874
IH + VV L H R + + T+ ++ YQA+ F +GP +
Sbjct: 818 DPEIHELLVVSLG------HLGRRPILLLRTENDLMIYQAFKFAKGPNLKIRF------- 864
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
R L + + R + Y E A R+ F NISG+ G F+ G P
Sbjct: 865 RRLPQTLILKERKAKFKVK------YENEVESERA--TRLRYFSNISGYNGVFVCGPNPH 916
Query: 935 WC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
W + R LR HP L DG + +F HNVNC GF+Y TS+ L+IC LP+ +YD W
Sbjct: 917 WLFLTARGELRSHPMLIDGRVTSFASFHNVNCPLGFLYFTSKCELRICILPTHLSYDAPW 976
Query: 994 PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1053
PV+K+PL+ TPH +TY E Y LI S +P N+ ++ H +++ + D
Sbjct: 977 PVRKVPLRCTPHMVTYHLESKTYCLITSSS--EPSNEYFR-FNGEDKEHSVEDRD----D 1029
Query: 1054 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS--ENALTVRVVTL-FNTTTKENETL 1110
+++ + + P W+ M+ E+ V+ V L + T +
Sbjct: 1030 RFPLPLQDKFSIVLFSP----VSWEVIPNTKMELDEWEHVTCVKTVNLSYEGTRSGLKGY 1085
Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQG 1165
+A+GT Y ED+ ++GR+L++ P +N VY+KE KG ++AL + G
Sbjct: 1086 VAVGTNYNYSEDITSKGRILIYDIIEVVPEPGQPLTKNRFKTVYAKEQKGPVTALCHVLG 1145
Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
L+ A G KI + + +L GIAF D +Y+ + VK+ IL+ D++KSI L ++E+
Sbjct: 1146 FLVTAMGQKIYIWQLKDNDLVGIAFIDT-QIYIHQMISVKSLILVADVYKSISLLRFQEE 1204
Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
L+L+++DF + +A E L+D + + ++SD + NI ++ Y P+ +S GQKLL +
Sbjct: 1205 YRTLSLVSRDFRPCEVYAIELLLDNTQMGFLISDVEMNIIMYMYKPEDRDSVGGQKLLRK 1264
Query: 1286 AEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
A+FH+G H+ + R++ L ++ G++K R +F TLDG++G + P+ E T+
Sbjct: 1265 ADFHLGQHINSWFRIRCRLGDQAENYDFPIGAEK--RHISMFATLDGALGYLLPIPEKTY 1322
Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
RRLQ LQ LV +PH+AGLNP++FR + S K IVD EL+ Y L + E+
Sbjct: 1323 RRLQMLQNILVYHIPHLAGLNPKAFRIYKSGRKLLGNPCKRIVDGELIWMYLSLTVMEKQ 1382
Query: 1405 EIAHQTGTTRSQILSNL 1421
++A + G+ I+ ++
Sbjct: 1383 DVAKKMGSKMDDIIEDI 1399
>gi|383863556|ref|XP_003707246.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Megachile rotundata]
Length = 1415
Score = 567 bits (1462), Expect = e-158, Method: Compositional matrix adjust.
Identities = 429/1468 (29%), Positives = 705/1468 (48%), Gaps = 180/1468 (12%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV N+I ++ + + +K K + ++ LE + Y LHGNV S+
Sbjct: 30 LVVAGGNIIRVFRLIPDVDITKREKYTESRPPKM--------KLECLAQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D IH LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
P+V+VDP+GRC +L+YG ++++L + S GD I SS++I
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ + H
Sbjct: 195 LKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
P+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S E
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308
Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG-SGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLTS + ++ FLGSRLG+SLL++F S S + + + E + K+
Sbjct: 369 ASVLTSCVCMCDDNYLFLGSRLGNSLLLRFIEKESENSQNMNENEITIEENETEETPAKK 428
Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+++ +SD L D+ + EEL +YGS ++ T ++ F V DSL+NIGP + S
Sbjct: 429 VKQDFIGDWMASDVL-DIKDPEELEVYGSETH-TSIQITSYIFEVCDSLLNIGPCGNISM 486
Query: 521 GLRINADASATGISKQSNYELV--------------------------ELPGCKGIWTVY 554
G + S+ + ELV ELPGC+ +WTV
Sbjct: 487 GEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI 545
Query: 555 HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAA 614
G + ++ + HA+LI+S E TM+L+T + EV +S + QG T+ A
Sbjct: 546 -----GALNNDEQVRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTVFA 599
Query: 615 GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
GNL R ++QV + G R+L G Q + ++ S ADPYV L
Sbjct: 600 GNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVSLL 649
Query: 675 MSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDKGPE 719
DG + LL T + A + + + Y D+ PE
Sbjct: 650 SEDGQVMLLTLREGRGTAKLHAQTANLLFRPQIEALCAYRDVSGIFTTQLPENVEDEVPE 709
Query: 720 --------PWLRKTSTDAWLSTGVGEAID--------GADG----GPLDQGDI------Y 753
P + + L G G A ++G P Q + Y
Sbjct: 710 EEHNTEEPPIVGNIDNEDDLLYGDGPAFQMPAPSQTKSSEGTSKRAPWWQKHLQEIKPTY 769
Query: 754 SVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
++ Y +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 770 WLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTAPVNEIPNPE---- 825
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
M+V E+ M H +RP L L D + YQ Y + P+ K
Sbjct: 826 ---------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQTYRY--PKGHLK------ 867
Query: 873 TSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
L + + NLR D ET H C + F NI+G+ G F+
Sbjct: 868 ----LRFKKLDHGIIPGNLRPKPKEEDMSAMNETRH---CM-MRYFSNIAGYNGVFICSD 919
Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W + R LR HP DG I +F +N+NC GF+Y + L+IC LP+ +YD
Sbjct: 920 YPHWIFLTGRGELRTHPMGIDGPITSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYD 979
Query: 991 NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
WPV+K+PL+ TPH +TY E Y +I S+ +PL G +
Sbjct: 980 APWPVRKVPLRCTPHFVTYHLESKTYCVITSIA--EPLKSYYRF-----NGEDKEFTEED 1032
Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1107
D + E++ + + P W+T I + E+ ++ V+L T+
Sbjct: 1033 RPDRFIFPSQEQFSIVLFSPVS----WETIPNTKIELDQWEHVTCLKNVSLAYEGTRSGL 1088
Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALAS 1162
+ + +GT Y GED+ +RGR+L+F P +N ++Y+KE KG I+A+
Sbjct: 1089 KGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPITAITQ 1148
Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
+ G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+ D++KSI L +
Sbjct: 1149 VSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYIHQMLSIKSLILIADVYKSISLLRF 1207
Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
+E+ L+L+++DF + + E+LID + L +V+D + NI +F Y P+ ES GQKL
Sbjct: 1208 QEEYRTLSLVSRDFRPAEVYTIEYLIDNNNLGFLVADGESNIALFMYQPESRESLGGQKL 1267
Query: 1283 LSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
+ +A+FH+G V F R++ ++ ++ G+DK R ++ +LDGS+G I P+ E
Sbjct: 1268 IRKADFHLGQKVNTFFRIRCRISDPANDKKHFSGADK--RHVTMYASLDGSLGYILPVPE 1325
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
T+RRL LQ LV + H+AGLNP+++R + S + I+D +L+ Y LP
Sbjct: 1326 KTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYIRTQGNPARGIIDGDLVWRYLYLPNN 1385
Query: 1402 EQLEIAHQTGTTRSQILSNLNDLALGTS 1429
E++++A + GT +I+ +L ++ T+
Sbjct: 1386 EKIDVAKKIGTRVQEIIEDLTEIDRQTA 1413
>gi|414587798|tpg|DAA38369.1| TPA: hypothetical protein ZEAMMB73_163106, partial [Zea mays]
Length = 483
Score = 555 bits (1430), Expect = e-155, Method: Compositional matrix adjust.
Identities = 270/458 (58%), Positives = 328/458 (71%), Gaps = 10/458 (2%)
Query: 685 GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
DPSTCT+S+ PA SS + +S+CTLY D+GPEPWLRKT TDAWLST VGEAID D
Sbjct: 33 ADPSTCTISINAPAIFASSSERISACTLYCDRGPEPWLRKTHTDAWLSTDVGEAIDDNDN 92
Query: 745 GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
D DIY ++CYESG LEIF+VP+F VF+VD FVSG + D + R + KDS
Sbjct: 93 SSHDLSDIYCIICYESGKLEIFEVPSFKRVFSVDNFVSGPAILFDVFSRNSTKDSGIGDR 152
Query: 805 SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
+S+ +KE ++K+VELAM RWS SRPFLF +L DGT+LCY AY FEG E+
Sbjct: 153 DASKVSV---KKEEAANIKIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAYYFEGSESN 209
Query: 865 SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPC---QRITIFKNIS 921
+ S + N + SRLRNLRF R +D +R++ C RITIF N+
Sbjct: 210 VQCAPFSPHGGSPDIGNATDSRLRNLRFCRVSIDISSRDDIS----CLVRPRITIFNNVG 265
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
G++G FL G RP W V R+R RVHPQLCDG IVAFTVLHNVNC G IYVTSQG LKIC
Sbjct: 266 GYEGLFLGGPRPTWVFVCRQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIYVTSQGFLKIC 325
Query: 982 QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
QLPS YDNYWPVQK+PL TPHQ+TY+ E++LYPLIVSVP ++PLNQVLS + DQE+G
Sbjct: 326 QLPSAYNYDNYWPVQKVPLHGTPHQVTYYGEQSLYPLIVSVPQVRPLNQVLSSMADQELG 385
Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFN 1101
++N S DL YTV+E+EVRI+E ++ G W+TR+TIPMQS ENALTVR+VTL N
Sbjct: 386 LHMENDVTSGGDLQEVYTVDEFEVRIMELGKSNGRWETRSTIPMQSFENALTVRIVTLQN 445
Query: 1102 TTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1139
T+TKENETL+AIGTAYVQGEDVAARGRVLLFS ++ +
Sbjct: 446 TSTKENETLMAIGTAYVQGEDVAARGRVLLFSFSKSEN 483
>gi|308805673|ref|XP_003080148.1| cleavage and polyadenylation specificity factor (ISS) [Ostreococcus
tauri]
gi|116058608|emb|CAL54315.1| cleavage and polyadenylation specificity factor (ISS), partial
[Ostreococcus tauri]
Length = 1473
Score = 555 bits (1430), Expect = e-155, Method: Compositional matrix adjust.
Identities = 390/1293 (30%), Positives = 603/1293 (46%), Gaps = 204/1293 (15%)
Query: 260 MVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
+ IL+E+ TWAGR + TC I ALS+ ++ +IW NLP +YKL A+ P+G
Sbjct: 6 LAILYEKTPTWAGRYNLAKDTCEIVALSVDVDKQKSTVIWRRQNLPSSSYKLTALLPPLG 65
Query: 320 GVLVVGANTIHYHSQSASCALALNNYA------------------VSLDSSQELPRS--- 358
GVLV + + + SQ +S AL LN + + D+ P +
Sbjct: 66 GVLVFSQDFLLHESQESSSALCLNTFGRGGPQEGNDAETVARLAGMGEDAVANPPPACAA 125
Query: 359 -----SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
+ LD A A + D L++TK G L LL + DGR ++R+ L + +VL+S
Sbjct: 126 RAVDCGLEITLDGAQAAVVSEDRVLVTTKMGALFLLALHTDGRSLRRMMLQRAGGAVLSS 185
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---MLSSGLKEEFGDIEADAPSTKRLRRS 470
+ + L FLGSR+GDSLLV+FT S + ML G +E E + S KR +
Sbjct: 186 GMCLLSRDLLFLGSRIGDSLLVKFTPKSEPAAPLMLPKGEDDEETVDEVEKGSGKRSKSG 245
Query: 471 SSDALQDMV-----------------NGEELS--LYGS-------ASNNTESAQKT---- 500
A++ + +EL LYG+ T++A+K
Sbjct: 246 DGAAIRKRAKSTEDPPPAPSTPSPEDDDDELEALLYGTTKAESVIGDETTQTAEKKREGL 305
Query: 501 -----------FSFAVRDSLVNIGPLKDFSYGLR--INAD-------ASATGISKQ---- 536
+ F V+DSL+ + P+ D + G + D +A G K
Sbjct: 306 AGVVPGLKVAGYDFKVKDSLLGVAPVVDITVGASAPVGTDTAERTELVTACGQGKNGALA 365
Query: 537 -----------SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
+ E LP +G+W ++ + + +R + +H +L++ L+
Sbjct: 366 ILTRGVQPELVTEVEAGTLPTLQGLWALHDRK------EGTR--EVREPFHNHLLLKLQ- 416
Query: 586 RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
EV+ S+++ T+AA N FG +Q+ E RIL QD++
Sbjct: 417 ------------EVSASLEFITDQATLAAANFFGHFCSLQITETSIRILKSGMKVQDVTL 464
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
+ GS + S I DPY+++ +SDG++RLL GD TVS+ A+ +S +
Sbjct: 465 ADIKAPKGS-----VIASAEILDPYIMIRLSDGTLRLLAGDEKKMTVSLMESGAMPTSSR 519
Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
G W+ +++T+ ++ G GA +Q + + E G+LE+
Sbjct: 520 RTRLVEALKKSG---WIHRSATNGTITGLEGSKKSGAS----NQKEAIVAIAREGGSLEL 572
Query: 766 FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
F +P+ ++ D G + T SE I ++V
Sbjct: 573 FSLPSCTRIWNADGLSEGSRVLSPTRPVH----SELRIP------------------EIV 610
Query: 826 ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS 885
++ + + H RP L A+ DGT+L Y+ ++ S++P++
Sbjct: 611 DIRIDSFEEAHERPLLTAVRGDGTLLLYRGFIVPAGTTCEGSEEPLARG----------- 659
Query: 886 RLRNLRFSRTPLD-------------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
LRFSR +D A ++ G RI+ G QG F++G
Sbjct: 660 ---ELRFSRVNIDVEGSGLNVAGVGVAGQVRDSLAGTRLTRISNVGEGQGLQGIFVAGPN 716
Query: 933 PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
P W +V R R+ P +G IVAFT HNVNC +GFI T+ G ++ICQ+PS Y+
Sbjct: 717 PLWLIVRRSRVLALPTRGEGEIVAFTDFHNVNCPYGFILGTAVGGVRICQMPSKMHYEAA 776
Query: 993 WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV-GHQIDNHNLSS 1051
WPV+KI LK TPH + Y + LY L+ S V +D+E+ G + +LS
Sbjct: 777 WPVRKIALKCTPHAVAYLPDFKLYALVTSANVP---------WVDREIDGENVHGLSLSK 827
Query: 1052 VDLHRTYTVE----EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 1107
R + +Y VR+L P WQ ++ E+ VR V L + T +
Sbjct: 828 ARRERAKAHDDMELQYSVRLLVPGSLDCVWQHT----LEPGEHVQCVRNVQLKDINTGHS 883
Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFST-----GRNADNPQNLVTEVYSKELKGAISALAS 1162
+ LA+GTA GED RGRV LF+ +AD + +E K A +AL
Sbjct: 884 LSYLAVGTAMPGGEDTPCRGRVYLFNMVWERDSESADGYRWKGQVCCVREAKMACTALEG 943
Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
L GHL++A G K+ +H W G ELN +AF+D P ++ VS+N+VKNFIL+GD+ K ++F W
Sbjct: 944 LGGHLIVAVGTKLTVHTWDGRELNSVAFFDTP-IHTVSINVVKNFILVGDLEKGLHFFRW 1002
Query: 1223 KEQGAQLNL--LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
K+ G + +L L+KDF +D ++EFLIDG+TLSL+ SD N + F Y PK ESWKGQ
Sbjct: 1003 KDTGFEKSLIQLSKDFERMDVVSSEFLIDGTTLSLLGSDMSGNARTFGYDPKSIESWKGQ 1062
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
KLL RA +HVG+ +++ +R + + S NRFA+ FGTLDG++G P D
Sbjct: 1063 KLLPRAAYHVGSPISRMVRFNVEGSKSKMASTDGKPKGANRFAVFFGTLDGALGIFMPTD 1122
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFR--QFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+T+ +L ++Q++L +V G NPR+FR + P ++D LLS +E L
Sbjct: 1123 PVTYEKLLAIQRELTTAVRSPIGCNPRTFRTPKVFEGKHVQLRAPLDVLDGGLLSKFETL 1182
Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
EQ++IA R L + L+ +FL
Sbjct: 1183 TFSEQVKIASSAQVDRDLTLGLIQQLSASNAFL 1215
>gi|270003792|gb|EFA00240.1| hypothetical protein TcasGA2_TC003068 [Tribolium castaneum]
Length = 1392
Score = 551 bits (1420), Expect = e-153, Method: Compositional matrix adjust.
Identities = 423/1477 (28%), Positives = 698/1477 (47%), Gaps = 229/1477 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LV + ANVI+++ + + ET + LE V Y L GN+ S+
Sbjct: 29 SLVTSGANVIKVFRLIPDIDTKTRIDKFNETNP-------PKSKLECVAQYTLFGNIMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
++ + RD+++LAF+DAK+SV+E+D H L+ S+H FE + +K G
Sbjct: 82 QSVNLANSP----RDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDD---MKDGWTH 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED---TFGSGGGFSARIESS 233
P+V+ DP+ RC + V+G ++++L + + D D G G A I +S
Sbjct: 135 HYHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILAS 194
Query: 234 HVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
++I L+D + ++ D F+HGY EP ++IL E T+AGRV+ + TC ++A+S++
Sbjct: 195 YMIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNLQ 254
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
K HP+IWS NLP D K + + P+GG L+ N + Y +QS + Y VSL+S
Sbjct: 255 QKVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQS------IPPYGVSLNS 308
Query: 352 SQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
E P+ + LD A AT+L++D +LS K G+L +LT++ D R V+
Sbjct: 309 IAENSTNFPLKPQDDLCISLDCAQATFLEDDTIVLSLKGGELYVLTLLADNMRYVRSFHF 368
Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--CGSGTSMLSSGLKEEFGDIEADA 461
K SVLT+ I+ N+ FLGSRLG+SLL++FT C ++ E
Sbjct: 369 EKAAASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITL-----------DETIE 417
Query: 462 PSTKRLRRSSS----------DALQDMV--------NGEELSLYGSASNNTESAQKTFSF 503
PS KRL+ S+S D L D + + EEL +YG+ + ++ F
Sbjct: 418 PSAKRLKASNSTSENEDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQIS-SYVF 476
Query: 504 AVRDSLVNIGPL------------KDFSYGLRINADASATG----------ISKQSNYEL 541
V DSL+NIGP ++FS L ++ + T + K ++
Sbjct: 477 EVCDSLLNIGPCGNISLGEPAFLSEEFSENLDLDLELVTTAGYGKNGALCVLQKSVRPQI 536
Query: 542 VE---LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
V LPGC +WTV+ + HA+LI+S E TM+L+T D + E
Sbjct: 537 VTTFTLPGCSNMWTVHAGEDK----------------HAFLILSQEDGTMILQTGDEINE 580
Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSEN 658
+ ++ + T+ AG I ++ +L
Sbjct: 581 I-DNTGFATHIPTVYAG-----------------INQLQHIPLELG-------------- 608
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG- 717
S ++ V+ DPY+ L +DG + L+ + + + S+ PV++ +Y D
Sbjct: 609 SPIVHVTSVDPYISLLTTDGQVITLMLREARGVAKLVISKSTLSNSPPVTTICMYRDVSG 668
Query: 718 -------------PEPWLRKTSTDAWLSTGVGEAIDGADGG--------PLDQGDIYS-- 754
PE ++ ++ T + + + G D P + +Y
Sbjct: 669 LFTSKIPEDFTHIPEHFINESETKMEVENE-DDLLYGDDSDFKMPTLNPPQPKPKVYYNW 727
Query: 755 --------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
V E+ LEI+ +P+F + + G +VD E++ S
Sbjct: 728 WKKYLLDVRPSYWLFVVRENSNLEIYSIPDFKLCYYITNLCFGHKVLVDNL--ESVTISA 785
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
+ S++ E Q R+ ++ + VV L H SRP L L + + Y+ + F
Sbjct: 786 STPISAAHEANIQ-RQFDVKEILVVALG-----NHGSRPLLMVRL-ERDLYIYEVFRF-- 836
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
P K + NVS R D + +E ++ F NI
Sbjct: 837 PRGNLKMRFRKIKHSLIYSPNVSG------RIDTEDSDFFAIQER-----IIKMRYFTNI 885
Query: 921 SGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+G+ G F+ G+ P W M R LR HP DG +++F +NVNC GF+Y + L+
Sbjct: 886 AGYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELR 945
Query: 980 ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
I LP+ +YD WPV+K+PL+ TPH +TY E Y L+ S+ +P N+
Sbjct: 946 IGVLPTHLSYDAAWPVRKVPLRCTPHFVTYHLESKTYCLVTSIA--EPSNKYYKF----- 998
Query: 1040 VGHQIDNHNLSSVDLHRTYTV---EEYEVRILEPDRAGGPWQT--RATIPMQSSENALTV 1094
++ LS D + E++ + + P W I + E+ +
Sbjct: 999 ---NGEDKELSVEDRGDRFPYPLQEKFSLMLFSP----VSWDVIPNTKIDLDEWEHVNCL 1051
Query: 1095 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEV 1148
+ V+L T+ + +A+GT Y GEDV +RGR+L+F P +N E+
Sbjct: 1052 KNVSLAYEGTRSGLKGYIAVGTNYNYGEDVTSRGRILIFDIIEVVPEPGQPLTKNRFKEI 1111
Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
Y+K+ KG ++AL+ ++G L+ A G KI + + +L G+AF D +Y + +K+ +
Sbjct: 1112 YAKDQKGPVTALSQVKGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYTHQILTIKSLL 1170
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
L+ D++KSI L ++E+ L+L+++DF + F+ E++ID +T+ +VSD +KN+ ++
Sbjct: 1171 LVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMIDNTTMGFLVSDSEKNLVLYM 1230
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM-LATSSDRTGAAPGSDKTNRFALLFG 1327
Y P+ ES GQ+LL +A+FH+G V F R++ L + G+DK R ++
Sbjct: 1231 YQPESRESLGGQRLLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADK--RHITMYA 1288
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
TLDG +G I P+ E T+RRL LQ LV H+AGLNP++FR + S K S++
Sbjct: 1289 TLDGGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTNPARSVI 1348
Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
D EL+ +Y L + E+LE++ + GT ++L +L+D+
Sbjct: 1349 DGELVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDI 1385
>gi|47217773|emb|CAG05995.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1446
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 430/1516 (28%), Positives = 695/1516 (45%), Gaps = 273/1516 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + ++ S ++K R LE V + L GNV S+
Sbjct: 29 NLVVAGTSQLFVYRIIHDVESTSKTDKSSDSKTR-------KEKLEQVAAFSLFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ GA+ RD+++L+F+DAK+SV+E+D H L+ S+H FE PE R++
Sbjct: 82 ESVQLVGAN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPEL------RDT 131
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
DE G G G + +++I
Sbjct: 132 LT-------------------------------------DEQELGVGEGPKSSFLPTYII 154
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + C I A+S++ K
Sbjct: 155 DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQAQCSIVAISLNIMQKV 214
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 215 HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFP 274
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD + A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 275 LRLQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 334
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL----KEEFGDIEADAPSTKRLR 468
+ + T+ FLGSRLG+SLL+++T L G KE+ D++ L
Sbjct: 335 TCMVTMEPGYLFLGSRLGNSLLLKYTEKLQEMPLEEGKDKQEKEKDNDMDKQV-YVHTLN 393
Query: 469 RSSSDALQDMVNGE--ELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN 525
S+ + D E E+ +YGS A + T+ A T+SF V DS++NIGP + S G
Sbjct: 394 SFSAHSQHDFFVDEVDEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANASMGEPAF 451
Query: 526 ADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSR 559
G + + + E+V ELPGC +WTV +
Sbjct: 452 LSEEFQG-NPEPDLEVVVCSGHGKNGALSVLQRSIRPQVVTTFELPGCHDMWTVISNEVK 510
Query: 560 GHNADSSRMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
++ D + H +LI+S E TM+L+T + E+ S + QG
Sbjct: 511 EDKKVPQSPGSFTATHYSLEEDTKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGP 569
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
T+ AGN+ + +IQV G R+L+G + L F P + S ++ S+ADPY
Sbjct: 570 TVFAGNIGDNKYIIQVSPMGIRLLEG---VKQLHFIPVDL-------GSPIVHCSVADPY 619
Query: 671 VLLGMSDGSI---------------RLLVGDPSTCTVSVQTPAAIESSKKPVS------- 708
V++ ++G + RL + P +S Q+ + + VS
Sbjct: 620 VVIMTAEGVVTMFVLKVDSYMGKTHRLALQKPQ---ISTQSRVIALCAYRDVSGMFTTEN 676
Query: 709 --SC---------------TLYHD------KGPEPWLRKTST-------DAWLSTGVGEA 738
SC T+ HD E L S+ + + + V
Sbjct: 677 KVSCAIAEDFNIRSQSETETVIHDLSSNIVDDEEEMLYGDSSSNAGPSKEEMIRSFVAPG 736
Query: 739 IDGADGGPLD-QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALK 797
++GGP + + +V ESG +EI+ +P++ VF V F G+ +VD+ ++
Sbjct: 737 PSVSEGGPSKAEPSHWCLVTRESGVMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQSAT 796
Query: 798 DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYL 857
+ E EE T QG + + +V L +HSRP+L + + +L Y+A+
Sbjct: 797 QGDKE--GKKEEMTRQGEIPLVKEVTLVSLGY-----NHSRPYLL-VHVEQELLVYEAFP 848
Query: 858 F--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--------PH 907
+ + P+N K +RF + P + RE+
Sbjct: 849 YDQQQPQNNLK-----------------------VRFKKVPHNINFREKKSKLRKDKKAE 885
Query: 908 GAPCQ----------RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVA 956
GA + R F++ISG+ G F+ G P W +V R LR+HP DG I +
Sbjct: 886 GAAAEDGVAARGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRLHPMTIDGPIES 945
Query: 957 FTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKN-- 1014
F+ HN+NC GF+Y QG L+I LP+ +YD WPV+KIPL+ T H ++Y E
Sbjct: 946 FSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVESKAS 1005
Query: 1015 -----LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
+Y + SV + L I + G + + + + + +++ ++++
Sbjct: 1006 LSHCCVYAVCTSV-------KELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLIS 1058
Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGR 1128
P TR I ++ E ++ V L + T + +A GT +QGE+V RGR
Sbjct: 1059 PVSWEAIPNTR--IDLEEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCRGR 1116
Query: 1129 VLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
+L+ P +T+ +Y KE KG ++AL G+L+ A G KI L
Sbjct: 1117 ILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFLWVLKDN 1176
Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
+L G+AF D LY+ + +KNFIL D+ KS+ L ++E+ L+L+++D L+ ++
Sbjct: 1177 DLTGMAFIDTQ-LYIHQMMSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVYS 1235
Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1303
EF++D S L +VSD KN+ ++ Y P+ ES+ G +LL RA+F+ GA++ F R M
Sbjct: 1236 IEFMVDNSQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTFWR--MP 1293
Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
+ G+ N+ F TLDG +G + P+ E T+RRL LQ L + H AG
Sbjct: 1294 CRGALEAGSRKAMTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLSHHAG 1353
Query: 1364 LNPRSF-------------------------RQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
LNP++F R H + ++ + +I+D ELL+ Y L
Sbjct: 1354 LNPKAFRCVGADRTSAAMLSGMLPDFATSVSRMLHCDRRSLQNPVKNILDGELLNKYLYL 1413
Query: 1399 PLEEQLEIAHQTGTTR 1414
+ E+ E+A + GTT+
Sbjct: 1414 SMMERSELAKKIGTTQ 1429
>gi|443684051|gb|ELT88095.1| hypothetical protein CAPTEDRAFT_161045 [Capitella teleta]
Length = 1410
Score = 533 bits (1373), Expect = e-148, Method: Compositional matrix adjust.
Identities = 428/1471 (29%), Positives = 687/1471 (46%), Gaps = 205/1471 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLV N I +Y + + + ++ ++ ETK + LE V Y L GNV S+
Sbjct: 29 NLVTAGVNQIRVYRLVAESKPVEKESHTTETKS-------AKQKLECVADYELCGNVSSI 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S GA RD+++L FE+AK+S+ ++D L+ S+H FE + L+ G
Sbjct: 82 ESISLVGA----ARDALLLCFEEAKLSLCDYDPDTDDLKTISLHYFEDAD---LENG--C 132
Query: 177 FARG---PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
RG V+VDP+GRC +L+YG +I+L + D + S + I S+
Sbjct: 133 CQRGLHHSEVRVDPEGRCAVMLIYGTHLIVLPFRKESPSDEIDATSCAS----KSPIMST 188
Query: 234 HVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
++I+LR LD + +V D F+HGY EP ++IL+E TW RV+ + TC I A+S++
Sbjct: 189 YIIDLRTLDERVTNVVDIQFLHGYYEPTVLILYEPLPTWTCRVAVRKDTCSIVAISLNLQ 248
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
K HP+IWS NLP+D + VP PIGGV+V N++ Y +QS Y VSL+S
Sbjct: 249 DKTHPIIWSHSNLPYDCLRTFPVPKPIGGVIVFAVNSLLYLNQS------FPPYGVSLNS 302
Query: 352 SQEL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
P+ + LD A A ++ ND ++S K G+L +LT+V D R V+ L
Sbjct: 303 LTSFNTEFLLKPQEGVRMSLDCAQAEFIDNDKLVISLKGGELYVLTLVIDSMRAVRSFHL 362
Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
K SVLT+ + G++ FLGSRLG+SLL+++ SS G+ +
Sbjct: 363 DKAAASVLTTCMCMCGDNYLFLGSRLGNSLLLRYQ--EKKPEASSSSDASPGEEQRKEKM 420
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPL------- 515
T + S + + + +EL +YG S ES T F F V DS++NIGP
Sbjct: 421 TLAIGLVGSSDVSKLDDLDELEVYGRDSQAVESEDITQFMFEVCDSIINIGPCGQVEMGE 480
Query: 516 -----KDFSYGLRINADASATG----------ISKQSNYELV---ELPGCKGIWTVYHKS 557
++FS+ + + T + +Q ++V ELPGC +WTV
Sbjct: 481 PAFLSEEFSHQEDPDLELVTTSGYGKNGAISILQRQIRPQVVTTFELPGCTDVWTVLGSP 540
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
+D + HA+L++S +MVLET + E+ S + T+ A N+
Sbjct: 541 DEQQGSDEKLAGS-----HAFLLLSRADSSMVLETGQEIMELDHS-GFCTDAPTVHAANI 594
Query: 618 FGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
R ++QV +L G Q L+ S S V+S S+ADP+VLL D
Sbjct: 595 GNGRYIVQVGPNAIWLLKGVERIQHLALDVS----------SPVVSCSLADPHVLLLCED 644
Query: 678 GSIRLLV-----GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG---------PEP--- 720
G + LV DP T+S+ T + SK V + LY D EP
Sbjct: 645 GQLLHLVLSVQGDDP---TLSLLTTKLHQKSK--VIAINLYRDTSGLFVVASSESEPSAT 699
Query: 721 --------------------------------------WLRKTSTDAWLSTGVGEAIDGA 742
W ++ S E +GA
Sbjct: 700 TTTEATETTTPQQQTEEGVDDEDDLLYGDSDISAITSTWQKQESEKEEKKEEEEEEAEGA 759
Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
D P ++V+ +G LE++ +P++ F V F +G ++D+ L S
Sbjct: 760 DIQP----TYWAVIIRATGNLELYSLPDWQLCFLVKNFATGNKLLIDSMQAADLSASFVA 815
Query: 803 INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
S++E V E+ + + + S+P L A + D + Y+ + G +
Sbjct: 816 PERSTQEVPF-----------VHEVMLHGFGVNGSQPLLMARVHD-ELYIYKVFSHVGSK 863
Query: 863 NTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTR-----EETPHGAPCQRITIF 917
+ RL+ +RF R R E+ P R F
Sbjct: 864 --------------------AKGRLQ-VRFKRRSHGLIIRPRDREEKIPENKKWLRP--F 900
Query: 918 KNISGHQGFFLSGSRPCW-CMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
+ISG+ G F+ GS P W M R LR HP DG+I FT HNVNC GF+Y +S
Sbjct: 901 TDISGYSGVFICGSYPHWLIMTQRGTLRGHPMAIDGTIPCFTAFHNVNCPKGFLYFSSNE 960
Query: 977 ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
L+IC LP+ +YD WPV+K+PL+ TPH + Y + Y ++ S V P Q++ +
Sbjct: 961 ELRICVLPTHLSYDAPWPVRKVPLRCTPHFVVYHPDSKTYSVVSSQQV--PCTQLVRVAG 1018
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
D E +I+ + D + ++ +++ P TR ++ E+ + ++
Sbjct: 1019 DGE--KEIE--AVQKDDRFVFPIMNKFNIQLFSPVSWEPIPNTR--FDLEEWEHVMCIKT 1072
Query: 1097 VTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYS 1150
+ L + T + + +GT EDV++RG++ ++ P +N + VY+
Sbjct: 1073 INLKSEGTLSGLKGYVVVGTNLNYNEDVSSRGKLTIYDVIDVVPEPGQPLTKNKIKVVYN 1132
Query: 1151 KELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILL 1210
KE KG ++AL +QG L+ A G K+ + + +L GIAF D +Y+ + +KN I++
Sbjct: 1133 KEQKGPVTALDGVQGFLVTAIGQKVYIWQLKDNDLAGIAFIDT-QIYIHKMEALKNLIII 1191
Query: 1211 GDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA 1270
GD+ KSI L ++E L+L++KD L + +L+D ++L+ +V+D+ KN ++ Y
Sbjct: 1192 GDVCKSISVLRYQEDMKVLSLVSKDVRPLAVYGVAYLVDETSLAFIVADKLKNFLVYCYQ 1251
Query: 1271 PKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1330
P + +S GQ+L+ +A+ ++G+ V F R++ SD + + + + TLD
Sbjct: 1252 PDLVQSQGGQRLIRKADINIGSLVNAFFRVK--CRVSDPSTSKTDQSLAMKHITYYVTLD 1309
Query: 1331 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
GSIG + P+ E +RRL LQK L+ V AGLNP+++R + + +I+D +
Sbjct: 1310 GSIGYLLPISESLYRRLYMLQKMLIQQVQQTAGLNPKAYRTCQTEFRQLINIQRNIIDGD 1369
Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
L Y L ++ E+A + GTT QI +L
Sbjct: 1370 LAWKYLALTSHDRAEMAKRIGTTSHQIEDDL 1400
>gi|427780291|gb|JAA55597.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
cft1 cpsf subunit [Rhipicephalus pulchellus]
Length = 1237
Score = 528 bits (1359), Expect = e-146, Method: Compositional matrix adjust.
Identities = 398/1305 (30%), Positives = 624/1305 (47%), Gaps = 198/1305 (15%)
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
F+HGY EP ++IL+E TW GR++ + TC I ALS++ + HP+IWS NLP D +
Sbjct: 3 FLHGYYEPTLLILYEPLRTWPGRIAIRQDTCCILALSLNLQQRVHPVIWSYTNLPFDCLR 62
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL-------PRSSFSVE 363
LLAVP P+GGVL++ +++ Y +QS Y VSL+S + P+ +
Sbjct: 63 LLAVPRPLGGVLIMAVDSLLYLNQSVP------PYGVSLNSFTDFSTSFPLKPQEGLKIS 116
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSL 422
LD A A +L D +LS K G+L +LT+ DG R V+ K SVLT+ +T +
Sbjct: 117 LDCAQACFLSYDRLVLSLKGGELYVLTLFNDGMRSVRNFYFDKAAASVLTTSMTLCEDGY 176
Query: 423 FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD----- 477
FLGSRLG+SLL+ +T + ++E + +A+ P +K+ R DA+ D
Sbjct: 177 LFLGSRLGNSLLLHYT-EKAAEVDDIAKRDEKTESDANDPPSKKKRM---DAIGDWMASD 232
Query: 478 --MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRINAD 527
+++ +EL +YGS + T+ +++F V DSL+NIGP G N D
Sbjct: 233 VALIDPDELEVYGSETMATKQL-TSYTFEVCDSLINIGPCGKICMGEPAFLSEEFVQNTD 291
Query: 528 -----ASATGISKQSNYELV------------ELPGCKGIWTVY---------HKSSRGH 561
+ G K ++ ELPGC +WTV K+
Sbjct: 292 PDLELVTTAGYGKNGALCVLQRSVRPQVVTTFELPGCVHMWTVMGPPAEKKPPEKTEESD 351
Query: 562 NADSSRMAAYDD--EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFG 619
+ S AA HA+LI+S +M+L+T + E+ S + Q T+ AGNL
Sbjct: 352 DPASEDKAAEQPLTNTHAFLILSRADSSMILQTDQEINELDHS-GFSTQNPTVFAGNLGD 410
Query: 620 RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
R V+QV G R+LDG+ Q + S++++ S+ADP+V++ ++G
Sbjct: 411 GRYVLQVCPMGVRLLDGTRQLQHIPL----------DVGSSIVAGSLADPHVIIRSAEGL 460
Query: 680 I--RLLVGDPST-CTVSVQTPAAIESSKKPVSSCT------LYHDKGPEPWLRKTSTDAW 730
+ L GDP+ C ++V P K +S C L+ + EP + +
Sbjct: 461 VIHLTLRGDPAAGCRLAVLRPQLTAVKAKILSICVYKDVSGLFTTQYREP--DEPAKPEK 518
Query: 731 LSTGVGEAIDGADGGPLDQGD--------------------------------------- 751
E+ID + G LD D
Sbjct: 519 PLPPPKESIDMSSNGLLDDEDELLYGESEENPIQKEPVRMTSEEAPSVAESMFEIKEVAP 578
Query: 752 -IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEE 809
+ V E+G LEI+ +P + F V F G+ +VD+ A +++E ++ S E
Sbjct: 579 TYWLFVARENGVLEIYSLPEYKLCFLVKNFPMGQKVLVDSVQMTAPSGTKSEKLSDMSHE 638
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
+H + VV L ++ HSRP L A + D +L Y+A+ F
Sbjct: 639 SM-----PVVHEILVVGLGIR-----HSRPLLLARV-DEDLLIYEAFPF----------- 676
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-----ETPHGAPCQR-------ITIF 917
T R + LRF + D + RE + P ++ + F
Sbjct: 677 -YETQREGHL---------KLRFKKMSHDIFLRERKYKTQKPENEEEEKAFQSRQWLHPF 726
Query: 918 KNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
+ISG+ G FL G RP W M R LR HP DG I F HNVNC GF++ QG
Sbjct: 727 SDISGYSGVFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQG 786
Query: 977 ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
L+I LP+ TYD WPV+K+PL+ TPH + Y + Y ++ S P P N ++
Sbjct: 787 ELRISTLPTHLTYDAPWPVRKVPLRCTPHFVNYHVDSKTYCVVTSQP--DPCNHLVRF-- 842
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTV 1094
G + + L + T++++ +++L P W+T + + E+ +
Sbjct: 843 ---TGEEKEYELLERDSRYIFPTMDKFSLQLLSPVS----WETIPNTRVDLDEWEHLTCL 895
Query: 1095 RVVTLFNT-TTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEV 1148
+ V L + TT + LA+GT Y GEDV +RGR+++ P +N + V
Sbjct: 896 KNVMLSSEGTTTGMKGYLALGTNYCYGEDVTSRGRIIILDIIDVVPEPGQPLTKNKIKIV 955
Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
YSKE KG ++AL+ + G LL A G KI + + EL G+AF D +Y+ S+ VKN I
Sbjct: 956 YSKEQKGPVTALSQVVGFLLSAIGQKIYIWQLKDNELVGVAFIDTQ-IYIHSVVTVKNLI 1014
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
L+GD+ KS+ L ++E L+L+++D L+ +A EF ID + +S +V+D ++N+ ++
Sbjct: 1015 LVGDVFKSVSLLRYQEASRTLSLVSRDVRPLEVYAVEFFIDNTQMSFLVTDAERNLLLYM 1074
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
Y P+ ES GQ+LL R +FHVG+ V R++ + S R + T
Sbjct: 1075 YQPESRESCGGQRLLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMAT 1134
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN----GKAHRPGPD 1384
LDGS+ + P+ E T+RRL LQ LV ++PH AGLNP+++R ++S G H+
Sbjct: 1135 LDGSLAYVLPVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHK---- 1190
Query: 1385 SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
+I+D EL+ + L E+ E++ + GTT +QI +L ++ T+
Sbjct: 1191 NILDGELIWKFMHLSFMERSELSKKIGTTVTQITDDLLEIETYTA 1235
>gi|194883064|ref|XP_001975624.1| GG22421 [Drosophila erecta]
gi|190658811|gb|EDV56024.1| GG22421 [Drosophila erecta]
Length = 1455
Score = 517 bits (1332), Expect = e-143, Method: Compositional matrix adjust.
Identities = 424/1486 (28%), Positives = 697/1486 (46%), Gaps = 190/1486 (12%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E G ++ N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEAGQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E I + +L + + A + EEL +YGS + + + F F V D
Sbjct: 430 RNLQDE-EQIMEEIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 489 SLMNVAPVNYMCAGERVEFEEDGATLRPHAESLQDVKIELVAATGHSKNGALSVFVNCIN 548
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 549 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++ E G
Sbjct: 600 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-----EVG-- 651
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 652 ---SPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 709 LSGLFTVKGDDINLTGSSNSGFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + VV +SG LEI+ +P+ V+ V+ G IV
Sbjct: 769 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDV--GNGAIV 826
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
T E + S T +S ++ +S +EL++ + RP L + T
Sbjct: 827 LTDAMEFVPISLTTQENSKAGIVQACMPQHANSPLPLELSLTGLGLNGERPLLM-VRTRV 885
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L YQ +F P+ K R L N+ + ++ D E+
Sbjct: 886 ELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DEQEDIESYQM 936
Query: 909 AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 937 QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
+GF+Y + LKI LPS +YD+ WPV+K+PL+ TP Q+ Y E +Y LI
Sbjct: 997 PNGFLYFDTTYELKISVLPSYLSYDSTWPVRKVPLRCTPRQLVYHRENRVYCLITQTE-- 1054
Query: 1026 KPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RA 1081
+P+ + D+E+ + Y + ++E+ ++ P+ W+ A
Sbjct: 1055 EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDA 1103
Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1104 SISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPE 1163
Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
P +T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF D
Sbjct: 1164 PGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT-N 1222
Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
+YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L
Sbjct: 1223 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGF 1282
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
+V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 1283 LVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL- 1341
Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
N+ +++GTLDG++G PL E +RR LQ LV H+ GLNP+ +R S
Sbjct: 1342 --YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLVSYQEHLCGLNPKEYRTLKSF 1399
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
K I+D +L+ Y ++ E+ E+A + GT +IL +L
Sbjct: 1400 KKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1445
>gi|45552619|ref|NP_995833.1| cleavage and polyadenylation specificity factor 160, isoform A
[Drosophila melanogaster]
gi|18203551|sp|Q9V726.1|CPSF1_DROME RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit;
Short=dCPSF 160
gi|7303176|gb|AAF58240.1| cleavage and polyadenylation specificity factor 160, isoform A
[Drosophila melanogaster]
Length = 1455
Score = 516 bits (1329), Expect = e-143, Method: Compositional matrix adjust.
Identities = 420/1486 (28%), Positives = 698/1486 (46%), Gaps = 190/1486 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E ++E + +L + + A + EEL +YGS + + + F F V D
Sbjct: 430 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 489 SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 549 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 600 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 649 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 709 LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + VV +SG LEI+ +P+ V+ V+ +G +
Sbjct: 769 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 828
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D E + S T +S ++ +S +EL++ + RP L + T
Sbjct: 829 DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L YQ +F P+ K R + N+ + ++ D E+
Sbjct: 886 ELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQM 936
Query: 909 AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 937 QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
+GF+Y + LKI LPS +YD+ WPV+K+PL+ TP Q+ Y E +Y LI
Sbjct: 997 PNGFLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTE-- 1054
Query: 1026 KPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RA 1081
+P+ + D+E+ + Y + ++E+ ++ P+ W+ A
Sbjct: 1055 EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDA 1103
Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1104 SITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPE 1163
Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
P +T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF D
Sbjct: 1164 PGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT-N 1222
Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
+YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L
Sbjct: 1223 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGF 1282
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
+V+D ++NI ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 1283 LVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL- 1341
Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R S+
Sbjct: 1342 --YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSS 1399
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
K I+D +L+ Y ++ E+ E+A + GT +IL +L
Sbjct: 1400 KKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1445
>gi|195485994|ref|XP_002091320.1| GE12310 [Drosophila yakuba]
gi|194177421|gb|EDW91032.1| GE12310 [Drosophila yakuba]
Length = 1455
Score = 516 bits (1329), Expect = e-143, Method: Compositional matrix adjust.
Identities = 417/1486 (28%), Positives = 699/1486 (47%), Gaps = 190/1486 (12%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E G ++ N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEAGQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E ++E + +L + + A + EEL +YG+ + + + F F V D
Sbjct: 430 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGTGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 489 SLMNVAPINYMCAGERVEFEEDGATLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 549 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 600 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 649 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 709 LSGLFTVKGDDINLTGSSNSGFGHSFGGYMKAEPNMKVEDEEDLLYGDAGNAFKMNSMAD 768
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + +V +SG LEI+ +P+ V+ V+ +G +
Sbjct: 769 LAKQSKQKNSDWWRRLLVQAKPSYWLIVARQSGTLEIYSMPDMKLVYLVNDVGNGAMVLT 828
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D E + S T +S ++ +S +EL++ + RP L + T
Sbjct: 829 DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L YQ +F P+ K R L N+ + ++ DA E+
Sbjct: 886 ELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DAQEEIESYQM 936
Query: 909 AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 937 QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
+GF+Y + LKI LPS +YD+ WPV+K+PL+ TP Q+ Y E +Y LI
Sbjct: 997 PNGFLYFDTTYELKISVLPSYLSYDSTWPVRKVPLRCTPRQLVYHRENRVYCLITQTE-- 1054
Query: 1026 KPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RA 1081
+P+ + D+E+ + Y + ++E+ ++ P+ W+ A
Sbjct: 1055 EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDA 1103
Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1104 SISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPE 1163
Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
P +T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF D
Sbjct: 1164 PGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT-N 1222
Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
+YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L
Sbjct: 1223 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGF 1282
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
+V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 1283 LVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL- 1341
Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R S
Sbjct: 1342 --YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSF 1399
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
K ++D +L+ Y ++ E+ E+A + GT +IL++L
Sbjct: 1400 KKQGINPSRCVIDGDLIWSYRLMANSERNEVAKKIGTRTEEILADL 1445
>gi|194756960|ref|XP_001960738.1| GF11349 [Drosophila ananassae]
gi|190622036|gb|EDV37560.1| GF11349 [Drosophila ananassae]
Length = 1455
Score = 516 bits (1328), Expect = e-143, Method: Compositional matrix adjust.
Identities = 418/1494 (27%), Positives = 691/1494 (46%), Gaps = 206/1494 (13%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E G ++ N E M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEAGQRQKLNPTE------MRVAPKMRLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D + L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P+V+VDP RC +LVYG ++++L + + L + + R
Sbjct: 136 GRYFVPVVRVDPDSRCAVMLVYGKRLVVLPFRKDNTLDEIELADVKPIKKAPTAMVTRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCQQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK-------- 451
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVDQQADQQL 429
Query: 452 ----------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+E D++ AP+ + RR + EEL +YGS + + +
Sbjct: 430 QRQQSEDQTLDEILDVDQLELAPTQAKSRR---------IEDEELEVYGSGAKASVLQLR 480
Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS-- 537
F F V DSL+N+ P+ G R+ + +ATG SK
Sbjct: 481 KFVFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLNDLKIELVAATGHSKNGAL 540
Query: 538 -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
N +++ EL GC +WTV+ D+++ + D+ H ++++S T
Sbjct: 541 SVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLLSQRNST 591
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 592 LVLQTGQEINEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 648
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
S V+ VSIADPYV L + +G + L + T + SS V
Sbjct: 649 --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAV 700
Query: 708 SSCTLYHD-----------------------------KGPEPWLRKTSTDAWLSTGVGEA 738
+ + Y D EP ++ + L G A
Sbjct: 701 VAISAYKDLSGLFTVKADDVNLTGSSSSAFGHSFGGYMKAEPHMKVEDEEDLLYGDAGNA 760
Query: 739 I------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKF 780
D A D + VV +SG LEI+ +P+ V+ V+
Sbjct: 761 FKMNSMADLAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDV 820
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
+G + D E + S T +S ++ +S +EL + + RP
Sbjct: 821 GNGAMVLTDAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELTVLGLGLNGERPL 878
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
L + T +L YQ +F P+ K R L N+ + ++ D
Sbjct: 879 LL-VRTRVELLIYQ--VFRYPKGHLKI-----RFRKLEQLNLMDHQPSHIELDEN--DER 928
Query: 901 TREETPHGAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAF 957
E+ P Q++ F N+ G G + G PC+ + R LR+H L +G + +F
Sbjct: 929 EEMESYQMQPKYVQKLRPFANVGGLSGIMVCGVNPCFVFLTSRGELRIHRLLGNGDVRSF 988
Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYP 1017
+NVN +GF+Y + LKI LPS +YD+ WP++K+PL+ TP Q+ Y E +Y
Sbjct: 989 AAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSTWPIRKVPLRCTPRQLVYHRENRVYC 1048
Query: 1018 LIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGG 1075
LI +P+ + D+E+ + Y + ++E+ ++ P+
Sbjct: 1049 LITQNE--EPMTKFYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET--- 1096
Query: 1076 PWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1132
W+ A+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1097 -WEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIY 1155
Query: 1133 STGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNG 1187
P +T EV+ KE KG +SA++ + G L+ G KI + + +L G
Sbjct: 1156 DIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIG 1215
Query: 1188 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1247
+AF D +YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF+
Sbjct: 1216 VAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFM 1274
Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
+D S L +V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q
Sbjct: 1275 VDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGL 1334
Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
+ N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+
Sbjct: 1335 HQRQPFLYE---NKHFVVYGTLDGALGYCLPLPEKLYRRFLMLQNVLLSYQEHLCGLNPK 1391
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+R + K I+D +L+ Y +L E+ E+A + GT +ILS+L
Sbjct: 1392 EYRTIKAVKKQGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEEILSDL 1445
>gi|195334368|ref|XP_002033855.1| GM20208 [Drosophila sechellia]
gi|194125825|gb|EDW47868.1| GM20208 [Drosophila sechellia]
Length = 1455
Score = 514 bits (1325), Expect = e-142, Method: Compositional matrix adjust.
Identities = 419/1491 (28%), Positives = 695/1491 (46%), Gaps = 200/1491 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
K SVLTS I + + FLGSRLG+SLL+ FT +++++ D+E
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVIT------LDDVEQ 423
Query: 460 DAPSTKRLRRSSSDALQDM-----------------VNGEELSLYGSASNNTESAQKTFS 502
+ +R + L+++ + EEL +YGS + + + F
Sbjct: 424 QSEQQQRNLQDEDQNLEEIFDVDQVEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFI 483
Query: 503 FAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS----- 537
F V DSL+N+ P+ G R+ + +ATG SK
Sbjct: 484 FEVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVF 543
Query: 538 ----NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
N +++ EL GC +WTV+ D+++ ++ +D+ H ++ +S T+VL
Sbjct: 544 VNCLNPQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMFLSQRNSTLVL 594
Query: 591 ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNS 650
+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 595 QTGQEINEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI----- 648
Query: 651 ESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSC 710
S V+ VSIADPYV L + +G + L + T + SS V +
Sbjct: 649 -----DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAI 703
Query: 711 TLYHD-------KG----------------------PEPWLRKTSTDAWLSTGVGEAI-- 739
+ Y D KG EP ++ + L G A
Sbjct: 704 SAYKDLSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKM 763
Query: 740 ----DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSG 783
D A D + VV +SG LEI+ +P+ V+ V+ +G
Sbjct: 764 NSMADLAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNG 823
Query: 784 RTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
+ D E + S T +S ++ +S +EL++ + RP L
Sbjct: 824 AMVLTDAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL- 880
Query: 844 ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
+ T +L YQ +F P+ K R L N+ + ++ D
Sbjct: 881 VRTRVELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DEQEEI 931
Query: 904 ETPHGAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVL 960
E+ P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F
Sbjct: 932 ESYQMQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAF 991
Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIV 1020
+NVN +GF+Y + LKI LPS +YD+ WPV+K+PL+ TP Q+ Y E +Y LI
Sbjct: 992 NNVNIPNGFLYFDTTYELKISVLPSYLSYDSIWPVRKVPLRCTPRQLVYHRENRVYCLIT 1051
Query: 1021 SVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQ 1078
+P+ + D+E+ + Y + ++E+ ++ P+ W+
Sbjct: 1052 QTE--EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WE 1098
Query: 1079 T--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1135
A+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1099 IVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDII 1158
Query: 1136 RNADNPQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAF 1190
P +T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF
Sbjct: 1159 EVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAF 1218
Query: 1191 YDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDG 1250
D +YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D
Sbjct: 1219 IDT-NIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDN 1277
Query: 1251 STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRT 1310
S L +V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 1278 SNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQR 1337
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R
Sbjct: 1338 QPFL---YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYR 1394
Query: 1371 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
S+ K I+D +L+ Y ++ E+ E+A + GT +IL +L
Sbjct: 1395 TLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1445
>gi|195455711|ref|XP_002074834.1| GK23274 [Drosophila willistoni]
gi|194170919|gb|EDW85820.1| GK23274 [Drosophila willistoni]
Length = 1463
Score = 514 bits (1324), Expect = e-142, Method: Compositional matrix adjust.
Identities = 417/1498 (27%), Positives = 688/1498 (45%), Gaps = 206/1498 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSE------MRVAPKMRLECLATYSLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D + L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA--GAMRDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 137
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +L+YG ++++L + S L + + R
Sbjct: 138 GRYYVPEVRVDPDARCAVMLIYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVTRTP 197
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T AGR+ + TC++ A+S
Sbjct: 198 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDTCVLVAIS 257
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ NLP D +LL + PIGG LV+ N + Y +QS Y V
Sbjct: 258 LNIQQRVHPIIWTVNNLPFDCLRLLPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 311
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 312 SLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVVSLRTGDLYVLTLCVDSMRTVR 371
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I FLGSRLG+SLL+ FT +++++
Sbjct: 372 NFHFHKAASSVLTSCICVCHMEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQQQQQA 431
Query: 448 ----SGLKEEFGDIEAD-------APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES 496
S E G ++ D APS + RR + EEL +YG+ + +
Sbjct: 432 AEEPSEEAEIEGILDMDQLEAATSAPSQAKSRR---------IEDEELEVYGTGAKASVL 482
Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQ 536
+ F F V DSL+N+ P+ G R+ + +ATG SK
Sbjct: 483 QLRKFVFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLQDVKIELVAATGHSKN 542
Query: 537 S---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
N +++ EL GC +WTV+ D+++ + D+ H ++++S +
Sbjct: 543 GALSVFVNCINPQIITSFELEGCLDVWTVFD--------DATKKTSRQDQ-HDFMLLSQK 593
Query: 585 ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
T+VL+T + E+ E+ + V TI GNL R ++QV R R+L G+ + Q++
Sbjct: 594 NSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQNRFIVQVTTRHVRLLQGTRLVQNVP 652
Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSK 704
S V+ V+IADPYV L + +G + L S T + SS
Sbjct: 653 I----------DVGSPVVQVAIADPYVCLRVFNGQVITLALRESRGTPRLAINKHTISSS 702
Query: 705 KPVSSCTLYHD-------------------------------KGPEPWLRKTSTDAWLST 733
V + Y D EP ++ + L
Sbjct: 703 PAVVAIAAYKDLSGLFTVKSDDILNLTGSGSNSAFGSTFGGYMKSEPHMKVEDEEDLLYG 762
Query: 734 GVGEAI------DGADGGPLDQGDIYS-------------VVCYESGALEIFDVPNFNCV 774
G A D A D + VV +SG LEI+ +P+ V
Sbjct: 763 DAGNAFKMNTMADLAKQSKQKNSDWWRRMLVQAAKPTYWLVVARQSGTLEIYSMPDMKLV 822
Query: 775 FTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSA 834
+ V+ +G + D E + S T +S ++ +S +EL++
Sbjct: 823 YLVNDVGNGAMVLTDAM--EFVPISLTSQENSKAGIVQSCMPQHANSPLPLELSLVGLGL 880
Query: 835 HHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR 894
+ RP L + T +L YQ +F P+ K R + N+ + ++
Sbjct: 881 NGERPLLL-VRTRLELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHVNLDD 932
Query: 895 TPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGS 953
+ Q++ F N+ G G + G PC+ + R LR+H L +G
Sbjct: 933 NEENEELESYNMQPKYVQKLRPFNNVGGMSGVMICGVNPCFLFLTSRGELRIHRLLGNGE 992
Query: 954 IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEK 1013
+ +F +N+N +GF++ + LKI LPS +YD+ WPV+K+PL+ TP Q+ Y E
Sbjct: 993 VRSFAAFNNINIPNGFLFFDTTFELKISVLPSYLSYDSTWPVRKVPLRCTPRQLVYHREN 1052
Query: 1014 NLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPD 1071
+Y LI +P+ + D+E+ + Y + ++++ ++ P+
Sbjct: 1053 RVYCLITQTE--EPMTKFYRFNGEDKELSEESRGERF-------IYPIGSQFDMVLISPE 1103
Query: 1072 RAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1128
W+ A+I + E+ ++V L + T + L IGT + ED+ +RG
Sbjct: 1104 ----TWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGN 1159
Query: 1129 VLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
+ ++ P +T EV+ KE KG +SA++ + G L+ G KI + +
Sbjct: 1160 IHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDG 1219
Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
+L G+AF D +YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ +
Sbjct: 1220 DLIGVAFIDT-NIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYG 1278
Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1303
EF++D + L +V+D + N+ ++ Y P+ ES GQKLL +A++H+G V R+Q
Sbjct: 1279 IEFMVDNTNLGFLVTDAESNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCH 1338
Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
+ N+ +++GTLDG++G PL E +RR LQ L+ H+ G
Sbjct: 1339 QRGLHQRQPFL---YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDHLCG 1395
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
LNP+ +R S+ + I+D +L+ Y +L E+ E+A + GT +IL++L
Sbjct: 1396 LNPKEYRTLKSSKRLGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEEILADL 1453
>gi|290981010|ref|XP_002673224.1| CPSF A subunit [Naegleria gruberi]
gi|284086806|gb|EFC40480.1| CPSF A subunit [Naegleria gruberi]
Length = 1373
Score = 513 bits (1321), Expect = e-142, Method: Compositional matrix adjust.
Identities = 413/1533 (26%), Positives = 686/1533 (44%), Gaps = 265/1533 (17%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
FA YK +H PT ++ C T + + NL++
Sbjct: 2 FACYKQLHPPTAVSFCLKARFTSANDE---------------------------NLIIVK 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL-AILSQ 121
N++E+Y+++ + +++ LV + L G ++S+ A+ Q
Sbjct: 35 NNIMEVYLIKP-----------------------NTSNIVLVKVFELFGVIDSIIAVCLQ 71
Query: 122 GGADNSRRRDSIILAFED-AKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
G +++ +++ FED AK+SV+EFD+ L+ S+H E L+ G+ F
Sbjct: 72 G-----MKKEMLLINFEDEAKVSVVEFDEKRSDLKTLSLHYLEDD---FLREGKARFFHN 123
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGS------------GLVGDEDTFGSGGGFSA 228
+ +DPQ R V++ +++IL Q G L GD++ G
Sbjct: 124 QPIILDPQNRFATVIICDSKLVILPFRQSGEDVSLSTEDNFLFALSGDQEEANENVGDQK 183
Query: 229 R-----IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
+ ++ +I+L DL +K+VKD+ F++GY EP ++ LHE E TW+GR++ K +T +
Sbjct: 184 KHHQPEVQRQVIIDLNDLGIKNVKDYCFLNGYNEPTILFLHENEQTWSGRLAAKSNTSTV 243
Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI-GGVLVVGANTIHYHSQSASCALAL 342
+A+S K +P IWS +LPHD KL+ + + GG LV+G N+I + +Q A+ L+
Sbjct: 244 TAVSFDLFRKYYPKIWSVGSLPHDCNKLIPLQEDVAGGALVIGMNSIIHINQCATYGLSF 303
Query: 343 NNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
N++AVS + + + ++F ++ D T++ D L+S K G+L + + G +
Sbjct: 304 NDFAVS-NPNLSINFNTFDGPALFFDTVAYTFIARDKLLVSLKDGELYTMYLESGGSRIN 362
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
+++ KT+ + S + T+ +L FLGS++GDS+L ++ S EE + A
Sbjct: 363 NINIKKTSNTTPASCMCTLKGNLIFLGSKIGDSVLYEYQEKVEVETSSLDTDEEMSSVFA 422
Query: 460 -----DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
+ KR D + EE ++ S S ++ ++ NIGP
Sbjct: 423 AGENFEPEKKKRKLADDDDFFAALEKDEEPTVIESFSKVSKKETTKVELKIKHVFTNIGP 482
Query: 515 LKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WTVYHKSSRGHNADSSRMA 569
+ + + + D S G ++N + C GI TV ++S + + +
Sbjct: 483 ISHLTAAVTSSFDMS--GFKSKTNDNQLSAIACSGIGRHGCLTVLNRSLQPDIQSEATLP 540
Query: 570 ---------AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
+ E+ YLI+SLE +T V E+ L EVT + T+ G + R
Sbjct: 541 FLVKQVWTISQKTEHDLYLILSLEDKTKVFESKATLAEVTSKSMFVTNETTLNIGKI--R 598
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
++QV R + +L GS P S++ I DPYVLL DGS+
Sbjct: 599 ESIVQV-TRKSVMLIGS--------EPKQVHHSKKEIRSSI----ILDPYVLLHFYDGSL 645
Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
LL D T IES+ +++ LY PE + G+ E
Sbjct: 646 VLLTHDNGRVT---SKQLDIESNHGKITAVCLYK-TNPE----------FEFFGINEK-- 689
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
+G V + GA EI VP+ CVF+ +F T + D
Sbjct: 690 --------EGKYLCCVYWTDGAFEILSVPDMTCVFSFSQFYQFHTTLFD----------- 730
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
E + + + V E+A++ + P+L ++L+D T+ Y+++L
Sbjct: 731 -------EGQSSNTTQSEVKYPYVTEMALRGIGSDSEMPYLVSVLSDNTVHIYRSFL--- 780
Query: 861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR------TPLDAYTREETPHGAPCQ-- 912
+ T+KS D +RL LRFS+ P+ ++ +
Sbjct: 781 -DRTTKSKD---------------NRLTRLRFSKFQHDDLLPISEIDKKSQTFTLNLKSK 824
Query: 913 -----------RITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 961
++ FKNI G+ G F +G +P W LRVHP + FT H
Sbjct: 825 YLFPKSDLGRSQLIPFKNIGGYGGLFKTGEKPFWLFTEHSNLRVHPTQSRDPVTTFTPYH 884
Query: 962 NVNCNHGFIYVT-------SQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKN 1014
+ NC HGFIY+T Q L I L + ++ YWP +KI LK+TP+ IT+ + N
Sbjct: 885 HENCPHGFIYLTDKEQDNKKQSKLHISSLNANVKFNAYWPQRKILLKSTPNVITFHQDTN 944
Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1074
SVPV +L I G + +TV+ + +G
Sbjct: 945 TCLAFTSVPV----KAILPDSIPFPEGK-------CPPPAEQKHTVKLF---------SG 984
Query: 1075 GPWQTRATIPMQSSENALTVRVVTL----------------FNTTTKENETLLAIGTAYV 1118
WQ E+A+ +VV L N+ ++ +++A+GTAYV
Sbjct: 985 HNWQEMDKFEFDLHESAVAAKVVYLSKEEYNDDTDISFEEPLNSRKQDLVSVVAVGTAYV 1044
Query: 1119 QGEDVAARGRVLLFST----GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
Q E RGR+LLF GR + NL++ S +KG I+ L + +++ + G +
Sbjct: 1045 QSERELCRGRLLLFDLDPILGRENEYKLNLIS---STSVKGPITTLEQVDRYIICSVGNR 1101
Query: 1175 IILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1232
I + W ++ +FYD Y SLN V+NFI+ GDI+KS+ FL WKE+G +L LL
Sbjct: 1102 IYTYYFDWEEKRMHITSFYDTQ-FYTASLNTVRNFIMFGDIYKSVSFLRWKEKGHRLILL 1160
Query: 1233 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1292
AKD L ++EFL++ L L V D KN+QIF Y P+ ES G+ L+ +FH+G
Sbjct: 1161 AKDNRPLQVVSSEFLVNNDLLGLAVIDTSKNLQIFSYLPQHQESNDGRNLVPVCDFHIGT 1220
Query: 1293 HVTKFLRLQM----------LATSSDRTGAAPGSDKT----NRFALLFGTLDGSIGCIAP 1338
+ +R+++ L +++ + D T N +LFG++DG+IG +AP
Sbjct: 1221 LINSLIRMKVRELPDDNTIRLGNVNEKPKQSGKKDITKTNPNHQFILFGSVDGAIGYVAP 1280
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
++E+T RRL +LQ K+ + AGL+P+SFR + + +I+D +L+ +Y +
Sbjct: 1281 INEVTHRRLFALQLKMYTQLEQAAGLHPKSFRLYKPLERTEYNYKKNIIDGQLIWNYANI 1340
Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
Q ++A Q GT IL ++ +L T F
Sbjct: 1341 NTILQRDLARQIGTNSDNILRSIQELNQATFFF 1373
>gi|195150431|ref|XP_002016158.1| GL10645 [Drosophila persimilis]
gi|194110005|gb|EDW32048.1| GL10645 [Drosophila persimilis]
Length = 1459
Score = 511 bits (1316), Expect = e-141, Method: Compositional matrix adjust.
Identities = 418/1504 (27%), Positives = 693/1504 (46%), Gaps = 222/1504 (14%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV AN++++Y + E G ++ N E M LE + Y L+GNV S
Sbjct: 29 NLVVAGANMLKVYRISPNVEAGQRQKLNPNE------MRIAPKMRLECLATYFLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA +D+++++F+DAK+SVL+ D + L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR------ 229
P+V+VDP RC +LVYG ++++L + S DE F
Sbjct: 136 GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSL---DEIELTDVKPFKKAPTAMVS 192
Query: 230 ---IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
I +S++I L++LD K +V D F+HGY EP ++IL+E T +GR+ + TC++
Sbjct: 193 RTPIMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCSGRIKVRSDTCVLV 252
Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
A+S++ + HP+IW+ +LP D +++ + PIGG LV+ N + Y +QS
Sbjct: 253 AISLNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVP------P 306
Query: 345 YAVSLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
Y VSL+SS + P+ + LD A+ ++ D ++S +TG+L +LT+ D R
Sbjct: 307 YGVSLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGELYVLTLCVDSMR 366
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK----- 451
V+ K SVLTS I + FLGSRLG+SLL+ FT +++++ +
Sbjct: 367 TVRNFHFHKAAASVLTSCICVCHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDVDAEQQA 426
Query: 452 ----------------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
EE D++ AP + RR + EEL +YGS +
Sbjct: 427 EQQQQKQQRVQEDQDIEEVYDVDQIELAPPQAKSRR---------IEDEELEVYGSGAKA 477
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGI 533
+ + F F V DSL+N+ P+ G R+ + +ATG
Sbjct: 478 SVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGH 537
Query: 534 SKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII 581
SK N +++ EL GC +WTV+ D+++ + D+ H ++++
Sbjct: 538 SKNGALSVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLL 588
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
S T+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q
Sbjct: 589 SQSNSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQ 647
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG-----SIRLLVGDPSTCT---VS 693
++ S V+ V+IADPYV L M +G ++R G P
Sbjct: 648 NVPI----------DVGSPVVQVAIADPYVCLRMLNGQVITLALRETRGSPRLAINKHTI 697
Query: 694 VQTPA--AIESSKKPVSSCTLYHDK--------------------GPEPWLRKTSTDAWL 731
+PA AI + K T+ D EP ++ + L
Sbjct: 698 TSSPAVVAIAAYKDLSGLFTVKSDDVLNLTGGTGSGFGHSFGGYMKAEPNMKVEDEEDLL 757
Query: 732 STGVGEAIDGADGGPLDQGD------------------IYSVVCYESGALEIFDVPNFNC 773
G A L Q + VV +SG LEI+ +P+
Sbjct: 758 YGDAGNAFKINSMAVLAQQSKQKNSDWWRRLLVQAKPSYWLVVSRKSGTLEIYSMPDMKL 817
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRW 832
V+ ++ +G + D +L S E +S+ G Q ++ +S +EL++
Sbjct: 818 VYHINDVGNGAMVLSDALEFVSLSSSTQE---NSKVGIVQSCMPQHANSPLPLELSLVGL 874
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
+ RP L + T +L YQ +F P+ K R L N+ + ++
Sbjct: 875 GLNGERPVLM-VRTRVELLIYQ--VFRYPKGNLKI-----RFRKLEQLNLLDQQPSHIEL 926
Query: 893 SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCD 951
+ Q++ F N+ G G + G PC+ + R LR+H +
Sbjct: 927 EENDEEEELESYNMQPKYVQKLRPFSNVGGLAGIMVCGVNPCFVFLTARGELRIHRLQGN 986
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFA 1011
G + +F +NVN +GF+Y + LKI LPS +YD+ WPV+K+PL+ TP Q+ Y
Sbjct: 987 GDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHR 1046
Query: 1012 EKNLYPLIVSVPVLKPLNQVLSL------LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEV 1065
E +Y LI +P+ + L ++ G + N S ++E+
Sbjct: 1047 ENRVYCLITQTE--EPMTKYYRFNGEDKELSEESRGERFIYPNGS-----------QFEM 1093
Query: 1066 RILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGED 1122
++ P+ W+ A+I + E+ ++V L + T + L IGT + ED
Sbjct: 1094 VLISPET----WEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSED 1149
Query: 1123 VAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIIL 1177
+ +RG + ++ P +T EV+ KE KG +SA++ + G L+ G KI +
Sbjct: 1150 ITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIYI 1209
Query: 1178 HKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
+ +L G+AF D +YV + VK+ I + D++KSI L ++E+ L+L ++DF
Sbjct: 1210 WQLRDGDLIGVAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLASRDFN 1268
Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
L+ + EF++D S L +V+D ++N+ ++ Y P+ ES GQKL+ +A++H+G V
Sbjct: 1269 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNTM 1328
Query: 1298 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1357
R+Q + N+ +++GTLDG +G PL E +RR LQ L+
Sbjct: 1329 FRVQCHQRGVHQRQPFLYE---NKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLSY 1385
Query: 1358 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
H+ GLNP+ FR S K I+D +L+ Y +LP ++ E+A + GT +I
Sbjct: 1386 QDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEEI 1445
Query: 1418 LSNL 1421
LS+L
Sbjct: 1446 LSDL 1449
>gi|198457226|ref|XP_001360595.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
gi|198135905|gb|EAL25170.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
Length = 1459
Score = 510 bits (1313), Expect = e-141, Method: Compositional matrix adjust.
Identities = 418/1504 (27%), Positives = 692/1504 (46%), Gaps = 222/1504 (14%)
Query: 57 NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV AN++++Y + E G ++ N E M LE + Y L+GNV S
Sbjct: 29 NLVVAGANMLKVYRISPNVEAGQRQKLNPNE------MRIAPKMRLECLATYFLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA +D+++++F+DAK+SVL+ D + L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR------ 229
P+V+VDP RC +LVYG ++++L + S DE F
Sbjct: 136 GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSL---DEIELTDVKPFKKAPTAMVS 192
Query: 230 ---IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
I +S++I L++LD K +V D F+HGY EP ++IL+E T GR+ + TC++
Sbjct: 193 RTPIMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLV 252
Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
A+S++ + HP+IW+ +LP D +++ + PIGG LV+ N + Y +QS
Sbjct: 253 AISLNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVP------P 306
Query: 345 YAVSLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
Y VSL+SS + P+ + LD A+ ++ D ++S +TG+L +LT+ D R
Sbjct: 307 YGVSLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGELYVLTLCVDSMR 366
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK----- 451
V+ K SVLTS I + FLGSRLG+SLL+ FT +++++ +
Sbjct: 367 TVRNFHFHKAAASVLTSCICVCHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDVDAEQQA 426
Query: 452 ----------------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
EE D++ AP + RR + EEL +YGS +
Sbjct: 427 EQQQQKQQRVQEDQDIEEVYDVDQIELAPPQAKSRR---------IEDEELEVYGSGAKA 477
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGI 533
+ + F F V DSL+N+ P+ G R+ + +ATG
Sbjct: 478 SVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGH 537
Query: 534 SKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII 581
SK N +++ EL GC +WTV+ D+++ + D+ H ++++
Sbjct: 538 SKNGALSVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLL 588
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
S T+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q
Sbjct: 589 SQSNSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQ 647
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG-----SIRLLVGDPSTCT---VS 693
++ S V+ V+IADPYV L M +G ++R G P
Sbjct: 648 NVPI----------DVGSPVVQVAIADPYVCLRMLNGQVITLALRETRGSPRLAINKHTI 697
Query: 694 VQTPA--AIESSKKPVSSCTLYHDK--------------------GPEPWLRKTSTDAWL 731
+PA AI + K T+ D EP ++ + L
Sbjct: 698 TSSPAVVAIAAYKDLSGLFTVKSDDVLNLTGGSGSGFGHSFGGYMKAEPNMKVEDEEDLL 757
Query: 732 STGVGEAIDGADGGPLDQGD------------------IYSVVCYESGALEIFDVPNFNC 773
G A L Q + VV +SG LEI+ +P+
Sbjct: 758 YGDAGNAFKINSMAVLAQQSKQKNSDWWRRLLVQAKPSYWLVVSRKSGTLEIYSMPDMKL 817
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRW 832
V+ ++ +G + D +L S E +S+ G Q ++ +S +EL++
Sbjct: 818 VYHINDVGNGAMVLSDALEFVSLSSSTQE---NSKVGIVQSCMPQHANSPLPLELSLVGL 874
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
+ RP L + T +L YQ +F P+ K R L N+ + ++
Sbjct: 875 GLNGERPVLM-VRTRVELLIYQ--VFRYPKGNLKI-----RFRKLEQLNLLDQQPSHIEL 926
Query: 893 SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCD 951
+ Q++ F N+ G G + G PC+ + R LR+H +
Sbjct: 927 EENDEEEELESYNMQPKYVQKLRPFSNVGGLAGIMVCGVNPCFVFLTARGELRIHRLQGN 986
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFA 1011
G + +F +NVN +GF+Y + LKI LPS +YD+ WPV+K+PL+ TP Q+ Y
Sbjct: 987 GDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHR 1046
Query: 1012 EKNLYPLIVSVPVLKPLNQVLSL------LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEV 1065
E +Y LI +P+ + L ++ G + N S ++E+
Sbjct: 1047 ENRVYCLITQTE--EPMTKYYRFNGEDKELSEESRGERFIYPNGS-----------QFEM 1093
Query: 1066 RILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGED 1122
++ P+ W+ A+I + E+ ++V L + T + L IGT + ED
Sbjct: 1094 VLISPET----WEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSED 1149
Query: 1123 VAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIIL 1177
+ +RG + ++ P +T EV+ KE KG +SA++ + G L+ G KI +
Sbjct: 1150 ITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKIYI 1209
Query: 1178 HKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
+ +L G+AF D +YV + VK+ I + D++KSI L ++E+ L+L ++DF
Sbjct: 1210 WQLRDGDLIGVAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLASRDFN 1268
Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
L+ + EF++D S L +V+D ++N+ ++ Y P+ ES GQKL+ +A++H+G V
Sbjct: 1269 PLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLIRKADYHLGQVVNTM 1328
Query: 1298 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1357
R+Q + N+ +++GTLDG +G PL E +RR LQ L+
Sbjct: 1329 FRVQCHQRGVHQRQPFLYE---NKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNVLLSY 1385
Query: 1358 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
H+ GLNP+ FR S K I+D +L+ Y +LP ++ E+A + GT +I
Sbjct: 1386 QDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTRTEEI 1445
Query: 1418 LSNL 1421
LS+L
Sbjct: 1446 LSDL 1449
>gi|195583398|ref|XP_002081509.1| GD25678 [Drosophila simulans]
gi|194193518|gb|EDX07094.1| GD25678 [Drosophila simulans]
Length = 1450
Score = 506 bits (1304), Expect = e-140, Method: Compositional matrix adjust.
Identities = 413/1463 (28%), Positives = 685/1463 (46%), Gaps = 190/1463 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E +E + +L + + A + EEL +YGS + + + F F V D
Sbjct: 430 RNLQDEDQSLE-EILDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 489 SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCLN 548
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 549 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 600 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 649 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 709 LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + VV +SG LEI+ +P+ V+ V+ +G T +
Sbjct: 769 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGATVLT 828
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D E + S T +S ++ +S +EL++ + RP L + T
Sbjct: 829 DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L YQ +F P+ K R L N+ + ++ D E+
Sbjct: 886 ELLIYQ--VFRYPKGHLK-----IRFRKLDXXNLLDQQPTHIELDEN--DEQEEIESYQM 936
Query: 909 AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 937 QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
+GF+Y + LKI LPS +YD+ WPV+K+PL+ TP Q+ Y E +Y LI
Sbjct: 997 PNGFLYFDTTYELKISVLPSYLSYDSIWPVRKVPLRCTPRQLVYHRENRVYCLITQTE-- 1054
Query: 1026 KPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RA 1081
+P+ + D+E+ + Y + ++E+ ++ P+ W+ A
Sbjct: 1055 EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDA 1103
Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1104 SITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPE 1163
Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
P +T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF D
Sbjct: 1164 PGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDT-N 1222
Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
+YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L
Sbjct: 1223 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGF 1282
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
+V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 1283 LVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL- 1341
Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R S+
Sbjct: 1342 --YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSS 1399
Query: 1376 GKAHRPGPDSIVDCELLSHYEML 1398
K I+D +L+ Y ++
Sbjct: 1400 KKQGINPSRCIIDGDLIWSYRLM 1422
>gi|198415711|ref|XP_002123169.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
1, partial [Ciona intestinalis]
Length = 1370
Score = 504 bits (1299), Expect = e-139, Method: Compositional matrix adjust.
Identities = 409/1476 (27%), Positives = 678/1476 (45%), Gaps = 215/1476 (14%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A Y+ +H PTG+ C + NL+VTA
Sbjct: 2 YAWYRQIHAPTGVEQCVYCNFASEKEK---------------------------NLLVTA 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
A+ + +Y + E + +++N E + + L+ + ++L GNV + +
Sbjct: 35 ASQLTVYRLERNYEVTTKTENGEE-------NTVVKEKLQQIGSWQLFGNVVRMRSVRLA 87
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
GA + DS++L+F +AK+S++EFD + H ++ TS+H FE + K G P
Sbjct: 88 GA----KLDSVLLSFAEAKLSIIEFDQATHDIKTTSLHYFEDALY---KDGSYQRITLPK 140
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGF--SARIESSHVINLRD 240
+ VDP+ RC + + + ++ + L D+ + R +S+ I+L
Sbjct: 141 IAVDPESRCVALQLTTKSVAVVPLRANTAALATDDGAAPQDNVSLQNKRSTTSYTIDLHA 200
Query: 241 LD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
+D ++ + D F+HGY EP +++L E TWAGRV+ + TC I A+S++ + HP++
Sbjct: 201 VDARLQRIIDIQFLHGYNEPTLLVLFESLRTWAGRVAMRQDTCNIVAISLNMAEQLHPVV 260
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE---- 354
WS LP D VP PIGGVL+ N+I Y +QS Y SL+S+ E
Sbjct: 261 WSLNGLPFDCKYAYPVPKPIGGVLIFAVNSILYLNQSVP------PYGTSLNSTTENSTS 314
Query: 355 ---LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
P+ + LD +HA ++ + ++S K G+L +LT++ D R V+ K+ SV
Sbjct: 315 FPLKPQEDVCMTLDCSHAMFISPESLVISLKNGELYVLTLLVDSMRSVRNFHFDKSASSV 374
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
LTS +T + + FLGSRLG+SLL+++T + + E A KRL +
Sbjct: 375 LTSCLTVLDDGFLFLGSRLGNSLLLKYT-------EARPVFRNCYHTEEPAAKRKRLNTA 427
Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
+ A D N +L +YG + +E ++ F V DSLVNIGP G A S
Sbjct: 428 ADWAASD-TNDIDLQMYGKDTVTSEPL-SSYKFEVCDSLVNIGPCGAAELGEP--AFLSE 483
Query: 531 TGIS-KQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNA 563
+S ++S+ EL ELPGC +WTV +
Sbjct: 484 EFVSQRESDLELAILSGHGKNGAISVLQRSVKPQVVTTFELPGCIDMWTVKSVCEKTELP 543
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
++ + H+YLI+S E T++LET + EV E+ + + +++ GN+ G + +
Sbjct: 544 TKTQ-----QQQHSYLILSREESTLILETGKEIMEV-ENSGFNTREQSVFVGNIGGDKEL 597
Query: 624 I-QVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
I QV G +L G + Q + E GS + SI DPY LL SDG + +
Sbjct: 598 ILQVCASGVWLLAGVKLLQHIPL-----ELGS-----PITQCSICDPYALLLTSDGDLIM 647
Query: 683 --LVGDPST--------CTVSVQTPAAIE--------------------------SSKK- 705
L D + C S+ IE SS K
Sbjct: 648 LTLTNDLDSENGVKLECCNPSINQVPQIEHVCLYKDTSGLFKTASGPSDVFLPEDSSNKG 707
Query: 706 -------------PVSSCTLYHDKGPEPWLRKTSTDAWL----------STGVGEAIDGA 742
P+SS T D+ E ++ D S E +DG
Sbjct: 708 VSDSEIPSSLPRTPLSSKTFTVDEEDELLYGESDPDVIFAPQFAPNVPKSPTQNEPLDGD 767
Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
G ++ ++++ E+ LEI+ +P+ + V+T+ F G+ + ++ + S+ +
Sbjct: 768 KEGN-EEFTFWAIIARENRNLEIYSMPSLDLVYTIKNFSFGQKLLTNSGPVHSYSVSKDD 826
Query: 803 INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
++S T K I + +V L + +S P L A + + IL Y+ + F PE
Sbjct: 827 KSTS----TRYSDKPRIFEILLVGLGYK-----NSSPHLIARIEE-EILIYEVFKFSAPE 876
Query: 863 NTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG 922
K + S + V+ S + R P+ T+ + C R F NI G
Sbjct: 877 KFKKYN-----SLQIRFKKVNHSMM----IRRAPVTHETKTDQLEHRNCLRT--FSNIGG 925
Query: 923 HQGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G FL G P W V R L HP DGS+ F HNVNC +GF+Y SQG L+IC
Sbjct: 926 YSGVFLCGPYPYWIFVTIRGALCCHPMSVDGSVSCFVPFHNVNCPNGFLYFNSQGELRIC 985
Query: 982 QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
LP YD WP++KI L+ + H + Y E +Y L+ SV +P ++ L + E
Sbjct: 986 MLPPHMKYDTAWPMRKITLRCSVHFLAYSIEHKVYALVTSVS--EPCTRLPYLTFENERE 1043
Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-F 1100
+ +L D ++++ V+++ P A A + M E+ ++ V L
Sbjct: 1044 FE----DLEKGDRFIYPHIDKFSVQLISP--ASWDLVPNARLDMGEFEHITCMKNVWLSC 1097
Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKG 1155
+ + L +GT V GE++++RG++++ P +N + ++YS+E KG
Sbjct: 1098 GQDSSARQNFLVLGTVNVFGEEMSSRGKIIILEVIEVVPEPGQPLTKNKLKQIYSEEQKG 1157
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
++A+ L+G+LL A G KI + ++ + L G+AF D +Y+ ++F L+GDI
Sbjct: 1158 PVTAVCGLEGNLLTAIGQKIFIWRFDENQSLRGLAFVDT-NVYIHHALSFRSFALVGDIQ 1216
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+SI L ++ L++ ++D L+ + + ++DG+ ++ +VSD +KN+ +F Y P+
Sbjct: 1217 RSITLLRYQTDFKTLSVTSRDVRPLEVYTADLVVDGTGINFLVSDHEKNLVLFAYDPEDH 1276
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
ES G +L RA+ H+G+ R+ A DR+ P + GTLDGSI
Sbjct: 1277 ESHGGSRLTKRADMHIGSRANCMWRVA--ACGVDRSTGLPNQPYAGVHITMMGTLDGSIC 1334
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
+ P+ E +RRL LQ ++ + H+AGLNP++FR
Sbjct: 1335 HVLPVAEKVYRRLLMLQNIMITGLQHIAGLNPKAFR 1370
>gi|157110889|ref|XP_001651294.1| cleavage and polyadenylation specificity factor cpsf [Aedes aegypti]
gi|108883895|gb|EAT48120.1| AAEL000832-PA [Aedes aegypti]
Length = 1417
Score = 503 bits (1294), Expect = e-139, Method: Compositional matrix adjust.
Identities = 415/1461 (28%), Positives = 684/1461 (46%), Gaps = 172/1461 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LV ANV+++Y R+ + S++ T R M LE + Y L GN+ S+
Sbjct: 29 SLVTGGANVLKVY--RLIPDADATSRDKFTTTRPPNM------KLECMATYTLFGNIMSM 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S G+ +RD+++++F+DAK+SV++FD L+ S+H FE + +K G
Sbjct: 81 QSVSLAGS----QRDALLISFQDAKLSVVQFDPDNFELKTLSLHYFEEED---IKGGWTG 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSGGG---FSARI 230
P+V+VDP RC +LVYG ++++L K S V D I
Sbjct: 134 HYHTPIVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEVQDVKPMKKAPTQLIAKTPI 193
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+S+VI L++ + + +V D F+HGY EP ++IL+E T+ GR++ + TCM+ ALS+
Sbjct: 194 LASYVIELKESEERIDNVIDIQFLHGYYEPTLLILYEPVKTFPGRIAVRSDTCMMVALSL 253
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAV 347
+ + HP+IW+ LP D + +A+ PIGG L++ N + Y +QS ++LN+ A
Sbjct: 254 NIQQRVHPVIWTVNCLPFDCLQAIAISKPIGGCLILSVNALIYLNQSVPPYGVSLNSIAD 313
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ P+ + LDAA +++ + +LS K G+L +LT+ D R V+ SK
Sbjct: 314 HCTNFPLKPQDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRSFHFSKA 373
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLT I + FLGSRLG+SLL++F + +++ EE + E KR
Sbjct: 374 ASSVLTCCICVVEEEYLFLGSRLGNSLLLRFKEKDESMVITIDDTEEVVEKEP-----KR 428
Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI-- 524
LR + EEL +YGS T ++ F V DS++NIGP+ + G RI
Sbjct: 429 LR----------LEQEELEVYGSG-QKTSVQLTSYIFEVCDSILNIGPIGHMAVGERISE 477
Query: 525 ---NADASATGISKQSNYELVELP--GCKGIWTVYHKSSRGHNADSSRMAAY--DDEYHA 577
+ + + + + E+V G G V S + S ++ D+ H+
Sbjct: 478 EEQDENKDVQFVPNKLDLEIVTSSGHGKNGALCVLQNSIKPQVITSFGLSGCLDVDDMHS 537
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
++I+S EA TMVL+T D + E+ E+ + TI GN+ G R ++QV + R+L G+
Sbjct: 538 FMILSQEAGTMVLQTGDEINEI-ENTGFATNVPTIHVGNIGGNRFIVQVTTKSIRLLQGT 596
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV-----GDPSTC-- 690
+ Q++ + +VSIADPYV + S+G + L G P
Sbjct: 597 RLLQNIPI----------DLGCPLAAVSIADPYVCVRSSEGRVITLALREGKGTPRLAVN 646
Query: 691 --TVSVQTPAAIE-SSKKPVSS--CTLYHD------------------KGPEPWLRKTST 727
T+S TPA + S K VS T Y D PEP ++
Sbjct: 647 KNTIST-TPAVVAISVYKDVSGMFTTKYEDFYDGSKAGSSAYSSGFGYMKPEPHMKIEDE 705
Query: 728 DAWLSTGVGEAI------DGADGGPLDQGDIYS------------VVCYESGALEIFDVP 769
+ L G + D A D + ++G LEI+ +P
Sbjct: 706 EDLLYGESGRSFKMTSMADMAIETKKKNTDFWRKFMQPVKPTFWLYAVRDNGNLEIYSMP 765
Query: 770 NFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN---SSSEEGTGQGRKENIHSMKVVE 826
+ V+ + +G + D+ L+ +T + +S+ + G N+ +++
Sbjct: 766 DLKLVYLITNIGNGNKVLQDSMEFVPLQVGQTAADADVTSNAFTSPFGFNPNLLPKEILM 825
Query: 827 LAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASR 886
+A+ H +RP LF L + +L Y+ Y + SK + R S V+
Sbjct: 826 VAL---GHHGTRPMLFVRL-ENDLLVYRVYRY------SKGHLKLRFRR--VPSGVTGPI 873
Query: 887 LRNLRFSRTPLDAYTREETPHGAPCQ-----RITIFKNISGHQGFFLSGSRPCWCMVFRE 941
+ P D + H I F N++G+ G + G +P + M+
Sbjct: 874 FKIAPRQSAPTDQEGEKPDEHSTKIMYENISMIRYFNNVNGYNGVAVCGEKP-YIMLLTS 932
Query: 942 R--LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
R LR H + F +NVNC +GF+Y Q LKI P +YD+ WPV+KIP
Sbjct: 933 RGELRAHRLYAKTIMKGFAPFNNVNCPNGFLYFDEQYELKIAVFPGYLSYDSIWPVRKIP 992
Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYT 1059
L+++P QI Y E +Y +++ +EV ++ N +L
Sbjct: 993 LRSSPKQIVYHKENKVYCVVMDA---------------EEVCNKYYRFNGEDKELTEENK 1037
Query: 1060 VE--------EYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENE 1108
E ++ V ++ P W+ +I + E+ + ++ V+L + +
Sbjct: 1038 GERFLYPMAHKFSVVLVTP----SAWEIIPETSINLDEWEHVIALKNVSLSYEGARSGFK 1093
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASL 1163
+A+GT + ED+ +RGR+LL+ P +T EV KE KG +SA+ +
Sbjct: 1094 EYIAVGTNFNYSEDITSRGRLLLYDIIEVVPEPGKPLTRYKFKEVIVKEQKGPVSAITHV 1153
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
G L+ A G K+ L + +L G+AF D ++V L +K+ IL+ D++KS+ L ++
Sbjct: 1154 SGFLVGAVGQKVYLWQLKDDDLVGVAFIDT-NIFVHQLVSIKSLILVADVYKSVSLLRFQ 1212
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E L+L+++D+ L+ F E+++D L +VSDEQ NI + Y P+ ES+ GQ+LL
Sbjct: 1213 EDYRTLSLVSRDYQPLNVFQIEYVVDNHNLGFLVSDEQCNIITYMYQPESRESFGGQRLL 1272
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
+ ++HVG + R+Q D S+ + F TLDG IG + PL E T
Sbjct: 1273 RKCDYHVGQKINSMFRVQCDFHEMDYKR---NSNYECKHTTYFATLDGGIGYVLPLPEKT 1329
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
+RRL LQ L+ PH+ GLNP++FR + K +VD +L+ + LP E+
Sbjct: 1330 YRRLFMLQNVLMTHSPHLCGLNPKAFRTIKTVKKLPINPARCVVDGDLIWTFLTLPANEK 1389
Query: 1404 LEIAHQTGTTRSQILSNLNDL 1424
LE+A + GT I ++L ++
Sbjct: 1390 LEVAKKIGTRIDDICADLMEI 1410
>gi|193702313|ref|XP_001945086.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Acyrthosiphon pisum]
Length = 1335
Score = 499 bits (1284), Expect = e-138, Method: Compositional matrix adjust.
Identities = 412/1440 (28%), Positives = 674/1440 (46%), Gaps = 214/1440 (14%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV N++ +Y + + + K E + Y L GN+ L
Sbjct: 30 LVVAGVNILRVYRLVPTDTTCQPPK----------------TKFECLAQYTLFGNIMCL- 72
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
Q D+++L+F +AK S++E+D +H LR S+H FE ++ K G
Sbjct: 73 ---QSVTLCPSSPDALLLSFSEAKFSLVEYDRDMHSLRTLSLHYFEDDKF---KNGHTQH 126
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
PL++VDP GRC LVYG ++L G D++ SA++ S+ I
Sbjct: 127 WSPPLIRVDPDGRCVVGLVYGSYFVVLPF-----GRTIDDN------AKSAQVMPSYTIP 175
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
+ +D M ++ DF F+HGY EP ++IL+E T+AGR++ + TC + A+S++ H
Sbjct: 176 ISKIDPKMNNIMDFDFLHGYYEPTLLILYEPVKTFAGRIAVRKDTCAMVAISLNIQQHVH 235
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQE 354
P+IWS +LP+D K++AV PIGGVL++ N++ Y +QS +ALN+ A +L +
Sbjct: 236 PVIWSLDSLPYDCQKVIAVSRPIGGVLIMAVNSLIYLNQSVPPFGVALNSIAKTLTNFPL 295
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTS 413
+ ++ LD A AT++ +D + S GDL ++T+ D R V+ K SVLT+
Sbjct: 296 GQQEDINLVLDRATATFISSDKLVTSLCNGDLYVITLYADSMRAVRSFHFEKCASSVLTT 355
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
IT +S FLGSRLG+SLL+++ S ++ D PS KR + +D
Sbjct: 356 CITVCLDSYLFLGSRLGNSLLLRYYARSQSN--------------DDEPSIKRKKTDETD 401
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+D+V EL +YGS T +++SF V DS++NIGP S G A S
Sbjct: 402 --EDLV---ELEVYGSEVQ-TSICLESYSFEVCDSIINIGPCSQASIGE--PAYISDEFS 453
Query: 534 SKQSNYELVELPG--CKGIWTVYHKSSRGHNADSSRMAAYDD--------EYHAYLIISL 583
S + + EL+ G G +V H+S + + + Y D ++H ++I++
Sbjct: 454 SDEHDVELLCTSGHGKNGALSVLHRSIKPQLVTTFHLDGYKDMWTVHGENDFHTFMILTN 513
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
T++L+T + E+ +S Y + T+ N+ + VIQV R+L+GS Q +
Sbjct: 514 VDSTLILQTGQEINEL-DSSGYATREHTVFVCNM--NKFVIQVLRYSVRLLNGSEQLQSV 570
Query: 644 S--FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI---------RLLVGDPSTCTV 692
S FG S ++ S +PY +L DG + R+L+ P+
Sbjct: 571 SLDFG------------SPIIHGSSCNPYAVLLTEDGQVIVLTVKSTGRILLMRPTNFEQ 618
Query: 693 SVQTP--------AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
QT + + SS P + L GP K D ++S V + + G
Sbjct: 619 IPQTKTLAVYRDVSGLFSSTMPQAEIPLV---GP-----KLQHDHFVSDSVEDEEEMLYG 670
Query: 745 GPLDQGD--------------------------IYSVVCYESGALEIFDVPNFNCVFTVD 778
D + V+ ++G +EI+ +P+F
Sbjct: 671 DARDPSSRETPHNSVSNKNTMWWLKFLEVPTPTYWVVLTRDNGYMEIYTLPDFKI----- 725
Query: 779 KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK-ENIHSMKVVELAMQRWSAHHS 837
Y + +S + S EEG +K E I + +V L Q
Sbjct: 726 -----------KYRAANIDESPMILKDSLEEGCYFPKKTEIIKEILIVPLGYQ-----DK 769
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
RP +F L D ++ Y + PE T K +RF + +
Sbjct: 770 RPIMFVRL-DNEVVIYGIH--RHPEGTLK-----------------------MRFHK--M 801
Query: 898 DAYTREETPHGAPCQRITI---FKNISGHQGFFLSGSRPCWCMV-FRERLRVHPQLCDGS 953
+ ++ G P + ++ F ++GH G F+ G P ++ R LR HP DG
Sbjct: 802 TSLLTFQSRSGNPLEGTSLLRYFSKVAGHNGVFICGQNPHLILLTVRGELRCHPLHIDGP 861
Query: 954 IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEK 1013
I+ F HNVNC+ GF+Y S L+I LP+ +YD WP++K+PL+ TPH I Y E
Sbjct: 862 IMCFAPFHNVNCSQGFLYFNSDHKLRISILPTHLSYDEPWPLRKVPLRKTPHFIAYHLET 921
Query: 1014 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA 1073
Y ++ S L + D+E+ + + L + H +T+E + EP
Sbjct: 922 KTYCVVTSSSELSASYYRFNGE-DKELTTE-ERDPLFPLPSHEVFTLELFSPASWEP--- 976
Query: 1074 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLF 1132
+I + E+ ++ V L + + +A+GT Y ED+ +RGR+ LF
Sbjct: 977 ----IPDTSIETEDWEHITCLKNVALAYEGARSGLKGYIAMGTNYSYSEDITSRGRIFLF 1032
Query: 1133 STGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNG 1187
P +N + +Y+KE KG ++A+ + G L+ A G KI + + +L G
Sbjct: 1033 DIIDVVPEPGKPLTKNKIKMIYAKEQKGPVTAITHVVGFLVTAVGQKIYIWQLKDNDLIG 1092
Query: 1188 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1247
IAF D +YV + +K+ IL+ D+ KSI L ++E+ L+L+ +D L+ F FL
Sbjct: 1093 IAFIDT-EVYVHQMLSIKSLILVADLFKSITLLRFQEEYRTLSLVCRDSKPLEVFDINFL 1151
Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS- 1306
ID + L + SD +N+ ++ Y P ES+ GQ L+ R +F++G++V F RL+ ++
Sbjct: 1152 IDNTELGFLASDRDQNLLLYLYQPMARESYGGQHLVRRGDFNIGSNVNSFFRLRCKQSTV 1211
Query: 1307 -SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN 1365
DR A GSDK R ++ TLDGSIG I P+ E +RRL +LQ LV ++ H+AGLN
Sbjct: 1212 APDRREAI-GSDK--RHVTMYTTLDGSIGYIVPIHEKNYRRLLTLQNMLVKNITHLAGLN 1268
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHY-EMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
P+++R F + ++D EL+ + + ++ EIA++ G ++L ++ +L
Sbjct: 1269 PKAYRSFKATAPERMNQARRVIDGELVWMFVTCMNARQRNEIANKVGVKTIELLQDIYEL 1328
>gi|414587799|tpg|DAA38370.1| TPA: hypothetical protein ZEAMMB73_163106 [Zea mays]
Length = 461
Score = 494 bits (1271), Expect = e-136, Method: Compositional matrix adjust.
Identities = 240/420 (57%), Positives = 293/420 (69%), Gaps = 10/420 (2%)
Query: 685 GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
DPSTCT+S+ PA SS + +S+CTLY D+GPEPWLRKT TDAWLST VGEAID D
Sbjct: 33 ADPSTCTISINAPAIFASSSERISACTLYCDRGPEPWLRKTHTDAWLSTDVGEAIDDNDN 92
Query: 745 GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
D DIY ++CYESG LEIF+VP+F VF+VD FVSG + D + R + KDS
Sbjct: 93 SSHDLSDIYCIICYESGKLEIFEVPSFKRVFSVDNFVSGPAILFDVFSRNSTKDSGIGDR 152
Query: 805 SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
+S+ +KE ++K+VELAM RWS SRPFLF +L DGT+LCY AY FEG E+
Sbjct: 153 DASKVSV---KKEEAANIKIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAYYFEGSESN 209
Query: 865 SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPC---QRITIFKNIS 921
+ S + N + SRLRNLRF R +D +R++ C RITIF N+
Sbjct: 210 VQCAPFSPHGGSPDIGNATDSRLRNLRFCRVSIDISSRDDIS----CLVRPRITIFNNVG 265
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
G++G FL G RP W V R+R RVHPQLCDG IVAFTVLHNVNC G IYVTSQG LKIC
Sbjct: 266 GYEGLFLGGPRPTWVFVCRQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIYVTSQGFLKIC 325
Query: 982 QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
QLPS YDNYWPVQK+PL TPHQ+TY+ E++LYPLIVSVP ++PLNQVLS + DQE+G
Sbjct: 326 QLPSAYNYDNYWPVQKVPLHGTPHQVTYYGEQSLYPLIVSVPQVRPLNQVLSSMADQELG 385
Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFN 1101
++N S DL YTV+E+EVRI+E ++ G W+TR+TIPMQS ENALTVR+VTL N
Sbjct: 386 LHMENDVTSGGDLQEVYTVDEFEVRIMELGKSNGRWETRSTIPMQSFENALTVRIVTLQN 445
>gi|158287218|ref|XP_309311.4| AGAP011340-PA [Anopheles gambiae str. PEST]
gi|157019545|gb|EAA05261.4| AGAP011340-PA [Anopheles gambiae str. PEST]
Length = 1434
Score = 487 bits (1254), Expect = e-134, Method: Compositional matrix adjust.
Identities = 408/1485 (27%), Positives = 701/1485 (47%), Gaps = 203/1485 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LV ANV+++Y RV + +++ R M LE V YRL+GN++S+
Sbjct: 29 SLVTGGANVLKVY--RVIPDADPATRDKYTAARPPNM------KLECVASYRLNGNIKSM 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S G+ RD+++++F DAK+SV++FD L+ S+H FE + ++ G
Sbjct: 81 QSVSLAGS----LRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDED---IRGGWTG 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR--I 230
P+V+VDP RC +LVYG ++++L + S L + + A+ I
Sbjct: 134 HYHIPMVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIELQDVKPIKKAPMQLVAKTPI 193
Query: 231 ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+S++I L+DLD K +V D F+HGY EP ++IL+E T+ GR++ + TC + ALS+
Sbjct: 194 LASYIIELKDLDEKIDNVIDIQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSL 253
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
+ + HP+IW+ +LP D + + + PIGG LV+ N++ Y +QS Y VS
Sbjct: 254 NIQQRVHPVIWTVNSLPFDCIQAIPINKPIGGCLVMCVNSLIYLNQSVP------PYGVS 307
Query: 349 LDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQR 400
L+SS + P+ + LDAA +++ + +LS K G+L +LT+ D R V+
Sbjct: 308 LNSSADHSTSFPLKPQDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRN 367
Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
+K SVLTS I + FLGSRLG+SLL++F + +++ ++ G +E +
Sbjct: 368 FHFNKAAASVLTSCICVCEDEYLFLGSRLGNSLLLRFKEKDESLVITI---DDSGAVEKE 424
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
KR R + +YGS T ++ F V D+++NIGP+ +
Sbjct: 425 P---KRPRLEEEEL----------EVYGSGYK-TSVQLTSYIFEVCDNVLNIGPIAHMAV 470
Query: 521 GLRINADASATG-----ISKQSNYELVE--------------------------LPGCKG 549
G R+ + + + + + E+V L GC
Sbjct: 471 GERVAEEDAENQPDVQIVQNKLDIEVVTSSGHGKNGALCVLQSSIKPQVITSFGLSGCVD 530
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+WTV+ ++ +R A HA++I+S E TMVL+T + + E+ E+ +
Sbjct: 531 VWTVFDEAV-------ARRAEDGPSTHAFMILSQEGGTMVLQTGEEINEI-ENTGFATTV 582
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TI GN+ R ++QV + R+L G+ + Q++ + SV+I DP
Sbjct: 583 PTIHVGNIGTNRFIVQVTTKSIRLLQGTRLLQNIPI----------DLGCPLASVAIVDP 632
Query: 670 YVLLGMSDGSIRLLV-----GDPSTC----TVSVQTPAAIE-SSKKPVSSCTLYHDK--- 716
YV + S+G + L G P T+S TPA + S+ + VS L+ K
Sbjct: 633 YVCVRSSEGRVITLALREGKGTPRLAVNKNTIS-PTPAVVAISAYRDVSG--LFTKKIED 689
Query: 717 --------------------GPEPWLRKTSTDAWLSTGVGE----------AIDGADGGP 746
PEP ++ + L G AI G GG
Sbjct: 690 VYDLSRGGAASAYSSGFGSMKPEPHMKIEDEEDLLYGESGRSFKMTSMADMAIAGKSGGS 749
Query: 747 LD---------QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDT--YMREA 795
D + + ++G LEI+ +P+ V+ + +G + D+ ++
Sbjct: 750 ADFWMKYMQQVKPTYWLFAARDNGTLEIYSMPDLKLVYLITNVGNGNKVLSDSMEFVPLP 809
Query: 796 LKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQA 855
+ S ++ ++SS G G ++ +++ +A+ ++ SRP LF I + +L Y+
Sbjct: 810 MGKSASQEDASSAFGASFGVSASLLPKEILMVAL---GSYGSRPLLF-IRLEHDLLIYRV 865
Query: 856 YLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP----- 910
+ + SK + R LS S V+ R S E+ A
Sbjct: 866 FRY------SKGHLKLRFKR-LSTS-VTCPVFRTPEPSGAGATEAANEQQQARATKVLYE 917
Query: 911 -CQRITIFKNISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHG 968
I F N+SG+ G + G +P + + LR H + AF +NVNC +G
Sbjct: 918 NISMIRYFANVSGYAGVAVCGEKPYFLFLTAHGELRSHRLYARTVMKAFAPFNNVNCPNG 977
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
F+Y Q LKI P+ +YD+ WPV+KIPL+++P QI Y E +Y +++ +
Sbjct: 978 FLYFDEQYELKISIFPTYLSYDSVWPVRKIPLRSSPKQIVYHRENKVYCVVMDAEEI--C 1035
Query: 1029 NQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPM 1085
N+ D+E+ + HR + V ++ P W+ +I +
Sbjct: 1036 NKYYRFNGEDKELTEENKGERFLYPMGHR------FSVVLVTP----AAWEVVPETSINL 1085
Query: 1086 QSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL 1144
+ E+ + ++ V+L + + +A+GT + ED+ +RGR+LL+ P
Sbjct: 1086 EEWEHVIALKNVSLTYEGARSGLKEYIAVGTNFNYSEDITSRGRLLLYDIIEVVPEPGKP 1145
Query: 1145 VT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
+T EV K+ KG +SA++ + G L+ A G K+ L + +L G+AF D ++V
Sbjct: 1146 LTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKVYLWQMKDDDLVGVAFIDTN-IFVH 1204
Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
+ +K+ IL+ D++KS+ L ++E+ L+++++D+ L+ F E+++D + L +VSD
Sbjct: 1205 QMVSIKSLILVADVYKSVSLLRFQEEYRTLSVVSRDYHPLNVFQVEYVVDNANLGFLVSD 1264
Query: 1260 EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT 1319
+Q N+ + Y P+ ES+ GQ+LL ++++H+G V R+Q +D D
Sbjct: 1265 DQCNLITYMYQPESRESFGGQRLLRKSDYHLGQQVNCMFRVQCDFHETDVMKRTLNYD-- 1322
Query: 1320 NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
N+ F TLDG IG + PL E T+RRL LQ L+ PH GLNP+++R K
Sbjct: 1323 NKHTTFFATLDGGIGFVLPLPEKTYRRLFMLQNVLLTHSPHTCGLNPKAYRTIKQTRKLP 1382
Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+VD +L+ + LP E+ E+A + GT +I ++L ++
Sbjct: 1383 INPSRCVVDGDLVWSFLELPANEKHEVAKKIGTRIEEICADLMEI 1427
>gi|195056749|ref|XP_001995154.1| GH22991 [Drosophila grimshawi]
gi|193899360|gb|EDV98226.1| GH22991 [Drosophila grimshawi]
Length = 1426
Score = 478 bits (1230), Expect = e-131, Method: Compositional matrix adjust.
Identities = 415/1498 (27%), Positives = 668/1498 (44%), Gaps = 243/1498 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + + + K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVDAVQRQKLNPSEMRLAPKM------RLECLASYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQSVSLAGA----MRDALLISFKDAKLSVLQLDADTQTLKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P+V+VDP RC +LVYG ++++L + S L + + R
Sbjct: 136 GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I L DLD K +V D F+HGY EP ++IL+E T AGR+ + T
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDT------- 248
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
+ PIGG LV+ N + Y +QS Y V
Sbjct: 249 -----------------------FFPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 279
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 280 SLNSSADNSTAFPLKPQDNVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 339
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I FLGSRLG+SLL+ FT +++++
Sbjct: 340 NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEATVEQQT 399
Query: 448 -----SGLKEE--FGDIEA-DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
L EE D+E +AP + RR + EEL +YGS + + +
Sbjct: 400 IEQSPEELAEESPVYDVEQHEAPPQSKSRR---------IEDEELEVYGSGAKASVLQLR 450
Query: 500 TFSFAVRDSLVNIGPLKDFSYG-----------LRINAD---------ASATGISKQS-- 537
F F V DSL+N+ P+ G LR +AD +ATG SK
Sbjct: 451 KFIFEVCDSLINVAPINYMCAGERVEFEEDGATLRPHADNLNDLKIELVAATGHSKNGAL 510
Query: 538 -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
N +++ EL GC +WTV+ ++R A ++R + H ++++S + T
Sbjct: 511 SVFVNCINPQIITSFELEGCLDVWTVFDDATR--KATTAR-----QDQHDFMLLSQRSST 563
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 564 LVLQTGQEINEI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 620
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
S V+ VSIADPYV L + +G + L + T + SS V
Sbjct: 621 --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAV 672
Query: 708 SSCTLYHD------------------------------KGPEPWLRKTSTDAWLSTGVGE 737
+ Y D EP ++ + L G
Sbjct: 673 VAIAAYKDLSGLFTCKADDVLNLTGSSGAGFANSFGGYMKAEPHMKVEDEEDLLYGDAGS 732
Query: 738 AI------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDK 779
A D A D + VV +SG LEI+ +P+ V+ V+
Sbjct: 733 AFKLNSMADLAKQSKQKNSDWWRRQLIQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVND 792
Query: 780 FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSR 838
+G + D E + S T+ NS + G ++ +S +EL + H R
Sbjct: 793 IGNGALVLSDAM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELCLVGLGQHGER 848
Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
P L + T +L YQ + + + + L + + LD
Sbjct: 849 PLLL-VRTRLELLIYQVFRY------------AKGHLKIRFRKLEQLHLLEQQPTHIELD 895
Query: 899 AYTREETP----HGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGS 953
EE Q++ F N+ G G + G PC+ + R LR+H L +G
Sbjct: 896 GEDVEEAESYNMQAKYVQKLRYFANVGGLAGIMVCGVNPCFVFLTSRGELRIHRLLGNGD 955
Query: 954 IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEK 1013
+ +F +NVN HGF+Y + LKI LPS +YD WPV+K+PL+ TP Q+ Y E
Sbjct: 956 VRSFAAFNNVNIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKVPLRCTPRQLVYHREN 1015
Query: 1014 NLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPD 1071
+Y LI +P+ + D+E+ + Y + +E+ ++ P+
Sbjct: 1016 RVYCLITQKE--EPMTKYYRFNGEDKELSEECRGERF-------IYPIGSLFEMVLISPE 1066
Query: 1072 RAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1128
W+ A+I + E+ ++V L + T + L IGT + ED+ +RG
Sbjct: 1067 ----TWEIVPDASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGN 1122
Query: 1129 VLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
+ ++ P +T EV+ KE KG +SA++ + G L+ G KI + +
Sbjct: 1123 IHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKIYIWQLRDG 1182
Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
+L G+AF D +YV + VK+ I + D++KSI L ++E+ L+L ++DF ++ F
Sbjct: 1183 DLIGVAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLASRDFNPMEVFG 1241
Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1303
EF++D S L +V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q
Sbjct: 1242 IEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCH 1301
Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
+ N+ +++G+LDG++G PL E +RR LQ L+ H+ G
Sbjct: 1302 QRGLHQRQPFL---YENKHLVIYGSLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDHLCG 1358
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
LNP+ +R S K I+D +L+ + ML E+ E+A + GT +IL++L
Sbjct: 1359 LNPKEYRTIKSVKKLGINPSRCIIDGDLIWSFRMLAHSERNEVAKKIGTRTEEILADL 1416
>gi|195122290|ref|XP_002005645.1| GI18959 [Drosophila mojavensis]
gi|193910713|gb|EDW09580.1| GI18959 [Drosophila mojavensis]
Length = 1431
Score = 474 bits (1219), Expect = e-130, Method: Compositional matrix adjust.
Identities = 411/1510 (27%), Positives = 668/1510 (44%), Gaps = 262/1510 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + + ++ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVDATQRQKLNPSEMRLAPKM------RLECLASYSLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S G RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQSVSLAGG----MRDALLVSFKDAKLSVLQLDADTQTLKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P+V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I L DLD K +V D F+HGY EP ++IL+E T AGR+
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRI------------- 242
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
K+ + PIGG LV+ N + Y +QS Y V
Sbjct: 243 ----------------------KVFPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 274
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 275 SLNSSADNSTSFPLKPQDNVRLSLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I FLGSRLG+SLL+ FT +++++
Sbjct: 335 NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVESAATAAA 394
Query: 448 ----------------SGLKEEFGDIEA-DAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
+++ D+E +APS + RR + EEL +YGS
Sbjct: 395 TGAGEQQQQAIDQSPPQMDEDQVYDVEQHEAPSQAKSRR---------IEDEELEVYGSG 445
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASA 530
+ + + F F V DSL+N+ P+ G R+ + +A
Sbjct: 446 AKASVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLTDLKIELVAA 505
Query: 531 TGISKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY 578
TG SK N +++ EL GC +WTV+ ++R + + E H +
Sbjct: 506 TGHSKNGALSVFVNCINPQIITSFELDGCLDVWTVFDDATR-------KPSTARQEQHDF 558
Query: 579 LIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY 638
+++S + T+VL+T + E+ E+ + V TI GNL +R ++QV R R+L G+
Sbjct: 559 MLLSQRSSTLVLQTGQEINEI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTR 617
Query: 639 MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPA 698
+ Q++ S V+ VSIADPYV L + +G + L + T +
Sbjct: 618 LIQNVPI----------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINK 667
Query: 699 AIESSKKPVSSCTLYHD------------------------------KGPEPWLRKTSTD 728
SS V + Y D EP ++ +
Sbjct: 668 HTISSAPAVVAIAAYKDLSGLFTCKADDVLNLTGSTGAGFANSFGGYMKAEPHMKVEDEE 727
Query: 729 AWLSTGVGEAI------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPN 770
L G A D A D + VV +SG LEI+ +P+
Sbjct: 728 DLLYGDAGNAFKLNSMADLAKQSKQKNTDWWRRQLVQAKPSYWLVVARQSGTLEIYSMPD 787
Query: 771 FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAM 829
V+ V+ +G + D E + S T+ NS + G ++ +S +EL++
Sbjct: 788 MKLVYLVNDVGNGALVLTDAM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELSL 843
Query: 830 QRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRN 889
H RP L + T +L YQ + + L + +L
Sbjct: 844 VGLGQHGDRPLLL-VRTRLELLIYQVFRY--------------AKGHLKIRFRKLEQLHL 888
Query: 890 LRFSRTPLDAYTREETPHGAP-------CQRITIFKNISGHQGFFLSGSRPCWC-MVFRE 941
L T ++ EET Q++ F N+ G G + G PC+ + R
Sbjct: 889 LDQQPTHIELINEEETDEAESYNMQPKYVQKLRYFNNVGGLAGIMVCGVNPCFIFLTARG 948
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
LR+H L + + +F +NVN HGF+Y + LKI LP+ +YD WPV+K+PL+
Sbjct: 949 ELRIHRLLGNAEVRSFAAFNNVNIPHGFLYFDTTYELKISVLPTYLSYDAAWPVRKVPLR 1008
Query: 1002 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV 1060
TP Q+ Y E +Y LI +P+ + D+E+ + Y +
Sbjct: 1009 CTPRQLVYHRENRVYCLITQKE--EPMTKYYRFNGEDKELSEESRGERF-------IYPI 1059
Query: 1061 EE-YEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1116
+E+ ++ P+ W+ A+I + E+ ++V L + T + L IGT
Sbjct: 1060 GSLFEMVLISPE----TWEIVPDASIQFEPWEHVTAFKLVKLSYEGTRSGLKEYLCIGTN 1115
Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIAS 1171
+ ED+ +RG + ++ P +T EV+ KE KG +SA++ + G L+
Sbjct: 1116 FNYSEDITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGL 1175
Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
G KI + + +L G+AF D +YV + VK+ I + D++KSI L ++E+ L+L
Sbjct: 1176 GQKIYIWQLRDGDLIGVAFIDTN-IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSL 1234
Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
++DF L+ F EF++D S L +V+D ++NI ++ Y P+ ES GQKLL +A++H+G
Sbjct: 1235 ASRDFNPLEVFGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLG 1294
Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
V R+Q + N+ +++GTLDG++G PL E +RR LQ
Sbjct: 1295 QVVNTMFRVQCHQRGLHQRQPFLYE---NKHFVIYGTLDGALGYCLPLPEKVYRRFLMLQ 1351
Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
L+ H+ GLNP+ +R + K I+D +L+ Y ML E+ E+A + G
Sbjct: 1352 NVLLSYQDHLCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIG 1411
Query: 1412 TTRSQILSNL 1421
T +IL++L
Sbjct: 1412 TRTEEILADL 1421
>gi|195381337|ref|XP_002049409.1| GJ21566 [Drosophila virilis]
gi|194144206|gb|EDW60602.1| GJ21566 [Drosophila virilis]
Length = 1420
Score = 473 bits (1218), Expect = e-130, Method: Compositional matrix adjust.
Identities = 408/1488 (27%), Positives = 661/1488 (44%), Gaps = 229/1488 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + + ++ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVDAAQRQKLNPTEMRLAPKM------RLECLASYSLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S G RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQSVSLAGG----MRDALLISFKDAKLSVLQLDADTQALKTLSLHYFEEED---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P+V+VDP RC +LVYG ++++L + S L + + R
Sbjct: 136 GRYHVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I L DLD K +V D F+HGY EP ++IL+E T AGR+
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRI------------- 242
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
K+ + PIGG LV+ N I Y +QS Y V
Sbjct: 243 ----------------------KVFPIQKPIGGCLVMTVNAIIYLNQSVP------PYGV 274
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 275 SLNSSADNSTSFPLKPQDNVRLSLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
K SVLTS I FLGSRLG+SLL+ FT +++++ E + +A
Sbjct: 335 NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDMENAVEQQA 394
Query: 460 DAPSTKRL----------RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSL 509
+ +L + S A + EEL +YGS + + + F F V DSL
Sbjct: 395 VEQAPPQLDEEQVYDVDQHEAPSQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSL 454
Query: 510 VNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------NYE 540
+N+ P+ G R+ + +ATG SK N +
Sbjct: 455 INVAPINYMCAGERVEFEEDGSTLRPHAESLNEVKIELVAATGHSKNGALSVFVNCINPQ 514
Query: 541 LV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
++ EL GC +WTV+ ++R + E H ++++S + T+VL+T +
Sbjct: 515 IITSFELDGCLDVWTVFDDATR-------KPTTARQEQHDFMLLSQRSSTLVLQTGQEIN 567
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 568 EI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI----------DV 616
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-- 715
S V+ VSIADPYV L + +G + L + T + SS V + Y D
Sbjct: 617 GSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAIAAYKDLS 676
Query: 716 ----------------KGP------------EPWLRKTSTDAWLSTGVGEAI------DG 741
GP EP ++ + L G A D
Sbjct: 677 GLFTCKADDVLNLTGSSGPGFVNSFGGYMKAEPHMKVEDEEDLLYGDAGNAFKLNSMADL 736
Query: 742 ADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD 789
A D + VV +SG LEI+ +P+ V+ V+ +G + D
Sbjct: 737 AKQSKQKNSDWWRRQLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGALVLND 796
Query: 790 TYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
E + S T+ NS + G ++ +S +EL + H RP L + T
Sbjct: 797 AM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELCLVGLGQHGERPLLL-VRTRL 851
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP-- 906
+L YQ + + + + L + + + LD EE
Sbjct: 852 ELLIYQVFRY------------AKGHLKIRFRKLEQLHLLDQQPTHIELDGDEAEEAESY 899
Query: 907 --HGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNV 963
Q++ F N+ G G + G P + + R LR+H L + + +F +NV
Sbjct: 900 NMQPKYVQKLRYFSNVGGLAGIMVCGMNPVFVFLTARGELRIHRLLGNADVRSFAAFNNV 959
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
N HGF+Y + LKI LPS +YD WPV+K+PL+ TP Q+ Y E +Y LI
Sbjct: 960 NIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKVPLRCTPRQLVYHRENRVYCLITQKE 1019
Query: 1024 VLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPDRAGGPWQT-- 1079
+P+ + D+E+ + Y + +E+ ++ P+ W+
Sbjct: 1020 --EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSLFEMVLISPET----WEIVP 1066
Query: 1080 RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNA 1138
A+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1067 DASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVV 1126
Query: 1139 DNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
P +T EV+ KE KG +SA++ + G L+ G KI + + +L G+AF D
Sbjct: 1127 PEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKIYIWQLRDGDLIGVAFIDT 1186
Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
+YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ F EF++D S L
Sbjct: 1187 N-IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVFGIEFMVDNSNL 1245
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
+V+D ++N+ ++ Y P+ ES GQKLL +A++H+G V R+Q
Sbjct: 1246 GFLVTDAERNLIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQRGLHHRQPF 1305
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R
Sbjct: 1306 LYE---NKHLVIYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDHLCGLNPKEYRTIK 1362
Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ K I+D +L+ Y ML E+ E+A + GT +IL+++
Sbjct: 1363 TVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIGTRTEEILADM 1410
>gi|24653655|ref|NP_725397.1| cleavage and polyadenylation specificity factor 160, isoform B
[Drosophila melanogaster]
gi|15292103|gb|AAK93320.1| LD38533p [Drosophila melanogaster]
gi|21627189|gb|AAM68553.1| cleavage and polyadenylation specificity factor 160, isoform B
[Drosophila melanogaster]
Length = 1420
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 410/1486 (27%), Positives = 677/1486 (45%), Gaps = 225/1486 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRI------------- 242
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
K+ + PIGG LV+ N + Y +QS Y V
Sbjct: 243 ----------------------KVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 274
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 275 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 335 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 394
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E ++E + +L + + A + EEL +YGS + + + F F V D
Sbjct: 395 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 453
Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
SL+N+ P+ G R+ + +ATG SK N
Sbjct: 454 SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 513
Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
+++ EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T
Sbjct: 514 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 564
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ E+ + V TI GNL +R ++QV R R+L G+ + Q++
Sbjct: 565 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 613
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 614 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 673
Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
KG EP ++ + L G A D
Sbjct: 674 LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 733
Query: 741 GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
A D + VV +SG LEI+ +P+ V+ V+ +G +
Sbjct: 734 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 793
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D E + S T +S ++ +S +EL++ + RP L + T
Sbjct: 794 DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 850
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L YQ +F P+ K R + N+ + ++ D E+
Sbjct: 851 ELLIYQ--VFRYPKGHLKI-----RFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQM 901
Query: 909 AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
P Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN
Sbjct: 902 QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 961
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
+GF+Y + LKI LPS +YD+ WPV+K+PL+ TP Q+ Y E +Y LI
Sbjct: 962 PNGFLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTE-- 1019
Query: 1026 KPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RA 1081
+P+ + D+E+ + Y + ++E+ ++ P+ W+ A
Sbjct: 1020 EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDA 1068
Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
+I + E+ ++V L + T + L IGT + ED+ +RG + ++
Sbjct: 1069 SITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPE 1128
Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
P +T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF D
Sbjct: 1129 PGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDTN- 1187
Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
+YV + VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L
Sbjct: 1188 IYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGF 1247
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
+V+D ++NI ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 1248 LVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL- 1306
Query: 1316 SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R S+
Sbjct: 1307 --YENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSS 1364
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
K I+D +L+ Y ++ E+ E+A + GT +IL +L
Sbjct: 1365 KKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 1410
>gi|312380158|gb|EFR26239.1| hypothetical protein AND_07834 [Anopheles darlingi]
Length = 1503
Score = 470 bits (1209), Expect = e-129, Method: Compositional matrix adjust.
Identities = 398/1499 (26%), Positives = 679/1499 (45%), Gaps = 223/1499 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LV ANV+++Y R+ + ++ R M LE + YRL GN+ SL
Sbjct: 42 SLVTGGANVLKVY--RIIPDADPATREKYSATRPPNM------KLECMASYRLFGNIMSL 93
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S G+ +RD+++++F DAK+SV++FD L+ S+H FE + ++ G
Sbjct: 94 QSVSLAGS----QRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDED---IRGGWTG 146
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSGGGF---SARI 230
PLV+VDP RC +LVYG ++++L K S + D I
Sbjct: 147 HYHIPLVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEMQDVKPIKKTPTLLIAKTPI 206
Query: 231 ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+S++I L+DLD K +V D F+HGY EP ++IL+E T+ GR++ + TC + ALS+
Sbjct: 207 LASYIIELKDLDEKIDNVIDVQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSL 266
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
+ + HP+IW+ +LP D + + + PIGG LV+ N++ Y +QS Y VS
Sbjct: 267 NIQQRVHPVIWTVNSLPFDCLQAVPISKPIGGCLVMCVNSLIYLNQSVP------PYGVS 320
Query: 349 LDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
L+SS + P+ + LDAA +++++ +LS K G+L +LT+ D
Sbjct: 321 LNSSADHSTNFPLKPQDGVRISLDAAQVCFIESEKLVLSLKGGELYVLTLCADS------ 374
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
I FLGSRLG+SLL++F + +++ ++ G +E +
Sbjct: 375 ----------MRSICVCETEYLFLGSRLGNSLLLRFREKDESLVITI---DDSGTVEKEQ 421
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
KR R + +YGS T ++ F V DS++NIGP+ + G
Sbjct: 422 ---KRQRLEEEEL----------EVYGSGYK-TSVQLTSYIFEVCDSVLNIGPIAHMAVG 467
Query: 522 LRINAD-------------------ASATGISKQSNYELVE------------LPGCKGI 550
RI + +A+G K +++ L GC +
Sbjct: 468 ERICEEEMEEGAEVQFVPNKLDVEVVTASGHGKNGALCVLQSSIKPQVITSFGLSGCLDV 527
Query: 551 WTVYHKSSRGHNADSSRMAAYDD---EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
WTV+ +++ +R DD HA++I+S E TMVL+T + + E+ E+ +
Sbjct: 528 WTVFDEAAGPGGVTGTRKP--DDAPPPNHAFMILSQEGATMVLQTGEEINEI-ENTGFAT 584
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
TI GN+ R ++QV + R+L G+ + Q++ + SVSI
Sbjct: 585 DVPTIHVGNIGSNRFIVQVTTKSIRLLQGTRLLQNIPI----------DLGCPLASVSIV 634
Query: 668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD------------ 715
DPYV + S+G + L T + S+ PV + + Y D
Sbjct: 635 DPYVCVRSSEGRVITLALREGKGTPRLAVNKNTISASPPVIAISAYRDVSGMFTRKLEDS 694
Query: 716 ----KG---------------PEPWLRKTSTDAWLSTGVGEAID---GADGGPLDQG--- 750
KG PEP ++ + L G + AD D+G
Sbjct: 695 FDVSKGGGATSAYSSGFGSMKPEPNMKIEDEEDLLYGESGRSFKVTSMADMALADKGGGN 754
Query: 751 -------------DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREAL- 796
+ + ++G LEI+ +P+ + + +G + D+ L
Sbjct: 755 ADFWLKYMQQIKPTYWLLAARDNGNLEIYSMPDLKLAYLISNVGNGNKVLSDSMEFVPLP 814
Query: 797 --KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
K ++ ++S G G S+ E+ M ++ SRP LF I + +L Y+
Sbjct: 815 MAKPGTSQEEATSAFGASFGSGGVPVSLLPKEILMVALGSYGSRPILF-IRLEQDLLIYR 873
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR- 913
+ + + S+ + V A RL NL + A T P+G Q
Sbjct: 874 VFRYAKGHLKLRFKRLTSSVTCPAFRTVPA-RLANLP-DKPATGATTDATEPNGKDTQEH 931
Query: 914 -----------ITIFKNISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLH 961
I F N+SG+ G + G +P + + LR H + AF +
Sbjct: 932 ATKVQYENISMIRYFGNVSGYAGVAVCGEKPYFLFLTAHGELRSHRLYARTVMKAFAPFN 991
Query: 962 NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1021
NVNC +GF+Y Q LKI LP+ +YD+ WPV+KIPL+++P QI Y E +Y +++
Sbjct: 992 NVNCPNGFLYFDEQYQLKISILPTYLSYDSVWPVRKIPLRSSPKQIVYHRENRVYCVVMD 1051
Query: 1022 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE--------EYEVRILEPDRA 1073
+E+ ++ N +L E ++ V ++ P
Sbjct: 1052 A---------------EEICNKYYRFNGEDKELTEENKGERFLYPMGHQFSVVLVNP--- 1093
Query: 1074 GGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVL 1130
W+ I ++ E+ ++++ V+L + + +A+GT + ED+ +RGR+L
Sbjct: 1094 -AAWEIVPDTAIALEEWEHVVSLKNVSLAYEGARSGLKEYIAVGTNFNYSEDITSRGRLL 1152
Query: 1131 LFSTGRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1185
L+ P +T EV K+ KG +SA++ + G L+ A G K+ L + +L
Sbjct: 1153 LYDIIEVVPEPGKPLTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKVYLWQMKDDDL 1212
Query: 1186 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
G+AF D ++V + +K+ IL+ D++KS+ L ++++ L+L+++D+ L+ + E
Sbjct: 1213 VGVAFIDTN-IFVHQMVSIKSLILVADVYKSVSLLRFQDEFRTLSLVSRDYHPLNVYQVE 1271
Query: 1246 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
+++D + L +V+D+Q N+ + Y P+ ES+ GQ+LL + ++H+G V R+Q
Sbjct: 1272 YVVDNTNLGFLVADDQANLITYMYQPESRESFGGQRLLRKGDYHLGQRVNAMFRVQCDFH 1331
Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN 1365
SD D N+ F TLDG G + PL E T+RRL LQ L+ PH GLN
Sbjct: 1332 ESDVMRRTLNYD--NKHTTFFATLDGGFGFVLPLPEKTYRRLFMLQNVLLTHSPHTCGLN 1389
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
P+++R + +VD +L+ + LP E+ E+A + GT +I ++L ++
Sbjct: 1390 PKAYRTIKQSRALPINPSRCVVDGDLVWSFLELPANEKQEVAKKIGTRIEEICADLMEI 1448
>gi|242075248|ref|XP_002447560.1| hypothetical protein SORBIDRAFT_06g003580 [Sorghum bicolor]
gi|241938743|gb|EES11888.1| hypothetical protein SORBIDRAFT_06g003580 [Sorghum bicolor]
Length = 374
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 220/359 (61%), Positives = 267/359 (74%), Gaps = 14/359 (3%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-------ELDSELPSK-RGI 52
MS+AAYKMMHWPT I +C +GFITHS AD ++DS S R +
Sbjct: 1 MSYAAYKMMHWPTSIDHCAAGFITHSPADAAAFSSAAPAAAASGPDGDIDSAAASAPRRV 60
Query: 53 GPVPNLVVTAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHG 111
GP PNLVV+AANV+E+Y VR G+++ NS T ++DGIS A LELVCHYRLHG
Sbjct: 61 GPTPNLVVSAANVLEVYAVRADSATGAEDVGNSSSTG--AILDGISGARLELVCHYRLHG 118
Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
N+ES+A+LS G RRDSI + F+DAKI+ +EFDDS +GLR +SMHCFE PEW HLK
Sbjct: 119 NIESMAVLSDG---TENRRDSIAVTFKDAKIACMEFDDSTNGLRTSSMHCFEGPEWFHLK 175
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE 231
RGRESFA GP++K DPQGRCG VLVYGLQMIILKA++ G LVG+++ + RIE
Sbjct: 176 RGRESFAWGPIIKADPQGRCGAVLVYGLQMIILKAAEVGQSLVGEDEPTRMLSSTAVRIE 235
Query: 232 SSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
SS+VI+LRDL+M H+KDF FVHGYIEPV+VILHERE TWAGR+S K TCM+SA SIS
Sbjct: 236 SSYVIDLRDLEMNHIKDFTFVHGYIEPVLVILHEREPTWAGRISSKSQTCMLSAFSISMG 295
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
LKQHP+IWSA LPHDAY+LLAVP PI G+LV+ AN+IHYHSQS SC+LALN+++ D
Sbjct: 296 LKQHPMIWSAAKLPHDAYQLLAVPPPISGILVICANSIHYHSQSTSCSLALNSFSSQPD 354
>gi|321475208|gb|EFX86171.1| hypothetical protein DAPPUDRAFT_313209 [Daphnia pulex]
Length = 1260
Score = 443 bits (1140), Expect = e-121, Method: Compositional matrix adjust.
Identities = 371/1327 (27%), Positives = 612/1327 (46%), Gaps = 177/1327 (13%)
Query: 57 NLVVTAANVIEIY--VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
NLVV ANV+ ++ + E+ ++ G+ + LE + Y L G V
Sbjct: 29 NLVVAGANVLRVFRLIPNTDEKMLRKESADGQPPK---------MKLECLASYNLFGKVM 79
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
S+A +S G+ +D+I+++F AK+S++E+D L+ S+H FE L G
Sbjct: 80 SIAAVSLPGSS----QDTILMSFAHAKLSLIEYDPVSDNLKTLSLHNFEVVSIL--DEGI 133
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE-SS 233
S + P ++VDP+GRC +L++ + IL F + + SS
Sbjct: 134 GSNHKIPEIRVDPEGRCAALLIFRNTLAILP--------------FRKDSAHDSNVTLSS 179
Query: 234 HVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
++I L DL+ + +V D F+HGY EP ++IL+E T+ GR++ + TC + A+S++T
Sbjct: 180 YIIKLTDLEERVDNVIDVQFLHGYYEPTLIILYEPVGTFPGRIAVRQDTCNMVAVSLNTQ 239
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLD 350
+ HP+IWS +LP D +LL VP P+GG L++ N++ Y +QS +++N+ A
Sbjct: 240 QRVHPIIWSLNSLPFDCSQLLPVPKPLGGALIMAVNSVIYVNQSVPPYGVSVNSIADHCT 299
Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPS 409
S P + LD A A +LQ D +LS K G+L +LT+ D R V++ L K S
Sbjct: 300 SFPLKPYEGSRIGLDCARAAFLQYDRVVLSLKGGELYVLTLFADSMRSVRKFHLEKAAAS 359
Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
VLT+ + N LF LGSRLG+SLL+ F + + A P ++
Sbjct: 360 VLTTCLCICDNYLF-LGSRLGNSLLLAFQ--------TKDYNQYATPFAAKKPKMEQFSL 410
Query: 470 SSSDALQDMVNGEELS--LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
L D ++ EE+ LYG +T+S ++ F V DSL+NIGP + G
Sbjct: 411 LFDQEL-DHLDEEEIDNYLYGEDHESTDSKAISYQFEVCDSLLNIGPCGQMAVG---EPA 466
Query: 528 ASATGISKQSNYELVELP-----GCKGIWTVYHKSSRGHNADSSRMAAYDDEY------- 575
++ T K+S VE+ G G V ++ + + + D +
Sbjct: 467 STCTDFDKKSPDPDVEIVTTSGYGKNGAICVLQRTMKPQVVTTFELPEVSDMFTVFASRN 526
Query: 576 ------HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
H YL++S TMVL+T + E+ +S + V TI A NL R ++QV
Sbjct: 527 NEDAIMHTYLLLSRADSTMVLQTGQEINEMDQS-GFSVTSPTILAANLGNNRFIVQVCPT 585
Query: 630 GARILDGS-YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV----------LLGMSDG 678
R+LD + + Q+L + + S S +DPYV LL +G
Sbjct: 586 SVRLLDATATVIQELVM----------DSDFLITSASASDPYVAVLTENGRIGLLTFVEG 635
Query: 679 SIRLLV------GDPSTCTVSVQTPAAIESSKKPVS---SCTLYHDKGPEPWLRKTSTDA 729
S ++ P C + + + ++ P + T H +K D
Sbjct: 636 SQLEMIFPVLSKNSPVVCVCLYRDISGLFNTTIPETDSPETTKLHTANKSLNAKKEMDDE 695
Query: 730 ----WLSTGVGEAIDGADGGPLDQGDIYSVVCY--------------ESGALEIFDVPNF 771
+ T E+ D V+ Y ++G LEI+ +
Sbjct: 696 EDYLYGDTNTEESRPTEDKTHTKFTPQQKVIDYFREIKPTFWLSIIRQNGTLEIYSLAGQ 755
Query: 772 NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
+ V+ F + H+ + +K ET + SS+ +VE+ +
Sbjct: 756 S---VVETFQTVHVHLGHRLIFN-MKADETSLPSSTH-------------CNIVEMGIFG 798
Query: 832 WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRN-- 889
H RP L +D +L Y+A PV S+ + + +L +
Sbjct: 799 LGHLHRRPLLMIRTSDFGVLLYEAI----------PALPVYDSKQKNELKIRFRKLNHSL 848
Query: 890 -LRFSRTPLDAYTREE------TPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRE 941
LR ++T Y R+ P+ + F NI+G+ G F+ G P W M R
Sbjct: 849 LLRETKT----YVRKGGQSVVLEPYAWKTNQFKYFSNIAGYTGVFIGGPYPHWLFMTSRG 904
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
LR+HP DGSI F HNVNC GFIY+ + L+IC LP+ YD WPV+K+PL+
Sbjct: 905 ELRLHPMSIDGSIKCFACFHNVNCAQGFIYLNRKDELRICLLPTLFNYDAPWPVRKVPLR 964
Query: 1002 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYT-V 1060
TPH + Y E Y I++ + +P N++ D + +L D Y V
Sbjct: 965 CTPHYLIYHVETKTY--ILATSLAEPTNRIYRFNGDDK------ELSLEERDDRFPYPHV 1016
Query: 1061 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQ 1119
E++ ++++ P TR + + E+ ++ V+L + + LA+ T Y
Sbjct: 1017 EKFAIQLISPVTWEAVPNTR--MDLDDWEHVTCLKTVSLEYEGHASGLKDYLAVSTNYNY 1074
Query: 1120 GEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
GED+ +RGR+ + P +N + +Y+K+ KG ++A++S+ G+L+ A G K
Sbjct: 1075 GEDIISRGRIFILDLIEVVPEPGQPLTKNKIKTLYAKDQKGPVAAISSVCGYLVAAIGQK 1134
Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
I L + +L GIAF D +Y+ L +K+FIL D++KS+ L ++E+ L ++A+
Sbjct: 1135 IYLWQLKNDDLVGIAFIDT-EIYIHQLLNIKSFILAADVYKSVSILRFQEEYRTLCIVAR 1193
Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
D+ L+ A ++ ID + L +VSD +KN+ ++ Y P+ ES G +L+ +A+FHVG V
Sbjct: 1194 DYQPLEVMAVDYYIDNTQLGFLVSDAEKNLILYMYQPEARESQGGHRLIRKADFHVGQVV 1253
Query: 1295 TKFLRLQ 1301
+ R++
Sbjct: 1254 STMFRIK 1260
>gi|307107849|gb|EFN56091.1| hypothetical protein CHLNCDRAFT_145620 [Chlorella variabilis]
Length = 1626
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 427/1602 (26%), Positives = 653/1602 (40%), Gaps = 375/1602 (23%)
Query: 4 AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
A +H PT + +C + ++TH++ Q P+P+LVV +
Sbjct: 7 AVCTQVHPPTAVTHCTAAWLTHAQRQ--------QGSGSADGDDGGGSGDPLPDLVVVRS 58
Query: 64 NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
+E+Y VR E G + ++ A SL+ + RL G ES+A+L +G
Sbjct: 59 TQLELYSVRGSEAGGPATTHT-------------AQSLDQLASCRLFGVAESVAVL-RGR 104
Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
A +RD ++L F DAK+SVL +D H L +S+H FE LK GR F PL
Sbjct: 105 APG--QRDVLLLTFRDAKLSVLHWDAGRHELAPSSLHYFEGDA--SLKLGRTVFPYPPLA 160
Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG-------------------- 223
DP GRCG V+++ Q+ +L A D + FG G
Sbjct: 161 VTDPLGRCGAVIIFRHQLAVLPAV--------DSELFGLGLSAAEEDEEEAAATAALGLA 212
Query: 224 --------------------------GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIE 257
+A + +S+V NL +K V+D F+HGY E
Sbjct: 213 PPDGGGAADGEAGAPRGGAAAAAAGLPAAAAAVGNSYVDNLGKAGIKEVRDACFLHGYSE 272
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
PV+++LHE E TWAG + K TC+++ALS++ T K HP IW A LP DAY+L A +P
Sbjct: 273 PVLMVLHEAEPTWAGNLRQKKDTCVLTALSLNLTRKHHPKIWGAQELPSDAYRLSA--AP 330
Query: 318 IGGVLVVGANTIHYHSQSASCALALN---------------------------------- 343
GGVLV+ + + ++ Q + L+
Sbjct: 331 CGGVLVLCQHLVLHYRQGQQSGVVLHPSALPPAAAPPPLLFDPQAMAEAGGPGPASAAYA 390
Query: 344 -NYAVSL------------DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT 390
+AV + D+SQ ++ V D A WL + ALL ++G L+ L
Sbjct: 391 RQHAVDVHPETVPAAVRFCDASQA---AALKVTADGASVCWLSPESALLCLRSGQLLQLA 447
Query: 391 VVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS----------------------LFFLG 426
++ G + L +++ + S ++ + L FLG
Sbjct: 448 LLPQQAGGSARHLAVARAGAAPHPSCCCSLSGAHRAPHMPGSAAAAAAGQAPQPALVFLG 507
Query: 427 SRLGDSLLVQFT----CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--- 479
S GDSLLV+ T G+ ++ D AD P++KRLR +
Sbjct: 508 SAAGDSLLVRATPAAAAGTKRPAEAATGAAGEEDGTADEPASKRLRLEGIEVGSAAAAVE 567
Query: 480 --------------------------------NGEELSLYGSA--------------SNN 493
EE +YG+A +
Sbjct: 568 ATAAAAAAAQGAAAAAAEARAAAGGGPAGSDSEDEEALIYGTALYSSAAGVAPAAAAAVP 627
Query: 494 TESAQ-KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWT 552
T S Q + + V DSL NIGPL+DF+ A A G + G G T
Sbjct: 628 TPSWQLQRYQLKVLDSLANIGPLRDFAVA---EPAAGAGGEAVPPALVGCSGEGKGGTLT 684
Query: 553 VYHKS----------------SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
V +S G + A + +HAYL++S + T VL T + L
Sbjct: 685 VLRRSVVPDVITEHRGAASASGGGSGQAAGEAAGQEGGHHAYLLLSFQGATKVLATGEEL 744
Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD-----LSFGPSNSE 651
EVTESV++ V T+AAG++ RR+ Q F +G R+LDG QD L+ + +
Sbjct: 745 REVTESVEFAVDTPTLAAGSVCCGRRIAQAFPQGLRLLDGEESVQDVWASELAAPAAAAA 804
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQT--------------- 696
+G ++S + DPYVLL ++DG+ R L DP C +S +
Sbjct: 805 AGGAPGGGAIVSADMCDPYVLLYLADGTARFLTADPVACRLSAASAAGAGPEAAAAAEAA 864
Query: 697 -----PAAIESSKKPVSSCTLYHD-------KGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
P A E +++C+L+ D + P+ + G A
Sbjct: 865 EAALRPVAAEER---ITACSLFADSCGWLAARLPQTQQQTQQQQQQQGQQDGGTTAQAAA 921
Query: 745 GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
G +Y+VVC SGA +++ +P + VF+ ++G ++ T A +
Sbjct: 922 SGGGCGAVYAVVCRASGACQLYALPAWQPVFSSSTSLAGGPALL-TGSGGAGGVAAAAAA 980
Query: 805 SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR-----------------PFLFAILTD 847
+++ E +VVE+ + + + P L A+ D
Sbjct: 981 AAAAAAAAGVEDEMDGPGEVVEVRLVSFGPAAAGRRDAAAARASPAPACEPPLLLALTAD 1040
Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-----NLRFSRTPLDAYTR 902
+L YQA+ ++ T R + L LR R
Sbjct: 1041 HQLLAYQAFSASPGSGGTRGSSGSGTPRFRRLRLDLPPLLPPAGGPQLRLRRLHCFEGLG 1100
Query: 903 EETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD---------GS 953
EE P+ G F++G P W + R L HP
Sbjct: 1101 EEAPY----------------SGVFVAGQHPHWLVASRGGLLPHPHFLPQPAGPGAAAVG 1144
Query: 954 IVAFTVLHNVNCNHGFIYVTS--QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFA 1011
FT HNVNC HGFI TS + ++I QLP + D WP Q++ +K TP ++ ++A
Sbjct: 1145 AAGFTPFHNVNCPHGFIVATSGARSGIQISQLPPRTRLDAPWPRQRVSIKGTPLKVAHYA 1204
Query: 1012 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1071
E +++ VLS + G + +E +EVR + P
Sbjct: 1205 EADMF-------------AVLSSRQGRARGRGV---------------MEGHEVRWVWP- 1235
Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1131
GG WQ + E AL+V V L + T LLA+G A GED GR+LL
Sbjct: 1236 --GGGWQGVGRHQRRPGERALSVGAVRLKDHATGATVPLLAVGAALPAGEDYPCGGRLLL 1293
Query: 1132 FSTGRNADNP----QNLVTEVYSKELKGAISALASLQGHLLIASGPKI------------ 1175
F R Q +Y++E KG +++++ L+G+LL+ASG +I
Sbjct: 1294 FEVTRGDGGGGGGGQWAGRLIYTREFKGPVTSVSGLEGYLLLASGNRIETCSLSSTTITS 1353
Query: 1176 -----ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1230
+ T ++ AFYD P L + SLN+VKNF+LLGD S+ F+ +K++G QL+
Sbjct: 1354 TADDGTVAATTTWKVQRSAFYDGPVL-LTSLNVVKNFVLLGDCQHSVQFVRYKDEGRQLS 1412
Query: 1231 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1290
LL+KDF D AT+FLI+GS+L L D +++ YAP SWKGQ+L++ FHV
Sbjct: 1413 LLSKDFNRADTAATQFLINGSSLHLASCDSAGTLRLLSYAPSHPASWKGQRLVAWGSFHV 1472
Query: 1291 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
G + RL++ +S + D+T R A+L + GS
Sbjct: 1473 GEAASCMRRLRLHPSSPE--------DRTVRQAVLLSSAAGS 1506
>gi|449477808|ref|XP_004155129.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Cucumis sativus]
Length = 643
Score = 404 bits (1039), Expect = e-109, Method: Compositional matrix adjust.
Identities = 188/255 (73%), Positives = 220/255 (86%), Gaps = 1/255 (0%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
MSFAAY+MMHWPTGI NC S +ITHSRAD+VP + +++LDS+ +R IGPVPNLVV
Sbjct: 1 MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVTS-HSDDLDSDWHPRRDIGPVPNLVV 59
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
TA NV+E+YVVRV EEG +ESK+SGE KR +MDG+S ASLELVCHYRLHGNVES+AILS
Sbjct: 60 TAGNVLEVYVVRVLEEGGRESKSSGEVKRGGIMDGVSWASLELVCHYRLHGNVESMAILS 119
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
G D S++RDSIIL F++AKISVLEFDDS H LR +SMHCF+ P+WLHLKRGRESFARG
Sbjct: 120 SRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARG 179
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
P+VKVDPQGRCGGVLVYGLQMIILKASQ GSGLV D++ FG+ G SAR+ESS++INLRD
Sbjct: 180 PVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRD 239
Query: 241 LDMKHVKDFIFVHGY 255
LD+KHVKDF+FVH Y
Sbjct: 240 LDVKHVKDFVFVHVY 254
Score = 368 bits (945), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 189/258 (73%), Positives = 208/258 (80%), Gaps = 26/258 (10%)
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
GDIE DA + KR+RRSSSDALQDMV G+ELSLYGSA+NNTESAQK FSFAVRDSL+NIGP
Sbjct: 342 GDIEVDAHTAKRMRRSSSDALQDMVGGDELSLYGSAANNTESAQKIFSFAVRDSLINIGP 401
Query: 515 LKDFSYGLRINADASATGISKQSNYELV--------------------------ELPGCK 548
LKDFSYGLRINAD +ATGI+KQSNYELV ELPGCK
Sbjct: 402 LKDFSYGLRINADPNATGIAKQSNYELVCCSGHGKNGALCILRQSIRPEMITEVELPGCK 461
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
GIWTVYHK++RG ADSSRM DDEYHAYLIISLEARTMVL T +LLTEVTESVDYFV
Sbjct: 462 GIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIISLEARTMVLVTGELLTEVTESVDYFVH 521
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
GRTIAAGNLFGRRRVIQV+E GARILDGS+MTQDL+ + +ESG+ SE TVLS SI+D
Sbjct: 522 GRTIAAGNLFGRRRVIQVYESGARILDGSFMTQDLNLVVNGNESGNASEGCTVLSASISD 581
Query: 669 PYVLLGMSDGSIRLLVGD 686
PYVLL M+DGSIRLLVG+
Sbjct: 582 PYVLLTMTDGSIRLLVGE 599
>gi|412986884|emb|CCO15310.1| predicted protein [Bathycoccus prasinos]
Length = 1595
Score = 385 bits (988), Expect = e-103, Method: Compositional matrix adjust.
Identities = 306/1011 (30%), Positives = 469/1011 (46%), Gaps = 163/1011 (16%)
Query: 501 FSFAVRDSLVNIGPLKDFSYGLR--INAD-------ASATGISKQSNY---------ELV 542
+ F+V+DSL+ I P+ D + G + D +A G K ELV
Sbjct: 668 YKFSVKDSLLCISPVVDLTVGASAPVGTDLDPRTELVAACGHGKNGALAILTRGITPELV 727
Query: 543 ------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA--RTMVLETAD 594
LPG + W + N + R D+ + +LI+SL + TMVLET +
Sbjct: 728 TEVESGALPGLRACWAT---RTEDDNDGTVRPKRKDELFDEHLILSLSSTKTTMVLETGE 784
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG- 653
L EV++ VD+ V T+A +F R + QV + R + + F + +
Sbjct: 785 ELREVSKEVDFIVDEETLACERIFNGRAIAQVTKTKIR-----FTRKGKKFAVDDIDLAF 839
Query: 654 -SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI-------ESSKK 705
G E + + I + + L +SDGSIR+++GD T T ++ ++
Sbjct: 840 LKGGEGAQITLAIIQNDAIALRLSDGSIRIILGDSKTNTFTLLEKVGELFASDNHSNTGS 899
Query: 706 PVSSCTLYHD----------------KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ 749
V++ TLY D + P WL +T + G E D + +
Sbjct: 900 DVTAFTLYDDSVACTDSFGGGGGGLNRAP-GWLERT------ACGDREEKDESK----EN 948
Query: 750 GDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
++ G L ++ +P+ +++ G RE L + T I+S
Sbjct: 949 NNVVFATISRDGTLALYSLPSLKKLWSSGGVSDG---------REILAPNSTGIDSIDFN 999
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
+ K + +++ A +A + RP L DG++L YQA F+ P +
Sbjct: 1000 DECEVEKYTVSDIRLDAFA----NAAYERPLLTCFRADGSVLAYQA--FKSPSSN----- 1048
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYT--REETPHGAPCQ---RITIFKNIS--- 921
LRF+R P++ T E T + Q R+T +NI
Sbjct: 1049 -------------------ELRFARVPIEIETAGSELTNNDVSVQGGSRLTRIENIGDGR 1089
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI-VAFTVLHNVNCNHGFIYVTSQGILKI 980
G G F+SG P W +V R R+ P +G +AF HNVNC GFI T++G +++
Sbjct: 1090 GIAGVFVSGLNPIWLIVRRGRVLALPTRGEGGARIAFAPFHNVNCPKGFILATNEGGIRV 1149
Query: 981 CQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1040
C+LP + WPV+K+ L+ TP ITY + LY L+ S V P D E+
Sbjct: 1150 CRLPGKMHIEAQWPVRKLALRCTPRAITYMNDFKLYALVTSASV--PWK-------DFEI 1200
Query: 1041 GHQIDNHNLSSVDLHRTYT------VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ D+H + + V+++ +R+L P WQ ++ E+ L V
Sbjct: 1201 -DETDSHARALYRFRKEKAKSEGNVVQQFAIRLLVPGTLETAWQK----AVEPGEHILCV 1255
Query: 1095 RVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST--GRNADNPQNLVTEVY-SK 1151
+ V + + +T ++LAIGTA GED RGR+LLF+ R D E+ K
Sbjct: 1256 KNVQIRDQSTGALLSMLAIGTAMPGGEDTPCRGRILLFAIMWERARDGGVRWRGELKCEK 1315
Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1211
K A SA+ S+ G ++A G K+ H W G LN IAFYD PLY +L VKNF+L G
Sbjct: 1316 PSKMACSAIESVDGTFMVAIGTKLTAHSWDGKHLNPIAFYDT-PLYTTTLCCVKNFLLCG 1374
Query: 1212 DIHKSIYFLSWKE-QGAQ-LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
D+HKSI F+ WK+ QG + L+ L KD+ LDC A+EF+IDG TLSL+ +D N +F Y
Sbjct: 1375 DLHKSIRFVRWKDSQGEKTLSQLGKDYEVLDCIASEFMIDGGTLSLLAADANGNAHVFQY 1434
Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
APK++ESWKG KLL ++ +H G+ + K +R Q+ G K NR A+ FG+
Sbjct: 1435 APKLAESWKGDKLLPKSAYHAGSLIRKMVRFQI----------GVGEQKQNRHAVFFGSS 1484
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA------GLNPRSFRQFHSN--GKAHRP 1381
DG +G +P+DE TF L+ LQ + ++ + GLN +++R S+ A +
Sbjct: 1485 DGGLGIFSPVDEHTFLNLEKLQDAMRSNIVASSNSINPLGLNSKTYRALKSSEGSVARQT 1544
Query: 1382 GPDSIVDCELLSHYE-MLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
P +IVD LLS +E L + Q +A + G TR Q LS + SF+
Sbjct: 1545 PPRTIVDGGLLSKFEHSLSITAQTRVAAKAGLTRDQALSLARTIIAEQSFM 1595
Score = 142 bits (358), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 100/325 (30%), Positives = 163/325 (50%), Gaps = 72/325 (22%)
Query: 181 PLV-KVDPQGRCGGVLVYGLQMIILK----------------ASQGGSGLVGDEDTFGSG 223
P++ + DP+GRC VL+ + +K +S G + ++ G G
Sbjct: 208 PIIGRADPEGRCAAVLLRNEEKAKVKIMPASETSTSSNYIKESSNGSKKMTTKKE--GEG 265
Query: 224 GGF-SARIESSHVINLRDL---DMKHVKDFIFVHGYIEPVMVILHEREL-TWAGRVSWKH 278
+ A I SS +++R + V+D F+HGY EPV++IL+E TW+GR+S +
Sbjct: 266 TVYVPATIGSSFDLDVRKILGPSAAFVRDCCFLHGYGEPVLMILYESNPPTWSGRLSLRM 325
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
TC + A+SI T K++ ++W+ LP AY L VP+P+GGVLV+ + I Y SQS+S
Sbjct: 326 DTCKLVAVSIDCTKKKYTIVWTREKLPSAAYSLFPVPNPLGGVLVLSSGHILYESQSSSA 385
Query: 339 ALALN----------NYAVSLD------------------------SSQELPRSSFSVEL 364
+ N+A + SS E ++ F V+L
Sbjct: 386 TYISDFLGKGGPQEGNFAEEIARNNGVEGQAAHANPVPHVNSNKNVSSYETTQNEFQVQL 445
Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTVVYD------------GRVVQRLDLSKTNPSVLT 412
DAA ++ +VA++S+KTG L+ TV+ + GR +R+ + K+ +VL+
Sbjct: 446 DAAKIEMIRENVAIISSKTGQLI--TVILETVGGAASVGSKVGRRCRRIRVLKSGNAVLS 503
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
S + +G L F+GSR+GDSLL+ +
Sbjct: 504 SGLAAVGKDLLFIGSRVGDSLLIGY 528
>gi|301093545|ref|XP_002997618.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110008|gb|EEY68060.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1744
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 372/1498 (24%), Positives = 634/1498 (42%), Gaps = 337/1498 (22%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR++++ V D F+ GY+EP +++LHE + + GR++ T ++ +SI+
Sbjct: 278 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 337
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+ HP IW+ NLP D ++L+ +P+GGV+V+ AN I Y +Q+ LA N +A +
Sbjct: 338 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTVN 397
Query: 352 SQELPRS------------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----- 394
P S +V L +LQ LL+ +G + +L++ Y+
Sbjct: 398 QSVFPLSEAVYETPEHETVQLNVVLYDCQFEYLQEKELLLTMPSGQVYVLSLPYEDTSSR 457
Query: 395 ----------GRVVQRLDLSKTNPSVLTSDITTIG-NSLFFLGSRLGDSLLVQFTCGSGT 443
GR L L S+ S + F+GSR GDS+L
Sbjct: 458 GLYGFGGVSSGRNAS-LSLRMLRSSIQASCVCIDDEKQTLFIGSRSGDSVLFALDKKKLV 516
Query: 444 SMLSSGLKEEFGDI------EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
+ K+E I + AP K S A ++ + ++L LYG+A E A
Sbjct: 517 TATEEEQKDEEMPIKEVVIKQESAPEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPA 571
Query: 498 QKT---------------------------FSFAVR--DSLVNIGPLKDFSYGLRINADA 528
+ + + +R D L +IG + G+ NAD+
Sbjct: 572 ATSSTECTNGVGVSSVKTEENGAPEQDTGSYDYELRQIDVLPSIGQITSIELGVENNADS 631
Query: 529 S--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
+ + G + ++ EL GC+ +WTV + R
Sbjct: 632 NEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATRSSDGR- 690
Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
Y+AYLI+S+ RTMVL T + + + + ++ G T+AA NLF ++R++Q+F+
Sbjct: 691 -----SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGSTLAAANLFNKQRIVQIFK 745
Query: 629 RGARIL------------------DGSYM----------------TQDLSFGPSNSESGS 654
+GAR++ +G+ TQ+++ G
Sbjct: 746 QGARVMMEVPEEETSNGQEKSAKTEGAEDEEEDDEDDGPRVKLVCTQEITLEGDVECGGM 805
Query: 655 GSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP------------AAI 700
+ S+V +SV + DPY+LL ++DGS+RLL+GD +SV P +
Sbjct: 806 NVDTSSVGIVSVDVVDPYILLLLTDGSVRLLMGDEEDLELSVIDPEIDYAEGISEANGSA 865
Query: 701 ESSKKPVSSCTLYHD----------------------------------------KGPEP 720
+ SK SS L++D P P
Sbjct: 866 DMSKHGSSSACLFYDWAGMFVENAWVEEEQEERHEATQSRAKRAEDDDDMDALYSSKPSP 925
Query: 721 WLRKT-STDAWLSTGVGEAIDGADGGPLDQ---GDIYSVVCYESGALEIFDVPNF----- 771
+ T +T + ST DG+ PL Q + +C+ G+L +F +P+F
Sbjct: 926 KVATTNATKSTPSTATPRNEDGSVSIPLLQQKDAKMMCGMCFGDGSLHVFSLPDFKKRGV 985
Query: 772 ---------NCVFTVDKFVSGRTHIVDTYMREALKDSETEIN-SSSEEGTGQGRKENIHS 821
+ V T++ + GR V L +N S+S G+ +K + +
Sbjct: 986 FPYLTFAPQSLVNTLEHYQVGRNKTVK------LSAPVLGLNASTSSANDGRIKKSHTIN 1039
Query: 822 MKVVELAMQRW--------SAHHSRPFLFAILTDGTILCYQAY-LFEGPENTSKSD-DPV 871
V ++ + R + + SR + L +G +L Y A FE + + + PV
Sbjct: 1040 SPVADIVIHRVGPSEGQHNAQYLSRMVMLVFLANGDLLMYSAAPKFESLKPRANGEIAPV 1099
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI---------TIFKNISG 922
+ ++ L + +A E A ++ T F N++
Sbjct: 1100 FHFVRVGTELITRPFLPPKARTNAHNEAGNNPEVNTSAVLAKLRAGFRYPMLTCFHNVNN 1159
Query: 923 HQGFFLSGSRPCWCMVFRERLRVHPQLCDGS------IVAFTVLHNVNCNHGFIYVTSQG 976
G F G+ P W + R P +C+ + +++FT H+ NC +GFIY S+G
Sbjct: 1160 MSGAFFRGAHPMWILGDRGHASFVP-MCNAAPRVSVPVLSFTSFHHWNCPNGFIYFHSRG 1218
Query: 977 ILKICQLPSGSTY-----DNYWPVQKIPLKATPHQITYFA-----------EKNLYPLIV 1020
L++C+LPS T + +QK AT H + Y E Y ++
Sbjct: 1219 ALRVCELPSSKTSTILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGVAEALEAPTYAVVC 1278
Query: 1021 SV--------------------------PVLKPL-NQVLSLLIDQEVGHQIDNHNLSSVD 1053
S P PL + V++ + ++ D
Sbjct: 1279 SARLKPADADRATEVEGAEEELEPENLDPNGNPLGSNVMAPTAEMFADYETD-------- 1330
Query: 1054 LHRTYTVEE-YEVRILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT------- 1103
H +T E+ YE+R+++ D G W R + + E L+V+++ L++++
Sbjct: 1331 -HMAHTEEDVYELRLVQTDEFG-EWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKEEVA 1388
Query: 1104 ------TKENETLLAIGTAYV--QGEDVAARGRVLLFST----------GRNADNPQNLV 1145
K+ L +GT +V GED + RGR+LL+ G + L
Sbjct: 1389 STSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLYELDYAQYVNEEGGATSGKLPKLR 1448
Query: 1146 TEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK 1205
+ +GAIS ++ L ++L A G K+I++++ +L G AFYDA +Y+V+L++VK
Sbjct: 1449 LVFIKEHRQGAISMVSQLGPYVLAAVGSKLIVYEFKSEQLIGCAFYDAQ-MYIVTLSVVK 1507
Query: 1206 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
+F++ GD++KS++FL W+E QL LLAKD+ L ATEF + L+L+ D +N+
Sbjct: 1508 DFVMYGDVYKSVHFLRWREMQRQLVLLAKDYEPLAVSATEFSVFEKKLALLAVDMDENLH 1567
Query: 1266 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS-----DRTGAAPGSDKTN 1320
+ +AP+ ES GQ+LL ++FH+G V+ R ++ A+ S + AAP S+ N
Sbjct: 1568 VMQFAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVN 1627
Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
+ GT +G +G + P+ E FRRL +LQ +V+++P LNPR FR +N +
Sbjct: 1628 ----VMGTSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRC 1683
Query: 1381 PGPDS---------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
PD+ +D +L + L Q E+A GTT ++ NL ++ TS
Sbjct: 1684 GRPDAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVMHNLLEVQHATS 1741
>gi|301103686|ref|XP_002900929.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
gi|262101684|gb|EEY59736.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
Length = 1561
Score = 372 bits (954), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 370/1484 (24%), Positives = 630/1484 (42%), Gaps = 322/1484 (21%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR++++ V D F+ GY+EP +++LHE + + GR++ T ++ +SI+
Sbjct: 108 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYYLTVISINMK 167
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL-- 349
+ HP IW+ NLP D ++L+ +P+GGV+V+ AN I Y +Q+ LA N +A
Sbjct: 168 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTVN 227
Query: 350 -------DSSQELPR---SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----- 394
D+ E P + +V L +LQ+ LL+ G + +L++ Y+
Sbjct: 228 QSVFPLSDAVYETPEHETAQLNVVLYDCQFEYLQDKELLLTMPCGQVYVLSLPYEDTSSR 287
Query: 395 ----------GRVVQRLDLSKTNPSVLTSDITTIG-NSLFFLGSRLGDSLLVQFTCGSGT 443
GR L L S+ S + F+GSR GDS+L
Sbjct: 288 GLYGFGGVSSGRNAS-LSLRMLRSSIQASCVCIDDEKQTLFIGSRSGDSVLFALDKKKLV 346
Query: 444 SMLSSGLKEEFGDI------EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
+ K+E I + AP K S A ++ + ++L LYG+A E A
Sbjct: 347 TATEEEQKDEEMPIKEVVIKQESAPEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPA 401
Query: 498 QKT---------------------------FSFAVR--DSLVNIGPLKDFSYGLRINADA 528
+ + + +R D L +IG + G+ NAD+
Sbjct: 402 ATSSTECTNGVGVSSVKTEENGAPEQDTGPYDYELRQIDVLPSIGQITSIELGVENNADS 461
Query: 529 S--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
+ + G + ++ EL GC+ +WTV + R
Sbjct: 462 NEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATRSSDGR- 520
Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
Y+AYLI+S+ RTMVL T + + + + ++ G T+AA NLF ++R++Q+F+
Sbjct: 521 -----SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGPTLAAANLFNKQRIVQIFK 575
Query: 629 RGARIL------------------DGSYM----------------TQDLSFGPSNSESGS 654
+GAR++ +G+ TQ+++ G
Sbjct: 576 QGARVMMEVPEEETSNGQEKSGKAEGAEDEEEDDEDDGPRVKLVCTQEITLEGDVECGGM 635
Query: 655 GSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP------------AAI 700
+ S+V +SV + DPY+LL ++D S+RLL+GD +SV P +
Sbjct: 636 NVDTSSVGIVSVDVVDPYILLLLTDVSVRLLMGDEEDLELSVIDPEIDYAEGISEANGSA 695
Query: 701 ESSKKPVSSCTLYHD-------------KGPEPWLRK-TSTDAWLSTGVGEAIDGADGGP 746
+ SK SS L++D P P + +T + ST DG+ P
Sbjct: 696 DMSKHGSSSACLFYDWAEDDDDMDALYSSKPSPKVATMNATKSMPSTATPRNEDGSVSIP 755
Query: 747 LDQ---GDIYSVVCYESGALEIFDVPNF--------------NCVFTVDKFVSGRTHIVD 789
L Q + +C+ G+L +F +P+F + V T++ + GR V
Sbjct: 756 LLQQKDAKMMCSMCFGDGSLHVFSLPDFKKRGVFPYLTFAPQSLVNTLEHYQVGRNKTVK 815
Query: 790 TYMREALKDSETEIN-SSSEEGTGQGRKENIHSMKVVELAMQRW--------SAHHSRPF 840
L +N S+S G+ +K + + V ++ + R + + SR
Sbjct: 816 ------LSAPALGLNASTSSANDGRIKKSHTINSPVADIVIHRVGPSEGQHNAQYLSRMV 869
Query: 841 LFAILTDGTILCYQAY-LFEGPENTSKSD-DPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
+ L +G +L Y A FE + + + PV + ++ L + +
Sbjct: 870 MLVFLANGDLLMYSAAPKFESLKPRANGEIAPVFHFVRVGTELITRPFLPPKARTNAHNE 929
Query: 899 AYTREETPHGAPCQRI---------TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQL 949
A E A ++ T F N++ G F G+ P W + R P
Sbjct: 930 AGNNPEVNTSAVLAKLRAGFRYPMLTCFYNVNNMSGAFFRGAHPMWILGDRGHASFVPMC 989
Query: 950 CDGS-------------------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY- 989
S +++FT H+ +C +GFIY S+G L++C+LPS T
Sbjct: 990 VPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHWSCPNGFIYFHSRGALRVCELPSSKTST 1049
Query: 990 ----DNYWPVQKIPLKATPHQITYFA-----------EKNLYPLIVSV------------ 1022
+ +QK AT H + Y E Y ++ S
Sbjct: 1050 ILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGVAEALEAPTYAVVCSARLKPADADRATE 1109
Query: 1023 --------------PVLKPL-NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVR 1066
P PL + V++ + ++ D H +T E+ YE+R
Sbjct: 1110 VEGAEEELEPENLDPNGNPLGSNVMAPTAEMFADYETD---------HMAHTEEDVYELR 1160
Query: 1067 ILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT-------------TKENETLL 1111
+++ D G W R + + E L+V+++ L++++ K+ L
Sbjct: 1161 LVQTDEFG-EWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKEEVASTSPEWNKKKRPYL 1219
Query: 1112 AIGTAYV--QGEDVAARGRVLLFST----------GRNADNPQNLVTEVYSKELKGAISA 1159
+GT +V GED + RGR+LL+ G + L + +GAIS
Sbjct: 1220 VVGTGWVGPHGEDESGRGRLLLYELDYAQYVNEEGGATSGKLPKLRLVFIKEHRQGAISM 1279
Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
++ L ++L A G K+I++++ +L G AFYDA +Y+V+L++VK+F++ GD++KS++F
Sbjct: 1280 VSQLGPYVLAAVGSKLIVYEFKSEQLIGCAFYDAQ-MYIVTLSVVKDFVMYGDVYKSVHF 1338
Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
L W+E QL LLAKD+ L ATEF + L+L+ D +N+ + +AP+ ES G
Sbjct: 1339 LRWREMQRQLVLLAKDYEPLAVSATEFSVFEKKLALLAVDMDENLHVMQFAPQDIESRGG 1398
Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSS-----DRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q+LL ++FH+G V+ R ++ A+ S + AAP S+ N + GT +G +G
Sbjct: 1399 QRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVN----VMGTSEGGVG 1454
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS--------- 1385
+ P+ E FRRL +LQ +V+++P LNPR FR +N + PD+
Sbjct: 1455 ALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRILKTNAQRRCGRPDAWSKKKWKKS 1514
Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
+D +L + L Q E+A GTT + NL ++ TS
Sbjct: 1515 FLDAFVLFRFLQLDYVAQKELARCIGTTPEVAMHNLLEVQHATS 1558
>gi|391328522|ref|XP_003738737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Metaseiulus occidentalis]
Length = 1500
Score = 355 bits (911), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 273/958 (28%), Positives = 443/958 (46%), Gaps = 143/958 (14%)
Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
ELPGC +WTV S+R + D ++ H +LI+S TM+L+T + E+ S
Sbjct: 603 ELPGCTDLWTVRSSSTRSPDVD--------EDSHQFLILSRPDSTMILQTGQEINELDHS 654
Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVL 662
+ Q TI AGNL R +IQV R+L+G Q + S ++
Sbjct: 655 -GFCTQSPTIFAGNLADGRYIIQVCPNSVRLLEGVKQLQQVPI----------DVGSPLV 703
Query: 663 SVSIADPYVLLGMSDGSIRLLV--GDPST-CTVSVQTPAAIESSKKPVSSCTLYHD---- 715
S SIAD +VL+ DG + L GD +T +SV P +K +++ +Y D
Sbjct: 704 SASIADLHVLVMSQDGLVIQLTLRGDDTTGYKLSVLKPQ-FPGAKSKITALCIYKDVSGL 762
Query: 716 -----KGPEPWLR-KTSTDAWLSTGV------------------GEAIDGAD--GGPLDQ 749
+ PE + KT + T V G ++D D G L+
Sbjct: 763 FVTKIQKPEDIAKPKTEAKTKVKTEVAKKVLRSADFDDEDELLYGSSVDIKDLVAGGLNA 822
Query: 750 GDI-----------------------------YSVVCYESGALEIFDVPNFNCVFTVDKF 780
+I + + E+GALEI+ P++ + V F
Sbjct: 823 ANIVPTTQTKDTAEEEDYEENVRKIAPVEPTFWVFLARENGALEIYSFPDYKLRYFVKNF 882
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
+ + ++ A +T S+SE KV+E+ + H SRP
Sbjct: 883 -----PLCNKILQNAAATGQTTSASTSEAQLP----------KVMEIFVCALGMHQSRPL 927
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
LFA + D + Y+AY F ++ + RL++ + P Y
Sbjct: 928 LFARV-DSELHIYEAYPF--------------VNQKEGHLKLQFRRLQH-AVTMEPRRVY 971
Query: 901 TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTV 959
++E + I F+++ G+ G F+ G RP W + R LR HP L DG I +F
Sbjct: 972 KQKEGDPTLSLRWIRAFQDVCGYNGVFVCGRRPHWIFLTARGELRAHPMLNDGRIYSFAT 1031
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLI 1019
HNVNC GF++ G L+IC LPS YD WP++KIP+ TPH + Y + Y +
Sbjct: 1032 FHNVNCEKGFLFFNKYGELRICALPSYLNYDAPWPMRKIPIYETPHSVNYHVDSRTYCVA 1091
Query: 1020 VS-------VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDR 1072
S VP L ++ I++E I TV+++ + + P
Sbjct: 1092 TSKEETATCVPKLANEDKEFEP-IERESSRFIPP------------TVDKFALELWSPVS 1138
Query: 1073 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRVLL 1131
TR +PM+ E V+ V + + T E L+A+GT + GED+ A+GR+LL
Sbjct: 1139 WEAIPNTR--MPMEDWEKITCVKNVMIASEGTTSGEKGLIAVGTIHNFGEDITAKGRILL 1196
Query: 1132 FSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELN 1186
P ++ V + SK ++AL S++GHL+ A G K+ L + +L
Sbjct: 1197 IDIIEVVPEPGQPLTRSKVKTILSKPQNAPVTALCSVKGHLMAAVGQKLFLFQLKDNDLV 1256
Query: 1187 GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEF 1246
G+AF D +Y++S +K+FIL+GD+HKSI L ++E+ L +++KD + ++ E+
Sbjct: 1257 GMAFLDTQ-IYILSAISIKSFILIGDVHKSITLLRYQEESKTLAVVSKDTKPVQIYSIEY 1315
Query: 1247 LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS 1306
L+D S ++ + +D Q NI ++ Y P+ E++ GQ+L+ R +F++G+ + R++
Sbjct: 1316 LVDNSQMAFLATDAQCNILVYMYQPENRETFGGQRLIRRGDFNIGSRINTMFRIRCRLAE 1375
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNP 1366
R+ SD R L+ +LDG+ G + P+ E T+RRL LQ L HV GLNP
Sbjct: 1376 VPRSERRLLSDLEARHVTLYASLDGAFGYLLPISEKTYRRLLMLQNVLNSYCQHVGGLNP 1435
Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
++FR ++ +A +IVD +L++ + L E+ E+A + GTT QI +L ++
Sbjct: 1436 KAFRIMQTDVRALSNPQKNIVDGDLINVFMDLNFNEKAEVARKIGTTVHQIQLDLAEI 1493
Score = 184 bits (466), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 122/404 (30%), Positives = 199/404 (49%), Gaps = 61/404 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS----AASLELVCHYRLHGN 112
NLVV VI++Y R++ DG++ A LE + GN
Sbjct: 29 NLVVAGGTVIKVY--------------------RLVCDGLNETDDKAKLEHQQTFNCFGN 68
Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ + + + RDS++ F++ KIS++E+D + H L+ ++ E E+ K
Sbjct: 69 ISGMEKIRLNAS-----RDSLLFVFKETKISLVEYDPATHELQTLAIRSLEKEEY---KE 120
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFS 227
G +F L+KVDP RC VL+YG + I+ A+ + + T + GF
Sbjct: 121 GFYNFVGNTLIKVDPLNRCAAVLIYGKHLAIIPFVKKDATDLSDPIASSKSTQTNTSGFL 180
Query: 228 ARIESSHVINLRDLD----MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
+ I L DLD + ++ D F++GY EP +++L+E TW GRV+ + TC I
Sbjct: 181 ----EYYTIRLIDLDEEKGVNNIHDMTFLNGYYEPTLLLLYEPIRTWTGRVAIRQDTCSI 236
Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALN 343
ALS++ + HP +WS LP +++K+L VP PIGGVL++ N + Y +QS
Sbjct: 237 MALSLNVYQRVHPPVWSFSGLPFNSFKVLPVPKPIGGVLILSVNALLYLNQSVPA----- 291
Query: 344 NYAVSLDSSQELPRSSFSVE--------LDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
Y VSL+ E +SF ++ LD +L LLS GDL +L++ DG
Sbjct: 292 -YGVSLNCFTEC-STSFPLKDQAGPPLTLDCCRCEFLSETKILLSVANGDLYVLSLFTDG 349
Query: 396 -RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
R + + + K + + + I+ F+GSR+G+SLL+++T
Sbjct: 350 MRSINQFEFKKIATTTVATCISLCEPGYLFVGSRIGNSLLLRYT 393
>gi|241060959|ref|XP_002408050.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
gi|215492346|gb|EEC01987.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
Length = 1241
Score = 354 bits (908), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 311/1109 (28%), Positives = 496/1109 (44%), Gaps = 156/1109 (14%)
Query: 388 LLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
+LT+ DG R V+ + K SVLT+ +T FLGSRLG+SLL+ +T M
Sbjct: 198 VLTLFNDGMRSVRNFNFDKAAASVLTTSMTLCEEGYLFLGSRLGNSLLLHYT-EKAAEME 256
Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
+G KE+ ++ D +++ +EL +YGS + T+ +++F V
Sbjct: 257 EAGKKED---------------KAEGDVNVALIDPDELEVYGSETLATKQL-TSYTFEVC 300
Query: 507 DSLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELV----------- 542
DSL+NIGP G N+D + G K ++
Sbjct: 301 DSLINIGPCGKICMGEPAFLSEEFTQNSDPDLELVTTAGYGKNGALCVLQRSVRPQVVTT 360
Query: 543 -ELPGCKGIWTVYHKSSRGHNA------DSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
ELPGC +WTV + D A HA+LI+S +M+L+T
Sbjct: 361 FELPGCVHMWTVMGPPTEKKKKEASEESDEQAADATLTNTHAFLILSRADSSMILQTDQE 420
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ S + Q T+ AGNL R V+QV G R+LDG+ Q +
Sbjct: 421 INELDHS-GFSTQNPTVFAGNLGDGRYVLQVCPMGVRLLDGTRQLQHIPL---------- 469
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLV--GDPST-CTVSVQTP--AAIESSKKPVSSC 710
S ++ S+ADP+VL+ G + L GDP++ C ++V P A+ S + +C
Sbjct: 470 DVGSPIVGGSLADPHVLIRSEGGLVVHLTLRGDPASGCRLAVLRPQLTAVVSHRANALTC 529
Query: 711 --------------TLYHD----KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
LY D + + +R T+ + + + + P
Sbjct: 530 HCIAVSGVLDDEDELLYGDSEDTRATKEPVRVTAMET--ESETANVFELKEVKP----TF 583
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEEGT 811
+ V E+G LEI+ +P++ F V F G+ +VD+ A +++E ++ S E
Sbjct: 584 WVFVARENGVLEIYSLPDYKLCFLVKNFPMGQRVLVDSVQMTAPSGTKSEKLSDMSHE-- 641
Query: 812 GQGRKENIHSMKVV-ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
M VV E+ M SRP L A + D +L Y+A+ F +
Sbjct: 642 ---------CMPVVHEILMVGLGVRQSRPLLLARV-DEDLLIYEAFPFYETQREGH---- 687
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
L ++ + R +T EE + + F +ISG+ G FL G
Sbjct: 688 ----LKLRFKKLNHDIILRSRKYKTQKPENEEEEKAFQSRLW-LQPFSDISGYSGVFLCG 742
Query: 931 SRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-----ILK---IC 981
RP W M R LR HP DG + F HNVNC GF++ Q +L +
Sbjct: 743 HRPHWLFMSSRGELRYHPMFVDGPVYCFAPFHNVNCPKGFLHFNKQSDSYALLLHSYWLS 802
Query: 982 QLPSGSTYDNYW----PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1037
QLPS + P K K H+ +FA + + P L + D
Sbjct: 803 QLPSPKRHGERLLFNCPSHK---KICIHRCHFFALQQKAADFLWPPPFVTTVSPLPFVAD 859
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVR 1095
+ T++++ +++L P W+T + + E+ ++
Sbjct: 860 SR---------------YIFPTMDKFSLQLLSPVS----WETIPNTRVDLDEWEHLTCIK 900
Query: 1096 VVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVY 1149
V L + T + LA+GT Y GEDV +RGR+ + P +N + VY
Sbjct: 901 NVMLSSEGTSTGMKGYLALGTNYCYGEDVTSRGRITILDIIDVVPEPGQPLTKNKIKIVY 960
Query: 1150 SKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFIL 1209
SKE KG ++AL+ + G LL A G K+ + + L G+AF D +Y+ S+ VKN IL
Sbjct: 961 SKEQKGPVTALSQVVGFLLSAIGQKMYIWQLKDNGLVGVAFIDTQ-IYIHSVVTVKNLIL 1019
Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
+GD+ KS+ L ++E L+L+++D L+ FA EF ID S +S +V+D ++N+ ++ Y
Sbjct: 1020 VGDVFKSVSLLRYQEASRTLSLVSRDVRPLEVFAVEFFIDNSQMSFLVTDSERNMILYMY 1079
Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
P+ ES GQ+LL R +FH+G+ V R++ + + R + TL
Sbjct: 1080 QPESRESCGGQRLLRRGDFHIGSPVVSMFRIKCRMGEVAKHDRRLAASVDGRHITMLATL 1139
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
DGS+G + P+ E T+RRL LQ LV ++PH AGLNP++FR +HS + +I+D
Sbjct: 1140 DGSLGYVLPVPEKTYRRLLMLQNVLVTNMPHYAGLNPKAFRMYHSQRRVLGNPHKNILDG 1199
Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
EL+ + L E+ E++ + GTT +Q++
Sbjct: 1200 ELIWKFMHLSFMERSELSKKIGTTVTQVV 1228
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 5/95 (5%)
Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
P+++VDP RC +LV+ + ++ + + +E G G + + + L +
Sbjct: 58 PMIRVDPCNRCAAMLVFSRTIAVVPFRKDTAA---EEQETGPTFGNKPPLLDWYPVALTE 114
Query: 241 LDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
LD K +V D F+HGY EP ++IL+E TW G+
Sbjct: 115 LDEKINNVIDMQFLHGYYEPTLLILYEPLRTWPGK 149
Score = 40.0 bits (92), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 9/103 (8%)
Query: 236 INLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+ L +LD K +V D F+HGY EP ++IL+E TW G V + M S + +
Sbjct: 158 VALTELDEKINNVIDMQFLHGYYEPTLLILYEPLRTWPGYVLTLFNDGMRSVRNFNFDKA 217
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
++ ++M L + Y L S +G L+ +HY ++A
Sbjct: 218 AASVLTTSMTLCEEGYLFLG--SRLGNSLL-----LHYTEKAA 253
>gi|440793679|gb|ELR14857.1| CPSF A subunit region protein [Acanthamoeba castellanii str. Neff]
Length = 1477
Score = 348 bits (892), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 256/898 (28%), Positives = 429/898 (47%), Gaps = 169/898 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGI----------------SAAS 100
NL+V NV+E+Y + E+ + T+ DG+ + S
Sbjct: 30 NLIVAKTNVLEVYALHRHEDSKARPIDRQSTRP---TDGVISLRGEEPKDAPPYAGTQHS 86
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
+ LV L GN+ES+A + G +D+++L+F DAKISVLEFD + + LR S+H
Sbjct: 87 MRLVLSSSLFGNIESMAAVRFPGTS----KDALLLSFRDAKISVLEFDIATNDLRTISLH 142
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE +K G + + P ++VDPQ RC +L + ++++L Q S +
Sbjct: 143 YFED---YKVKEGHDHYIHVPELRVDPQQRCAAMLAFDRKLVVLPFRQHASLM-----EI 194
Query: 221 GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
+GG ++ S +++LR + + +VKDF+F+ GY EP ++IL+E TW+GRV+ +T
Sbjct: 195 ENGGQEDQPVKPSFLLDLRAMGIINVKDFVFLQGYYEPTLLILYEPTQTWSGRVAVNRNT 254
Query: 281 CMISALSIST-----TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI------ 329
C+ +A+S++ HP++WSA LP+D +L+AVP PIGG L + N++
Sbjct: 255 CVAAAVSLNLWQHRGQTSAHPVVWSAEFLPYDTQRLIAVPGPIGGALALSTNSLLYLNQV 314
Query: 330 -------------------HYHSQSASCALALNNYA-VSLDSSQELP---RSSFSVELDA 366
H+ +Q+++ L LN +A + L P ++ + LDA
Sbjct: 315 SFPYRLILPAHGADVSITSHHDTQASASCLPLNVFADLYLSPQTPFPSAGKNRVGIALDA 374
Query: 367 AHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNS----- 421
A +L +D L+S K G+L + ++ DGR V + L+K SV+TS + T+
Sbjct: 375 ARDVFLADDQLLVSLKGGELYIFHLLSDGRTVNDIQLTKAGSSVITSCMATLSGEGADER 434
Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE--FGDIEADAPSTKRLRRSSSDA----- 474
FLGSR+GDSLL+Q+T ++ +G + F DI+ + + +A
Sbjct: 435 FLFLGSRVGDSLLLQYTTADASAPKQNGATKGSLFDDIKKEEDNDDDDEDEEEEASGEGE 494
Query: 475 LQDMVNGE-ELSLYGSASNNTESAQK-----TFSFAVRDSLVNIGPLKDFSYGLRIN-AD 527
+++ +GE E+ +G + +K T+ F V DSLVN+GP+ DF+ G + A
Sbjct: 495 VKEEPDGEGEVDEFGRRIREEDRRKKKGLLTTYKFKVCDSLVNVGPITDFAIGESFDPAS 554
Query: 528 ASATGISKQSNYELV---------------------------ELPGCKGIWTVYHKSSRG 560
S Q + E+V +L GCK WT+YH+S
Sbjct: 555 VSMAEQEGQRSVEIVTCSGQGKNGSLCVLQHGVRPELVHASADLAGCKAFWTLYHRSEER 614
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT-ESVDYFVQGRTIAAGNLFG 619
++ EYHAYL++S E +T V+ D L E++ E D+ V T+ AGNLF
Sbjct: 615 QGEEA--------EYHAYLLLSEEEQTRVI-AGDGLDELSNEETDFNVAAPTVDAGNLFE 665
Query: 620 RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
+ R++QV + G +LDG TQ + S + + SIADPYVL+ M+DG+
Sbjct: 666 QTRIVQVHQHGLILLDGVKATQRI------------STPGQIAAASIADPYVLVLMADGA 713
Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI 739
+RL DP++ + VQT + + + L++ G A+
Sbjct: 714 LRLYFADPTSSKL-VQTSLQNIHEVRDIMAMHLFY---------------------GGAM 751
Query: 740 DGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
G D+ I++ + ++G L+I+ VP F+ VF+ ++ +G I + MR + +
Sbjct: 752 RGKKARTNDE--IFAAIAKDNGRLDIYSVPEFDLVFSAERAANGPRLINNVLMRPPPQSA 809
Query: 800 ETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYL 857
+ ++ + S ++ E+A+ S P LF L +G +L Y+ +L
Sbjct: 810 AAQQSADTT------------SARIAEIALHSIGNIPSLPHLFLYLDNGELLLYRGFL 855
Score = 305 bits (780), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 191/539 (35%), Positives = 277/539 (51%), Gaps = 62/539 (11%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
+RI F + G F+SGS P W R R++P D + AF HN NC HGFIY
Sbjct: 967 RRIHYFGTVGKSNGVFISGSAPAWVFAQRGYARLYPMKLDTFVRAFAEFHNANCPHGFIY 1026
Query: 972 VTSQGILKICQLPSGSTYDNYWP----VQKIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1027
+G LKICQLP+ +W V+K+PL TP +I Y Y +V++
Sbjct: 1027 FNHEGTLKICQLPAAEGA-IHWELPGVVRKVPLGRTPREIAYHPPSRTY--VVALATPVT 1083
Query: 1028 LNQVLSLLIDQE-------------VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1074
D E +G + + E +E+ ++ P
Sbjct: 1084 TVVPTPPETDMERQEREREEEESREMGIEPEEKQRDMGPREIAMMEERHELHLISPRT-- 1141
Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL----LAIGTAYVQGEDVAARGRVL 1130
WQ + ++ E+ LT+ V+ L + ++ N L L I V GE+
Sbjct: 1142 --WQILHHVELEPKEHVLTLSVLKLGDNYSQVNRELRPPHLLIYEIDVTGEE-------- 1191
Query: 1131 LFSTGRNADNPQNLVTEVYSKELK-----GAISALASLQGHLLIASGPKIILHKWTGTEL 1185
Q +T Y K +K G +SA ASLQG+L+IA GPKI + + G
Sbjct: 1192 -----------QCKLTMAYQKPMKEKPMKGPVSAAASLQGYLIIAVGPKIWVFNFDGGST 1240
Query: 1186 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
+AFYDAP Y+VS+ +KNF+L GDI+KSI+FL WK+ +QL LLAKD G + FATE
Sbjct: 1241 EAVAFYDAPH-YIVSIKTLKNFVLCGDIYKSIFFLRWKDSASQLALLAKDVGRVSVFATE 1299
Query: 1246 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
+++D L+L++SDE++N+Q+ YAP +ES GQ L+ R +F+VG + KF+RL M
Sbjct: 1300 YVVDKQNLALLMSDERQNLQVTAYAPHTAESRGGQLLVPRGDFNVGQSINKFVRLPMTLP 1359
Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN 1365
S G+ R AL FGTL G +G +AP+DE FRRL LQ L+ ++PH AGL+
Sbjct: 1360 S--------GTTSLQRHALWFGTLSGGVGYLAPMDESVFRRLGMLQSALLSAIPHTAGLH 1411
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
P+++R + R +I+D LLS Y L Q +IA + GT+R +IL++L +
Sbjct: 1412 PQAYRALQRE-RLLRNRKHTILDGLLLSRYLALDSATQQQIALKLGTSRERILNDLQGI 1469
>gi|431908146|gb|ELK11749.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
alecto]
Length = 820
Score = 345 bits (884), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 220/683 (32%), Positives = 344/683 (50%), Gaps = 48/683 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 162 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 217
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 218 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLVYEAF----PHDSQLGQGNLK 267
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ R + R S+ + E R F++I G+ G F+ G
Sbjct: 268 VRFKKVPHNINF-REKKPRPSKKKAEGGAEEGPGARGRVARFRYFEDIYGYSGVFICGPS 326
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 327 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 386
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S I + G + + +
Sbjct: 387 PWPVRKIPLRCTAHYVAYHVESKVYAVATS-------TNTPCTRIPRMTGEEKEFETIER 439
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 440 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 495
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 496 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 555
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 556 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 614
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 615 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 674
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 675 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 727
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 728 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYL 787
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 788 STMERGELAKKIGTTPDIILDDL 810
>gi|393245434|gb|EJD52944.1| hypothetical protein AURDEDRAFT_81080 [Auricularia delicata TFB-10046
SS5]
Length = 1422
Score = 342 bits (878), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 356/1475 (24%), Positives = 647/1475 (43%), Gaps = 198/1475 (13%)
Query: 52 IGPVPNLVVTAANVIEIYVVRVQ-------------EEGSKE---SKNSGETKRRVLMDG 95
+G NLVV N++ ++ VR++ E+G GE + V +G
Sbjct: 40 LGVATNLVVARQNLLRVFEVRIEAAPLPSQEKLLADEQGRGRRGMEAVEGEVEMDVGGEG 99
Query: 96 ISAAS------------------LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAF 137
+A L LV +RLHG V L + Q A + D ++++F
Sbjct: 100 FVSAGIVKSAGQHARQRQRTVTRLYLVRQHRLHGIVTGLGRV-QTMASLEDKLDRLLVSF 158
Query: 138 EDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLV 196
+DAKI++LE+ + H L S+H +E +P+ L R +++DP RC + +
Sbjct: 159 KDAKIALLEWSEVSHDLSTISIHTYERAPQMLAFDSARALTE----LRIDPNSRCAALTL 214
Query: 197 YGLQMIILKASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFV 252
G + IL + + L D D GG S + S +++L ++D ++++ D F+
Sbjct: 215 PGDAVAILPFYESQAELDMDVDQ----GGVSRDVPYSPSFILSLPEVDNDIRNIIDIAFL 270
Query: 253 HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
G+ P + +L E + TW GR++ T + L++ + +P+I S LP+D+ +L+
Sbjct: 271 PGFNNPTLAVLFETQRTWTGRLAEFKDTVRLRILTLDVVTRTYPIIGSVDGLPYDSMRLV 330
Query: 313 AVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATW 371
A P+ +GGV+V+ AN + + QS + A+A N +A + S P L+ + A +
Sbjct: 331 ACPAALGGVIVLTANAVLHIDQSGKNVAVAANGWAARV-SEFPTPAPERDETLEGSRAVF 389
Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL-SKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ + LL + G +V + ++ +GR+V ++D+ + + + + + + + L +GS G
Sbjct: 390 VSDKTFLLVYRDGSIVPVELILEGRMVTKIDMGQRLAQTTIPTVVCAVQDDLVLVGSTAG 449
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-----DMVNGEELS 485
S+L++ T E DI DA S + ++ ++ D ++ E+
Sbjct: 450 PSILLKVT-------------HEEEDITPDAGSARENGAANGNSTNGATYDDPMDSEDED 496
Query: 486 LYGSASNNTESAQKTFS--------------FAVRDSLVNIGPLKDFSYGLRINAD---- 527
LYG S T + A+ DSL GP+ D ++ L N +
Sbjct: 497 LYGGTDMMVTSTSGTLTVGGTAALEKRRILRLALADSLCGHGPISDMAFILGRNGERHVP 556
Query: 528 -------ASATG--------ISKQSNYELVELPGCKGIWTVYHKSS---RGHNADSSRMA 569
TG + + +L + G +G+W+ + + G N + A
Sbjct: 557 ELLAGVGVGHTGGLARFQRDLPARVKRKLHRISGNRGVWSFPVRRAVKVAGMNIERPTGA 616
Query: 570 AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ--GRTIAAGNLFGRRRVIQVF 627
A D +I+S +A + + + + VD + TI AG F R ++QV
Sbjct: 617 ADWD----TVIVSTDATPSPGLSRVAVKDSSTDVDILTRLPAITIGAGPFFQRTAILQVV 672
Query: 628 ERGARIL--DGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
R+L DGS + +DL + + + + SI+DP+V++ D ++ L
Sbjct: 673 NNAIRVLEADGSERQVIKDLD---------GTTPRAKIRACSISDPFVVVVREDDTLGLF 723
Query: 684 VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK------GPEPWLR-KTSTDAWLSTGVG 736
VG+ + + + + + + Y D G L+ K +A ST +
Sbjct: 724 VGETGKGKLRRKDMSMLGDKASRYLAASFYQDHSGLFQVGTARSLKGKEKANAPASTTIE 783
Query: 737 EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREAL 796
A+D +G + V+C G +EI+ +P VF+ + DT+
Sbjct: 784 AAMDEG------RGSQWLVLCRPQGVVEIWALPKLTLVFSCGGVSDIPPVMADTF----- 832
Query: 797 KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+ S ++ Q ++ E+ + RP L +L GT+ Y
Sbjct: 833 ---DLATPSPVQDPPRQAEDHDVE-----EILISPIGETTPRPHLLVLLRSGTVAVYDTA 884
Query: 857 LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI 916
E P PV+T R + ++ R+ + P++ + P AP I
Sbjct: 885 PVELP--------PVATGREAGL-QLAFVRIMSRAVDTAPIERAEKRGAP--APRHLIPF 933
Query: 917 FKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
++S G FL+G +P W + + +R+ P + + + +FT F+ T +
Sbjct: 934 STSVS---GVFLTGGKPGWILGTDKTAVRLVPAV-NQVVHSFTACSLWGNRGEFLMNTDE 989
Query: 976 GILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1035
G + +P D P +P + P+ T A + ++++ L+ +
Sbjct: 990 GPCLVEWMPD-LRLDEELPSFFMP-RGRPY--TSIAYEATTGMVIAASSLRS-----RFV 1040
Query: 1036 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1095
+ E G+ + + + T + + +++P+ W T +E TVR
Sbjct: 1041 LFDEDGNTVWKPDAEFIS---DPTTDTSSLELIDPET----WTTVDGFEFAFNEMINTVR 1093
Query: 1096 VVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNA-DNPQNLVTEVY---S 1150
V L +T+ ++ +A+GT +GED+A +G +F D+ Q ++
Sbjct: 1094 TVNLETVSTEAGSKDFIAVGTTVFRGEDLAVKGATYIFEVIEVVPDDTQQRRHKLKLWCR 1153
Query: 1151 KELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFIL 1209
E KG +SAL + G+L+ + G K+ + + E L G+AF D +YV SL +KN +L
Sbjct: 1154 DEAKGPVSALCGINGYLVSSMGQKVFVRAFDLNERLVGVAFMDV-GIYVTSLRTLKNLLL 1212
Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
+GD KS++F++++E +L LL KDF + EF +++V +DEQ ++IF Y
Sbjct: 1213 IGDAVKSVWFVAFQEDPFKLQLLGKDFQRAALTSAEFFFGFGEMTIVSTDEQNVLRIFRY 1272
Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
P +E+ GQKLL + EF+ + +L +SD P S +++
Sbjct: 1273 DPMHAEAQDGQKLLCQTEFNTQSDARG--TTTILRRTSDEDILLPQS------KIMYCGT 1324
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
DGS+ + P++E F+RL LQ ++ ++ HVAGL+P++FR ++ A RP I+D
Sbjct: 1325 DGSLSALLPVEEHVFKRLHLLQGQMTRNIQHVAGLHPKAFRVVRNDFTA-RPLARGILDS 1383
Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
LL+ +E LPL Q+E Q G +R IL + L
Sbjct: 1384 NLLAKFEELPLSRQVEFTKQIGQSREVILGDWMQL 1418
>gi|395512730|ref|XP_003760588.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Sarcophilus harrisii]
Length = 1449
Score = 342 bits (876), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 225/703 (32%), Positives = 352/703 (50%), Gaps = 81/703 (11%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V+ ESGA+EI+ +P++ VF V F G+ +VD+ + T+ ++ EE T
Sbjct: 790 WCVLVRESGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPATQGDTKKEEVTR 845
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L ++ +RP+L + D +L Y+A+ +
Sbjct: 846 QGELPLVKEVLLVALGNRQ-----TRPYLL-VHVDQELLIYEAFAHD------------- 886
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
S + S L+ +RF + P + RE+ P + R
Sbjct: 887 -------SQLGQSNLK-VRFKKVPHNINFREKKPKPSKKKPEGGGAEEGAGARGRVARFR 938
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 939 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 998
Query: 975 QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
QG L+I LP+ +YD WPV+KIPL+ T H + Y E +Y + S L
Sbjct: 999 QGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATS-------TNALCT 1051
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1092
I + G + + + D + E + ++++ P W+ A I ++ E+
Sbjct: 1052 RIPRMTGEEKEFETIERDDRYIHPLQEAFSIQLISPVS----WEAIPNARIELEEWEHVT 1107
Query: 1093 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE---- 1147
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1108 CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1167
Query: 1148 -VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VKN
Sbjct: 1168 VLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKN 1226
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
FIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ +
Sbjct: 1227 FILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDSAQLGFLVSDRDRNLMV 1286
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NR 1321
+ Y P+ ES+ G +LL RA+FHVGAHV F R + GAA G K N+
Sbjct: 1287 YMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPTKKSIVWENK 1339
Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1340 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1399
Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+++D ELL+ Y L E+ E+A + GTT IL +L ++
Sbjct: 1400 AVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLEI 1442
Score = 305 bits (781), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 224/687 (32%), Positives = 353/687 (51%), Gaps = 99/687 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E S +S + E K + LELV + GNV S+
Sbjct: 29 NLVVAGTSQLYVYRLNHDAETSTKSDRNAEGK----LHKEHKEKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 138 NVHTPRVRVDPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQKSSFL 189
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
K HP+IWS NLP D + LAVP PIGGV++ N++ Y +QS Y VSL
Sbjct: 250 ILQKVHPVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSL 303
Query: 350 DS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRL 401
+S + P + + LD A A ++ D ++S K G++ +LT++ DG R V+
Sbjct: 304 NSLTAGTTAFPLRMQDGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSF 363
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI-EAD 460
K SVLT+ + T+ FLGSRLG+SLL+++T SS + ++ + D
Sbjct: 364 HFDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSAREAPSREVSDKD 423
Query: 461 APSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
P K+ R S+ A QD V+ E+ +YGS A + T+ A T+SF V DS++NI
Sbjct: 424 EPPVKKKRVESTLGWAGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 479
Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
GP + + G L I + + + + K ++V ELPGC
Sbjct: 480 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 539
Query: 548 KGIWTVY-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
+WTV ++++G + SS D + H +LI+S E TM+L+T +
Sbjct: 540 YDMWTVIAPLRKEEDETTKGEGPEQEPSSPETEDDGKRHGFLILSREDSTMILQTGQEIM 599
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 600 ELDTS-GFATQGPTVYAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL------- 648
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 649 GSPIVQCAVADPYVVIMSAEGHVTMFL 675
>gi|410987992|ref|XP_004000273.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Felis catus]
Length = 1432
Score = 340 bits (873), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 226/700 (32%), Positives = 350/700 (50%), Gaps = 81/700 (11%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G LEI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 773 WCLLVRENGTLEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 828
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 829 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 871
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
L N+ +RF + P + RE E GA + R
Sbjct: 872 ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKVEGGSAEEGAGARGRVARFR 921
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 922 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 981
Query: 975 QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
QG L+I LP+ +YD WPV+KIPL+ T H + Y E +Y + S + P +
Sbjct: 982 QGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNM--PCTR---- 1035
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1092
I + G + + + D + E + ++++ P W+ A I ++ E+
Sbjct: 1036 -IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVT 1090
Query: 1093 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE---- 1147
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1091 CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1150
Query: 1148 -VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VKN
Sbjct: 1151 VLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKN 1209
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
FIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ +
Sbjct: 1210 FILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMV 1269
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NR 1321
+ Y P+ ES+ G +LL RA+FHVGAHV F R + GAA G K N+
Sbjct: 1270 YMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENK 1322
Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1323 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1382
Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1383 AVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILEDL 1422
Score = 300 bits (769), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 211/631 (33%), Positives = 334/631 (52%), Gaps = 76/631 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 57 LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 112
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P V+VDP GRC +L+YG ++++L + + +E
Sbjct: 113 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEG 166
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 167 LMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQ 226
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS
Sbjct: 227 DTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPP 286
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
+ALN + + + LD A A ++ D ++S K G++ +LT++ DG R
Sbjct: 287 YGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMR 346
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
V+ K SVLT+ + T+ FLGSRLG+SLL+++T +S ++E
Sbjct: 347 SVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA--- 402
Query: 457 IEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDS 508
+ + P +K+ R S+ QD V+ E+ +YGS A + T+ A T+SF V DS
Sbjct: 403 ADKEEPPSKKKRVDSTVGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDS 458
Query: 509 LVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV---E 543
++NIGP + + G L I + + + + K ++V E
Sbjct: 459 ILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFE 518
Query: 544 LPGCKGIWTVY-------HKSSRGHNADS--SRMAAYDD-EYHAYLIISLEARTMVLETA 593
LPGC +WTV ++S+G A+ S + A DD H +LI+S E TM+L+T
Sbjct: 519 LPGCYDMWTVIAPVRKEQEETSKGEGAEQEPSTLEAEDDGRRHGFLILSREDSTMILQTG 578
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 579 QEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL--- 631
Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 632 ----GSPIVQCAVADPYVVIMSAEGHVTMFL 658
>gi|334326317|ref|XP_001364707.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Monodelphis domestica]
Length = 1449
Score = 340 bits (873), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 224/703 (31%), Positives = 352/703 (50%), Gaps = 81/703 (11%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V+ ESGA+EI+ +P++ VF V F G+ +VD+ + T+ ++ EE T
Sbjct: 790 WCVLVRESGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPATQGDTKKEEVTR 845
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L ++ +RP+L + D +L Y+A+ +
Sbjct: 846 QGELPLVKEVLLVALGNRQ-----TRPYLL-VHVDQELLIYEAFAHD------------- 886
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
S + S L+ +RF + P + RE+ P + R
Sbjct: 887 -------SQLGQSNLK-VRFKKVPHNINFREKKPKPSKKKPEGGGTEEGAGARGRVARFR 938
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 939 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 998
Query: 975 QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
QG L+I LP+ +YD WPV+KIPL+ T H + Y E +Y + S L
Sbjct: 999 QGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATS-------TNALCT 1051
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1092
I + G + + + + + E + ++++ P W+ A I ++ E+
Sbjct: 1052 RIPRMTGEEKEFETIERDERYIHPLQEAFSIQLISPVS----WEAIPNARIELEEWEHVT 1107
Query: 1093 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE---- 1147
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1108 CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1167
Query: 1148 -VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VKN
Sbjct: 1168 VLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKN 1226
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
FIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ +
Sbjct: 1227 FILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDSAQLGFLVSDRDRNLMV 1286
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NR 1321
+ Y P+ ES+ G +LL RA+FHVGAHV F R + GAA G K N+
Sbjct: 1287 YMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSIVWENK 1339
Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1340 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1399
Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+++D ELL+ Y L E+ E+A + GTT IL +L ++
Sbjct: 1400 AVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLEI 1442
Score = 306 bits (783), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 224/687 (32%), Positives = 354/687 (51%), Gaps = 99/687 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E S +S + E K + LELV + GNV S+
Sbjct: 29 NLVVAGTSQLYVYRLNHDAETSTKSDRNAEGK----LHKEHKEKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 138 NVHTPRVRVDPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQKSSFL 189
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
K HP+IWS NLP D + LAVP PIGGV++ N++ Y +QS Y VSL
Sbjct: 250 ILQKVHPVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSL 303
Query: 350 DS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRL 401
+S + P + + LD A A ++ D ++S K G++ +LT++ DG R V+
Sbjct: 304 NSLTAGTTAFPLRMQDGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSF 363
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI-EAD 460
K SVLT+ + T+ FLGSRLG+SLL+++T S+ + ++ + D
Sbjct: 364 HFDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAAREAPSREVSDKD 423
Query: 461 APSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
P K+ R S+ A QD V+ E+ +YGS A + T+ A T+SF V DS++NI
Sbjct: 424 EPPVKKKRVESTLGWAGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 479
Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
GP + + G L I + + + + K ++V ELPGC
Sbjct: 480 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 539
Query: 548 KGIWTVY-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
+WTV ++++G A+ SS D + H +LI+S E TM+L+T +
Sbjct: 540 YDMWTVIAPLRKEEDETTKGEGAEQEPSSPETEDDGKRHGFLILSREDSTMILQTGQEIM 599
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 600 ELDTS-GFATQGPTVYAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL------- 648
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 649 GSPIVQCAVADPYVVIMSAEGHVTMFL 675
>gi|355680843|gb|AER96659.1| cleavage and polyadenylation specific factor 1, 160kDa [Mustela
putorius furo]
Length = 1399
Score = 340 bits (871), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 218/683 (31%), Positives = 344/683 (50%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 741 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 796
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 797 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 846
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 847 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAAARGRVARFRYFEDIYGYSGVFICGPS 906
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 907 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 966
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S + P + I + G + + +
Sbjct: 967 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNM--PCTR-----IPRMTGEEKEFEAIER 1019
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1020 DDRYVHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1075
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1076 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1135
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1136 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1194
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1195 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1254
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1255 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1307
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H++ +A + +++D ELL+ Y L
Sbjct: 1308 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRLLHADRRALQNAVRNVLDGELLNRYLYL 1367
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1368 STMERGELAKKIGTTPDIILDDL 1390
Score = 298 bits (762), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 210/635 (33%), Positives = 329/635 (51%), Gaps = 81/635 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 22 LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 77
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVL----VYGLQMIILKASQGGSGLVGD 216
FE PE L+ G P V+VDP GRC +L +YG ++++L + + +
Sbjct: 78 YFEEPE---LRDGFVQNVHAPRVRVDPDGRCAAMLTAMLIYGSRLVVLPFRRES---LAE 131
Query: 217 EDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRV 274
E G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GRV
Sbjct: 132 EHEGLMGEGQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRV 191
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
+ + TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +Q
Sbjct: 192 AVRQDTCCIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQ 251
Query: 335 SA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
S +ALN + + + LD A A ++ D ++S K G++ +LT++
Sbjct: 252 SVPPYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLIT 311
Query: 394 DG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++ T L
Sbjct: 312 DGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKY-----TEKLQEAPAG 366
Query: 453 EFGDIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFA 504
+ + D P +K+ R S+ A QD V+ E+ +YGS A + T+ A T+SF
Sbjct: 367 AVRETDKDEPPSKKKRVESAVGWSGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFE 422
Query: 505 VRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV 542
V DS++NIGP + + G L I + + + + K ++V
Sbjct: 423 VCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVV 482
Query: 543 ---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMV 589
ELPGC +WTV + + G + S + A DD H +LI+S E TM+
Sbjct: 483 TTFELPGCYDMWTVIAPARKEQEETPKGDGAEQEPSALEADDDGRRHGFLILSREDSTMI 542
Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 543 LQTGQEIMELDTS-GFATQGPTVFAGNIGDGRYIVQVSPLGIRLLEG---VSQLHFIPVD 598
Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 599 L-------GSPIVQCAVADPYVVIMSAEGHVTMFL 626
>gi|358415280|ref|XP_003583063.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Bos taurus]
Length = 1490
Score = 340 bits (871), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 217/683 (31%), Positives = 343/683 (50%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 831 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 886
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + RP+L + D +L Y+A+ P ++ +
Sbjct: 887 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 936
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E T R F++I G+ G F+ G
Sbjct: 937 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 996
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 997 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1056
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P +V + G + + +
Sbjct: 1057 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIER 1109
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
+ + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1110 DERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1165
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1166 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1225
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1226 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1284
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1285 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1344
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1345 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1397
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1398 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYL 1457
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1458 STMERGELAKKIGTTPDIILDDL 1480
Score = 295 bits (755), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 209/635 (32%), Positives = 327/635 (51%), Gaps = 82/635 (12%)
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+
Sbjct: 114 KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 169
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLV 214
H FE PE L+ G P V+VDP GRC +L+YG ++++L ++ GLV
Sbjct: 170 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLV 226
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAG 272
G+ G + S++I++R LD K ++ D F+HGY EP ++IL E TW G
Sbjct: 227 GE--------GQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPG 278
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
+V+ + TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV++ N++ Y
Sbjct: 279 KVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYL 338
Query: 333 SQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
+QS +ALN+ + + + LD A A ++ D ++S K G++ +LT+
Sbjct: 339 NQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTL 398
Query: 392 VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
+ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T S+
Sbjct: 399 ITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA- 457
Query: 451 KEEFGDIEADAPSTKRLRRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFA 504
E D E KR+ + S QD V+ E+ +YGS A + T+ A T+SF
Sbjct: 458 -REAADKEEPPSKKKRVDATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFE 512
Query: 505 VRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV 542
V DS++NIGP + + G L I + + + + K ++V
Sbjct: 513 VCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVV 572
Query: 543 ---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMV 589
ELPGC +WTV + G + A DD H +LI+S E TM+
Sbjct: 573 TTFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMI 632
Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 633 LQTGQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVD 688
Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 689 L-------GSPIVQCAVADPYVVIMSAEGHVTMFL 716
>gi|345779232|ref|XP_532356.3| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Canis lupus familiaris]
Length = 1460
Score = 339 bits (869), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 217/683 (31%), Positives = 342/683 (50%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 801 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 856
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 857 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 906
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 907 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 966
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 967 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1026
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S + P + I + G + + +
Sbjct: 1027 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNM--PCTR-----IPRMTGEEKEFETIER 1079
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1080 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1135
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1136 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1195
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1196 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1254
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1255 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1314
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1315 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1367
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1368 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1427
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1428 STMERGELAKKIGTTPDIILDDL 1450
Score = 301 bits (771), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 212/629 (33%), Positives = 331/629 (52%), Gaps = 72/629 (11%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 85 LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 140
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P V+VDP GRC +L+YG ++++L + + +E
Sbjct: 141 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEG 194
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 195 LMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQ 254
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS
Sbjct: 255 DTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPP 314
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
+ALN + + + LD A A ++ D ++S K G++ +LT++ DG R
Sbjct: 315 YGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMR 374
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
V+ K SVLT+ + T+ FLGSRLG+SLL+++T S+ E D
Sbjct: 375 SVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAA--REAAD 432
Query: 457 IEADAPSTKRLRRSS-----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLV 510
E KR+ ++ QD V+ E+ +YGS A + T+ A T+SF V DS++
Sbjct: 433 KEEPPSKKKRVDCAAGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSIL 488
Query: 511 NIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELP 545
NIGP + + G L I + + + + K ++V ELP
Sbjct: 489 NIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELP 548
Query: 546 GCKGIWTVY-------HKSSRGHNA--DSSRMAAYDD-EYHAYLIISLEARTMVLETADL 595
GC +WTV ++S+G A +SS + A DD H +LI+S E TM+L+T
Sbjct: 549 GCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTMILQTGQE 608
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 609 IMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL----- 659
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 660 --GSPIVQCAVADPYVVIMSAEGHVTMFL 686
>gi|27807297|ref|NP_777145.1| cleavage and polyadenylation specificity factor subunit 1 [Bos
taurus]
gi|1706101|sp|Q10569.1|CPSF1_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|929007|emb|CAA58152.1| cleavage and polyadenylation specificity factor, 160 kDa subunit [Bos
taurus]
gi|296480730|tpg|DAA22845.1| TPA: cleavage and polyadenylation specificity factor subunit 1 [Bos
taurus]
Length = 1444
Score = 338 bits (868), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 217/683 (31%), Positives = 343/683 (50%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 785 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + RP+L + D +L Y+A+ P ++ +
Sbjct: 841 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E T R F++I G+ G F+ G
Sbjct: 891 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 950
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 951 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1010
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P +V + G + + +
Sbjct: 1011 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIER 1063
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
+ + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1064 DERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1119
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1120 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1179
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1180 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1238
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1239 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1298
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1299 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1351
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1352 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYL 1411
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1412 STMERGELAKKIGTTPDIILDDL 1434
Score = 304 bits (779), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 221/678 (32%), Positives = 345/678 (50%), Gaps = 86/678 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 309
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 369
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T S+ E D E KR+
Sbjct: 370 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRV 427
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 428 DATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 483
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
L I + + + + K ++V ELPGC +WTV
Sbjct: 484 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 543
Query: 557 SSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 544 VRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFA 602
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + S ++ ++
Sbjct: 603 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAV 652
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 653 ADPYVVIMSAEGHVTMFL 670
>gi|296227035|ref|XP_002807684.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Callithrix jacchus]
Length = 1394
Score = 338 bits (868), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 216/683 (31%), Positives = 342/683 (50%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 735 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 790
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ S +
Sbjct: 791 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLSQGNLK 840
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 841 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 900
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 901 PHWLLVTGRGALRLHPMGIDGPVDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 960
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 961 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFETIER 1013
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
+ + E + ++++ P W+ A I +Q E+ ++ V+L + T +
Sbjct: 1014 DERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLK 1069
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P+ +T +Y KE KG ++AL
Sbjct: 1070 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVTEPRQTLTXXKFKVLYEKEQKGPVTALCHC 1129
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1130 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1188
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1189 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1248
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GA G K N+ F TLDG IG + P
Sbjct: 1249 RRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVMWENKHITWFATLDGGIGLLLP 1301
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1302 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYL 1361
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1362 STMERSELAKKIGTTPDIILDDL 1384
Score = 267 bits (682), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 200/672 (29%), Positives = 324/672 (48%), Gaps = 124/672 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN + + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSAEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ ++ F C +G +
Sbjct: 367 ASVLTTSVSGTEG----------------FLCAAGGKSVP-------------------- 390
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------ 521
QD +E+ +YGS + + + T+SF V DS++NIGP + + G
Sbjct: 391 --------QD--EXDEIEVYGSETQSG-TQLATYSFEVCDSILNIGPCANAAMGEPAFLS 439
Query: 522 ----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-------H 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 440 EEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEE 499
Query: 556 KSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
++ +G + S+ A D H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 500 ENPKGEGTEQEPSTPEADDDSRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTV 558
Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV+
Sbjct: 559 FAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFVPVDL-------GAPIVQCAVADPYVV 608
Query: 673 LGMSDGSIRLLV 684
+ ++G + + +
Sbjct: 609 IMSAEGHVTMFL 620
>gi|16751835|ref|NP_444423.1| cleavage and polyadenylation specificity factor subunit 1 isoform 2
[Mus musculus]
gi|17374611|sp|Q9EPU4.1|CPSF1_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|11762096|gb|AAG40326.1|AF322193_1 cleavage and polyadenylation specificity factor 1 [Mus musculus]
gi|38614159|gb|AAH56388.1| Cleavage and polyadenylation specific factor 1 [Mus musculus]
Length = 1441
Score = 338 bits (868), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 218/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1060
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1061 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1116
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1117 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1176
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1177 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1235
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1236 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1295
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1296 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1348
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1349 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1408
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1409 STMERSELAKKIGTTPDIILDDL 1431
Score = 307 bits (787), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
>gi|395860104|ref|XP_003802355.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Otolemur garnettii]
Length = 1441
Score = 338 bits (867), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 224/700 (32%), Positives = 348/700 (49%), Gaps = 81/700 (11%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 880
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-------------ETPHGAPCQ----RIT 915
L N+ +RF + P + RE T GA + R
Sbjct: 881 ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGVRGRVARFR 930
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 931 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 990
Query: 975 QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
QG L+I LP+ +YD WPV+KIPL+ T H + Y E +Y + S P +
Sbjct: 991 QGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR---- 1044
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1092
I + G + + + D + E + ++++ P W+ A I ++ E+
Sbjct: 1045 -IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVT 1099
Query: 1093 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE---- 1147
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1100 CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1159
Query: 1148 -VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VKN
Sbjct: 1160 VLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKN 1218
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
FIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ +
Sbjct: 1219 FILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMV 1278
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NR 1321
+ Y P+ ES+ G +LL RA+FHVGAHV F R + GA G K N+
Sbjct: 1279 YMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGPSKKSVVWENK 1331
Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1332 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQN 1391
Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1392 AVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 1431
Score = 300 bits (767), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 217/678 (32%), Positives = 343/678 (50%), Gaps = 89/678 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEVLTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASATRESADKEEPPSKKKRV 424
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 425 DPSVGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 480
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 481 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAS 540
Query: 555 -----HKSSRGHNADSSR---MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
++ +G + D H +LI+S E TM+L+T + E+ S +
Sbjct: 541 VRKEEEETPKGEGTEQESGVPEGEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 599
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 600 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 649
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 650 ADPYVVIMSAEGHVTMFL 667
>gi|354491122|ref|XP_003507705.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 1 [Cricetulus griseus]
Length = 1441
Score = 338 bits (867), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 218/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1060
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1061 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1116
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1117 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1176
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1177 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1235
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1236 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1295
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1296 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLP 1348
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1349 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1408
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1409 STMERSELAKKIGTTPDIILDDL 1431
Score = 303 bits (775), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 217/673 (32%), Positives = 347/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE E L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ ++
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429
Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY------- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 555 HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
++ R + + ++ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
>gi|344236599|gb|EGV92702.1| Cleavage and polyadenylation specificity factor subunit 1 [Cricetulus
griseus]
Length = 1419
Score = 338 bits (867), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 218/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 760 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 815
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 816 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 865
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 866 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 925
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 926 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 985
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 986 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1038
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1039 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1094
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1095 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1154
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1155 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1213
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1214 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1273
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1274 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLP 1326
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1327 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1386
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1387 STMERSELAKKIGTTPDIILDDL 1409
Score = 299 bits (765), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 217/671 (32%), Positives = 347/671 (51%), Gaps = 82/671 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE E L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA--DAPSTKRLRRS 470
+ + T+ FLGSRLG+SLL+++T SS ++E A + P +K+ R
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS-VREAADKASAHNEEPPSKKKRVD 430
Query: 471 SSDAL-------QDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 431 PTAGWTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGE 486
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 487 PAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 546
Query: 555 ----HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
++ R + + ++ A D H +LI+S E TM+L+T + E+ S +
Sbjct: 547 RKEEEEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 605
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++A
Sbjct: 606 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 655
Query: 668 DPYVLLGMSDG 678
DPYV++ ++G
Sbjct: 656 DPYVVIMSAEG 666
>gi|197245729|gb|AAI68713.1| Cpsf1 protein [Rattus norvegicus]
Length = 1439
Score = 338 bits (867), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 218/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 780 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 836 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 886 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 945
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 946 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1006 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1058
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1059 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1114
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1115 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1174
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1175 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1233
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1234 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1293
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1294 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLP 1346
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1347 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1406
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1407 STMERSELAKKIGTTPDIILDDL 1429
Score = 308 bits (789), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 219/671 (32%), Positives = 345/671 (51%), Gaps = 77/671 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429
Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSE 485
Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH-------- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 EFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545
Query: 556 --KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 546 TPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654
Query: 674 GMSDGSIRLLV 684
++G + + +
Sbjct: 655 MSAEGHVTMFL 665
>gi|213407244|ref|XP_002174393.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
gi|212002440|gb|EEB08100.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
Length = 1431
Score = 337 bits (864), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 378/1513 (24%), Positives = 641/1513 (42%), Gaps = 248/1513 (16%)
Query: 57 NLVVTAANVIEIY-VVRVQ------EEGSKESKNSGETKRRVLMDGI---------SAAS 100
NL+V + ++++ +VRVQ E+ S N + M+ + +
Sbjct: 29 NLIVAKDDFLQVFDIVRVQRDSDDVEDAFGSSMNLRMEENDAFMETNMQLIRTHEHTVYT 88
Query: 101 LELVCHYRLHGNVESLAILSQ--GGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
L LV R+ G ++ LA++ GG D ++L AK+S+L +D L S
Sbjct: 89 LRLVYQTRVFGTIKDLAVVKPKLGGFTT----DLLVLLTNYAKVSILVWDSLTQQLSTVS 144
Query: 159 MHCFES---PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
MH +ES P+ + E+ A+ + VDP+ C + YG M I+ + +
Sbjct: 145 MHYYESVVPPKPI----AEETPAQ---LIVDPESTCCVLRFYGDMMAIIPFRKPEDLEME 197
Query: 216 DEDTFGSGG-GFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
D + S V+ LD + V D F+ GY E + +L+ E T
Sbjct: 198 DANAQSEKPVDVQCVYLPSFVLTASQLDYSIARVLDSKFLEGYREATLALLYCPEQTSTV 257
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY- 331
+ + T ++ +++ + +I S NLP+D Y +L VP+P+GG L++G N I Y
Sbjct: 258 FLPVRKDTVSLAVITLDIEQRASAVITSIHNLPYDIYCILPVPAPLGGSLLLGGNEIIYV 317
Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDL 386
S ++ +A+N + + + Q RSSF +EL+ L ++ LL TG L
Sbjct: 318 DSAGSTVGIAVNPFYRNATNFQLEDRSSFQLELEGTIGVPLSSPRTESVSVLLIHPTGQL 377
Query: 387 VLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTC 439
L + DG+ V+ LDL + N ++L S +T + + FLGS+ GDS LVQ++
Sbjct: 378 FYLDFLMDGKNVKNLDLHPASDELNNALLQSGVTCALPVADHELFLGSQTGDSYLVQWSR 437
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
S + + D E DA D L D +Y + S K
Sbjct: 438 RSINNQTQEEGTLTYKDEENDA-------DEEVDELDD--------IYDTGSKEKAKRNK 482
Query: 500 -----TFSFAVRDSLVNIGPLKDFSYG---------------LRINADASATGIS----- 534
V D L N+GP+ +F G L + + TG S
Sbjct: 483 FVELGPLRLEVHDVLSNVGPIIEFCTGKAGSLAYFPQDNHGPLEVTC-VTGTGKSGSLVV 541
Query: 535 -KQSNYELVE----LPGCKGIWTVYHKSSRGHNADSSRMAAY-DDEYHAYLIISLEARTM 588
++S +VE GC+ +WT+ H + R N S Y DDEY YL++S E +
Sbjct: 542 FRRSISPVVEGKFNFEGCQSLWTI-HVTGRLKNPRSHGSERYLDDEYDTYLVVSKEKESF 600
Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGP 647
V + EV +S D+ +G TI G L G R++Q+ R+ D + ++ Q ++ G
Sbjct: 601 VFTAGETFDEVEDS-DFNTKGSTINVGGLLGGMRIVQICTTSLRVYDPNIHLVQRINLG- 658
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
+ V++ S+ DPYV+L + G I L D T + + K V
Sbjct: 659 ---------KKQNVVAASVCDPYVVLVLLGGRILLYSMDAETQRL---IKMDLHKQLKNV 706
Query: 708 SSCTLYHDKGPEPWLRKTSTDAWL-----STGVGE-AIDGADGGP--------------- 746
+ +LY +P +++ ++ L S G + +DG D P
Sbjct: 707 KAASLYSTN--DPVMQELFSELDLGRNNSSPGKSDIQMDGVDTQPDRPSMPAGNQVTETN 764
Query: 747 ---LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
LD+ + V ++ G L++ +P ++CV D F + T + + L
Sbjct: 765 VSTLDEQSFAAHFVLFVLHDDGRLKVLHLPTYSCVLECDVF------DLPTVLYDGLSSE 818
Query: 800 E-TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF 858
TE++ SS+E +VE+ L I Y+ ++
Sbjct: 819 RVTEMHESSQE--------------LVEVLATDLGDEAKEAHLLIRSRMNEITVYKPFVC 864
Query: 859 EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET------------- 905
P T K++ LRFS+ P + TRE T
Sbjct: 865 SNPV-THKTE---------------------LRFSKIPQEGMTRESTECSLQDLVAETEQ 902
Query: 906 ---PHGAPCQ------------RITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQL 949
P A Q R+ + I H F++G++P + + + + HP L
Sbjct: 903 ENAPKDASEQKPQKSSSTVDKPRMVALQRIGNHSAVFITGAKPFFLLKTAHSVAKFHPLL 962
Query: 950 CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITY 1009
+ I++ H + G+I+V + IC+ YD+ W +K+ + + H I Y
Sbjct: 963 SECRILSLASFHTEHAPKGYIFVDENYDINICRFQDDINYDHRWGYKKVNVGRSVHGIAY 1022
Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
K +Y + S L P + E G+ + L RT + + ++
Sbjct: 1023 HPTKMVYAIATST--LTPYE------VTDEEGNVVYPLKNEGEYLPRTNS---GMLELVS 1071
Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1128
P W E L VR+V L + TK + +A+GT+ +GED+A RG
Sbjct: 1072 P----LTWTVIDRYKFLDYEIPLCVRLVNLEISDVTKLRKPFIAVGTSITKGEDIAVRGS 1127
Query: 1129 VLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
LF P + T V +E+KG ++ ++ + G+LL G K+I+
Sbjct: 1128 TYLFEIIDVVPQPGHPETRHKLKLVTREEIKGTVAVVSEINGYLLSGQGQKVIVRALEDE 1187
Query: 1184 E-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
+ L G+AF D VV+ ++ +N ++ GDI +SI F+ + E+ ++ L AK L
Sbjct: 1188 DHLVGVAFIDLGSYTVVAKSL-RNLLIFGDIRQSISFVGFAEEPYRMTLFAKGQDPLSVS 1246
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
+ +FL+ G +L V+D + N++I Y P+ ES G++L++R + HVG H+ + L +
Sbjct: 1247 SADFLVQGQSLYFAVADMRGNLRILAYDPENPESHSGERLVTRGDIHVG-HIITAIHL-V 1304
Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1362
DR G D+ + FA + DGS+ + P+ E +RRL +Q L + + V
Sbjct: 1305 PKMKKDRPGEV-DYDEGDEFACITTNSDGSLQALCPISERVYRRLNIIQNYLANRIETVG 1363
Query: 1363 GLNPRSFRQFHS----NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
GLNPRS+R ++ N HR I+D L+ H+ + + + E+A++ G S I+
Sbjct: 1364 GLNPRSYRLINTVSSLNNATHR-----ILDGGLIEHFSYMSVAHRQEMAYKCGVPISTIM 1418
Query: 1419 SNLNDLALGTSFL 1431
++L +L +++
Sbjct: 1419 NDLVELDEALNYM 1431
>gi|417406474|gb|JAA49895.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
cft1 cpsf subunit [Desmodus rotundus]
Length = 1444
Score = 337 bits (863), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 223/700 (31%), Positives = 346/700 (49%), Gaps = 81/700 (11%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 785 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ +
Sbjct: 841 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAFAHD------------- 881
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
S + L+ +RF + P + RE+ P + R
Sbjct: 882 -------SQLGQGNLK-VRFKKVPHNINFREKKPKPSKKKADGGGAEEGAGARGRVARFR 933
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 934 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 993
Query: 975 QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
QG L+I LP+ +YD WPV+KIPL+ T H + Y E +Y + S P +
Sbjct: 994 QGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR---- 1047
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1092
I + G + + + D + E + ++++ P W+ A I ++ E+
Sbjct: 1048 -IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVT 1102
Query: 1093 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE---- 1147
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1103 CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1162
Query: 1148 -VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VKN
Sbjct: 1163 VLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKN 1221
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
FIL D+ KSI L ++E L+L+++D L+ ++ +F++D + L +VSD +N+ +
Sbjct: 1222 FILAADVMKSISLLRYQEDSKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMV 1281
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NR 1321
+ Y P+ ES+ G +LL RA+FHVGAHV F R + GAA G K N+
Sbjct: 1282 YMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENK 1334
Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1335 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQN 1394
Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+I+D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1395 AVRNILDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1434
Score = 300 bits (767), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 216/675 (32%), Positives = 346/675 (51%), Gaps = 80/675 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRSTEGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTSGTTAFP 314
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 315 LRTQEGVRTTLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAAASVLT 374
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR---- 468
S + T+ FLGSRLG+SLL+++T +S ++E + + PS+K+ R
Sbjct: 375 SSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEAPASAVREA---ADKEEPSSKKKRVDPT 430
Query: 469 ---RSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 431 VGWSGGQSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGEPA 486
Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 487 FLSEEFQNSPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVVAPVRK 546
Query: 555 --HKSSRGHNADSSRMAAY---DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
++ +G + + D H +LI+S E TM+L+T + E+ S + QG
Sbjct: 547 EQEETPKGEGTEQEPITPETEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 605
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
T+ AGN+ R ++QV G R+L+G L F P + S ++ ++ADP
Sbjct: 606 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVADP 655
Query: 670 YVLLGMSDGSIRLLV 684
V++ ++G + + +
Sbjct: 656 CVVIMSAEGHVAMFL 670
>gi|338728513|ref|XP_003365689.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Equus caballus]
Length = 1450
Score = 336 bits (862), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 217/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 791 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 846
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 847 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 896
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 897 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGVGARGRVARFRYFEDIYGYSGVFICGPS 956
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 957 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1016
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1017 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFETIER 1069
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1070 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1125
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1126 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1185
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1186 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1244
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1245 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1304
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1305 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1357
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1358 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1417
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1418 STMERGELAKKIGTTPDIILDDL 1440
Score = 303 bits (775), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 219/681 (32%), Positives = 350/681 (51%), Gaps = 86/681 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN + + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRNAEGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+ + T+ FLGSRLG+SLL+++T +S ++E E + P +K+ R S+
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---AEKEEPPSKKKRVDST 430
Query: 473 -------------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP +
Sbjct: 431 VGWSGSPRAAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANA 486
Query: 519 SYG----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTV 553
+ G L I + + + + K ++V ELPGC +WTV
Sbjct: 487 AMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTV 546
Query: 554 Y-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
++ +G + S+ A D H +LI+S E TM+L+T + E+ S
Sbjct: 547 IAPVRKEQEETPKGEGTEQEPSAPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS- 605
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
+ QG T+ AGN+ R ++QV G R+L+G L F P + S ++
Sbjct: 606 GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQ 655
Query: 664 VSIADPYVLLGMSDGSIRLLV 684
++ADPYV++ ++G + + +
Sbjct: 656 CAVADPYVVIMSAEGHVTMFL 676
>gi|338728511|ref|XP_001505047.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like isoform 1 [Equus caballus]
Length = 1444
Score = 336 bits (862), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 217/683 (31%), Positives = 341/683 (49%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 785 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 841 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 891 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGVGARGRVARFRYFEDIYGYSGVFICGPS 950
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 951 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1010
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1011 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFETIER 1063
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1064 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1119
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1120 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1179
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1180 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1238
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1239 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1298
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1299 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1351
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1352 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1411
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1412 STMERGELAKKIGTTPDIILDDL 1434
Score = 305 bits (781), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 219/675 (32%), Positives = 350/675 (51%), Gaps = 80/675 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN + + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRNAEGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+ + T+ FLGSRLG+SLL+++T +S ++E E + P +K+ R S+
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---AEKEEPPSKKKRVDST 430
Query: 473 -------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 431 VGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGEPA 486
Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 487 FLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRK 546
Query: 555 --HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
++ +G + S+ A D H +LI+S E TM+L+T + E+ S + QG
Sbjct: 547 EQEETPKGEGTEQEPSAPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 605
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
T+ AGN+ R ++QV G R+L+G L F P + S ++ ++ADP
Sbjct: 606 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVADP 655
Query: 670 YVLLGMSDGSIRLLV 684
YV++ ++G + + +
Sbjct: 656 YVVIMSAEGHVTMFL 670
>gi|403302917|ref|XP_003942095.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Saimiri boliviensis boliviensis]
Length = 1390
Score = 336 bits (862), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 223/700 (31%), Positives = 349/700 (49%), Gaps = 81/700 (11%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 731 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 786
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 787 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 829
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
LS N+ +RF + P + RE E GA + R
Sbjct: 830 ----LSQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSAEEGAGARGRVARFR 879
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG + +F HN+NC GF+Y
Sbjct: 880 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPVDSFAPFHNINCPRGFLYFNR 939
Query: 975 QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
QG L+I LP+ +YD WPV+KIPL+ T H + Y E +Y + S P +
Sbjct: 940 QGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR---- 993
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1092
I + G + + + + + E + ++++ P W+ A I +Q E+
Sbjct: 994 -IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVT 1048
Query: 1093 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE---- 1147
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1049 CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1108
Query: 1148 -VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VKN
Sbjct: 1109 VLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKN 1167
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
FIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ +
Sbjct: 1168 FILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMV 1227
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NR 1321
+ Y P+ ES+ G +LL RA+FHVGAHV F R + GA G K N+
Sbjct: 1228 YMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENK 1280
Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1281 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQN 1340
Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1341 AVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1380
Score = 273 bits (699), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 203/645 (31%), Positives = 322/645 (49%), Gaps = 105/645 (16%)
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 2 SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSAR 229
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 55 VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--------------------------GGV 321
++ T K HP+IWS +LP D + LAVP PI GGV
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGENPGGAEGSAGRGAVSLPTSLCPPPGGV 226
Query: 322 LVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
++ N++ Y +QS +ALN+ + + + LD A AT++ D ++S
Sbjct: 227 VIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQATFISYDKMVIS 286
Query: 381 TKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
K G++ +LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T
Sbjct: 287 LKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTE 346
Query: 440 G----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNT 494
+++ +G KEE + +T QD V+ E+ +YGS A + T
Sbjct: 347 KLQEPPASAVREAGDKEEPPSKKKRVDATAGWSAGGKSVPQDEVD--EIEVYGSEAQSGT 404
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASATG 532
+ A T+SF V DS++NIGP + + G L I + + +
Sbjct: 405 QLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSV 462
Query: 533 ISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYL 579
+ K ++V ELPGC +WTV + G + S A DD H +L
Sbjct: 463 LQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEENPKGEGTEQEPSTPEADDDSRRHGFL 522
Query: 580 IISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYM 639
I+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 523 ILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNVGDDRYIVQVSPLGIRLLEG--- 578
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 579 VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 616
>gi|348555854|ref|XP_003463738.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 1 [Cavia porcellus]
Length = 1440
Score = 335 bits (860), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 217/683 (31%), Positives = 340/683 (49%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 781 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQPTTQGEVR----KEEATR 836
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 837 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 886
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 887 VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGVRGRVARFRYFEDIYGYSGVFICGPS 946
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 947 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1006
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1007 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTR-----IPRMTGEEKEFEAIER 1059
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1060 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1115
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1116 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1175
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1176 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1234
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1235 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1294
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GA G K N+ F TLDG IG + P
Sbjct: 1295 RRADFHVGAHVNTFWR-------TPCRGATEGPSKKSVVWENKHITWFATLDGGIGLLLP 1347
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1348 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1407
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1408 STMERGELAKKIGTTPDIILDDL 1430
Score = 271 bits (694), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 209/678 (30%), Positives = 332/678 (48%), Gaps = 90/678 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + + A + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRPTEGKSHREKLGAGGPPSLSF----GNVMSM 82
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + I ++SV+E+D H L+ S+H FE PE L+ G
Sbjct: 83 ASVQLXXXXXX------IALISFPQLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 134 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 185
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 186 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 245
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 246 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLG 305
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 365
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 366 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTVREAADKEEPPSKKKRV 423
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 424 DSTAGWAGSKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 479
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 480 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 539
Query: 556 --------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + S A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 540 VRKEEEETPKAEGSEQEPSAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 598
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 599 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 648
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 649 ADPYVVIMSAEGHVTMFL 666
>gi|255918233|ref|NP_001157645.1| cleavage and polyadenylation specificity factor subunit 1 isoform 1
[Mus musculus]
Length = 1450
Score = 335 bits (860), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 215/675 (31%), Positives = 337/675 (49%), Gaps = 47/675 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1060
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1061 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1116
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1117 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1176
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1177 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1235
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1236 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1295
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1296 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1348
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1349 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1408
Query: 1399 PLEEQLEIAHQTGTT 1413
E+ E+A + GTT
Sbjct: 1409 STMERSELAKKIGTT 1423
Score = 307 bits (787), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
>gi|119602512|gb|EAW82106.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
gi|119602513|gb|EAW82107.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
gi|119602514|gb|EAW82108.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
Length = 1365
Score = 335 bits (858), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 215/685 (31%), Positives = 340/685 (49%), Gaps = 51/685 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 706 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 761
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 762 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 811
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 812 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 871
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 872 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 931
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 932 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 982
Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
+ + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 983 ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1038
Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1039 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1098
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1099 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1157
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +
Sbjct: 1158 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1217
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCI 1336
LL RA+FHVGAHV F R + GA G K N+ F TLDG IG +
Sbjct: 1218 LLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLL 1270
Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1271 LPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYL 1330
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNL 1421
L E+ E+A + GTT IL +L
Sbjct: 1331 YLSTMERSELAKKIGTTPDIILDDL 1355
Score = 287 bits (734), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 202/620 (32%), Positives = 322/620 (51%), Gaps = 80/620 (12%)
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 2 SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSAR 229
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 55 VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYA 346
++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLT 226
Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 227 TGTTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDK 286
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADA 461
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE +
Sbjct: 287 AAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRV 346
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + +
Sbjct: 347 DATAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAV 402
Query: 521 G----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY- 554
G L I + + + + K ++V ELPGC +WTV
Sbjct: 403 GEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIA 462
Query: 555 --------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVD 604
+ G + S DD+ H +LI+S E TM+L+T + E+ S
Sbjct: 463 PVRKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-G 521
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
+ QG T+ AGN+ R ++QV G R+L+G L F P + + ++
Sbjct: 522 FATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQC 571
Query: 665 SIADPYVLLGMSDGSIRLLV 684
++ADPYV++ ++G + + +
Sbjct: 572 AVADPYVVIMSAEGHVTMFL 591
>gi|402879380|ref|XP_003903320.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Papio anubis]
Length = 1389
Score = 335 bits (858), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 215/683 (31%), Positives = 339/683 (49%), Gaps = 47/683 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 730 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 785
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 786 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 835
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 836 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGXRGRVARFRYFEDIYGYSGVFICGPS 895
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 896 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 955
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S I + G + + +
Sbjct: 956 PWPVRKIPLRCTAHYVAYHVESKVYAVATS-------TNTPCARIPRMTGEEKEFETIER 1008
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
+ + E + ++++ P W+ A I +Q E+ ++ V+L + T +
Sbjct: 1009 DERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLK 1064
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1065 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1124
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1125 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1183
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1184 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1243
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GA G K N+ F TLDG IG + P
Sbjct: 1244 RRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLP 1296
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1297 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYL 1356
Query: 1399 PLEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1357 STMERSELAKKIGTTPDIILDDL 1379
Score = 272 bits (696), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 202/644 (31%), Positives = 321/644 (49%), Gaps = 104/644 (16%)
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 2 SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSAR 229
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 55 VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI-------------------------GGVL 322
++ T K HP+IWS +LP D + LAVP PI GGV+
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGEYPGSGWGCVEGALSLPTSLCPPPGGVV 226
Query: 323 VVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
V N++ Y +QS +ALN+ + + + LD A AT++ D ++S
Sbjct: 227 VFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRIQEGVRITLDCAQATFISYDKMVISL 286
Query: 382 KTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
K G++ +LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T
Sbjct: 287 KGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEK 346
Query: 441 ----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNTE 495
+++ + KEE + +T QD V+ E+ +YGS A + T+
Sbjct: 347 LQEPPASAVREAADKEEPPSKKKRVDATASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQ 404
Query: 496 SAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASATGI 533
A T+SF V DS++NIGP + + G L I + + + +
Sbjct: 405 LA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVL 462
Query: 534 SKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYLI 580
K ++V ELPGC +WTV + G ++ A DD H +LI
Sbjct: 463 QKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLI 522
Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 523 LSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---V 578
Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 579 NQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 615
>gi|426361048|ref|XP_004047737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Gorilla gorilla gorilla]
Length = 1440
Score = 334 bits (857), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 215/685 (31%), Positives = 340/685 (49%), Gaps = 51/685 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 781 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 836
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 837 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 886
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 887 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 946
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 947 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1006
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1007 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 1057
Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
+ + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1058 ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1113
Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1114 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1173
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1174 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1232
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +
Sbjct: 1233 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1292
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCI 1336
LL RA+FHVGAHV F R + GA G K N+ F TLDG IG +
Sbjct: 1293 LLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLL 1345
Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1346 LPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYL 1405
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNL 1421
L E+ E+A + GTT IL +L
Sbjct: 1406 YLSTMERSELAKKIGTTPDIILDDL 1430
Score = 287 bits (735), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 210/675 (31%), Positives = 336/675 (49%), Gaps = 84/675 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP G C +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGTCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T +S ++E + + P +K+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---ADKEEPPSKKK 422
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR---DSLVNIGPLKDFSYG--- 521
R ++ G + A + AV DS++NIGP + + G
Sbjct: 423 RVDATAGWSGEGRSRAGQERGQVTQGWSGAGAPLTVAVPQVCDSILNIGPCANAAMGEPA 482
Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 FLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRK 542
Query: 555 ----HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+ G + S A DD H +LI+S E TM+L+T + E+ S + QG
Sbjct: 543 EEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 601
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
T+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADP
Sbjct: 602 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADP 651
Query: 670 YVLLGMSDGSIRLLV 684
YV++ ++G + + +
Sbjct: 652 YVVIMSAEGHVTMFL 666
>gi|440904368|gb|ELR54893.1| Cleavage and polyadenylation specificity factor subunit 1, partial
[Bos grunniens mutus]
Length = 1417
Score = 334 bits (857), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 214/675 (31%), Positives = 339/675 (50%), Gaps = 47/675 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 775 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 830
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + RP+L + D +L Y+A+ P ++ +
Sbjct: 831 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 880
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E T R F++I G+ G F+ G
Sbjct: 881 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 940
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 941 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1000
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P +V + G + + +
Sbjct: 1001 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIER 1053
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
+ + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1054 DERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1109
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1110 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1169
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1170 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1228
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1229 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1288
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1289 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLP 1341
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1342 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYL 1401
Query: 1399 PLEEQLEIAHQTGTT 1413
E+ E+A + GTT
Sbjct: 1402 SPMERGELAKKIGTT 1416
Score = 280 bits (717), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 206/634 (32%), Positives = 320/634 (50%), Gaps = 89/634 (14%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LELV + GNV S+A + GA +RD+++L SV+E+D H L+ S+H
Sbjct: 66 LELVASFSFFGNVMSMASVQLAGA----KRDALLL-------SVVEYDPGTHDLKTLSLH 114
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVG 215
FE PE L+ G P V+VDP GRC +L+YG ++++L ++ GLVG
Sbjct: 115 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVG 171
Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ G + S++I++R LD K ++ D F+HGY EP ++IL E TW GR
Sbjct: 172 E--------GQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGR 223
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
V+ + TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +
Sbjct: 224 VAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLN 283
Query: 334 QSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
QS +ALN+ + + + LD A A ++ D ++S K G++ +LT++
Sbjct: 284 QSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLI 343
Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T S+
Sbjct: 344 TDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA-- 401
Query: 452 EEFGDIEADAPSTKRLRRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAV 505
E D E KR+ + S QD V+ E+ +YGS A + T+ A T+SF V
Sbjct: 402 REAADKEEPPSKKKRVDATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEV 457
Query: 506 RDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV- 542
DS++NIGP + + G L I + + + + K ++V
Sbjct: 458 CDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVT 517
Query: 543 --ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVL 590
ELPGC +WTV + G + A DD H +LI+S E TM+L
Sbjct: 518 TFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMIL 577
Query: 591 ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNS 650
+T + E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 578 QTGQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL 633
Query: 651 ESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 634 -------GSPIVQCAVADPYVVIMSAEGHVTMFL 660
>gi|354491126|ref|XP_003507707.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 3 [Cricetulus griseus]
Length = 1449
Score = 334 bits (857), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 215/675 (31%), Positives = 337/675 (49%), Gaps = 47/675 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1008 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1060
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1061 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1116
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1117 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1176
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1177 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1235
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1236 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1295
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAP 1338
RA+FHVGAHV F R + GAA G K N+ F TLDG IG + P
Sbjct: 1296 RRADFHVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLP 1348
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1349 MQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYL 1408
Query: 1399 PLEEQLEIAHQTGTT 1413
E+ E+A + GTT
Sbjct: 1409 STMERSELAKKIGTT 1423
Score = 302 bits (774), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 217/673 (32%), Positives = 347/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE E L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ ++
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429
Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY------- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 555 HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
++ R + + ++ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
>gi|392306997|ref|NP_001254722.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
gi|380812168|gb|AFE77959.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
gi|383417835|gb|AFH32131.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
Length = 1442
Score = 334 bits (857), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 216/685 (31%), Positives = 341/685 (49%), Gaps = 51/685 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 783 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 839 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 889 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 949 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1009 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 1059
Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
+ + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1060 ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1115
Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1116 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1175
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1176 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1234
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +
Sbjct: 1235 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1294
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCI 1336
LL RA+FHVGAHV F R + GA G K N+ F TLDG IG +
Sbjct: 1295 LLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLL 1347
Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1348 LPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYL 1407
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNL 1421
L E+ E+A + GTT IL +L
Sbjct: 1408 YLSTMERSELAKKIGTTPDIILDDL 1432
Score = 304 bits (779), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 219/677 (32%), Positives = 347/677 (51%), Gaps = 86/677 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+ G ++ A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651
Query: 668 DPYVLLGMSDGSIRLLV 684
DPYV++ ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668
>gi|351713968|gb|EHB16887.1| Cleavage and polyadenylation specificity factor subunit 1
[Heterocephalus glaber]
Length = 1440
Score = 334 bits (856), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 216/682 (31%), Positives = 341/682 (50%), Gaps = 46/682 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 782 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SSGQPTTQGEARKEEATR 837
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 838 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947
Query: 933 PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
P W +V LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 948 PHWLLVTGRGLRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAP 1007
Query: 993 WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1052
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1008 WPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTR-----IPRMTGEEKEFEAIERD 1060
Query: 1053 DLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ET 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T +
Sbjct: 1061 DRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKG 1116
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQ 1164
+A GT +QGE+V RGRV + P +T+ +Y KE KG ++AL
Sbjct: 1117 YVAAGTCLMQGEEVTCRGRVRDWERIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCN 1176
Query: 1165 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++E
Sbjct: 1177 GHLVSAIGQKIFLWSLRASELTGMAFIDT-QLYIHQMISVKNFILAADVMKSISLLRYQE 1235
Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1236 ESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLR 1295
Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPL 1339
RA+FHVGAHV F R + GA+ G K N+ F TLDG IG + P+
Sbjct: 1296 RADFHVGAHVNTFWR-------TPCRGASEGPSKKSVVWENKHITWFATLDGGIGLLLPM 1348
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1349 QEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLS 1408
Query: 1400 LEEQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1409 TMERGELAKKIGTTPDIILDDL 1430
Score = 303 bits (775), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 221/684 (32%), Positives = 351/684 (51%), Gaps = 101/684 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEGLTKNDKTTEGKSHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE P
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTVREAADKEE--------PP 418
Query: 464 TKRLRRSSSDAL-------QDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPL 515
+K+ R S+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP
Sbjct: 419 SKKKRVDSAAGWAGNKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPC 474
Query: 516 KDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGCKGI 550
+ + G L I + + + + K ++V ELPGC +
Sbjct: 475 ANAAVGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDM 534
Query: 551 WTVYH---------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVT 600
WTV + G + S A DD H +LI+S E TM+L+T + E+
Sbjct: 535 WTVIAPVRKEEEETPKAEGSEQEPSAPEAQDDGRRHGFLILSREDSTMILQTGQEIMELD 594
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
S + QG T+ AGN+ R ++QV G R+L+G L F P + +
Sbjct: 595 TS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAP 643
Query: 661 VLSVSIADPYVLLGMSDGSIRLLV 684
++ ++ADPYV++ ++G + + +
Sbjct: 644 IVQCAVADPYVVIMSAEGHVTMFL 667
>gi|397497327|ref|XP_003819464.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Pan paniscus]
gi|410336497|gb|JAA37195.1| cleavage and polyadenylation specific factor 1, 160kDa [Pan
troglodytes]
Length = 1442
Score = 334 bits (856), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 215/685 (31%), Positives = 340/685 (49%), Gaps = 51/685 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 783 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 839 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 889 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 949 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1009 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 1059
Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
+ + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1060 ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1115
Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1116 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1175
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1176 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1234
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +
Sbjct: 1235 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1294
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCI 1336
LL RA+FHVGAHV F R + GA G K N+ F TLDG IG +
Sbjct: 1295 LLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLL 1347
Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1348 LPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYL 1407
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNL 1421
L E+ E+A + GTT IL +L
Sbjct: 1408 YLSTMERSELAKKIGTTPDIILDDL 1432
Score = 305 bits (781), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 220/677 (32%), Positives = 348/677 (51%), Gaps = 86/677 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+ G + S A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651
Query: 668 DPYVLLGMSDGSIRLLV 684
DPYV++ ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668
>gi|56676371|ref|NP_037423.2| cleavage and polyadenylation specificity factor subunit 1 [Homo
sapiens]
gi|23503048|sp|Q10570.2|CPSF1_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|16878041|gb|AAH17232.1| Cleavage and polyadenylation specific factor 1, 160kDa [Homo sapiens]
gi|119602516|gb|EAW82110.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_c
[Homo sapiens]
gi|123993607|gb|ABM84405.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
gi|123999626|gb|ABM87355.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
gi|307684758|dbj|BAJ20419.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
Length = 1443
Score = 334 bits (856), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 215/685 (31%), Positives = 340/685 (49%), Gaps = 51/685 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 784 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 840 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 890 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 950 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1010 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 1060
Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
+ + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1061 ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1116
Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1117 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1176
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1177 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1235
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +
Sbjct: 1236 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1295
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCI 1336
LL RA+FHVGAHV F R + GA G K N+ F TLDG IG +
Sbjct: 1296 LLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLL 1348
Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y
Sbjct: 1349 LPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYL 1408
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNL 1421
L E+ E+A + GTT IL +L
Sbjct: 1409 YLSTMERSELAKKIGTTPDIILDDL 1433
Score = 305 bits (780), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + S DD+ H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669
>gi|1045574|gb|AAC50293.1| cleavage and polyadenylation specificity factor [Homo sapiens]
Length = 1442
Score = 333 bits (855), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 213/680 (31%), Positives = 338/680 (49%), Gaps = 42/680 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 784 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 840 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + E R F++I G+ G F+ G
Sbjct: 890 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 950 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1010 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFETI 1060
Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1107
+ + E + ++++ P W+ A I +Q E+ ++ V+L + T
Sbjct: 1061 ERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSG 1116
Query: 1108 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALA 1161
+ +A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1117 LKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALC 1176
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1177 HCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLR 1235
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +
Sbjct: 1236 YQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMR 1295
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
LL RA+FHVGAHV F R AT + N+ F TLDG IG + P+ E
Sbjct: 1296 LLRRADFHVGAHVNTFWRTPCRATEGLSKKSVVWE---NKHITWFATLDGGIGLLLPMQE 1352
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
T+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L
Sbjct: 1353 KTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTM 1412
Query: 1402 EQLEIAHQTGTTRSQILSNL 1421
E+ E+A + GTT IL +L
Sbjct: 1413 ERSELAKKIGTTPDIILDDL 1432
Score = 305 bits (780), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
+ G + S DD+ H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601
Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651
Query: 667 ADPYVLLGMSDGSIRLLV 684
ADPYV++ ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669
>gi|229335612|ref|NP_001108153.2| cleavage and polyadenylation specificity factor subunit 1 [Danio
rerio]
Length = 1449
Score = 333 bits (854), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 217/696 (31%), Positives = 357/696 (51%), Gaps = 67/696 (9%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + S T+ EE T
Sbjct: 790 WCLLVRENGVMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTR 845
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG +I +K E+A+ +HSRP+L A + + +L Y+A+ ++ +
Sbjct: 846 QG---DIPLVK--EVALVSLGYNHSRPYLLAHV-EQELLIYEAFPYDQQQ---------- 889
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--------PHG---------APCQRIT 915
+ S L+ +RF + P + RE+ P G R
Sbjct: 890 ----------AQSNLK-VRFKKMPHNINYREKKVKVRKDKKPEGQGEDTLGVKGRVARFR 938
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++ISG+ G F+ G P W +V R +R+HP DG+I +F+ HN+NC GF+Y
Sbjct: 939 YFQDISGYSGVFICGPSPHWMLVTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNK 998
Query: 975 QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
QG L+I LP+ +YD WPV+KIPL+ T H ++Y E +Y + SV +P +
Sbjct: 999 QGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVESKVYAVCTSVK--EPCTR---- 1052
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
I + G + + + + + +++ ++++ P TR + ++ E+ +
Sbjct: 1053 -IPRMTGEEKEFETIERDERYIHPQQDKFSIQLISPVSWEAIPNTR--VDLEEWEHVTCM 1109
Query: 1095 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----V 1148
+ V L + T + +A+GT +QGE+V RGR+L+ P +T+ +
Sbjct: 1110 KTVALKSQETVSGLKGYVALGTCLMQGEEVTCRGRILILDVIEVVPEPGQPLTKNKFKVL 1169
Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
Y KE KG ++AL G L+ A G KI L +L G+AF D LY+ + +KNFI
Sbjct: 1170 YEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKDNDLTGMAFIDTQ-LYIHQMYSIKNFI 1228
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
L D+ KSI L ++ + L+L+++D L+ ++ EF++D + L +VSD KN+ ++
Sbjct: 1229 LAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFMVDNNQLGFLVSDRDKNLMVYM 1288
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
Y P+ ES+ G +LL RA+F+VG+HV F R+ T A D N+ F T
Sbjct: 1289 YLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTANKKALTWD--NKHITWFAT 1346
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
LDG +G + P+ E T+RRL LQ L +PH AGLNP++FR H + + + +I+D
Sbjct: 1347 LDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQNAVKNILD 1406
Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
ELL+ Y L E+ E+A + GTT IL +L ++
Sbjct: 1407 GELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEI 1442
Score = 292 bits (747), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 209/641 (32%), Positives = 334/641 (52%), Gaps = 78/641 (12%)
Query: 94 DGIS-AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
DG S LE V + L GNV S+A + G + RD+++L+F+DAK+SV+E+D H
Sbjct: 58 DGKSRKEKLEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTH 113
Query: 153 GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
L+ S+H FE PE L+ G P+V+VDP+ RC +LVYG +++L +
Sbjct: 114 DLKTLSLHYFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT-- 168
Query: 213 LVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTW 270
+ DE G G + S++I++R+LD K ++ D F+HGY EP ++IL E TW
Sbjct: 169 -LADEQEGIVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTW 227
Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
GRV+ + TC I A+S++ K HP+IWS NLP D +++AVP PIGGV+V N++
Sbjct: 228 PGRVAVRQDTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLL 287
Query: 331 YHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
Y +QS ++LN+ + P+ + LD + A+++ +D ++S K G++ +L
Sbjct: 288 YLNQSVPPFGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVL 347
Query: 390 TVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
T++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T + +
Sbjct: 348 TLITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEE 407
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDA-------LQDMVNGEELSLYGS-ASNNTESAQKT 500
G + E + + P K+ R S+ A L D ++ E+ +YGS A + T+ A T
Sbjct: 408 GKENEEKEKQ---PPNKKKRVDSNWAGCPGKGNLPDELD--EIEVYGSEAQSGTQLA--T 460
Query: 501 FSFAVRDSLVNIGPLKDFSYG--------LRINADAS-----ATGISKQSNYELV----- 542
+SF V DS++NIGP S G + N + +G K ++
Sbjct: 461 YSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVVVCSGYGKNGALSVLQKSIR 520
Query: 543 -------ELPGCKGIWTVYHKSSR---------GHNADSSRMAAY---DDEYHAYLIISL 583
ELPGC +WTV + + G + + + D + H +LI+S
Sbjct: 521 PQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILSR 580
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
E TM+L+T + E+ S + QG T+ AGN+ + +IQV G R+L+G L
Sbjct: 581 EDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---VNQL 636
Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
F P + S ++ S+ADPYV++ ++G + + V
Sbjct: 637 HFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFV 670
>gi|427795803|gb|JAA63353.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
cft1 cpsf subunit, partial [Rhipicephalus pulchellus]
Length = 726
Score = 332 bits (852), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 224/700 (32%), Positives = 354/700 (50%), Gaps = 74/700 (10%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEEGTGQG 814
V E+G LEI+ +P + F V F G+ +VD+ A +++E ++ S E
Sbjct: 73 VARENGVLEIYSLPEYKLCFLVKNFPMGQKVLVDSVQMTAPSGTKSEKLSDMSHESMPV- 131
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
+H + VV L ++ HSRP L A + D +L Y+A+ F T
Sbjct: 132 ----VHEILVVGLGIR-----HSRPLLLARV-DEDLLIYEAFPF------------YETQ 169
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTRE-----ETPHGAPCQR-------ITIFKNISG 922
R + LRF + D + RE + P ++ + F +ISG
Sbjct: 170 REGHL---------KLRFKKMSHDIFLRERKYKTQKPENEEEEKAFQSRQWLHPFSDISG 220
Query: 923 HQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G FL G RP W M R LR HP DG I F HNVNC GF++ QG L+I
Sbjct: 221 YSGVFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQGELRIS 280
Query: 982 QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
LP+ TYD WPV+K+PL+ TPH + Y + Y ++ S P P N ++ G
Sbjct: 281 TLPTHLTYDAPWPVRKVPLRCTPHFVNYHVDSKTYCVVTSQP--DPCNHLVRF-----TG 333
Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL 1099
+ + L + T++++ +++L P W+T + + E+ ++ V L
Sbjct: 334 EEKEYELLERDSRYIFPTMDKFSLQLLSP----VSWETIPNTRVDLDEWEHLTCLKNVML 389
Query: 1100 FNT-TTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKEL 1153
+ TT + LA+GT Y GEDV +RGR+++ P +N + VYSKE
Sbjct: 390 SSEGTTTGMKGYLALGTNYCYGEDVTSRGRIIILDIIDVVPEPGQPLTKNKIKIVYSKEQ 449
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
KG ++AL+ + G LL A G KI + + EL G+AF D +Y+ S+ VKN IL+GD+
Sbjct: 450 KGPVTALSQVVGFLLSAIGQKIYIWQLKDNELVGVAFIDTQ-IYIHSVVTVKNLILVGDV 508
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
KS+ L ++E L+L+++D L+ +A EF ID + +S +V+D ++N+ ++ Y P+
Sbjct: 509 FKSVSLLRYQEASRTLSLVSRDVRPLEVYAVEFFIDNTQMSFLVTDAERNLLLYMYQPES 568
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
ES GQ+LL R +FHVG+ V R++ + S R + TLDGS+
Sbjct: 569 RESCGGQRLLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSL 628
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN----GKAHRPGPDSIVDC 1389
+ P+ E T+RRL LQ LV ++PH AGLNP+++R ++S G H+ +I+D
Sbjct: 629 AYVLPVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHK----NILDG 684
Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
EL+ + L E+ E++ + GTT +QI +L ++ T+
Sbjct: 685 ELIWKFMHLSFMERSELSKKIGTTVTQITDDLLEIETYTA 724
>gi|49619065|gb|AAT68117.1| cleavage and polyadenylation specific factor 1 [Danio rerio]
Length = 1105
Score = 332 bits (851), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 217/696 (31%), Positives = 356/696 (51%), Gaps = 67/696 (9%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + S T+ EE T
Sbjct: 446 WCLLVRENGVMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTR 501
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG +I +K E+A+ HSRP+L A + + +L Y+A+ ++ +
Sbjct: 502 QG---DIPLVK--EVALVSLGYSHSRPYLLAHV-EQELLIYEAFPYDQQQ---------- 545
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--------PHG---------APCQRIT 915
+ S L+ +RF + P + RE+ P G R
Sbjct: 546 ----------AQSNLK-VRFKKMPHNINYREKKVKVRKDKKPEGQGEDSLGVKGRVARFR 594
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++ISG+ G F+ G P W +V R +R+HP DG+I +F+ HN+NC GF+Y
Sbjct: 595 YFQDISGYSGVFICGPSPHWMLVTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNK 654
Query: 975 QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
QG L+I LP+ +YD WPV+KIPL+ T H ++Y E +Y + SV +P +
Sbjct: 655 QGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVESKVYAVCTSVK--EPCTR---- 708
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
I + G + + + + + +++ ++++ P TR + ++ E+ +
Sbjct: 709 -IPRMTGEEKEFETIERDERYIHPQQDKFSIQLISPVSWEAIPNTR--VDLEEWEHVTCM 765
Query: 1095 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----V 1148
+ V L + T + +A+GT +QGE+V RGR+L+ P +T+ +
Sbjct: 766 KTVALKSQETVSGLKGYVALGTCLMQGEEVTCRGRILILDVIEVVPEPGQPLTKNKFKVL 825
Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
Y KE KG ++AL G L+ A G KI L +L G+AF D LY+ + +KNFI
Sbjct: 826 YEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKYNDLTGMAFIDTQ-LYIHQMYSIKNFI 884
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
L D+ KSI L ++ + L+L+++D L+ ++ EF++D + L +VSD KN+ ++
Sbjct: 885 LAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFMVDNNQLGFLVSDRDKNLMVYM 944
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
Y P+ ES+ G +LL RA+F+VG+HV F R+ T A D N+ F T
Sbjct: 945 YLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTANKKALTWD--NKHITWFAT 1002
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
LDG +G + P+ E T+RRL LQ L +PH AGLNP++FR H + + + +I+D
Sbjct: 1003 LDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQNAVKNILD 1062
Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
ELL+ Y L E+ E+A + GTT IL +L ++
Sbjct: 1063 GELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEI 1098
Score = 100 bits (250), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 159/342 (46%), Gaps = 62/342 (18%)
Query: 388 LLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
+LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++T +
Sbjct: 2 VLTLITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYT----EKLQ 57
Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG------EELSLYGS-ASNNTESAQK 499
+ ++E + E + + +R S+ G +E+ +YGS A + T+ A
Sbjct: 58 ETPMEEGKENEEKEKEPPNKKKRVDSNWAGCPKKGNLPDELDEIEVYGSEAQSGTQLA-- 115
Query: 500 TFSFAVRDSLVNIGPLKDFSYG--------LRINADAS-----ATGISKQSNYELV---- 542
T+SF V DS++NIGP S G + N + +G K ++
Sbjct: 116 TYSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVVVCSGYGKNGALSVLQKSI 175
Query: 543 --------ELPGCKGIWTVYHKSSR---------GHNADSSRMAAY---DDEYHAYLIIS 582
ELPGC +WTV + + G + + + D + H +LI+S
Sbjct: 176 RPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILS 235
Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
E TM+L+T + E+ S + QG T+ AGN+ + +IQV G R+L+G
Sbjct: 236 REDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---VNQ 291
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L F P + S ++ S+ADPYV++ ++G + + V
Sbjct: 292 LHFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFV 326
>gi|260835071|ref|XP_002612533.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
gi|229297910|gb|EEN68542.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
Length = 1003
Score = 328 bits (841), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 268/962 (27%), Positives = 429/962 (44%), Gaps = 152/962 (15%)
Query: 543 ELPGCKGIWTVY------HKSSRGHNADSSRMAAYDDEY-----------------HAYL 579
+LPGC +WTV G A+S+ + H +L
Sbjct: 107 DLPGCLDMWTVIGIPPESKPQEEGEKAESAGSEEKPEGEKEETKEEGPPDVDLTNSHGFL 166
Query: 580 IISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYM 639
I+S E TMVL+T + E+ S + QG T+ AGN+ + +IQV G R+L G
Sbjct: 167 ILSREDSTMVLQTGKEIMELDHS-GFSTQGPTVYAGNIGNNKYIIQVSPYGIRLLQGVKQ 225
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL--VGDPSTCTVSVQTP 697
Q L F S+ + S+ADPY L+ DG I LL V DP +
Sbjct: 226 LQHLPFD---------SKGPAFVLASVADPYALVMSEDGQILLLTLVNDPYGSGHRLSAK 276
Query: 698 AAIESSKKPVSSCTLYHDKG------------PEPWLRKTSTDAW------LSTGV---- 735
+ K + Y D P P + K + + + TGV
Sbjct: 277 KIDMAGKSQAITVCAYRDTSGLFTVSSPSTTTPAPEVEKDAAEPAAEDAVAMETGVDDED 336
Query: 736 ----GEAIDGADGGPLDQGDI-------------------YSVVCYESGALEIFDVPNFN 772
GE G + + ++ + V+C E+G+LEI+++P+F+
Sbjct: 337 EMLYGEPSAKPSGPAVVREEVKPSTSTVQEPVVKEVEPTHWCVICRENGSLEIYNLPDFS 396
Query: 773 CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
V+ V F +G +VD++ + + K+ V E+ M
Sbjct: 397 LVYLVKNFPTGMKLLVDSFQSTSSASTSQS------------DKQGDQLASVKEILMVGL 444
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
SRP L A + D +L Y+A+ P + S P T + V + + R
Sbjct: 445 GHKGSRPHLLARV-DEDLLIYEAF----PYHLS----PSYTMLKIRFKKVQHNLILRERK 495
Query: 893 SRTPLDAYTREET--PHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQL 949
A +EE+ G+ Q F +ISG+ G F+ GS P W M R LR+HP
Sbjct: 496 GGKTKKAGDQEESDGQTGSRIQHFRTFTDISGYSGLFICGSSPHWLFMTSRGALRIHPMS 555
Query: 950 CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITY 1009
DG++ F+ HNVNC GF+Y G L+I LP+ +YD WPV+K+PL+ TPH + Y
Sbjct: 556 IDGAVTCFSPFHNVNCPKGFLYFNRGGELRISVLPTHLSYDAPWPVRKVPLRCTPHFVAY 615
Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
E +Y V+ + N++ + D++ ++ D + ++++ ++++
Sbjct: 616 HMECKVY--AVAASTFEMCNRIPRMAGDEKEYDAVEKD-----DRYIYPMLDKFNIQLMS 668
Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRV 1129
P TR MQ EN E +G +V + G++
Sbjct: 669 PVSWEIIPNTRG---MQLEENY-------------AECTCSFLVGINFV----LFVAGQI 708
Query: 1130 LLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
++ P +N + E+Y KE KG +SAL G+LL A G KI L ++ +
Sbjct: 709 VILDVIEVVPEPGQPLTKNKIKELYGKEQKGPVSALCGCNGYLLSAIGQKIFLWEFRNND 768
Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
L G+AF D +Y+ + +KN+++L D+ KSI L ++ D L+ +
Sbjct: 769 LIGVAFIDTQ-VYIHTAISIKNYVILADVFKSISLLRYQ-----------DMRPLETYCV 816
Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QM 1302
EF +D + + +VSD QKN ++ Y P+ ES+ GQ+L+ RA+F+VG+HV F R+ ++
Sbjct: 817 EFFVDNAQIGFLVSDAQKNFLLYSYQPEARESYGGQRLVRRADFNVGSHVNTFFRVRCKI 876
Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1362
+ S +R A K R +F TLDG +G + P+ E T+RRL LQ L+ +P A
Sbjct: 877 MDPSGERRRDADTVAK--RHVTMFATLDGGLGALLPMAEKTYRRLLMLQNTLMTHMPFPA 934
Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
GLNP++FR N ++ +I+D ELL + L + E+ E+A + GT+ I +L
Sbjct: 935 GLNPKAFRMLKHNHRSLINACRNILDGELLWKFLHLSVVERSELARKIGTSPETITEDLM 994
Query: 1423 DL 1424
D+
Sbjct: 995 DI 996
>gi|215701517|dbj|BAG92941.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 265
Score = 323 bits (828), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 161/246 (65%), Positives = 191/246 (77%), Gaps = 1/246 (0%)
Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
GYIEPV+VILHE+E TWAGR+ KHHTCMISA SIS TLKQHP+IWSA NLPHDAY+LLA
Sbjct: 19 GYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSISMTLKQHPVIWSAANLPHDAYQLLA 78
Query: 314 VPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
VP PI GVLV+ AN+IHYHSQS SC+L LNN++ D S E+ +S+F VELDAA ATWL
Sbjct: 79 VPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSHPDGSPEISKSNFQVELDAAKATWLS 138
Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
ND+ + STK G+++LLTVVYDGRVVQRLDL K+ SVL+S +T+IGNS FFLGSRLGDSL
Sbjct: 139 NDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKASVLSSAVTSIGNSFFFLGSRLGDSL 198
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASN 492
LVQF+ + S+L E DIE D P +KRL+R SD LQD+ + EELS A N
Sbjct: 199 LVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLKRIPSDVLQDVTSVEELSFQNIIAPN 258
Query: 493 NTESAQ 498
+ ESAQ
Sbjct: 259 SLESAQ 264
>gi|296414526|ref|XP_002836950.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295632796|emb|CAZ81141.1| unnamed protein product [Tuber melanosporum]
Length = 1468
Score = 317 bits (813), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 348/1472 (23%), Positives = 594/1472 (40%), Gaps = 269/1472 (18%)
Query: 57 NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVL----------------MDGI 96
N++V ++++I+ E ++K G+ RR+L
Sbjct: 29 NVLVAKTSLLQIFTTTTYETELNSALADAKQPGDIDRRILDADEEQTFAADIALQRSQVE 88
Query: 97 SAASLELVCHYRLHGNV---ESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
S L LV Y L G+V + + +LS GG ++++ +F+DAK S++E+D
Sbjct: 89 SVTKLVLVAEYPLSGSVTGLQRIKLLSTRSGG-------EAVLASFKDAKCSLMEWDPET 141
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVK--------VDPQGRCGGVLVYGLQMII 203
+ + S+H +E RE F P+V DP RC + G + I
Sbjct: 142 NSITTISLHYYE----------REEFC-SPVVSDGLPTELVADPGSRCAALRFSGDMLAI 190
Query: 204 LKASQGG-----------------------------SGLVGDEDTFGSGG---------G 225
+ Q +G EDT G G
Sbjct: 191 IPFRQREDEELSLGRGDADEVMGDEDGDNDDWDPEMAGTARGEDTIMGEGDVKTTDATEG 250
Query: 226 FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
S V+++ LD + HV F+H Y EP IL+ TW G ++ + I
Sbjct: 251 KDRPYHPSFVLSVSQLDDAISHVISLTFLHEYREPTFGILYSPRRTWTGLLAAEGRKDTI 310
Query: 284 SALSISTTLKQHP--LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCAL 340
S + I+ L+Q I S LP+D +K++ + P GG L+VG N IH + +
Sbjct: 311 SYIVITLDLEQKASTPILSVSGLPYDIFKVVPLAPPTGGSLLVGGNELIHVDQAGKTTGV 370
Query: 341 ALNNYAVSLDSSQELP-RSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRV 397
A+N + L +S +EL+ + L+++ LL TK G+ V++ DGR
Sbjct: 371 AVNPFCRRSTGFAGLADQSDLCLELEGSQVVELESEGGDMLLFTKRGEGVIVGFRMDGRN 430
Query: 398 VQRLDLSKTNP---SVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
V + ++K N S++ ++T +G F+G GD+ ++++ G+K
Sbjct: 431 VSGVKITKLNNHPGSIVGGRVSTAVGLGGRRLFVGCIEGDARVLKWRRKGERKKAGEGIK 490
Query: 452 EE----------FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
EE +G +E SS + NG N+ +Q +
Sbjct: 491 EEVLENEDEDDVYGALEDMDDDLYGGGGDSSFRKDSLTNGRR--------NSEAKSQGEY 542
Query: 502 SFAVRDSLVNIGPLKDFSYG------------------LRINADASATGISKQSNYELV- 542
F D L N+GP +D + G L + + + S+ S ++
Sbjct: 543 IFQTHDRLTNLGPFRDITLGKPTFPEESRERQKGVSPELELVTTSGPSNTSEDSGISIIR 602
Query: 543 -----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE-----YHAYLIISLEAR 586
+ P C+ +WTV +S+ NA DD + +L ++
Sbjct: 603 KSISPTIVGRFDFPQCQALWTVRARSANTSNAAVGLGGEEDDRSVEESFDRFLFVTKNDE 662
Query: 587 TMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
+ V D EV D+ +G TI G + R++QV R+ D +
Sbjct: 663 SQVFRVGDTFEEV-RGTDFESEGETIEVGVVGNGMRIVQVVSEQVRVYDCDLQLSQII-- 719
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
P E +G E V + DPY+LL DGS + D + ++ + AI+ K
Sbjct: 720 PMFDEE-TGEEGPNVHRARVCDPYILLIKVDGSPAVYKMDSTNLELAEERADAIKFDKYQ 778
Query: 707 VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ-GDIYSVVCYESGALEI 765
S C KG + +D P++ D + G L+I
Sbjct: 779 -SGCIYASTKG-----------------IFIPLDA----PVENVKDYLLFLLTVEGGLQI 816
Query: 766 FDVPN-FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKV 824
+D+ N +F+ + F +T D+ T ++ E+ K+ I + V
Sbjct: 817 YDLSNPVTPLFSAESF--------NTLYPLLRTDNPTSPTANREK---HRSKQLIIEILV 865
Query: 825 VELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS--KSDDPVSTSRSLSVSNV 882
++ + P+L A ++ + Y+ ++ P KS +P S LS+S
Sbjct: 866 ADMG----DSIFKEPYLIARSSNNDLTFYKPFISSSPSTLRFIKSPNPHIASNELSLSAG 921
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRE 941
+ + R L T N++G+ FL G+ P + + +
Sbjct: 922 TKNIFRPL------------------------TAVYNLAGYSAVFLPGADPSFVIKTAKS 957
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
R+H +L + + + H+ + GF+YV S GI+++ +P+ T+D W +K+
Sbjct: 958 SPRIH-KLAGTGVRSLSSFHSAGADRGFVYVDSLGIVRVALMPAEFTFDGNWGYKKVTPG 1016
Query: 1002 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE 1061
+ YF N+Y ++S +P + + +E G N++ D ++
Sbjct: 1017 EHVQSLAYFPPMNVY--VISTSKRQPFD------LAEEDG------NIAKDDTTLQPEID 1062
Query: 1062 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQG 1120
+++L P W +E AL V+ ++L + TKE + L+++GTA +G
Sbjct: 1063 SGTLKLLSPQT----WTAVDEYKFAHNEIALVVKTISLEVSEHTKERKQLVSVGTAIFRG 1118
Query: 1121 EDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKI 1175
ED +ARG + +F P T V +E+KG +SA+ + G+LL A G KI
Sbjct: 1119 EDHSARGGIYVFEVIEVVPEPNRPETNRKLKLVTREEVKGTVSAICGVNGYLLAAQGQKI 1178
Query: 1176 ILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
++ + L +AF D LYV + IL GD KS++F + E+ ++ L K
Sbjct: 1179 MVRGLKEDQSLLPVAFLDMC-LYVSVAKNLDGMILFGDFMKSVWFAGFSEEPYKMTLFGK 1237
Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
D L+ + EFL DG+ L VV D + NI Y P+ +S GQ+L+ RA+F G +
Sbjct: 1238 DTQKLEIISAEFLPDGNQLYFVVVDAESNIHTLQYDPEHPKSLAGQRLIRRADFFSGHEI 1297
Query: 1295 TKFLRLQM----LATSSDRTGAAPGSDKT------------NRFALLFGTLDGSIGCIAP 1338
+ L L+ SS+ A +D + + +L GT GS+ I
Sbjct: 1298 STLTMLPFSPYSLSASSNSHLPADATDTSPLHHHHQNQQQQQEYFVLAGTQTGSLAMIRT 1357
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
+ E +RRL +Q ++V+ HVAGLNPR +R
Sbjct: 1358 IPETAYRRLNIVQGQIVNGEEHVAGLNPREYR 1389
>gi|317157892|ref|XP_001826637.2| protein cft1 [Aspergillus oryzae RIB40]
gi|391864317|gb|EIT73613.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
[Aspergillus oryzae 3.042]
Length = 1389
Score = 314 bits (805), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 338/1393 (24%), Positives = 592/1393 (42%), Gaps = 196/1393 (14%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
++I+LAF +AK++++E+D +G+ S+H +E + + + G ++ VDP R
Sbjct: 88 EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147
Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
C V +G++ + IL Q G LV D+ + GS G A
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206
Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
SS V+ L LD + H F++ Y EP IL+ + T + + + +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + + S LP D +K++A+P P+GG L++G+N +H + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326
Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S +S ++ L+ L N LL TG++VL+ DGR V + +
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386
Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
P S +G+ FLGS DS+L+ G S+ SSG K+ +
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
E D+ +S D +D + E+ + G + ++F D L+NI
Sbjct: 441 RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500
Query: 513 GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
GPLKD ++G + S ELV +L
Sbjct: 501 GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560
Query: 545 PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
C +WT +S H ++ + A + E H Y+++S +A + E +++ +
Sbjct: 561 ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613
Query: 602 SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
+ F + TI G L G+ RV+Q+ R DG DL P E
Sbjct: 614 ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
S E +S S+ DPYV + D ++ LL D S V+ I +SK +SC
Sbjct: 669 DTS--EERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSKW--TSCC 724
Query: 712 LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
LY DK TG+ +I A L Q + + + L I+ +P+
Sbjct: 725 LYFDK----------------TGIFSSI-SATSDELAQNSMTLFLMTQDCRLFIYRLPDQ 767
Query: 772 NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
+ + G + E K S T +E + + V +L
Sbjct: 768 KLL----AIIEGVDCLPPVLSSEPPKRSTT--------------REVLTEIVVADLG-DS 808
Query: 832 WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
WS S P+L + Y+ ++ T +P + L +N+ R+
Sbjct: 809 WS---SFPYLIIRSRHDDLAVYRPFI----SITKSVGEPHADLNFLKETNLVLPRI---- 857
Query: 892 FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
+ D + EE P + I NISG F G P + + L
Sbjct: 858 -TSGVEDQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKG 913
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFA 1011
G + + C GFI + S+G++ +CQ+P G D W +Q+IP+ + Y +
Sbjct: 914 GYAQSLSKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQIPIGEQVDHLAYSS 973
Query: 1012 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1071
+Y + S L D E+ + N S V+ ++++ P
Sbjct: 974 SSGMYVIGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQRSSLKVVSPK 1022
Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVL 1130
W + + +E+ + V+ ++L + T E + ++ +GTA+ +GED+A+RG V
Sbjct: 1023 T----WTVIDSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCVY 1078
Query: 1131 LFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLIASGPKIILH--KWT 1181
+F + +P+ + V + +KGA++AL+ + QG L++A G K I+ K
Sbjct: 1079 VFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGLKED 1138
Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1236
G+ L +AF D +++VK ++ D K ++F + E+ +++L AKD
Sbjct: 1139 GSLLP-VAFMDVQ----CHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDL 1193
Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
L+ A +FL DG+ L ++V+D N+ + Y P+ +S G +LLSR++FH G ++
Sbjct: 1194 DYLEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFIST 1253
Query: 1297 FLRLQMLATSSDR----TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
L + SS++ A K R +L + +GS+G + + E ++RRL +LQ
Sbjct: 1254 LTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQS 1313
Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
+L +++ H GLNPR+FR S+G A R ++D +LL + + + ++EIA + G
Sbjct: 1314 QLTNTIEHPCGLNPRAFRAVESDGTAGR----GMLDGKLLFQWLDMSKQRKVEIASRVGA 1369
Query: 1413 TRSQILSNLNDLA 1425
+I ++ ++
Sbjct: 1370 NEWEIKADFEAIS 1382
>gi|336388105|gb|EGO29249.1| hypothetical protein SERLADRAFT_445076 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1424
Score = 313 bits (801), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 344/1466 (23%), Positives = 619/1466 (42%), Gaps = 186/1466 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDG-------- 95
N+VV +NV+ I+ VR +E ++ E RR V MDG
Sbjct: 47 NVVVARSNVLRIFEVR-EERPPMSTQTEDERDRRSHVRKGTEAVEGEVEMDGQGEGYVNM 105
Query: 96 ----------ISAASLELVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKI 142
+ + V + LHG V E++ I+S N D ++++F+DAKI
Sbjct: 106 GTVKKGAVHLPTVSRFYFVREHMLHGTVTGLETVRIMSS----NDDNLDRLLVSFKDAKI 161
Query: 143 SVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQM 201
++LE+ D IH L S+H +E +P+ + L +S ++VDP RC + + +
Sbjct: 162 ALLEWSDDIHDLITVSIHTYERAPQLMAL----DSSLFHTKLRVDPSSRCAALSLPKDAI 217
Query: 202 IILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIE 257
IL Q + L V ++D S +++L D +++HV DF+F+ G+
Sbjct: 218 AILPFFQSQAELDVMEQD---QNQARDVPYSPSFILDLASDVDENIRHVIDFVFLPGFNN 274
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + +L + E TW+GR+ T + ++ +P+I + LP D L+ +
Sbjct: 275 PTIAVLFQTEQTWSGRLKEFKDTAKLIIFTLDLLSHTYPVITAVDGLPFDCISLVPCVAS 334
Query: 318 IGGVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELP-----RSSFSVELDAAHATW 371
+GGV+++ +NTI Y S AL +N ++ S S +P +S ++ L+ HA
Sbjct: 335 LGGVVIMSSNTIIYVDPASRRVALPVNGWS-SRVSDMPMPALSGDEASRNISLEGCHAVL 393
Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLGSRLG 430
+ + + K G + + +V DG+ V +L ++ + + S + I FLGS +G
Sbjct: 394 VDDRTMFVFLKDGTVYPVELVADGKTVSKLSMAPALAQTTIPSMVRKINEDHLFLGSIVG 453
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
S+L++ L ++A + ++ + + +
Sbjct: 454 ASVLLKTVRVEEEVEDEEKLPAHAAVVDAPTTMDLDDDDDTMPSMNGVTH---------S 504
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG------------ 532
+N + ++ DSL GP+ D ++ L D +ATG
Sbjct: 505 NNIIHRTRSVVHLSLCDSLPAYGPISDVTFSLAKLGDRYVPELVAATGSGFLGGFTLFQR 564
Query: 533 -ISKQSNYELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM-- 588
+ ++ +L + G +GIW+ + R + R + + +IIS +A
Sbjct: 565 DLPSRTKRKLHAIGGARGIWSFPVRQQVRVNGLSYERPVNSFESENDTVIISTDANPSPG 624
Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDLSFGP 647
V A ++ ++ + G TI AG+ F R ++ V R+L+ G + +DL
Sbjct: 625 VSRIATRTSKSDIAIPTRIPGTTIGAGSFFQRTAILHVMTNAIRVLESGKQIIKDLD--- 681
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQ--TPAAIESSKK 705
+ + SI DP+VL+ D +I L +G+ + + +P +SS+
Sbjct: 682 ------GNIPRPRIKACSICDPFVLIIREDDTIGLFIGEAERGKIRRKDMSPMGDKSSRY 735
Query: 706 PV------SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
+SC P D +++ + ++ + + ++
Sbjct: 736 LAGCFFTDNSCIFETHANDLPSSASNGVDKNVTSTMQAVVNS------NSRSQWLILVRP 789
Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
G +EI+ +P F+ + D+Y AL + RK N
Sbjct: 790 QGVMEIWTLPKLTLAFSTSSLAMLEHILSDSYDTPALSPPQ-----------DHPRKSN- 837
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
+ V ++ + P+L L G I+ Y+A P + S+
Sbjct: 838 -DLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPAD------------SIPP 884
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI----------SGHQGFFLS 929
S VS +++ ++ + + EET ++ I + S G F +
Sbjct: 885 SRVSVLKVKFIKTATKIFELPKHEETEKSILAEQKRISRQFVPFVTSPTPGSVLSGVFFT 944
Query: 930 GSRPCWCMVFRER-LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
G RP W + + +R++ + +FT F+ + +G + +P
Sbjct: 945 GDRPSWIVATNKGGIRIYSS-GHHIVHSFTSCSLWESKGDFLVYSDEGPSLLEWMPD-LC 1002
Query: 989 YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
D+ P + IP + Y L IV+ ++ + E G+ I
Sbjct: 1003 LDSVLPSRNIPRSRAYANVVYDPSAML---IVAASSMQ-----ANFASFDEDGNIIWEPE 1054
Query: 1049 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-N 1107
S+V L + + + ++ P+ W T +E + VTL +T+ +
Sbjct: 1055 ASNVSLPK---CDCSTLELIAPEA----WITMDGYEFAPNEYVNALECVTLETLSTETGS 1107
Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNA-DNPQNL-----VTEVYSKELKGAISALA 1161
+ +A+GT+ +GED+A +G LF D QNL + + + KG ++AL
Sbjct: 1108 KDFIAVGTSIDRGEDLAVKGATYLFEIVEVVPDYSQNLKRWYKLKLLARDDAKGPVTALC 1167
Query: 1162 SLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
+ G+L+ + G KI + + E L G+AF D +YV SL +VKNF+L+GD KSI+F+
Sbjct: 1168 GINGYLVSSMGQKIFIRAFDMDERLVGVAFLDVG-VYVTSLRVVKNFLLIGDAVKSIWFV 1226
Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
+++E +L +LAKD +F TLS+V D ++++ Y P ES GQ
Sbjct: 1227 AFQEDPYKLVVLAKDVHRTHVTNADFFFTDDTLSIVTEDGDGILRMYAYDPDDPESKNGQ 1286
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
LL R EFH + L ++A + P + + F+ DGS+ + P+D
Sbjct: 1287 HLLCRTEFHNHSECRSSL---VIARRTKEESVLPQAKILSAFS------DGSLSSLTPVD 1337
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
+ +F+RLQ LQ +L ++ HVAGLNPR++R N +P I+D +LLS +E LP+
Sbjct: 1338 DASFKRLQLLQGQLTRNIQHVAGLNPRAYRIVR-NDFVSKPLSKDILDGQLLSAFESLPI 1396
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLAL 1426
Q E+ Q GT R+ +L + +LA+
Sbjct: 1397 SRQNEMTKQIGTERNIVLHDWMELAI 1422
>gi|355698297|gb|EHH28845.1| Cleavage and polyadenylation specificity factor 160 kDa subunit
[Macaca mulatta]
Length = 1436
Score = 312 bits (799), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 184/525 (35%), Positives = 278/525 (52%), Gaps = 37/525 (7%)
Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
R F++I G+ G F+ G P W +V R LR+HP DG + +F HNVNC GF+Y
Sbjct: 923 RFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLY 982
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLN 1029
QG L+I LP+ +YD WPV+KIPL+ T H + Y E +Y + S P +
Sbjct: 983 FNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR--- 1039
Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQS 1087
I + G + + + + + E + ++++ P W+ A I +Q
Sbjct: 1040 ------IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQE 1089
Query: 1088 SENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT 1146
E+ ++ V+L + T + +A GT +QGE+V RGR+L+ P +T
Sbjct: 1090 WEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLT 1149
Query: 1147 E-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1201
+ +Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ +
Sbjct: 1150 KNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQM 1208
Query: 1202 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1261
VKNFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD
Sbjct: 1209 ISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRD 1268
Query: 1262 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-- 1319
+N+ ++ Y P+ ES+ G +LL RA+FHVGAHV F R + GA G K
Sbjct: 1269 RNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSV 1321
Query: 1320 ---NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
N+ F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H +
Sbjct: 1322 VWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDR 1381
Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ + +++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1382 RTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1426
Score = 286 bits (732), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 218/704 (30%), Positives = 344/704 (48%), Gaps = 116/704 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVR---------------- 506
T QD V+ E+ +YGS A + T+ A T+SF VR
Sbjct: 427 TASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVRLRQQGPHPSQCPQRPL 482
Query: 507 --------DSLVNIGPLKDFSYG--LRINADASATGISKQSNYELV-------------- 542
DS++NIGP + + G ++ + S + + E+V
Sbjct: 483 TFAVPQVCDSILNIGPCANAAMGEPAFLSEEVPRVVNSPEPDLEIVVCSGHGKNGALSVL 542
Query: 543 ------------ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYLI 580
ELPGC +WTV + G ++ A DD H +LI
Sbjct: 543 QKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLI 602
Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
+S E TM T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 603 LSREDSTM---TGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---V 655
Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 656 NQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 692
>gi|336375160|gb|EGO03496.1| hypothetical protein SERLA73DRAFT_165174 [Serpula lacrymans var.
lacrymans S7.3]
Length = 1428
Score = 311 bits (797), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 344/1470 (23%), Positives = 619/1470 (42%), Gaps = 190/1470 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDG-------- 95
N+VV +NV+ I+ VR +E ++ E RR V MDG
Sbjct: 47 NVVVARSNVLRIFEVR-EERPPMSTQTEDERDRRSHVRKGTEAVEGEVEMDGQGEGYVNM 105
Query: 96 --------------ISAASLELVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFE 138
+ + V + LHG V E++ I+S N D ++++F+
Sbjct: 106 GTVKSTGKKGAVHLPTVSRFYFVREHMLHGTVTGLETVRIMSS----NDDNLDRLLVSFK 161
Query: 139 DAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
DAKI++LE+ D IH L S+H +E +P+ + L +S ++VDP RC + +
Sbjct: 162 DAKIALLEWSDDIHDLITVSIHTYERAPQLMAL----DSSLFHTKLRVDPSSRCAALSLP 217
Query: 198 GLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVH 253
+ IL Q + L V ++D S +++L D +++HV DF+F+
Sbjct: 218 KDAIAILPFFQSQAELDVMEQD---QNQARDVPYSPSFILDLASDVDENIRHVIDFVFLP 274
Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
G+ P + +L + E TW+GR+ T + ++ +P+I + LP D L+
Sbjct: 275 GFNNPTIAVLFQTEQTWSGRLKEFKDTAKLIIFTLDLLSHTYPVITAVDGLPFDCISLVP 334
Query: 314 VPSPIGGVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELP-----RSSFSVELDAA 367
+ +GGV+++ +NTI Y S AL +N ++ S S +P +S ++ L+
Sbjct: 335 CVASLGGVVIMSSNTIIYVDPASRRVALPVNGWS-SRVSDMPMPALSGDEASRNISLEGC 393
Query: 368 HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLG 426
HA + + + K G + + +V DG+ V +L ++ + + S + I FLG
Sbjct: 394 HAVLVDDRTMFVFLKDGTVYPVELVADGKTVSKLSMAPALAQTTIPSMVRKINEDHLFLG 453
Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
S +G S+L++ L ++A + ++ + +
Sbjct: 454 SIVGASVLLKTVRVEEEVEDEEKLPAHAAVVDAPTTMDLDDDDDTMPSMNGVTH------ 507
Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG-------- 532
++N + ++ DSL GP+ D ++ L D +ATG
Sbjct: 508 ---SNNIIHRTRSVVHLSLCDSLPAYGPISDVTFSLAKLGDRYVPELVAATGSGFLGGFT 564
Query: 533 -----ISKQSNYELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR 586
+ ++ +L + G +GIW+ + R + R + + +IIS +A
Sbjct: 565 LFQRDLPSRTKRKLHAIGGARGIWSFPVRQQVRVNGLSYERPVNSFESENDTVIISTDAN 624
Query: 587 TM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDL 643
V A ++ ++ + G TI AG+ F R ++ V R+L+ G + +DL
Sbjct: 625 PSPGVSRIATRTSKSDIAIPTRIPGTTIGAGSFFQRTAILHVMTNAIRVLESGKQIIKDL 684
Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQ--TPAAIE 701
+ + SI DP+VL+ D +I L +G+ + + +P +
Sbjct: 685 D---------GNIPRPRIKACSICDPFVLIIREDDTIGLFIGEAERGKIRRKDMSPMGDK 735
Query: 702 SSKKPV------SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSV 755
SS+ +SC P D +++ + ++ + + +
Sbjct: 736 SSRYLAGCFFTDNSCIFETHANDLPSSASNGVDKNVTSTMQAVVNS------NSRSQWLI 789
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
+ G +EI+ +P F+ + D+Y AL + R
Sbjct: 790 LVRPQGVMEIWTLPKLTLAFSTSSLAMLEHILSDSYDTPALSPPQ-----------DHPR 838
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
K N + V ++ + P+L L G I+ Y+A P +
Sbjct: 839 KSN--DLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPAD------------ 884
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI----------SGHQG 925
S+ S VS +++ ++ + + EET ++ I + S G
Sbjct: 885 SIPPSRVSVLKVKFIKTATKIFELPKHEETEKSILAEQKRISRQFVPFVTSPTPGSVLSG 944
Query: 926 FFLSGSRPCWCMVFRER-LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
F +G RP W + + +R++ + +FT F+ + +G + +P
Sbjct: 945 VFFTGDRPSWIVATNKGGIRIYSS-GHHIVHSFTSCSLWESKGDFLVYSDEGPSLLEWMP 1003
Query: 985 SGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI 1044
D+ P + IP + Y L IV+ ++ + E G+ I
Sbjct: 1004 D-LCLDSVLPSRNIPRSRAYANVVYDPSAML---IVAASSMQ-----ANFASFDEDGNII 1054
Query: 1045 DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT 1104
S+V L + + + ++ P+ W T +E + VTL +T
Sbjct: 1055 WEPEASNVSLPK---CDCSTLELIAPEA----WITMDGYEFAPNEYVNALECVTLETLST 1107
Query: 1105 KE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNA-DNPQNL-----VTEVYSKELKGAI 1157
+ ++ +A+GT+ +GED+A +G LF D QNL + + + KG +
Sbjct: 1108 ETGSKDFIAVGTSIDRGEDLAVKGATYLFEIVEVVPDYSQNLKRWYKLKLLARDDAKGPV 1167
Query: 1158 SALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
+AL + G+L+ + G KI + + E L G+AF D +YV SL +VKNF+L+GD KS
Sbjct: 1168 TALCGINGYLVSSMGQKIFIRAFDMDERLVGVAFLDVG-VYVTSLRVVKNFLLIGDAVKS 1226
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
I+F++++E +L +LAKD +F TLS+V D ++++ Y P ES
Sbjct: 1227 IWFVAFQEDPYKLVVLAKDVHRTHVTNADFFFTDDTLSIVTEDGDGILRMYAYDPDDPES 1286
Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
GQ LL R EFH + L ++A + P + + F+ DGS+ +
Sbjct: 1287 KNGQHLLCRTEFHNHSECRSSL---VIARRTKEESVLPQAKILSAFS------DGSLSSL 1337
Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
P+D+ +F+RLQ LQ +L ++ HVAGLNPR++R N +P I+D +LLS +E
Sbjct: 1338 TPVDDASFKRLQLLQGQLTRNIQHVAGLNPRAYRIVR-NDFVSKPLSKDILDGQLLSAFE 1396
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
LP+ Q E+ Q GT R+ +L + +LA+
Sbjct: 1397 SLPISRQNEMTKQIGTERNIVLHDWMELAI 1426
>gi|390358535|ref|XP_789715.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 1223
Score = 309 bits (792), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 209/688 (30%), Positives = 338/688 (49%), Gaps = 53/688 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V C E+G LE++ +P+ F V F G +VD S S TG
Sbjct: 560 WCVFCRENGQLEMYSLPDMVLAFLVKNFPMGSKVLVD---------------SGSAFMTG 604
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+++ +V E+ + + ++ A++ D I+ Y+A+ P NT + +
Sbjct: 605 DQSQQHEMLQQVQEVLLVGLGHDRKKIYMLALVEDD-IMIYEAF----PYNTVTQEHHLR 659
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPL-DAYTREETPHGAP---------CQRITIFKNISG 922
R + + + + R S+ P + T+ ET A R+ F N+
Sbjct: 660 V-RFRKIPHKILMKPKKTRTSKKPTAEGGTKPETETEAESDTKTTSRRVNRLREFHNVQT 718
Query: 923 HQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G F+SGS P W V R LR HP DG+I F HNVNC +GF+Y + L+IC
Sbjct: 719 YSGVFISGSHPYWLFVTSRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRIC 778
Query: 982 QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
LPS +YD WPV+K+PL+ TPH + Y E Y ++ SV Q + + G
Sbjct: 779 VLPSHLSYDAPWPVRKVPLRCTPHFVAYHVETKTYAVVTSV-------QETKTHVWKVTG 831
Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-F 1100
+I + D T + +++ P TR I +++EN ++VV L
Sbjct: 832 EEIGEEPVERDDRFVPTTKVVFSIQLFSPVSWDAIPNTR--IEYEAAENVTCLKVVNLSC 889
Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKG 1155
T + + + T +V ED+ RG V ++ P +N + +Y K KG
Sbjct: 890 EGTMTGKKGYVVVATTHVYSEDLQTRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQKG 949
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
+SAL + G LL G K+ + ++ +L G+AF D +Y+ + VK FIL+ D+ K
Sbjct: 950 PVSALCEVMGFLLTCIGQKVYMWQFKDNDLIGLAFIDT-QIYIHNAVSVKQFILITDVMK 1008
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
YFL ++ Q L+L+++D L+ F EF++D ++ +VSD KN+ +F+Y P+ E
Sbjct: 1009 GAYFLQYQAQDRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAPE 1068
Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLAT--SSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
S G LL R + ++G+ V F+R++ T S+++ + P R + F TLDGS+
Sbjct: 1069 SHGGAYLLRRGDMNIGSAVNTFVRVRCRLTDPSTEQVLSGP---VLRRQVVFFATLDGSL 1125
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
G + P+ E T+RRL LQ L + +PHV GLNP+S+R S+ + +I+D +LL
Sbjct: 1126 GLLLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLLL 1185
Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
Y L + E+ E A + GT+ QI+S+L
Sbjct: 1186 KYCHLSVVERNEFAKKIGTSVDQIISDL 1213
Score = 182 bits (461), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 141/456 (30%), Positives = 221/456 (48%), Gaps = 60/456 (13%)
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
RV+ + TC I ALS++ K HP+IWS +LP+D ++ AVP PIGGVL++ N++ Y
Sbjct: 11 RVAVRQDTCSIVALSLNMAQKVHPIIWSQSSLPYDCMQVQAVPKPIGGVLILAVNSLLYL 70
Query: 333 SQSASCALALNNYAVSLDS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGD 385
+QS + Y VSL+S S P + + +D AT++ D LS K G+
Sbjct: 71 NQS------IPPYGVSLNSLTDWSTAFPLKTQEGVKLSMDCTQATFISYDRLALSLKDGE 124
Query: 386 LVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
+ +LT++ DG R V+ L K SVLT+ I +G+ FLGSRLG+SLL+++T +
Sbjct: 125 IYVLTLLVDGMRSVRGFHLDKAAASVLTTCICPMGDGFLFLGSRLGNSLLLKYTEKVSET 184
Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD----MVNGEELSLYGSASNNTESAQKT 500
+ K E + P+ K +SD + + + +EL +YG T + +
Sbjct: 185 SPTDASKTEEPKPGEEPPTKKMRSDDASDWMASDTKFLDDPDELEVYGKQVQKTGTQLTS 244
Query: 501 FSFAVRDSLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELVE---- 543
+SF + DSL+NIGP + G + N D + +G K +++
Sbjct: 245 YSFEICDSLLNIGPCGNMIMGEPAFLSEEFQGNVDPDLELVTTSGYGKNGALSVLQRTIR 304
Query: 544 --------LPGCKGIWTVYHKSSRGHNAD----SSRMAAYDDEYHAYLIISLEARTMVLE 591
LPGC +WTV KS + AD S + D + HA+LI+S + +MVL+
Sbjct: 305 PQVVTTFNLPGCLDMWTV--KSLKEAKADEKSEESEASPEDKDRHAFLILSKQDSSMVLQ 362
Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
T +TEV + Q TI A N+ R ++QV + +++G Q +
Sbjct: 363 TGQEITEVAAG-GFSTQAPTIFASNMGDDRYIVQVMNKSICLMEGVEQIQHMVL------ 415
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
S + S+ADPY+LL +G L+ P
Sbjct: 416 ----DVGSPIKQCSLADPYLLLLTENGDPILMTLKP 447
>gi|194474008|ref|NP_001124043.1| cleavage and polyadenylation specificity factor subunit 1 [Rattus
norvegicus]
gi|149066087|gb|EDM15960.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 1386
Score = 309 bits (791), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 219/669 (32%), Positives = 345/669 (51%), Gaps = 75/669 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429
Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSE 485
Query: 522 -------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH---------- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 ENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 545
Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T+ AG
Sbjct: 546 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAG 604
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
N+ R ++QV G R+L+G L F P + + ++ ++ADPYV++
Sbjct: 605 NIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMS 654
Query: 676 SDGSIRLLV 684
++G + + +
Sbjct: 655 AEGHVTMFL 663
Score = 285 bits (730), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 202/678 (29%), Positives = 317/678 (46%), Gaps = 88/678 (12%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 778 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 833
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 834 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 883
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 884 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 943
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 944 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1003
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1004 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1056
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENET 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1057 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET----- 1107
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
V G LKG ++A L QG +
Sbjct: 1108 --------VSG--------------------------------LKGYVAAGTCLMQGEEV 1127
Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
G +I L +EL G+AF D LY+ + VKNFIL D+ KSI L ++E+
Sbjct: 1128 TCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKT 1185
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+F
Sbjct: 1186 LSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADF 1245
Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELT 1343
HVGAHV F R + GAA G K N+ F TLDG IG + P+ E T
Sbjct: 1246 HVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKT 1298
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E+
Sbjct: 1299 YRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMER 1358
Query: 1404 LEIAHQTGTTRSQILSNL 1421
E+A + GTT IL +L
Sbjct: 1359 SELAKKIGTTPDIILDDL 1376
>gi|148697644|gb|EDL29591.1| cleavage and polyadenylation specific factor 1, isoform CRA_c [Mus
musculus]
Length = 1388
Score = 308 bits (789), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 219/671 (32%), Positives = 344/671 (51%), Gaps = 77/671 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH-------- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545
Query: 556 --KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 546 TPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654
Query: 674 GMSDGSIRLLV 684
++G + + +
Sbjct: 655 MSAEGHVTMFL 665
Score = 286 bits (732), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 202/678 (29%), Positives = 317/678 (46%), Gaps = 88/678 (12%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 780 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 836 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 886 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 945
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 946 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1006 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1058
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENET 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1059 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET----- 1109
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
V G LKG ++A L QG +
Sbjct: 1110 --------VSG--------------------------------LKGYVAAGTCLMQGEEV 1129
Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
G +I L +EL G+AF D LY+ + VKNFIL D+ KSI L ++E+
Sbjct: 1130 TCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKT 1187
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+F
Sbjct: 1188 LSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADF 1247
Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELT 1343
HVGAHV F R + GAA G K N+ F TLDG IG + P+ E T
Sbjct: 1248 HVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKT 1300
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E+
Sbjct: 1301 YRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMER 1360
Query: 1404 LEIAHQTGTTRSQILSNL 1421
E+A + GTT IL +L
Sbjct: 1361 SELAKKIGTTPDIILDDL 1378
>gi|345482082|ref|XP_001607052.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Nasonia vitripennis]
Length = 1415
Score = 308 bits (788), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 207/688 (30%), Positives = 348/688 (50%), Gaps = 61/688 (8%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
V ++G LE++ +P + + F G+ + D+ + ++ +G+ Q
Sbjct: 773 VYRDNGTLEVYSLPELRLSYLIKNFGFGQNILHDS------------MEFTTIQGSQQNE 820
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
N ++V E+A+ H +RP L L D + YQ Y + P+ K
Sbjct: 821 PVN-PEVQVREIAVVALGHHGNRPMLLVRL-DSELQIYQVYRY--PKGHLK--------- 867
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGSR 932
L + + + + FSR +EE R+ + F NI+G+ G F+ G
Sbjct: 868 -LRFKKIDHNFI--VGFSRIG----PKEEDMPSMNDTRLCMMRYFSNIAGYNGVFIGGDY 920
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W + R LR HP DG + +F +NVNC GF+Y + L+IC LP+ +YD
Sbjct: 921 PHWIFLTGRGELRAHPMNIDGPVKSFAPFNNVNCPQGFLYFNRKDELRICVLPTHLSYDA 980
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLS 1050
WPV+K+PL+ TPH +TY E Y ++ S +PL D+E + N
Sbjct: 981 PWPVRKVPLRCTPHFVTYHLESKTYCVVTSTA--EPLKSYYRFNGEDKEFTEEERNERF- 1037
Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1107
L+ T E++ + + P W T I + E+ ++ V+L T+
Sbjct: 1038 ---LYPTQ--EQFSIVLFSPVS----WDTIPNTKIDLDQWEHVTCLKNVSLAYEGTRSGL 1088
Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALAS 1162
+ + IGT Y GED+ +RGR+ +F P +N ++Y+KE KG ++A+
Sbjct: 1089 KGYIVIGTNYNYGEDITSRGRIFIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPVTAITQ 1148
Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
+ G L+ A G KI + + +L G+AF D +YV + +K+ IL+ D++KS+ L +
Sbjct: 1149 VSGFLVSAIGQKIYIWQLKDNDLVGVAFIDTQ-IYVCQMLSIKSLILVADVYKSVSLLRF 1207
Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
+ + L+L+++DF + + +A E+ I + L +V+D + NI IF Y P+ S+S GQKL
Sbjct: 1208 QPEYKTLSLVSRDFRTTEIYAIEYFIQNNELGFIVADGESNISIFSYQPESSQSLGGQKL 1267
Query: 1283 LSRAEFHVGAHVTKFLRLQMLAT-SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
+ +A+ H+G + F R++ T S++ T G+DK R ++ TLDGS+G I P+ E
Sbjct: 1268 IRKADIHLGQKINTFFRIKCKTTDSANPTKQFSGADK--RHVTMYATLDGSLGYILPVPE 1325
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
T+RRL LQ LV + H+AGLNP++FR + S + I+D +L+ Y LP+
Sbjct: 1326 KTYRRLLMLQNVLVSHIYHIAGLNPKAFRTYKSCVRMQGNPARGIIDGDLVRKYLDLPVN 1385
Query: 1402 EQLEIAHQTGTTRSQILSNLNDLALGTS 1429
E++EIA + GT +I+ +++++ TS
Sbjct: 1386 EKIEIAKKIGTGAQEIMDDMHEIYKQTS 1413
Score = 295 bits (754), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 210/667 (31%), Positives = 334/667 (50%), Gaps = 78/667 (11%)
Query: 58 LVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
LVV AN+I ++ ++ + G KE + LE + Y LHGNV S+
Sbjct: 30 LVVAGANIIRVFRLIPDVDPGKKEKFTESRPPK---------MRLECLAQYTLHGNVMSM 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ G+ RDS++L+F +AK+SV+E+D IH LR S+H FE E +K G +
Sbjct: 81 QAVQLIGSP----RDSLLLSFREAKLSVVEYDPEIHSLRTVSLHYFEEEE---IKDGWTN 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+GRC +L+YG ++++L + GD I SS++I
Sbjct: 134 HHHVPIVRVDPEGRCAVMLIYGRKLVVLPFRKDPILDEGDLIENPKSSSHKTPILSSYMI 193
Query: 237 NLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
L+ L+ M ++ D F+HGY EP ++IL+E T+AGR++ + TC + A+S++ K
Sbjct: 194 VLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFAGRIAVRQDTCAMVAISLNIQQKV 253
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDSSQ 353
HP+IWS NLP D Y+ +AV P+GG L++ N++ Y +QS ++LN+ + +
Sbjct: 254 HPIIWSVSNLPFDCYQAVAVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLTDNCTNFP 313
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
P+ + L+++ ++ D ++S KTG+L +L++ D R V+ K SVLT
Sbjct: 314 LKPQEGVKISLESSQVAFISPDRLVISLKTGELYVLSLFADSMRSVRGFHFDKAAASVLT 373
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS-SGLKEEFGDIEADAPSTKRLRRS- 470
S + ++ FLGSRLG+SLL++FT + S L+ + TK+++
Sbjct: 374 SCVCLCDDNYLFLGSRLGNSLLLRFTEKESEKINDISMLEMSLNSSNSQEQPTKKIKLDY 433
Query: 471 -----SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG---- 521
+SD L D+ + EEL +YGS + T ++ F V DSL+NIGP + S G
Sbjct: 434 LEDWMASDVL-DIKDPEELEVYGSET-QTSIQITSYIFEVCDSLLNIGPCGNISMGEPAF 491
Query: 522 ----LRINAD-----ASATGISKQSNYELVE------------LPGCKGIWTVYHKSSRG 560
N++ + +G K +++ LPG + IWTV +
Sbjct: 492 LSEEFSNNSEPDVELVTTSGYGKNGALCVLQRSIRPQVITTFDLPGYENIWTVIDSTVSD 551
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
+ A + H +LI++ + TMVL+T + EV + + QG TI AGNL
Sbjct: 552 NRAKTETEGT-----HGFLILTQDDSTMVLQTGQEINEVVDQSGFSTQGTTIFAGNLGSN 606
Query: 621 RRVIQVFERGARILDG----SYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
R +IQV + G R+L G +M DL ++ S ADPYV L
Sbjct: 607 RYIIQVTQMGVRLLQGLEQIQHMPMDLG--------------CPIVHASCADPYVSLLSE 652
Query: 677 DGSIRLL 683
DG + LL
Sbjct: 653 DGQVVLL 659
>gi|74212803|dbj|BAE33365.1| unnamed protein product [Mus musculus]
Length = 741
Score = 308 bits (788), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545
Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667
>gi|148697642|gb|EDL29589.1| cleavage and polyadenylation specific factor 1, isoform CRA_a [Mus
musculus]
Length = 1417
Score = 307 bits (786), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 56 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 108
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 109 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 161
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 162 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 218
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 219 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 278
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 279 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 338
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 339 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 398
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 399 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 456
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 457 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 512
Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 513 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 572
Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
K+ S+ A D H +LI+S E TM+L+T + E+ S + QG T
Sbjct: 573 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 631
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
+ AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV
Sbjct: 632 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 681
Query: 672 LLGMSDGSIRLLV 684
++ ++G + + +
Sbjct: 682 VIMSAEGHVTMFL 694
Score = 285 bits (730), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 202/678 (29%), Positives = 317/678 (46%), Gaps = 88/678 (12%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 809 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 864
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 865 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 914
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 915 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 974
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 975 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1034
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1035 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1087
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENET 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1088 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET----- 1138
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
V G LKG ++A L QG +
Sbjct: 1139 --------VSG--------------------------------LKGYVAAGTCLMQGEEV 1158
Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
G +I L +EL G+AF D LY+ + VKNFIL D+ KSI L ++E+
Sbjct: 1159 TCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKT 1216
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+F
Sbjct: 1217 LSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADF 1276
Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELT 1343
HVGAHV F R + GAA G K N+ F TLDG IG + P+ E T
Sbjct: 1277 HVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKT 1329
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E+
Sbjct: 1330 YRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMER 1389
Query: 1404 LEIAHQTGTTRSQILSNL 1421
E+A + GTT IL +L
Sbjct: 1390 SELAKKIGTTPDIILDDL 1407
>gi|410042329|ref|XP_003954555.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Pan troglodytes]
Length = 1296
Score = 306 bits (783), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 220/677 (32%), Positives = 348/677 (51%), Gaps = 86/677 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNEALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+ G + S A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651
Query: 668 DPYVLLGMSDGSIRLLV 684
DPYV++ ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668
Score = 305 bits (781), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 195/579 (33%), Positives = 292/579 (50%), Gaps = 41/579 (7%)
Query: 860 GPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA-PCQRITIFK 918
G E + DD S S S S+ R S+ P D R+ P A P + +
Sbjct: 732 GSETSPTVDDEEEMLYGDSGSLFSPSKEEARRSSQPPAD---RDPAPFRAEPTHWCLLVR 788
Query: 919 NISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
F+ G P W +V R LR+HP DG + +F HNVNC GF+Y QG
Sbjct: 789 ENGTMXXXFICGPSPPWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGE 848
Query: 978 LKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLL 1035
L+I LP+ +YD WPV+KIPL+ T H + Y E +Y + S P +
Sbjct: 849 LRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCAR--------- 899
Query: 1036 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALT 1093
I + G + + + + + E + ++++ P W+ A I +Q E+
Sbjct: 900 IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTC 955
Query: 1094 VRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE----- 1147
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 956 MKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKV 1015
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VKNF
Sbjct: 1016 LYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNF 1074
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
IL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++
Sbjct: 1075 ILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVY 1134
Query: 1268 YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRF 1322
Y P+ ES+ G +LL RA+FHVGAHV F R + GA G K N+
Sbjct: 1135 MYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKH 1187
Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1188 ITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNA 1247
Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1248 VRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1286
>gi|307191845|gb|EFN75271.1| Cleavage and polyadenylation specificity factor subunit 1
[Harpegnathos saltator]
Length = 1214
Score = 306 bits (783), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 204/697 (29%), Positives = 343/697 (49%), Gaps = 76/697 (10%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 570 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYMLHDSMESTTLQSAPINETLNPE------ 623
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
++V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 624 -------LQVREVLMVALGHHGNRPMLLVRL-DSELQIYQAYKY--PKGHLK-------- 665
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
L + + SR P E+ P A RI + F NI+G+ G F+
Sbjct: 666 --LRFKKLDHGIIPG-HLSRKP----KEEDVPVNANETRICMMRYFSNIAGYNGVFICSD 718
Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W + R LR HP DGS+ +F +N+NC GF+Y + L+IC LP+ +YD
Sbjct: 719 YPHWIFLTGRGELRTHPMGIDGSVTSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYD 778
Query: 991 NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
WPV+K+PL+ TPH +TY E Y +I S +PL + +
Sbjct: 779 APWPVRKVPLRCTPHFVTYHLESKTYCVITSTS--EPLKSY---------------YRFN 821
Query: 1051 SVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVTLF 1100
D + +T E+ R L P + + W+T I + E+ ++ V+L
Sbjct: 822 GED--KEFTEEDRPERFLYPSQEQFCIVLFSPVSWETIPNTKIELDQWEHVTCLKNVSLA 879
Query: 1101 NTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELK 1154
T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+KE K
Sbjct: 880 YEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQK 939
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
G I+A+ + G L+ A G KI + + +L GIAF D +Y+ + +K+ IL+ D++
Sbjct: 940 GPITAITQVSGFLVTAVGQKIYIWQLKDNDLVGIAFIDTQ-IYIHQMLSIKSLILIADVY 998
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
KSI L ++E+ L+L+++DF + + E+LID + L +++D + N+ +F Y P+
Sbjct: 999 KSISLLRFQEKCRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLIADGESNLALFMYQPESR 1058
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT--SSDRTGAAPGSDKTNRFALLFGTLDGS 1332
ES GQKL+ +A+FH+G + F R++ T +SD+ SD + ++ +LDGS
Sbjct: 1059 ESLGGQKLIRKADFHLGQKINTFFRIKCRVTDVASDKKHF---SDADKKHVTMYASLDGS 1115
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+G + P+ E T+RRL LQ LV + H+AGLNP+++R + S + I+D +L+
Sbjct: 1116 LGYVLPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYVRNQGNPARGIIDGDLV 1175
Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
Y LP E+ ++A + GT +I+ ++ ++ T+
Sbjct: 1176 WRYLSLPNNEKADVAKKIGTRVQEIIEDITEIDRQTA 1212
Score = 216 bits (549), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 152/475 (32%), Positives = 243/475 (51%), Gaps = 52/475 (10%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
M +V D F+HGY EP ++IL+E T++GR++ + TC + A+S++ + HP+IWS
Sbjct: 1 MDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVHPIIWSVS 60
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDSSQELPRSSFS 361
NLP D Y+ + V P+GG L++ N++ Y +QS ++LN+ A + + P+
Sbjct: 61 NLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTNFPLRPQDGVK 120
Query: 362 VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGN 420
+ L+ A +L D ++S K+G+L +L++ D R V+ K SVLTS + +
Sbjct: 121 ISLEGAQVAFLSADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASVLTSCVCMCED 180
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKRLRRS------SSD 473
+ FLGSRLG+SLL++FT ++ S E D + + P K+ ++ +SD
Sbjct: 181 NYLFLGSRLGNSLLLRFTEKEPETIKSLDDGEINIEDNDNEEPPAKKAKQDFLGDWMASD 240
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----------R 523
L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S G
Sbjct: 241 VL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEPAFLSEEFAHN 298
Query: 524 INAD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
N D + +G K ++ ELPGC+ +WTV G + ++
Sbjct: 299 QNPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI-----GSLNNDEQV 353
Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
+ + HA+LI+S E TMVL+T + EV +S + QG T+ AGNL R ++QV +
Sbjct: 354 KSETEGSHAFLILSQEDSTMVLQTGQEINEVDQS-GFSTQGSTVFAGNLGANRYIVQVTQ 412
Query: 629 RGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
G R+L G Q + ++ S ADPYV+L DG + LL
Sbjct: 413 MGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVILLSEDGQVMLL 457
>gi|384946686|gb|AFI36948.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
Length = 1428
Score = 305 bits (781), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 219/677 (32%), Positives = 347/677 (51%), Gaps = 86/677 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542
Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+ G ++ A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 543 RKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + + ++ ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651
Query: 668 DPYVLLGMSDGSIRLLV 684
DPYV++ ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668
Score = 265 bits (676), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 168/556 (30%), Positives = 274/556 (49%), Gaps = 35/556 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 783 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 839 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 889 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 949 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S I + G + + +
Sbjct: 1009 PWPVRKIPLRCTAHYVAYHVESKVYAVATS-------TNTPCARIPRMTGEEKEFETIER 1061
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1108
+ + E + ++++ P W+ A I +Q E+ ++ V+L + T +
Sbjct: 1062 DERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLK 1117
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL 1163
+A GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1118 GYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1177
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KSI L ++
Sbjct: 1178 NGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQ 1236
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL
Sbjct: 1237 EESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLL 1296
Query: 1284 SRAEFHVGAHVTKFLR 1299
RA+FHVGAHV F R
Sbjct: 1297 RRADFHVGAHVNTFWR 1312
>gi|390347522|ref|XP_003726804.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 1439
Score = 304 bits (779), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 205/688 (29%), Positives = 333/688 (48%), Gaps = 53/688 (7%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V C E+G LE++ +P+ F V F G +VD S S TG
Sbjct: 776 WCVFCRENGQLEMYSLPDMVLAFLVKNFPMGSKVLVD---------------SGSAFMTG 820
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+++ +V E+ + + ++ A++ D I+ Y+A+ P NT + +
Sbjct: 821 DQSQQHEMLQQVQEVLLVGLGHDRKKIYMLALVEDD-IMIYEAF----PYNTVTQEHHLR 875
Query: 873 TSRSLSVSNVSASRLRNLRFSRTP----------LDAYTREETPHGAPCQRITIFKNISG 922
R + + + + R S+ P + R+ F N+
Sbjct: 876 V-RFRKIPHKILMKPKKTRTSKKPTAEGGTKTETETEAESDTKTQTRRVNRLREFHNVQT 934
Query: 923 HQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G F+SGS P W V R LR HP DG+I F HNVNC +GF+Y + L+IC
Sbjct: 935 YSGVFISGSHPYWLFVTSRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRIC 994
Query: 982 QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
LPS +YD WPV+K+PL+ TPH + Y E Y ++ SV Q + + G
Sbjct: 995 VLPSHLSYDAPWPVRKVPLRCTPHFVAYHVETKTYAVVTSV-------QETKTHVWKVTG 1047
Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-F 1100
+I + D T + +++ P TR I +++EN ++VV L
Sbjct: 1048 EEIGEEPVERDDRFVPTTKVVFSIQLFSPVSWDAIPNTR--IEYEAAENVTCLKVVNLSC 1105
Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKG 1155
T + + + T +V ED+ RG V ++ P +N + +Y K KG
Sbjct: 1106 EGTMTGKKGYVVVATTHVYSEDLQTRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQKG 1165
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
+SAL + G LL G K+ + ++ +L G+AF D +Y+ + VK FIL+ D+ K
Sbjct: 1166 PVSALCEVMGFLLTCIGQKVYMWQFKDNDLIGLAFIDT-QIYIHNAVSVKQFILITDVMK 1224
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
YFL ++ Q L+L+++D L+ F EF++D ++ +VSD KN+ +F+Y P+ E
Sbjct: 1225 GAYFLQYQAQDRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAPE 1284
Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLAT--SSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
S G LL R + ++G+ V F+R++ T S+++ + P R + F TLDGS+
Sbjct: 1285 SHGGAYLLRRGDMNIGSAVNTFVRVRCRLTDPSTEQVLSGP---VLRRQVVFFATLDGSL 1341
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
G + P+ E T+RRL LQ L + +PHV GLNP+S+R S+ + +I+D +LL
Sbjct: 1342 GLLLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLLL 1401
Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
Y L + E+ E A + GT+ QI+S+L
Sbjct: 1402 KYCHLSVVERNEFAKKIGTSVDQIISDL 1429
Score = 286 bits (732), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 219/726 (30%), Positives = 349/726 (48%), Gaps = 105/726 (14%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A Y+ +H PTG+ +C H + P ++ NLVV
Sbjct: 2 YAFYREIHPPTGVEHC---VYCHFFS------------------PDQQ------NLVVAK 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
+ + +Y + + + +K + + K + LE + + G V S+ Q
Sbjct: 35 GSELTVYSM-ITVDSNKPTDKESKPKNK----------LEEAATFHIFGKVMSM----QS 79
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
RD+++L+F +AK+S++E+D ++H L+ SMH FE E K G P+
Sbjct: 80 AQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEEDE---TKEGVYRNIFHPV 136
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
VKVDP RC +L YG ++++L + GLV D D S + S+VI L ++D
Sbjct: 137 VKVDPDHRCAIMLTYGSKLVVLPFRR--DGLVEDLDKSMSASTRRGALMPSYVIRLNEMD 194
Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
+ +V D F+HGY EP ++IL+E TWAGRV+ + TC I ALS++ K HP+IWS
Sbjct: 195 DPICNVLDIQFLHGYYEPTLLILYEPLRTWAGRVAVRQDTCSIVALSLNMAQKVHPIIWS 254
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS----SQELP 356
+LP+D ++ AVP PIGGVL++ N++ Y +QS + Y VSL+S S P
Sbjct: 255 QSSLPYDCMQVQAVPKPIGGVLILAVNSLLYLNQS------IPPYGVSLNSLTDWSTAFP 308
Query: 357 ---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + +D AT++ D LS K G++ +LT++ DG R V+ L K SVLT
Sbjct: 309 LKTQEGVKLSMDCTQATFISYDRLALSLKDGEIYVLTLLVDGMRSVRGFHLDKAAASVLT 368
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+ I +G+ FLGSRLG+SLL+++T + S K E + PS K +S
Sbjct: 369 TCICPMGDGFLFLGSRLGNSLLLKYTEKVSETSPSDASKTEEPKPGEEPPSKKMRSDDAS 428
Query: 473 DALQD----MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
D + + + +EL +YG T + ++SF + DSL+NIGP + G
Sbjct: 429 DWMASDTKFLDDPDELEVYGKQVQKTGTQLTSYSFEICDSLLNIGPCGNMIMGEPAFLSE 488
Query: 522 -LRINAD-----ASATGISKQSNYELVE------------LPGCKGIWTV--YHKSSRGH 561
+ N D + +G K +++ LPGC +WTV K+
Sbjct: 489 EFQGNVDPDLELVTTSGYGKNGALSVLQRTIRPQVVTTFNLPGCLDMWTVKSLKKAKADE 548
Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
++ S + D + HA+LI+S + +MVL+T +TEV + Q TI A N+ R
Sbjct: 549 KSEESETSPEDKDRHAFLILSKQDSSMVLQTGQEITEVAAG-GFSTQAPTIFASNMGDDR 607
Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIR 681
++QV + +++G Q + S + S+ADPY+LL +G
Sbjct: 608 YIVQVMNKSICLMEGVEQIQHMVL----------DVGSPIKQCSLADPYLLLLTENGDPI 657
Query: 682 LLVGDP 687
L+ P
Sbjct: 658 LMTLKP 663
>gi|389740693|gb|EIM81883.1| hypothetical protein STEHIDRAFT_65512 [Stereum hirsutum FP-91666 SS1]
Length = 1438
Score = 304 bits (778), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 369/1498 (24%), Positives = 636/1498 (42%), Gaps = 210/1498 (14%)
Query: 47 PSKRGIGPVPNLVVTAANVIEIYVVRV------------QEEGSKESKNSGETKRRVLMD 94
P + P+ NLVV +N++ I VR +E K + + V MD
Sbjct: 31 PDSQKALPLFNLVVARSNLLRILEVREVPTLRPIHLDDERERRGNVRKGTEPVEGEVEMD 90
Query: 95 ---------GISAAS----------LELVCHYRLHGNV---ESLAILSQGGADNSRRRDS 132
G S AS V YRLHG V E++ I+S D
Sbjct: 91 EQGEGYVNMGASTASNGAPRPTVLRFYFVRDYRLHGTVTGLETVRIMSS----LEDEMDR 146
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRC 191
++++F+DAKI++LE+ H L S+H +E +P+ L L +S ++ DP +C
Sbjct: 147 LLVSFKDAKIALLEWSTDTHSLSTVSIHTYERAPQLLSL----DSNMFTAQLRTDPLSQC 202
Query: 192 GGVLVYGLQMIILKASQGGSGL-VGDED-TFGSGGGFSARIESSHVINLR---DLDMKHV 246
+ + IL Q L V D+D T +S S +++L D +++V
Sbjct: 203 AALSLPKDAFAILPFYQTQVDLDVMDQDQTRARDVPYSP----SFILDLAAEVDERIRNV 258
Query: 247 KDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPH 306
DF+F+ G+ P + +L + + TW GR+ T + +++ + +P+I S LP+
Sbjct: 259 VDFVFLPGFSHPTVAVLFQAQQTWTGRLKEYKDTMRLFIFTLNVVTRSYPIITSVEGLPY 318
Query: 307 DAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS---- 361
D ++ P+ +GGV+V+ +N+ IH S ALA+N + + ++P ++ +
Sbjct: 319 DCLSVVPCPAALGGVVVLTSNSVIHIDQASRRVALAVNGW---MPRVSDMPVTALAQGDQ 375
Query: 362 --VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSD 414
+EL+ + T++ + + K G + + DG+VV +L +S T PSV
Sbjct: 376 GRLELEGSRMTFVDDKTLFIVLKDGTIHPVEFFVDGKVVSKLSISPPLAQTTTPSV---- 431
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
I I N FF+GS G S L++ SG++E+ D + + K + D+
Sbjct: 432 IRKITNEHFFVGSTAGPSALLKV----------SGVEEDIQD-DVEEIDGKTAPAAVVDS 480
Query: 475 LQDMVNGEELSLYGSA--------------SNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+ M ++ LYGS+ + +T + ++ DSL GP+ D ++
Sbjct: 481 VDGMDIDDDDDLYGSSKADPTPTANGNAVETTSTTRKRTVIHLSLCDSLPAHGPISDMTF 540
Query: 521 GLRINAD------ASATGISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGH 561
+ N D +ATG + L + LP G +G+W++ + +
Sbjct: 541 SMTKNGDRAVPELVAATGSGLLGGFTLFQRDLPIRTKRKLHAIGGARGVWSLPVRQAVRV 600
Query: 562 NADSSRMAAYD-DEYHAYLIISLEARTM--VLETADLLTEVTESVDYFVQG-RTIAAGNL 617
N S + + +IIS +A + A ++ ++ + G T+ A
Sbjct: 601 NGVSYQTPQNPLRSDNDTIIISTDATPSPGISRIATRSSKTDLNITTRIPGVTTVGAAPF 660
Query: 618 FGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
F ++ V R+L DGS P G+ + + + + SI DPYV +
Sbjct: 661 FQGTAILHVLSNAIRVLEPDGSERQ------PIKDMDGN-NYRAKIKNCSICDPYVFVLR 713
Query: 676 SDGSIRLLVGDPSTCTVSVQ--TPAAIESSKKPVSSCTLYHDKGP-----EPWLRKTSTD 728
D +I L +G+ + + +P ++S+ ++ C G L ++T
Sbjct: 714 EDETIGLFIGETERGKIRRKDMSPMGDKTSRY-IAGCFFSDTTGTFQAHVNSSLNGSNTT 772
Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
+T +++ A Q + ++ G +EI+ +P VF+ + + +V
Sbjct: 773 KQNATSTLQSVMNAG-----QKTQWLLLVRPQGVMEIWTLPKLTLVFSTTALATLQPLLV 827
Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
D+ AL S Q RK + + ++ + RP LF +L G
Sbjct: 828 DSLDPPAL----------SSLPQDQPRKP--QELDIDQILVAPLGETSPRPHLFVLLRSG 875
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--P 906
+ Y+A FE P + DP SR S+ V ++ + F D +E++
Sbjct: 876 QLAIYEAVSFELP-----TGDPEPASRP-SILPVKLVKVLSRAFDIQHPDEQPQEKSVLA 929
Query: 907 HGAPCQRITI-FKNISGHQ----GFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVL 960
QR+ I F + G F +G RPCW + + +RVH + +FT
Sbjct: 930 ELKKIQRLFIPFVTSPAPEKTFTGVFFTGDRPCWILGTDKGGIRVHSS-GHAVVHSFTPC 988
Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITY--FAEKNLYPL 1018
+ F+ T +G + +P + ++P + P Y L
Sbjct: 989 SLWDSKGDFLLYTDEGPCLLEWMPDVQLH------TELPSRFMPRSRAYTNVVFDPFTCL 1042
Query: 1019 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQ 1078
IV LK Q S D + D N+S T + + ++ PD W
Sbjct: 1043 IVGAASLK--AQFTSFDEDGNQTWEPDAPNISYP------TTDCSTLELITPDA----WL 1090
Query: 1079 TRATIPMQSSENALTVRVVTLFNTTTKENE-TLLAIGTAYVQGEDVAARGRVLLFSTGRN 1137
T S+E V V L +T + + +A+GT +GED+A +G +F
Sbjct: 1091 TMDGYEFASNEIVNAVECVMLETQSTDSGQKSFIAVGTTINRGEDLAVKGATYIFEIVEV 1150
Query: 1138 ADNPQNLVTEVYSKEL------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAF 1190
+P V + + KG ++AL + G+L+ + G KI + E L G+AF
Sbjct: 1151 VPDPSFGVKRWFKLRMRCRDDAKGPVTALCGMDGYLVSSMGQKIFVRALDLDERLVGVAF 1210
Query: 1191 YDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLID 1249
D +YV SL +KN +++ D KS++F++++E +L +LAKD + CF + +F
Sbjct: 1211 LDVG-VYVTSLRALKNLLIISDAVKSVWFVAFQEDPYKLTVLAKDAQQV-CFTSADFFFA 1268
Query: 1250 GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
LS+V DE+ +++++Y P ES GQ+LL AEFH L + R
Sbjct: 1269 NQQLSIVTCDEEGILRMYHYNPHDPESKNGQRLLCHAEFHGQIEYRSSLTIA-------R 1321
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
P ++ + L+ G+ DGS+ + P++E F+RL LQ +L +V H A LNPR+F
Sbjct: 1322 RTKGPDTE-IPQAKLICGSPDGSLSALVPVEEAAFKRLHLLQGQLTRNVQHTAALNPRAF 1380
Query: 1370 RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
R N + +D LL +E LP+ Q+EI Q GT R +L + +ALG
Sbjct: 1381 RAVR-NEYVSKTLHKGFLDGLLLRSFEDLPVSRQIEITRQIGTERRLVLKDW--VALG 1435
>gi|291232722|ref|XP_002736302.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
[Saccoglossus kowalevskii]
Length = 984
Score = 304 bits (778), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 287/1106 (25%), Positives = 465/1106 (42%), Gaps = 238/1106 (21%)
Query: 423 FFLGSRLGDSLLVQFT--CGSGTSMLSSGLKE---EFGDIEADAPSTKRLRRSSSDALQD 477
FLGSRLG+SLL+++ T +++G K+ + + + P+ K+ +SD +
Sbjct: 6 LFLGSRLGNSLLLKYVEKAQESTDSVTNGAKKTEEDEETNKEEPPNKKKRTDDASDWIAS 65
Query: 478 MV-----NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRI 524
V + +EL +YGS + + +++F V DS++NIGP G +
Sbjct: 66 DVALLAEDVDELEVYGSQTQ-AGTQLTSYTFEVCDSIMNIGPCTKAVMGEPVFLSEEFQT 124
Query: 525 NAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSR 567
N D + +G SK ++ ELPGC +WTV + N D +
Sbjct: 125 NPDPDMELVALSGYSKNGALSVLQRSIRPQVVTTFELPGCIDMWTVVGPPEK-ENKDQPK 183
Query: 568 MAAYDD---------EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
++ HA+LI+S + +M+L T + E+ S + QG T+ AGNL
Sbjct: 184 EKTEEEGDKKPDALTNGHAFLILSRDDSSMILSTGQEIMELDHS-GFSTQGPTVYAGNLG 242
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
++QV G R+L+G Q + S ++ S++DPY LL G
Sbjct: 243 NNAYILQVSPMGVRLLEGVNQLQHIPL----------DLGSPIVLCSVSDPYALLMSEKG 292
Query: 679 SI--------------RLLVGDP---------STCTVSVQTPAAIESSKKPVSSCTLYHD 715
+ RL + P + C + SSK +S
Sbjct: 293 ELVLLTLKPDGFAGGHRLAISRPQIPQISRILTLCAYKDTSGMFTTSSKMESTSDETEEK 352
Query: 716 KGPEPWLRKTSTDAWLSTG-------VGEAIDGADGGPLDQGD----------------- 751
K +P + S + +S GE+ D + P + +
Sbjct: 353 KITKPSVADISMTSEISNVDDEDEMLYGES-DASLFSPTKKEEKSSFLQTREVLSETKPT 411
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
+ + E+G LEI+ +P+F F V F G +VD+Y ++ +S+ G+
Sbjct: 412 YWCAMSRENGVLEIYSLPDFKLAFLVKNFPMGFKVMVDSY----------QMTASAPGGS 461
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+ +++ V EL + + + L A + D + Y+A+
Sbjct: 462 SKSDQQHDMMPIVKELLLIGLGHKNKKTHLLARV-DEDLYIYEAF--------------- 505
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
T S+ N LR LRF + F+ G
Sbjct: 506 -THDQSSLDN----HLR-LRFRKV-------------------------------FVCGP 528
Query: 932 RPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W M R LR HP DGS+ F HN+NC GF+Y G L+IC LP+ +YD
Sbjct: 529 YPHWLFMTSRGALRSHPMHIDGSVTCFAPFHNINCPKGFLYFNKHGELRICVLPTHLSYD 588
Query: 991 NYWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNHN 1048
WPV+K+PL+ TPH I+Y E Y ++ SV P L+ I + G + +
Sbjct: 589 ALWPVRKVPLRCTPHFISYHIESKTYAVVTSVSEPCLR---------ICKMTGDDKEFED 639
Query: 1049 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1106
+ D T+E++ +++ P W+ I + E+ ++ V L + T
Sbjct: 640 VERDDRFIFPTIEKFSLQLFSP----LSWEAIPNTKIDTEDWEHITGLKTVFLKSEGTVS 695
Query: 1107 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISAL 1160
+ +A+ T V GE+V RGR+L+F P +N + +Y KE KG ++ L
Sbjct: 696 GLKGFIAVSTTIVYGEEVTCRGRILIFDVIEVVPEPGQPLTKNKLKLLYDKEQKGPVTTL 755
Query: 1161 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
++G L A G KI L + +L G+AF D +++ +L +KNFIL DI KS+ L
Sbjct: 756 CDIEGLLAAAIGQKIFLWAFRNNDLIGVAFIDT-QIHIHTLCTIKNFILAADIRKSVSLL 814
Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
+ ++ L+L+ + ES+ GQ
Sbjct: 815 RFSDEDRSLSLVTR----------------------------------------ESFGGQ 834
Query: 1281 KLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
+LL RA+F+ G+HV F R+ ++ ++++ P R +F TLDGSIG + P
Sbjct: 835 RLLRRADFNAGSHVCSFFRMRSKLSDPATEKLLTGPME---RRHVTMFATLDGSIGYLIP 891
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L H AGLNP+ FR K+ +I+D +LL Y L
Sbjct: 892 MTEKTYRRLLMLQNALTTQTLHTAGLNPKGFRMVKHQTKSLENTHKNILDGDLLWKYTFL 951
Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ E+ E+A + GT+ QIL +L D+
Sbjct: 952 SVNERTELAKKIGTSVEQILDDLMDV 977
>gi|110750698|ref|XP_624382.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Apis mellifera]
Length = 1415
Score = 303 bits (776), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 204/697 (29%), Positives = 338/697 (48%), Gaps = 77/697 (11%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 772 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE------ 825
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 826 -------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL------- 868
Query: 875 RSLSVSNVSASRLRNLRFSRTP--LDAYTREETPHGAPCQR---ITIFKNISGHQGFFLS 929
R + L P L R+E R + F NI+G+ G F+
Sbjct: 869 -----------RFKKLDHGIIPGHLRPRPRDEDMPAMNDTRHCMMRYFSNIAGYNGVFIC 917
Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +
Sbjct: 918 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 977
Query: 989 YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
YD WPV+K+PL+ TPH +TY E Y +I S+ +PL +
Sbjct: 978 YDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------YR 1020
Query: 1049 LSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVT 1098
+ D + +T EE R + P + + W+T I + E+ ++ V+
Sbjct: 1021 FNGED--KEFTEEERPDRFIFPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNVS 1078
Query: 1099 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKE 1152
L T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+KE
Sbjct: 1079 LAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKE 1138
Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
KG I+A+ + G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+ D
Sbjct: 1139 QKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYIHQMLSIKSLILIAD 1197
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
++KSI L ++E+ L+L+++DF + + E+LID + L +V+D + NI +F Y P+
Sbjct: 1198 VYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGESNIALFMYQPE 1257
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
ES GQKL+ +A+FH+G V F R++ S SD R ++ +LDG+
Sbjct: 1258 SRESLGGQKLIRKADFHLGQKVNTFFRIR-CRISDPANDKKHFSDADKRHVTMYASLDGN 1316
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+G I P+ E T+RRL LQ LV + H+AGLNP+++R + S+ + I+D +L+
Sbjct: 1317 LGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLV 1376
Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
Y LP E++++A + GT +I+ +L ++ T+
Sbjct: 1377 WRYLYLPNNEKIDVAKKIGTRVQEIIEDLTEIDRQTA 1413
Score = 293 bits (751), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 209/668 (31%), Positives = 343/668 (51%), Gaps = 81/668 (12%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV AN+I ++ + + +K+ K + ++ LE + Y LHGNV S+
Sbjct: 30 LVVAGANIIRVFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D H LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
P+V+VDP+GRC +L+YG ++++L + S GD I SS++I
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ + H
Sbjct: 195 LKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
P+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S E
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308
Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPSTKR 466
SVLTS + ++ FLGSRLG+SLL++FT ++ ++ + + E + K+
Sbjct: 369 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPAKK 428
Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+++ +SD L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S
Sbjct: 429 IKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486
Query: 521 G--------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYH 555
G N D + +G K ++ ELPGC+ +WTV
Sbjct: 487 GEPAFLSEEFSHNQDPDVELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVI- 545
Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
G + ++ + HA+LI+S E TM+L+T + EV +S + QG TI AG
Sbjct: 546 ----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTIFAG 600
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
NL R ++QV + G R+L G Q + ++ S ADPYV L
Sbjct: 601 NLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLS 650
Query: 676 SDGSIRLL 683
DG + LL
Sbjct: 651 EDGQVMLL 658
>gi|354491124|ref|XP_003507706.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 isoform 2 [Cricetulus griseus]
Length = 1388
Score = 303 bits (775), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 217/671 (32%), Positives = 347/671 (51%), Gaps = 77/671 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN G T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE E L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ ++
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429
Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485
Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-------HK 556
L I + + + + K ++V ELPGC +WTV +
Sbjct: 486 SEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545
Query: 557 SSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
+ R + + ++ A D H +LI+S E TM+L+T + E+ S + QG T+
Sbjct: 546 APRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
AGN+ R ++QV G R+L+G L F P + + ++ ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654
Query: 674 GMSDGSIRLLV 684
++G + + +
Sbjct: 655 MSAEGHVTMFL 665
Score = 285 bits (730), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 202/678 (29%), Positives = 317/678 (46%), Gaps = 88/678 (12%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 780 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 836 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 886 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 945
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 946 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1006 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 1058
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENET 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1059 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET----- 1109
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
V G LKG ++A L QG +
Sbjct: 1110 --------VSG--------------------------------LKGYVAAGTCLMQGEEV 1129
Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
G +I L +EL G+AF D LY+ + VKNFIL D+ KSI L ++E+
Sbjct: 1130 TCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKT 1187
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+F
Sbjct: 1188 LSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADF 1247
Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELT 1343
HVGAHV F R + GAA G K N+ F TLDG IG + P+ E T
Sbjct: 1248 HVGAHVNTFWR-------TPCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKT 1300
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E+
Sbjct: 1301 YRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMER 1360
Query: 1404 LEIAHQTGTTRSQILSNL 1421
E+A + GTT IL +L
Sbjct: 1361 SELAKKIGTTPDIILDDL 1378
>gi|322792443|gb|EFZ16427.1| hypothetical protein SINV_15375 [Solenopsis invicta]
Length = 1532
Score = 302 bits (774), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 207/680 (30%), Positives = 334/680 (49%), Gaps = 75/680 (11%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ T IN
Sbjct: 740 LVYRDSGTLEIYSLPDLRLSYLIRNFGFGQYVLHDSMESTTLQS--TPINEIPHP----- 792
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
M+V E+ M H +RP L L D + YQ Y + P+ K
Sbjct: 793 ------DMQVREILMVALGHHGNRPMLLVRL-DSELQIYQVYRY--PKGYLK-------- 835
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
L + + R S P E+ P RI + F NI+G+ G F+
Sbjct: 836 --LRFKKLDHGIIPG-RLSPRP----KEEDVPRNTSDTRICVMRYFSNIAGYNGVFICSD 888
Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W + R LR HP DGS+ +F +N+NC GF+Y + L+IC LP+ +YD
Sbjct: 889 YPHWIFLTGRGELRTHPMGIDGSVTSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYD 948
Query: 991 NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
WPV+K+PL+ TPH +TY E Y +I S +PL + +
Sbjct: 949 APWPVRKVPLRCTPHFVTYHLESKTYCVITSTA--EPLKSY---------------YRFN 991
Query: 1051 SVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVTLF 1100
D + +T EE R L P + + W+T I + E+ ++ V+L
Sbjct: 992 GED--KEFTEEERPDRFLYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNVSLA 1049
Query: 1101 NTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELK 1154
T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+KE K
Sbjct: 1050 YEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQK 1109
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
G I+A+ + G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+ D++
Sbjct: 1110 GPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDT-QIYIHQMLSIKSLILIADVY 1168
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
KSI L ++E+ L+L+++DF + + E+LID + L +V+D + N+ +F Y P+
Sbjct: 1169 KSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFIVADGESNLALFMYQPESR 1228
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSI 1333
ES GQKL+ +A+FH+G V F R++ T ++ G+DK R ++ +LDGS+
Sbjct: 1229 ESLGGQKLIRKADFHLGQKVNTFFRIRCRVTDPANDKKQFSGADK--RHVTMYASLDGSL 1286
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELL 1392
G I P+ E T+RRL LQ LV + H+AGLNP+S+R + + ++ P I+D +L+
Sbjct: 1287 GYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKSYRHTYKSYIRNQGNPARGIIDGDLV 1346
Query: 1393 SHYEMLPLEEQLEIAHQTGT 1412
Y LP E+ ++A + GT
Sbjct: 1347 WRYLFLPNNEKADLAKKIGT 1366
Score = 293 bits (749), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 206/621 (33%), Positives = 328/621 (52%), Gaps = 63/621 (10%)
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
LE + Y LHGN+ S+ + G+ +RDS++L+F DAK+SV+E+D IH LR S+
Sbjct: 32 KLECLAQYTLHGNIMSMQAVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSL 87
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-D 218
H FE E +K G + P+V+VDP+GRC +L++G ++++L + S GD D
Sbjct: 88 HYFEEEE---IKDGWTNHHHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPSLDDGDLLD 144
Query: 219 TFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
T A I SS++I L+ L+ M +V D F+HGY EP ++IL+E T+AGR++
Sbjct: 145 TAKLTSSNKAPILSSYMIVLKSLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFAGRIAV 204
Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS- 335
+ TC + A+S++ + HP+IWS NLP D Y+ + V P+GG L++ N++ Y +QS
Sbjct: 205 RQDTCAMVAISLNIQQRVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAFNSLIYLNQSI 264
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
++LN+ A + + P+ + L+ A ++ D ++S K+G+L +L++ D
Sbjct: 265 PPYGVSLNSLADTSTNFPLKPQEGVKMSLEGAQVAFISADRLVISLKSGELYVLSLFADS 324
Query: 396 -RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS-SGLKEE 453
R V+ K SVLTS + ++ FLGSRLG+SLL++FT ++ + +G +
Sbjct: 325 MRSVRGFHFDKAAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNGGEIT 384
Query: 454 FGDIEADAPSTKRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
+ E++ K+ ++ +SD L D+ + EEL +YGS + +T ++ F V D
Sbjct: 385 IEENESEETPAKKAKQDFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCD 442
Query: 508 SLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELV------------ 542
SL+NIGP + S G N D + +G K ++
Sbjct: 443 SLLNIGPCGNISMGEPAFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTF 502
Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
ELPGC+ +WTV N D + A + HA+LI+S E TM+L+T + EV +S
Sbjct: 503 ELPGCEDMWTVIGTL----NNDEIKTEA--EGSHAFLILSQEDSTMILQTGQEINEVDQS 556
Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVL 662
+ QG T+ AGNL R ++QV + G R+L G Q + ++
Sbjct: 557 -GFSTQGSTVFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIV 605
Query: 663 SVSIADPYVLLGMSDGSIRLL 683
S ADPYV L DG + LL
Sbjct: 606 HASCADPYVTLLSEDGQVMLL 626
>gi|261201748|ref|XP_002628088.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
gi|239590185|gb|EEQ72766.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
Length = 1403
Score = 301 bits (770), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 358/1478 (24%), Positives = 627/1478 (42%), Gaps = 214/1478 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V + +++++ + GS ++ +T+ + L LV Y L G + L
Sbjct: 28 NLIVAKSTLLQVFNLVNVVYGSAPGQSDEKTRSQY-------TKLVLVAEYALSGTITDL 80
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++ +AK+S++E+D H + TS+H +E + +H+
Sbjct: 81 GRVKILNSKSGGE------AVLVGTRNAKLSLIEWDPERHKIATTSIHYYERDD-VHISP 133
Query: 173 GRESFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
+ A P + VDP RC VL +G + + IL Q G LV
Sbjct: 134 WTPNLANCPSHLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDFDSDVEEPPRDTN 192
Query: 215 -----GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
DE +G F SS V+ + L+ M H F++ Y EP IL+ +
Sbjct: 193 HTAEGQDEAKKSNGLAFHTPYASSFVLPIAALEPAMLHPISLAFLYEYREPTFGILYSQV 252
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
T + + + S ++ + + S LP+D +K++A+P P+GG L++G N
Sbjct: 253 ATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVALPPPVGGALLIGTN 312
Query: 328 T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
+H + A+ +N +A S +S + L+ + L +N LL G
Sbjct: 313 ELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEGSIVEQLGTENGDMLLVLLNG 372
Query: 385 DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
+ +L+ DGR V + L K PS +G F GS DS+
Sbjct: 373 KMAVLSFKLDGRSVSGISLRLVPDLAGGSLLKARPSC----SVPLGRGKIFFGSEESDSV 428
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY------ 487
L+ G S S+ K+ D SS + +D + E LY
Sbjct: 429 LI------GWSRPSTRPKDPPVQGAGD---DNIAELSSDEEEEDDEDIYEDDLYATPVPT 479
Query: 488 GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYE 540
G+ + + S + T ++F + D L N+GP++D + G + D S +N E
Sbjct: 480 GAKARGSLSVKGTNLNDYTFRIHDRLWNLGPMRDLTLGRPAGSRDKDKRQPVSSLSTNLE 539
Query: 541 LVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDE 574
LV G G W+V+ K + + S
Sbjct: 540 LVATQGYGKAGGLTILRREIDPYVIDSLMIKDTDGAWSVHVKDPKLPSQSGSLPLNASSN 599
Query: 575 YHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFE 628
Y YL++S + +++V + E T++ ++ + RTI G L G RV+QV +
Sbjct: 600 YDHYLLLSKSKGSDKEKSVVYTMSSGGLEETKASEFNPNEDRTIDIGTLAGGTRVVQVLK 659
Query: 629 RGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
R D G + Q + SE V+ S ADPYVL+ D S+ LL D
Sbjct: 660 GEVRSYDSGLGLAQIFPVWDEDM-----SEEKYVVHASFADPYVLIIRDDQSVLLLQADG 714
Query: 688 STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
S ++ I S+ S +LY DK +T+ LS V
Sbjct: 715 SGDLDEIEADGIINSTT--WISGSLYQDKYRSFMSYETAPSRKLSDNV------------ 760
Query: 748 DQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
+ ++ ES L IF +PN VFT + D +I S+
Sbjct: 761 ----LLFLLSSES-KLHIFHLPNAKEPVFTAECV-----------------DLLPQILST 798
Query: 807 SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
+E++ + V ++ + P+L ++ ++ Y+ Y +T+
Sbjct: 799 EPPPKRATYRESLTEILVADIG----DSVSRTPYLILRSSNNDLILYEPY------HTTH 848
Query: 867 SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
S + S S++ + N F + + + + GA + + + ++ G++
Sbjct: 849 STEKKS-------SDLRFLKTINHHFPKFHAGSNVEDSSHIGALPKPLRVLGDVCGYRTV 901
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
F+ G+ PC+ + + L ++ + + + C GF+YV + ++++C+ P
Sbjct: 902 FMPGNSPCFVIKSSTSIPHVLNLRGKTVHSLSSFNIPACERGFVYVDADNVVRMCRFPRN 961
Query: 987 STYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDN 1046
+ +D W +KI L + Y + Y + S V +L D E+ + N
Sbjct: 962 THFDGSWATRKIGLGEQVDIVEYSSSSETYVIGTSQKV------DFNLPEDDEIHPEWRN 1015
Query: 1047 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTK 1105
+S + +++ V++L P W + ++++E + V+ + L + T
Sbjct: 1016 EVISFLP-----QIDQGSVKLLSPRT----WSIIDSHTLRTAERIMCVKCLDLEVSEITH 1066
Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKELKGAISAL 1160
E ++A+GTA +GED+AARG + +F D P+ + + +E+KGA+++L
Sbjct: 1067 ERRDMIAVGTAVTRGEDIAARGCIYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSL 1126
Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1214
+ + QG L+ A G K I+ K G+ L +AF D YV L +K ++GD
Sbjct: 1127 SGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKELKGTGMCIMGDAL 1184
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
K I+F + E+ +L+L +KD G+L A +FL DG L ++V+D+ NI + Y P+
Sbjct: 1185 KGIWFAGYSEEPYKLSLFSKDDGTLQVMAADFLPDGKRLYILVADDDCNIHVLQYDPEDP 1244
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDRTGAAPGSDKTNRFALLFGTL---- 1329
S KG +LL R+ FH G + L + + S+ A P + + L+ L
Sbjct: 1245 GSSKGDRLLHRSTFHTGHFASTMTLLPRTIIPSAQGPDANPDMMELDSSGPLYHVLVTSE 1304
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
GSI I PL E +RRL +LQ +L++++ H GLNPR+FR S+G R +VD
Sbjct: 1305 TGSIALITPLSETAYRRLSALQSQLINTLEHPCGLNPRAFRAIESDGIGGR----GMVDG 1360
Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
+LL + L + + EIAH+ G +I ++L + G
Sbjct: 1361 DLLHRWLDLGTQRKAEIAHRVGADIWEIRADLEAIGKG 1398
>gi|239611898|gb|EEQ88885.1| protein CFT1 [Ajellomyces dermatitidis ER-3]
gi|327352847|gb|EGE81704.1| CFT1 [Ajellomyces dermatitidis ATCC 18188]
Length = 1402
Score = 301 bits (770), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 359/1478 (24%), Positives = 627/1478 (42%), Gaps = 215/1478 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V + +++++ + GS ++ +T+ + L LV Y L G + L
Sbjct: 28 NLIVAKSTLLQVFNLVNVVYGSAPGQSDEKTRSQY-------TKLVLVAEYALSGTITDL 80
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++ +AK+S++E+D H + TS+H +E + +H+
Sbjct: 81 GRVKILNSKSGGE------AVLVGTRNAKLSLIEWDPERHKIATTSIHYYERDD-VHISP 133
Query: 173 GRESFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
+ A P + VDP RC VL +G + + IL Q G LV
Sbjct: 134 WTPNLANCPSHLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDFDSDVEEPPRDTN 192
Query: 215 -----GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
DE +G F SS V+ + L+ M H F++ Y EP IL+ +
Sbjct: 193 HTAEGQDEAKKSNGLAFHTPYASSFVLPIAALEPAMLHPISLAFLYEYREPTFGILYSQV 252
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
T + + + S ++ + + S LP+D +K++A+P P+GG L++G N
Sbjct: 253 ATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVALPPPVGGALLIGTN 312
Query: 328 T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
+H + A+ +N +A S +S + L+ + L +N LL G
Sbjct: 313 ELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEGSIVEQLGTENGDMLLVLLNG 372
Query: 385 DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
+ +L+ DGR V + L K PS +G F GS DS+
Sbjct: 373 KMAVLSFKLDGRSVSGISLRLVPDLAGGSLLKARPSC----SVPLGRGKIFFGSEESDSV 428
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY------ 487
L+ G S S+ K D + SSD +D + E LY
Sbjct: 429 LI------GWSRPSTRPK----DPPVQGAGDDNIAELSSDEEEDDEDIYEDDLYATPVPT 478
Query: 488 GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYE 540
G+ + + S + T ++F + D L N+GP++D + G + D S +N E
Sbjct: 479 GAKARGSLSVKGTNLNDYTFRIHDRLWNLGPMRDLTLGRPAGSRDKDKRQPVSSLSTNLE 538
Query: 541 LVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDE 574
LV G G W+V+ K + + S
Sbjct: 539 LVATQGYGKAGGLTILRREIDPYVIDSLMIKDTDGAWSVHVKDPKLPSQSGSLPLNASSN 598
Query: 575 YHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFE 628
Y YL++S + +++V + E T++ ++ + RTI G L G RV+QV +
Sbjct: 599 YDHYLLLSKSKGSDKEKSVVYTMSSGGLEETKASEFNPNEDRTIDIGTLAGGTRVVQVLK 658
Query: 629 RGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
R D G + Q + SE V+ S ADPYVL+ D S+ LL D
Sbjct: 659 GEVRSYDSGLGLAQIFPVWDEDM-----SEEKYVVHASFADPYVLIIRDDQSVLLLQADG 713
Query: 688 STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
S ++ I S+ S +LY DK +T+ LS V
Sbjct: 714 SGDLDEIEADGIINSTT--WISGSLYQDKYRSFMSYETAPSRKLSDNV------------ 759
Query: 748 DQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
+ ++ ES L IF +PN VFT + D +I S+
Sbjct: 760 ----LLFLLSSES-KLHIFHLPNAKEPVFTAECV-----------------DLLPQILST 797
Query: 807 SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
+E++ + V ++ + P+L ++ ++ Y+ Y +T+
Sbjct: 798 EPPPKRATYRESLTEILVADIG----DSVSRTPYLILRSSNNDLILYEPY------HTTH 847
Query: 867 SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
S + S S++ + N F + + + + GA + + + ++ G++
Sbjct: 848 STEKKS-------SDLRFLKTINHHFPKFHAGSNVEDSSHIGALPKPLRVLGDVCGYRTV 900
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
F+ G+ PC+ + + L ++ + + + C GF+YV + ++++C+ P
Sbjct: 901 FMPGNSPCFVIKSSTSIPHVLNLRGKTVHSLSSFNIPACERGFVYVDADNVVRMCRFPRN 960
Query: 987 STYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDN 1046
+ +D W +KI L + Y + Y + S V +L D E+ + N
Sbjct: 961 THFDGSWATRKIGLGEQVDIVEYSSSSETYVIGTSQKV------DFNLPEDDEIHPEWRN 1014
Query: 1047 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTK 1105
+S + +++ V++L P W + ++++E + V+ + L + T
Sbjct: 1015 EVISFLP-----QIDQGSVKLLSPRT----WSIIDSHTLRTAERIMCVKCLDLEVSEITH 1065
Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKELKGAISAL 1160
E ++A+GTA +GED+AARG + +F D P+ + + +E+KGA+++L
Sbjct: 1066 ERRDMIAVGTAVTRGEDIAARGCIYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSL 1125
Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1214
+ + QG L+ A G K I+ K G+ L +AF D YV L +K ++GD
Sbjct: 1126 SGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKELKGTGMCIMGDAL 1183
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
K I+F + E+ +L+L +KD G+L A +FL DG L ++V+D+ NI + Y P+
Sbjct: 1184 KGIWFAGYSEEPYKLSLFSKDDGTLQVMAADFLPDGKRLYILVADDDCNIHVLQYDPEDP 1243
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDRTGAAPGSDKTNRFALLFGTL---- 1329
S KG +LL R+ FH G + L + + S+ A P + + L+ L
Sbjct: 1244 GSSKGDRLLHRSTFHTGHFASTMTLLPRTIIPSAQGPDANPDMMELDSSGPLYHVLVTSE 1303
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
GSI I PL E +RRL +LQ +L++++ H GLNPR+FR S+G R +VD
Sbjct: 1304 TGSIALITPLSETAYRRLSALQSQLINTLEHPCGLNPRAFRAIESDGIGGR----GMVDG 1359
Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
+LL + L + + EIAH+ G +I ++L + G
Sbjct: 1360 DLLHRWLDLGTQRKAEIAHRVGADIWEIRADLEAIGKG 1397
>gi|171695066|ref|XP_001912457.1| hypothetical protein [Podospora anserina S mat+]
gi|170947775|emb|CAP59938.1| unnamed protein product [Podospora anserina S mat+]
Length = 1441
Score = 300 bits (768), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 362/1491 (24%), Positives = 607/1491 (40%), Gaps = 232/1491 (15%)
Query: 57 NLVVTAANVIEIYVVRV--------QEEGSKESKNSGETKRRVLMD--GISAA------- 99
NLVV +++++I+ ++ Q+ ++N+G + R+ D G+ A+
Sbjct: 28 NLVVAKSSLLQIFRTKIVSTEIDASQQGSGARTRNAGRYESRLANDDDGLEASFLGGDSL 87
Query: 100 ----------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFD 148
L LV L G + LA + + N R D++++AF+DA++S++E+D
Sbjct: 88 AFKTDRTNNTKLVLVSEISLSGTITGLAKIK---SQNLRSGGDALLVAFKDARLSLVEWD 144
Query: 149 DSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMII 203
H L S+H +E E W +F + DP GRC + G+ + I
Sbjct: 145 AERHDLSTVSIHYYEQDELQGSPWAPPLSNFTNF-----LAADPGGRCAALKFGGMNLAI 199
Query: 204 LKASQG---------------GSGLVGDEDTFGSGGGF--SARIESSHVINLRDLD--MK 244
L Q G V E +GG S V+ L +LD +
Sbjct: 200 LPFKQADEDIDMDDDWDEDLDGPRPVKQEAAVVNGGSSIKETPYSPSFVLRLSNLDPSLL 259
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
H F+H Y EP IL + + + K H + ++ + I S L
Sbjct: 260 HPVHLAFLHEYREPTFGILAST-VNASNSLGRKDHLAYM-VFTLDLQQRASTTILSVPGL 317
Query: 305 PHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
P D +++ +P+P+GG L+VGAN IH +A+N S +S ++
Sbjct: 318 PQDLFRVQPLPAPVGGALLVGANELIHIDQSGKPNGVAVNPLTKQCTSFGLSDQSDLNLR 377
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT-- 417
L+ L + L+ G + L+T DGR V LD+ S+T S++ ++T
Sbjct: 378 LEECTIDVLSAEELLVILSDGRMALVTFRIDGRTVSGLDVKLLPSETGGSLIPGRVSTLS 437
Query: 418 -IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG-DIEADAPSTKRLRRSSSDAL 475
IG S+ F GS GDSL+ +T S ++ G DI+ D
Sbjct: 438 RIGKSVMFAGSEEGDSLVFGWTKKQNQSGRKKSRLQDVGLDIDMADEEDLDEDEDEDDLY 497
Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------ 529
+ ++ ++ +ASN E +F + D L++I P++ +YG ++A S
Sbjct: 498 AEEPTPKQQAV-ATASNVKEG---DLTFRIHDRLLSIAPIQSMTYGQPVDAPGSEEEQNS 553
Query: 530 -----------ATGISKQSNYELV------------ELPGCKGIWTVYHKSS--RGHNAD 564
G +K S ++ E P +G WTV K + D
Sbjct: 554 AGVRSELQLVCGVGRNKSSAMAIMNLAIPPKVIGRFEFPEARGFWTVCAKKPVPKSLQGD 613
Query: 565 SSRMAAYDD-----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIA 613
A +D +Y ++I++ E + TA +T + G TI
Sbjct: 614 KGPGAIGNDYGTSGQYDKFMIVAKVDLDGYEKSDVYALTAAGFESLTGTEFDPAAGFTIE 673
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
AG + R+IQV + R DG + P E +T S SIADPY+L+
Sbjct: 674 AGTMGKDNRIIQVLKSEVRCYDGDLGLSQIV--PMMDEETGAEPRAT--SASIADPYLLI 729
Query: 674 GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLST 733
+D S+ + S+ +E +K DK +T WL+
Sbjct: 730 IRNDQSVFI---------ASIHDDNELEEVEK--------EDK-------TLATTKWLTG 765
Query: 734 GVGEAIDGADGGPLDQGD--------IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
+ +G G + GD I + SGAL I+ +P+
Sbjct: 766 CLYTDTNGVFGE--ESGDKKAKLPESILMFLLSASGALYIYRLPDL-------------- 809
Query: 786 HIVDTYMREALKDSETEINS--SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
Y+ E L T +++ ++ +GT KE + + V +L P+L
Sbjct: 810 -CKPVYVAEGLSYIPTGLSADYAARKGTA---KETVSEILVADLG----DTTAKSPYLIL 861
Query: 844 ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
+ + Y+ Y ++ + ++L + S L +++P + E
Sbjct: 862 RHANDDLTMYEPYRYQLGAG-------LEFPKTLFFQKIPNSVL-----AKSPAEETDDE 909
Query: 904 ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNV 963
E H A C + NI G+ FL G P + + + + L ++ A + H
Sbjct: 910 EVTHQAKCLALRRCNNIGGYSTVFLPGPSPSFIIKSSKSMPKVLPLQGAAVTAISSFHTE 969
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV 1022
C HGFIY S I+++ QLP ++ + V+KIP+ + Y Y + +
Sbjct: 970 GCEHGFIYADSHNIVRVSQLPKDWSFAETGLAVKKIPIGEDIVAVAYHPPSQSYVVACNT 1029
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD-LHRTYTVEEYEVRILEPDRAGGPWQTRA 1081
P +P E+ D H + + L T+E ++++ P W
Sbjct: 1030 P--EPF----------ELPRDDDYHKEWAREVLPFKPTLERGTLKLIGPIT----WTVVD 1073
Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
TI M+ EN L V + L + T E + L+ +GTA +GED+ RG V +++
Sbjct: 1074 TIVMEPCENVLCVETLNLEVSEATNERKLLIGVGTAITKGEDLPTRGAVYVYNVADVIPE 1133
Query: 1141 PQNLVT----EVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAF 1190
P T ++ +KE +GA++AL+ + QG +L+A GPK ++ K GT L +AF
Sbjct: 1134 PGKPETGKKLKLIAKEDIPRGAVTALSEIGTQGLMLVAQGPKCMVRGLKEDGTLLP-VAF 1192
Query: 1191 YDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
D YV S + L+ D K ++F + E+ ++ L K L+ +FL
Sbjct: 1193 MDM-NCYVTSAKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKSNTRLEVLNADFLP 1251
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFLRLQMLATSS 1307
+G LS+V D + +I I + P+ +S +G LL R F GA HVTK L L +
Sbjct: 1252 NGKELSIVACDAEGHIHILQFDPEHPKSLQGHLLLHRTSFSTGAHHVTKSLLLPSTLSPD 1311
Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
++ + LL + G + + PL E +RRL SL +L +S+ H AGLNP+
Sbjct: 1312 NKEDNEENGATSRPHILLLASPTGVLAALRPLSETAYRRLSSLAAQLTNSLTHAAGLNPK 1371
Query: 1368 SFRQFHSNGKAHRPGPDS-----IVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
+R + G D+ IVD +L+ + L ++ E+A + G T
Sbjct: 1372 GYRM--PSATCPPAGVDAGIGRHIVDGTILARFSELGRAKRGEVAGRAGYT 1420
>gi|317036382|ref|XP_001398211.2| protein cft1 [Aspergillus niger CBS 513.88]
Length = 1393
Score = 300 bits (767), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 337/1463 (23%), Positives = 628/1463 (42%), Gaps = 198/1463 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+L+V ++++IY + + E ++ + ++L++ Y L G V L
Sbjct: 28 DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++AF +AK+S++E+D G+ S+H +E +
Sbjct: 77 CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
+ G ++ VDP RC + +G++ + I+ Q G LV D+ +GS
Sbjct: 131 VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187
Query: 223 ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
GGG + S V+ L LD + H F++ Y EP IL+ +
Sbjct: 188 HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + + ++ + ++ S LP D ++++A+P P+GG L++G+
Sbjct: 248 VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + S +S ++ L+ L + LL T
Sbjct: 308 NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
G++ ++ DGR V + + + I T IG+ FLGS GDS+L+
Sbjct: 368 GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
++ S ++ ++ D AD + S D +D + + +L G +
Sbjct: 428 YSYSSSSTKKHRLQAKQVIDDSADMSEEDQ---SDDDVYEDDLYSTSPDTTLTGRRPSGE 484
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
SA + F + D L+NIGPL+D + G R++ + TG S +++ +G
Sbjct: 485 SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
V + H S + + D + A L EA R V+ T
Sbjct: 545 GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604
Query: 597 TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
++ ES+ + V G TI+ G R+RV+QV + R D +
Sbjct: 605 SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
TQ ++ ++ +S S+AD + + D ++ L D S V
Sbjct: 665 TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719
Query: 700 IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
+ S K SC LY DK TG+ +ID P+ + D++ +
Sbjct: 720 VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760
Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
L ++ V + + ++ + G + ++ SSE G +EN+
Sbjct: 761 DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
V +L + WSA P+L ++ Y+ ++ VST +
Sbjct: 803 IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
++ S+ N R P + + + + + I +ISG F+ G+ + +
Sbjct: 846 HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFIIRT 905
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
+L + + + L C+ GFIY+ SQ ++ C+LP + +D W ++++
Sbjct: 906 SASAPHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLPPMTRFDYQWTLKRVH 965
Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHRTY 1058
L + Y +Y VL + L D E+ + N +S R
Sbjct: 966 LGEQVDHLAYSTSSGMY-------VLGTCHATDFKLPEDDELHPEWRNEAISFFPSARGS 1018
Query: 1059 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAY 1117
+ +++ P+ W + + + E + ++ ++L + T E + ++ +GTA+
Sbjct: 1019 FI-----KLVSPNT----WSIIDSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAF 1069
Query: 1118 VQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLIA 1170
+GED+ +RG + +F + +P + T+ + + +KGA++AL+ + QG +L+A
Sbjct: 1070 ARGEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVA 1129
Query: 1171 SGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQG 1226
G K ++ K G+ L +AF D YV + +K +LGD K ++F + E+
Sbjct: 1130 QGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCILGDAVKGVWFAGYSEEP 1187
Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
+++L AKD L+ A EFL DG L +VV+D NI + Y P+ +S G +LLSR+
Sbjct: 1188 YKMSLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRS 1247
Query: 1287 EFHVGAHVTKFLRLQMLATSSDR-TGAAPGSDKTNRFAL---LFGTLDGSIGCIAPLDEL 1342
+FH+G + L SS++ ++ G D N+ L L T +GS+G I + E
Sbjct: 1248 KFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEE 1307
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
++RRL +LQ +L +++ H GLNPR+FR S+G A R ++D LL + + +
Sbjct: 1308 SYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GMLDGNLLFKWIDMSKQR 1363
Query: 1403 QLEIAHQTGTTRSQILSNLNDLA 1425
+ EIA + G +I ++L ++
Sbjct: 1364 KTEIAGRVGAREWEIKADLEAIS 1386
>gi|332018184|gb|EGI58789.1| Cleavage and polyadenylation specificity factor subunit 1 [Acromyrmex
echinatior]
Length = 1412
Score = 300 bits (767), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 202/672 (30%), Positives = 332/672 (49%), Gaps = 63/672 (9%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ T IN
Sbjct: 771 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQ--STPINEIPHP----- 823
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 824 ------DMQVREILMVALGHHGNRPMLLVRL-DSDLQIYQAYRY--PKGYLK-------- 866
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
L + + R S P E+ P RI + F NI+G+ G F+
Sbjct: 867 --LRFKKLDHGIIPG-RLSPRP----KEEDVPRNRNITRICVMRYFSNIAGYNGVFICSD 919
Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +YD
Sbjct: 920 YPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYD 979
Query: 991 NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
WPV+K+PL+ TPH +TY E Y +I S +PL + +V
Sbjct: 980 APWPVRKVPLRCTPHFVTYHLESKTYCVITSTA--EPLKSYYRFNGEDKV---------- 1027
Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1107
L + Y + ++ + + W+T I + E+ ++ V+L T+
Sbjct: 1028 ---LTKLYYLFQFSRIFMNLLFSPVSWETIPNTKIELDQWEHVTCLKNVSLAYEGTRSGL 1084
Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALAS 1162
+ + +GT Y GED+ +RGR+L+F P +N ++Y+KE KG I+A+
Sbjct: 1085 KGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPITAITQ 1144
Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
+ G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+ D++KSI L +
Sbjct: 1145 VSGFLVSAVGQKIYIWQLKDNDLVGVAFIDT-QIYIHQMLSIKSLILIADVYKSISLLRF 1203
Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
+E+ L+L+++DF + + E+LID S L +V+D + N+ +F Y P+ ES GQKL
Sbjct: 1204 QEEYRTLSLVSRDFRPAEVYTIEYLIDNSNLGFIVADGESNLALFMYQPESRESLGGQKL 1263
Query: 1283 LSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
+ +A+FH+G + F R++ T ++ G+DK R ++ +LDGS+G I P+ E
Sbjct: 1264 IRKADFHLGQKINTFFRIKCRITDPANDKKQFSGADK--RHVTMYASLDGSLGYILPVPE 1321
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELLSHYEMLPL 1400
T+RRL LQ LV + H+AGLNP+++R + + ++ P I+D +L+ Y LP
Sbjct: 1322 KTYRRLLMLQNVLVTHICHIAGLNPKAYRHTYKSYVRNQGNPARGIIDGDLVWRYLFLPN 1381
Query: 1401 EEQLEIAHQTGT 1412
E+ ++A + GT
Sbjct: 1382 NEKADLAKKIGT 1393
Score = 290 bits (741), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 214/665 (32%), Positives = 342/665 (51%), Gaps = 76/665 (11%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV ANVI ++ + + ++ K + ET+ LE + Y LHGN+ S+
Sbjct: 30 LVVAGANVIRVFRLIPDVDMTRREKYT-ETRP-------PKMKLECLTQYTLHGNIMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+ G+ +RDS++L+F DAK+SV+E+D IH LR S+H FE E +K G +
Sbjct: 82 AVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IKDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR---IESSH 234
P+V+VDP+GRC +L++G ++++L + S + D D S S I SS+
Sbjct: 135 HHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPS--LDDGDLLDSAKLTSTNKTPILSSY 192
Query: 235 VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+I L+ L+ M +V D F+HGY EP ++IL+E T++GR++ + TC + A+S++
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDS 351
+ HP+IWS NLP D Y+ + V P+GG L++ N++ Y +QS ++LN+ A S +
Sbjct: 253 RVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADSSTN 312
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
P+ + L+ + ++ D ++S K+G+L +L++ D R V+ K SV
Sbjct: 313 FPLKPQEGVKMSLEGSQVAFISADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM--LSSGLKEEFGDIEADAPSTKRLR 468
LTS + ++ FLGSRLG+SLL++FT ++ L+ + + P+ K +
Sbjct: 373 LTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNDNEITIEENENEETPAKKTKQ 432
Query: 469 R-----SSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-- 521
+SD L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S G
Sbjct: 433 DFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEP 490
Query: 522 ------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSS 558
N D + +G K ++ +LPGC+ +WTV
Sbjct: 491 AFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFQLPGCEDMWTVIGIV- 549
Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
N D R ++ HA+LI+S E TMVL+T + EV +S + QG T+ AGNL
Sbjct: 550 ---NNDEIRT---EEGSHAFLILSQEDSTMVLQTGQEINEVDQS-GFSTQGSTVFAGNLG 602
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
R ++QV + G R+L G Q + ++ S ADPYV L DG
Sbjct: 603 ANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVALLSEDG 652
Query: 679 SIRLL 683
+ LL
Sbjct: 653 QVMLL 657
>gi|380014171|ref|XP_003691113.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Apis florea]
Length = 1583
Score = 300 bits (767), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 210/668 (31%), Positives = 343/668 (51%), Gaps = 81/668 (12%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
LVV AN+I ++ + + +K+ K + ++ LE + Y LHGNV S+
Sbjct: 30 LVVAGANIIRVFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
++ G+ +RDS++L+F DAK+SV+E+D H LR S+H FE E ++ G +
Sbjct: 82 AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
P+V+VDP+GRC +L+YG ++++L + S GD I SS++I
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
L+ L+ M ++ D F+HGY EP ++IL+E T++GR++ + TC + A+S++ + H
Sbjct: 195 LKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
P+IWS NLP D Y+ + V P+GG L++ N++ Y +QS + Y VSL+S E
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308
Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
P+ + L+ + ++ +D ++S K+G+L +L++ D R V+ K
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKR 466
SVLTS + ++ FLGSRLG+SLL++FT ++ ++ E + E + K+
Sbjct: 369 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIVLEENETEETPAKK 428
Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+++ +SD L D+ + EEL +YGS + +T ++ F V DSL+NIGP + S
Sbjct: 429 IKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486
Query: 521 G--------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYH 555
G N D + +G K ++ ELPGC+ +WTV
Sbjct: 487 GEPAFLSEEFSHNQDPDVELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVI- 545
Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
G + ++ + HA+LI+S E TM+L+T + EV +S + QG TI AG
Sbjct: 546 ----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTIFAG 600
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
NL R ++QV + G R+L G Q + ++ S ADPYV L
Sbjct: 601 NLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLS 650
Query: 676 SDGSIRLL 683
DG + LL
Sbjct: 651 EDGQVMLL 658
Score = 295 bits (754), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 199/679 (29%), Positives = 327/679 (48%), Gaps = 77/679 (11%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
+V +SG LEI+ +P+ + + F G+ + D+ L+ + + E
Sbjct: 772 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE------ 825
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
M+V E+ M H +RP L L D + YQAY + P+ K
Sbjct: 826 -------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL------- 868
Query: 875 RSLSVSNVSASRLRNLRFSRTP--LDAYTREETPHGAPCQR---ITIFKNISGHQGFFLS 929
R + L P L R+E R + F NI+G+ G F+
Sbjct: 869 -----------RFKKLDHGIIPGHLRPRPRDEDMPAMNDTRHCMMRYFSNIAGYNGVFIC 917
Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
P W + R LR HP DG + +F +N+NC GF+Y + L+IC LP+ +
Sbjct: 918 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 977
Query: 989 YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
YD WPV+K+PL+ TPH +TY E Y +I S+ +PL +
Sbjct: 978 YDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------YR 1020
Query: 1049 LSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVT 1098
+ D + +T EE R + P + + W+T I + E+ ++ V+
Sbjct: 1021 FNGED--KEFTEEERPDRFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNVS 1078
Query: 1099 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKE 1152
L T+ + + +GT Y GED+ +RGR+L+F P +N ++Y+KE
Sbjct: 1079 LAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKE 1138
Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
KG I+A+ + G L+ A G KI + + +L G+AF D +Y+ + +K+ IL+ D
Sbjct: 1139 QKGPITAITQVSGFLVSAVGQKIYIWQLKDNDLVGVAFIDTQ-IYIHQMLSIKSLILIAD 1197
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
++KSI L ++E+ L+L+++DF + + E+LID + L +V+D + NI +F Y P+
Sbjct: 1198 VYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGESNIALFMYQPE 1257
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
ES GQKL+ +A+FH+G V F R++ S SD R ++ +LDG+
Sbjct: 1258 SRESLGGQKLIRKADFHLGQKVNTFFRIR-CRISDPANDKKHFSDADKRHVTMYASLDGN 1316
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+G I P+ E T+RRL LQ LV + H+AGLNP+++R + S+ + I+D +L+
Sbjct: 1317 LGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGNPARGIIDGDLV 1376
Query: 1393 SHYEMLPLEEQLEIAHQTG 1411
Y LP E++++A +
Sbjct: 1377 WRYLYLPNNEKIDVAKKIA 1395
>gi|301773406|ref|XP_002922132.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1-like [Ailuropoda
melanoleuca]
Length = 1469
Score = 298 bits (763), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 210/631 (33%), Positives = 333/631 (52%), Gaps = 76/631 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 104 LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 159
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P V+VDP GRC +L+YG ++++L + + +E
Sbjct: 160 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEG 213
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 214 LMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQ 273
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS
Sbjct: 274 DTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPP 333
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
+ALN + + + LD A A ++ D ++S K G++ +LT++ DG R
Sbjct: 334 YGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMR 393
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
V+ K SVLT+ + T+ FLGSRLG+SLL+++T +S ++E
Sbjct: 394 SVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA--- 449
Query: 457 IEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDS 508
+ + P +K+ R S+ QD V+ E+ +YGS A + T+ A T+SF V DS
Sbjct: 450 ADKEEPPSKKKRVDSTVGWSGGKSMPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDS 505
Query: 509 LVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV---E 543
++NIGP + + G L I + + + + K ++V E
Sbjct: 506 ILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFE 565
Query: 544 LPGCKGIWTVY-------HKSSRGHNADS--SRMAAYDD-EYHAYLIISLEARTMVLETA 593
LPGC +WTV ++ +G A+ S + A DD H +LI+S E TM+L+T
Sbjct: 566 LPGCYDMWTVIAPVRKEQEETPKGEGAEQEPSALEADDDGRRHGFLILSREDSTMILQTG 625
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 626 QEIMELDTS-GFATQGPTVFAGNIGDSRYIVQVSPLGIRLLEG---VNQLHFIPVDL--- 678
Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
S ++ ++ADPYV++ ++G + + +
Sbjct: 679 ----GSPIVQCAVADPYVVIMSAEGHVTMFL 705
Score = 290 bits (743), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 214/707 (30%), Positives = 336/707 (47%), Gaps = 105/707 (14%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 820 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 875
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 876 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 918
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
L N+ +RF + P + RE E GA + R
Sbjct: 919 ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKVEGGSAEEGAGARGRVARFR 968
Query: 916 IFKNISGHQG-------FFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNH 967
F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC
Sbjct: 969 YFEDIYGYSGGGGACPQVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPR 1028
Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1027
GF+Y QG L+I LP+ +YD WPV+KIPL+ T H + Y E +Y + S + P
Sbjct: 1029 GFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNM--P 1086
Query: 1028 LNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPM 1085
+ I + G + + + D + E + ++++ P W+ A I +
Sbjct: 1087 CTR-----IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIEL 1137
Query: 1086 QSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL 1144
+ E+ ++ V+L + T + +A GT +QGE+V RGR+L+ P
Sbjct: 1138 EEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQP 1197
Query: 1145 VTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
+T+ +Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+
Sbjct: 1198 LTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIH 1256
Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
+ VKNFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD
Sbjct: 1257 QMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSD 1316
Query: 1260 EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT 1319
+N+ ++ Y P+ ES+ G +LL RA+FHVGAHV F R + GAA G K
Sbjct: 1317 RDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGPSKK 1369
Query: 1320 -----NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
N+ F TLDG IG + P+ E T RLQ S R H
Sbjct: 1370 SVVWENKHITWFATLDGGIGLLLPMQEKT-NRLQPAX----------------SPRMLHV 1412
Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ + + +++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1413 DRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 1459
>gi|302694047|ref|XP_003036702.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
gi|300110399|gb|EFJ01800.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
Length = 1396
Score = 297 bits (761), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 342/1477 (23%), Positives = 615/1477 (41%), Gaps = 229/1477 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS------------------- 97
N+V N + IY VR + SK + K+ M+G+
Sbjct: 38 NVVTARGNTLSIYEVREETATSKSPTEAKSQKKDDAMEGVKEERQTPVVQVRSLSKKTYP 97
Query: 98 ---------AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFD 148
+ LV +RLHG V L + + D ++++F+DAKI++LE+
Sbjct: 98 DSDSHSQPLSTKFHLVREHRLHGVVTGLQAVKIISSLEDHL-DRLLVSFKDAKIALLEWS 156
Query: 149 DSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
+ L S+H +E + + +F ++VDPQ RC + + IL Q
Sbjct: 157 TATQDLLTVSIHTYERAIQM-VATDISAFTSE--LRVDPQSRCAALSLPKDAFAILPPCQ 213
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHE 265
+ D S ++NL + +++V DF F+ G+ P + +L+E
Sbjct: 214 VSDSVCRD-----------VPYSPSFILNLPSEVESGIRNVIDFTFLPGFSNPTVAVLYE 262
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
TW GR++ + T ++ ++ +++P+I A LP D +LA PS GGV+VV
Sbjct: 263 TYQTWTGRLNEQKDTVKMAFFTLDIVNRRYPVIGLATGLPCDCLSVLACPS-TGGVMVVA 321
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELP------RSSFSVELDAAHATWLQNDVALL 379
+N+I Y QS + N + S LP + ++EL+ + + ++ + A +
Sbjct: 322 SNSIIYVDQSGRKVVLPVNAWIPRMSDIALPTNLTPEEQARTLELEGSRSIFVDDKTAFI 381
Query: 380 STKTGDLVLLTVVYDGRVVQRLDL-----SKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
K G + + +V GRVV +L L T PS+L I N +GS GDS
Sbjct: 382 ILKDGTIYPVELVTAGRVVSKLALGTPLAKTTIPSILRR----INNDYLLVGSASGDS-- 435
Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL--QDM-VNGEELSLYGSAS 491
++LS+ EE D + D + +S AL QD+ ++ ++ +YG +
Sbjct: 436 ---------ALLSTSWVEEVIDDDVDMEAN-----TSVAALEQQDIEMDDDDDDIYGPSI 481
Query: 492 NNTESAQK------------TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGI 533
T ++QK + D+L GP+ D ++ + N D +ATG
Sbjct: 482 IKTGTSQKESAAPMSKKTRSVLRLSFCDALPAYGPIADLTFTVGKNGDRPVAELVTATGS 541
Query: 534 SKQSNYELVE--LP-----------GCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYL 579
+ L + LP G +G+W++ +SS A+ H L
Sbjct: 542 GHLGGFTLFQKDLPLRKKKKLPIISGARGVWSLPIRRSSSAAVAE-----------HDTL 590
Query: 580 IISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
IIS +A R V T L+ V+ V G TI AG F R ++ V R
Sbjct: 591 IISTDANPSPGFSRLAVRATKGDLSVVSR-----VNGMTIGAGPFFQRTAILHVMTNAIR 645
Query: 633 IL--DGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
+L DG+ + +D+ + + S SI DPYVL+ D +I L +G+ +
Sbjct: 646 VLEPDGNERQIIKDME---------GNVPRAKIKSCSICDPYVLIFREDDTIGLFIGETT 696
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG--EAIDGADGGP 746
+ + + + ++ + D + + DA T + +D +
Sbjct: 697 RGKIRRKDMSPMGEKSSRYTAGGFFTDTASVFRVYHQNADANTETTIPMHSVVDASSKSQ 756
Query: 747 LDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
+ V+ G +EI+ +P VF+ + + + D+ AL +
Sbjct: 757 ------WLVLVRPQGVVEIWTLPKLTLVFSTTLLATLQNVLTDSQEPPALSPPQDPPRKP 810
Query: 807 SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
E + + ++ + +P L +L G + Y+A+ N +
Sbjct: 811 QE-------------LDIEQILLTNLGQSDPKPHLLVLLRSGHLAIYEAFA----TNQAP 853
Query: 867 SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR------ITIFKNI 920
+P R+ S+ +++ ++ + + +ET G ++ F
Sbjct: 854 IVEPPLKPRA------SSLQIQFVKIASKAFEMQRTDETEKGILAEQKKALRTFVPFACA 907
Query: 921 SGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
G F +G RP W + + ++++P ++ AF+ + F+ + +G
Sbjct: 908 GAPAGVFFTGDRPHWIVATDKGGVQMYPS-GHAAVYAFSACTLWERSTEFLIYSEEG-QT 965
Query: 980 ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
+C+ + P++ IP I Y + +IV+ L+ E
Sbjct: 966 LCEWITEYEIGRPLPMRHIPRGRAYSNIVYEPASS---MIVAAASLRARFASF-----DE 1017
Query: 1040 VGHQI---DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
G+QI D ++ TVE + ++ P+ W T ++E T+
Sbjct: 1018 DGNQIWAPDGPGITEP------TVECSTLELISPEV----WATVDGYEFATNEFVNTMEC 1067
Query: 1097 VTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL-- 1153
V L +T+ + +A+GT+ V+GED+A +G +F + N Y +L
Sbjct: 1068 VPLETVSTEAGVKHFIAVGTSIVRGEDLAVKGATYIFEVVEVVPDQSNGPKRWYRLKLRC 1127
Query: 1154 ----KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFI 1208
KG ++AL + +L+ + G KI + + E L G+AF D +YV SL +KN +
Sbjct: 1128 RDDAKGPVTALCGINNYLVSSMGQKIFVRAFDLDERLVGVAFMDV-GVYVTSLRALKNLL 1186
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
L+GD+ + I F++++E +L L +D + +F L++V +DE + ++
Sbjct: 1187 LIGDVVRGIQFVAFQEDPYKLVTLGRDVSRMCATTVDFFFAEEALAIVTTDENGVMSMYN 1246
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
Y P+ +S G+ LL + EF++ T F ++A + P L+FG
Sbjct: 1247 YDPEAPDSHDGRLLLKQTEFNLH---TDFRTSTLIARRTKDDPIIPQG------ILIFGG 1297
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
DG++ C+ P+ + +RLQ LQ +L ++ HVAGLNP++ R N RP I+D
Sbjct: 1298 TDGTLSCLTPVPDDAAKRLQPLQLQLTRNMQHVAGLNPKALRIVR-NEHVSRPLSKGILD 1356
Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
L++++E LP+ Q E+ Q GT R+ IL + L+
Sbjct: 1357 GNLIAYFEHLPITRQDEMTRQIGTERATILRDWMSLS 1393
>gi|393220097|gb|EJD05583.1| cleavage factor protein [Fomitiporia mediterranea MF3/22]
Length = 1450
Score = 297 bits (760), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 363/1458 (24%), Positives = 627/1458 (43%), Gaps = 207/1458 (14%)
Query: 76 EGSKESKNSGETKRRVLMDGISAASLEL--VCHYRLHGNV---ESLAILSQGGADNSRRR 130
EG E GE + +S + + + +RLHG V E + ILS
Sbjct: 89 EGEVEMDTQGEGFVNMASKPLSMTTYQFHFIREHRLHGIVTGLEPVKILSS----TEDSL 144
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQG 189
D ++++F+DAK+++LE+ +H L S+H +E +P+ L + + G L +VDP
Sbjct: 145 DRLLVSFKDAKLALLEWSPELHDLVTVSIHTYERAPQMTFLDPSKFT---GQL-RVDPLS 200
Query: 190 RCGGVLVYG--LQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVK 247
RC + + L ++ SQ LV + T +S + N D +++V
Sbjct: 201 RCAALSLPCDCLAILPFYHSQVDLDLVDADQTVSRDIPYSPSF-ILDLFNQVDHRIRNVI 259
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
DF F+ G+ P + +L + + TW GR+ TC + ++ +P+I S NLPHD
Sbjct: 260 DFAFLPGFNNPTLAVLFQTQHTWTGRLKEFKDTCNLFIFTLDLVTHMYPIITSVENLPHD 319
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQ-SASCALALNNYA--VSLDSSQELPRSSFS--V 362
+ +L S +GGV+++ N++ Y Q S L +N +A VS Q+L + +
Sbjct: 320 CFAMLPCDSSLGGVVIISCNSLIYVDQASRKTVLPVNGWAARVSDMPMQQLRPEEMNRDL 379
Query: 363 ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITTI 418
L+ AHAT++ + + T+ G ++ + +V DGR RL L S+T L ++
Sbjct: 380 HLEGAHATFVDSRTFFIITRDGLVLPVEIVMDGRTALRLALHPAMSQTTTPALVRNVAFR 439
Query: 419 GNS--------LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF------GDIEAD-APS 463
S + F+GS +G S+L++ T ++EE GDI A A +
Sbjct: 440 SASGDQAPRSQILFVGSTVGPSVLLRVTW----------VEEEIQKDKQQGDIPAAVADN 489
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS---FAVRDSLVNIGPLKDFSY 520
+ D + V E + +G + +++A +T S ++ DSL GP+ ++
Sbjct: 490 PMAVDFDDEDDIYGDVAKETQTTHGQPTAASQAAVETKSVIHLSLCDSLSAYGPINSMAF 549
Query: 521 GLRINAD------ASATGISKQSNYELVE-------------LPGCKGIWTV-YHKSSRG 560
L N D +ATG ++ + L + + G +GIW + +S +
Sbjct: 550 ALTRNGDRPTAELVAATGYARLGGFTLFQRDVPTRSKRKLHAVGGARGIWCIPVRQSLKV 609
Query: 561 HNADSSR-MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR----TIAAG 615
+ ++ SR + E +I+S +A T + D + R TI A
Sbjct: 610 NGSERSRNLLPGSSEVVDTVIVSTDANPSPGLTR--FAAKSSRNDIAITARRTETTIGAA 667
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
F R +I V R+L+ D S + ++ + I+DP++L+
Sbjct: 668 PFFQRTAIIHVTTDLIRVLE-----PDCSERQCIRDMDGSNKRPKIRFCCISDPFILVIR 722
Query: 676 SDGSIRLLVGDPSTCTVSVQ--TPAAIESSKKPVSSCTLYHDKG------PEPWLRKTST 727
D S+ L VGD + + TP E + + C G E ++
Sbjct: 723 EDESLGLFVGDAERGRIRRKDMTPMG-EKVSRYSAGCFFLDQSGIFELHMSESSPTTGTS 781
Query: 728 DAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI 787
D G G D +G + V+C G +EI+ +P VF+ + +
Sbjct: 782 DDKQRMGTGSLESAVDA---QRGTQWLVLCRPQGVVEIWTLPKLALVFSTSSLKDLPSVV 838
Query: 788 VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
D++ AL S E+ + ++ +I ++ ++ + P L +L
Sbjct: 839 SDSFDPPAL--------SLPEDPPRKPQEADIELLQFAQIG-----ELYPHPHLIVMLRC 885
Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSV----------------------SNVSAS 885
G + YQA + K D P ST R+ ++ S+V A
Sbjct: 886 GQLAIYQAVAVD------KDDFPESTVRTSTLKIKFIKMGTRSFEPRQLEPAEKSSVIAE 939
Query: 886 RLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLR 944
+ R LR S P E K +S G F++G PCW + ++ L+
Sbjct: 940 QRRALR-SLVPFIVSPNSE-------------KRVS---GVFVTGDEPCWIVATDKDGLK 982
Query: 945 VHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQ--GILKICQLPSGSTYDNYWPVQKIPLK 1001
+H C V +FT + F+ T + G + +P + + P + + +
Sbjct: 983 IHS--CSFQTVNSFTSCSVWDSKCDFLMHTDEAFGPCLLGWIPEFNLGTDM-PSKTVTVG 1039
Query: 1002 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL-HRTYTV 1060
T +T+ A L ++ S V P + D+E G+++ + +++ H +
Sbjct: 1040 RTYTNVTFDAASGL--MVASSVVPNPFT-----IFDEE-GNKLWEPDAPNINYPHSVMSA 1091
Query: 1061 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQ 1119
E L G + Q +E + V L +T+ + + +GT +
Sbjct: 1092 LELFHSDLSCVMDGYEF--------QPNEFVTALDCVQLETQSTESGTKEFIVVGTTVNR 1143
Query: 1120 GEDVAARGRVLLFSTGRNADNPQNLVTEVY------SKELKGAISALASLQGHLLIASGP 1173
GED+A +G +F +P+ + + E KG ++AL + G+L+ + G
Sbjct: 1144 GEDLAVKGVTYVFEIVEIVPDPEGGLARQFKLRLLCKDEAKGPVTALCGMNGYLVSSMGQ 1203
Query: 1174 KIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1232
KI + E L G+AF D +YV SL +KN +++GD KS++ ++++E +L ++
Sbjct: 1204 KIFVRALDLDERLVGVAFLDV-GVYVTSLRTIKNLLIIGDAVKSVWLVAFQEDPFKLVIV 1262
Query: 1233 AKDFGSLDCFATEFLI--DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH- 1289
AK+ LD +FL DG + VSDE+ I++ Y ES GQ LL R E+H
Sbjct: 1263 AKEVQRLDVMTADFLFASDGD-FYIAVSDEEGIIRLLEYDTSDPESHSGQYLLRRTEYHA 1321
Query: 1290 -VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD-ELTFRRL 1347
V +H T ++A S G P + L+ +DGS+ + P+D + + +RL
Sbjct: 1322 QVESHTTV-----LIARRSQNDGLVPQA------RLISAAVDGSMYALTPVDADESAKRL 1370
Query: 1348 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
Q LQ +L ++ HVAGLNPR+FR S+G A RP I+D LL+ +E LP+ Q EIA
Sbjct: 1371 QLLQGQLTRNMQHVAGLNPRAFRAVRSDGVA-RPLTKGILDGNLLAGFEQLPIPRQNEIA 1429
Query: 1408 HQTGTTRSQILSNLNDLA 1425
GT R +L + +L+
Sbjct: 1430 RPIGTDRLAVLRDRRELS 1447
>gi|336276223|ref|XP_003352865.1| hypothetical protein SMAC_04980 [Sordaria macrospora k-hell]
gi|380092984|emb|CCC09221.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1486
Score = 294 bits (753), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 359/1428 (25%), Positives = 596/1428 (41%), Gaps = 208/1428 (14%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 95 DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVDRNT 154
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 155 LETISIHYYEKEELVGSPWVAPLHHYPTLLLADPASRCAALKFSERNLAILPFKQPDEDM 214
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 215 DMDNWDEELDGPRPKKDLSGAIANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 274
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 275 YRDPTIGVLSSTKTASNSLGHRDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 332
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + +S + L+ L
Sbjct: 333 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 392
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITT---IGNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T+ +G S F
Sbjct: 393 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRVGRSTVF 452
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + K DI+ D
Sbjct: 453 VGSEEGDSVLLGWTRRQGQTQKR---KSRIQDIDLDLDLDDEDLEDDD-----------D 498
Query: 485 SLYGSASNNTESA--------QKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------- 529
LYG S + E A +F + D L++I P++ +YG + S
Sbjct: 499 DLYGEESTSPEQAISAAKAVKSGELNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSE 558
Query: 530 ----------ATGISKQSNYELV------------ELPGCKGIWTVYHKSS--RGHNADS 565
A G K S ++ E P +G WTV K + D
Sbjct: 559 GVRSDLQLVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDK 618
Query: 566 SRMAA-YDD--EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
M+ YD ++H ++I++ E + TA +T + G T+ AG
Sbjct: 619 GPMSNDYDTSGQHHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGT 678
Query: 617 LFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
+ R++QV + R DG ++Q + + E+G+ V + SIADP++LL
Sbjct: 679 MGKDCRILQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIR 733
Query: 676 SDGSIRLLVGDPSTCTV-SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG 734
D S+ + P + V+ + + K ++ C LY D +TG
Sbjct: 734 DDFSVFVAEMSPKLLELDEVEKEDQMLTGTKWLAGC-LYTD----------------TTG 776
Query: 735 VGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMRE 794
V A + A G D +I + SG L I+ +P+ V + +S Y+
Sbjct: 777 V-FADEAAGKGTKD--NILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPP 825
Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
L + ++ +GT KE++ + V +L H P+L + + YQ
Sbjct: 826 GLS-----ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQ 873
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----P 910
Y + + + P S S + ++ N F++ P + ++ H A P
Sbjct: 874 PYRVK-----ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPVEDDELHNAQRFLP 921
Query: 911 CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFI 970
+R T NISG+ FL GS P + + + L + A + H C HGFI
Sbjct: 922 MRRCT---NISGYSTVFLPGSSPSFILKTAKSSPRVLGLQGSGVQAMSSFHTEGCEHGFI 978
Query: 971 YVTSQGILKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
Y + GI ++ Q+P+ S++ V+KIP+ + Y Y +V +P
Sbjct: 979 YADTNGIARVTQIPTDSSFAELGLSVKKIPVGVDTQSVVYHPPTQAY--VVGCNNAEPFE 1036
Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1089
L D + + N++ + V+ +++L +G W T+ M+ E
Sbjct: 1037 ----LPKDDDYHKEWARENITFKPM-----VDRGMLKLL----SGITWTVIDTVEMEPCE 1083
Query: 1090 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEV 1148
L V + L + +T E + L+A+GTA +GED+ RGRV +F P T
Sbjct: 1084 TVLCVETLNLEVSESTNERKQLIAVGTALTKGEDLPTRGRVYVFDIADVIPEPGKPET-- 1141
Query: 1149 YSKELK---------GAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPP 1195
SK+LK GA++AL+ + QG +L+A G K ++ K GT L +AF D
Sbjct: 1142 -SKKLKLVAKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN- 1198
Query: 1196 LYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
YV S+ + L+ D K ++F + E+ ++ L K ++ +FL DG L
Sbjct: 1199 CYVTSVKELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKEL 1258
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFLRLQML--ATSSDRT 1310
+V SD +I I + P+ +S +G LL R F+ GA H T L L + T+S +
Sbjct: 1259 YIVASDADGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTSSLLLPAVYPTTTSPNS 1318
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
+ G + + LL + G + + PL E +RRL SL +L +++PH AGLNP+ +R
Sbjct: 1319 NSEVGENPPH--ILLLASPTGLLATLRPLQENAYRRLSSLAIQLTNALPHPAGLNPKGYR 1376
Query: 1371 QFHSNGKA--HRPGPDS-----IVDCELLSHYEMLPLEEQLEIAHQTG 1411
+ A PG D+ IVD ++L + L ++ EIA + G
Sbjct: 1377 LPSPSASASMQLPGVDAGIGRNIVDGKILERFMELGTGKRQEIAGRAG 1424
>gi|148886831|sp|Q7SEY2.2|CFT1_NEUCR RecName: Full=Protein cft-1; AltName: Full=Cleavage factor two
protein 1
Length = 1456
Score = 294 bits (752), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 345/1418 (24%), Positives = 594/1418 (41%), Gaps = 189/1418 (13%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 96 DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + ++ + L+ L
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T++ G S F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + ++ ++ D D + GEE
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507
Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
+ A + ++ + +F + D L++I P++ +YG + S
Sbjct: 508 ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567
Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
A G K S ++ E P +G WTV K + +D
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627
Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+YH ++I++ E + TA +T + G T+ AG + R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+QV + R DG ++Q + + E+G+ V + SIADP++LL D S+ +
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSVFI 742
Query: 683 LVGDPSTCTVSVQTPA-AIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG 741
P + I +S K ++ C LY D ++ + VG+
Sbjct: 743 AEMSPKLLELEEVEKEDQILTSTKWLAGC-LYTD----------TSGVFADETVGKGT-- 789
Query: 742 ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
+ +I + SG L I+ +P+ V + +S Y+ L
Sbjct: 790 -------KDNILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPPGLS---- 830
Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
+ ++ +GT KE++ + V +L H P+L + + YQ Y +
Sbjct: 831 -ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-- 880
Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIF 917
+ + P S S + ++ N F++ P + ++ PH A P +R +
Sbjct: 881 ---ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS-- 928
Query: 918 KNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
NISG+ FL GS P + + + L + A + H C HGFIY + GI
Sbjct: 929 -NISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGI 987
Query: 978 LKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
++ Q+P+ S+Y V+KIP+ + Y Y +V ++P L
Sbjct: 988 ARVTQIPTDSSYAELGLSVKKIPIGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPK 1041
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
D + + N++ + V+ +++L +G W T+ M+ E L V
Sbjct: 1042 DDDYHKEWARENITFKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVET 1092
Query: 1097 VTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK- 1154
+ L + +T E + L+A+GTA ++GED+ RGRV +F P T SK+LK
Sbjct: 1093 LNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKL 1149
Query: 1155 --------GAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLN 1202
GA++AL+ + QG +L+A G K ++ K GT L +AF D YV S+
Sbjct: 1150 VAKEDIPRGAVTALSEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVK 1207
Query: 1203 IVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
+ L+ D K ++F + E+ ++ L K ++ +FL DG L +V SD
Sbjct: 1208 ELPGTGLCLMADAFKGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDA 1267
Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
+I I + P+ +S +G LL R F+ GAH L + A + + + S++ +
Sbjct: 1268 DGHIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENS 1326
Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA-- 1378
LL + G + + PL E +RRL SL +L + +PH AGLNP+ +R + A
Sbjct: 1327 PHILLLASPTGVLATLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASM 1386
Query: 1379 HRPGPDS-----IVDCELLSHYEMLPLEEQLEIAHQTG 1411
PG D+ IVD ++L + L ++ E+A + G
Sbjct: 1387 QLPGVDAGIGRNIVDGKILERFLELGTGKRQEMAGRAG 1424
>gi|301628217|ref|XP_002943254.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Xenopus (Silurana) tropicalis]
Length = 628
Score = 292 bits (748), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 204/626 (32%), Positives = 318/626 (50%), Gaps = 74/626 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E S + + E K LEL+ + GNV S+
Sbjct: 29 NLVVAGTSQLYVYRLNPNCESSSKGEKGSEVKGH-------KEKLELMASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F++AK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKEAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG Q+++L ++ GLVG+ G +
Sbjct: 135 NVHNPKVRVDPSGRCAVMLIYGTQLVVLPFRRDTLAEEHDGLVGE--------GQKSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
K HP+IWS NLP+D + LAVP PIGGV++ N++ Y +QS ++LN+
Sbjct: 247 IMQKVHPVIWSLTNLPYDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVSLNSLTNG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
S P+ V LD + AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTSFPLKPQEGLRVTLDCSQATFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ +T + FLGSRLG+SLL+++T S + + D P K+
Sbjct: 367 ASVLTTSMTPMEPGYLFLGSRLGNSLLLRYTEKVQDSPAGPSKDPD----KQDEPPNKKK 422
Query: 468 RRSSSDALQ-----DMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
R SS A +MV+ +E+ +YGS + + T+SF V DS++NIGP S G
Sbjct: 423 RVDSSLARPGGSKGNMVDEIDEIEVYGSEM-QSGTQLSTYSFEVCDSILNIGPCATASMG 481
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
L I + + + + K ++V ELPGC +WTV
Sbjct: 482 EPAFLSEEFQESPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVISN 541
Query: 557 SSRGHNADSSRM------AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
+ A D H +LI+S + TM+L+T + E+ S + Q
Sbjct: 542 HKKEEQEGEKEGETPPVEAEEDTNRHGFLILSRDDSTMILQTGQEIMELDTS-GFATQDP 600
Query: 611 TIAAGNLFGRRRVIQVFERGARILDG 636
T+ AGN+ + ++QV RG R+L+G
Sbjct: 601 TVYAGNIGDNKYIVQVSPRGIRLLEG 626
>gi|345566738|gb|EGX49680.1| hypothetical protein AOL_s00078g169 [Arthrobotrys oligospora ATCC
24927]
Length = 1407
Score = 291 bits (746), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 350/1524 (22%), Positives = 618/1524 (40%), Gaps = 296/1524 (19%)
Query: 57 NLVVTAANVIEIYVVRVQEEGS-----KESKNSGETKRRVLM-----DGISAAS------ 100
NLVV ++++I+ + E+ E+K+ G + RRV D + S
Sbjct: 31 NLVVAKTSLLQIFRLVEYEDAEGEFALDEAKDEGGSDRRVFEGRDHEDSFTVESGMHLQR 90
Query: 101 --------LELVCHYRLHGNVESLAIL----SQGGADNSRRRDSIILAFEDAKISVLEFD 148
L+LV Y L+G+V S+ + S+ G D ++++F+ AKIS+LE+D
Sbjct: 91 ETIEKTTKLDLVAQYHLYGSVTSMVKIRIPTSKSGGD------CLLVSFDSAKISLLEWD 144
Query: 149 DSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK--------VDPQGRCGGV-LVYGL 199
+ H + S+H +E E+ R PL DP+ RC + L
Sbjct: 145 PAAHSISTISLHYYEGDEF-----------RSPLTPEFPINYLISDPKSRCAAFKFNHDL 193
Query: 200 QMII-LKASQGGSGLVGDEDTF-----------------------GSGGGFSARIESSHV 235
I+ + ++ + D D+F G G S V
Sbjct: 194 VAILPFRQTEDEDLEIPDNDSFTYDLEDDDDAEKPKKDVEMKDNTGEGKPSDTPYHPSFV 253
Query: 236 INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
++ LD ++ + D +F+H Y EP I+++ + G + + +++ +
Sbjct: 254 LSASQLDESVERIIDIVFLHEYREPTFGIVYQPQQGSVGMLERRKDPTHFIVVTLDLDQR 313
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYAVSLDSS 352
I SA NLP D +K +A+P PIGG L++G + I H A+A+N+YA +
Sbjct: 314 ASTSIMSAKNLPFDIWKAVALPPPIGGTLLLGEHEIVHVDQAGKMSAVAVNSYAQQYSAF 373
Query: 353 QELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP-- 408
+S + L++ A L N+ L+ T GD +L+ +GR + L + +
Sbjct: 374 NMTDQSDLELNLESCSAISLPNENGDVLIVTIAGDFAILSFKAEGRSISSLSVRRIQSKD 433
Query: 409 -----SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
S + +GN FFLGS D++L + + LS S
Sbjct: 434 GYPFTSAPCETLVEVGNRRFFLGSLDSDAMLWGYKRKGEKTSLSQK-------------S 480
Query: 464 TKRLRRSSSDALQDMVNGEELSLYG------------SASNNTESAQKTFSFAVRDSLVN 511
+L R ++ + +E LYG S+ N + + F D L N
Sbjct: 481 EVKLERDDAEDNV-EDDDDEDDLYGESTVTPITPRKASSGNIGRGSSGEYVFRRHDRLQN 539
Query: 512 IGPLKDFSYG--------------------LRINADASATGISKQSNYELV-------EL 544
+GP + ++G L G N + +
Sbjct: 540 VGPCRQMAFGRPAMLPEKLKLHQGVLPELELMATTGRGVEGAVTVFNTSICPRVSATFDF 599
Query: 545 PGCKGIWTVYHKSSRGHNA--DSSRMAAYDDE------YHAYLIISLEARTMV------- 589
C+ +W V+ K + + SS Y+++ Y YL S + T+V
Sbjct: 600 KDCQRLWAVHSKQVKKGQSMIPSSVSKGYEEQIGATEDYSTYLFASNTSETLVYKVGTKF 659
Query: 590 --LETADL-LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
LE D+ TEV ++++ G R+ QV E ++ D Q +
Sbjct: 660 EPLEGTDIETTEVCPTLEF---------GTFQDGLRIAQVCETNVKVYDSEL--QLIQII 708
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKK 705
+N E G + ++S S ADPY+LL D SI T + ++ PA I+ +K
Sbjct: 709 STNDEDPDGGPH--IVSASFADPYMLLICGDSSILACQCHERTLELDRIELPATIKDTK- 765
Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCY---ESGA 762
T+ L T E G V+C+ E G
Sbjct: 766 --------------------YTNGCLYTSSSEV--------FGLGTKSQVLCFLLTEEGT 797
Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG-QGRKENIHS 821
L++F +PNF T++ F D ++ S E ++ I
Sbjct: 798 LQVFTLPNFELKATLEHF-----------------DMSLQLVSPDETALRFHTARDEIEE 840
Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
+ V +L A P+L I+ Y+ ++ G
Sbjct: 841 IIVADLGDNISKA----PYLIVKTKRDDIIIYEPFISNG--------------------- 875
Query: 882 VSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRE 941
+ ++ N + P + + +++P G P +I ++ G+ F++G P + +
Sbjct: 876 ICFKKIYN---TVLPTVSLSEQKSPSG-PLVKI---DDLGGYSVAFMAGDTPTFITKSSK 928
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
L +L G + + + + GF+Y+ S+G ++C P S ++ W Q+IPL+
Sbjct: 929 TLPKLYKLQGGMVRSLSPFNTKETERGFLYIDSKGTARVCHFPEVSM-EHTWLSQRIPLE 987
Query: 1002 ATPHQITYFAEKNLYPLIV---SVPVLKPLN-QVLSLLIDQEVGHQIDNHNLSSVDLHRT 1057
TP +TY+ KN+Y + V S P + + Q+ L+D+ + +++ +L +
Sbjct: 988 RTPTSLTYYDPKNVYVVSVLSTSKPEVDDEDFQMEEGLVDETLLPELETGHLVMISPVTW 1047
Query: 1058 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY 1117
T + YE + E P+ +A + ++ SE TKE + L+A+GT
Sbjct: 1048 TTTDRYEFPVHEV-----PFVVKA-VELEISE-------------VTKERKVLIAVGTGL 1088
Query: 1118 VQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
++GE+ ARG V +F G+ + + + +E+KG +S LA + G+LLI
Sbjct: 1089 LRGENSPARGAVYVFDVIDVVPEIGKPETGKKFKL--ISREEVKGVVSTLAGMDGYLLIT 1146
Query: 1171 SGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
G K ++ K G+ L +AF D V+ + K ++ GD+ K + F+ + E+ +
Sbjct: 1147 HGQKCMIRGLKEDGSLLP-VAFMDMNTHTTVAKTLEK-MVMFGDVLKGVSFVGFSEEPYK 1204
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
+ L KD L A +FL G+ VV+D Q NI + Y P+ +S G +LL + E
Sbjct: 1205 MILFGKDPRQLSITAGDFLPAGTACYFVVADAQSNIHVLQYDPENPKSIHGNRLLPKGEI 1264
Query: 1289 HVGAHVTKFLRL-QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1347
+ G V L + + ++ D+ F +F T+ G G ++ + E +RRL
Sbjct: 1265 YCGHEVKSICILPKKKSLFTEPDEDDMDEDEDEEFLCMFSTMTGVFGTVSSITESMYRRL 1324
Query: 1348 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
+Q ++ ++ H+AGLNPR++R + P +I+D +LL + ML + E+A
Sbjct: 1325 NVIQGQITNTGEHIAGLNPRAYRAAKFRNTSSEPM-RAILDGKLLVRWLMLGAGRRKELA 1383
Query: 1408 HQTGTTRSQILSNLNDLALGTSFL 1431
+ GT+ + +L L T+F
Sbjct: 1384 GRAGTSEEMLREDLWFLQDATAFF 1407
>gi|19112233|ref|NP_595441.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe
972h-]
gi|74582544|sp|O74733.1|CFT1_SCHPO RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|3738146|emb|CAA21247.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe]
Length = 1441
Score = 291 bits (745), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 336/1443 (23%), Positives = 598/1443 (41%), Gaps = 205/1443 (14%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV ++ G + ++ L G++ D +I+ + AK+S LE+D S+H
Sbjct: 92 LRLVSQVKVFGTITEISALKGKGSNGC---DLLIMLTDYAKVSTLEWDMQSQSFVTNSLH 148
Query: 161 CFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
+E +K + P + VDP C +L + M+ + L +E
Sbjct: 149 YYED-----VKSSNICSSHTPTQLLVDPDSDCC-LLRFLTDMMAIIPYPANEDLDMEEAA 202
Query: 220 F-GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
S S + S V+ LD + + D F++GY EP + IL+ E T +
Sbjct: 203 IENSKISSSYAYKPSFVLASSQLDASISRILDVKFLYGYREPTLAILYSPEQTSTVTLPL 262
Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-HSQS 335
+ T + S +++ + +I + +LP+D Y +++P+P+GG L++G N + Y S
Sbjct: 263 RKDTVLFSLVTLDLEQRASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVDSAG 322
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDLVLLT 390
+ + +N+Y +S F++EL+ A L + +L +G L
Sbjct: 323 RTVGIGVNSYYSKCTDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHTSGQFFYLD 382
Query: 391 VVYDGRVVQRLDLS----KTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGT 443
+ DG+ V+ L L + N L S IT G +L FLGS+ DS L++++
Sbjct: 383 FLLDGKSVKGLSLQALDLEINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWS----- 437
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
EE E D L ++ + DM++ E +
Sbjct: 438 ---RRTTNEEVRLDEGD----DTLYGTNDAEMDDMLDIYETDESVGSKRKIAYENGPLRL 490
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY---ELV------------------ 542
+ D L NIGP+ DF+ G A + Q N+ ELV
Sbjct: 491 EICDVLTNIGPITDFAVG-----KAGSYSYFPQDNHGPLELVGTAGADGAGGLVVFRRNI 545
Query: 543 --------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETA 593
+ GC+ +WTV S + N S A Y + E YL++S E + +
Sbjct: 546 FPLIAGEFQFDGCEALWTV-SISGKLRNMKSRIQAQYSNPELETYLVLSKEKESFIFLAG 604
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSES 652
+ EV S D+ +T+ G+L R++Q+ R+ D + +TQ +F
Sbjct: 605 ETFDEVQHS-DFSKDSKTLNVGSLLSGMRMVQICPTSLRVYDSNLRLTQLFNF------- 656
Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSI----------RLLVGDPSTCTVSVQTPAAIES 702
S+ V+S SI DP +++ G I RL+ D V+T A++ S
Sbjct: 657 ---SKKQIVVSTSICDPCIIVVFLGGGIALYKMDLKSQRLIKTDLQNRLSDVKT-ASLVS 712
Query: 703 SKKPVSSCTLY----------------HDKGPEPWL-----RKTSTDAWLSTGVGEAIDG 741
L+ +D E L KTS + + G +++
Sbjct: 713 PDSSALFAKLFTYNETLNAKGQIANGMNDSASETDLDIQPNHKTSNNDQM--GYDQSV-S 769
Query: 742 ADGGP--------------LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKFVSG 783
AD P LDQ + + G L+++++ +F+ + D F
Sbjct: 770 ADDVPEVDNTIVTEKNVSNLDQESLEKHPILFALTDEGKLKVYNLADFSLLMECDVFDLP 829
Query: 784 RTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
T + ++ T N S S ++VEL + P LF
Sbjct: 830 PT------LFNGMESERTYFNKES-------------SQELVELLVADLGDDFKEPHLFL 870
Query: 844 ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
I Y+A+L+ NT K + ++ ++ V + +R TP DA +
Sbjct: 871 RSRLNEITVYKAFLYS---NTDKHKNLLAFAK---VPQETMTREFQANVG-TPRDAESTM 923
Query: 904 ETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVL 960
E + ++T + + H F++G +P + + P + I++
Sbjct: 924 EKKASSSVDHLKMTALEVVGNHSAVFVTGRKPFLILSTLHSNAKFFPISSNIPILSVAPF 983
Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIV 1020
H + G+IYV ++IC+ YDN WP +K+ L + I Y K +Y +
Sbjct: 984 HAHHAPQGYIYVDENSFIRICKFQEDFEYDNKWPYKKVSLGKQINGIAYHPTKMVYAVGS 1043
Query: 1021 SVPVLKPL-----NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG 1075
+VP+ + N+ ++ D + + N S+DL T
Sbjct: 1044 AVPIEFKVTDEDGNEPYAITDDNDY---LPMANTGSLDLVSPLT---------------- 1084
Query: 1076 PWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
W + Q E L+V +V L + TTK + +A+GT+ +GED+A RG LF
Sbjct: 1085 -WTVIDSYEFQQFEIPLSVALVNLEVSETTKLRKPYIAVGTSITKGEDIAVRGSTYLFEI 1143
Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGI 1188
P ++ + V +E+KG ++ + + G+LL G K+I+ + L G+
Sbjct: 1144 IDVVPQPGRPETRHKLKLVTREEIKGTVAVVCEVDGYLLSGQGQKVIVRALEDEDHLVGV 1203
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
+F D Y +S ++N +L GD+ +++ F+ + E+ ++ L +K +L+ A +FL+
Sbjct: 1204 SFIDLGS-YTLSAKCLRNLLLFGDVRQNVTFVGFAEEPYRMTLFSKGQEALNVSAADFLV 1262
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
G L VV+D N+++ Y P+ ES G++L++R +FH+G +T + +L
Sbjct: 1263 QGENLYFVVADTSGNLRLLAYDPENPESHSGERLVTRGDFHIGNVITA---MTILPKEKK 1319
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
A G D + F+ + DG + + P+ + +RRL +Q L + V + GLNP+S
Sbjct: 1320 HQNAEYGYDTGDDFSCVMVNSDGGLQMLVPISDRVYRRLNIIQNYLANRVNTIGGLNPKS 1379
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
+R S P I+D L+ ++ + + + E+AH+ G S I+++L +L
Sbjct: 1380 YRLITSPSNLTNPT-RRILDGMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDLVELDEAL 1438
Query: 1429 SFL 1431
S++
Sbjct: 1439 SYM 1441
>gi|426194401|gb|EKV44332.1| hypothetical protein AGABI2DRAFT_187183 [Agaricus bisporus var.
bisporus H97]
Length = 1413
Score = 291 bits (745), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 342/1447 (23%), Positives = 611/1447 (42%), Gaps = 166/1447 (11%)
Query: 57 NLVVTAANVIEIYVVRVQE-----EGSKESKNSGETKR-------RVLMD-------GIS 97
N+VV +N++ I+ VR + + E + G+T+R V MD I+
Sbjct: 49 NVVVARSNLLRIFEVREEPAPFPTQADDERERKGKTRRGTEAVEGEVEMDEEGEGFVNIA 108
Query: 98 AASLE-----------LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKIS 143
++++ + +RLHG V E + I+ A + D ++++F+DAKI+
Sbjct: 109 KSAIQKTKLPTVTKFYFIREHRLHGIVTGLEGVRIM----ASLEDKLDRLLVSFKDAKIA 164
Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
+LE+ D+IH L S+H +E +P+ + L R L +VDP RC + + +
Sbjct: 165 LLEWSDTIHDLVTVSIHTYERAPQLISLD---SPLFRSDL-RVDPISRCAALSLPKHAIA 220
Query: 203 ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
IL Q + L V ++D S S +++L + ++++V DF+F+ G+ P
Sbjct: 221 ILPFYQTQAELDVMEQDQSQSK---DVPYSPSFILDLPIQVEENIRNVIDFVFLPGFNNP 277
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
+ IL + + TW GR+ T + ++ + +I S LP+DA+ LL + I
Sbjct: 278 TIAILFQTQQTWTGRLRESKDTARLIIFTLDILTQNSTIITSVEGLPYDAFSLLPCSTAI 337
Query: 319 GGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPR---SSFSVELDAAHATWLQN 374
GGV+V+ N++ Y QS+ +L +N +A + P ++ + L+ + + +
Sbjct: 338 GGVIVITGNSVIYVDQSSRRVSLQVNGWATRISDLPYPPMEEDAALKLHLEGCRSAMVDD 397
Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLGSRLG 430
L K G + + ++ DG+ V +L ++ P++ + I T+ + F+GS +G
Sbjct: 398 KTVFLIYKDGTVYPVELIADGKTVSKLIMA---PALAQTTIPTVVKRVDEDHLFIGSAVG 454
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
S+L++ G K + D + D D + G+
Sbjct: 455 PSILLKTAHVEQEVEEEHGSKSGPAVVTQDV---------TMDDDDDDIYGDSTMETEPT 505
Query: 491 SNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYEL 541
+N +KT ++RD L GP+ ++ L +N + +ATG + L
Sbjct: 506 ANGVTHVRKTKTVIHLSLRDYLPAYGPISSMTFSLAMNGEKAVPELVAATGAGSLGGFTL 565
Query: 542 VE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----LIISLEARTMVLETADL 595
+ LP K +Y SRG + R + H + LI+S + +
Sbjct: 566 FQRDLPTVKKRKILYISGSRGIWSLPIRQPLRSNTSHGHDYDTLILSTDINPSPGSSRIA 625
Query: 596 LTEVTE--SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSE 651
+ + S++ G TI A F R ++ V R+L DG+ + +
Sbjct: 626 VRSMNRDVSINSRTPGLTIGAAPFFQRTAILHVMTNAIRVLHPDGTERQ-------TIPD 678
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSC 710
+ SIADP+VL+ D SI + V D +P +SS+ ++ C
Sbjct: 679 KDGNMPRPKIRFCSIADPFVLVMREDDSIGMFVATDREKIRRKDMSPMGDKSSRY-LAGC 737
Query: 711 TLYHDKGPEPWLRKTSTD--AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDV 768
G L + + D + +T + GA + ++ G LEI+ +
Sbjct: 738 FFTDTTG----LFEANFDNKSPATTSTLQITSGAKSQ-------WLLLVRPQGVLEIWTL 786
Query: 769 PNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELA 828
P + F+ S ++ + DT+ A Q + + ++
Sbjct: 787 PKLSLAFSTPAIASLQSVLTDTHDPPA-------------PSLPQDPPRKPQDLDIEQIL 833
Query: 829 MQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR 888
+ P L L G + Y+A + +N D P +TS + ++A
Sbjct: 834 LAPIGESSPTPHLCVFLRSGQLAIYEAVVLG--QNPEVPDTPRATSLQIQFVKIAAKSFE 891
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHP 947
R + + +T + + G F +G RP W + R ++V+P
Sbjct: 892 IQRPEENEKGILAEHKKINRMFIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGVQVYP 951
Query: 948 QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQI 1007
+ AFT F+ T G + + +P +D P++ IP +
Sbjct: 952 S-GHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVPDFQ-FDGPLPMRSIPRGRAYSNV 1009
Query: 1008 TYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRI 1067
+ +L IV+ L+ + S D + D N+SS +V+ + +
Sbjct: 1010 LFDPSTSL---IVAASSLQ--STFTSFDEDGNNIWEPDAPNISSP------SVDCSALEL 1058
Query: 1068 LEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAAR 1126
+ PD W T ++E + +VTL T+ + +A+GT +GED+A +
Sbjct: 1059 IAPDI----WATMDGFEFATNEYINDMTIVTLETAATETGTKDFIAVGTTIDRGEDLAVK 1114
Query: 1127 GRVLLFSTGRNADNPQNLVTEVYSKEL--------KGAISALASLQGHLLIASGPKIILH 1178
G +F P V++ +L KG ++A+ L +L+ + G KI +
Sbjct: 1115 GATYIFEIAEVV--PDQAVSQRRWYKLRLRCRDDAKGPVTAVCGLSDYLVSSMGQKIFVR 1172
Query: 1179 KWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
+ E L G+AF D +YV SL +KN +L+GD KS+ F++++E +L LL+KD
Sbjct: 1173 AFDSDERLVGVAFMDVG-VYVTSLQTLKNLLLIGDAVKSVQFVAFQEDPYKLVLLSKDIQ 1231
Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
S+ +FL + L LV DE+ I+I+ Y P+ +S +G+ LL EFH G ++
Sbjct: 1232 SVCVTRADFLFSENDLRLVTGDEEGIIRIYEYNPQDPDSREGRHLLLETEFH-GQR--EY 1288
Query: 1298 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1357
++A + P S LL G+ DGS+ + ++E F+RL LQ +L+ +
Sbjct: 1289 RTSVLVAHRIKEDQSIPNS------RLLTGSADGSLASLTIVEEEAFKRLGLLQGQLMRN 1342
Query: 1358 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
+ H+A LNP++FR N +P I+D LL YE LP+ Q E Q G R +
Sbjct: 1343 IQHMAALNPKAFR-IVKNEYVSKPLTRGILDGNLLGQYESLPINRQSEATQQIGADRVNV 1401
Query: 1418 LSNLNDL 1424
L + +L
Sbjct: 1402 LRDWIEL 1408
>gi|409076059|gb|EKM76433.1| hypothetical protein AGABI1DRAFT_108759 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1413
Score = 291 bits (744), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 341/1447 (23%), Positives = 610/1447 (42%), Gaps = 166/1447 (11%)
Query: 57 NLVVTAANVIEIYVVRVQE-----EGSKESKNSGETKR-------RVLMD-------GIS 97
N+VV +N++ I+ VR + + E + G+T+R V MD I+
Sbjct: 49 NVVVARSNLLRIFEVREEPAPFPTQADDERERKGKTRRGTEAVEGEVEMDEEGEGFVNIA 108
Query: 98 AASLE-----------LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKIS 143
++++ + +RLHG V E + I+ A + D ++++F+DAKI+
Sbjct: 109 KSAIQKTKLPTVTKFYFIREHRLHGIVTGLEGVRIM----ASLEDKLDRLLVSFKDAKIA 164
Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
+LE+ D+IH L S+H +E +P+ + L R L +VDP RC + + +
Sbjct: 165 LLEWSDTIHDLVTVSIHTYERAPQLISLD---SPLFRSDL-RVDPISRCAALSLPKHAIA 220
Query: 203 ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
IL Q + L V ++D S +++L + ++++V DF+F+ G+ P
Sbjct: 221 ILPFYQTQAELDVMEQD---QSQAKDVPYSPSFILDLPIQVEENIRNVIDFVFLPGFNNP 277
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
+ IL + + TW GR+ T + ++ + +I S LP+DA+ LL + I
Sbjct: 278 TIAILFQTQQTWTGRLRESKDTARLIIFTLDILTQNSTIITSVEGLPYDAFSLLPCSTAI 337
Query: 319 GGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPR---SSFSVELDAAHATWLQN 374
GGV+V+ N++ Y QS+ +L +N +A + P ++ + L+ + + +
Sbjct: 338 GGVIVITGNSVIYVDQSSRRVSLQVNGWATRISDLPYPPMEEDATLKLHLEGCRSAMVDD 397
Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLGSRLG 430
L K G + + ++ DG+ V +L ++ P++ + I T+ + F+GS +G
Sbjct: 398 KTVFLIYKDGTVYPVELIADGKTVSKLIMA---PALAQTTIPTVVKRVDEDHLFIGSAVG 454
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
S+L++ G K + D + D D + G+
Sbjct: 455 PSILLKTAHVEQEVEEEHGSKSGPAVVTQDV---------TMDDDDDDIYGDSTMETEPT 505
Query: 491 SNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYEL 541
+N +KT ++RD L GP+ ++ L +N + +ATG + L
Sbjct: 506 ANGVTHVRKTKTVIHLSLRDYLPAYGPISSMTFSLAMNGEKAVPELVAATGAGSLGGFTL 565
Query: 542 VE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----LIISLEARTMVLETADL 595
+ LP K +Y SRG + R + H + LI+S + +
Sbjct: 566 FQRDLPTVKKRKILYISGSRGIWSLPIRQPLRSNTSHGHDYDTLILSTDINPSPGSSRIA 625
Query: 596 LTEVTE--SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSE 651
+ + S++ G TI A F R ++ V R+L DG+ + +
Sbjct: 626 VRSMNRDVSINSRTPGLTIGAAPFFQRTAILHVMTNAIRVLHPDGTERQ-------TIPD 678
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSC 710
+ SIADP+VL+ D SI + V D +P +SS+ ++ C
Sbjct: 679 KDGNMPRPKIRFCSIADPFVLVMREDDSIGMFVATDREKIRRKDMSPMGDKSSRY-LAGC 737
Query: 711 TLYHDKGPEPWLRKTSTD--AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDV 768
G L + + D + +T + GA + ++ G LEI+ +
Sbjct: 738 FFTDTTG----LFEANFDNKSPATTSTLQITSGAKSQ-------WLLLVRPQGVLEIWTL 786
Query: 769 PNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELA 828
P + F+ S ++ + DT+ A Q + + ++
Sbjct: 787 PKLSLAFSTPAIASLQSVLTDTHDPPA-------------PSLPQDPPRKPQDLDIEQIL 833
Query: 829 MQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR 888
+ P L L G + Y+A + +N D P +TS + ++A
Sbjct: 834 LAPIGESSPTPHLCVFLRSGQLAIYEAVVLG--QNPEVPDTPRATSLQIQFVKIAAKSFE 891
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHP 947
R + + +T + + G F +G RP W + R ++V+P
Sbjct: 892 IQRPEENEKGILAEHKKINRMFIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGVQVYP 951
Query: 948 QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQI 1007
+ AFT F+ T G + + +P +D P++ IP +
Sbjct: 952 S-GHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVPDFQ-FDGPLPMRSIPRGRAYSNV 1009
Query: 1008 TYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRI 1067
+ +L IV+ L+ + S D + D N+SS +V+ + +
Sbjct: 1010 LFDPSTSL---IVAASSLQ--STFTSFDEDGNNIWEPDAPNISSP------SVDCSALEL 1058
Query: 1068 LEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAAR 1126
+ PD W T ++E + +VTL T+ + +A+GT +GED+A +
Sbjct: 1059 IAPDI----WATMDGFEFATNEYINDMTIVTLETAATETGTKDFIAVGTTIDRGEDLAVK 1114
Query: 1127 GRVLLFSTGRNADNPQNLVTEVYSKEL--------KGAISALASLQGHLLIASGPKIILH 1178
G +F P V++ +L KG ++A+ L +L+ + G KI +
Sbjct: 1115 GATYIFEIAEVV--PDQAVSQRRWYKLRLRCRDDAKGPVTAVCGLSDYLVSSMGQKIFVR 1172
Query: 1179 KWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
+ E L G+AF D +YV SL +KN +L+GD KS+ F++++E +L LL+KD
Sbjct: 1173 AFDSDERLVGVAFMDVG-VYVTSLQTLKNLLLIGDAVKSVQFVAFQEDPYKLVLLSKDIQ 1231
Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
S+ +FL + L LV DE+ I+I+ Y P+ +S +G+ LL EFH G ++
Sbjct: 1232 SVCVTRADFLFSENDLRLVTGDEEGIIRIYEYNPQDPDSREGRHLLLETEFH-GQR--EY 1288
Query: 1298 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1357
++A + P S LL G+ DGS+ + ++E F+RL LQ +L+ +
Sbjct: 1289 RTSVLVAHRIKEDQSIPNS------RLLTGSADGSLASLTIVEEEAFKRLGLLQGQLMRN 1342
Query: 1358 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
+ H+A LNP++FR N +P I+D LL YE LP+ Q E Q G R +
Sbjct: 1343 IQHMAALNPKAFR-IVKNEYVSKPLTRGILDGNLLGQYESLPINRQSEATQQIGADRVNV 1401
Query: 1418 LSNLNDL 1424
L + +L
Sbjct: 1402 LRDWIEL 1408
>gi|392558419|gb|EIW51607.1| hypothetical protein TRAVEDRAFT_176174 [Trametes versicolor FP-101664
SS1]
Length = 1431
Score = 290 bits (741), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 346/1404 (24%), Positives = 607/1404 (43%), Gaps = 181/1404 (12%)
Query: 103 LVCHYRLHGNVESL-AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC 161
LV +RLHG V L A+ + ++ + D ++++F+DAKI++LE+ D+IH + S+H
Sbjct: 123 LVREHRLHGTVTGLEAVRTVHSLED--KLDRLLVSFKDAKIALLEWSDAIHDVMTVSIHT 180
Query: 162 FE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK--ASQGGSGLVGDED 218
+E +P+ + L RG L +VDP RC + + + IL SQ L+ E
Sbjct: 181 YERAPQLMALD---SPLFRGEL-RVDPLSRCAALSLPKDSLAILPFYQSQAELDLMEQES 236
Query: 219 TFGSGGGFSARIESSHVINL-RDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
+ +S S V++L D+D +++V DF F+ G+ P + +L + + TW GR+
Sbjct: 237 SQARDVPYSP----SFVLDLANDVDQRIRNVIDFAFLPGFNNPTVAVLCQYQQTWTGRLK 292
Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ- 334
T + ++ +PLI + LP+D L + IGGV ++ +N I + Q
Sbjct: 293 EYKDTVGLFIFTLDLVTNNYPLITAVDGLPYDCLSLTPCSTAIGGVFILASNAIIFVDQA 352
Query: 335 SASCALALNNY---AVSLDSSQELPRSSF-SVELDAAHATWLQNDVALLSTKTGDLVLLT 390
S L +N + L P+ +++L+ A T++ + + K G + +
Sbjct: 353 SRRVILPVNGWPPRTSDLTMPSLTPQEQLRNLQLEGARFTFVDDKTLFVILKDGTVHPVE 412
Query: 391 VVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
+V DG+ V RL ++ T P+V + + + F+GS +G S+L++ T+
Sbjct: 413 LVLDGKTVSRLSMADALARTTIPAV----VARVRDDYLFVGSMVGPSVLLR------TAH 462
Query: 446 LSSGLKEEFGDIEAD-----APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK- 499
+ +KEE D++A AP+ D NGE+ S G+ + +S +K
Sbjct: 463 VEEVIKEEDVDMDAGPATVVAPADTMDLDDDDDLYGPSGNGEQPSANGATNGTVDSVKKR 522
Query: 500 -TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG-------------ISKQSNY 539
++ D+L G + D ++GL N D +ATG + +S
Sbjct: 523 TVVRLSLCDALPAHGAISDMAFGLARNGDRVVPELIAATGSGELGGFHLFQRDMPTRSKR 582
Query: 540 ELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLL 596
+L + G +G+W++ ++ + R ++ +D +IIS +A + A
Sbjct: 583 KLHAIGGARGMWSLAVRQAMKVSGGTLERPSSQNDS----VIISTDANPSPGLSRIATRS 638
Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRVIQVF---ERGARIL--DGS--YMTQDLSFGPSN 649
++ + G T+ A F ++ + R+L DG+ + +DL
Sbjct: 639 AHSDIAITTRIPGTTLGAAPFFQGTAILHILFNVTNAIRVLEPDGTERQIIKDLE----- 693
Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSS 709
+ + S SI DP+VL+ D +I L +G+ + + + + +
Sbjct: 694 ----GTAPRPKIKSCSICDPFVLIIREDDTIGLFIGELERGKIRRKDMSPMGDKTSRYVA 749
Query: 710 CTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI-------YSVVCYESGA 762
+ D T L T V E + QG + + ++ G
Sbjct: 750 GGFFTD-----------TSGLLQTFVNEQAPAENVTSTLQGAMNAGNKSQWLILVRPQGV 798
Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
+E++ +P F+ + + D+Y AL S ++ + ++ +I +
Sbjct: 799 VELWTLPKLTLAFSTTLLATLDPILTDSYDGPAL--------SLPQDPPRKPQELDIDQI 850
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
+ L R RP L +L G + Y+A P S + S +L V V
Sbjct: 851 VIAPLGESR-----PRPHLIVLLRSGQLAVYEAVAIPPPPEPLPS----TRSSTLLVKFV 901
Query: 883 S-ASRLRNLRFSRTPLDAYTREE----------TPHGAPCQRITIFKNISGHQGFFLSGS 931
AS+ +++ + E+ AP Q + G F +G
Sbjct: 902 KVASKAFDIQHPEEEQKSVLAEQKRISRLLVPFVTSPAPGQTFS---------GVFFTGD 952
Query: 932 RPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
RP W + + ++V P + AFT F+ + +G + +P D
Sbjct: 953 RPSWILSTDKGGVKVFPS-GHSVVQAFTTSSLWESRGDFLLYSEEGPSLVEWMPD-VQLD 1010
Query: 991 NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
+ P + +P ++ P+ F LIV+ + N+ S D V + D+ N+S
Sbjct: 1011 GHLPARSVP-RSRPYSNVVFDAST--SLIVAASSFQ--NRFASYDEDGNVVWEPDSPNIS 1065
Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ET 1109
S L T+E ++ PD W T +E + + L +T+ +
Sbjct: 1066 S-PLCECSTLE-----LISPDG----WITMDGYEFAPNEFVNCIVSIPLETMSTESGMKD 1115
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL------KGAISALASL 1163
+A+GT +GED+A +G V +F +P V + +L KG +S L +
Sbjct: 1116 FIAVGTTINRGEDLAVKGAVYIFEIVEVVPDPSTHVKRWWRLKLLCRDDAKGPVSFLCGI 1175
Query: 1164 QGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
G+L+ + G KI + + E L G+AF D +YV SL VKN +++GD KS++F+++
Sbjct: 1176 NGYLVSSMGQKIFVRAFDLDERLVGVAFLDV-GVYVTSLRAVKNLLVIGDAVKSVWFVAF 1234
Query: 1223 KEQGAQLNLLAKDFGSLDCF--ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
+E +L +L KD L C A F DG LS+V DE+ ++++ Y P ES GQ
Sbjct: 1235 QEDPYKLVVLGKD-PQLCCITRADLFFADGQ-LSIVTCDEEGIVRLYAYDPHDPESKSGQ 1292
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
LL R EFH + R ML + G + + L+ G++DGS+ + +D
Sbjct: 1293 HLLRRTEFHGQSE----YRSSMLVARRPKN----GDPEIPQARLVCGSVDGSLSTLTYVD 1344
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
E +RL LQ +L+ +V HVA LNP++FR N RP I+D LL+ +E LP+
Sbjct: 1345 EAASKRLHLLQGQLIRTVQHVAALNPKAFRMVR-NEYVSRPLSKGILDGNLLATFEDLPI 1403
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDL 1424
Q E+ Q GT R+ +L + L
Sbjct: 1404 ARQNEVTRQIGTDRATVLKDWASL 1427
>gi|121797760|sp|Q2TZ19.1|CFT1_ASPOR RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|83775384|dbj|BAE65504.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1393
Score = 290 bits (741), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 337/1401 (24%), Positives = 590/1401 (42%), Gaps = 208/1401 (14%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
++I+LAF +AK++++E+D +G+ S+H +E + + + G ++ VDP R
Sbjct: 88 EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147
Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
C V +G++ + IL Q G LV D+ + GS G A
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206
Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
SS V+ L LD + H F++ Y EP IL+ + T + + + +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + + S LP D +K++A+P P+GG L++G+N +H + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326
Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S +S ++ L+ L N LL TG++VL+ DGR V + +
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386
Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
P S +G+ FLGS DS+L+ G S+ SSG K+ +
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
E D+ +S D +D + E+ + G + ++F D L+NI
Sbjct: 441 RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500
Query: 513 GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
GPLKD ++G + S ELV +L
Sbjct: 501 GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560
Query: 545 PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
C +WT +S H ++ + A + E H Y+++S +A + E +++ +
Sbjct: 561 ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613
Query: 602 SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
+ F + TI G L G+ RV+Q+ R DG DL P E
Sbjct: 614 ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668
Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
S E +S S+ DPYV + D ++ LL D S V+ I +SK +SC
Sbjct: 669 DTS--EERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSKW--TSCC 724
Query: 712 LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
LY DK TG+ +I A L Q + + + L I+ +P+
Sbjct: 725 LYFDK----------------TGIFSSI-SATSDELAQNSMTLFLMTQDCRLFIYRLPDQ 767
Query: 772 NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
+ + G + E K S T +E + + V +L
Sbjct: 768 KLL----AIIEGVDCLPPVLSSEPPKRSTT--------------REVLTEIVVADLG-DS 808
Query: 832 WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
WS S P+L + Y+ ++ T +P + L +N+ R+
Sbjct: 809 WS---SFPYLIIRSRHDDLAVYRPFI----SITKSVGEPHADLNFLKETNLVLPRI---- 857
Query: 892 FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
+ D + EE P + I NISG F G P + + L
Sbjct: 858 -TSGVEDQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKG 913
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGI------LKICQLPSGSTYDNYWP--VQKIPLKAT 1003
G + + C GFI + S+ + L C L + +Y+P +Q+IP+
Sbjct: 914 GYAQSLSKFQTSECGEGFILLDSKVLCFILLCLTYCILSFHTGCHSYYPWTIQQIPIGEQ 973
Query: 1004 PHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEY 1063
+ Y + +Y + S L D E+ + N S V+
Sbjct: 974 VDHLAYSSSSGMYVIGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQRS 1022
Query: 1064 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGED 1122
++++ P W T+ +E+ + V+ ++L + T E + ++ +GTA+ +GED
Sbjct: 1023 SLKVVSPKT----W----TVIDSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGED 1074
Query: 1123 VAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLIASGPKI 1175
+A+RG V +F + +P+ + V + +KGA++AL+ + QG L++A G K
Sbjct: 1075 IASRGCVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKC 1134
Query: 1176 ILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQ 1228
I+ K G+ L +AF D +++VK ++ D K ++F + E+ +
Sbjct: 1135 IVRGLKEDGSLLP-VAFMDVQ----CHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYK 1189
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
++L AKD L+ A +FL DG+ L ++V+D N+ + Y P+ +S G +LLSR++F
Sbjct: 1190 MSLFAKDLDYLEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKF 1249
Query: 1289 HVGAHVTKFLRLQMLATSSDR----TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
H G ++ L + SS++ A K R +L + +GS+G + + E ++
Sbjct: 1250 HTGNFISTLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESY 1309
Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
RRL +LQ +L +++ H GLNPR+FR S+G A R ++D +LL + + + ++
Sbjct: 1310 RRLSALQSQLTNTIEHPCGLNPRAFRAVESDGTAGR----GMLDGKLLFQWLDMSKQRKV 1365
Query: 1405 EIAHQTGTTRSQILSNLNDLA 1425
EIA + G +I ++ ++
Sbjct: 1366 EIASRVGANEWEIKADFEAIS 1386
>gi|170102106|ref|XP_001882269.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164642641|gb|EDR06896.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1406
Score = 288 bits (738), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 337/1469 (22%), Positives = 606/1469 (41%), Gaps = 226/1469 (15%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSGETKR-------RVLMD---------- 94
N+VV +N++ I+ VR +Q + E + + +R V MD
Sbjct: 51 NVVVARSNLLRIFEVREEPCPIQNQADDERERRSKVRRGTEAVEGEVAMDEQGDGFINIA 110
Query: 95 --------GISAASLELVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFEDAKIS 143
+ V + LHG +E + I+S R D ++++F+DAKI+
Sbjct: 111 KSQKCPTHTPTVTRFYFVREHHLHGIVTGIEGVKIMSS----LEDRLDRLLISFKDAKIA 166
Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
+LE+ D++H L S+H +E +P+ + + S R L + DP RC + + +
Sbjct: 167 LLEWSDAVHDLITVSIHTYERAPQLMSID---SSLFRTEL-RTDPISRCAALSLPRHALA 222
Query: 203 ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
IL Q + L V D+D S +++L D ++++V DF F+ G+ P
Sbjct: 223 ILPFYQSQAELEVMDQD---QSQAKDVPYSPSFILDLPAQVDQNIRNVIDFAFLPGFNNP 279
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
+ +L + + TW GR+ T + ++ + +P+I S LPH+ LL + +
Sbjct: 280 TIAVLFQTQQTWTGRLREFKDTVRLVIFTLDIVTQNYPIITSVEGLPHECLALLPCGTSL 339
Query: 319 GGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPRSSFSVE-------LDAAHAT 370
GGV+++ +N I Y QS+ L +N + + ++P S + E L+ + A
Sbjct: 340 GGVVIITSNAIIYTDQSSKRVVLPVNGWVSRI---SDIPLPSLTPEEQLRNICLEGSRAV 396
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLG 426
++ + + K G + L +V DG+ V +L +S P + + I ++ L F +G
Sbjct: 397 FVDDRNLFVILKDGTVYPLEIVVDGKTVSKLTMS---PPLAQTSIPSVLRKLDDDHFLVG 453
Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFG---DIEADAPSTKRLRRSSSDALQDMVNGEE 483
S +G S+L++ ++ ++EE D+EA AP+T + D N
Sbjct: 454 SSVGPSVLLK----------AAHIEEEVAEDHDMEA-APATVVYDADDMEFDDDDGNLPR 502
Query: 484 LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQS 537
++ + ++RDSL GP+ D ++ L N D +ATG
Sbjct: 503 VA-------QPMAKPTVIHLSLRDSLPAYGPISDMTFSLAKNGDRPVPELVAATGSGFLG 555
Query: 538 NYELVE--LP-----------GCKGIWTV------------YHKSSRGHNADSSRMAAYD 572
+ L + LP G +G+W++ Y K+ A++ +
Sbjct: 556 GFTLFQRDLPVRTKRKLHVIGGARGLWSLPIRQPVKASGISYEKAVNPFQAENDSLIIST 615
Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
D + +S + V+ TA + G TI A F R V+ V R
Sbjct: 616 D-INPSPGLSRAGKNDVMITAR------------IPGTTIGAAPFFQRTTVLHVMTNALR 662
Query: 633 ILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
+L+ G + +D+ + + SI+DP+VL+ D SI L +G+
Sbjct: 663 VLEPGMQIIKDMD---------GNMPRPRIRACSISDPFVLILREDDSIGLFIGETERGK 713
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKGP-EPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
+ + + + SC G E ++T +++ + A++ G
Sbjct: 714 IRRKDMSPMGDK----VSCFYTDTTGLLESNFENSTTPVGVTSTLSAAVNAGSKGQ---- 765
Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
+ ++ G +E++ +P F+ D S + +VD++ A
Sbjct: 766 --WLILVRPQGIVELWTLPKLTLGFSADGLTSLQNVLVDSHDPPA-------------PS 810
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
Q V ++ + RP L L G + Y+
Sbjct: 811 LPQDPPRKPQEFDVEQILVAPIGESSPRPHLCVFLRSGQLTIYEVLPLG----------- 859
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGH------- 923
T+ +L + +++ ++ S + EE G ++ I++
Sbjct: 860 -RTTEALPKVRPAHVKIKFVKISSMAFEIQRPEEGEKGIIAEQKRIYRMFVPFVTSASPG 918
Query: 924 ---QGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
G F +G RP W + ++++P + AFT F+ T +
Sbjct: 919 VTFSGVFFTGDRPNWIFGTDKGGVQIYPS-GHAVVNAFTPCSLFESKGDFLMYTEEA--S 975
Query: 980 ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
+ + YD P++ +P + + +L +V+ L+ + S D
Sbjct: 976 VSKWLPDFHYDGPLPLRSVPRGRAYSSLVFDPSTSL---LVAASSLQ--AKFASYDDDDN 1030
Query: 1040 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
+ + N+ + + T T+E ++ PD W T ++E V VTL
Sbjct: 1031 KIWEPETPNIGN-PMCDTSTLE-----LISPDM----WITMDGFEFATNEYINDVACVTL 1080
Query: 1100 FNTTTK-ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL----- 1153
T+ ++ +A+GT +GED+AARG ++ +P Y L
Sbjct: 1081 ETAGTEVGSKDFIAVGTTIDRGEDLAARGATYIYEIVEVVPDPAISPKRWYKLRLRCRDD 1140
Query: 1154 -KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
KG ++A+ G+L+ + G KI + + E L G+AF D +YV SL +KN +L+G
Sbjct: 1141 AKGPVTAVCGFHGYLVSSMGQKIFVRAFDSDERLVGVAFMDVG-VYVTSLRTLKNLLLVG 1199
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D KS+ F++++E +L LL KD + +F LSLV DE+ ++++ Y P
Sbjct: 1200 DAVKSLSFIAFQEDPYKLVLLGKDTQHVCVTNADFFFTDGELSLVTGDEEGIMRMYEYNP 1259
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1331
+ +S G+ LL R EFH + + T + R P + L+ G DG
Sbjct: 1260 QDPDSKDGRYLLLRTEFHGQSE------YRTSTTIARRLKDDPSIPQAK---LIIGGTDG 1310
Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
+ + P++E F+RLQ LQ +L ++ HVAGLNP++FR N +P I+D L
Sbjct: 1311 CLSSLTPVEEHAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVR-NDFVSKPLSKGILDGNL 1369
Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
L+HYE LP+ Q E+ Q GT R +L +
Sbjct: 1370 LAHYESLPIIRQNEMTRQIGTDRVTLLRD 1398
>gi|449543656|gb|EMD34631.1| hypothetical protein CERSUDRAFT_116804 [Ceriporiopsis subvermispora
B]
Length = 1440
Score = 288 bits (736), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 358/1487 (24%), Positives = 625/1487 (42%), Gaps = 226/1487 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDGISAASLE- 102
N+VV ++++ I+ VR +E S+ E +RR V MDG L
Sbjct: 49 NVVVARSSLLRIFEVR-EEPAPISSQKEDERERRASVRKGTEAVEGEVEMDGSGEGFLNM 107
Query: 103 ---------------------LVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFE 138
L+ +RLHG +E + I++ R D ++++F+
Sbjct: 108 GSVKSTAQNGSVQPPTINRFYLIREHRLHGIVTGIEGVRIVTS----LEDRLDRLLVSFK 163
Query: 139 DAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
DAKI++LE+ D++H L S+H +E +P+ + L +S P ++ DP RC +L+
Sbjct: 164 DAKIALLEWSDAVHDLVTVSIHTYERAPQLMAL----DSSLFRPTLRADPLSRCAALLLP 219
Query: 198 GLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVH 253
+ IL Q + L V ++DT S +++L D +++V DF+F+
Sbjct: 220 RDSIAILPFYQSQAELDVVEQDT---SQLRDVPYSPSFIVDLSAEVDDRIRNVIDFVFLP 276
Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
G+ P + +L +++ TW GR+ T + ++ + +P+I S LP+D + + A
Sbjct: 277 GFNNPTIAVLFQKQQTWTGRLREYKDTVSLYIFTLDLVTRNYPVITSTEGLPYDCFAVAA 336
Query: 314 VPSPIGGVLVVGANTIHYHSQSA-SCALALNNY-------AVSLDSSQELPRSSFSVELD 365
+ +GGV+++ +N I Y QS+ AL +N + V S+QE R + L+
Sbjct: 337 CSTALGGVVILASNAIIYVDQSSRRVALPVNGWPPRVSDMPVQALSAQEQLR---DLRLE 393
Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGN 420
+H ++ + + K G + + +V DG+ V +L +S T P+V + +
Sbjct: 394 GSHFVFVDDRTLFIILKDGTVYPVELVLDGKSVSKLTMSSAVARTTIPTV----VRRVQT 449
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
F+GS +G S+L++ T+ + + +E D+E T + D+ M
Sbjct: 450 DHLFIGSTVGPSVLLK------TARVEEDIADE--DVEMSVAPTAVV-----DSTDTMDL 496
Query: 481 GEELSLYGSASNNTE------------SAQKT-FSFAVRDSLVNIGPLKDFSYGLRINAD 527
+E LYGS T S ++T ++ DSL GP+ D ++ L N D
Sbjct: 497 DDEDDLYGSTKETTHRVDGLVNGAADASKKRTVVHLSLCDSLPAHGPIADMTFALAKNGD 556
Query: 528 ------ASATGISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRM 568
+ATG + L + LP G +G+W++ + + N +
Sbjct: 557 RAVPELVAATGSGTLGGFTLFQRDLPTRVKRKLHAIGGGRGMWSLPVRQAVKVNGSTYEK 616
Query: 569 AAYDDEYHAY---LIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
A + +H+ +IIS +A + A ++ + G TI A F +
Sbjct: 617 PA--NPFHSVNDSVIISTDANPSPGLSRIASRNQNGDITITTRIPGTTIGAAPFFQGTAI 674
Query: 624 IQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
+ V ++ + D S + + + SI DP+VL+ D +I L
Sbjct: 675 LHVMYNVTNVI--RVLEPDGSERQIIKDVDGNVARPKIRACSICDPFVLIIREDDTIGLF 732
Query: 684 VGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKGP-EPWLRKTSTDAWLSTGVGEAIDG 741
+G+P + + + + + + + ++ C G + L + +T ++
Sbjct: 733 IGEPERGKIRRKDMSPMGDKTSRYLTGCFFTDTTGTFQTHLNPLAAGTEAATSTLQS--A 790
Query: 742 ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
+ G Q + ++C G LEI+ + F+ S + +VDTY L
Sbjct: 791 INAGSRSQ---WLILCRPQGTLEIWTLSKLTLAFSTTLIPSLESVVVDTYDVPHL----- 842
Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
S ++ + ++ +I + V L RP+L L G + Y+ P
Sbjct: 843 ---SLPQDPPRKPQELDIEQIVVAPLG-----ESSPRPYLTVFLRSGQLAVYETIPVAPP 894
Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
DP+ SRS ++ +RF + A+ ++ + K IS
Sbjct: 895 A------DPLPNSRSCTIL---------VRFRKVLSKAFDIQQQNEEVEKSVLAEQKRIS 939
Query: 922 ------------GH--QGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIV-AFTVLHNVNC 965
G G F +G RPCW + + ++V P S+V AFT
Sbjct: 940 RLLIPFVTSPNPGQTLSGVFFTGDRPCWILSTDKGGVKVFPS--GHSVVHAFTASSVWES 997
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
F+ + +G + +P G D + P + +P + Y +L V
Sbjct: 998 KSDFLLYSEEGPSLLEWIP-GVQLDGHLPSRTVPRNKAYSNVVYDPSTSLI-----VAAS 1051
Query: 1026 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1085
++ S D + + D N+ S+ T T+E +L PD W T
Sbjct: 1052 SSQSRFASYDEDGNIVWEPDASNI-SLPFCETSTLE-----LLSPDG----WVTLDGYEF 1101
Query: 1086 QSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL 1144
+E + VTL ++T+ + + +GT +GED+A +G +F +P
Sbjct: 1102 APNEFVNCLDCVTLETSSTESGTKDYIVVGTTINRGEDLAVKGAAYVFEIIEVVPDPTAQ 1161
Query: 1145 VTEVYSKEL------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLY 1197
+ + +L KG ++A+ + G+L+ + G KI + + E L G+AF D +Y
Sbjct: 1162 MKRWHRLKLHCRDDAKGPVTAMCGMNGYLVSSMGQKIFVRAFDLDERLVGVAFLDV-GVY 1220
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
V SL VKN +++ D KS++F++++E +L +L KD L +F +S++
Sbjct: 1221 VTSLCAVKNLLVISDAVKSVWFVAFQEDPYKLVILGKDPYPLYVTKADFFFAEGRVSIIS 1280
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH--VGAHVTKFL--RLQMLATSSDRTGAA 1313
DE ++I Y P ES GQ LL R EFH V + L RL+ + R
Sbjct: 1281 CDEDGVMRILEYDPHDPESKNGQHLLRRTEFHGQVEYRTSAILARRLKGVDIPQSR---- 1336
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
L+ G DGS+ + ++E +RL LQ +L +V HVAGLNPR FR
Sbjct: 1337 ----------LICGLTDGSLITMTYVEEAASKRLHLLQGQLTRNVQHVAGLNPRGFRIVR 1386
Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
N RP I+D LL YE LP+ Q E+ Q GT R+ IL +
Sbjct: 1387 -NDYVSRPLTRGILDGNLLMAYEDLPIVRQDEVTRQIGTDRTTILKD 1432
>gi|407929511|gb|EKG22329.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1418
Score = 287 bits (735), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 343/1434 (23%), Positives = 584/1434 (40%), Gaps = 216/1434 (15%)
Query: 99 ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
+ L LV Y L G V SLA + D D++++AF DAK+S++E+D + H L S
Sbjct: 81 SKLVLVAEYPLEGTVLSLARIK--ALDTKSGGDALLIAFRDAKMSLVEWDPANHALSTIS 138
Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-GDE 217
+H +E E + + DP RC + + IL Q G LV GD+
Sbjct: 139 IHYYEGEELHGAPWDADLGHYHNFLAADPSSRCAALKFGARHLAILPFRQLGDDLVEGDD 198
Query: 218 ---------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
+ +G +SS ++L +D + H F+H Y EP
Sbjct: 199 YDPDFDEPMDAPAAKEKATNGDVAQTPYKSSFALSLPQIDPALTHPVHLDFLHEYREPTF 258
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
I+ + A + + + ++ K + S LP+D +K++ +P P+GG
Sbjct: 259 GIISANKAAAASLLYERRDLLTYTVFTLDLEEKASTALLSVAGLPYDTHKVIPLPLPVGG 318
Query: 321 VLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVA 377
L++GAN IH + A+A+N++A S +S ++ L+ A L +N
Sbjct: 319 ALLLGANQFIHVDQAGKTSAVAVNDFAKQCSSFPMSDQSELAMRLEGASIELLSPENGDL 378
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFLGSRLG 430
L+ K G L +++ DGR V L + K + S T++G + F+GS G
Sbjct: 379 LVVLKDGSLAVISFKLDGRSVSGLSIRKISEEKGGHVVPTAASCTTSLGRNRMFIGSEDG 438
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
DS+L+ +T + LS K ++ AD D +G + SA
Sbjct: 439 DSVLLGWT--KKAAQLSR--KRSHAEMLADDAELSFDEEDLEDDDDLYGDGPSTAKTASA 494
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN-YELV------- 542
S+ S ++F + D ++++ P+KD + D + + + ++ +LV
Sbjct: 495 SSEA-SDPSNYTFRIHDIMLSLAPIKDVALASHKVTDTAIGTLERAADQLDLVVSTGRGA 553
Query: 543 -------------------ELPGCKGIWTVYHK--SSRGHNADSSRMA----AYDDEYHA 577
E + +W+V+ K + +G A S+ A A D +Y
Sbjct: 554 AGGLALMRREIDPVILRKGEFSNARAVWSVHAKKPAPKGMVAAGSQDAEAKLAADVDYDQ 613
Query: 578 YLIISL------EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+LI+S E + TA E + TI G + G R++QV +
Sbjct: 614 FLIVSRSNGDGGEESAIFNITATGFEETNKGDFEREDAATINVGTIAGGTRIVQVLKAEI 673
Query: 632 RILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC 690
R D + Q L P E+GS ++S S ADPY+L+ D S+ +L D +
Sbjct: 674 RSYDSELGLDQIL---PMEDENGS---ELRIISASFADPYILVIRDDSSVIVLQADANGE 727
Query: 691 TVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
+ + S+K WLS + ++ +
Sbjct: 728 MEEIDRGDTLLSTK-------------------------WLSGCIHQSQSTGEKA----- 757
Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
+ + G L IF++P+ + V + ++ L T SSS
Sbjct: 758 --LAYLLSAEGGLHIFELPDLSKPVYVAASLG--------FLPPTLTADFTPRRSSS--- 804
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
K + + V EL + + P+L + ++ YQ Y F E
Sbjct: 805 -----KAALTEVIVAELG----DSTYKTPYLIVRTSSNDLVIYQPYHFPAHEVVKP---- 851
Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
+L + RL FS P A E+T G TI N+ G+ F++G
Sbjct: 852 --FFENLRWLKIPQPRLPE--FSEEP--ALESEDTGIGKESILTTI-ANVGGYSAVFMAG 904
Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY- 989
+ P + + L ++ S+ + H C+ GF Y+ + G L++CQLP G Y
Sbjct: 905 TSPSFILKESSSLPRVIKMRTKSVKNLSSFHRAECDRGFAYINADGNLRVCQLPRGYRYG 964
Query: 990 DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
D W V+KI + + Y K++ L++ V KP L + E H+ N+
Sbjct: 965 DAGWAVKKISINQDVQAMCYHPPKDV--LVLGVGDKKPFT-----LPEDEHHHEWLEENI 1017
Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATI---PMQSSENALTVRVVTL-FNTTTK 1105
+ + VE+ +++L+ Q+ A I +++ E LT++V+ L + T
Sbjct: 1018 TFKPM-----VEQGMIKVLDT-------QSLAVIDTYELEAFEVVLTIKVLNLEVSENTH 1065
Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISAL 1160
E + L+A+GT +++GED+ +RG + +F P T + +E++G+++A+
Sbjct: 1066 ERKQLVAVGTGFIRGEDLPSRGCIYVFEVINVVPEPGRPETNRRLKLIAKEEVRGSVTAI 1125
Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV-KNFILLGDIHK 1215
+ QG LL+A G K ++ K GT L +AF D V+ + +L+GD K
Sbjct: 1126 TDVGSQGFLLMAQGQKCMVRGLKEDGTLLP-VAFMDMQCYVTVAKELNGSGMLLMGDAAK 1184
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
+F+ + E ++ L K ++ A +FL L L+V+D N+ Y P +
Sbjct: 1185 GAWFVGYTEDPYKMILFGKSRSKMEVMAADFLPHDKQLYLMVADGDCNLHALQYDPDHPK 1244
Query: 1276 SWKGQKLLSR---------------------------AEFHVGAHVTKFLRLQMLATSSD 1308
S GQ+LL + A+ H HV+ + A D
Sbjct: 1245 SLSGQRLLHKSTFHTGHFTTTMTLLPSSLSPTVSPSSADEHANGHVSPSPSPENDAMDID 1304
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
AP + +L T GS+ + PL E +RRL +LQ L+ ++ H GLNPR+
Sbjct: 1305 ---PAPAGTVQH---ILLTTQTGSLALLTPLSEQQYRRLGALQTYLIGALEHWCGLNPRA 1358
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
+R S G R IVD LL+ + L + + E A + G + S+L
Sbjct: 1359 YRAVESEGFGSR----GIVDGALLARWCELGSQRRAEGAAKVGVEEWVVRSDLE 1408
>gi|328773280|gb|EGF83317.1| hypothetical protein BATDEDRAFT_21894 [Batrachochytrium dendrobatidis
JAM81]
Length = 1673
Score = 286 bits (732), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 219/774 (28%), Positives = 362/774 (46%), Gaps = 125/774 (16%)
Query: 753 YSVVCYESGALEIFDVPNFN--CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
+ V ++G L ++ +P+F C F + F + +D + + T N++ +E
Sbjct: 930 WCFVYTDTGHLLVYTLPDFKECCAFPL--FSTLPVLAMDVPLWRSRSIDSTFANTTGDE- 986
Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
+ VV L S P+L + +G + Y+ +F P TS +DD
Sbjct: 987 --------FEEILVVNLGN---SKDRQTPYLVCLAANGDLAVYK--IFVCP--TSSNDDD 1031
Query: 871 VS--------TSRSLSVSNVSASRLRN---LRFSRTPLDAYTRE---------------E 904
S SR+ + + A L+ +R R P D TR+ +
Sbjct: 1032 TSFVNSGTFKQSRTPAELELDAQNLKKRLAIRLVRIPHDQITRDLQFYTDNEGDKIDLVQ 1091
Query: 905 TPHGAPC----QRITIFKNI--SG---HQGFFLSGSRPCWCMVFRER------------- 942
P P Q + F I SG + G ++GSRPCW MV +
Sbjct: 1092 EPQHQPTFLKRQHLKPFDAIGWSGGNMYSGVVVTGSRPCWIMVALQSRQQDLDVISFDNS 1151
Query: 943 -----------------LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
LR HP DG + F LHNVN HGF+Y+ +G+ +ICQLP
Sbjct: 1152 VACSTKLPPVPLLGTNMLRFHPMPVDGPMKCFAPLHNVNVAHGFLYINWKGLFRICQLPP 1211
Query: 986 GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS------VPVLKPLNQVLSLLIDQE 1039
+D+ WPV K+P+ T H++ Y Y + S +P + + V + +ID+
Sbjct: 1212 QFNFDHDWPVCKVPIHKTVHKVAYHYSSQTYAIATSTPERFDIPHAQYASAVAAAVIDE- 1270
Query: 1040 VGHQIDNHNLSSVDLHR---------TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSEN 1090
G ++ + + TV+ Y++ ++ + W+T +I + +E
Sbjct: 1271 -GDEMPDAERKVTGIRELSEIKPGMYEATVDRYKIELV----SSVTWETVDSIELSEAET 1325
Query: 1091 ALTVRVVTLFNTTTKENETL-LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
+ + V L + T + L LAIGT Y +GED+++RG++ L+ +P N T
Sbjct: 1326 VMALEAVDLSSKETISGKKLYLAIGTGYSRGEDLSSRGKLHLYDVIEVVPDPNNPQTNRK 1385
Query: 1148 ---VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
V S++ + SA+ ++ +LL A GPKII+++ E+ G+AF D ++V SL+ V
Sbjct: 1386 FKHVDSEDDRSPFSAICTVNDYLLAAIGPKIIMYQLEDGEITGVAFLDVN-VFVTSLSSV 1444
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
KN I + DI KS++F++++E+ A+L +L +D L +A LID + L+L+V+D KN+
Sbjct: 1445 KNLIQICDIQKSVWFVAFQEEPAKLAVLGRDVHPLQGYAANMLIDDNQLALLVADGDKNL 1504
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
YAP +S G++L+ + E H+G HV+KF+R++ R A S + A
Sbjct: 1505 HTMIYAPDNVQSLGGERLIRKGEIHLGQHVSKFIRMRRKPLL--RNDAIVFSKQYLNVA- 1561
Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS-------NGK 1377
TLDG++ I P+ E F+RL L ++V S+ H+AGLNPR FRQ +G
Sbjct: 1562 --ATLDGALEIITPVSERIFKRLYGLYSRMVTSIEHIAGLNPRGFRQAQHRVRPITLSGF 1619
Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
PGP I+D +LL Y L +Q +A G+ +++ +L ++ G F
Sbjct: 1620 IGPPGPRGILDGDLLYEYVRLSRTQQRGLAKAIGSKDDRLMDDLLEVLTGLDFF 1673
Score = 197 bits (502), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 181/744 (24%), Positives = 330/744 (44%), Gaps = 152/744 (20%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
AA LEL +R+HGN+ SL ++ + S + D+++L+F++AK+S++E+ L
Sbjct: 87 AACLELAAQFRVHGNITSLGVVPM---NYSGKADALLLSFKEAKMSLVEYSQFTQKLVTV 143
Query: 158 SMHCFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
SMH FE E+ L S R P +KVDPQG C + +YG ++ IL Q G+ L+ D
Sbjct: 144 SMHYFEREEFKKLG----SIDRPPPEIKVDPQGYCAAMRIYGDRLAILPFKQDGADLLND 199
Query: 217 EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
+ S F I V+ DLD ++++ DF F+ GY P + I+++ E TW R+
Sbjct: 200 LNDANSKYPFRPSI----VLPFLDLDKSIRNIIDFTFLFGYAVPTIAIMYQTEQTWTARL 255
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY--- 331
+ T I+ +S+ T + +P+++ LP++ L++VP+PIGG++V+ N I +
Sbjct: 256 GIRKDTVSIAVISLDTAEESYPVLYKIEKLPYNCTMLVSVPTPIGGLIVLSHNAIIFTDQ 315
Query: 332 -HSQSASCA----------LALNNYAVSLDSSQELP---------------RSSFSVELD 365
H+ +C + L Y + LD Q P ++ LD
Sbjct: 316 IHAPGIACIVNAYFDSETNIMLTPYELQLDMVQPRPPRPPSVFFAQNKYTDYKELAISLD 375
Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDGRV----------VQRLDLSK---------- 405
+ ++ D+ LL + G+++ + ++ + V V+ L++
Sbjct: 376 GSRGMFISPDIFLLVLRDGEMIQVDLIGEEGVGRSWKRRKGGVKSFQLTRLGIRMTAPVH 435
Query: 406 -------TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE 458
+NP L+ +++ FLGSR G L + S + + L +F ++E
Sbjct: 436 LFPLADASNPLSLSGRNSSVPLGGSFLGSR-GSKLRYNYLFASSRTTDACLL--QFVEVE 492
Query: 459 ADAPSTKRLRRSSSDALQDMVNGE----ELSLYGSASNNTES------------------ 496
A S+ + +++ + + NGE + LYG ++ ++
Sbjct: 493 EFAKSSVSMNGAAN--MNNTDNGEDDELDKDLYGDSTTAKQTDTDMSALLSSDEHGHGEI 550
Query: 497 -AQKTFSFAVRDSLVNIGPLKDFSYGLRINAD---------------ASATG-------- 532
+++T F + DS+ + PL+DF+ GL +ATG
Sbjct: 551 VSEQTLRFRLCDSVTVVSPLRDFAVGLPAETSEHRFSPKIGGCDLEIVAATGHGPHGHLA 610
Query: 533 -ISKQSNYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
+++ ++V ELP + +WT+ + D ++ D +H Y+I+S + T
Sbjct: 611 ILNRSVRPQIVTTFELPQIEEMWTI---RCAKFDKDYRLVSEPTDAFHKYVILSHSSGTS 667
Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF--- 645
+L+ + TE+ ++ ++ G T+ G L ++QV G + D + D +
Sbjct: 668 ILKAGEAFTEMDDTT-FYQAGPTVGVGALLDETIIVQVHPNGVILFD--FSKYDFTIIDR 724
Query: 646 --------------GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
G E G ++ V+S S DPY +L M+ G I LL D +T
Sbjct: 725 LNTNRMHALYIFVEGTKLQEMRVGDDDIWVISCSFMDPYAMLLMNTGHIVLLSLDETTHQ 784
Query: 692 VSVQTPAAIESSKKPVSSCTLYHD 715
++ + E K+ VS+ +LY D
Sbjct: 785 ITQIS----EYKKRLVSTFSLYCD 804
>gi|148697643|gb|EDL29590.1| cleavage and polyadenylation specific factor 1, isoform CRA_b [Mus
musculus]
Length = 1311
Score = 285 bits (729), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 202/678 (29%), Positives = 317/678 (46%), Gaps = 88/678 (12%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 703 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 758
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 759 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 808
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 809 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 868
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 869 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 928
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 929 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIER 981
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENET 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 982 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET----- 1032
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
V G LKG ++A L QG +
Sbjct: 1033 --------VSG--------------------------------LKGYVAAGTCLMQGEEV 1052
Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
G +I L +EL G+AF D LY+ + VKNFIL D+ KSI L ++E+
Sbjct: 1053 TCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKT 1110
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+F
Sbjct: 1111 LSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADF 1170
Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELT 1343
HVGAHV F R + GAA G K N+ F TLDG IG + P+ E T
Sbjct: 1171 HVGAHVNTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKT 1223
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E+
Sbjct: 1224 YRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMER 1283
Query: 1404 LEIAHQTGTTRSQILSNL 1421
E+A + GTT IL +L
Sbjct: 1284 SELAKKIGTTPDIILDDL 1301
Score = 283 bits (724), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 196/601 (32%), Positives = 310/601 (51%), Gaps = 68/601 (11%)
Query: 129 RRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQ 188
+RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G P V+VDP
Sbjct: 11 KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQNVHTPRVRVDPD 67
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HV 246
GRC +L+YG ++++L + + +E G G + S++I++R LD K ++
Sbjct: 68 GRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYIIDVRALDEKLLNI 124
Query: 247 KDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPH 306
D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K HP+IWS +LP
Sbjct: 125 IDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPF 184
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELD 365
D + LAVP PIGGV++ N++ Y +QS +ALN+ + + + LD
Sbjct: 185 DCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLD 244
Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFF 424
A A ++ D ++S K G++ +LT++ DG R V+ K SVLT+ + T+ F
Sbjct: 245 CAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLF 304
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-----SSDALQDMV 479
LGSRLG+SLL+++T SS E D E KR+ + QD V
Sbjct: 305 LGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVGWTGGKTVPQDEV 362
Query: 480 NGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----------------L 522
+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G L
Sbjct: 363 D--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPDL 418
Query: 523 RI------NADASATGISKQSNYELV---ELPGCKGIWTVYH----------KSSRGHNA 563
I + + + + K ++V ELPGC +WTV K+
Sbjct: 419 EIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAESTEQE 478
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
S+ A D H +LI+S E TM+L+T + E+ S + QG T+ AGN+ R +
Sbjct: 479 PSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYI 537
Query: 624 IQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
+QV G R+L+G L F P + + ++ ++ADPYV++ ++G + +
Sbjct: 538 VQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMF 587
Query: 684 V 684
+
Sbjct: 588 L 588
>gi|240277254|gb|EER40763.1| cleavage factor two protein 1 [Ajellomyces capsulatus H143]
Length = 1408
Score = 285 bits (729), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 339/1431 (23%), Positives = 604/1431 (42%), Gaps = 196/1431 (13%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G + L + D+ +++++A +AK+S++E+D H + TS+H
Sbjct: 65 LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIH 122
Query: 161 CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
+E + +++ + A P + VDP RC VL +G + + IL Q G LV
Sbjct: 123 YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180
Query: 215 -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
DE +G F SS V+ + L+ M H F++ Y
Sbjct: 181 DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL+ + T + + + S ++ + + S LP+D +K++ +P
Sbjct: 241 REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300
Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
P+GG L++G+N +H + A+ +N +A S +S + L+ + L
Sbjct: 301 PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFL 425
+N LL G + +L+ DGR V + L + S+L + + + F
Sbjct: 361 ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD-------- 477
GS GDS+L+ ++ S + + G+I + D
Sbjct: 421 GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480
Query: 478 MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
M G + +S+ G+ N+ + F + D L N+GP++D + G + D
Sbjct: 481 MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533
Query: 531 TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
S +N ELV G G +VY K + +
Sbjct: 534 PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593
Query: 565 SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
S Y YL++S + +++V + E T++ ++ + RTI G L
Sbjct: 594 GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653
Query: 619 GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
RV+QV + R D G + Q + SE +V+ S ADPYVL+ D
Sbjct: 654 SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708
Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
SI LL D S +T I S+ S +LY DK
Sbjct: 709 QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDKY-------------------R 747
Query: 738 AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
+ + +G P + Q D + + L +F +PN VFT +
Sbjct: 748 SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793
Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
D +I S+ +E I + V +L + P+L ++ + Y+
Sbjct: 794 ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLTLYE 846
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
Y + TS ++ S R + ++N + S + ++ + T P +
Sbjct: 847 PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTALSKPLR-- 895
Query: 915 TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
+ ++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV +
Sbjct: 896 -VLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954
Query: 975 QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
++++C+ P + +D W +KI L + Y + Y + + V +L
Sbjct: 955 DNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEYSSSSETYVIGTNQKV------DFNL 1008
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
D E+ + N +S + +++ V++L P W + ++++E + V
Sbjct: 1009 PEDDEIHPEWRNEVISFLP-----QIDKGSVKLLTPRT----WSIIDSYNLRNAERIMCV 1059
Query: 1095 RVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEV 1148
+ + L + T E + + +GTA +GED+AARG + +F + D P+ + +
Sbjct: 1060 KCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEVIKVVPEVDRPETNRKLKLI 1119
Query: 1149 YSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV 1204
+E+KGA+++L+ + QG L+ A G K I+ K G+ L +AF D YV L +
Sbjct: 1120 AKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKEL 1177
Query: 1205 KN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
K ++GD K ++F + E+ +L+L +KD G+L A +FL DG+ L ++V+D+
Sbjct: 1178 KGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKDDGTLQVMAADFLPDGNRLYILVADDDC 1237
Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
NI + Y P+ S KG +LL R+ F G + L ATSS + G D +
Sbjct: 1238 NIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQ-GPDADPDMMDLD 1296
Query: 1323 A------LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
+ +L + GSI I P+ E ++RRL +LQ +L +++ H GLNPR+FR S+G
Sbjct: 1297 SSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLANTLEHPCGLNPRAFRAVESDG 1356
Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
R +VD +L+ + L + + EIA++ G +I ++L + G
Sbjct: 1357 IGGR----GMVDGDLVKRWLDLGTQRKAEIANRVGADVWEIRADLEAIGKG 1403
>gi|395740218|ref|XP_002819588.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Pongo abelii]
Length = 1388
Score = 285 bits (728), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 207/702 (29%), Positives = 332/702 (47%), Gaps = 87/702 (12%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 731 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 786
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++
Sbjct: 787 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 829
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-------------ETPHGAPCQ----RIT 915
L N+ +RF + P + RE T GA + R
Sbjct: 830 ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGARGRVARFR 879
Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
F++I G+ G F+ G P W +V R LR+HP DG + +F HNVNC GF+Y
Sbjct: 880 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNR 939
Query: 975 QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVL 1032
Q ++ PS + P + L H +Y + S P +
Sbjct: 940 QEPQRLSGSPSRTXXXXPTPPGLLGLPG--HWCVTPTNPQVYAVATSTNTPCAR------ 991
Query: 1033 SLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSEN 1090
I + G + + + + + E + ++++ P W+ A I +Q E+
Sbjct: 992 ---IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEH 1044
Query: 1091 ALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 1045 VTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNK 1104
Query: 1148 ---VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + V
Sbjct: 1105 FKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISV 1163
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
KNFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+
Sbjct: 1164 KNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNL 1223
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT----- 1319
++ Y P+ ES+ G +LL RA+FHVGAHV F R + GAA G K
Sbjct: 1224 MVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGAAEGLSKKSVVWE 1276
Query: 1320 NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
N+ ++ G IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + +
Sbjct: 1277 NKHITWLVSVRGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTL 1336
Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ +++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1337 QNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1378
Score = 173 bits (438), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 102/269 (37%), Positives = 152/269 (56%), Gaps = 29/269 (10%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
T K HP+IWS +LP D + LAVP PI
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPI 275
Score = 115 bits (288), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 101/347 (29%), Positives = 167/347 (48%), Gaps = 56/347 (16%)
Query: 379 LSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+S K G++ +LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL+++
Sbjct: 285 ISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKY 344
Query: 438 TCG----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASN 492
T +++ + KEE + +T + QD V+ E+ +YGS A +
Sbjct: 345 TEKLQEPPASAVREAADKEEPPSKKKRVDATAGWSAAGKSVPQDEVD--EIEVYGSEAQS 402
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASA 530
T+ A T+SF V DS++NIGP + + G L I + +
Sbjct: 403 GTQLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGAL 460
Query: 531 TGISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHA 577
+ + K ++V ELPGC +WTV + G + S A DD H
Sbjct: 461 SVLQKSIRPQVVTTFELPGCYDMWTVIAPLRKEEEDNPKGEGTEQEPSTPEADDDGRRHG 520
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
+LI+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L+G
Sbjct: 521 FLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG- 578
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 579 --VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 616
>gi|431908147|gb|ELK11750.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
alecto]
Length = 671
Score = 285 bits (728), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 212/661 (32%), Positives = 328/661 (49%), Gaps = 114/661 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN + + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRSAEGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 309
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT+V DG R V+ K
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLVTDGLRSVRAFHFDKAA 369
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC----GSGTSMLSSGLKEEFGDIEADAPS 463
SVLTS + T+ FLGSRLG+SLL+++T +++ + KEE P
Sbjct: 370 ASVLTSSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPASTVREAADKEE--------PP 421
Query: 464 TKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPL 515
+K+ R S+ QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP
Sbjct: 422 SKKKRVDSTVGWSGGKSVAQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPC 477
Query: 516 K-------------------------DFSYGLRINADASATGISKQSNYELV-------- 542
+ + GL + + S + + E+V
Sbjct: 478 ANAAMGEPAFLSEEVPVWEVQGGGGVECTVGLWPHPSLAQFQNSPEPDLEIVMCSGYGKN 537
Query: 543 ------------------ELPGCKGIWTVY-------HKSSRGHNAD---SSRMAAYDDE 574
ELPGC +WTV ++ +G + S+ A D
Sbjct: 538 GALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEQEETPKGEAVEPEPSAPDADDDGR 597
Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
H +LI+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+L
Sbjct: 598 RHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLL 656
Query: 635 D 635
+
Sbjct: 657 E 657
>gi|225558298|gb|EEH06582.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 1408
Score = 284 bits (727), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 339/1431 (23%), Positives = 604/1431 (42%), Gaps = 196/1431 (13%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G + L + D+ +++++A +AK+S++E+D H + TS+H
Sbjct: 65 LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQICTTSIH 122
Query: 161 CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
+E + +++ + A P + VDP RC VL +G + + IL Q G LV
Sbjct: 123 YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180
Query: 215 -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
DE +G F SS V+ + L+ M H F++ Y
Sbjct: 181 DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL+ + T + + + S ++ + + S LP+D +K++ +P
Sbjct: 241 REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300
Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
P+GG L++G+N +H + A+ +N +A S +S + L+ + L
Sbjct: 301 PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFL 425
+N LL G + +L+ DGR V + L + S+L + + + F
Sbjct: 361 ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD-------- 477
GS GDS+L+ ++ S + + G+I + D
Sbjct: 421 GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480
Query: 478 MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
M G + +S+ G+ N+ + F + D L N+GP++D + G + D
Sbjct: 481 MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533
Query: 531 TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
S +N ELV G G +VY K + +
Sbjct: 534 PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593
Query: 565 SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
S Y YL++S + +++V + E T++ ++ + RTI G L
Sbjct: 594 GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653
Query: 619 GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
RV+QV + R D G + Q + SE +V+ S ADPYVL+ D
Sbjct: 654 SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708
Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
SI LL D S +T I S+ S +LY DK
Sbjct: 709 QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDKY-------------------R 747
Query: 738 AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
+ + +G P + Q D + + L +F +PN VFT +
Sbjct: 748 SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793
Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
D +I S+ +E I + V +L + P+L ++ ++ Y+
Sbjct: 794 ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLILYE 846
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
Y + TS ++ S R + ++N + S + ++ + T P +
Sbjct: 847 PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTTLSKP---L 894
Query: 915 TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
+ ++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV +
Sbjct: 895 RVLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954
Query: 975 QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
++++C+ P + +D W +KI L + Y + Y + + V +L
Sbjct: 955 DNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEYSSSSETYVIGTNQKV------DFNL 1008
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
D E+ + N +S + +++ V++L P W + ++++E + V
Sbjct: 1009 PEDDEIHPEWRNEVISFLP-----QIDKGSVKLLTPRT----WSIIDSYNLRNAERIMCV 1059
Query: 1095 RVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEV 1148
+ + L + T E + + +GTA +GED+AARG + +F D P+ + +
Sbjct: 1060 KCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEVIEVVPEVDRPETNRKLKLI 1119
Query: 1149 YSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV 1204
+E+KGA+++L+ + QG L+ A G K I+ K G+ L +AF D YV L +
Sbjct: 1120 AKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKEL 1177
Query: 1205 KN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
K ++GD K ++F + E+ +L+L +KD G+L A +FL DG+ L ++V+D+
Sbjct: 1178 KGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKDDGTLQVMAADFLPDGNRLYILVADDDC 1237
Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
NI + Y P+ S KG +LL R+ F G + L ATSS + G D +
Sbjct: 1238 NIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQ-GPDADPDMMDLD 1296
Query: 1323 A------LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
+ +L + GSI I P+ E ++RRL +LQ +L +++ H GLNPR+FR S+G
Sbjct: 1297 SSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDG 1356
Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
R +VD +L+ + L + + EIA++ G +I ++L + G
Sbjct: 1357 IGGR----GMVDGDLVKRWLDLGTQRKAEIANRVGADVWEIRADLEAIGKG 1403
>gi|384487281|gb|EIE79461.1| hypothetical protein RO3G_04166 [Rhizopus delemar RA 99-880]
Length = 1468
Score = 284 bits (727), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 204/693 (29%), Positives = 334/693 (48%), Gaps = 70/693 (10%)
Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
+G L I+ +P+F F +F IVD DS G K I
Sbjct: 811 TGILRIYSLPDFKEHFACPQFSIAPDLIVD--------DS--------------GVKSRI 848
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
+ + E+ M P L I+ Y+A+ + + + S + V
Sbjct: 849 PTNNIQEILMTHIGKERKDPHLVVRTDTNDIIIYKAFTYLDESSPDRLALRFSRVQHEYV 908
Query: 880 SNVSASRLRNLRFSRTPLDAYTREET--------------PHGAPCQR--ITIFKNISGH 923
S S+S + R +D + +T QR + F +++G+
Sbjct: 909 SRKSSSHESKPKKKRGIIDEFEIPDTDLNEEEEDLKLSTKKMDKKIQRKLLIPFTDVAGY 968
Query: 924 QGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQ 982
G F++G++P W M + +RVHP + IV FT HNVNC HGFI V S+ +++ +
Sbjct: 969 AGVFVAGAQPAWLMCSCKSFVRVHPMKTEHEIVGFTQFHNVNCQHGFITVDSKSTIQLSR 1028
Query: 983 LPS-GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
L + G YD W +QK+ L T H+I Y +Y ++VS V + + ID G
Sbjct: 1029 LRTEGINYDLDWVIQKVLLGQTVHKIQYHPVMRVYAVLVSSSVPTRMKNDDNQYID---G 1085
Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-F 1100
+ D +E++ + ++ P W+ + + E ++ L
Sbjct: 1086 KETDERGPGEF----LPEMEQFSMILVSP----VTWEIVDKVEFEEFEQCFSLECALLDS 1137
Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQ--NLVTEVYSKELKG 1155
T+ + + IGT ++GED +G + ++ DNPQ + V ++++KG
Sbjct: 1138 KQTSTGRKYYMIIGTGTLKGEDTTMKGSIRMYDIIEVVPEPDNPQTNHKFKPVLTEDVKG 1197
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
A++A+ ++ GHL G K+I+ E L G+AF D +YV S++ +KNFIL+GD
Sbjct: 1198 AVTAMCTVSGHLAACIGSKVIVWSLEDDERLVGVAFIDVQ-IYVTSMSSIKNFILIGDAQ 1256
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
KSI+FL ++ + A+L LL KD+ S D +F+ID +L L+V D +NI ++ YAP
Sbjct: 1257 KSIWFLGFQLEPAKLTLLGKDYQSFDVGCVDFIIDDKSLYLIVGDTNENIDLYQYAPFNL 1316
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+S+ GQKL+ R +FHVG+ V +RL + + G + + R L GT +GSI
Sbjct: 1317 QSFGGQKLMRRGDFHVGSQVQTMVRLPQIEKTEK------GFEYSRRHFCLCGTFNGSIA 1370
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD---SIVDCEL 1391
I+ + E TF+RL +L LV+++ HVAGLNPR+FR G R + +++D +L
Sbjct: 1371 VISSISEKTFKRLNTLYGHLVNNLQHVAGLNPRAFRLI--KGPKQRMSTNRTKAVLDGDL 1428
Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ + L +EEQ E Q GTT ++I+ +L D+
Sbjct: 1429 IFEFAGLSIEEQKETTKQIGTTVTRIMEDLVDI 1461
Score = 179 bits (453), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 163/650 (25%), Positives = 277/650 (42%), Gaps = 97/650 (14%)
Query: 88 KRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEF 147
K+ ++ + LELV ++++G + ++ + DS++L F DAK+S+LE+
Sbjct: 87 KKGGMISDTTLGRLELVAQFKMNGIITTMGTVRTNSPRGREGCDSLLLGFSDAKMSLLEW 146
Query: 148 DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL---VKVDPQGRCGGVLVYGLQMIIL 204
S + + S+H +E E+ ++ F P + +DPQ RC Y ++ +L
Sbjct: 147 SSSTNSIITVSIHYYERDEF------KKEFLTNPYPSAIHIDPQQRCAVFNFYDNKLAVL 200
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
Q S + + G S +I+L LD +K+V D F+ Y EP + I
Sbjct: 201 PFRQ--SDKLDERQGEGEEDEEKWPYYPSFIIDLATLDSRIKNVIDMTFLSDYYEPTLAI 258
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
L + E TW GR+ T + +S+ T K +P+I+S LP+D +KL+A+P P+ G+L
Sbjct: 259 LFQPEQTWTGRLGNNKDTVSLVVISLDITAKIYPIIYSIDKLPYDCFKLVAMPKPVTGML 318
Query: 323 VVGANTIHYHSQ-SASCALALNNYAVSLDSSQELPRSSFS-------VELDAAHATWLQN 374
V+ AN+I + SQ S +A+N Y + + P + + L+ A A
Sbjct: 319 VIAANSILHVSQGSPGMGVAVNGYT---KKTTDFPGMIYEPSLIELGLSLEGAKALAFGG 375
Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN---------------PSVLTSDITTIG 419
D L+ + G L+ V DG V + +S+ P +L S + +
Sbjct: 376 DRCLIFMQNGHWALVEVRRDGNKVVGMAISEIKHDLPVMEKKPPRFDTPPLLASVPSCVT 435
Query: 420 N----SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
N FFLGSR+GDSLL+++ + D + + D +
Sbjct: 436 NVKAGEYFFLGSRVGDSLLIKYDANRVNHQSVAPPVFRVCDTMLNTGPIVDMAVGDVDTV 495
Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
+ + +L L S+ + A F +I P F++ D+ A
Sbjct: 496 EQQEDWPQLELVSSSGHGKNGALCVFQ-------RHIYPQTSFAFH---QFDSQA----- 540
Query: 536 QSNYELVELPGCKGIWTVY-HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
IW++ K+ + N DD++ L IS T+VL D
Sbjct: 541 --------------IWSIKCRKNDQQQNE--------DDDFDKLLFISKSKSTLVLSAGD 578
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSES 652
L EV ++ +G TIA LF R++QV+ G +L +G + ++
Sbjct: 579 ELQEV--KTGFYTRGSTIAVSTLFDATRIVQVYATGVMVLTPEGKRI-----------QT 625
Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC-TVSVQTPAAIE 701
+ ++ SI DPY+LL + + I L GD ST + +Q P I+
Sbjct: 626 VPIPRGAKIVEASIHDPYILLTLDNNKILALQGDASTKDIIHIQLPNHIK 675
>gi|340371789|ref|XP_003384427.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Amphimedon queenslandica]
Length = 1408
Score = 284 bits (727), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 176/529 (33%), Positives = 273/529 (51%), Gaps = 42/529 (7%)
Query: 917 FKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
F NI+G+ G F+ G P W M R L +HP DG + +F NVNC GF+Y +
Sbjct: 894 FSNIAGYSGVFVCGPYPHWIFMAARGHLSIHPMYIDGPVQSFAPFDNVNCPSGFLYFNKE 953
Query: 976 GILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1035
L+I LP+ +YD+YWPV+K+PLKATPH + Y E ++ +I S P Q ++++
Sbjct: 954 SELRISVLPTQLSYDSYWPVRKVPLKATPHFVGYHMESKVHVIIASTP------QPVTVI 1007
Query: 1036 IDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
D G D D Y+ EE Y +++L P W+ TIP E
Sbjct: 1008 PDPN-GETEDALETVERDGRFVYSQEETYYLQLLSPTS----WE---TIPHSKYEMEAHY 1059
Query: 1095 RVVTLFNTTTKENETL------LAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QN 1143
V + + ETL + +GT GE+++A+G+VL+F P Q
Sbjct: 1060 HVTDMKVMRLRSQETLSGRKEYIVVGTMATFGEELSAKGKVLIFDVSVVIPEPGKPFSQY 1119
Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPLYVVSLN 1202
+ +Y +E K ++ L + G +L A G KI + ++ +L +AF DA Y+ +
Sbjct: 1120 RLKNLYDQEQKWPVTGLECVNGLILTAMGQKIFMWQFKDNKDLLAVAFIDA-ETYIHTAQ 1178
Query: 1203 IVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
+K FIL GD+ +SI L + E L+L+++D ++ F+T F+IDG L +VSD +
Sbjct: 1179 SIKGFILTGDVTRSIQLLHYNEDRRSLSLISQDPNPMEVFSTTFMIDGKALGFLVSDSDR 1238
Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
NI +F Y P+ S G L+ + HVG+ V FL ++ +T A G+ + +
Sbjct: 1239 NITLFQYQPENPASSGGANLVRCGDIHVGSLVNVFLNIRC------KTSAGLGASREMKI 1292
Query: 1323 AL-------LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
AL FGTLDG IGC+ P+ E +RRL LQ K+ + H+AGLNP++FR F +
Sbjct: 1293 ALADKRQCTFFGTLDGGIGCLLPIPEKVYRRLSMLQVKMTQGMRHMAGLNPKAFRTFQTR 1352
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ +I+D LL Y L +E+ + + Q GTT +QI+ +L ++
Sbjct: 1353 HQYLHNAQRNILDGTLLYQYLSLTAKEKFDFSKQIGTTVAQIMEDLKEI 1401
Score = 224 bits (571), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 235/880 (26%), Positives = 376/880 (42%), Gaps = 207/880 (23%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A Y+ +H PTG+ +C S HS + V V +
Sbjct: 2 YAVYREVHPPTGVEHCTSCHFVHSEKEQV---------------------------AVAS 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
+++ I+ V ++ +N G+ K L + HGN++SL +
Sbjct: 35 TSLLRIFDV------AQLQRNDGKAK------------LVQCLEFSFHGNIQSLDKVRLR 76
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
+D RD ++L+F DAK+S++E++ +GL+ SMH FE E ++ G P+
Sbjct: 77 HSD----RDCLLLSFNDAKLSIVEYNPETNGLKTVSMHQFEDEE---IRGGILHNDSRPV 129
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
VKVDP+GRC +L++G + + Q L D S + I ++ I+LRDL
Sbjct: 130 VKVDPEGRCAVMLLFGSHLAVCPFQQD---LSIDTPLSPSPSLDTHDILPTYTISLRDLP 186
Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
+ +KD F+ GY P ++ L E TWAGR+S + + M+ LS++T+ K H +IW+
Sbjct: 187 EPLPVIKDMTFIEGYTSPTLLFLSEVSPTWAGRISLRQDSMMLLGLSLNTSDKSHTVIWT 246
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALN---NYAVSLDSSQELP 356
NLP D+ L VP P+GGVLV GANT+ Y +QS+ L+LN +Y E
Sbjct: 247 LKNLPFDSSYLHPVPKPLGGVLVFGANTLIYLNQSSPPYGLSLNSITDYTTRFLLKNE-- 304
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG--RVVQRLDLSKTNPSVLTSD 414
S + LD + + ++ N+ L+S ++GD+ ++T+ D R V+R+ K S+L+S
Sbjct: 305 -GSLGIRLDCSQSVFISNEQLLVSLQSGDIYIVTLFPDSGMRGVKRITFDKAASSILSSC 363
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
I +I FLGSRL +SLL+++ ++ + + E G A
Sbjct: 364 ICSIKPHFLFLGSRLANSLLLRY-----STTVKQNIVEPIG-----------------GA 401
Query: 475 LQDMVNGEELSLYGSASNNTESAQK----TFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
+ D+ +++ +YG ++ + ++ +S V DSL+ IGP+ + G A S
Sbjct: 402 ILDL---DDIEVYGESAVSQSTSSSSLLTNYSLEVCDSLLCIGPVVKATIGE--PAFLSE 456
Query: 531 TGISKQS-NYELV--------------------------ELPGCKGIWTV---------- 553
+ K + ELV ELPGC +WTV
Sbjct: 457 EFVDKSDLDLELVLCSGHGKNGALSVLQRTIRPQVVTTFELPGCIDMWTVKSEGEEEEKG 516
Query: 554 ----YHKSSRGHNADSSRMAAYDDE-YHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+ G D SR H YLI+S TMVL+T +TE+ +S + Q
Sbjct: 517 EETKEEGQNEGGEKDQSREKEEKGSGQHDYLILSRSDSTMVLQTGQEITELDQS-GFATQ 575
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDG----SYMTQDLSFGPSNSESGSGSENSTVLSV 664
T+ AGN+ ++Q R+L G Y+ D+ G V V
Sbjct: 576 SATVFAGNV--GSFIVQATRTDIRLLKGIKQLCYVALDMGGG--------------VKCV 619
Query: 665 SIADPYVLLGMSDGSIRLL--------VGDPS--------------------TCTVSVQ- 695
+ PYV++ + +G I LL + PS T S+Q
Sbjct: 620 DVCSPYVIVLLMEGEIGLLKLVDESLVLSWPSLGNNTPVNHISAYTDTSGLFDVTSSLQF 679
Query: 696 ----------TPAAIESSKKPVSSCTLYHDK-----GPEPWLRKTSTDAWLSTGVGEAID 740
P A K+P S +L +D+ GP K + + +
Sbjct: 680 EGDGSEKEEEVPIAPPPVKRPHLSSSLLYDEDELLYGPVKTEVKEENASPMEASLAAE-- 737
Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
+ P + ++C E GALEI+ VP F VF V F
Sbjct: 738 -PEAPPPITPTHWCLLCKEDGALEIYSVPEFQFVFAVRNF 776
>gi|320169222|gb|EFW46121.1| cleavage and polyadenylation specificity factor 1 [Capsaspora
owczarzaki ATCC 30864]
Length = 1725
Score = 283 bits (725), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 177/519 (34%), Positives = 284/519 (54%), Gaps = 38/519 (7%)
Query: 920 ISGHQ---GFFLSGSRPCWCMV--FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
+ GHQ G F+ G RP W ++ R+ LR H L DGS+ AF+ +N C GF+Y T+
Sbjct: 1218 LGGHQLCSGVFVCGRRPLWLLMSPTRKALRAHLMLTDGSVSAFSAFNNNACPGGFVYFTT 1277
Query: 975 QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
QG L+ CQL + +DN WPV+++PL+AT H I Y Y L+ S P KP + L
Sbjct: 1278 QGTLRFCQLAPTTNHDNPWPVRRVPLRATAHYIGYHEVFRTYVLVTSHP--KPYFNLPRL 1335
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
D+ ++ T + + ++++ P W++ + + + E +V
Sbjct: 1336 TNDETYTPVPYTPKPRAI----PATFDTFSLQLISPVT----WESIHSFDLPAFERVTSV 1387
Query: 1095 RVVTLFNTTTKENETLL----AIGTAYVQGEDVAARGRVLLFS---TGRNADNPQN--LV 1145
+ + T++E T L IGT ++GEDV GR+++F + PQ +
Sbjct: 1388 DIAAI---TSQETVTGLKDYVVIGTTVIEGEDVTCHGRIIVFEIIDVVPEVNRPQTNRKL 1444
Query: 1146 TEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIV 1204
+ +E KGAI+AL+ + GHL+ G KII+ ++ + ++G+AF D +VVS++ +
Sbjct: 1445 KYLMEREQKGAITALSHVCGHLVSCIGQKIIIWQFASDDTMDGVAFIDTQ-TFVVSVSAI 1503
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
KNFIL+GD++ S++ L + E L +A+DF + +T+FL+DGS+L + +D +N+
Sbjct: 1504 KNFILVGDLNNSVFLLRFNETTKHLGFIARDFDHMSVASTQFLVDGSSLGFLATDSHQNL 1563
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAAPGSDKTNRF 1322
+F Y P ES GQ+LL + +FHVG+HV + LR+ + L S DR GA+ R
Sbjct: 1564 VVFAYNPLNRESNNGQRLLRQLDFHVGSHVQQVLRMVPRSLPVSVDR-GAS-----VKRH 1617
Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
L TL+GS+ +AP+ E TFRRL+ LQ++LV + AGLNP +R + K
Sbjct: 1618 IDLLATLEGSLNALAPIGETTFRRLEWLQRQLV-GLQQRAGLNPIGYRAYRFPRKMTTTR 1676
Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+++D ELLS + L L EQ E+A Q T ++ ++
Sbjct: 1677 AGNVIDGELLSRFLYLGLAEQRELARQRRNTPEDLIDDI 1715
Score = 109 bits (272), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 76/261 (29%), Positives = 135/261 (51%), Gaps = 23/261 (8%)
Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL-TWAGRVSWKHHTCMISA 285
R+ S+ I L +L + HV D F+ GY EP + +L E +W GR + TC + A
Sbjct: 294 RLRPSYEIKLTELQRHIHHVIDIEFLTGYFEPTLALLFEPNAPSWTGRTVQRKDTCSMVA 353
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNN 344
LSI+T+ HP++WS LP ++ +++AVP P+ G ++V + I + SQS+ + ++LN
Sbjct: 354 LSINTSSHSHPVVWSVDKLPFNSMRVMAVPRPVCGTVIVTPDAILHLSQSSPTVGVSLNE 413
Query: 345 Y-AVSLDSSQELPR------SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV 397
++S + +P SS + +L + L T+ G++ + T++ +GR
Sbjct: 414 LSSMSTELRLGIPENKHPDGSSVVYNMQEGRCCFLTPETLLAVTEGGEMFVATLLTEGRT 473
Query: 398 VQRLDLSKTNPSVLTSDITTIGNSLF-FLGSRLGDSLLVQF----TCGSGTSMLSSGLKE 452
V R+ + SVL +T++ N + F+GSR DS+L++ T + L+S +
Sbjct: 474 VVRIRIEPAGASVLPCCMTSLYNGQYCFIGSRASDSVLLRVMNNATAAADKRRLASAALD 533
Query: 453 EFGDIEADAPSTKRLRRSSSD 473
+F + KR R S ++
Sbjct: 534 DFS-------ANKRSRSSDTN 547
Score = 82.4 bits (202), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 66/253 (26%), Positives = 120/253 (47%), Gaps = 30/253 (11%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHS--RADYVPQIPLIQTEELDSELPSKRGIGPVPNL-- 58
FA ++ H PT + +C T++ R V + L++ +D+ S G G L
Sbjct: 2 FAYFRQQHPPTAVEHCVEASFTNAAERQLVVARANLLEVYRIDAATAS--GSGWRSELSS 59
Query: 59 --VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAA---------SLELVCHY 107
+TA +++ R G + S + + + +A LELV +
Sbjct: 60 GSALTAQTAGAMHLGRAAGYGGNDGGRSDDAATEINTRSLHSAPATPPALQHKLELVASF 119
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
L GNVES+ + +RDS++LAF++AK++V+++D + L+ S+H +E
Sbjct: 120 NLSGNVESIGVARLAHC----KRDSLLLAFKEAKVAVVDYDPATLDLKTISLHMYED--- 172
Query: 168 LHLKRGRESFARG----PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
+ ++ GR++ A P+++VDP +C LVYG ++IIL Q + ++D + S
Sbjct: 173 IEMRGGRDATALQAVWPPVIRVDPMRQCAAFLVYGTKLIILPFRQESH--LDEDDDYQSA 230
Query: 224 GGFSARIESSHVI 236
+A + S I
Sbjct: 231 QAPAASVPPSAQI 243
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 100/198 (50%), Gaps = 27/198 (13%)
Query: 537 SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET-ADL 595
S++E EL G +G+W+V+ S + + +++ D H+ L+ S + T+V T +
Sbjct: 735 SSFE--ELTGGRGLWSVF---STALDPSLAALSSLDGASHSLLVASRDDSTLVFTTTGEE 789
Query: 596 LTEVTESVDYFVQGRTIAAGNLF---GRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
L ++ ES +F G TIA GN+F G+ ++ VF G R++DG + Q+L +S
Sbjct: 790 LEQIAES-GFFTAGATIAIGNVFAANGKILIVDVFAHGIRLVDGVNLRQELLLAQLSSV- 847
Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLL--VGDPSTCTVSVQTPAAIESSKKPVSSC 710
S ++ SIA+ VL +DG++ + GD S T AA +PV +
Sbjct: 848 ------SEIIHASIAESSVLALHADGAVSFVQFTGDTQELVASTATVAA----GQPVVAV 897
Query: 711 TLYHDKG----PEPWLRK 724
+LY D+ PE L++
Sbjct: 898 SLYADRSGLFVPEAVLQR 915
>gi|327287424|ref|XP_003228429.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Anolis carolinensis]
Length = 1294
Score = 283 bits (725), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 213/680 (31%), Positives = 336/680 (49%), Gaps = 89/680 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV + + +Y + E + +S S E K LELV + GNV S+
Sbjct: 29 NLVVAGTSQLYVYRLNHDSESTTKSDRSSEGKSH-------KEKLELVAAFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + DE G G + S++I
Sbjct: 135 NVHIPKVRVDPDGRCAVMLIYGTRLVVLPFRRD---TLTDEHEGVVGEGQKSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R+LD K ++ D F++GY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 192 DIRELDEKLLNIIDMQFLYGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
HP+IWS NLP D + LAVP PIGGV++ N++ Y +QS Y VSL+S
Sbjct: 252 HPVIWSLSNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSLNSLTN 305
Query: 352 -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ P + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 GTTVFPLRIQEGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKA 365
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
SVLT+ + T+ FLGSRLG+SLL+++T +++ K+ E KR
Sbjct: 366 AASVLTTCMITMDPGYLFLGSRLGNSLLLRYTEKLQEPPVNAA-KDATEKTEEPPVKKKR 424
Query: 467 LRRSSS-----DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ + ++ A QD V+ E+ +YGS + + + T+SF V DS++NIGP + + G
Sbjct: 425 VEQQANWAGGKSAPQDEVD--EIEVYGSEAQSG-TQLSTYSFEVCDSILNIGPCANAAMG 481
Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
L I + + + + K ++V ELPGC +WTV
Sbjct: 482 EPAFLSEEFQNSLEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 541
Query: 557 SSRGHNADSSRMAAY----------DDEYHAYLIISLEARTMVLETADLLTEVTESVDY- 605
D+ +A D + H +LI+S E TMV + +D
Sbjct: 542 QKAEQEEDAQGESAEKEPSPPEPPDDGKRHGFLILSREDSTMVNPANGPTGQEIMELDTS 601
Query: 606 -FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
T AGN+ R ++QV G R+L+G L F P + S ++
Sbjct: 602 GLAPRSTQDAGNIGENRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQC 651
Query: 665 SIADPYVLLGMSDGSIRLLV 684
++ADPYV++ S+G + + V
Sbjct: 652 AVADPYVVIMSSEGQVTMFV 671
Score = 168 bits (425), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 139/559 (24%), Positives = 243/559 (43%), Gaps = 89/559 (15%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ V+ E+G +EI+ +P + VF V F G+ +VD+ + +E + EE
Sbjct: 786 WCVLVRENGTMEIYQLPEWRLVFLVKNFPMGQRVLVDSSFGQPASQAE----AKKEEVIR 841
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
Q + + +V L ++ SRP+L + D +L Y+A+ + S+
Sbjct: 842 QTEMPLVKEVLLVALGNRQ-----SRPYLL-VHVDQELLIYEAF-----NHDSQLGQTNL 890
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREE--TPHGAPCQRITIFKNISGHQGFFLSG 930
R V + R + R S+ ++ EE P G R F++I G+ G F+ G
Sbjct: 891 KVRFKKVPHNINFREKKPRPSKKKTESAGGEEASVPRGR-VARFRYFEDIYGYSGVFICG 949
Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
P W + VTS+G L++ + +
Sbjct: 950 PSPHWLL----------------------------------VTSRGALRLHPMTIDGPIE 975
Query: 991 NYWPVQKIPLKATPHQITYFAEKN----LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDN 1046
++ P + P YF + ++ +P + + + I++ V +
Sbjct: 976 SFAPFHNV---NCPKGFLYFNRQGTGGGIHNACSRIPRMTGEDDMEFETIERGVLKCVPG 1032
Query: 1047 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE 1106
DL ++ ++ ++ E+ ++ V+L + T
Sbjct: 1033 EGFGHPDLILSFKID-----------------------LEEWEHVTCMKTVSLKSEETVS 1069
Query: 1107 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISAL 1160
+ +A+GT +QGE+V RGR+L+ P +T+ +Y KE KG ++AL
Sbjct: 1070 GLKGYIAVGTCLMQGEEVTCRGRILIMDIIEVVPEPGQPLTKNKFKVLYEKEQKGPVTAL 1129
Query: 1161 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
G+L+ A G KI L +L G+AF D LY+ + VKNFIL D+ KSI L
Sbjct: 1130 CHCNGYLVSAIGQKIFLWSLKDNDLTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLL 1188
Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
++E+ L+L+++D L+ + +F++D L +VSD +N+ ++ Y P+ ES+ G
Sbjct: 1189 RYQEESKTLSLVSRDAKPLEVYCVDFMVDSCQLGFLVSDRDRNLLVYMYLPEAKESFGGM 1248
Query: 1281 KLLSRAEFHVGAHVTKFLR 1299
+LL RA+FHVGAHV F R
Sbjct: 1249 RLLRRADFHVGAHVNAFWR 1267
>gi|367052335|ref|XP_003656546.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
gi|347003811|gb|AEO70210.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
Length = 1460
Score = 282 bits (722), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 371/1518 (24%), Positives = 614/1518 (40%), Gaps = 271/1518 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLM---------DGISAA-------- 99
NLVV +++++++ +V + S +SG R DG+ A+
Sbjct: 28 NLVVAKSSLLQVFRTKVVSTELEASPDSGHRSRNAARYESRLANDDDGLEASFLGGDSLA 87
Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDS 150
L LV L G V LA + A + DS+++A +DA++S++E+D
Sbjct: 88 LRTDRANVTKLVLVAETPLAGTVTGLARIKTPHARHGC--DSLLIALKDARLSLVEWDAE 145
Query: 151 IHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMIIL 204
H L S+H +E E + S PL ++ DP RC + + IL
Sbjct: 146 RHALATVSIHYYEQEEL------QGSPWAAPLSHYVNFLEADPGSRCAALKFGARNLAIL 199
Query: 205 KASQGGSGL-VGDEDTFGSGGGFSARIESSHVIN-----------------LRDLD--MK 244
Q + +GD D G A+ +SS VI+ L +LD +
Sbjct: 200 PFRQADEDIDMGDWDG-ELDGPRPAKDQSSAVIDGASNIEDTPYSPSFVLRLSNLDPSLL 258
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
H F+H Y EP IL H T M+ L + K I S L
Sbjct: 259 HPVHLAFLHEYREPTFGILASTASASNSLGRKDHFTYMVFTLDLQQ--KASTTILSVGGL 316
Query: 305 PHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
P D ++++ +P+P+GG L+VG+N IH + +A+N S + +S ++
Sbjct: 317 PQDLFRVVPLPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTRQCTSFGLVDQSELNLR 376
Query: 364 LDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT 417
L+ L D+ L+ G + L+T DGR V L+L + + S++ +TT
Sbjct: 377 LEGCVVDVLTADLGELLVILNDGRMALVTFRIDGRTVSGLELRMLPASSGGSIIPGRVTT 436
Query: 418 ---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+G + F G GDS+L FG + + + +R R+
Sbjct: 437 LSRVGRNAMFAGLEEGDSVL-------------------FGWAKKQSQAGRRRPRAKDAV 477
Query: 475 LQ------------DMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGPLK 516
LQ + + ++L A+ S+ + +F + D LV+I P++
Sbjct: 478 LQMDEEAGEEEEEEEDEDEDDLYGEEPAARQQPSSTASSLMTGDLTFRIHDRLVSIAPIQ 537
Query: 517 DFSYGLRINADAS-----------------ATGISKQSNYELV------------ELPGC 547
+YG + S A G K ++ + E P
Sbjct: 538 AMTYGQPVWLPGSEEERNSAGVHSDLQLVCAVGRDKSASLATINLAIAPKVIGRFEFPEA 597
Query: 548 KGIWTV-----YHKSSRGHNADSSRMAAYD--DEYHAYLII------SLEARTMVLETAD 594
+G WT+ KS +G A +S YD +Y ++I+ E + TA
Sbjct: 598 RGFWTMCAKKPIPKSLQGDKAGASLGNGYDTSGQYDKFMIVGKVDLDGYEKSDVYALTAA 657
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
+ + G TI AG + R+IQV + R DG + ++Q L P E
Sbjct: 658 GFESLGGTEFDPAAGITIEAGTMGKGSRIIQVLKSEVRCYDGDFGLSQIL---PMQDEE- 713
Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
+G+E V S S+ADP++L+ D S+ + D S + + S+ K ++ C LY
Sbjct: 714 TGAEPRAV-SASVADPFLLIIRDDSSVFIARIDSSNELEELDKDDPVLSTTKWLTGC-LY 771
Query: 714 HDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
D S + +G+ A + + SGAL I+ +P+
Sbjct: 772 AD----------SAGVFAEESMGKPASTAQC-------VLMFLLSASGALYIYRLPDLAR 814
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
V + +S Y+ L + + +GT KE + + V +L
Sbjct: 815 PIYVAEGLS--------YIPPGLS-----ADYAGRKGTA---KETLAEILVADLG----D 854
Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
+ H P+L + + YQ + S+ + S +L V N +
Sbjct: 855 STHKSPYLILRHANDDLTLYQPF-------RSRKATEQAFSETLFFQKVP-----NTALA 902
Query: 894 RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDG 952
++P +A +E H + N+ G+ F+ G+ P + + + + RV P L
Sbjct: 903 KSPQEA-DEDEASHQPRFLSMRRCDNVGGYSTVFVPGASPSFIIASSKSMPRVMP-LQGS 960
Query: 953 SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPHQITYFA 1011
++A + H C HGFIY S+ I ++CQ P G Y + V+KIP+ + Y
Sbjct: 961 GVIAMSPFHTEGCEHGFIYADSRRIARVCQFPDGCIYAETGVAVRKIPIGEDIAAVAY-- 1018
Query: 1012 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1071
+P + S V ++ L D + + NLS TV+ +++L P
Sbjct: 1019 ----HPPMQSYVVGCNTSEPFELPKDDDYHKEWARENLSF-----KPTVDRGILKLLSPI 1069
Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVL 1130
W + M+ E L V + L + T E + L+A+GTA +GED+ RGRV
Sbjct: 1070 T----WTVVDAVQMEPCETILCVETLNLEVSEFTNERKQLIAVGTALTKGEDLPTRGRVY 1125
Query: 1131 LFSTGRNADNPQNLVT----EVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KW 1180
++ P T ++ +KE +GA++AL+ + QG +L+A G K ++ K
Sbjct: 1126 VYDIADVIPEPGRPETGKKLKLIAKEDIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKE 1185
Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
G+ L +AF D YV + + LL D K ++F + E+ ++ L K
Sbjct: 1186 DGSLLP-VAFMDM-NCYVTAAKELPGTGLCLLADAFKGVWFTGYTEEPYKMMLFGKSSTK 1243
Query: 1239 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKF 1297
L+ +FL DG LS VVSD I I + P+ +S +G LL R F+ GA H TK
Sbjct: 1244 LEVLNADFLPDGKELSFVVSDADGYIHILQFDPEHPKSLQGHLLLHRTTFNTGAHHATKS 1303
Query: 1298 LRL-------------------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
L L S ++ P + + + LL + G + + P
Sbjct: 1304 LLLPASTPADKEKNDGNAANAQAKAKASDNKQPREPAAQRPH--VLLLASPTGVLAALRP 1361
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLS 1393
L E +RRL SL +L +S+PH AGLNPR +R + + G D SIVD +L
Sbjct: 1362 LSESAYRRLSSLAAQLTNSLPHPAGLNPRGYRA--AGAECPPAGVDAGLGRSIVDGTVLE 1419
Query: 1394 HYEMLPLEEQLEIAHQTG 1411
+ L + ++E+A + G
Sbjct: 1420 RFAELGMARRVELAGRAG 1437
>gi|242798830|ref|XP_002483249.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces stipitatus ATCC 10500]
gi|218716594|gb|EED16015.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces stipitatus ATCC 10500]
Length = 1382
Score = 282 bits (722), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 359/1466 (24%), Positives = 608/1466 (41%), Gaps = 223/1466 (15%)
Query: 57 NLVVTAANVIEIYVV-------RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRL 109
NLVV ++++IY + V E G + + N KR L+L Y L
Sbjct: 28 NLVVIKTSLLQIYNLVTETVTPSVLENGQRANDNE---KRN------ETTKLQLFAEYDL 78
Query: 110 HGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
HG V + S+ NSR D+++L+F +AK+S++E++ I + S+H +E +
Sbjct: 79 HGTVTDI---SRINILNSRSGGDALLLSFRNAKLSLIEWNPEIQNISTVSIHYYEKEDIT 135
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE-------DTF 220
+ + VDP RC VL +G++ + IL Q G LV DE D F
Sbjct: 136 LSPWAPDLSQCDSHLTVDPSSRCA-VLNFGVRNLAILPFHQAGDDLVMDEYDPDLDMDDF 194
Query: 221 GSGGGFSARIES----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
++ +S S V+ L LD + H F+H Y EP I
Sbjct: 195 TGQDKNTSHTDSKKGTEKDHTHQTPYAASFVLPLTALDPTLIHPIGLTFLHEYREPTFGI 254
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
L+ T A + + + S ++ + + S LP D ++A+P+P+GG L
Sbjct: 255 LYSPIATSAALLEERKDVVVYSVFTLDLEQRASTPLLSIAKLPSDLLHIMALPAPVGGAL 314
Query: 323 VVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LL 379
++G+N IH + A+A+N +A + S + +S + L+ + + + LL
Sbjct: 315 LIGSNELIHVDQSGKASAVAVNEFAKQVSSFPMIDQSDLGLRLENSVVEVINKECGDILL 374
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT--------TIGNSLFFLGSRLGD 431
+ TG+LVL+ DGR V + P+ D+ ++G+ F+GS D
Sbjct: 375 TLSTGELVLVHFKIDGRSVSGPVVCPV-PTNSGGDVVGATASCSISLGSGKVFIGSEDTD 433
Query: 432 SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS--SDALQDMVNGEELSLYGS 489
SLL+ S S S E+ D + + + S A ++ VN +
Sbjct: 434 SLLLDCYVSSAVSKKSKDHGEDQFDEDMNDEDDDDMYEDDLYSSAPKEAVNK-------A 486
Query: 490 ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
SN SA + +SF V D L ++ L+ + G + D+ A +S QS +EL EL G
Sbjct: 487 VSNG--SASEDYSFRVLDKLPSLASLRSVTVGKPASRDSDAGNVS-QSVHEL-ELAAAYG 542
Query: 550 ---------IWTVYH----KSSRGHNADS------SRMAAYDDEYHAYLIISLEARTMVL 590
+ H + G ADS S + +D E+ + V
Sbjct: 543 SGRNGGVALLQRALHLDGISTMNGETADSVWNINTSTKSGRNDPSEG------ESPSYVF 596
Query: 591 ETADLLTEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARIL 634
T T+ E++ Y V G T+ G L G RV+QV R+
Sbjct: 597 LTKSNSTDNEETLVYAVNGSNLEPFSAPDVNPNGDPTVDIGTLAGNSRVVQVLTGEVRVY 656
Query: 635 DGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
D + M Q P E G E V S S ADPY+L+ D S+ LL D S
Sbjct: 657 DTNLGMAQ---IYPVWDED-EGDERFAV-STSFADPYLLIIRDDSSVLLLHSDESGDLDE 711
Query: 694 VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 753
+ P I SS+ + C LY DK V E D A G+ Y
Sbjct: 712 LSKPETI-SSQSWLCGC-LYTDK----------------HNVFE--DNA------TGNTY 745
Query: 754 SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 813
+ + L +F +P V + D + I SS +
Sbjct: 746 MFLLNQECKLFMFRLPTRELVSVTEGV-----------------DYVSSILSSDQPAKRL 788
Query: 814 GRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVST 873
+E I + V +L + P+L ++ Y+ PV
Sbjct: 789 NSRETIAELLVADLG----EISTASPYLIIRSATDDLIIYK---------------PVRE 829
Query: 874 SRSLSVSNVSASRLR--NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
+ + V+ ++ N + P++A + +R+ +I G+ +SG+
Sbjct: 830 NSKDEKTGVTLKYIKESNHFLPKVPIEAAATDTQQRMPGLRRLA---DIGGYAAVLMSGA 886
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P + + L + SI + + C G IYV ++ +++ C+L + D
Sbjct: 887 SPSLVVRTSKSLPRVFSIQSDSIRGISGFDSAGCEKGLIYVDNEHVVRTCRLHDNTQLDF 946
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WP++KIPL ++ Y A Y + V+ ++ L D + H ++
Sbjct: 947 SWPIRKIPLN---EEVDYLA----YSTVSGTYVVGTTHEQDFKLPDNDELHP----EWAN 995
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETL 1110
D+ V + +++L P W+ + ++E + + L + T E + +
Sbjct: 996 EDISLRPKVAQGSIKLLNPKT----WKVIDSYTFNAAERITAIENINLEISEKTSERKDM 1051
Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL-- 1163
+ +GT + +GED+AARG V +F +P NL ++ +E ++GA++A++ +
Sbjct: 1052 IVVGTTFAKGEDIAARGNVYVFDVINVVPDPDEPGTNLKLKLIGEESVRGALTAVSGIGG 1111
Query: 1164 QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYF 1219
QG L++A G K ++ K G+ L +AF D YV + +K L+GD K ++F
Sbjct: 1112 QGFLIVAQGQKCMVRGLKDDGSLL-PVAFIDVQ-CYVSVIKELKGTGMCLIGDALKGLWF 1169
Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
+ E+ ++ L KD L+ +FL DG L ++V+D N+ + Y P+ +S G
Sbjct: 1170 TGYSEEPYKMTLFGKDLDELEVVTADFLPDGKKLYILVADSDCNLHVLQYDPEDPKSSNG 1229
Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF----ALLFGTLDGSIGC 1335
+LL+R +FH+G + L A SS+ S + + L T G +
Sbjct: 1230 DRLLNRCKFHMGHFASTITLLPRTAVSSELAVMNSDSMDIDSYIPLHQALITTQSGLMAL 1289
Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
+ L E ++RRL +LQ +L +++ H GLNPR++R S+G R ++D +LL +
Sbjct: 1290 VTSLSEESYRRLSALQSQLSNTLEHPCGLNPRAYRAVESDGVVGR----GMIDGKLLMRW 1345
Query: 1396 EMLPLEEQLEIAHQTGTTRSQILSNL 1421
L +LEIA + G +I ++L
Sbjct: 1346 LDLSRPRKLEIAGRVGADEWEIRADL 1371
>gi|348555856|ref|XP_003463739.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 2 [Cavia porcellus]
Length = 1387
Score = 282 bits (721), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 201/678 (29%), Positives = 316/678 (46%), Gaps = 88/678 (12%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + E EE T
Sbjct: 779 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQPTTQGEVR----KEEATR 834
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 835 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 884
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E + R F++I G+ G F+ G
Sbjct: 885 VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGVRGRVARFRYFEDIYGYSGVFICGPS 944
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 945 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1004
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1051
WPV+KIPL+ T H + Y E +Y + S P + I + G + + +
Sbjct: 1005 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTST--PCTR-----IPRMTGEEKEFEAIER 1057
Query: 1052 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKENET 1109
D + E + ++++ P W+ A I ++ E+ ++ V+L + T
Sbjct: 1058 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEET----- 1108
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
V G LKG ++A L QG +
Sbjct: 1109 --------VSG--------------------------------LKGYVAAGTCLMQGEEV 1128
Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
G +I L +EL G+AF D LY+ + VKNFIL D+ KSI L ++E+
Sbjct: 1129 TCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKT 1186
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+F
Sbjct: 1187 LSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADF 1246
Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELT 1343
HVGAHV F R + GA G K N+ F TLDG IG + P+ E T
Sbjct: 1247 HVGAHVNTFWR-------TPCRGATEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKT 1299
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
+RRL LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E+
Sbjct: 1300 YRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMER 1359
Query: 1404 LEIAHQTGTTRSQILSNL 1421
E+A + GTT IL +L
Sbjct: 1360 GELAKKIGTTPDIILDDL 1377
Score = 271 bits (693), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 209/676 (30%), Positives = 332/676 (49%), Gaps = 88/676 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + + A + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRPTEGKSHREKLGAGGPPSLSF----GNVMSM 82
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + I ++SV+E+D H L+ S+H FE PE L+ G
Sbjct: 83 ASVQLXXXXXX------IALISFPQLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +L+YG ++++L ++ GLVG+ G +
Sbjct: 134 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 185
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 186 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 245
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 246 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLG 305
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 306 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 365
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 366 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTVREAADKEEPPSKKKRV 423
Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 424 DSTAGWAGSKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 479
Query: 522 --------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH--- 555
L I + + + + K ++V ELPGC +WTV
Sbjct: 480 EPAFLSEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVR 539
Query: 556 ------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+ G + S A DD H +LI+S E TM+L+T + E+ S + Q
Sbjct: 540 KEEEETPKAEGSEQEPSAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQ 598
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
G T+ AGN+ R ++QV G R+L+G L F P + + ++ ++AD
Sbjct: 599 GPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVAD 648
Query: 669 PYVLLGMSDGSIRLLV 684
PYV++ ++G + + +
Sbjct: 649 PYVVIMSAEGHVTMFL 664
>gi|390599704|gb|EIN09100.1| hypothetical protein PUNSTDRAFT_67240 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1439
Score = 282 bits (721), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 338/1407 (24%), Positives = 600/1407 (42%), Gaps = 176/1407 (12%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
+ A L LV +RLHG V L + + N D ++++FEDAKI+VLE+ + H L
Sbjct: 118 TVARLRLVREHRLHGMVTGLGRIKILSSLNDGL-DRLLISFEDAKIAVLEWSEEQHDLLT 176
Query: 157 TSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
S+H +E +P+ + L S G L +VDP RC + + I+ Q
Sbjct: 177 VSIHTYERAPQLMSLN---ASLFHGWL-RVDPISRCAALALPCDAFAIIPFHQTLE---- 228
Query: 216 DEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
A S +++L D + +V D F+ G+ P + +L + TW G
Sbjct: 229 -----------EAPYAPSFILDLTSEVDQRIHNVVDMSFLPGFNNPTVAVLFQPTQTWTG 277
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
R++ T + ++ + +P+I S NLP+D + A + +GGV+V+ +N+I +
Sbjct: 278 RLTEYKDTMKLLVFTLDAVTRNYPVITSVDNLPYDCLSVHACSAAVGGVIVITSNSIIHV 337
Query: 333 SQSAS-CALALNNYAVSLDSSQELP----RSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
SQS+ AL++N +A + P ++ ++ L+ + ++ + L K G +
Sbjct: 338 SQSSRRVALSVNGWASRVTDMSLAPVQAEYATRNLALEGSRLAFVDDRTFFLFLKDGTVY 397
Query: 388 LLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
+ + DG VV + + S + + +T + F+GS G S+L++ T
Sbjct: 398 PVELSLDGAVVSTISMGHALAQSAIPAVVTPVTQEHIFVGSTAGTSVLLKIT-------- 449
Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDM-------VNGEELSLYGSASNNTESAQK 499
++EE D +DA + + + S + D + + SL +N T + K
Sbjct: 450 --SVEEEVEDNASDAVAAAVVDTADSMVMDDDDDIYGVSMKTDAQSLSNGHANGTHLSVK 507
Query: 500 TFS---FAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYELVE--LP--- 545
S ++ DSL G + D S+ L N + +ATG + L + LP
Sbjct: 508 KRSVTHLSLSDSLPGYGSISDMSFSLAKNGEKVVPELVAATGSGSMGGFTLFQRDLPART 567
Query: 546 --------GCKGIW------------TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
G +G+W T Y ++ AD+ + D A +
Sbjct: 568 KRKLHAIGGGRGMWSLSLRPTVKVNGTSYERAVNPFQADNDTVVVSTDANPAPGLSRFSH 627
Query: 586 RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGS--YMTQ 641
RT E S+ V G+TI A F R ++ V R+L DGS + +
Sbjct: 628 RTPRTEI---------SITTRVPGQTIGAAPFFQRTAILHVMSNAIRVLEPDGSERQVIK 678
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI- 700
DL + SI DP+VL+ D +I L +G+ + + + +
Sbjct: 679 DLD---------GNMARPKIRHCSICDPFVLIVREDDTIGLFIGESERGKIRRKDMSPMG 729
Query: 701 ESSKKPVSSCTLYHDKGPEPWLRKTSTDAW---LSTGVGEAIDGADG-----------GP 746
+ + + ++ C + G + + ++ +T + + AD G
Sbjct: 730 DKTSRYLTGCFFTDNAGVFDLRSQANGNSGADKTATSTLQGVVNADSRSQWLLLVRPQGV 789
Query: 747 LDQGDIYSVV-CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
L+ D+ + C +I+ +P + VF+V + + D+ AL
Sbjct: 790 LEASDLSPIPGCRRLNEKQIWTLPKLSIVFSVRLASTLDWVLADSGDGPAL--------- 840
Query: 806 SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
S G R + + V + + +P L L G + YQA P S
Sbjct: 841 -SMPGESPRRPQE---LDVEQAVIAPLGETAPQPHLLLFLRSGQLAIYQAI----PMQAS 892
Query: 866 KSDDPVS-TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQ 924
D+ +S S + + V+ R + ++ +T +
Sbjct: 893 SVDESLSRPSLGVRFAKVATRVFEIQRQDDSEKSILAEQKKISRVLIPFLTSPSPTTTFS 952
Query: 925 GFFLSGSRPCWCMVF-RERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKICQ 982
G F +G PCW + R +R+HP S+V AFT F+ + +G +
Sbjct: 953 GVFFTGDHPCWILKPDRSGIRIHPS--GHSVVHAFTSCSLWESKGDFLLYSDEGPSLLEW 1010
Query: 983 LPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGH 1042
+P + + P + IP + ++T+ A L IV+ L+ + + D +
Sbjct: 1011 MPD-TDVETELPSRSIPQPRSYSKVTFDASTGL---IVAAAHLE--AEFATYDEDNNIVW 1064
Query: 1043 QIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNT 1102
+ D+ N+S T+E ++ PD W T ++E +V V L +
Sbjct: 1065 EPDSANVS-FPRSSCSTLE-----LISPDE----WITMDGFEFANNEFVTSVESVPLETS 1114
Query: 1103 TTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSK-------ELK 1154
+T+ ++ +A+GT +GED+A RG +F P+N + K + K
Sbjct: 1115 STESGSKDFIAVGTTIDRGEDLAVRGTTYVFEIVEVVP-PENSSLSRWWKLRLRCRDDAK 1173
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
G ++AL ++ G+L+ + G KI + + E L G+AF D +YV +L VKN +++GD
Sbjct: 1174 GPVTALCAMDGYLVSSMGQKIFVRAFDMDERLVGVAFLDVG-VYVTTLRAVKNLLVIGDA 1232
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
KS++F+ ++E +L +LAKDF ++ +F+ ++S++ +DE ++++ Y P+
Sbjct: 1233 AKSVWFVGFQEDPYKLVILAKDFQTVCVTTADFIFTEDSMSILTNDENGVMRLYQYDPQD 1292
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
+S GQ+L+ R EF H T Q + R G + + ++ G++DGS+
Sbjct: 1293 PDSRNGQQLMCRTEFD--THTT----CQTSIVFARRVGEGEEA-ALPQAKVVAGSIDGSL 1345
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
+ +DE F+RLQ LQ +L ++ HVAGLNP++FR N +P I+D LLS
Sbjct: 1346 AALTCMDEPAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVR-NDYVSKPLSKGILDGNLLS 1404
Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSN 1420
Y LP+ Q EI Q T R+ +L +
Sbjct: 1405 SYLELPIPRQEEITKQIATERAAVLRD 1431
>gi|350633238|gb|EHA21604.1| hypothetical protein ASPNIDRAFT_51242 [Aspergillus niger ATCC 1015]
Length = 1406
Score = 282 bits (721), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 333/1478 (22%), Positives = 628/1478 (42%), Gaps = 215/1478 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+L+V ++++IY + + E ++ + ++L++ Y L G V L
Sbjct: 28 DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++AF +AK+S++E+D G+ S+H +E +
Sbjct: 77 CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
+ G ++ VDP RC + +G++ + I+ Q G LV D+ +GS
Sbjct: 131 VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187
Query: 223 ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
GGG + S V+ L LD + H F++ Y EP IL+ +
Sbjct: 188 HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + + ++ + ++ S LP D ++++A+P P+GG L++G+
Sbjct: 248 VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + S +S ++ L+ L + LL T
Sbjct: 308 NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
G++ ++ DGR V + + + I T IG+ FLGS GDS+L+
Sbjct: 368 GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
++ S ++ ++ D AD + S D +D + + +L G +
Sbjct: 428 YSYSSSSTKKHRLQAKQVIDDSADMSEEDQ---SDDDVYEDDLYSTSPDTTLTGRRPSGE 484
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
SA + F + D L+NIGPL+D + G R++ + TG S +++ +G
Sbjct: 485 SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNPEKTGDRTNSTSPELQIVASQGSHKSG 544
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
V + H S + + D + A L EA R V+ T
Sbjct: 545 GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604
Query: 597 TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSYMT 640
++ ES+ + V G TI+ G R+RV+QV + R D T
Sbjct: 605 SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISIGTQESRKRVVQVLKNEVRSYDFGKFT 664
Query: 641 -----QDLSFGPSNSES-------GSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
++ + G S + ++ +S S+AD + + D ++ L D S
Sbjct: 665 PSRCRRNFADGTDLSLTQIYPIWDDDTNDERMAVSASLADSCLAILRDDSTLLFLQADDS 724
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
V + S K SC LY DK TG+ +ID P+
Sbjct: 725 GDLDEVVFGEDVASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV- 765
Query: 749 QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE 808
+ D++ + L + C+ + G ++ + S+ I++
Sbjct: 766 KNDMFLFLLSHDCKLFV------KCLLWSSFALRGWHLMLSKSSGLSRPRSKAAIDN--- 816
Query: 809 EGTGQGRKENIHSMKVVELAM----QRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
+G + + S+ ++E + + WSA P+L I+C+ EG
Sbjct: 817 ----RGDRRFVASVNLIEAIVADLGETWSAS---PYL--------IVCHH---IEG---- 854
Query: 865 SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQ 924
+ ++ S+ N R P + + + + + I +ISG
Sbjct: 855 --------------IHSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLS 900
Query: 925 GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
F+ G+ + + +L + + + L C+ GFIY+ SQ ++ C+LP
Sbjct: 901 AVFMPGASAGFIIRTSASAPHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLP 960
Query: 985 SGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQ 1043
+ +D W ++++ L + Y +Y VL + L D E+ +
Sbjct: 961 PMTRFDYQWTLKRVHLGEQVDHLAYSTSSGMY-------VLGTCHATDFKLPEDDELHPE 1013
Query: 1044 IDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNT 1102
N + ++ + ++++ P+ W + + + E + ++ ++L +
Sbjct: 1014 WRNEDCLAISFFPS--ARGSFIKLVSPNT----WSIIDSFSLGADEYVMAIKNISLEVSE 1067
Query: 1103 TTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAI 1157
T E + ++ +GTA+ +GED+ +RG + +F + +P + T+ + + +KGA+
Sbjct: 1068 NTHERKDMIVVGTAFARGEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAV 1127
Query: 1158 SALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLG 1211
+AL+ + QG +L+A G K ++ K G+ L +AF D YV + +K +LG
Sbjct: 1128 TALSEIGGQGFVLVAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCILG 1185
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D K ++F + E+ +++L AKD L+ A EFL DG L +VV+D NI + Y P
Sbjct: 1186 DAVKGVWFAGYSEEPYKMSLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDP 1245
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR-TGAAPGSDKTNRFAL---LFG 1327
+ +S G +LLSR++FH+G + L SS++ ++ G D N+ L L
Sbjct: 1246 EDPKSSNGDRLLSRSKFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMT 1305
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
T +GS+G I + E ++RRL +LQ +L +++ H GLNPR+FR S+G A R ++
Sbjct: 1306 TQNGSLGLITCIPEESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GML 1361
Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
D LL + + + + EIA + G +I ++L ++
Sbjct: 1362 DGNLLFKWIDMSKQRKTEIAGRVGAREWEIKADLEAIS 1399
>gi|395324102|gb|EJF56549.1| hypothetical protein DICSQDRAFT_93527 [Dichomitus squalens LYAD-421
SS1]
Length = 1433
Score = 281 bits (718), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 358/1491 (24%), Positives = 628/1491 (42%), Gaps = 234/1491 (15%)
Query: 57 NLVVTAANVIEIYVVR-------VQEEGSKES-----KNSGETKRRVLMD---------- 94
N+VV ++++ I+ VR Q+E KE K + + V MD
Sbjct: 42 NVVVARSSLLRIFEVREEPAPVSTQKEVEKERRAAVRKGTEAVEGEVEMDTSGEGFVNMG 101
Query: 95 ---GISAAS-------LELVCHYRLHGNVESL-AILSQGGADNSRRRDSIILAFEDAKIS 143
G++ A+ LV +RLHG V L A+ + D+ + D ++++F+DAKI+
Sbjct: 102 TSAGLNGAAHPPTVNRFYLVREHRLHGTVTGLEAVRTVHSLDD--KLDRLLVSFKDAKIA 159
Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
+LE+ S+H + S+H +E +P+ + + R L + DP RC + + +
Sbjct: 160 LLEWSLSLHDVITVSIHTYERAPQLIAID---SPLFRSEL-RADPLSRCAALSLPKDSLA 215
Query: 203 ILK--ASQGGSGLVGDEDTFGSGGGFSARIESSHVINL-RDLD--MKHVKDFIFVHGYIE 257
IL SQ ++ E + +S S +++L D+D +++V DF F+ G+
Sbjct: 216 ILPFYQSQAELDILEQEASQARDVPYSP----SFILDLANDVDKRIRNVIDFTFLPGFHN 271
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + +L + + TW GR+ T + ++ +P+I + LP+D + L +
Sbjct: 272 PTVAVLCQYQQTWTGRLKEYKDTVGLYIFTLDFVTNNYPVITAVDGLPYDCFALTPCSTA 331
Query: 318 IGGVLVVGANTIHYHSQSA-SCALALNNYAVSLD-------SSQELPRSSFSVELDAAHA 369
IGGV+++ +N + + QS L +N + + ++QE R ++L+ A
Sbjct: 332 IGGVVILASNAVLFVDQSGRRVILPVNGWPPRVSDLPMPPLTAQEQTR---DLQLEGARF 388
Query: 370 TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFF 424
++ + L K G + + ++ DGR V +L +S T P+V + IG+ F
Sbjct: 389 VFVDDKKLFLILKDGTVYPIELIQDGRTVSKLTMSDALARTTIPAV----VKRIGDDHIF 444
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE---ADAPST--------------KRL 467
+GS +G S+L++ ++ ++EE D + A+ P+T L
Sbjct: 445 IGSIVGPSVLLK----------TARVEEEIHDEDVAMAEGPATVVDTSKTVDMMDDDDDL 494
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
S+ A Q NG A +N + + ++ D++ GP+ D ++GL N D
Sbjct: 495 YGPSTIADQPAANGTA----NGAVDNVRT-RTVVHLSLCDAIPAHGPISDMTFGLSRNGD 549
Query: 528 ------ASATGISKQSNYELVE--LP-----------GCKGIW------------TVYHK 556
+ATG ++ L + +P G +G+W T + +
Sbjct: 550 RLVPELVAATGSGHLGSFSLFQRDMPTRFKRKLHAIGGGRGMWSLPVRQQVKTGGTTFER 609
Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEART--MVLETADLLTEVTESVDYFVQGRTIAA 614
S +AD+ + D + + + R+ + + VT F QG I
Sbjct: 610 PSNPFHADNDTVIISTDANPSPGLSRIATRSSHSDITITTRIPGVTLGAAPFFQGTAILH 669
Query: 615 GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
+F VI+V E DG+ S + + + S SI DP++L+
Sbjct: 670 -VMFNVTNVIRVLEP-----DGTERQ-------SIKDLDGNAARPRIKSCSICDPFILII 716
Query: 675 MSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG 734
D +I L +G+ + + + + + Y D L +T +A
Sbjct: 717 REDDTIGLFIGEIERGKIRRKDMSPMGEKTSKYLAGYFYTDTS---GLFQTFLNA---EA 770
Query: 735 VGEAIDGADGGPLDQGDI--YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
GEA G ++ G+ + + G +EI+ +P F+ + I D+
Sbjct: 771 PGEAATSTLQGAMNAGNKTHWLTLVRPQGVVEIWTLPKLTLAFSTTTLATLDPVISDSLE 830
Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
AL Q + V +L + H RP L +L G +
Sbjct: 831 PPALS-------------LPQDPPRKPQELDVDQLVIAPLGESHPRPHLIVLLRSGQLAI 877
Query: 853 YQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNL-RFSRTPLDAYTREETPHGAPC 911
Y+A P DP+ +RSL++ L NL + D EE
Sbjct: 878 YEAVAASPPA------DPLPPTRSLTL-------LVNLVKVKSKAFDIQHTEEEQKSVLA 924
Query: 912 QRITIFKNI----------SGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVL 960
++ I + + + G F +G RP W + + +RV P + AFT
Sbjct: 925 EQKRISRLLLPFVTSPAPGQTYSGVFFTGDRPSWIVSTDKGGVRVFPS-GHNVVHAFTTC 983
Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIV 1020
F+ + +G + +P D + P + +P ++ P+ F + LIV
Sbjct: 984 SLWESRGDFLLYSEEGPSLVEWMPD-IILDAHLPARSVP-RSRPYSHVVFDASS--SLIV 1039
Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1080
+ +N+ S D + + D+ N+S T T+E ++ PD W T
Sbjct: 1040 AASSF--MNRFASYDEDGNIVWEPDSPNISFPHCE-TSTLE-----LISPDG----WITM 1087
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1139
++E V V L +T+ + +A+GT +GED+A +G V +F
Sbjct: 1088 DGYEFAANEFVSCVVSVPLETVSTESGMKDFIAVGTTINRGEDLAVKGAVYIFEIVEVVP 1147
Query: 1140 NPQNLVTEVYSKEL------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYD 1192
+ + + +L KG +S L + G+L+ + G KI + + E L G+AF D
Sbjct: 1148 DASLNIKRWWRLKLLCRDDAKGPVSFLCGMNGYLVSSMGQKIFVRAFDLDERLVGVAFLD 1207
Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE---FLID 1249
+YV SL VKN +++GD KS++F++++E +L +L KD C T F D
Sbjct: 1208 V-GVYVTSLRAVKNLLVIGDAVKSVWFVAFQEDPYKLVILGKD--PHHCCVTRADLFFAD 1264
Query: 1250 GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
G LS+V DE+ ++++ Y P ES GQ LL R EFH R +L +
Sbjct: 1265 GH-LSIVTCDEEGVVRLYAYDPHDPESKGGQHLLRRTEFHGQTE----YRSSLLVARRPK 1319
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
G + + L+ G++DGS+ + +DE F+RL LQ +L+ +V HVA LNP++F
Sbjct: 1320 A----GDPEIPQARLICGSVDGSLTTLTYVDENAFKRLHLLQGQLIRTVQHVAALNPKAF 1375
Query: 1370 RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
R N RP ++D LL+ +E LP+ Q E+ Q GT R+ +L +
Sbjct: 1376 RMVR-NEYVSRPLSKGVLDGNLLATFEDLPIGRQNEVTRQIGTDRATVLKD 1425
>gi|441648592|ref|XP_004093268.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Nomascus leucogenys]
Length = 1177
Score = 280 bits (717), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 202/651 (31%), Positives = 321/651 (49%), Gaps = 115/651 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
SVLT+ + T+ FLGSRLG+SLL+++ T++L
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKY--------------------------TEKL 400
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
+ + A+++ + EE S +K R++A
Sbjct: 401 QEPPASAVREAADKEE----------PPSKKK-----------------------RVDAT 427
Query: 528 ASATGISKQSNYELV----ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD- 573
+G ++S V ELPGC +WTV + G + S A DD
Sbjct: 428 VGWSGEGQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTPEADDDC 487
Query: 574 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
H +LI+S E TM+L+T + E+ S + QG T+ AGN+ R ++QV G R+
Sbjct: 488 RRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRL 546
Query: 634 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
L+G L F P + + ++ ++ADPYV++ ++G + + +
Sbjct: 547 LEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 587
Score = 165 bits (418), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 94/250 (37%), Positives = 137/250 (54%), Gaps = 26/250 (10%)
Query: 1191 YDAP------PL--------YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1236
YDAP PL Y V + NFIL D+ KSI L ++E+ L+L+++D
Sbjct: 925 YDAPWPVXKIPLRCTAHYVAYHVESKVCPNFILAADVMKSISLLRYQEESKTLSLVSRDA 984
Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+FHVGAHV
Sbjct: 985 KPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNT 1044
Query: 1297 FLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
F R + GA G K N+ F TLDG IG + P+ E T+RRL LQ
Sbjct: 1045 FWR-------TPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQ 1097
Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
L +PH AGLNPR+FR H + + + +++D ELL+ Y L E+ E+A + G
Sbjct: 1098 NALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIG 1157
Query: 1412 TTRSQILSNL 1421
TT IL +L
Sbjct: 1158 TTPDIILDDL 1167
Score = 126 bits (317), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 86/289 (29%), Positives = 136/289 (47%), Gaps = 20/289 (6%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 702 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 757
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + SRP+L + D +L Y+A+ P ++ +
Sbjct: 758 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 807
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E R F++I G+ G F+ G
Sbjct: 808 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 867
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD
Sbjct: 868 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 927
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYP-LIVSVPVLKPLNQVLSLLIDQE 1039
WPV KIPL+ T H + Y E + P I++ V+K +SLL QE
Sbjct: 928 PWPVXKIPLRCTAHYVAYHVESKVCPNFILAADVMKS----ISLLRYQE 972
>gi|67521912|ref|XP_659017.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
gi|74598221|sp|Q5BDG7.1|CFT1_EMENI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|40745387|gb|EAA64543.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
gi|259486722|tpe|CBF84808.1| TPA: Protein cft1 (Cleavage factor two protein 1)
[Source:UniProtKB/Swiss-Prot;Acc:Q5BDG7] [Aspergillus
nidulans FGSC A4]
Length = 1339
Score = 276 bits (706), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 352/1456 (24%), Positives = 603/1456 (41%), Gaps = 234/1456 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++I+ +R S ++ +T+ R L L Y+L G V +
Sbjct: 28 NLIVARTSLLQIFSLR------DVSLSALDTEVRPAQHRQETCKLVLEREYQLPGTVTDI 81
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ ++ G D ++++AF DAK+S++E+D +GL S+H +E +
Sbjct: 82 CRVKILKTKSGGD------AVLVAFRDAKLSLVEWDPERYGLSTISIHYYERDDMTRSPW 135
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE- 231
+ G ++ DP RC + I+ Q G LV D+ FGS + R+E
Sbjct: 136 ASDLSTCGSILSADPGSRCAIFQFGARSLAIIPFHQPGDDLVMDD--FGSEPDYENRVEG 193
Query: 232 --------------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
SS V+ L LD + H F++ Y EP IL+ + T
Sbjct: 194 NSRSHEAKDKDAAEYQTPYASSFVLPLTALDPSVIHPISLAFLYEYREPTFGILYSQVAT 253
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
+ + + +++ + + S LP D +K++A+P P+GG L++G+N
Sbjct: 254 SHALLHERKDVVFYTVITLDLEQRASTTLLSVTRLPSDLFKVVALPPPVGGSLLIGSNEL 313
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
+H + A+ +N ++ S +S ++ L+ +D LL+ TG
Sbjct: 314 VHIDQAGKTNAVGVNEFSRQASSFSMTDQSDLALRLENCVVERFSDDNGDLLLALSTGVF 373
Query: 387 VLLTVVYDGRVVQRLD---LSKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCG 440
L++ DGR V + LS + L S ++ +GN F GS DS+L+
Sbjct: 374 ALVSFKLDGRSVSGISVRPLSGPSKEFLASTASSSAFLGNGKVFFGSESADSVLL----- 428
Query: 441 SGTSMLSSGLKEEF-GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
G S SS K+ F G D S DA +D + + N S
Sbjct: 429 -GWSSASSATKKSFSGSTSND--------ESEDDAYEDDLYSSAPAAMTDNPQNQPSNSS 479
Query: 500 TFSFA---VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVY 554
+F + D L + GP++D G A + T K ELV G G +
Sbjct: 480 VAAFGDLRIHDRLSSPGPIRDIVLGRSSEASSRDT---KDGVLELVAAQGSDEGGTMVIM 536
Query: 555 HK--------SSRGHNADS----SRMAAYDDEYHAYLIISL-------EARTMVLETADL 595
+ S A+S S + +D+ Y+I+S E+ VLE D
Sbjct: 537 KREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYVILSKQEKPDKEESEVFVLE--DK 594
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
L +T T+ G L + RVIQV R D + D
Sbjct: 595 LRPITAPEFNPNHELTVEIGTLASKSRVIQVLRNEVRSYDAVWDEDD------------- 641
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S+ ++ ++ DPY+ + D ++ LL D S + TL D
Sbjct: 642 SDERVAVNATLVDPYLAIIRDDSTLLLLQADDS----------------GDLDEVTLSED 685
Query: 716 KGPEPWLRKT--STDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
+ WL S +A T +I + + L ++ +P+F
Sbjct: 686 VVSQKWLSACFYSDNAGFFTAPFASI--------------LFLLNQDHQLYVYRLPDF-A 730
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
V +V + V I+ T E K S T +EN+ + VVEL
Sbjct: 731 VISVIEGVGCLPPILST---EPPKRSTT--------------RENVLQIAVVELG----D 769
Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
++ S PFL + ++ Y+ + E T R L +N + + N
Sbjct: 770 SYSSLPFLILRTENDDLVVYKPFFTNSKELTGL--------RFLKEANHTLPKTPNTT-- 819
Query: 894 RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLC 950
D E P + I NI+G F+ G P +FR P +L
Sbjct: 820 ----DELQSEMKP-------LRILPNIAGCSSIFMPG--PSAGFIFRAS-TTSPHFIRLR 865
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYF 1010
G I + + GF Y+ S G L + +LP G+ W ++ +P+ ++TY
Sbjct: 866 GGFIKGLGCFDS--PDKGFAYLDSHG-LHLAKLPEGTQLGYPWIMRTVPIGQQIDKLTYV 922
Query: 1011 AEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
+ + Y VL + L D E+ + N +S + V + ++++
Sbjct: 923 SASDTY-------VLGTCQRCEFRLPEDDELHPEWRNEEISFLP-----EVNQSSLKVVS 970
Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1128
P W + P++ +E+ + ++ ++L + T E ++ +GT+ +GED+ +RG
Sbjct: 971 PKT----WSVIDSYPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRGC 1026
Query: 1129 VLLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--K 1179
+ +F +P+ N ++ KE +KGA++AL+ + QG L+ A G K ++ K
Sbjct: 1027 IYVFEVIEVVPDPEQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRGLK 1086
Query: 1180 WTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
G+ L +AF D +V + +K + GD K ++F + E+ +++L AKD
Sbjct: 1087 EDGSLLP-VAFMDMQ-CFVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSLFAKDLD 1144
Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
L+ A +FL DG+ L +VV+D N+ + Y P+ S G KLL+R++FH G +
Sbjct: 1145 YLEVLAADFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTGNFASTV 1204
Query: 1298 LRLQMLATSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
L SS+R A GSDK + +L + +GSIG + + E ++RRL +LQ
Sbjct: 1205 TLLPRTLVSSER--AMSGSDKMDIDNTAPLHQVLVTSHNGSIGLVTCVPEESYRRLSALQ 1262
Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
+L +++ H GLNPR++R S+ A R ++D LL Y + + + EIA + G
Sbjct: 1263 SQLTNTLEHPCGLNPRAYRAVESDASAGR----GMLDSNLLLQYLDMSKQRKAEIAGRVG 1318
Query: 1412 TTRSQILSNLNDLALG 1427
T +I ++L ++ G
Sbjct: 1319 ATEWEIRADLEAISGG 1334
>gi|358372791|dbj|GAA89393.1| cleavage and polyadenylation specificity factor subunit A
[Aspergillus kawachii IFO 4308]
Length = 1372
Score = 276 bits (705), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 335/1464 (22%), Positives = 618/1464 (42%), Gaps = 221/1464 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+L+V ++++IY + + + E ++ + ++L++ Y L G V L
Sbjct: 28 DLIVVRTSLLQIYSLH-KVTSNAEGADAQQELTKLLLEK----------EYSLSGTVTGL 76
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+ NSR +++++AF +AK+S++E+D + S+H +E + +
Sbjct: 77 CRVK---VLNSRSGGEAVLVAFRNAKLSLIEWDPERRSISTISIHYYERDDLTRSPWVPD 133
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS------------ 222
G ++ VDP RC + +G++ + I+ Q G LV D+ +GS
Sbjct: 134 LKNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGMSTDHDL 190
Query: 223 GGG---------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
GGG + S V+ L LD + H F++ Y EP IL+ + T +
Sbjct: 191 GGGPDKAKEGIAYQTPYAPSFVLPLTALDPSILHPISLAFLYEYREPTFGILYSQVATSS 250
Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IH 330
+ + + ++ + ++ S LP D ++++A+P P+GG L++G+N +H
Sbjct: 251 ALLPERKDVVFYTVFTLDLEQQASTILLSVSRLPSDLFRVVALPPPVGGALLIGSNELVH 310
Query: 331 YHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVL 388
+ A+ +N ++ + S +S ++ L+ L + LL TG++ +
Sbjct: 311 IDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLSTGEMAI 370
Query: 389 LTVVYDGRVVQRLD---------LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
+ DGR V + L+ N + S T IG+ FLGS GDS+L+ ++C
Sbjct: 371 MKFKLDGRSVSGISVHLLPAHAGLTSMNSAAAAS--TFIGDGKIFLGSEDGDSVLLGYSC 428
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNTESA 497
S +S ++ D AD + S D +D + + +L G + SA
Sbjct: 429 SSSSSKKHRLQAKQAIDDSADMSEEDQ---SEDDVYEDDLYSTSPDTTLTGRRPSGESSA 485
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WT 552
+ F + D L+NIGPL+D + G ++ + G S +++ +G
Sbjct: 486 FGLYDFRMHDKLINIGPLRDITIGRKLPTNQEKGGDRTNSTSPELQIVASQGSHKSGGLV 545
Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLLTEV 599
V + H S + + D + A L EA R V+ T ++
Sbjct: 546 VMAREIDPHVVASISLESVDSIWTASLTWEEEAVSRTSENIGQRSQRCYVIATEAKASDR 605
Query: 600 TESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-MTQD 642
ES+ + V G TI G R+RV+QV + R D +TQ
Sbjct: 606 EESLIFVVDGHDLKPFRAPDFNPNEDVTINIGTQESRKRVVQVLKNEVRSYDIDLGLTQI 665
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
++ ++ +S S+AD + + D ++ L D S V + S
Sbjct: 666 YPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVLGEDVAS 720
Query: 703 SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA 762
K SC LY DK TG+ +ID P+ + D++ +
Sbjct: 721 GK--WISCCLYSDK----------------TGLFSSIDRTLSEPV-KNDMFLFLLSHDSK 761
Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
L ++ V + + ++ + + G + ++ SSE G +EN+
Sbjct: 762 LFVYRVRD-QKLLSIIEGLDGLSPLL-----------------SSEPPKRSGTRENLVEA 803
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
V +L + WSA P+L + ++ Y+ ++ + T + + +
Sbjct: 804 IVADLG-ETWSAS---PYLILRSENDDLIIYKPFV-------------IPTGPTGEIHTL 846
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRER 942
S+ N D + + + + + I +ISG F+ G+
Sbjct: 847 KFSKENNSVLPMISPDVDSTQPSGSDYRVRPLRILPDISGLSAVFMPGAS---------- 896
Query: 943 LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG----ILKICQLPSGSTYDNYWPVQKI 998
F + + + H F+ + + ++ CQLP + +D W ++K+
Sbjct: 897 ------------AGFVLRTSASAPH-FLRLRGESPRCSTVRFCQLPPMTRFDYQWTLKKV 943
Query: 999 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHRT 1057
L + Y +Y VL + L D E+ + N +S R
Sbjct: 944 HLGEQVDHLAYSTSSGMY-------VLGTCHATDFKLPDDDELHPEWRNEAISFFPSARG 996
Query: 1058 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1116
+ +++ P+ W + + + E + ++ ++L + T E + L+ +GTA
Sbjct: 997 SFI-----KLVSPNT----WSIIDSYSLGTDEYVMAIKNISLEISENTHERKDLIVVGTA 1047
Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLI 1169
+ +GED+ +RG + +F + +P + T+ + + +KGA++AL+ + QG +L+
Sbjct: 1048 FARGEDIPSRGCIYVFEVVQVVPDPDDPETDRKLKLIGKESVKGAVTALSEIGGQGFVLV 1107
Query: 1170 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQ 1225
A G K ++ K G+ L +AF D YV + +K +LGD K I+F + E+
Sbjct: 1108 AQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCILGDAVKGIWFAGYSEE 1165
Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
+++L AKD L+ A EFL DG L +VV+D NI + Y P+ +S G KLLSR
Sbjct: 1166 PYKMSLFAKDLDYLEVSAAEFLPDGRRLFIVVADSDCNIHVLQYDPEDPKSSNGDKLLSR 1225
Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRT-GAAPGSDKTNRFAL---LFGTLDGSIGCIAPLDE 1341
++FH G + L SS++ + D N+ AL L T +GS+G I + E
Sbjct: 1226 SKFHTGNFASTLTLLPRTMVSSEKMISNSDDMDIDNQSALHQVLMTTQNGSLGLITCMPE 1285
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
++RRL +LQ +L +++ H GLNPR+FR S+G A R ++D LL + + +
Sbjct: 1286 ESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GMLDGNLLFKWIDMSKQ 1341
Query: 1402 EQLEIAHQTGTTRSQILSNLNDLA 1425
+ EIA + G +I ++L ++
Sbjct: 1342 RKTEIAGRVGAREWEIKADLEAIS 1365
>gi|302831157|ref|XP_002947144.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
gi|300267551|gb|EFJ51734.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
Length = 2830
Score = 273 bits (699), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 252/983 (25%), Positives = 419/983 (42%), Gaps = 181/983 (18%)
Query: 575 YHAYLIISL-EARTMVLETADLLTEVTES--VDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
+HAYL+I++ RTMVL D L +VT S ++ V T+AAGNLF ++Q G
Sbjct: 1889 FHAYLLITMGRVRTMVLRCTDGLDDVTNSPECEFLVNQPTLAAGNLFHNAVIVQACPMGL 1948
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS-------------IADPYVLLGMSDG 678
R+L+G + Q+L + ++ S ADPYVL+G+SDG
Sbjct: 1949 RVLEGMTLVQELRVSDFQASRPKTAQYSFCCRTKHPIAHRAMGPIPQAADPYVLVGLSDG 2008
Query: 679 SIRLLVGDPSTCTVSVQTPAA-------IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
+ LL GDP + T+ V T AA S ++ +++ L+ D+ +W+
Sbjct: 2009 TAVLLEGDPLSLTLGVATAAAEQLMAVPARSRQQRLAAACLHRDE-----------TSWM 2057
Query: 732 STGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS-------GR 784
++ + I+ +C SG LE + +P+ VF + G
Sbjct: 2058 ASATAAEAASS----GSSFSIFLWICRLSGRLECYSLPSMRLVFHSSGLAAAEEVLRMGP 2113
Query: 785 THIVDTY--MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS----- 837
+ D Y +E E++ G G G E+ VVEL ++ + S
Sbjct: 2114 AVMYDVYDLFGGGGGGAEAELDG----GGGSGIMED----PVVELRVESFLGGGSPAVPD 2165
Query: 838 --RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA----------- 884
RP L + G ++ YQ L P ++ + P + + S
Sbjct: 2166 CERPVLLVMAASGNLVAYQIALRRLPLDSLSHEAPAAMGAAAGSSGGGGGIGGGAALGPR 2225
Query: 885 -SRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
+R +L ++ +++R + ++ + + + G F++GSRP W + R L
Sbjct: 2226 MARFDHLAYTDPSSKSHSRTDI------RKYPVASQGTSYSGVFVAGSRPLWLVASRGGL 2279
Query: 944 RVHPQLCDGSIVAFTVLHNVNCNHGFI-YVTSQGILKICQLPSGSTYDNYWPVQKIPLKA 1002
HP +G++ A T HN NC GFI +S+G+LK+CQLP + D W +++PL+
Sbjct: 2280 VPHPMFAEGAVAAMTPFHNANCPLGFISACSSRGLLKVCQLPPHTRLDTPWVTRRVPLRV 2339
Query: 1003 TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE 1062
TPH++ +F + L I S V+ D ++ R E
Sbjct: 2340 TPHKLAWFRDAGLMAAITSRVVVSRPRPPEEPGGDAHAAAAYAAAAAAAAGRGRE---EA 2396
Query: 1063 YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA------ 1116
+E+R+LEP+ G W + P E AL ++V+ L N TT + + LLA+GT
Sbjct: 2397 WELRLLEPNGCGRLWLSPLLPP---GEQALCLKVIYLQNATTGDTDALLAVGTGSPMGQL 2453
Query: 1117 --------------------------------------YVQGEDVAARGRVLLFST---- 1134
GED GR+LL++
Sbjct: 2454 GGGNWRFRLPRGRVAGSGGLVVHRQCEREGAGRGCRGERPPGEDYPCLGRILLYTISAEV 2513
Query: 1135 ----GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW---------- 1180
G N + V V ++++ A++++ + LL+ G +I +++W
Sbjct: 2514 VDLGGGNLTRRWSAVL-VATRDMASAVTSVQEFKSQLLVTCGSRIEMYEWRGPAAGASGG 2572
Query: 1181 ----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1236
G L AF+D P L V SL VK+++L D + +YF+ + + L ++KDF
Sbjct: 2573 GGGGPGGRLEKRAFFDLPSL-VTSLVAVKDYLLAADASQGLYFVRYSDSARVLEFMSKDF 2631
Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQI--FYYAPKMS-ESWKGQKLLSRAEFHVGAH 1293
D +I+ L+ + +D N+ + FY + + E W GQ+L HV
Sbjct: 2632 DHRDVLTAGVVINEPKLAFLAADAAGNLALSEFYGSRNTNPEFWAGQRLAPLGLMHVARR 2691
Query: 1294 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQK 1352
++ + ++M P SD NR ALL G +G + IAP+ D +RL +LQ
Sbjct: 2692 LSCCVSIKM-----------PTSDGKNRHALLCGAAEGGLSYIAPVPDAEMTQRLLALQN 2740
Query: 1353 KLVDSVPHVAGLNPRSFRQFH-------SNGKAHR----PGPDSIVDCELLSHYEMLPLE 1401
+ +PHVAGLNPR+FR G++H P + ++D +LL + +L +
Sbjct: 2741 HMSRRLPHVAGLNPRAFRHRFCRIPKSLGGGQSHHAPPAPASNGLLDGQLLLGFPLLSRQ 2800
Query: 1402 EQLEIAHQTGTTRSQILSNLNDL 1424
Q + A G T QI+S+L +
Sbjct: 2801 HQGQAAEALGVTVRQIMSDLRAI 2823
Score = 104 bits (260), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 120/215 (55%), Gaps = 25/215 (11%)
Query: 228 ARIESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
A + + +++NL + + ++ V+D +F+HGY EPV+++LHE + TW G + + TC ++A+
Sbjct: 1305 ATLGNGYLLNLNKMMGIREVRDCVFLHGYTEPVLLLLHEPDPTWVGMLRERKDTCCLAAI 1364
Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA 346
SIS LK+H ++W +LP+D +KLLAVP VLV+ N + SQ++ A ALN+ A
Sbjct: 1365 SISLRLKRHTILWKLASLPYDCFKLLAVPY-RPAVLVISPNLLLLCSQASQHAAALNSNA 1423
Query: 347 VS--------LDSSQELP---------RSSFSVELDAA-----HATWLQN-DVALLSTKT 383
+ LD S+E P + + +V D A +AT + + +V ++
Sbjct: 1424 LPGEVPPPLILDPSREPPAATAARLAAQYALNVHPDCAPAAGRNATLMADLEVVAAGLQS 1483
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
G L+ + + ++G QR+ + +T + S + I
Sbjct: 1484 GTLLAVHLQFEGPADQRITVVRTGGGPIASAMVGI 1518
Score = 95.9 bits (237), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 8/188 (4%)
Query: 56 PNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD-GISAASLELVCHYRLHGNVE 114
PNL+V N +E++ +R + + + G A LELV Y LHG VE
Sbjct: 1078 PNLIVVRTNRLEVHSLRSSAVATNAAAATATAAATASAAVGSGGARLELVVSYHLHGVVE 1137
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
SLA+LS G +S RRD+++LAF + K+SV+E++ H LR +S+H FE + + GR
Sbjct: 1138 SLAVLSGG---SSSRRDALLLAFREGKLSVVEWNPRTHSLRTSSLHYFEGDPGVQ-REGR 1193
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG---LVGDEDTFGSGGGFSARIE 231
+ P V DP GRC + Q+ +L A + +G V D G G G RI
Sbjct: 1194 IAVPLPPRVVTDPAGRCAAMSFCFSQLALLPALEVKAGAWQCVDDGGVMGVGRGERERIG 1253
Query: 232 SSHVINLR 239
H+ R
Sbjct: 1254 GVHINERR 1261
>gi|315045910|ref|XP_003172330.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
gi|311342716|gb|EFR01919.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
Length = 1397
Score = 273 bits (698), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 350/1483 (23%), Positives = 618/1483 (41%), Gaps = 221/1483 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + G+ ++ D + A L L Y + G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSAPA---GQPDQKGRHDRLQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+ NS+ D+I+++ +AK+S++E+D HG+ S+H +E E +
Sbjct: 85 ERVR---ISNSKSGGDAILVSSRNAKLSLIEWDPQKHGITTISIHYYEGEESHMSPWVPD 141
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE-DTFGSGGGFSARIES- 232
+ + VDP G C + +G+ + IL Q G LV D+ D +G + +
Sbjct: 142 LGSCSSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDAIPNGDDTTDAVNDA 200
Query: 233 ----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
S V+ + LD + H F+H Y EP IL+ +
Sbjct: 201 QKPAPGNAVHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSMSLT 260
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
+ S ++ K + + LP D +K++ +P PIGG L++G N +H
Sbjct: 261 IDRKDIVSYSIFTLDLQQKASTSLLTVSRLPSDIFKVVPLPPPIGGALLIGTNELVHVDQ 320
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
+ A+ +N +A + +S + L+ L + LL G + +LT
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCMVEQLGSGAGDVLLILSDGRMAILTF 380
Query: 392 VYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGS--- 441
DGR V + L ++ S++ S + ++G + F GS GDS+L+ ++ S
Sbjct: 381 KVDGRSVAGISLHFVAEQSGGSIIKSRPSCSASLGRNKLFYGSEEGDSILLGWSKHSSAT 440
Query: 442 -------------GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
GT+ LS +++ D + +++ + +VNG+ G
Sbjct: 441 KKPSKAAGGGNEDGTANLSDEEEQDDDDDDMYEDDLYSANPTTTQQEKQVVNGD-----G 495
Query: 489 SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK 548
+A+ F+ D L ++GP +D + G + + S +EL +
Sbjct: 496 AAN---------FTLRAHDRLWSLGPYRDITLGRPPKSKSKDRQDSVPEISAPLELVAAR 546
Query: 549 GI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA------------RTMVLE 591
G TV + DS +M DD Y + I ++ R ++L
Sbjct: 547 GFGKAGGLTVLKREIDPFTIDSLKM---DDVYGVWSIRVIDPKSKDAGLSRSYDRYLLLA 603
Query: 592 TADLLTEVTESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILD 635
A + ESV Y V + TI G L RV+QV R D
Sbjct: 604 KAKG-DDKEESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLATGSRVVQVLRTEIRSYD 662
Query: 636 GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVS 693
+ + P E S E TV+ S A+PY+L D S+ +L D + V
Sbjct: 663 CNLGLAQIY--PVWDEDTS--EERTVIQASFAEPYLLTIRDDNSLLILQADKNGDLDEVE 718
Query: 694 VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 753
+Q AA S K VS C LY DK + S+D D +I
Sbjct: 719 IQGSAA---SAKWVSGC-LYEDK-----TKIFSSDL-------------DTEHAATPNIL 756
Query: 754 SVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ G L IF +PN + VD L S SSS
Sbjct: 757 LFLLDSDGNLSIFRLPNITEPLCRVDNL--------------NLLPSNLPYESSSRRPV- 801
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+E + + V +L A H P++ ++ Y+ Y G SK
Sbjct: 802 --NRETLTELLVADLG----DAIHKSPYMILRTKHDDLVLYEPYRITGENGRSKLQ--FI 853
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
+ + V ++ N +R+P +P + + ++ G++ F+SG
Sbjct: 854 KAVNHVVMGPRTNQPMNKDINRSP------------SPSKLLRALSDVCGYKTVFMSGQN 901
Query: 933 PCWCM---VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
PC+ + + R + +L ++ + T H C GF YV ++++ +LPS + +
Sbjct: 902 PCFILKSAIARPNVL---RLRGKAVQSLTGFHIAACERGFAYVDEDNVIRMSRLPSNTRF 958
Query: 990 DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
D+ W +KIPL I Y + Y + S + L D E + N +
Sbjct: 959 DSAWATRKIPLGEQVDCIVYSSASESYVIGTST------KEDFKLPEDDESHTEWRNEFI 1012
Query: 1050 SSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENE 1108
+ + ++ V++LEP W ++ +E ++++ L + TT E +
Sbjct: 1013 TFLP-----QLDRGTVKLLEPKN----WSAIDIYEVEPAERITCIKIIRLEISETTHERK 1063
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFS---TGRNADNPQ-NLVTEVYSKE-LKGAISALASL 1163
++ +G+A +GED+ +G + +F + D+P+ N +++++E +KGA++A++ +
Sbjct: 1064 DMVVVGSAVAKGEDIVPKGCIRVFEIIDVVPDPDHPEKNKKLKLFAREEVKGAVTAVSGI 1123
Query: 1164 --QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSI 1217
QG L++A G K ++ K G+ L IAF D YV L +K ++GD K +
Sbjct: 1124 GGQGFLIVAQGQKCMVRGLKEDGSLLP-IAFKDTQ-CYVNVLKELKGTGMCIIGDAFKGL 1181
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
+F + E+ +L+L K+ +L +FL DG+ L ++V+D+ N+ + Y P+ S
Sbjct: 1182 WFTGYSEEPYKLDLFGKENENLAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSS 1241
Query: 1278 KGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT---------NRFALLFGT 1328
KG +LL R+ FH G F L T ++P + +++ +L
Sbjct: 1242 KGDRLLHRSVFHTG----HFASTMTLLPHGSHTLSSPVDEDAMDTDLPPPPSKYQVLITF 1297
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
GSIG I+PL+E ++RRL +LQ +LV+++ H GLNPR +R S+G + G ++D
Sbjct: 1298 QTGSIGVISPLNEDSYRRLLALQSQLVNALEHPCGLNPRGYRAVESDGMGGQRG---MID 1354
Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
LL + + + + EIA + G I +L L G ++L
Sbjct: 1355 GNLLLRWLDMGAQRKAEIAGRVGADVGAIRIDLEKLHGGLAYL 1397
>gi|148886829|sp|A2R919.1|CFT1_ASPNC RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|134083776|emb|CAK47110.1| unnamed protein product [Aspergillus niger]
Length = 1383
Score = 271 bits (693), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 331/1464 (22%), Positives = 618/1464 (42%), Gaps = 210/1464 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+L+V ++++IY + + E ++ + ++L++ Y L G V L
Sbjct: 28 DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ S+ G + ++++AF +AK+S++E+D G+ S+H +E +
Sbjct: 77 CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
+ G ++ VDP RC + +G++ + I+ Q G LV D+ +GS
Sbjct: 131 VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187
Query: 223 ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
GGG + S V+ L LD + H F++ Y EP IL+ +
Sbjct: 188 HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + + ++ + ++ S LP D ++++A+P P+GG L++G+
Sbjct: 248 VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + S +S ++ L+ L + LL T
Sbjct: 308 NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
G++ ++ DGR V + + + I T IG+ FLGS GDS+L+
Sbjct: 368 GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
++ S ++ ++ D AD + S D +D + + +L G +
Sbjct: 428 YSYSSSSTKKHRLQAKQVIDDSADMSEEDQ---SDDDVYEDDLYSTSPDTTLTGRRPSGE 484
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
SA + F + D L+NIGPL+D + G R++ + TG S +++ +G
Sbjct: 485 SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
V + H S + + D + A L EA R V+ T
Sbjct: 545 GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604
Query: 597 TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
++ ES+ + V G TI+ G R+RV+QV + R D +
Sbjct: 605 SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
TQ ++ ++ +S S+AD + + D ++ L D S V
Sbjct: 665 TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719
Query: 700 IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
+ S K SC LY DK TG+ +ID P+ + D++ +
Sbjct: 720 VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760
Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
L ++ V + + ++ + G + ++ SSE G +EN+
Sbjct: 761 DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
V +L + WSA P+L ++ Y+ ++ VST +
Sbjct: 803 IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
++ S+ N R P + + + + + I +ISG F+ G+ + +
Sbjct: 846 HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFII-- 903
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
S F L N + ++ C+LP + +D W ++++
Sbjct: 904 ---------RTSASAPHFLRLRGEN--------SRSSTVRFCKLPPMTRFDYQWTLKRVH 946
Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHR-T 1057
L + Y +Y VL + L D E+ + N +S R +
Sbjct: 947 LGEQVDHLAYSTSSGMY-------VLGTCHATDFKLPEDDELHPEWRNEAISFFPSARGS 999
Query: 1058 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1116
+ ++ + D + + + + E + ++ ++L + T E + ++ +GTA
Sbjct: 1000 FIKLVWDHHLQRQDSVILIFHLH-SFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTA 1058
Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLI 1169
+ +GED+ +RG + +F + +P + T+ + + +KGA++AL+ + QG +L+
Sbjct: 1059 FARGEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLV 1118
Query: 1170 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQ 1225
A G K ++ K G+ L +AF D YV + +K +LGD K ++F + E+
Sbjct: 1119 AQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVSVVKELKGTGMCILGDAVKGVWFAGYSEE 1176
Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
+++L AKD L+ A EFL DG L +VV+D NI + Y P+ +S G +LLSR
Sbjct: 1177 PYKMSLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSR 1236
Query: 1286 AEFHVGAHVTKFLRLQMLATSSDR-TGAAPGSDKTNRFAL---LFGTLDGSIGCIAPLDE 1341
++FH+G + L SS++ ++ G D N+ L L T +GS+G I + E
Sbjct: 1237 SKFHMGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPE 1296
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
++RRL +LQ +L +++ H GLNPR+FR S+G A R ++D LL + + +
Sbjct: 1297 ESYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGTAGR----GMLDGNLLFKWIDMSKQ 1352
Query: 1402 EQLEIAHQTGTTRSQILSNLNDLA 1425
+ EIA + G +I ++L ++
Sbjct: 1353 RKTEIAGRVGAREWEIKADLEAIS 1376
>gi|409046890|gb|EKM56369.1| hypothetical protein PHACADRAFT_93103 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1417
Score = 271 bits (692), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 324/1388 (23%), Positives = 594/1388 (42%), Gaps = 160/1388 (11%)
Query: 103 LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
V +RLHG V ES+ I+S D ++++F+DAKI++LE+ D+++ L S+
Sbjct: 120 FVREHRLHGTVTGMESIRIVSS----QEDGLDRLLVSFKDAKIALLEWSDAVNDLLTVSI 175
Query: 160 HCFE-SPEWLHLKRGRESFARGPL----VKVDPQGRCGGVLVYGLQMIILKASQGGSGL- 213
H +E +P+ + L+ PL ++ DP RC +++ + IL Q + L
Sbjct: 176 HTYERAPQMMALE--------APLFHSQLRTDPLSRCAALMLPKDSLAILPFYQSQADLD 227
Query: 214 VGDEDTFGSGGGFSARIESSHVINLR-DLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
+ ++DT S S V+++ D+D +KHV D +F+ G+ P + +L + TW
Sbjct: 228 IMEQDTQTSCRDIP--YSPSFVLDMTTDVDERIKHVIDLVFLPGFNSPTIAVLFQNTQTW 285
Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
R+ T + ++ + P++ + NLP+D L+ + +GGV++V AN +
Sbjct: 286 TSRLREYKDTVGLIIFTLDLVTRNCPVLTAVDNLPYDCLYLVPCSAQLGGVVIVSANALI 345
Query: 331 YHSQ-SASCALALNNYAVSLDSSQELPR-----SSFSVELDAAHATWLQNDVALLSTKTG 384
Y +Q S L +N + + S LP+ S +++L+ ++A ++ ++ + G
Sbjct: 346 YVAQTSRRVILPVNGWQARV-SDHPLPQLTEEEKSRNLKLEGSYAVFVDDNKLFVLLSDG 404
Query: 385 DLVLLTVVYDGRVVQRLDL-SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ + V DGR V RL + S + + + + + + F+GS G S+L++ T
Sbjct: 405 TVYPMEVHADGRTVSRLTMGSALAQTTIPAIVRRVTDENLFIGSTAGPSVLLK------T 458
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
S + +KEE +++ AP+ + D D +GE A T
Sbjct: 459 SHVEEDVKEEDVEMDT-APAAVVDEANEMDLDDD--DGELCHWVHFAKKRT-----VVHL 510
Query: 504 AVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYELVE--LP---------- 545
++ DS+ GP+ D ++ L D +ATG + L + LP
Sbjct: 511 SLCDSIPAYGPVSDMTFSLTRVGDRPVAELVAATGSGGLGGFTLFQRDLPSRVKRKLHAV 570
Query: 546 -GCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLLTEVTE 601
G +G+W++ ++ R + + R + + +IIS +A + A ++
Sbjct: 571 GGGRGMWSLAVRQAVRVNGSTYERPSNPHHGGNDAVIISTDANPSPGLSRIASRSSKSDI 630
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGS--YMTQDLSFGPSNSESGSGSE 657
+ + G T+ A + F ++ V R+L DG+ + +DL
Sbjct: 631 QITTRIPGTTVGAASFFQGTAILHVMSNAIRVLEPDGTERQIIKDLD---------GSVP 681
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDK 716
+ S+ DP++++ D S+ L +G+P + + + + E + K ++ C + D
Sbjct: 682 RPKIRYCSMCDPFIMVIREDDSLGLFIGEPERGKIRRKDMSPMGEKTSKYIAGC-FFMDT 740
Query: 717 GPEPWLRKTSTDAWLSTGVGEAIDGA-DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVF 775
R + A V + + G Q + ++ G LE++ +P VF
Sbjct: 741 TGIFQSRVNAAAAAADKNVTSTLQTVMNAGTRTQ---WLLLVRPQGVLEVWSLPKLALVF 797
Query: 776 TVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAH 835
+ + + +VD+ AL Q + V ++A+
Sbjct: 798 STSHVSALESVLVDSGDSPAL-------------SLPQDPPRKPQDLDVEQIAIAPLGES 844
Query: 836 HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
S+ +L L G Y+A P S P + + +L V V +
Sbjct: 845 SSKLYLLVFLRCGLFAVYEAL----PAPASTDPPPPTRTSTLCVKFV--------KVVTR 892
Query: 896 PLDAYTREETPHGAPCQRITIFKNI----------SGHQGFFLSGSRPCWCM-VFRERLR 944
D EE ++ I + + G FL+G RPCW + + ++
Sbjct: 893 AFDIQQSEEVEKSVLAEQKRISRQLIPFVTSPTPGRAFSGVFLTGDRPCWILSTDKGGVK 952
Query: 945 VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATP 1004
+ P + AFT F+ + +G I +P ++ + P + IP P
Sbjct: 953 IMPS-GHQVVHAFTACSLWESKGDFLLYSDEGPSLIEWVPE-IQFEGHLPSRSIP---RP 1007
Query: 1005 HQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE 1064
++ + L+V+ L+ + S D+ V + D N+S E
Sbjct: 1008 RPYSHVVFEPTTTLLVAASSLQ--STFTSYDEDRNVVWEPDEPNMS------LPVCETSA 1059
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDV 1123
+ ++ PD W T +E + +TL +T+ + +A+ T +GED+
Sbjct: 1060 LELISPDT----WTTMDGYEFAQNEFVTCMECITLETLSTETGTKDFVAVSTTINRGEDL 1115
Query: 1124 AARGRVLLFSTGRNADNPQNLVTEVYS------KELKGAISALASLQGHLLIASGPKIIL 1177
A +G V +F +P Y E KG ++AL + +L+ + G KI +
Sbjct: 1116 AVKGAVYIFEVVEVVPDPAMGQKRWYRLKLHCRDEAKGPVTALCGMDNYLVSSMGQKIFV 1175
Query: 1178 HKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF 1236
E L G+AF D +YV SL VKN +++GD K ++ ++++E +L +LAKD+
Sbjct: 1176 RALDLDERLVGVAFLDVS-VYVTSLRAVKNLLVIGDALKGVWLVAFQEDPYKLVVLAKDY 1234
Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
+ + SL+ DE+ +++ Y P ES GQ+LL R EFH T+
Sbjct: 1235 YPIPVACADLFFADGKASLISCDEEGVLRLSEYDPHDPESRHGQRLLCRTEFH---GQTE 1291
Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
+ ++A R G ++ + L+ G DGS+ + +D+ +RL LQ +L
Sbjct: 1292 YRTSHLIA----RRGKGLDAE-IPQAKLICGHTDGSLTSLTYVDDAVSKRLHLLQGQLAR 1346
Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1416
+V HVAGLNP++FR N + RP I+D LL+ +E LP+ Q+E+ Q T R+
Sbjct: 1347 NVQHVAGLNPKAFRVVR-NDRVARPLTKGILDGNLLAAFEDLPVPRQVEVTRQIATERTT 1405
Query: 1417 ILSNLNDL 1424
+L + DL
Sbjct: 1406 VLKDWLDL 1413
>gi|330799483|ref|XP_003287774.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
gi|325082229|gb|EGC35718.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
Length = 1453
Score = 270 bits (691), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 181/621 (29%), Positives = 329/621 (52%), Gaps = 68/621 (10%)
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
++++VE++++ ++S+P+L G ++ Y+++ E + K + R LS
Sbjct: 876 ENLEIVEISLE--ILNNSQPYLLLKNRIGDLIVYKSFKKENGDLRFKKYNHNFILRDLSN 933
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
++ S + D Y + + I K S + G F+ G +P W
Sbjct: 934 NSKSINS-----------DGYRK---------KSIVNIKLSSKNNGVFIGGQKPVWIFNE 973
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGSTYDNYWPVQKI 998
+ +R+H DG+IV+ HN +C +GF+Y T + +KI L ++N + ++++
Sbjct: 974 KGYIRLHSMDFDGAIVSLKPFHNADCPNGFLYYTEDKQHIKIGYLNGLMNFENEYAIRRV 1033
Query: 999 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY 1058
P+K + H+I Y E Y ++VS P +V +++ + +
Sbjct: 1034 PIKLSAHKIAYHNELKCYVVVVSFP---------------QVTQELEEDSKKPI-----L 1073
Query: 1059 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTT--TKENETLLAIGT 1115
T E+++++I++P W+ + +Q E L +++V+L F + T +++ L IGT
Sbjct: 1074 TDEKFQIKIIDP-TIDWSWRFIDSFSLQDRETVLAMKIVSLKFKESDETIKSKPFLVIGT 1132
Query: 1116 AYVQGEDVAARGRVLLFS--TGRNADNPQNLVTE----VYSKELKGAISALASLQGHLLI 1169
A+ GED +GRVL+F + + +L T+ +Y KE KG ++AL+S+ G LL+
Sbjct: 1133 AFTFGEDTQCKGRVLVFEIVSHKTQFESDDLGTKRLNLLYEKEQKGPVTALSSVSGLLLM 1192
Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
GPK+ ++++ +L ++F+DA +Y+ S++ +K +I++GD++KS+YFL W G QL
Sbjct: 1193 TIGPKLTVNQFLTGQLVTLSFHDAQ-IYICSISTIKTYIVIGDMYKSVYFLQW--NGKQL 1249
Query: 1230 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1289
L+KD+ SL+ F+TEF+++ TLS++VSD KNI +F + P S +GQ LL +A+FH
Sbjct: 1250 VPLSKDYQSLNIFSTEFIVNQQTLSILVSDLDKNILLFSFDPADPTSRQGQMLLCKADFH 1309
Query: 1290 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
+G+++ KF+R M + +D+ + FGTLDGS+ + PLDE ++
Sbjct: 1310 IGSNIEKFVRTPMKFNIQSSSNGNNNNDQ----LVFFGTLDGSLNVLRPLDERMYQLFYH 1365
Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS-------IVDCELLSHYEMLPLEE 1402
LQ KL +P AGLN + +R F S + P + I+D +LLS + L +E
Sbjct: 1366 LQSKLY-YLPQPAGLNAKQYRAFKSFSQNFHFSPSTIHQLPKYILDGDLLSKFVKLNQKE 1424
Query: 1403 QLEIAHQTGTTRSQILSNLND 1423
+ +A G+ +IL+ L +
Sbjct: 1425 RRLLASSVGSNTDEILTALKN 1445
Score = 248 bits (633), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 198/738 (26%), Positives = 331/738 (44%), Gaps = 119/738 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS-AASLELVCHYRLHGNVES 115
NLV++ N +++Y + K KN T ++ + + SLEL+ +L G +ES
Sbjct: 31 NLVLSKNNTLQVYKI-------KYVKNENTTTQQKQIKKVEIKPSLELLIELKLFGTIES 83
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+A + G + +DS++L F DAKISVL+++ I I S+H +E+ E+ K GR
Sbjct: 84 MASVRYPGEN----KDSLLLTFRDAKISVLDYNIDIMDFEIRSLHFYENDEF---KNGRI 136
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
F P++K+D Q RC +L+Y +++L Q S L +++ ++
Sbjct: 137 HFKHPPILKIDTQQRCATMLLYDRNIVVLPFKQISSILDDEDEEEKDEEDEKENDNANQD 196
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
D +F F++GY EP ++ LHE TW R++ K T ++A+SI+ + K
Sbjct: 197 YTEEFDDDDDDNNFCFLYGYYEPTILFLHEPSQTWTSRIAVKRLTSQLTAISINFSTKLA 256
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA-VSLDSSQE 354
+IW N+P++ +L++VP P+ G LV+ N + + +Q++ LA+N YA + + E
Sbjct: 257 SIIWHTSNMPYNCDQLVSVPEPLSGALVITPNIMFHVNQTSKYGLAVNEYANIDIGDKFE 316
Query: 355 LPRS---SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
P + LD ++ +L+ D + S K G+L++ ++ DGR VQR+ +SK SVL
Sbjct: 317 FPLDETLNLVFTLDRSNFVFLEADKFIGSLKGGELLIFHLISDGRTVQRIHVSKAGGSVL 376
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
+ + + ++L FLGSRLGDSLL+Q+T S T +E + E + K+ + S
Sbjct: 377 ATCMCVVSDNLLFLGSRLGDSLLLQYTEKSIT--------DESLEHENFSNPYKKQKTSE 428
Query: 472 SDAL-----------QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+ L D V EE L+ N +S Q + D ++N+GP+ D
Sbjct: 429 QEKLLNQQQQQQKDEMDEVLDEEDELFKEKKNQLKSYQ----LGICDQILNVGPVGDMVI 484
Query: 521 GLRINADASATGISKQSNY--ELVELPGCKG----------------------------- 549
G +N + Y +EL C G
Sbjct: 485 GQALNPTYDLNTLPSDPAYMPRFLELVTCSGYGKNGSISILQNSVKPEIVGAFDSEGVVN 544
Query: 550 -IWTVYHKSSRGHNADSSR------------------------MAAYDDEYHAYLIISLE 584
WTVY+K+S D +++Y YL IS+
Sbjct: 545 SFWTVYNKASSSIKEDEEEKLIGKKRTINEIIKEEQQYEQQQQKQPIEEDYLDYLYISMS 604
Query: 585 ARTMVLETADLLTEVTESVDYFVQG----RTIAAGNLFGRRRVIQVFERGARIL-DGSYM 639
T T L T +E +G RT+ GNLF +RR++ + E ++L D + +
Sbjct: 605 NGT----TNILDTTSSEEGKLTFKGEFEYRTLDMGNLFNKRRIVLINENSIKLLNDYNNI 660
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
Q++ + S I DPYVL+ SD SI+L D ++ +
Sbjct: 661 VQEIKLS------------KPIKSTFIQDPYVLVHYSDNSIQLFKCDYKLLKLNQFNFSL 708
Query: 700 IESSKKPVSSCTLYHDKG 717
+ V + +L+ DK
Sbjct: 709 NHGDEGKVLTSSLFFDKN 726
>gi|326471884|gb|EGD95893.1| protein kinase subdomain-containing protein [Trichophyton tonsurans
CBS 112818]
Length = 1398
Score = 269 bits (688), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 358/1474 (24%), Positives = 606/1474 (41%), Gaps = 202/1474 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + R D A L L Y + G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSTTATQPDQKGRN---DRSQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ + + D+I+++ +AK+S++E+D HG+ S+H +E E H+
Sbjct: 85 QRVRISNSKSGG--DAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVPD 141
Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------DT 219
P + VDP G C + +G+ + IL Q G LV D+ D
Sbjct: 142 LGSCPSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDMVSDA 200
Query: 220 FGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
S G +A + S V+ + LD + H F+H Y EP IL+ +
Sbjct: 201 QKSAPGNTAHDKPYAPSFVLPMAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
+ S ++ + + + LP D +K++ +P P+GG L++G N +H
Sbjct: 261 IDRKDVVSYSIFTLDLQQRASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
+ A+ +N +A + +S + L+ L + LL G + +L+
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILSF 380
Query: 392 VYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
DGR V + L +K PS S +G + F GS GDS+L+ ++
Sbjct: 381 KVDGRSVSGISLHFVAEQSGGLITKARPSCSAS----LGRNKLFYGSEEGDSILLGWSRP 436
Query: 441 SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES---- 496
S T+ S K G E+ A D D + ++L AS E
Sbjct: 437 SSTTKRPS--KAADGVDESGAADLSDEAEQDDDGDDDDMYEDDLHSVNPASIRQEKQVVN 494
Query: 497 --AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
+ F+F D L ++GP +D + G + + S + +EL +G
Sbjct: 495 GDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPAIAAPLELVAARGFGKSG 554
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR---TMVLETAD---LLTEVT--- 600
TV + + DS +M DD Y + I ++ + T + + D LL +
Sbjct: 555 GLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTRLSRSYDKYLLLAKAKGDD 611
Query: 601 --ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
ESV Y V + T+ G L RV+QV R D Y
Sbjct: 612 KEESVVYSVGSSGLDSIDAPEFNPNEDCTVDIGTLATGTRVVQVLRTEIRSHD--YNLGL 669
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAI 700
P E S E TV+ S A+PY+L D S+ +L D + V VQ AA
Sbjct: 670 AQIYPVWDEDTS--EERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA- 726
Query: 701 ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYES 760
S K VS C LY DK + + S E + GP +I +
Sbjct: 727 --SGKWVSGC-LYEDK----------MNIFFSDFDIEN----EAGP----NILLFLLDVD 765
Query: 761 GALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
G L IF +PN + + VD L S SSS +E +
Sbjct: 766 GNLSIFRLPNISEPLCRVDNL--------------NLLPSNLPYESSSRRPV---NRETL 808
Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
+ + +L A H P++ ++ Y+ Y G S R L
Sbjct: 809 TELLIADLG----DAIHKSPYMILRTKHDDLVLYEPYRIAGESGHSGL-------RFLKA 857
Query: 880 SN--VSASRLR---NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
N V R N +R+P + C+ + ++ G++ F+SG PC
Sbjct: 858 VNHVVMGPRTDQGVNHDINRSP------------SSCKLLRALPDVCGYKTVFMSGHNPC 905
Query: 935 WCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
+ ++ R H G V + + H C GF YV ++++ +LPS + +D+ W
Sbjct: 906 F-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGFAYVDEDNVIRMSRLPSNTRFDSGW 964
Query: 994 PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1053
+KI L I Y + Y + S + L D E + N ++ +
Sbjct: 965 ATRKIALGEQVDSIVYSSASECYVIGTSA------KEDFKLPEDDESHTEWRNEFITFLP 1018
Query: 1054 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLA 1112
+E V++LEP W T + ++ +E + V+ L + T E + ++
Sbjct: 1019 -----QLERGTVKLLEPKN----WSTIDSHELKPAERITCIEVIRLEISELTHERKDMVV 1069
Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNP----QNLVTEVYSKE-LKGAISALASL--QG 1165
+G++ V+GED+ +G + +F P ++ ++++KE +KGA++AL+ + QG
Sbjct: 1070 VGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKSKKLKLFAKEEVKGAVTALSGIGGQG 1129
Query: 1166 HLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLS 1221
L++A G K ++ K G+ L +AF D YV L +K ++GD K ++F+
Sbjct: 1130 FLIVAQGQKCMVRGLKEDGSLLP-VAFKDTQ-CYVNVLKELKGTGMCIIGDAFKGLWFIG 1187
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
+ E+ +L+L K+ +L +FL DG+ L ++V+D+ N+ + Y P+ S KG +
Sbjct: 1188 YSEEPYKLDLFGKENENLAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDR 1247
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSS----DRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1337
LL R+ FH G + L A + D S +++ +L GSI I
Sbjct: 1248 LLHRSVFHTGHFASTMTLLPHGAYTPSAPVDEDAMDTDSLPPSKYQILMTFQTGSIAVIT 1307
Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
PL E ++RRL +LQ +LV+++ H LNPR +R S+G + G ++D LL +
Sbjct: 1308 PLSEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVESDGMGGQRG---MIDGNLLLRWLD 1364
Query: 1398 LPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
+ + + EIA + G I ++L L G ++L
Sbjct: 1365 MGAQRKAEIAGRVGADVGAIRTDLEKLHGGLAYL 1398
>gi|340924328|gb|EGS19231.1| hypothetical protein CTHT_0058560 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1460
Score = 268 bits (684), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 365/1506 (24%), Positives = 603/1506 (40%), Gaps = 247/1506 (16%)
Query: 57 NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD--GISAA------- 99
NLVV +++++++ + +Q G+ + +++ + R+ D G+ A+
Sbjct: 28 NLVVAKSSLLQVFRTKTVTTEIDTLQTNGASKGRSAARYENRLANDDDGLEASFLGGDSL 87
Query: 100 ----------SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
L LV L G V L+ + + + +S++LAF DAK+S++E+D
Sbjct: 88 GFRADRTTNTKLVLVYETPLAGTVIGLSKIKTSTSRSGC--ESLLLAFRDAKLSLVEWDA 145
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMII 203
+ L S+H +E E + S PL + DP RC + + I
Sbjct: 146 ERNALGTVSIHYYEQEEL------QGSPWAAPLSHYVNFLVADPGSRCAALKFAARNLAI 199
Query: 204 LKASQGGSGL-VGDEDTFGSG------------GGFSARIES-----SHVINLRDLD--M 243
L Q + +GD D G ++ IE S V+ L +LD +
Sbjct: 200 LPFRQVDEDIDMGDWDEELDGPRPQKDVSNAAVSNGASNIEDTPYSPSFVLRLSNLDPSL 259
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
H F+H Y EP IL H+T M+ L + K I S
Sbjct: 260 LHPVHLAFLHEYREPTFGILASTSSASNALGRKDHYTYMVFTLDLQQ--KASTTILSVSG 317
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
LP D Y+++ +P+P+GG L+VG N IH +A+N S +S ++
Sbjct: 318 LPQDLYRVVPLPAPVGGALLVGCNELIHIDQSGKPNGVAVNPMTKQCTSFGLADQSDLNI 377
Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDIT 416
L+ L D+ L+ G +VL+T DGR V L+L P +++ I+
Sbjct: 378 RLEGCIIDVLTPDLGEFLMILNDGRMVLITFRIDGRTVSGLELRLVPPASGGTIIPGRIS 437
Query: 417 T---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
T IG ++ F GS GDSL+ +T + + + + D
Sbjct: 438 TLSRIGKNVMFAGSEEGDSLVFGWT-----KKQTQAGRRKSKPRDDDFYMDDYEEEEEEV 492
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------------ 521
D+ E S + S + SF + D L++I P++ +YG
Sbjct: 493 DEDDLYGEETTSHHQPVSAASSLLSGDLSFRIHDRLISIAPIQSMTYGQPVWMPGSEEER 552
Query: 522 --LRINAD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNAD 564
+ ++AD A G K S + E +G WT+ K +
Sbjct: 553 NSIGVHADLQLVCAVGRDKSSCLATMNLAIQPKVIGQFEFSEARGFWTMCAKKPIPKSLQ 612
Query: 565 SSRMAA------YDD--EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGR 610
S + + YD +Y ++I++ E + TA + + G
Sbjct: 613 SDKGVSVLGGNDYDTGGQYDRFMIVAKVDLDGYEKSDVYALTAAGFEGLCGTEFDPAAGI 672
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
TI AG + R++Q+ + R DG + + P E +G+E V + SIADPY
Sbjct: 673 TIEAGTMGKGSRIVQILKSEVRSYDGDFGLSQIV--PMMDEE-TGAEPRAV-TASIADPY 728
Query: 671 VLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
+L+ D S + D S ++ + S K +S C LY+D
Sbjct: 729 LLIIRDDSSAFIAGIDSSNELEELRKEDKVLVSSKWLSGC-LYND--------------- 772
Query: 731 LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVD 789
ST + P I + SGAL I+ +P+ + ++ D
Sbjct: 773 -STAIFAEETAKSSKPTQS--ILLFLLSSSGALYIYRLPDLSKPIYVTDGLA-------- 821
Query: 790 TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGT 849
Y+ AL T +GT KE I + V +L H P+L ++
Sbjct: 822 -YIPPALSSDFT-----VRKGT---PKEAITEIMVADLG----DTTHKSPYLILRHSNDD 868
Query: 850 ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
+ YQ Y ++ + T + S + +L N F+R P + +++ P
Sbjct: 869 LTIYQPYRYK-----------LGTGQVFS-KTLFFQKLPNPSFARAP-EETEQDDVPPQP 915
Query: 910 PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDGSIVAFTVLHNVNCNHG 968
+ NI+G+ FL G P + + + + RV P L ++A + H C+HG
Sbjct: 916 RLLSMRRCNNIAGYSTVFLPGHSPSFILKSAKSMPRVVP-LQGAGVIAMSPFHTEGCDHG 974
Query: 969 FIYVTSQGILKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1027
FIY S I ++ Q+P +Y V+K+P+ + Y + Y +V +P
Sbjct: 975 FIYADSHNIARVTQIPEDWSYAELGLAVKKVPIGEDIAAVAYHPPQQCY--VVGCNASEP 1032
Query: 1028 LNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1086
E+ D H + +L T++ ++++ P W T+ ++
Sbjct: 1033 F----------ELPKDDDYHKEWARENLVFKPTLDRGLLKLISPIT----WTVIDTVQLE 1078
Query: 1087 SSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1145
E L V + L + +T E L+A+GTA +GED+ RGRV ++ P
Sbjct: 1079 PCETVLCVETLNLEVSESTNERRQLIAVGTALTKGEDLPTRGRVHVYDIADVIPEPGKPE 1138
Query: 1146 TEVYSKELK---------GAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYD 1192
T SK+LK GA++AL+ + QG +L+A G K ++ K GT L +AF D
Sbjct: 1139 T---SKKLKLIAKEDIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMD 1194
Query: 1193 APPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDG 1250
YV + + L+ D K ++F+ + E+ ++ L K L+ +FL DG
Sbjct: 1195 MS-CYVTAAKELPGTGLCLMADAFKGVWFVGYTEEPYKMMLFGKSSTKLEVLTADFLPDG 1253
Query: 1251 STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH-VTKFLRL-QMLATSSD 1308
L +V D +I I + P+ +S +G LL R F+ GAH TK L L L T +
Sbjct: 1254 KELFIVACDADGHIHILQFDPEHPKSLQGHLLLHRTSFNTGAHNPTKSLLLPSTLPTDTP 1313
Query: 1309 RT------------------GAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1350
T AP LL + G I + PL E ++RRL SL
Sbjct: 1314 STIDGSNPNTNNTNGTPNASNLAPYDATERPHILLLCSPTGLIAALRPLSESSYRRLSSL 1373
Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS-----IVDCELLSHYEMLPLEEQLE 1405
+LV+S+PH AGLNP+ +R + G D+ IVD +L + L + + E
Sbjct: 1374 AAQLVNSLPHAAGLNPKGYRM--PSADCPPAGVDASVGRNIVDGTVLERFTELGMARRAE 1431
Query: 1406 IAHQTG 1411
+A + G
Sbjct: 1432 LAGRAG 1437
>gi|9794908|gb|AAF98388.1| cleavage and polyadenylation specificity factor [Drosophila
melanogaster]
Length = 813
Score = 264 bits (674), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 224/823 (27%), Positives = 373/823 (45%), Gaps = 89/823 (10%)
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD--- 715
S V+ VSIADPYV L + +G + L + T + SS V + + Y D
Sbjct: 10 SPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKDLSG 69
Query: 716 ----KG----------------------PEPWLRKTSTDAWLSTGVGEAI------DGAD 743
KG EP ++ + L G A D A
Sbjct: 70 LFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMADLAK 129
Query: 744 GGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTY 791
D + VV +SG LEI+ +P+ V+ V+ +G + D
Sbjct: 130 QSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLTDAM 189
Query: 792 MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTIL 851
E + S T +S ++ +S +EL++ + RP L + T +L
Sbjct: 190 --EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRVELL 246
Query: 852 CYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP- 910
YQ +F P+ K R L N+ + ++ D E+ P
Sbjct: 247 IYQ--VFRYPKGHLKI-----RFRKLDQLNLLDQQPTHIELDEN--DEQEEIESYQMQPK 297
Query: 911 -CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG 968
Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN +G
Sbjct: 298 YVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNG 357
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
F+Y + LKI LPS +YD+ WPV+K+PL+ TP Q+ Y E +Y LI +P+
Sbjct: 358 FLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHRENRVYCLITQTE--EPM 415
Query: 1029 NQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RATIP 1084
+ D+E+ + S D Y + ++E+ ++ P+ W+ A+I
Sbjct: 416 TKYYRFNGEDKELSEE-------SRDERFIYPIGSQFEMVLISPET----WEIVPDASIT 464
Query: 1085 MQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1143
+ E+ ++V L + T + L IGT + ED+ +RG + ++ P
Sbjct: 465 FEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPEPGK 524
Query: 1144 LVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
+T+ KE+ KG +SA++ + G L+ G KI + + +L G+AF D +YV
Sbjct: 525 PMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDTN-IYV 583
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
+ VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L +V+
Sbjct: 584 HQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGFLVT 643
Query: 1259 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1318
D ++NI ++ Y P+ ES GQKLL +A++H+G V R+Q +
Sbjct: 644 DAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFL---Y 700
Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
N+ +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R S+ K
Sbjct: 701 ENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQ 760
Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
I+D +L+ Y ++ E+ E+A + GT +IL +L
Sbjct: 761 GINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDL 803
>gi|55725165|emb|CAH89449.1| hypothetical protein [Pongo abelii]
Length = 565
Score = 264 bits (674), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 182/536 (33%), Positives = 282/536 (52%), Gaps = 65/536 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
SVLT+ + T+ FLGSRLG+SLL+++T +++ + KEE + +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
T + QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTV 553
L I + + + + K ++V ELPGC +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTV 538
>gi|169864473|ref|XP_001838845.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
gi|116500065|gb|EAU82960.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
Length = 1458
Score = 263 bits (673), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 359/1511 (23%), Positives = 624/1511 (41%), Gaps = 247/1511 (16%)
Query: 57 NLVVTAANVIEIYVVRVQE-------EGSKESKN---------SGETKRRVLMDGISAAS 100
NLVV +N++ I+ VR + E +E K GE DG S
Sbjct: 40 NLVVARSNLLRIFEVREEPCAVPHGVEDERERKGGIRRGTEAVEGELAMDAQGDGFINVS 99
Query: 101 ----------------LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
L LV ++LHG V L+ + + A + D ++++F+DAKI++
Sbjct: 100 KGMAMKSDVEHPKTTRLYLVREHKLHGMVTGLSGV-RIIASLEDKLDRLLVSFKDAKIAL 158
Query: 145 LEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVK----VDPQGRCGGVLVYGL 199
LE+ D++H L S+H +E +P+ L PL K VDPQ RC + +
Sbjct: 159 LEWSDAVHDLVPVSIHTYERAPQLTSLT--------APLFKSQLRVDPQSRCAALGLPNH 210
Query: 200 QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYI 256
+ IL F S +++L + ++++V DF F+ G+
Sbjct: 211 ALAILP--------------FLDDAVSDVPYSPSFILDLAVSVNPNIRNVADFCFLPGFN 256
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
+P + ++ E TW GR+ T + ++ +P+I S LP D+ L VP+
Sbjct: 257 KPTLAVMFEPLQTWMGRIGEYKDTVKLVIFTLDIKTSSYPIITSVDGLPMDSLGL--VPA 314
Query: 317 PIGGVLVVGANTIHYHSQSAS--CALALNNYA--VSLDSSQELPRSSFSVELDAAHATWL 372
GGV++ N++ Y QS+S A+ +N +A ++ LP ++ L+ + +
Sbjct: 315 -FGGVVITTPNSLIYIDQSSSRQIAVPVNGWASRITDLPLLPLPSPDLNLTLEGSKTVVV 373
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGS 427
+ G + + V+ DG+ V +L + K T PSV+ S
Sbjct: 374 DEKTLFVILANGIIYPIEVMADGKTVTKLQVGKPLAQATIPSVVES-------------- 419
Query: 428 RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-----GE 482
LGD L + +L + EE D E + + K + L D + +
Sbjct: 420 -LGDGHLFVGSTVGVGVVLKTAWVEEEVDDEEEGTNAKVVEDDIDMDLYDDDDDLYGDSK 478
Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG---- 532
+ + +T+ + ++RD+L GP+ ++ L D +ATG
Sbjct: 479 NKTQVTAEVKDTKKYRSVLHLSLRDTLPAYGPISSLTFSLATEGDKPVPELVTATGSGIL 538
Query: 533 ---------ISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----- 578
+ ++ +++ + G +G+W++ + S SS A + HA
Sbjct: 539 GGFTLFQRDLPTRTKKKILAVGGTRGLWSLPIRQSVKKGGSSSSTTAIE---HAKTERDT 595
Query: 579 LIISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
LI+S +A R TEV ++ V G T+ A F R ++ V
Sbjct: 596 LILSTDATPSPGVSRIATRAPPGGKTEV--NITTRVPGTTVGAAPFFQRTAILVVMTNSI 653
Query: 632 RILDGSYMTQDLSFGPSNSESGSGSE------NSTVLSVSIADPYVLLGMSDGSIRLLVG 685
++L+ P +E + + + S SI DP+VL+ D S+ L +G
Sbjct: 654 KVLE-----------PDGTERQTIQDMDGKLLRPKIRSCSICDPFVLIIREDDSLGLFIG 702
Query: 686 DPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI--DGA 742
+ + + + + E + K ++ C G +TS +T + + G+
Sbjct: 703 ETERGKIRRKDMSPMGEKTSKYLAGCFFTDTSGLFGQQFETSVPVEGATATLQNVVSGGS 762
Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
G Q + ++ G +EI+ +P F+V S +VD++ + AL S
Sbjct: 763 TSGGKPQHTQWLLLVRPQGVMEIWTLPKLTLAFSVSAVPSLFNVLVDSHDKPAL--SVPN 820
Query: 803 INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
+ G+ E + +V E R LF L +G + Y+A P
Sbjct: 821 PGDPPQRKPGEFDVEQVCVSRVGE-------DGRGRVCLFVFLRNGQLTIYEAL----PL 869
Query: 863 NTSKSDDPVSTSRSLSV--------SNVSASRLR----NLRFSRTPLDAYTREETPHGA- 909
+T+ S S ++ V V A + R N++F + A+ + GA
Sbjct: 870 STTASQPAASVDGAMDVDASSTEPQQQVEAEKKRSQTLNIKFVKISSIAFEIQRHEDGAE 929
Query: 910 ------------------PCQRITIFKNIS----------GHQGFFLSGSRPCWCM-VFR 940
QR+ + + + G F +G +P W + +
Sbjct: 930 GGERGSGERASGILAEHKKIQRLFVPFTVKPKTSDGTPSPTYSGVFFTGDKPNWIIGTDK 989
Query: 941 ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL 1000
++++P + +F+ F+ T G I LP TY + P + IP
Sbjct: 990 GGVQIYPS-GHNVVHSFSACSLWEERGEFLVYTEDGPCLIEWLPD-FTYSHPLPARSIP- 1046
Query: 1001 KATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTV 1060
+ + F LIV+ + Q D++ G ++ + VD T T
Sbjct: 1047 RGRGYSNVVFDPSTC--LIVAASSM----QARFASYDED-GVRVWEKDGPGVDDPITDT- 1098
Query: 1061 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQ 1119
+ ++ P+ W T ++E + +VTL T+ ++ +A+GT +
Sbjct: 1099 --SALELISPNS----WITMDGFEFATNEYINDISIVTLETAATETGSKDFIAVGTTIDR 1152
Query: 1120 GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL------KGAISALASLQGHLLIASGP 1173
GED+AA+G +F +P T Y L KG ++A+ QG+L+ + G
Sbjct: 1153 GEDLAAKGAAYIFEIVEVVPDPAISPTRWYKLRLRCRDDAKGPVTAVCGFQGYLVSSMGQ 1212
Query: 1174 KIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1232
KI + + E L G+AF D +YV SL ++KN +L+GD KS+ F++++E +L LL
Sbjct: 1213 KIFVRAFDSDERLVGVAFMDVG-IYVTSLRVLKNLLLIGDAVKSVMFVAFQEDPYKLVLL 1271
Query: 1233 AKDFGSLDCFATEFLIDGS-TLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH-- 1289
AKD +F + L+L+V DE+ ++I+ Y P +S G+ LL R E+H
Sbjct: 1272 AKDVHLHSVTRADFFFNADGDLALIVGDEEGIMRIYEYNPNDPDSRDGRYLLLRTEYHGQ 1331
Query: 1290 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
V H + T + R P +++ LL G+ DGS+ + P+DE F+RLQ
Sbjct: 1332 VPYHTS--------TTIARRDKEDPSIPQSH---LLIGSADGSLSSLVPVDEYAFKRLQL 1380
Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
LQ +L ++ HVAGLNP++FR N +P I+D +LL+ YE LP+ Q E+ Q
Sbjct: 1381 LQGQLTRNIQHVAGLNPKAFR-IVKNDYVSKPLSKGILDGQLLAQYESLPIPRQNEMTKQ 1439
Query: 1410 TGTTRSQILSN 1420
GT R +L +
Sbjct: 1440 IGTERGVVLRD 1450
>gi|396471273|ref|XP_003838832.1| similar to cleavage and polyadenylation specificity factor subunit A
[Leptosphaeria maculans JN3]
gi|312215401|emb|CBX95353.1| similar to cleavage and polyadenylation specificity factor subunit A
[Leptosphaeria maculans JN3]
Length = 1402
Score = 263 bits (672), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 339/1426 (23%), Positives = 584/1426 (40%), Gaps = 230/1426 (16%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++I+ ++ V EG E N+ E + A L LV
Sbjct: 28 NLVVAKNSLLQIFEIKSTTTEVTPEGGDEVDNAAANLDTEAADVQFQRTENTAKLVLVAE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
+ L G V SLA + A N++ R +++++AF DAK+S++E+D + L S+H +E+P
Sbjct: 88 FPLAGTVISLARIK---ALNTKSRGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144
Query: 166 E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGG-------SG 212
+ W + +F + DP RC + + IL Q S
Sbjct: 145 DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQSDLVEDDYDSD 199
Query: 213 LVGDEDT------FGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
L G DT SGG + + SS V+ L +LD + H F+H Y EP I
Sbjct: 200 LDGPRDTKPDQAEAPSGGETTHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHQYREPTFGI 259
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
+ ++ + S ++ K + S LP+D +++ +P PIGG L
Sbjct: 260 IAASRAAAPSLLANRKDILTYSVFTLDLEQKASTTLLSVTGLPYDISRVVPLPHPIGGAL 319
Query: 323 VVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LL 379
++G N IH + +A+N +A + S +S ++ L+ + L D +L
Sbjct: 320 LLGNNEIIHVDQGGKTNGVAVNEFAKACTSFPLSDQSDLALHLEGCNVELLSQDTGDVVL 379
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPS----VLTSDITTIGNSL---FFLGSRLGDS 432
G L+++T +GR V + + VL + + N + F+GS G+S
Sbjct: 380 VLNNGRLLIMTFTLEGRTVSGMTIQTVAADHGGHVLKAGSSCTSNLVRGRLFIGSEDGES 439
Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG--SA 490
+L+ G S ++ L+ ++ D T D L D + + +A
Sbjct: 440 VLL------GWSSATASLRRRHSNVGLDGDGTSEEEEEDIDDLDDDLYNDTAPAVQKITA 493
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKD------------------FSYGLRINADASATG 532
+ + + T+SF + D+L +I P++D S G A + TG
Sbjct: 494 AASEPTPPGTYSFRIHDTLPSIAPIRDAVLHPGKVTDSLNRGEIMLSTGR--GAAGAITG 551
Query: 533 ISKQ---SNYELVELPGCKGIWTVYHKSS------RGHNADSSRMAAYDDEYHAYLIISL 583
+ ++ + ELP GIW V+ + D+ + D +Y YL++S
Sbjct: 552 LDRELHPVSLAASELPSTHGIWAVHARKQAPGGVVTAFGEDTEANMSTDVDYDQYLVVSK 611
Query: 584 EAR-----TMVLET-ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
+ T+V E + L+E + +G T+ G L +V+QV R D S
Sbjct: 612 TSEDGSESTVVYEVHGNELSETDKGDFEREEGSTLFVGVLAAGTKVVQVMRTEVRTYD-S 670
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP 697
+ D + E+G+ V++ S ADPY+L+ L D S +
Sbjct: 671 ELNMDQILPMEDEETGN---ELRVINASFADPYLLV---------LREDSSVKILKASGD 718
Query: 698 AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVC 757
+E + R S+ WLS + ++ + I++ +
Sbjct: 719 GELEDLEA-----------------RGLSSTKWLSASLFKSATFTE--------IFAFLL 753
Query: 758 YESGALEIFDV--PNFNC-VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
G L IF + P C V F+ + R A K + TEI
Sbjct: 754 TPEGGLHIFAMSEPEKPCYVAEALGFLPPLLTVDFVPRRSAAKATITEI----------- 802
Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
LA P L + ++ Y+A+ F
Sbjct: 803 ------------LAADLGDVTTRSPHLIIRTSSDDLVIYKAFHFP--------------- 835
Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRP 933
S S +++ LR ++ ++ + Y + A + + ++ G+ F G+ P
Sbjct: 836 -SRSAADLWTKNLRWIKLAQQHVPRYVEDAGSEDAGVESTLLALDDVCGYSTVFQRGASP 894
Query: 934 CWCMVFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
+ +F+E P+ L + T H +C GF YV S L+I QLPS + +
Sbjct: 895 SF--IFKEA-SSSPRVIGLSGKPVKGLTTFHTSSCERGFAYVDSTDTLRISQLPSRTHFG 951
Query: 991 NY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVLSLLIDQEVGHQIDN 1046
+ W +++P+ A + + Y LY + P VL P E H
Sbjct: 952 HLGWATRRLPMDAEVYALAYHP-AGLYVVGTGQPEDFVLDP----------SETYH---- 996
Query: 1047 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTK 1105
+ L D+ +VE +++++ G W T E L ++ + L + TT
Sbjct: 997 YELPKEDISFKPSVERGVIKLIDE----GTWSIIDTHVFDPQEVVLCIKALNLEVSETTH 1052
Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISAL 1160
+ + L+A+GT+ V GED+A +G + +F P T + E+KGA+SA+
Sbjct: 1053 QRKDLIAVGTSIVHGEDLATKGCIRIFEVITVVPEPDRPETNKRLKLIVKDEVKGAVSAI 1112
Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1214
+ L QG L++A G K ++ K GT L +AF D YV +L + N +L+GD +
Sbjct: 1113 SELGTQGFLIMAQGQKCMVRGLKEDGTLLP-VAFMDMQ-CYVTTLKTLPNTGMLLMGDAY 1170
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+ ++F + E+ +++L + +L+ A EFL L ++V+D NIQ+ + P+
Sbjct: 1171 RGVWFTGYTEEPYKMSLFGRSKHNLEAMAVEFLPFNGELHIIVADADMNIQVLQFDPENP 1230
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQ---MLATSSDRTGA----APGSDKTNRFAL--- 1324
+S +G +LL +A FH G T LQ + S+ G AP S + L
Sbjct: 1231 KS-EGSRLLHKATFHTGHFPTTTHLLQSHLQMPESASTFGTTDTFAPDSTPSAPLPLHQV 1289
Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
L + G++ I PL E ++RRL +L L++++ GLNP +FR
Sbjct: 1290 LITSQSGTLALITPLSESSYRRLSNLAAYLINTLESPCGLNPVAFR 1335
>gi|451849663|gb|EMD62966.1| hypothetical protein COCSADRAFT_92785 [Cochliobolus sativus ND90Pr]
Length = 1405
Score = 261 bits (667), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 333/1424 (23%), Positives = 587/1424 (41%), Gaps = 223/1424 (15%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++++ ++ V G E++N+ E L S A L LV
Sbjct: 28 NLVVAKNSLLQVFELKSTTTEVTPGGGDEAENAAANLDTEAADVPLQRTESTAKLVLVGE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + R +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVVSLARVK--ALSTKSRGEALLVAFRDAKLSLVEWDPESYNLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ-------GGSGL 213
W + +F + DP RC + + IL Q S
Sbjct: 146 LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGSHNLAILPFRQRDLVDDDYDSDA 200
Query: 214 VGDEDTF----GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
G +++ + G + SS V+ L +LD + H F+H Y EP I+
Sbjct: 201 DGPKESKLEQQAASGSHTTPYTSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAASR 260
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
T ++ + S ++ K + S LP+D +++ +PSPIGG L+VG+N
Sbjct: 261 DTAPSLLAHRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITRVVPLPSPIGGALLVGSN 320
Query: 328 -TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTG 384
IH + +A+N +A + S +S ++ L+ L ++ L+ G
Sbjct: 321 EIIHVDQGGKTSGVAVNEFAKTCTSFPLSDQSDMALRLEGCSVELLSHEAGDVLIVLNNG 380
Query: 385 DLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L++LT DGR V + + S + +G F+GS GDS+++ +
Sbjct: 381 RLLVLTFTLDGRTVSGMTVHPVAADHGGHLIKAAASCTSNLGRGRLFVGSEDGDSVMLGW 440
Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTES 496
T +S L+ + + D D D+ N ++ +A+ + +
Sbjct: 441 TS------TASHLRRKQSNANIDTDEDMSDEEDMDDMEDDLYNDTAPAVQKITAAASEPT 494
Query: 497 AQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL--- 541
A T++F + D L +I P+K+ + G + ++ A N EL
Sbjct: 495 APGTYTFRIHDVLPSIAPIKNAVLHPGKDTESLNRGEIMLSTGRGAAAAITALNRELHPV 554
Query: 542 ----VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISLEAR----- 586
+LP +G W V+ + + D A + +Y YL++S
Sbjct: 555 TAATRQLPSARGTWAVHARKQAPGDVTAAFGEDMEANMATNVDYDQYLVVSKTGEDGTES 614
Query: 587 TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
T+V E + LTE + +G T+ G L +V+QV R D S + D
Sbjct: 615 TVVYEVNGNELTETDKGDFEREEGSTLFVGILAAGTKVVQVMRTEIRTYD-SELNMDQIL 673
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
+ ESG+ V++ S ADPY+L+ D S+++ A + +
Sbjct: 674 PMEDEESGN---ELNVINASFADPYLLVLREDSSVKIFR-------------ATGDGELE 717
Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
V + L S WLS + ++ + +++ + G L +
Sbjct: 718 DVEATGL-------------SNSQWLSASLFKSASFTE--------VFAFLLTPEGGLRV 756
Query: 766 FDVPNFNCVFTVDKFVSGRTHIVDT-YM--REALKDSETEINSSSEEGTGQGRKENIHSM 822
F V + V + +S ++ Y+ R A+K + TEI
Sbjct: 757 FAVSDMEKPCYVAEALSFLPPVLGMDYVPKRSAIKATITEI------------------- 797
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
LA A P L + I+ Y+A+ S S S +++
Sbjct: 798 ----LAADLGDATTKSPHLIIRTSSDNIVIYKAF----------------HSPSRSAADL 837
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRE 941
LR ++ S+ + YT + + + + +I G+ F G+ P + +F+E
Sbjct: 838 WTKNLRWVKLSQQHIPRYTEDGGAEDSGFESTLLALSDIGGYSTVFQRGTTPAF--IFKE 895
Query: 942 RLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-WPVQK 997
P+ L + + T H +C GF Y+ S L+I QLP + Y + W ++
Sbjct: 896 SSSA-PRVIGLSGKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGWATRR 954
Query: 998 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1057
+P+ A H + Y + LY + P L+ E H + L D+
Sbjct: 955 MPMDAEIHALAYHS-SGLYIIGAGQPEEYQLDP-------SETYH----YELPKEDMSFK 1002
Query: 1058 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1116
T+E +++L+ W T + E L+++ + L + T + + L+A+GTA
Sbjct: 1003 PTIERGIIQLLDEKT----WAIIDTHVLDPQEVVLSIKTLNLEVSENTHQRKDLIAVGTA 1058
Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QGHLLI 1169
+ GED+A +G + +F P T + E+KGA+SA++ L QG +++
Sbjct: 1059 ILHGEDLATKGCIRIFEVITVVPEPDRPETNKRLKLIVKDEVKGAVSAISELGTQGFMIM 1118
Query: 1170 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQ 1225
A G K ++ K GT L +AF D YV L + + + D ++ ++F + E+
Sbjct: 1119 AQGQKCMVRGLKEDGTLLP-VAFMDMQ-CYVSDLKNLPGTGMLAMSDAYRGVWFTGYTEE 1176
Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
+++L A+ SL+ A +F+ L L+V+D N+Q+ + P +S G +LL +
Sbjct: 1177 PYRMSLFARSKHSLEAIAVDFIPFEEQLHLLVADADMNLQVLQFDPDNPKSEAGSRLLHK 1236
Query: 1286 AEFHVGAHVTKFL-----RLQMLATSSDRTGA-------------APGSDKTNRF-ALLF 1326
+ FH G H L RL+M ++SD GA +P T +L
Sbjct: 1237 STFHTG-HFPATLHVVHSRLKM-PSASDFAGANNTENGDFEMDTSSPDDKATQPLHQILC 1294
Query: 1327 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
T G++ + PL E T+RRL +L L +++ AGLNPR+FR
Sbjct: 1295 TTQSGTLALVTPLSEDTYRRLSNLSAYLSNTLDATAGLNPRAFR 1338
>gi|452001482|gb|EMD93941.1| hypothetical protein COCHEDRAFT_1129958 [Cochliobolus heterostrophus
C5]
Length = 1385
Score = 261 bits (667), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 330/1414 (23%), Positives = 585/1414 (41%), Gaps = 223/1414 (15%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++++ ++ V G E++N+ E L S A L LV
Sbjct: 28 NLVVAKNSLLQVFELKSTTTEVTPGGGDEAENAAANLDTEAADVPLQRTESTAKLVLVGE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + R +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVVSLARVK--ALSTKSRGEALLVAFRDAKLSLVEWDPESYSLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ------------ 208
W + +F + DP RC + + IL Q
Sbjct: 146 LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGSHNLAILPFRQRDLVDDDYDSDA 200
Query: 209 -GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
G ++ T + G + SS V+ L +LD + H F+H Y EP I+
Sbjct: 201 DGPKESKPEQQT--ASGSHTTPYTSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAA 258
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
T ++ + S ++ K + S LP+D +++ +PSPIGG L+VG
Sbjct: 259 SRDTAPSLLAHRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITRVVPLPSPIGGALLVG 318
Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTK 382
+N IH + +A+N +A + S +S ++ L+ L ++ L+
Sbjct: 319 SNEIIHVDQGGKTNGVAVNEFAKACTSFPLSDQSDLALRLEGCSVELLSHEAGDVLVVLN 378
Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLV 435
G L++LT DGR V + + S + +G F+GS GDS+++
Sbjct: 379 NGRLLVLTFTLDGRTVSGMTVHPVAADHGGHLIKAAASCTSNLGRGRLFVGSEDGDSVML 438
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNT 494
+T +S L+ + + D D D+ N ++ +A+ +
Sbjct: 439 GWTS------TASHLRRKQSNANIDTDEDMSDEEDMEDMEDDLYNDTAPAVQKITAAASE 492
Query: 495 ESAQKTFSFAVRDSLVNIGPLKD------------------FSYGLRINADASATGISKQ 536
+A T++F + D L +I P+K+ S G A AS T ++++
Sbjct: 493 PTAPGTYTFRIHDVLPSIAPIKNAVLHPGKDTESLNRGEVMLSTGR--GAAASITALNRE 550
Query: 537 SNYELV---ELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISLEAR- 586
+ V +LP +G W V+ + + D A + +Y YL++S
Sbjct: 551 LHPVTVATRQLPSARGTWAVHARKQAPGDVTAAFGEDMEANMATNVDYDQYLVVSKTGED 610
Query: 587 ----TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
T+V E + LTE + +G T+ G L +V+QV R D S +
Sbjct: 611 GTESTVVYEVNGNELTETDKGDFEREEGSTLFVGVLAAGTKVVQVMRTEIRTYD-SELNM 669
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
D + ESG+ V++ S ADPY+L+ D S+++ A +
Sbjct: 670 DQILPMEDEESGN---EVNVINASFADPYLLVLREDSSVKIFR-------------ATGD 713
Query: 702 SSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG 761
+ V + L S WLS + ++ ++++ + G
Sbjct: 714 GELEDVEATGL-------------SNSQWLSASLFKSASFT--------EVFAFLLTPEG 752
Query: 762 ALEIFDVPNFNCVFTVDKFVSGRTHIVDT-YM--REALKDSETEINSSSEEGTGQGRKEN 818
L +F V + V + +S ++ Y+ R A+K + TEI
Sbjct: 753 GLRVFAVSDMEKPCYVAEALSFLPPVLGMDYVPKRSAIKATITEI--------------- 797
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
LA A P L + ++ Y+A+ S S S
Sbjct: 798 --------LAADLGDATTKSPHLIVRTSSDNLVIYKAF----------------HSPSRS 833
Query: 879 VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCM 937
+++ LR ++ S+ + YT + + + + +I G+ F G+ P +
Sbjct: 834 AADLWTKNLRWVKLSQQHIPRYTEDGGAEDSGFESTLLTLSDIGGYSTVFQRGTTPAF-- 891
Query: 938 VFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-W 993
+F+E P+ L + + T H +C GF Y+ S L+I QLP + Y + W
Sbjct: 892 IFKESSSA-PRVIGLSGKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGW 950
Query: 994 PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1053
+++P+ A H + Y + LY + P L+ E H + L D
Sbjct: 951 ATRRMPMDAEIHALAYHS-SGLYIVGTGQPEEYQLDP-------SETYH----YELPKED 998
Query: 1054 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLA 1112
+ T+E +++L+ W T + E L+++ + L + T + + L+A
Sbjct: 999 MSFKPTIERGIIKLLDEKT----WTIIDTHVLDPQEVVLSIKTLNLEVSENTHQRKDLVA 1054
Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASL--QG 1165
+GTA + GED+A +G + +F P T + E+KGA+SA++ L QG
Sbjct: 1055 VGTAILHGEDLATKGCIRIFEVITVVPEPDRPETNKRLKLIVKDEVKGAVSAISELGTQG 1114
Query: 1166 HLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLS 1221
+++A G K ++ K GT L +AF D YV L + + + D ++ ++F
Sbjct: 1115 FMIMAQGQKCMVRGLKEDGTLLP-VAFMDM-QCYVSDLKNLPGTGMLAMSDAYRGVWFTG 1172
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
+ E+ +++L A+ SL+ A +F+ L L+V+D N+Q+ + P +S G +
Sbjct: 1173 YTEEPYRMSLFARSKHSLEAIAIDFIPFEEQLHLLVADADMNLQVLQFDPDNPKSEAGSR 1232
Query: 1282 LLSRAEFHVGAHVTKFL-----RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
LL ++ FH G H L RL+M ++SD P +L + G++ +
Sbjct: 1233 LLHKSTFHTG-HFPATLHVVHSRLKM-PSASDFAATQP------LHQILCTSQSGTLALV 1284
Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
PL E T+RRL +L L +++ AGLNPR+FR
Sbjct: 1285 TPLSEDTYRRLSNLSAYLSNTLDATAGLNPRAFR 1318
>gi|440637976|gb|ELR07895.1| hypothetical protein GMDG_02777 [Geomyces destructans 20631-21]
Length = 1495
Score = 261 bits (666), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 350/1450 (24%), Positives = 594/1450 (40%), Gaps = 216/1450 (14%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
S L LV Y L G V SLA + +++ +S++L+F+DAK+S++E+D HGL
Sbjct: 147 STTKLVLVGEYALAGTVTSLARIKI--SESKSGGESLLLSFKDAKLSLVEWDPERHGLST 204
Query: 157 TSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS 211
S+H +E E W ++ + DP+GRC + + IL QG
Sbjct: 205 VSIHYYEQEEIGGSPWDPYLSNCFNY-----LTADPRGRCAALKFGARNLAILPFRQGDE 259
Query: 212 GLVGDE--------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
D+ T + G S V+ L LD + H F++ Y
Sbjct: 260 DTTMDDWDEELDGPRPTTAIITSENKGHEDTPYAPSFVLRLSSLDPTLIHTVHLAFLYEY 319
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL + + + ++ K I LP+D +K++ +P
Sbjct: 320 REPTFGILSSTLSPSSSLLDERKDQLSYMVFTLDLNQKASTTILVVTGLPYDLFKVIPLP 379
Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQN 374
SPIGG L+VG N IH + +A+N A S S + +SS + L+ L
Sbjct: 380 SPIGGALLVGGNELIHIDQSGKANGVAVNALAKSCTSFGLVDQSSLQMRLEGCAVEQLSA 439
Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFL 425
D L+ TG+L +L+ DGR V L+L + PS S + I ++ F+
Sbjct: 440 DNGEMLIILNTGELAVLSFRMDGRSVSGLNLRRV-PSESGICMGAQASCTSLINHNSMFI 498
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE-- 483
GS DS+++ ++ S + + + + D +D + GE
Sbjct: 499 GSEDTDSIVLGWSRKSKQAGRRRS-QPTIDAGDDADVDGTDEDQEDEDEDEDDLYGESTA 557
Query: 484 -LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----RINADASATGISKQSN 538
+ L G + + S ++F + DSLVNI PL+D + R + D AT IS +SN
Sbjct: 558 AIPLKGEVAADANSKAGDYAFRIHDSLVNIAPLRDVTLSKPETPREDEDEEAT-ISTRSN 616
Query: 539 YELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
+ELV E P +GIWT+ K + + A
Sbjct: 617 FELVGVTGRNTSGSLAFLRREIEPNVIGRFEFPEARGIWTLCAKRPLIKGLEPEKSEAIL 676
Query: 573 D-------EYHAYLIISL-------EARTMVLETADLLTEVTESVDYF-VQGRTIAAGNL 617
D ++ +I+S E+ VL +A E ++ G TI G +
Sbjct: 677 DPESELGAQFDRLMIVSKSTEDTPEESSVYVLTSAGF--EALADTEFEPAAGATIKCGTV 734
Query: 618 FGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
RV+Q+ + R DG + Q L + E+G+ +++ SI DPYVLL
Sbjct: 735 GNGMRVVQILKSEVRSYDGDLGLAQILPM--FDDETGA---EPKIVAASIVDPYVLLIRD 789
Query: 677 DGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG 736
D SI + D ++ + K +S C LY+D STG+
Sbjct: 790 DASIFVASCDSDNDLEEIERGDDSLLTNKWLSGC-LYND----------------STGMF 832
Query: 737 EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYM--R 793
++G + I S++ E GAL ++ +P+ + ++ + T I Y R
Sbjct: 833 AETALSNGTVSKKSVIMSLLNSE-GALFMYALPDLSKPIYQANGVSFIPTTISPDYATRR 891
Query: 794 EALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCY 853
+ ++ TE+ L A P+L ++ + Y
Sbjct: 892 STVAETLTEV-----------------------LLADLGDATSKSPYLIFRASNDDLTIY 928
Query: 854 QAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR 913
+ F+ P S+ P S+SL + + T + A E G+P +
Sbjct: 929 EP--FQVP-----SEAPRPLSKSLHFQKIHNPHVAKTANPETEV-AADAESAKRGSPMRA 980
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT 973
I N+ G FL G P + + + L + + + H C+ GFIYV
Sbjct: 981 IA---NVGGLSSVFLPGDSPSFVVKSSKSTPRVVGLRGHGVRSLSGFHTEGCDRGFIYVD 1037
Query: 974 SQGILKICQL-PSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPL--IVSVPVLKPLNQ 1030
S+GI ++ QL P + D ++K+ + +TY K++Y + +V P P +
Sbjct: 1038 SKGIARVSQLEPETNVTDIGLTLRKVKIGEEVQAVTYHPPKDVYVIGTVVKEPFELPKDD 1097
Query: 1031 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE---------VRILEPDRAGGPWQTRA 1081
D HR + E+ +++L P W
Sbjct: 1098 ----------------------DYHREWAKEDITFKPLTGRGFLKLLNPSN----WSVID 1131
Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF---STGRN 1137
+ + S E + ++ + L + T E + L+ +GTA +GED+A RGRV ++ +
Sbjct: 1132 KVELDSHEIIMCIKTLNLEVSENTHERKQLITVGTAISKGEDLAIRGRVYVYEVITVVPF 1191
Query: 1138 ADNPQ-NLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAF 1190
D P+ N ++ +KE +GAI+ ++ + QG +++A G K ++ K GT L +AF
Sbjct: 1192 PDRPETNKKLKLIAKEEIPRGAITGISEIGTQGFMIVAQGQKSMVRGLKEDGTLLP-VAF 1250
Query: 1191 YDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
D YV ++ + L D K ++F + E+ ++ + K ++ + L
Sbjct: 1251 IDMN-TYVTTVKSLPGTGMCLFADAIKGVWFAGYSEEPYKMTIFGKQSQGMEVITADLLP 1309
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
G L ++V+D N+ + + P+ +S GQ LL R F +G H+ + L L T++
Sbjct: 1310 IGDELYIIVADSDCNLHVLQFDPEHPKSLHGQLLLQRTTFSLGGHMPTTMTLLPLTTTTQ 1369
Query: 1309 RTGAA---PGSDKTNRFALLFGTL-DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1364
A S+ TN + L TL G + + PL E +RRL +L L + + H GL
Sbjct: 1370 TPTPAVTSTASEPTNPASGLLMTLSSGVVAILTPLSEQQYRRLNALSNHLSNLLYHPGGL 1429
Query: 1365 NPRSFRQFHSNGKA---HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
NP++ R ++ +A RP IVD +L + L +++ E+A + G I +L
Sbjct: 1430 NPKAHRISNTAPEAVIGGRP----IVDGSVLWRWLELGSQKRAEVAGRVGVDGETIREDL 1485
Query: 1422 NDLALGTSFL 1431
++A G +L
Sbjct: 1486 QEIAAGLGYL 1495
>gi|406865186|gb|EKD18229.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 1443
Score = 260 bits (665), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 343/1512 (22%), Positives = 616/1512 (40%), Gaps = 233/1512 (15%)
Query: 57 NLVVTAANVIEIYVVRVQ------EEGSKESKNSGETKRRVLMDGISAA----------- 99
NL+V ++++++ ++ EEG+ SK + + + DG+ A+
Sbjct: 28 NLIVAKTSLLQVFTTKITSIELGIEEGA--SKQNDKWDPSLDNDGLDASFIGADSLLRPD 85
Query: 100 -----SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
L LV Y L G + SLA + + + +++++ F DAK+S++E+D + G+
Sbjct: 86 RARRTKLVLVAEYTLSGTITSLARIKTLSSKSGG--EALLVGFRDAKLSLVEWDPARPGI 143
Query: 155 RITSMHCFESPEWLHLKRG------RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
S+H +E E L+R +ES + DP RC + G + I+ Q
Sbjct: 144 STISIHYYEQDE---LQRSPWAPNLKESVN---YLIADPGSRCAALKFGGRNLGIIPFKQ 197
Query: 209 GGSGLVGDE----------------DTFGSGGGFSARIESSHVINLRDLDMKHVKD--FI 250
+ D+ S S V+ L LD +
Sbjct: 198 DDEDVNMDDWDEEIDGPRPADKVITKATNSSNDKETPYGPSFVLRLATLDPNLINPIHLA 257
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
F++ Y EP IL ++ + + + H T M+ L + + I S LP+D
Sbjct: 258 FLYEYREPTFGILSSSQMPASSLLFERRDHLTYMVFTLDLQQ--RASTTIMSVTGLPYDL 315
Query: 309 YKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
++++ + +P+GG L++G N IH + +A+N +A S + +S + L+ +
Sbjct: 316 FEVVPLDAPVGGALLIGTNELIHIDQAGKANGVAVNVFAKQCTSFGLVDQSGLDMRLEGS 375
Query: 368 HATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITT---I 418
L Q+ ++ +TG++ +L+ DGR V L + + + SV+ + ++T I
Sbjct: 376 KIEQLSIQSGEMIIFLQTGEIAILSFHMDGRSVSSLSVRRVSAEAGGSVIPARVSTLSHI 435
Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
G + F+GS DS+++ + S S S K IE ++ D D
Sbjct: 436 GQNTLFVGSACADSMVLGW---SRKSNQVSRRKPRVEVIEDADDASLDELDDEDDDADDD 492
Query: 479 VNGEELSLYGSASN------NTESAQKTFSFAVRDSLVNIGPLKDFSYG---LRINADAS 529
+ GE S+ A+N S + F V DSLVNI P+ + ++G L N D
Sbjct: 493 LYGEGPSIIQDATNGVAKSDTVNSKAGDYVFQVHDSLVNIAPIVNITFGNASLSQNEDEK 552
Query: 530 ATGISKQSNYELV--------------------------ELPGCKGIWTVYHK--SSRGH 561
+ + ELV E P +GIWT+ K + +G
Sbjct: 553 LDSVGVRGYLELVASVGKQRAGALAVIHQNIQPKVIGRFEFPEARGIWTMSAKRPAEKGL 612
Query: 562 NADSSRMA-----AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYF-------VQG 609
A + + A D +Y +I+S +A + ET+D+ + + + G
Sbjct: 613 EAKKEKSSTSGDYAIDAQYDRLMIVS-KALSDGTETSDVYALTSANFEALTGTEFEPAAG 671
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
TI AG L RVIQV + R DG+ + Q L + +G+E ++S S AD
Sbjct: 672 STIEAGTLGNGNRVIQVLKSEVRSYDGNLGLAQILPM----YDDDTGAE-PKIVSASFAD 726
Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
PY+LL D SI + D + ++ + K ++ C LY D
Sbjct: 727 PYLLLFRDDSSIFVAQSDENNELEEIEREDDALLATKWLTGC-LYAD------------- 772
Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHI 787
S GV + G +++ ++ + GAL I+ +P+ N V+ +
Sbjct: 773 ---SRGVFAPVQSDKGQKVEE-NVMMFLLSAGGALHIYALPDLSNAVYVAEGLC------ 822
Query: 788 VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR-PFLFAILT 846
++ L + S++ E + EL + +R P+L +
Sbjct: 823 ---FVPPVLSAAYAARRSAARE-------------TITELVVADLGDETARSPYLILRPS 866
Query: 847 DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP 906
+ Y+ P +TS S S S + ++ N +R P + ET
Sbjct: 867 TDDLTIYE------PFHTS------SESSGGLASTLQFLKIHNPHLARNP--DVSAAETA 912
Query: 907 HGAPCQR---ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNV 963
G R + + N+ G+ FL G P + M + L + + H
Sbjct: 913 DGIQETRDEPMRVISNLGGYCTVFLPGGSPSFIMKSAKSTPKVISLQGLGVRGMSSFHTE 972
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYFAEKNLYPLIVSV 1022
C+ GFIY G+ ++ QLP +T+ +QKI L H + Y Y S
Sbjct: 973 GCDRGFIYTDVDGLARVSQLPKDTTFAELGVSLQKIELGQEIHGVAYHPPTECYVAATST 1032
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY--TVEEYEVRILEPDRAGGPWQTR 1080
+ E+ + DNH+ T+ T+E+ +R++ P W
Sbjct: 1033 EA------------EFELPKEDDNHHPQWAKEQITFKPTMEQGRLRLINPVN----WTVV 1076
Query: 1081 ATIPMQSSENALTVRVVTLFNT-TTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1139
+ + E + ++ + L + T E + L+A+GT +GED+A +GR+ ++
Sbjct: 1077 DEVELDPFEVIMCIKTLILETSEITNERKQLIAVGTGISKGEDLAIKGRIHVYDVINVVP 1136
Query: 1140 NPQNLVTEVYSKEL------KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIA 1189
P T K + +GAI+ ++ + QG ++++ G K ++ K GT L +A
Sbjct: 1137 EPDRPETNKRLKLIATEDIARGAITCISEIGTQGFMIVSQGQKCMVRGLKEDGTLLP-VA 1195
Query: 1190 FYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1247
F D Y+ S+ +K + D K ++ + E+ ++ L K +++ + L
Sbjct: 1196 FMDMN-CYITSIKELKGTGICVFSDAVKGVWVAGYTEEPYKMMLFGKSAKNMEIMQADLL 1254
Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
DG L +V +D N+ I + P+ +S +G LL R+ F +G H+ + L L +
Sbjct: 1255 PDGKELYIVAADSDCNLHIMQFDPEHPKSLQGHLLLHRSTFALGGHLPTSMTL--LPRTK 1312
Query: 1308 DRTGAAPGSDKTNRFA--------LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVP 1359
T P D + A +L + G I + PL E +RRL +L L++++
Sbjct: 1313 SATLLPPSPDAMDTAADATIPEHEILITSSTGCISLLTPLSEAQYRRLSTLTSHLINTLY 1372
Query: 1360 HVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1419
H GLNPR++R + G +++D +L + L + + E+A + G ++
Sbjct: 1373 HACGLNPRAYR-VDKDAPEGMVGSRTVIDGNILMRWMELGSQRRAEVAGRVGVDVLEVRE 1431
Query: 1420 NLNDLALGTSFL 1431
+L L G +L
Sbjct: 1432 DLASLMGGLGYL 1443
>gi|403411348|emb|CCL98048.1| predicted protein [Fibroporia radiculosa]
Length = 1437
Score = 260 bits (664), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 334/1468 (22%), Positives = 615/1468 (41%), Gaps = 187/1468 (12%)
Query: 57 NLVVTAANVIEIYVVRVQE------------------------EGSKESKNSGE------ 86
N+VV +N++ I+ VR + EG E SGE
Sbjct: 45 NVVVARSNLLRIFEVREEPAPFSTQKEDERDRRASMRKGTEAVEGEVEMDASGEGFVNMG 104
Query: 87 ----TKRRVLMDGISAASLELVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFED 139
T + ++ + L+ +RLHG +E + I++ ++S D ++++F+D
Sbjct: 105 SVKSTGQNGILHQPTVNRFYLIREHRLHGIVTGIEGVRIIT--SIEDSF--DRLLVSFKD 160
Query: 140 AKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPL----VKVDPQGRCGGV 194
AKI++LE+ +++H L S+H +E +P+ + + PL ++ DP RC +
Sbjct: 161 AKIALLEWSEAMHDLITVSIHTYERAPQLMAID--------APLFRSQLRADPLSRCAAL 212
Query: 195 LVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIF 251
+ + IL Q + L D + S +++L D +++V DF+F
Sbjct: 213 SLPKDSIAILPFYQSQAEL--DIMEHETSQARDVPYSPSFILDLSADVDTRIRNVIDFVF 270
Query: 252 VHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKL 311
+ G+ P + +L + + TW GR+ T + ++ + +P+I + LPHD + +
Sbjct: 271 LPGFNSPTIAVLFQYQQTWTGRLKEYKDTVGLILFTLDLVTRHYPVITAIDGLPHDCFAM 330
Query: 312 LAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPRSSFS-------VE 363
+ +GGV+V+ +N+I Y Q+ L ++ + L +LP S S ++
Sbjct: 331 APCSTALGGVVVLASNSIIYVDQATRRVILPVSGW---LPRISDLPIPSLSHQDQQRDLQ 387
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-TNPSVLTSDITTIGNSL 422
L+ + ++ + + K G + + ++ DG+ V RL ++ + + S + + +
Sbjct: 388 LEGSQFVFVDDRTLYVVLKDGTVYPVEIIVDGKTVSRLSMAPPVARTTMPSLVRKMQDDY 447
Query: 423 FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD--APSTKRLRRSSSDALQDMVN 480
F+GS +G S+L++ T G E + A AP+ D
Sbjct: 448 LFVGSIIGPSVLLKTT---RVEEDIEGDDVEMASVPATVVAPNNAMDLDDDDDLYGGSAV 504
Query: 481 GEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASAT 531
E+ + G N + + K + DSL GP+ D ++ L N D +AT
Sbjct: 505 IEQPHMNGITQNGSTAISKKRTVVQLSFCDSLPAYGPIADMTFTLAKNGDRAVPELVAAT 564
Query: 532 GISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY 578
G + L++ LP G +GIW++ + + N + A + YHA
Sbjct: 565 GSGMLGGFTLLQRDLPTRTKRKMHAIGGGRGIWSLLVRQAVKVNGSTYERPA--NPYHAE 622
Query: 579 ---LIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
++IS +A + A + + + G TI A F ++ V +
Sbjct: 623 NDSIVISTDANPSPGLSRIASRNAQGDIQITTRIPGTTIGAAPFFQGTAILHVMINVTNV 682
Query: 634 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
+ + D + + + SI DP+VL+ D SI L +G+ +
Sbjct: 683 I--RVLEPDGTERQVIKDWDGNIPRPKIRFCSICDPFVLIIRDDDSIGLFIGESERGKIR 740
Query: 694 VQTPAAI-ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
+ + + E + + ++ C + D + + + A + + G Q
Sbjct: 741 RKDMSPMGEKTSRYLAGC-FFTDTSGIFQVHQNAQAAGIEGATSTLQSVMNAGNRTQ--- 796
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++C G +EI+ +P F+ + + D Y AL S ++
Sbjct: 797 WLILCRPQGVIEIWTLPKLGLAFSTTHAAGLESVLTDLYDPPAL--------SVPQDPPR 848
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+ ++ +I + V L RP L L G + Y+ + P P +
Sbjct: 849 KPQELDIEQLLVAPLG-----ESSPRPHLMLFLRSGQLAVYEVHSTPVPAEPL----PAA 899
Query: 873 TSRSLSVSNVSA-SRLRNLRFSRTPLDAYTREETPHGAPCQRIT-----IFKNISGHQ-- 924
S +L V V SR N++ S + E+ +RI+ + S Q
Sbjct: 900 RSSTLLVKFVKVLSRAFNIQHSDEVEKSVLAEQ-------KRISHLLIPFATSPSPGQTF 952
Query: 925 -GFFLSGSRPCWCMVF-RERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKIC 981
G FL+G RP W + + ++V P S+V AFT + F+ + +G +
Sbjct: 953 SGVFLTGDRPSWLLCTDKGGVKVLPS--GHSVVHAFTASSVWESKNDFLLYSEEGPSLME 1010
Query: 982 QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
LP D + P + +P + + Y +L IV+ S +
Sbjct: 1011 WLPD-VQLDGHLPSRSVPRPRSYSNVVYDPSTSL---IVAA----------SSQQSKFAS 1056
Query: 1042 HQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1100
+ D + + D + ++ + E + ++ P+ W T +E + +TL
Sbjct: 1057 YDEDGNIVWEPDTNISFPSCECSALELISPEG----WVTMDGYEFAQNEFVNCLDCITLE 1112
Query: 1101 NTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL------ 1153
+T+ + +A+GT +GED+A +G V +F + + + +Y +L
Sbjct: 1113 TMSTETGTKDFIAVGTTINRGEDLAVKGAVYIFEIVEVVPDTNSGLKRLYRLKLQCRDDA 1172
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
KG ++AL + +L+ + G KI + + E L G+AF D ++V SL VKN +++GD
Sbjct: 1173 KGPVTALCGMDNYLVSSMGQKIFVRAFDLDERLVGVAFLDVG-VFVTSLRSVKNLLVIGD 1231
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
KS++F++++E +L +L KD + + +SL+V DE I++ Y P
Sbjct: 1232 AVKSVWFVAFQEDPYKLVILGKDPYHTCVTCADLFFAENRVSLLVCDEDGVIRLLEYDPH 1291
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
ES GQ LL R EFH T++ ++A D+ P + L+ G+ DGS
Sbjct: 1292 DPESRGGQHLLRRTEFH---GQTEYRTSVLIARRKDKDIDIPQA------KLVCGSTDGS 1342
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+ ++E F+ L LQ +L +V HVAGLNPR+FR N RP I+D LL
Sbjct: 1343 LVSFTFVEEAAFKGLHLLQGQLTRNVQHVAGLNPRAFRIVR-NDYVSRPLSKGILDGNLL 1401
Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
+ +E LP+ Q E+ Q GT R+ +L +
Sbjct: 1402 TTFEELPIARQNEMTRQIGTERATVLKD 1429
>gi|414587797|tpg|DAA38368.1| TPA: hypothetical protein ZEAMMB73_143443 [Zea mays]
Length = 153
Score = 259 bits (663), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 127/159 (79%), Positives = 139/159 (87%), Gaps = 6/159 (3%)
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
M ESWKGQKLLSRAEFHVGAHV+KFLRLQML T G A S+KTNRFAL+FGTLDG
Sbjct: 1 MVESWKGQKLLSRAEFHVGAHVSKFLRLQMLPTQ----GLA--SEKTNRFALVFGTLDGG 54
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
IGCIAP+DELTFRRLQSLQ+KLVD++PHV GLNPRSFR F SNGKAHRPGPD+I+D ELL
Sbjct: 55 IGCIAPVDELTFRRLQSLQRKLVDAIPHVCGLNPRSFRHFKSNGKAHRPGPDNIIDFELL 114
Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
SHYEM+ LEEQLEIA Q GTTRSQILSN +D +LGTSFL
Sbjct: 115 SHYEMMSLEEQLEIAQQIGTTRSQILSNFSDFSLGTSFL 153
>gi|384253955|gb|EIE27429.1| hypothetical protein COCSUDRAFT_64224 [Coccomyxa subellipsoidea
C-169]
Length = 1137
Score = 259 bits (661), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 198/627 (31%), Positives = 293/627 (46%), Gaps = 80/627 (12%)
Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
PFL +L DGT L Y+A F P V RL + P
Sbjct: 553 PFLLLLLADGTFLAYRA--FHTPRG-----------------RVCFKRLSLPAHAHCPPQ 593
Query: 899 AYTREETPHGAPCQRITIFKNISGHQ-----GFFLSGSRPCWCMVFRERLRVHPQLCDGS 953
+ T AP +T F + + G F+SG RP W + R L H +G
Sbjct: 594 DRRSKTT---APSSSMTRFDGLGESKEHVNSGMFVSGERPLWLVASRGTLVAHAMDVEGR 650
Query: 954 IVAFTVLHNVNCNHGFIYV----TSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITY 1009
+ T HN+NC GFI LKICQLP + D WP+QKI ++ATPH++ Y
Sbjct: 651 VSGMTPFHNINCPLGFITACMAENDGETLKICQLPMRTRLDTPWPLQKIAVRATPHRLAY 710
Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS---VDLHRTYTVEEY--E 1064
+AE LY L+VS PV P + QE D H S D + E E
Sbjct: 711 YAEARLYVLLVSRPV--PYRE------HQEEASDGDPHASYSYICADAAAKASGTELGGE 762
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
VR+LEP R +QT A + E +V L N T E + +GTA GED
Sbjct: 763 VRLLEPGR----YQTVARHALDPGEEPCSVAADWLRNAQTGALEPYITVGTALNYGEDYP 818
Query: 1125 ARGRVLLFSTGRNADNPQN--------LVTEVYSKELKGAISALASLQGHLLIASGPKII 1176
GR+LLF R + + +T V++ + LA + G L+ A G +
Sbjct: 819 CSGRILLFKATRTSTSGAEQADPTISWQLTLVHASGFSRPVQGLAVMDGRLVAAVGNNMQ 878
Query: 1177 LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL--SWKEQGAQLNLLAK 1234
+ + G+ L+ I+F+ A L++ S+ +K FILLGD+HK + F+ K L L+K
Sbjct: 879 VMELRGSSLHMISFFHA-QLFITSVATIKTFILLGDVHKGLTFVYADKKANYTALTQLSK 937
Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY--APKMSESWKGQKLLSRAEFHVGA 1292
D+ +D A EFL++G L L+ D +N+++F Y + +W+G+KLL HVG
Sbjct: 938 DYNDVDVEAAEFLVNGKKLFLLACDAAQNLRLFAYDGGKEQQATWQGKKLLPLGAIHVGQ 997
Query: 1293 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQ 1351
++ L ++ T + TG A +FG+ GSI +AP D L L +LQ
Sbjct: 998 NICSSLSHRI--TPASATG-------VQLRAAVFGSAAGSIASLAPTWDGLPAEELLALQ 1048
Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG---------PDSIVDCELLSHYEMLPLEE 1402
+++V +VP VAGLNP SFR+ + +G G D ++D + L+ ++ LPL E
Sbjct: 1049 REMVLAVPQVAGLNPVSFRRRYKHGVKALAGGQSFEAPVSDDRVLDLDQLNRFQWLPLTE 1108
Query: 1403 QLEIAHQTGTTRSQILSNLNDLALGTS 1429
Q+ +A + +R Q+L L ++ + S
Sbjct: 1109 QVALAAKCNLSRQQVLHALREMVMAIS 1135
Score = 171 bits (433), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 148/477 (31%), Positives = 227/477 (47%), Gaps = 68/477 (14%)
Query: 225 GFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
S + +S+++ L L + V+D +F+H Y EPV+++LHE + +W G++ T ++
Sbjct: 18 ALSTTVGNSYMLKLAKLGISEVRDAVFLHRYSEPVLLVLHETKPSWGGQLRNSKDTMEVT 77
Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
A S++ K+H +WS NLP DA+KL+ VP GG LV+ N + Y SQ A+ A A
Sbjct: 78 AFSLNVAHKRHTRLWSIGNLPSDAFKLIEVPG--GGGLVICQNLLIYVSQEAAAAAASGA 135
Query: 345 YAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
F ++L WL ++ LL +G L+L+ V +G +RL +S
Sbjct: 136 PRA----------EGFELDLTDCSGAWLADNSLLLGLASGQLILVNVQLEGS--KRLKVS 183
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
K + S + +G L FLGS + +SLL++ G ++L G +E+ EADA
Sbjct: 184 KAQGAPPPSCMCRLGPELLFLGSWVANSLLIR-AVPEGQTLLLGGPEEQAS--EADATHA 240
Query: 465 KRLRRSSSDALQDMVN--GEELSLY-----GSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
+ R DA D+ N +E+SL A +T A K +S V DSLV+IG ++D
Sbjct: 241 SKRPRLDPDA-ADLGNEDEDEVSLIYRTDAQPALPSTTGASK-YSLQVVDSLVSIGIVQD 298
Query: 518 FSYGLRINADASATGISKQSN-------------------------YEL---VELPGCKG 549
G + A ++K EL V LPG
Sbjct: 299 LVTG-EASTSAPQEWVAKTERGPPKLLAAVGSDKFGAVAVLRSSLVPELVTEVPLPGVDQ 357
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES-VDYFVQ 608
+W V H G D S + YHA+L ++ ++ T VL T + L E S VD+ +
Sbjct: 358 MWAV-HFQPEGLPVDDSLL------YHAFLFLNEKSGTKVLRTGEELDETDSSQVDFILS 410
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
RT+ AGNL G R++QV RG +L GS QDL + G N+T+++ S
Sbjct: 411 SRTVFAGNLLGNSRIVQVHARGVVLLSGSSRVQDLPV-----QDLIGVSNTTIVAAS 462
>gi|281205270|gb|EFA79463.1| CPSF domain-containing protein [Polysphondylium pallidum PN500]
Length = 1395
Score = 255 bits (652), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 209/734 (28%), Positives = 340/734 (46%), Gaps = 109/734 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLV+ +++++Y +R ++ + +++ D + LEL +L +ESL
Sbjct: 31 NLVIAKTSLLQVYTIRYDRIEQQQQQQQQTNEQQSQQDTLKPW-LELNLELQLFSIIESL 89
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ G D DS+IL+F DAK+S+++++ + L I S+H FE LK GR++
Sbjct: 90 NCVRLPGDD----IDSLILSFRDAKVSIVKYNKATEKLDIRSLHYFEGNS--ELKGGRKT 143
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG-------------------DE 217
F PL++VD Q RC +L+Y + +L + S L DE
Sbjct: 144 FRTPPLIRVDYQQRCAVMLLYDRHLAVLPFPRSFSILDDEEEEEEEEAAVVADQQQQHDE 203
Query: 218 D-----------TFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHER 266
+ S + S+VI+L L +++VKDF F+H Y EP ++ LHE
Sbjct: 204 NEQQQPQDDQQQQQTSEKNKKKKQSESYVISLNSLGIENVKDFCFLHTYYEPTLLFLHEP 263
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
TW R+S K T +++A+S++ +Q P+IWS +LP++ +L+ VP P+GG +V+
Sbjct: 264 SQTWTSRISSKKFTNVLTAVSLNIAQRQQPVIWSIEHLPYNCERLVPVPDPLGGAMVLTP 323
Query: 327 NTIHYHSQSASCALALNNYA-VSLDSSQELPRSSFSVE----LDAAHATWLQNDVALLST 381
N + Y +QS+ L N YA + + P S S LD A+ +L D L S
Sbjct: 324 NILFYFNQSSRYGLECNEYAQIDTGDQFQFPIDSSSTNLVFTLDCANFIFL-GDRLLGSL 382
Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--- 438
K G+L++ ++ DGR VQR+ ++K SVL+S + ++L FLGSRLGDSLL+Q+T
Sbjct: 383 KGGELLIFHLISDGRNVQRISITKAGASVLSSTSCVLTDNLLFLGSRLGDSLLLQYTEKI 442
Query: 439 ----CGSGTSMLSSGLKE----EFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
LS+ K+ E D+ D + S +D + +E ++
Sbjct: 443 IDVDSSDNVENLSNPYKKKKTSEVFDLFDDEERNSKTGASDADGNGQSLFDDEDDIF--- 499
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN-ADASATGISKQSNYELV------- 542
N+ ++ K++ + D + NIGP+ D G+ + A S +Q + ELV
Sbjct: 500 -NDKKNQLKSYRLNICDHITNIGPVSDLITGVSYDHASVSNDESFEQRSLELVACSGHGK 558
Query: 543 -------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
ELPG + WT+Y+ + S + +
Sbjct: 559 NGALTILQYGVRPELNTSFELPGVRQSWTLYYDDPLAASQSGSSASNAAASAASKKRQHE 618
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQD 642
E T+V +T L EV + TI N+FGRRR+ V + G ++L G S +TQ+
Sbjct: 619 EDSTLVFQTGGQLKEVAK-----FDHATITVANMFGRRRIALVHQNGIKLLSGHSNITQE 673
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS-TCTVSVQTPAAIE 701
+ +V I DPYVL+ DG+I L G+ T + + P
Sbjct: 674 IKL-------------KSVKMAYIVDPYVLILHKDGTISLYQGNTGITQLLEYELPQP-- 718
Query: 702 SSKKPVSSCTLYHD 715
K V SC+++HD
Sbjct: 719 --KDGVMSCSMFHD 730
Score = 198 bits (504), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 135/445 (30%), Positives = 224/445 (50%), Gaps = 58/445 (13%)
Query: 823 KVVELAMQRW-SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
K+VE+ + ++ HS P+L + G IL Y+A K D + ++ L
Sbjct: 872 KIVEIVIHYLHNSPHSSPYLMILNEFGDILIYKAI---------KYKDSMDNTKEL---- 918
Query: 882 VSASRLRNLRFSRTPLDAYTREETPHGAP-------CQRITIFKNISGHQGFFLSGSRPC 934
+R ++ + L + RE + P ++I F NI GH+G F+ G R
Sbjct: 919 -----IRFIKHTDQNLHSKQREYSYGIDPSSESSFYIRKIVAFDNIGGHKGVFMCGKRSL 973
Query: 935 WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
W + LR HP + +FT HN+NC++GFIY T +G+L+I QL + ++N W
Sbjct: 974 WFFCEKNYLRAHPMNFKDPVTSFTCFHNINCSYGFIYFTEKGVLRINQLSNMMNFENEWA 1033
Query: 995 VQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL 1054
++KIPL+ T H+I++ E Y L++S P Q D
Sbjct: 1034 IRKIPLRMTCHKISFHQEFKCYVLVISYP----------------QAPQSDEEEEEKEKS 1077
Query: 1055 HRTYTVEE-YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL--FNTTTKENETLL 1111
+ +EE ++V++++P W + M E L ++V L + + + L
Sbjct: 1078 KKPLILEEKFQVKLIDPSMN---WSIVDSFSMSEKETVLCAKIVHLKYADVDGIKLKPYL 1134
Query: 1112 AIGTAYVQGEDVAARGRVLLF------STGRNADNPQNLVTEVYSKELKGAISALASLQG 1165
+GTAY GED +GR+L+F + + + +Y K+ KG ++ALA L G
Sbjct: 1135 CVGTAYTHGEDTVCKGRILVFEIISHREVQDDTGEEKKRLNLLYEKDQKGPVTALAGLNG 1194
Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
LL++ GPK+I++ ++ L GIAFYD +++VSL+ VKN+IL+GD++KS+ F K+Q
Sbjct: 1195 LLLMSIGPKLIVNNFSSGSLVGIAFYDT-QIFIVSLSTVKNYILVGDMYKSVSFFKLKDQ 1253
Query: 1226 GAQLNLLAKDFGSLDCFATE--FLI 1248
QL LL KD+ ++ F+ + FL+
Sbjct: 1254 -KQLILLGKDYEEMNTFSNQVHFLV 1277
>gi|310789917|gb|EFQ25450.1| CPSF A subunit region [Glomerella graminicola M1.001]
Length = 1439
Score = 254 bits (649), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 353/1471 (23%), Positives = 585/1471 (39%), Gaps = 248/1471 (16%)
Query: 69 YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL----SQGGA 124
Y R+ ++ ES G V D L LV Y + G V LA + S+ G
Sbjct: 66 YDRRLNDDDGLESSFLGGDGMLVRADRAVNTKLVLVAEYPIFGVVTGLARIKIQHSKSGG 125
Query: 125 DNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL-- 182
+ ++++A A++S+++++ H L S+H +E E + S GPL
Sbjct: 126 E------ALLIATRVARLSLVQWNSEKHALEDISIHYYEKEEL------QGSPFDGPLAN 173
Query: 183 ----VKVDPQGRCGGVLVYGLQMI-ILKASQG--------------GSGLVGDEDTFGSG 223
+ DP RC L +G + I L Q G + T +
Sbjct: 174 YRTHLAADPGSRCAA-LSFGPRYIAFLPFKQADEDIDMDDWDEDVDGPRPAKEPPTTAAT 232
Query: 224 GGFS----ARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
G S +S+V+ L LD + H F+H Y EP I+ +
Sbjct: 233 NGTSNIADVPYSTSYVLPLPQLDPSLLHPVYLAFLHEYREPTFGIISSTQRRSNTLPRKD 292
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
H + + L + + I S NLP D +K++A+P P+GG L+VG N IH
Sbjct: 293 HFSYKVFTLDLQQ--RASTAILSVNNLPQDLFKVVALPGPVGGALLVGTNELIHIDQSGK 350
Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYD 394
+A+N + + +S + L+ H + +N L+ G L ++T D
Sbjct: 351 PNGVAVNAFTKETTNFPLADQSDLDLRLEHCHIELMSAENGELLMVLSDGRLAIITFKID 410
Query: 395 GRVVQRLDLSKTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
GR V + + V S I+ + ++FF+GS DSL++ +T
Sbjct: 411 GRTVSGVSVKPVAAEVGGNIVQCSVSTISKLSRNVFFVGSTGSDSLVLGWT--------- 461
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS---------------LYGSASN 492
A + +R R D+ + + E++ + A+
Sbjct: 462 ----------RKQAQNARRKTRLVDDSFEYDLEDEDMDDGDDDDLYGETTTTMIQPGATA 511
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLR-INAD------------------------ 527
N S +F V DSL++I P+KD + G + N D
Sbjct: 512 NGVSKGGDLTFRVHDSLLSIAPVKDMTSGKQAFNPDSEEANNSVGVVADLQLACVVGRGN 571
Query: 528 ASATGISKQSNYELV----ELPGCKGIWT--VYHKSSRGHNADSSRMAAYDDEYHAYLII 581
A A I Q+ V E P +G WT V + D AA E+ A
Sbjct: 572 AGAVAILNQNIQPKVIGKFEFPEARGFWTMCVQKPVPKSLQGDKGANAAVGSEFDAS--- 628
Query: 582 SLEARTMVLETADLLTEVTESVDYFV-----------------QGRTIAAGNLFGRRRVI 624
S+ + M++ DL + E+ D + G T+ AG + R+I
Sbjct: 629 SIYDKFMIVSKVDL--DGYETSDVYALTGAGFEALTGTEFDPAAGFTVEAGTMGKHMRII 686
Query: 625 QVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
QV + R DG ++Q L + E+G+ V+S SIADPY+LL D SI +
Sbjct: 687 QVLKSEVRCYDGDLGLSQILPM--LDEETGA---EPRVVSASIADPYLLLVRDDSSIMVA 741
Query: 684 VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGAD 743
D + V+ S K ++ C LY D +TG +
Sbjct: 742 QIDNNCELEEVEKQDDAILSTKWLAGC-LYAD----------------TTGRFAPVQTDK 784
Query: 744 GGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEI 803
G P Q +I+ + +GAL I+ +P+ + V +G T++
Sbjct: 785 GTPEGQ-NIFMFLLSAAGALYIYALPDLSKPVYV---AAGLTYVPPLL----------SA 830
Query: 804 NSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPEN 863
+ + GT Q E + + V +L + P+L + + Y+ E +
Sbjct: 831 DYAVRRGTVQ---ETLTELLVADLG----DTTTTSPYLILRHANDDLTIYEPIRLESQDK 883
Query: 864 TSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNIS 921
T V S++L ++ N +++P++ E E P P + NI+
Sbjct: 884 T------VGLSKTLHFQKIT-----NPALAKSPVEVADDEANEQPRFVPLRPC---PNIN 929
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
G+ FL G+ P + + + L + + H C GFIY S+G ++
Sbjct: 930 GYSTVFLPGASPSFIIKSSKSSPKVIGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVT 989
Query: 982 QLPSGSTYDNYW-PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1040
QLP+ + + V+KIP+ I Y Y + SV L E+
Sbjct: 990 QLPADTNFTELGVAVRKIPIGDNVGLIAYHPPMETYAVACSV------------LERFEL 1037
Query: 1041 GHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
D H + + +Y E ++++ P W T+ ++ E A+ ++ + L
Sbjct: 1038 PKDDDYHKEWAKEATTSYPQTERGIIKLMSPTT----WSVIDTVELEPHEVAMCMKTLHL 1093
Query: 1100 -FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNLVTEVYSKE-- 1152
+ TKE L+ IGTA +GED+ RGR+L++ P N ++ +KE
Sbjct: 1094 EVSEETKERRMLITIGTAINRGEDLPIRGRILVYDVVPVVPQPGRPETNKKLKLVAKEEI 1153
Query: 1153 LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVK--N 1206
+GA++ L + QG +L+A G K ++ K GT L +AF D YV ++ V+
Sbjct: 1154 PRGAVTGLCEVGSQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDM-NCYVTAVREVRGTG 1211
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
+ L+ D K ++F+ + E+ ++ L K G + +F+I G L +VV D+ I +
Sbjct: 1212 YCLMTDAFKGVWFVGYAEEPYKMMLFGKSTGKFEVLTADFIIAGDELHIVVCDKDGVIHV 1271
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
+ P+ +S +G LL+RA F H T L L S+ T A T LL
Sbjct: 1272 MQFDPEHPKSLQGHLLLNRASFSAAPNHPTTTLSLPRTPASTATTSATKNPPTT----LL 1327
Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD- 1384
+ G++ + PL E +RRL SL + ++PH A NP++ R + A PG D
Sbjct: 1328 LASPTGALASLTPLSEQAYRRLTSLANSIAGALPHAAATNPKAHRLQPLD--ARTPGVDT 1385
Query: 1385 ----SIVDCELLSHYEMLPLEEQLEIAHQTG 1411
SIVD LL+ + L + E+A + G
Sbjct: 1386 SAGRSIVDGALLARWNELGAGRRSEVAGKGG 1416
>gi|440466842|gb|ELQ36086.1| hypothetical protein OOU_Y34scaffold00669g71 [Magnaporthe oryzae Y34]
gi|440481991|gb|ELQ62520.1| hypothetical protein OOW_P131scaffold01068g7 [Magnaporthe oryzae
P131]
Length = 1475
Score = 248 bits (634), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 342/1516 (22%), Positives = 607/1516 (40%), Gaps = 235/1516 (15%)
Query: 57 NLVVTAANVIEIYVVRV---QEEGSKESKNSGETKRRVLMD--GISAA------------ 99
NLVV +++++I+ R+ + +G+ +S + L D G+ A+
Sbjct: 51 NLVVAKSSLLQIFATRLVPAELDGTSQSAKATHNYDTKLNDDEGLEASFLGGDAAIIRSD 110
Query: 100 ----SLELVCHYRLHGNVESLAILSQGGADNSRRR---------DSIILAFEDAKISVLE 146
L LV + L G + LA + S D +++AF+DAK+S++E
Sbjct: 111 RNHTKLVLVAEFPLSGTITGLARVKANATKTSNGNGAGSSSSGGDFLLIAFKDAKLSLVE 170
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQ 200
+D L S+H +E E + S PL + DP RC L +G +
Sbjct: 171 WDPDRRSLETISIHYYEQNEL------QSSPWAAPLSDYVNFLVADPGSRCAA-LKFGAR 223
Query: 201 MIILKASQGGSGLVGDED----------------TFGSGGGFSARIES-----SHVINLR 239
+ + + G +G +D T + G +E S V+ L
Sbjct: 224 SLAIIPFKQADGDIGMDDWDEELDGPRPAQEKPATAATNGTTDNVVEDTPYTPSFVLRLP 283
Query: 240 DLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
+LD + H F++ Y EP IL +T + ++ K H + ++ K
Sbjct: 284 NLDPALLHPVHLAFLYEYREPTFGILSS-NITPSTYLARKDH-LTYTVFTLDLQQKASTT 341
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP 356
I S LP D +++A+P+P+GG L+VG+N IH + +A+N S S
Sbjct: 342 ILSVGGLPKDLTRVIALPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTKSCTSFSLAD 401
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLSKTNPSV---- 410
+S + L+ L + D L T+V+ DGR V L + P
Sbjct: 402 QSDLGLRLEGCMINVLSAEDGQFIIVLNDGRLATLVFHIDGRTVSGLKIKMVAPEAGGQL 461
Query: 411 ---LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
S +T +G + F GS GDS++ + S K + D + D
Sbjct: 462 LQTSVSCLTRLGRNALFAGSDRGDSVVFGWNRKHNQV---SKRKPKIQDPDLDLDIDYDD 518
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----- 522
D D+ E + ++++ E+ F V D +V+I P++D ++G
Sbjct: 519 LEDDEDDDDDLYADTEKTKATTSASTGETKTDDLIFRVHDRMVSIAPIRDVTFGKPPPPT 578
Query: 523 ---RINADASAT----------GISKQSNYELV------------ELPGCKGIWTV---- 553
R D +A G K S+ ++ E P +G+WT+
Sbjct: 579 DAERNTKDPAAVQSELQLVAVVGRDKASSLAIINREMTPVSIGRFEFPEARGLWTLSTQK 638
Query: 554 -YHKSSRGHNADSSRMAAYDD----EYHAYLIISLEARTMVLETADLLTEVTESVDYF-- 606
K + N + AA + +Y Y+I++ E ET+D+ +
Sbjct: 639 PLPKPLQASNKNPKTAAATESILSAQYDQYMIVAKEDDDG-FETSDVYALTAAGFETLSG 697
Query: 607 -----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
G TI AG + ++IQV + R DG +TQ + + E+G
Sbjct: 698 TEFEPAAGFTIEAGTMGDHTKIIQVLKSEVRCYDGDLGLTQIIPM--LDEETG---HEPR 752
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
S SIADPY+L+ D S + + + ++ I SS K + C LY D
Sbjct: 753 ATSASIADPYLLIIRDDSSAFIAHVNEDSEIEEIEKEDKIISSTKWSTGC-LYAD----- 806
Query: 721 WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
S G A P I + +GAL I+ +P+ +
Sbjct: 807 -----------SKGAFAATQQTAKSPKSTPTIMMFLLSAAGALYIYALPDIS-------- 847
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
Y+ E L +++ G R E I + V +L + + H
Sbjct: 848 -------RPVYVAEGLCYVPPYLSADYSARKGMAR-ETISEILVTDLGDTVFKSPH---- 895
Query: 841 LFAIL--TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
IL ++ + Y+ Y ++D S ++ L + +L N ++ P +
Sbjct: 896 --VILRHSNHDLTIYEPYRI--------AEDSQSLTKILRLR-----KLPNPAVAKAP-E 939
Query: 899 AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIV 955
A E+ P + + NI+G+ F+ G P + + + + P+ L +
Sbjct: 940 ATNSEDPPLMSRNMPLRACANIAGYSAVFMPGHSPSFLI---KSAKATPKVIGLRGSGVR 996
Query: 956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKN 1014
A + H C GFIY S G+ ++ Q+P +++ V+K+PL I Y +
Sbjct: 997 AMSSFHTEGCERGFIYADSAGVARVAQIPKDTSFSELGLSVKKVPLGIDADGIAYHSPTG 1056
Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1074
+Y L S + L D + + N+S + VE ++++ P
Sbjct: 1057 VYVLTCS------YWEPFELPKDDDYHCEWAKENISFKPM-----VERSVLKVINPIN-- 1103
Query: 1075 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF- 1132
W T + E A+ +R + L + +T E L+ +GTA +GED+ RG + +F
Sbjct: 1104 --WSDIWTEEFEQHEVAMCIRSLNLEVSQSTNERRQLITVGTAMCKGEDLPVRGGIYVFD 1161
Query: 1133 ------STGRNADNPQNLVTEVYSKEL-KGAISALASL--QGHLLIASGPKIILHKWT-G 1182
GR + + + +V +E+ +GA+++L+ + QG +++A G K ++
Sbjct: 1162 LASVVPQKGRPETDKK--LKQVAKEEIPRGAVTSLSEIGTQGLMMVAQGQKTLVRGLQED 1219
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
+L +AF D YV + + ++ D K ++F + E ++ L K +L+
Sbjct: 1220 GKLPPVAFMDMN-CYVTCVKELAGTGLCVMADAFKGVWFCGYTEGPYKMMLFGKSSTNLE 1278
Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1300
C + L DG L +V +D N+ + + P+ +S +G LL+R F GAH + +
Sbjct: 1279 CMNVDLLPDGKDLLIVAADSDGNLHVLQFDPEHPKSLQGHLLLNRTTFSTGAHHPQ--KS 1336
Query: 1301 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1360
+L T+ R S R +L + G + + PL + T+ RL +L L+ SVPH
Sbjct: 1337 LLLPTTDPRPSTNQPSSDAERQHILMASPTGVLAAVQPLSQSTYTRLSALASNLMASVPH 1396
Query: 1361 VAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
A LNP+++R ++ + D ++VD LL+ + L + E+A + G + +
Sbjct: 1397 HAALNPKAYRLPPTSTRNQVAAVDISVGRAVVDGSLLARWAELASGRRAEVAGRAGFSGA 1456
Query: 1416 QILSNLNDLALGTSFL 1431
+ + + LG S +
Sbjct: 1457 AEVRSELEAVLGWSIM 1472
>gi|389641257|ref|XP_003718261.1| cft-1 [Magnaporthe oryzae 70-15]
gi|351640814|gb|EHA48677.1| cft-1 [Magnaporthe oryzae 70-15]
Length = 1452
Score = 248 bits (633), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 342/1516 (22%), Positives = 607/1516 (40%), Gaps = 235/1516 (15%)
Query: 57 NLVVTAANVIEIYVVRV---QEEGSKESKNSGETKRRVLMD--GISAA------------ 99
NLVV +++++I+ R+ + +G+ +S + L D G+ A+
Sbjct: 28 NLVVAKSSLLQIFATRLVPAELDGTSQSAKATHNYDTKLNDDEGLEASFLGGDAAIIRSD 87
Query: 100 ----SLELVCHYRLHGNVESLAILSQGGADNSRRR---------DSIILAFEDAKISVLE 146
L LV + L G + LA + S D +++AF+DAK+S++E
Sbjct: 88 RNHTKLVLVAEFPLSGTITGLARVKANATKTSNGNGAGSSSSGGDFLLIAFKDAKLSLVE 147
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQ 200
+D L S+H +E E + S PL + DP RC L +G +
Sbjct: 148 WDPDRRSLETISIHYYEQNEL------QSSPWAAPLSDYVNFLVADPGSRCAA-LKFGAR 200
Query: 201 MIILKASQGGSGLVGDED----------------TFGSGGGFSARIES-----SHVINLR 239
+ + + G +G +D T + G +E S V+ L
Sbjct: 201 SLAIIPFKQADGDIGMDDWDEELDGPRPAQEKPATAATNGTTDNVVEDTPYTPSFVLRLP 260
Query: 240 DLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
+LD + H F++ Y EP IL +T + ++ K H + ++ K
Sbjct: 261 NLDPALLHPVHLAFLYEYREPTFGILSS-NITPSTYLARKDH-LTYTVFTLDLQQKASTT 318
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP 356
I S LP D +++A+P+P+GG L+VG+N IH + +A+N S S
Sbjct: 319 ILSVGGLPKDLTRVIALPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTKSCTSFSLAD 378
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLSKTNPSV---- 410
+S + L+ L + D L T+V+ DGR V L + P
Sbjct: 379 QSDLGLRLEGCMINVLSAEDGQFIIVLNDGRLATLVFHIDGRTVSGLKIKMVAPEAGGQL 438
Query: 411 ---LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
S +T +G + F GS GDS++ + S K + D + D
Sbjct: 439 LQTSVSCLTRLGRNALFAGSDRGDSVVFGWNRKHNQV---SKRKPKIQDPDLDLDIDYDD 495
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----- 522
D D+ E + ++++ E+ F V D +V+I P++D ++G
Sbjct: 496 LEDDEDDDDDLYADTEKTKATTSASTGETKTDDLIFRVHDLMVSIAPIRDVTFGKPPPPT 555
Query: 523 ---RINADASAT----------GISKQSNYELV------------ELPGCKGIWTV---- 553
R D +A G K S+ ++ E P +G+WT+
Sbjct: 556 DAERNTKDPAAVQSELQLVAVVGRDKASSLAIINREMTPVSIGRFEFPEARGLWTLSTQK 615
Query: 554 -YHKSSRGHNADSSRMAAYDD----EYHAYLIISLEARTMVLETADLLTEVTESVDYF-- 606
K + N + AA + +Y Y+I++ E ET+D+ +
Sbjct: 616 PLPKPLQASNKNPKTAAATESILSAQYDQYMIVAKEDDDG-FETSDVYALTAAGFETLSG 674
Query: 607 -----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
G TI AG + ++IQV + R DG +TQ + + E+G
Sbjct: 675 TEFEPAAGFTIEAGTMGDHTKIIQVLKSEVRCYDGDLGLTQIIPM--LDEETG---HEPR 729
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
S SIADPY+L+ D S + + + ++ I SS K + C LY D
Sbjct: 730 ATSASIADPYLLIIRDDSSAFIAHVNEDSEIEEIEKEDKIISSTKWSTGC-LYAD----- 783
Query: 721 WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
S G A P I + +GAL I+ +P+ +
Sbjct: 784 -----------SKGAFAATQQTAKSPKSTPTIMMFLLSAAGALYIYALPDIS-------- 824
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
Y+ E L +++ G R E I + V +L + + H
Sbjct: 825 -------RPVYVAEGLCYVPPYLSADYSARKGMAR-ETISEILVTDLGDTVFKSPH---- 872
Query: 841 LFAIL--TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
IL ++ + Y+ Y ++D S ++ L + +L N ++ P +
Sbjct: 873 --VILRHSNHDLTIYEPYRI--------AEDSQSLTKILRLR-----KLPNPAVAKAP-E 916
Query: 899 AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIV 955
A E+ P + + NI+G+ F+ G P + + + + P+ L +
Sbjct: 917 ATNSEDPPLMSRNMPLRACANIAGYSAVFMPGHSPSFLI---KSAKATPKVIGLRGSGVR 973
Query: 956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKN 1014
A + H C GFIY S G+ ++ Q+P +++ V+K+PL I Y +
Sbjct: 974 AMSSFHTEGCERGFIYADSAGVARVAQIPKDTSFSELGLSVKKVPLGIDADGIAYHSPTG 1033
Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1074
+Y L S + L D + + N+S + VE ++++ P
Sbjct: 1034 VYVLTCS------YWEPFELPKDDDYHCEWAKENISFKPM-----VERSVLKVINPIN-- 1080
Query: 1075 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF- 1132
W T + E A+ +R + L + +T E L+ +GTA +GED+ RG + +F
Sbjct: 1081 --WSDIWTEEFEQHEVAMCIRSLNLEVSQSTNERRQLITVGTAMCKGEDLPVRGGIYVFD 1138
Query: 1133 ------STGRNADNPQNLVTEVYSKEL-KGAISALASL--QGHLLIASGPKIILHKWT-G 1182
GR + + + +V +E+ +GA+++L+ + QG +++A G K ++
Sbjct: 1139 LASVVPQKGRPETDKK--LKQVAKEEIPRGAVTSLSEIGTQGLMMVAQGQKTLVRGLQED 1196
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
+L +AF D YV + + ++ D K ++F + E ++ L K +L+
Sbjct: 1197 GKLPPVAFMDMN-CYVTCVKELAGTGLCVMADAFKGVWFCGYTEGPYKMMLFGKSSTNLE 1255
Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1300
C + L DG L +V +D N+ + + P+ +S +G LL+R F GAH + +
Sbjct: 1256 CMNVDLLPDGKDLLIVAADSDGNLHVLQFDPEHPKSLQGHLLLNRTTFSTGAHHPQ--KS 1313
Query: 1301 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1360
+L T+ R S R +L + G + + PL + T+ RL +L L+ SVPH
Sbjct: 1314 LLLPTTDPRPSTNQPSSDAERQHILMASPTGVLAAVQPLSQSTYTRLSALASNLMASVPH 1373
Query: 1361 VAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
A LNP+++R ++ + D ++VD LL+ + L + E+A + G + +
Sbjct: 1374 HAALNPKAYRLPPTSTRNQVAAVDISVGRAVVDGSLLARWAELASGRRAEVAGRAGFSGA 1433
Query: 1416 QILSNLNDLALGTSFL 1431
+ + + LG S +
Sbjct: 1434 AEVRSELEAVLGWSIM 1449
>gi|428186188|gb|EKX55039.1| hypothetical protein GUITHDRAFT_160593 [Guillardia theta CCMP2712]
Length = 2290
Score = 248 bits (633), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 168/543 (30%), Positives = 263/543 (48%), Gaps = 68/543 (12%)
Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD--GSIVAFTVLHNVNCNHGFI 970
R+ G +G ++ +P + R R+HP D + + +N+ C G +
Sbjct: 1068 RLMPLGGAGGLEGVLIAARQPAVVLFGRGLPRIHPWKLDRGEGVRSAARFNNLQCKDGIV 1127
Query: 971 YVT------SQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPV 1024
+ ++G+LKIC +P G + D WP++ + T H + + A + L+VS
Sbjct: 1128 CIADKGRDRAKGVLKICNIPEGISGDTPWPLRTKHVGMTVHHVAFHAATGCHVLVVSSQ- 1186
Query: 1025 LKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT---YTVEEYEVRILEPDRAGGPWQTRA 1081
+I++ L T E+YEV++ P
Sbjct: 1187 -----------------QEIEDERKPEGTLEGAIPPLTEEKYEVQLRAP--YSMELLDSY 1227
Query: 1082 TIPMQSSENALTVRVVTLFNTTTKENE-TLLAIGTAYVQGEDVAAR--GRVLLFST---- 1134
+ E AL ++VV L NT K++ +A+GT + GE +R GR+ +F
Sbjct: 1228 EFDFANGEKALCLQVVHLKNTRVKDSLLPFVAVGTGFQNGESETSRATGRIYVFEVTTVV 1287
Query: 1135 ------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP--------KIILHKW 1180
GR + + + T +++K +SAL L+G+LL+A GP K+ +++W
Sbjct: 1288 GEEGYEGRTSFKIKKIFTSADIQDIKAPVSALCQLEGYLLVAQGPNPGMIGGSKLYVYEW 1347
Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
+L G AF+DA LY+ +L VK FI+ GDI S++ L W+E L LLAKD L
Sbjct: 1348 VDEKLVGRAFFDAH-LYITTLKTVKFFIVFGDIRHSVHLLRWREDIRMLQLLAKDALPLS 1406
Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR- 1299
+A EF++ GS L+ SDEQKN+Q+F + P E ++ Q+L+ RA+ HVG+H+ KF+R
Sbjct: 1407 VYAAEFVVMGSNFGLLASDEQKNVQVFVFNPNSPE-YRRQQLICRADLHVGSHINKFIRW 1465
Query: 1300 -LQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSV 1358
L T RT A + TLDG IG I P+ E ++RRL +LQ LV ++
Sbjct: 1466 PLPFRPTLGVRTAAH------------YTTLDGGIGAIIPIPEQSYRRLLALQNLLVTAM 1513
Query: 1359 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
PH AGLNPRS+R + R + +D LL Y L L Q++++ TR IL
Sbjct: 1514 PHYAGLNPRSWRLYKPAMCMKRRYAKNFLDGNLLGRYLHLDLALQMQLSSALNQTREAIL 1573
Query: 1419 SNL 1421
+L
Sbjct: 1574 GDL 1576
Score = 239 bits (609), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 142/412 (34%), Positives = 222/412 (53%), Gaps = 38/412 (9%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD---GISAASLELVCHYRLHGNV 113
NL V +E+YV++ +E+ ++ N + ++ D G A+L+ V Y L+GNV
Sbjct: 31 NLAVVKGTQLELYVLKEEEKKHSKTCNGKQNGQKAAGDSGHGHGGATLQCVGRYDLNGNV 90
Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
ES+A + G R RD + L F DAK+S+LE+D+SI + S+H FE E +++G
Sbjct: 91 ESMAFVRLPG----RNRDHLFLVFRDAKLSILEYDNSIDDIVNVSLHLFEDDE---IRKG 143
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF------------- 220
R SF R PL++VDP RC +LVY +M+++ GS L D++
Sbjct: 144 RVSFGRAPLLRVDPLQRCAALLVYESKMVVIPFKHKGSDLEEDDEILTQPNKKFKSESAS 203
Query: 221 -------GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
G+ I ++V++L + +KHV DF F+ GY EP + LHE TWAGR
Sbjct: 204 SNTVTRLGAPSDNKLGILPTYVVDLDEAGIKHVVDFTFLDGYYEPTISFLHENSRTWAGR 263
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
++ + T MI+ +S++ + ++ P+IWSA LPH++ ++A+P+P GGV+VV +N + Y +
Sbjct: 264 LAVSNFTGMITTVSLNISQRRQPIIWSASKLPHNSRHIVALPAPAGGVVVVSSNALIYRN 323
Query: 334 QSASCALALNNYAVSL-DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
CAL LN YA++ D + + D H L+ L S TG+ ++ V
Sbjct: 324 HEQKCALKLNEYAIAAGDGGNRFDTAGDIICFDTVHPVRLEGYQMLFSLVTGESYIMGVQ 383
Query: 393 Y--DGRVVQRLDLS----KTNPS-VLTSDITTIGNSLFFLGSRLGDSLLVQF 437
DG ++ L L K +PS S + +G+S FLGSRLGDS LV+
Sbjct: 384 LDTDGNTIKALTLDLVDVKLSPSGGFASIMCRVGDSYLFLGSRLGDSSLVKM 435
Score = 76.6 bits (187), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 135/314 (42%), Gaps = 77/314 (24%)
Query: 501 FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------------------ 542
+ F + D+L NIGP+ G R++A G K+ + ELV
Sbjct: 587 YRFELCDTLTNIGPI-----GSRLDA-----GAVKKDSVELVTASGGLQYGKLGVLQRSL 636
Query: 543 --------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE----YHAYLIISL--EARTM 588
LP + +WTV+ +++ + D ++E HAY++IS + T+
Sbjct: 637 NPVVMTAVPLPDAQAVWTVFGPTAKAADEDMEEDGNEEEEQSAGMHAYMVISQGNDKGTI 696
Query: 589 VLETADLLT-EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
VL+ +L + E VD+ V +T+ GN+FG +R++QV +L+G Q+L
Sbjct: 697 VLKGRELEEFDEDEQVDFEVDAKTVCVGNIFGNQRIVQVTPWNVYVLNGPRKEQELPV-- 754
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
+G+G + +++ I DPY+ L + DG + LLVGD S+ V+ + +
Sbjct: 755 ---VAGNGLQ---IVAAYIRDPYIALILQDGRLNLLVGDASSMQVNY-----VSHEIHNI 803
Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
++ + D P+ GEA D Q D+ +G +++
Sbjct: 804 TAACFFLDPIPD----------------GEANDDP-----QQRDVMLAAAPRNGHFQLYT 842
Query: 768 VPNFNCVFTVDKFV 781
+P+ V+ FV
Sbjct: 843 LPSLELVYDAADFV 856
>gi|324499955|gb|ADY39993.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
suum]
Length = 1434
Score = 244 bits (622), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 201/699 (28%), Positives = 341/699 (48%), Gaps = 63/699 (9%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREA--LKDSE---TEINSSSEE 809
V+ E+G L I+ +P V+ V K +H+ D + E L D ++I S++
Sbjct: 773 VMARENGNLYIYSIPEMQLVYMVKKL----SHLPDVAIDEMNYLGDESVVASDIASNTLN 828
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
+ E I +VE+ + + RP LF ++ D + Y+ ++++ N
Sbjct: 829 EALVAKPEEI----IVEVLLTGMGMNQGRPMLFVVV-DDMVSVYEMFMYD---NGVVEHL 880
Query: 870 PVSTSRSLSVSNVSASRLRNLRFS----RTPLDAYTREETPHGA---PCQRITIFKNISG 922
V R L + V+ R+ RF R P++A R+ + P +RI N
Sbjct: 881 AVRFKR-LPYTTVT----RSCRFQGNDGRAPVEA-ARDTVRYRTALHPFERIGNILN--- 931
Query: 923 HQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKIC 981
G F+ S PC ++ LR+HP +G I++FT +NV C +GFIY+T + ++I
Sbjct: 932 --GVFICSSYPCVFLMDSGILRMHPLNLEGPILSFTAFNNVLCPNGFIYLTEREWAMRIA 989
Query: 982 QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
+LP+ D+ PV+KI T H I Y + N Y ++ S K N L +L++++
Sbjct: 990 KLPTDVELDSSLPVRKIRTGRTIHNIVYLLQSNTYAVVGSE---KKPNNRLCVLVNED-- 1044
Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL 1099
D H D +E Y+V++ P+ W+ A I M+ E V L
Sbjct: 1045 KSFDEH--EKADSFVLPELEVYDVKLYSPED----WKPVPNAEIKMEDFEVLTCCEEVVL 1098
Query: 1100 FNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKEL 1153
+ T + LA+GTA GE+V RGR+++ P ++ + +Y KE
Sbjct: 1099 RSEGTVSGVQNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQ 1158
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
KG +++L S G+LL G K+ + + L GI+F D Y+ L V+N L DI
Sbjct: 1159 KGPVTSLCSCNGYLLAGMGQKVFIWLFRDNNLQGISFLDMH-FYIHQLVGVRNLALACDI 1217
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSL--DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
++S+ L ++E+ L+L ++D ++ A +FLID ++ ++SDE NI +F Y P
Sbjct: 1218 YRSVALLRYQEEYKALSLASRDMRAVVQPPMAAQFLIDNRQMAFIMSDEAANIAVFNYLP 1277
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-NRFALLFGTLD 1330
+ ES G++L+ R+E ++G +V F+R++ +S G + NR ++LF +LD
Sbjct: 1278 EALESSGGERLILRSEINIGTNVNSFMRVKGHISS----GFVENEHYSLNRQSVLFCSLD 1333
Query: 1331 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
GS G + PL E FRRL LQ+ + V AGLN + R H ++VD +
Sbjct: 1334 GSFGFVRPLSEKVFRRLHMLQQLMSSLVAQAAGLNVKGSRAARPQRPNHYLNTRNMVDGD 1393
Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
++ Y L L ++ ++A + GT+R I+ +L +++ T+
Sbjct: 1394 VVFQYLHLSLADKNDLARKLGTSRYHIIDDLTEISRLTT 1432
Score = 178 bits (451), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 172/656 (26%), Positives = 281/656 (42%), Gaps = 114/656 (17%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + H RL V+SLA+ + S++L F+ AK+SV+ F + L+ S+H
Sbjct: 107 LECIIHVRLLAPVKSLAV---ARIPQNPSCSSLLLGFDTAKLSVVGFSAAERSLKTISLH 163
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G + P+++VDP RC +L+YG + +L L
Sbjct: 164 CFEEE---MLKDGYVTDLPSPVIRVDPAQRCAVMLIYGRYLAVLPFDDTSPHL------- 213
Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
++ + L +D + ++ D F+ GY EP ++ L+E T AGR ++
Sbjct: 214 -----------HTYTVALSSIDPRLVNIIDIAFLDGYYEPTLLFLYEPAQTTAGRACVRY 262
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS- 337
T + +S++T + H +W NLP D ++L +P PIGG L++GAN + Y +QS
Sbjct: 263 DTVCMLGVSLNTKEQVHASVWQLNNLPMDCNQVLMIPRPIGGALIIGANELIYLNQSVPP 322
Query: 338 CALALNNYAVSLDSSQELPRSS---FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD 394
C LN+ +D + P S ++ LD A + + ++ ++G L +LT+V D
Sbjct: 323 CGSLLNS---CMDGFTKFPLKSEKEMALTLDGCAACVISTNKVVVCARSGALFILTLVVD 379
Query: 395 G-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
V+ ++ + +T F+GSR+GDSL +++ S L
Sbjct: 380 STNSVKSIEFKHEFDVSIPHTVTACSPGYLFVGSRVGDSLFIEYV---------SEL--- 427
Query: 454 FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ---KTFSFAVRDSLV 510
+ D P K+L+ + QD + E+L LYG A + S + F V D ++
Sbjct: 428 ---VPVDDPIEKKLK---VEVPQDDLEDEDLELYGKALPSVISQDVSVEKMRFRVLDRML 481
Query: 511 NIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS---- 566
N+ P K + +G S+ N L E P ++ + GH DSS
Sbjct: 482 NVAPCKKMT-----------SGCSEGLNSYLQEQPRLDPVFD--RVCACGHGKDSSICIF 528
Query: 567 ---------------------RMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
+ +D+ H Y+I S E ++ LET + L E+ V +
Sbjct: 529 QQSIRPDIITSSSIEGVIQYWAVGRREDDTHMYIIASKELGSLALETDNDLVELEAPV-F 587
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ----DLSFGPSNSESGSGSENSTV 661
TIAAG L +QV ++ Q L+F V
Sbjct: 588 ITSESTIAAGELADGGLSVQVTTSSIVVVAEGQQIQLIPLQLTF--------------PV 633
Query: 662 LSVSIADPYVLLGMSDGSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
LS SI DP+V + +G + L L P +V P I +K P+++ +Y D
Sbjct: 634 LSASIVDPFVAICTQNGRLLLYELDNTPHVHLKAVDLPGNIIHNKSPITALCIYRD 689
>gi|346971831|gb|EGY15283.1| cft-1 [Verticillium dahliae VdLs.17]
Length = 1445
Score = 244 bits (622), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 351/1504 (23%), Positives = 611/1504 (40%), Gaps = 258/1504 (17%)
Query: 57 NLVVTAANVIEIYVVRV-------QEEGSKESKNSGETKRRVLMD--GISAA-------- 99
NL+V+ ++++I+ V+ + +K + N+GET R + D G+ +A
Sbjct: 28 NLIVSKGSLLQIFAVKTVSTEIDTSQIQAKSTSNAGETYDRRINDDDGLESAFLGGDGML 87
Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDD 149
L LV Y +HG + LA + +SR +++++ A++S+L++D
Sbjct: 88 MRADRTTNTRLVLVAEYPVHGVIAGLARVK---IQSSRSGGEALLVHSRTARLSLLQWDP 144
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMII 203
HG+ S+H +E EW + S GPL ++ DPQ RC L +GL+ I
Sbjct: 145 EKHGVEDISIHFYEKEEW------QGSPMDGPLRQHATILQADPQSRCAA-LKFGLRKIA 197
Query: 204 L-------------KASQGGSGLVGDEDTFGSGGGFSARIES----------SHVINLRD 240
+ G E+ + + S S V+ L
Sbjct: 198 FLPFRQIDGDIDMDDWDEEVDGPRPQEEPPAAAAVHGSSSNSSSLAPVPYTPSFVLALPQ 257
Query: 241 LD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
LD + H F F+H Y EP + I+ H T + + + L
Sbjct: 258 LDPEILHPVHFAFLHEYREPTLGIISSTNRRLKMEPQKDHFTFKVFTVDL--------LQ 309
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPR 357
+++N K++A+P P+GG L++G N IH + +A+N YA + +
Sbjct: 310 KASLN------KVIALPKPMGGALLIGENELIHIDQAGKAHGVAVNPYAAKMTKFPLADQ 363
Query: 358 SSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVL 411
S + L+ + +N LL T+ G++ ++T DGR V + + ++ VL
Sbjct: 364 SELKLRLEHCEVELMSPENGEMLLVTRHGEMAVVTFKMDGRSVSGVSVKVVATENGGDVL 423
Query: 412 ---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
+ +T + + F G+ GDS ++ G + + K+ E+
Sbjct: 424 PFRAACLTKVTKNSMFYGTIGGDSKVI----GWSRQHVQTARKKARLLDESLDYDLDDDE 479
Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
D D + GE ++ + F V DSL+++ P+ D +YG
Sbjct: 480 ADDDDDDDDDLYGEGTVAPQPSAAAGSAKGGDVVFRVHDSLLSLSPIMDMTYGKTAFFPG 539
Query: 522 ---------LRINAD-ASATGISKQSNYELV------------ELPGCKGIWTV-----Y 554
+R D A G + + L+ + P +G WT
Sbjct: 540 SEDAKNSEGVRSELDLVCAVGRHRGGSLALINQHIQPRVIGRFDFPEARGFWTTRVQKTI 599
Query: 555 HKSSRGHNADSSRMAAYDD-----EYHAYLIISLEARTMVLETADLLTEVTESVDYF--- 606
KS +G + +A +D +Y ++I++ + ET+D+ +
Sbjct: 600 AKSLQGDKG--ANLAVGNDYGSVTQYDKFMIVA-KVDLDGYETSDVYALTGAGFEALSGT 656
Query: 607 ----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTV 661
G TI AG + R+IQV R DG ++Q L + E+G+ V
Sbjct: 657 EFDPAAGLTIEAGTMGNDMRIIQVLRSEVRCYDGDLGLSQILPM--LDEETGA---EPRV 711
Query: 662 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK----G 717
+S SI DPY+LL D SI + + S K +S C LY D
Sbjct: 712 ISASIVDPYLLLLREDSSILVAQITNHNELEELDKEDETIVSTKWLSGC-LYKDSRGLFA 770
Query: 718 PEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV 777
P + TST + + AI G+++ +C ++ +PN + V
Sbjct: 771 PVQTDKGTSTSESVFLFLLNAI----------GELHVRIC-------VYALPNLSKSIYV 813
Query: 778 DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
+G ++I S + ++ GT E + + V +L ++ H
Sbjct: 814 ---AAGLSYI----------PSLLSADYTARRGTS---PETLTEILVADLGDSTSASAH- 856
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
L + + Y+ + G E K D + SL VS S L +++P+
Sbjct: 857 ---LILRHANDDMTIYEPFRIGGQEE--KED----LANSLFFKKVSNSHL-----AKSPV 902
Query: 898 DAYTREETPHGAPCQRITIFK---NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
+A E R+ + NI G+ FL G+ P + + + L +
Sbjct: 903 EAAEDEAVQE----NRVIPLRACDNIGGYSTVFLPGASPSFILKSSKSTPKVIGLQGLGV 958
Query: 955 VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYFAEK 1013
+ H C GFIY S+G ++ Q P + V+K+P+ + +
Sbjct: 959 NGMSSFHTEGCERGFIYADSKGCARVTQFPDAANVAELGVSVRKVPIDTAVSHVAWHPNM 1018
Query: 1014 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEY-EVRILEPDR 1072
+Y V+ L+P E+ D H + + ++E+ +++ P
Sbjct: 1019 EVY--AVASSKLEPF----------ELPKDDDYHKEWAKEECPMPPMKEHGSIKLYSPIT 1066
Query: 1073 AGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1131
W ++ E A+ ++ + L + TKE L A+GTA ++GED+ RGR+L+
Sbjct: 1067 ----WNVIDEFELEQYEVAMCMKTLLLEVSEETKERRMLFAVGTAILRGEDLPVRGRILV 1122
Query: 1132 FSTGRNADNPQNLVTE-----VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWT 1181
F P T+ + +E+ +GA+++L + QG +L+A G K ++ K
Sbjct: 1123 FDVVHVIPQPDRPETDRKLKLIAKEEIPRGAVTSLCEVGTQGLMLVAQGQKCMVRGLKED 1182
Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1239
GT L +AF D YVV+++ ++N + L+ D + ++F+ + E+ ++ L K L
Sbjct: 1183 GTLLP-VAFLDMS-TYVVAVHELRNTGYCLMADANMGVWFVGYSEEPYRMTLFGKSGTQL 1240
Query: 1240 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFL 1298
C +FL+ G+ LS+V SDE + I + P+ S +G LL+RA F V H L
Sbjct: 1241 KCLTADFLVAGNDLSIVASDEDGVLHILQFDPEHPRSLQGHLLLNRASFSVAPNHAWATL 1300
Query: 1299 RL------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
L L S TGAA ++T LL + G+I + P+ E +RRL SL
Sbjct: 1301 VLPRTTTRPYLPQSEPATGAAGSQNRTQ--TLLLASASGAIASLNPITEHAYRRLTSLTT 1358
Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEIA 1407
L +++PH AG+NP++ R +G A P D +IVD LL+ + L ++ E A
Sbjct: 1359 SLANALPHAAGMNPKAHRLPPQDGAARPPAVDVSAGRTIVDGALLARWNELGARQRAEAA 1418
Query: 1408 HQTG 1411
+ G
Sbjct: 1419 GKGG 1422
>gi|449299306|gb|EMC95320.1| hypothetical protein BAUCODRAFT_25380 [Baudoinia compniacensis UAMH
10762]
Length = 1437
Score = 243 bits (620), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 343/1491 (23%), Positives = 585/1491 (39%), Gaps = 263/1491 (17%)
Query: 52 IGP-VPNLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRL 109
IGP NLVV ++++++ V R+ + + + + R L L+ Y L
Sbjct: 22 IGPQADNLVVAKTSLLQVFEVKRISQAKDNGHHDHADAQSR----------LSLIGEYTL 71
Query: 110 HGNVESLAILS-----QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
G V +L+ ++ GGA +++ AF+DAK+S++E+D + + S+H +E
Sbjct: 72 SGTVTALSPITLPSSRTGGA-------ALVCAFKDAKLSLIEWDPEHYRISTISIHYYEG 124
Query: 165 PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS------------- 211
L G ++ VDP RC + Q+ IL Q G
Sbjct: 125 DNVLLPPFGAALSECESILTVDPGSRCAALKFGERQLAILPFRQQGDELADEAAEDADMA 184
Query: 212 --------GLVGDEDTFGSGGGFS----ARIESSHVINLRDLD--MKHVKDFIFVHGYIE 257
G V + T + S +SS V+ L LD + H F+H Y E
Sbjct: 185 EAESEEQPGNVTLKRTSTTQALDSKDDITPYKSSFVLPLITLDPSLTHPVHLAFLHEYRE 244
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P IL + + + + ++ + + S LP D +K++A+P P
Sbjct: 245 PTFGILSAPQQPSLALLDERKDCLSYTVFTLDLEQRASTNLMSVSKLPSDLWKVIALPPP 304
Query: 318 IGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
+GG L+VG N IH + A+A+N +A + S +++L+ L +
Sbjct: 305 VGGALLVGTNELIHIDQSGKTTAVAVNEFAKVASNFSMADHSDLNMKLEGCEIEMLDSST 364
Query: 377 --ALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSD----ITTIGNSLFFLGS 427
AL+ G L+ GR V L +S+ TN + + + ++ F+GS
Sbjct: 365 GNALIVLNDGSFATLSFKMLGRTVGGLTVSRVADTNGGNVNASAPSCVASMQQQKLFVGS 424
Query: 428 RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD---------------APSTKRLRRSSS 472
G S LV++ + T + G APS ++R++S
Sbjct: 425 EDGSSSLVRWAKDTPTLSRKRSHAQMLGQDAPMDDADDAEELDEDDLYAPSAVAVKRAAS 484
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL--RINAD--- 527
A+ A T++F + DSL ++ P+ + G R +
Sbjct: 485 ----------------VANAAAVDASTTYTFELEDSLNSLAPMNNVCLGRSPRTGKEKLE 528
Query: 528 -ASATGISKQS-----NYELV-------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
+ G K S N E++ ++ G K IW+V +S G S+ D
Sbjct: 529 LVAGIGRGKASSLAFMNREIIPNEIRSRDVAGAKDIWSVCARSREGDKVSSA------DT 582
Query: 575 YHAYLIISLEARTMVLETADL----LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
Y L + T + AD + E+ E+ D+ G T+ G L ++Q
Sbjct: 583 YDNLLFVFDGESTKTYKYADSAEGSIIELDET-DFEGDGETVCVGTLANGSCIVQCRRTE 641
Query: 631 ARILDGSYMTQDLSFGPSNSESGSGSENST---VLSVSIADPYVLLGMSDGSIRLLVGDP 687
R T D G S S E +++ S DPY+L+ D S+++L D
Sbjct: 642 IR-------TYDHQLGLSQIIPMSDDETDAELKIVATSFCDPYLLVIQDDSSVQILQVDK 694
Query: 688 STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
+ +P+ + E LR+ WL+ + G L
Sbjct: 695 -------------QGDVEPLDAA--------ESDLREGK---WLTGSL-------YAGEL 723
Query: 748 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
G + + + G L++F +P V++ ++ L S+
Sbjct: 724 SDGQSAAFLLGQEGGLQVFSLPETKLVYSAPTL---------PFLPPVL---------SA 765
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
+ +G K + + VV+L + + RP+L ++ Y+ + +
Sbjct: 766 DAPQRRGGKVTLTEVLVVDLGAEGVT----RPYLIVRTAMDDLILYEPFHY--------- 812
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREE----TPHGAPCQRITIFKNISGH 923
S + + A+ +LRF + P + + T G P Q I G
Sbjct: 813 --------SATTLDARATGFTDLRFRKVPFTYLPKYDEGLDTADGRPAQLQPAV--IGGR 862
Query: 924 QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
+L G P + + L L + +F+ LH C GF V G LK QL
Sbjct: 863 NALYLPGGTPSFLVKEATSLPKVLGLRARGVRSFSPLHRAGCQQGFALVDGDGKLKEYQL 922
Query: 984 PSGSTYDNYWPVQKIPLKATPH---QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1040
P ++ W V+ + L P Q+ Y ++ +Y V+ V L D +
Sbjct: 923 PGHVSFATGWSVRTLTLGEPPQEVRQVAYHEQRGIY-------VVATCRDVDFTLHDLDE 975
Query: 1041 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1099
+ D NL V +Y + +L + + ++ M +E +++++ L
Sbjct: 976 RQRDDEPNLKP-------QVPQYTLHLL----SATSHKVIQSLEMPYAEIVTSLKIMPLE 1024
Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS---TGRNADNPQNLVTE--VYSKELK 1154
+ T E + +L +G A +GED A+G + +F D+P++ + +E K
Sbjct: 1025 VSEHTHEQKLMLVVGAAAQRGEDAPAKGLLTVFDIIDVVPEPDDPESGIRLHIAAREETK 1084
Query: 1155 GAISALASLQGHLL-IASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFIL 1209
GAI+AL S G L+ A G KI++ K GT L +AF DA Y+VSL + L
Sbjct: 1085 GAITALESFSGGLVGTAQGQKIMVRGLKEDGTCLP-VAFLDAQ-TYMVSLKTMGRSGLSL 1142
Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
GD K ++F W E+ +L LL K ++ + EFL L L+V D + ++ + Y
Sbjct: 1143 AGDAWKGLWFGGWTEEPYRLTLLGKSRTKMEVVSAEFLPFDGQLYLLVVDGKMDLHVLQY 1202
Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRL---------QMLATSSDRTGAAPGSDKTN 1320
P+ ++ GQ+LL ++ FH+G L L Q T+ D G G++ +
Sbjct: 1203 DPENPKTVSGQRLLHKSTFHLGHWPVDMLLLPSDLAPFAQQAPLTNGDSNGHTNGTESSA 1262
Query: 1321 R---------FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
F +L G++G I P+DE T+RRL +LQ +L + H AGLNPR++R
Sbjct: 1263 ANAPAPAPSLFHVLTTFQSGAVGLITPVDEATYRRLGALQTQLTSVLEHAAGLNPRAYRA 1322
Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
S R +VD L+ L + E+ + G + S+L
Sbjct: 1323 VESESLGGR----GVVDGMLVQRIGELGAARRAEVLGRAGADAWGLRSDLE 1369
>gi|66812672|ref|XP_640515.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
gi|60468551|gb|EAL66554.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
Length = 1628
Score = 242 bits (618), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 156/546 (28%), Positives = 276/546 (50%), Gaps = 79/546 (14%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDG----------------SIV 955
+RI F +ISG +G F+ G +P W + LR+H ++
Sbjct: 1122 KRIFEFSSISGKRGLFIGGKKPIWAFCEKGYLRLHSMDSSDNSNSNNSNNNNNNNSNTVE 1181
Query: 956 AFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKN 1014
FT +N++C GFIY + + ++KIC L + ++N +++IP K + H+I Y +E
Sbjct: 1182 TFTSFNNISCQDGFIYFSKEKDVIKICTLSTLMNFENDIAIRRIPTKNSCHKIAYHSEAK 1241
Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1074
Y +IVS P +V ++ + + T +++++++++P
Sbjct: 1242 CYVVIVSFP---------------QVTQELQEDSKKPI-----LTDDKFQIKLIDP-TID 1280
Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTT---TKENETLLAIGTAYVQGEDVAARGRVLL 1131
W+ + +Q E L +++V+L T L IGTA+ GED +GRVL+
Sbjct: 1281 WNWKFIDSFSLQDRETVLAMKIVSLKFTEPDGITRARPFLVIGTAFTFGEDTQCKGRVLV 1340
Query: 1132 F------STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1185
F + + + + + +Y KE KG ++AL+S+ G LL+ GPK+ ++++ L
Sbjct: 1341 FEIVSHKTQFESEELGEKRLNLLYEKEQKGPVTALSSVNGLLLMTIGPKLTVNQFYTGSL 1400
Query: 1186 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
++FYDA +Y+ S+ +KN+I++GD++KS+YFL WK+ LNLL+KD+ +L+ F+TE
Sbjct: 1401 VTLSFYDAQ-IYICSICTIKNYIVIGDMYKSVYFLQWKDNKT-LNLLSKDYQALNIFSTE 1458
Query: 1246 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
F+++ TLS++VSD KNI +F + P+ S GQ Q +
Sbjct: 1459 FIVNQKTLSILVSDLDKNILLFSFEPQDPSSRSGQ------------------INQEING 1500
Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN 1365
++ P ++ ++FGTLDG + + PLDE + +Q KL +P AGLN
Sbjct: 1501 NNKNDNRLPKKEQ----LVIFGTLDGGLNVLRPLDEKIYLLFYHIQSKLY-YLPQTAGLN 1555
Query: 1366 PRSFRQFHSNGKAHRPGPDS-------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
P+ +R F S + P + I+D +L+S + L E+ I++ +T +I+
Sbjct: 1556 PKQYRSFKSFSQNFHFSPSTFHQLPKFILDGDLISKFLSLSQSEKRLISNSINSTSDEII 1615
Query: 1419 SNLNDL 1424
+L D+
Sbjct: 1616 ESLKDV 1621
Score = 199 bits (505), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 156/585 (26%), Positives = 266/585 (45%), Gaps = 128/585 (21%)
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
+++++++VKDF F+HGY EP ++ LHE TW R++ K TC ++A+S++ K I
Sbjct: 281 KNIEIENVKDFCFLHGYYEPTILFLHEPIQTWTSRIAVKKFTCQMTAISLNLLTKAGSFI 340
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W+ N P++ L++VP P+GG LV+ AN + Y +Q++ LA+N YA S+D+S +
Sbjct: 341 WNVSNFPYNCEMLVSVPEPLGGALVITANIMFYVNQTSRYGLAVNEYA-SIDTSTIIGSQ 399
Query: 359 SFS----------VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
F LD ++ +L++D + S K G+L++ ++ DGR VQR+ +SK
Sbjct: 400 PFDFPIDDTLNLVFTLDRSNFVFLESDKFIGSLKGGELLIFHLISDGRSVQRIHVSKAGG 459
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT------SMLSSGLKEE-------FG 455
SVLTS I + N+L FLGSRLGDSLL+Q+T S T S+ K++
Sbjct: 460 SVLTSCICVLSNNLIFLGSRLGDSLLLQYTEKSITDDQLEHENFSNPYKKQKTSEVFDLF 519
Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELS-------------LYGSASNNTESAQKTFS 502
D ++ + +++ Q+ + ++ L+ N +S Q
Sbjct: 520 DENSETNNNNNSNNNNNKENQEKSSSSSIASKLLEEIEDEEDQLFKEKKNQLKSYQ---- 575
Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY-----ELV--------------- 542
+ D ++NIGP+ D G I+ T Q Y ELV
Sbjct: 576 LGICDQIINIGPIGDIVVGQSIDPTYDETIQPNQPEYVPKTLELVTCSGYGKNGSISVLQ 635
Query: 543 -----------ELPGCKGIWTVY------------------HKSSRGHNADSSRMAAY-- 571
ELPG +WTVY K SR N ++ +
Sbjct: 636 NNIKPELVMAFELPGILNVWTVYKEEIEEEHIEKEIKKNTSKKRSRDENNNNEQEDNEQE 695
Query: 572 ----------------DDEYHAYLIISL-EARTMVLETADLLTEVTESVDYFVQGRTIAA 614
D +H YL +SL + T++ ET L EV + +++
Sbjct: 696 DNEDNEEEEEEEKMQKDKNWHDYLYLSLKDGTTLIFETGRDLKEVGK-----FNFKSLDI 750
Query: 615 GNLFGRRRVIQVFERGARILDG-SYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
GNLFGR+R++ +++ G ++++G + Q++ N + S I DP++LL
Sbjct: 751 GNLFGRKRIVVIYQGGIKLINGFDRVIQEIQI------------NEPIKSSYICDPFILL 798
Query: 674 GMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG 717
+G+I++ G D + + + + + S +L+ D+
Sbjct: 799 QFHNGTIQIFKGIDEENQLIQFSINSISNNLNQSIFSSSLFFDRN 843
Score = 91.3 bits (225), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/168 (34%), Positives = 89/168 (52%), Gaps = 18/168 (10%)
Query: 57 NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGISAA-------SLELVC 105
NLV+ NV++IY +R ++ E +S+ + ++ I+ SLEL+
Sbjct: 32 NLVLAKTNVLQIYKIRYEKIEKYENVSDSQPQQQQEQEQQQQDITQKKKIELKPSLELII 91
Query: 106 HYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
+L GN+ES+A + ++ RDS+IL F DAKISVL++D + I S+H FE
Sbjct: 92 EKKLFGNIESMASVRYPNSE----RDSLILTFRDAKISVLDYDSDLLDFEIRSLHYFEKD 147
Query: 166 EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
E+ K GR F PL+KVD Q RC +L+Y + +L + S L
Sbjct: 148 EF---KGGRNHFKHPPLLKVDTQQRCAVMLLYDRNLAVLPFKKTSSIL 192
>gi|347838999|emb|CCD53571.1| similar to Cleavage and polyadenylation specificity factor subunit 1
[Botryotinia fuckeliana]
Length = 1447
Score = 241 bits (615), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 338/1519 (22%), Positives = 608/1519 (40%), Gaps = 243/1519 (15%)
Query: 57 NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD-GIS---------- 97
NLVV +++++I+ + + E+ S +K+ RV D G+
Sbjct: 28 NLVVAKSSLLQIFTTKTVSVDLDELSEKDSSTAKDDTNIDPRVNNDDGVEDSFLGTDSIM 87
Query: 98 -------AASLELVCHYRLHGNVESLA----ILSQGGADNSRRRDSIILAFEDAKISVLE 146
L LV Y L G V SL I S+ G + +I++ F+DAK+S++E
Sbjct: 88 QRPELARTTKLVLVAEYNLSGTVTSLVRVKTISSKTGGE------AILVGFKDAKLSLVE 141
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+D G+ S+H +E E + VDP RC + + IL
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201
Query: 207 SQGGSGLVGDEDTFGSG--------------GGFSARIESSHVINLRDLDMK-----HVK 247
Q + D D G G SS V+ L LD H++
Sbjct: 202 KQDEDVNMDDWDEELDGPRPAKISQKAAAEDGQLDTPYGSSFVLRLSSLDPSIIFPIHLE 261
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLP 305
F++ Y EP IL + + + H T M+ L + K I S LP
Sbjct: 262 ---FLYEYREPTFGILSSTMAPSSALLQERRDHLTYMVFTLDMHQ--KASTTILSVGGLP 316
Query: 306 HDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVEL 364
+D ++++ + P+GG L+VG N IH + +A+N +A L ++ + L
Sbjct: 317 YDLFRIVPLAPPVGGALLVGTNELIHIDQAGKANGVAVNMFAKQCTGFSLLDQADLDLRL 376
Query: 365 DAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLT---SDI 415
+ L +N L+ +GD+ +L+ DGR V L + + + ++LT S +
Sbjct: 377 EGCKIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAELGGAILTGAASCV 436
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
+++G F+GS + DS+++ + SG + + E D +
Sbjct: 437 SSLGAGSLFVGSEVSDSVILGWNRKSGQTSRRKSRLDSSAIAEVDE---AMFDEEDLEDD 493
Query: 476 QDMVNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSYG---LRINADASA 530
D + G+ ++ + +N T S KT ++F + DS+VNI P+ + ++G L + D
Sbjct: 494 DDDLYGDGPTITHATANITASNSKTGDYTFRIHDSMVNIAPITNIAFGEAALSLGKDEEL 553
Query: 531 TGISKQSNYELV--------------------------ELPGCKGIWTVYHK--SSRGHN 562
QS +LV +LP +GIWT+ K + +G
Sbjct: 554 KSSGVQSELQLVAAVGREKGGSLAVINREIQPNVIGRFDLPEARGIWTMSAKRPAPKGLQ 613
Query: 563 ADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLTEVTESVDYF-VQG 609
+ + D +Y +I+S +A + E+A D E ++ G
Sbjct: 614 VNKEKSVTSGDYGVDAQYDRLMIVSKASDAEDAIEESAVYALTDAGFEALTGTEFEPAAG 673
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
TI AG L RV+Q+ + R DG + Q L + E+G+ ++S S AD
Sbjct: 674 STIEAGTLGNGMRVVQILKSEVRSYDGDLGLAQILPM--LDDETGA---EPKIISASFAD 728
Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
P++LL D SI + D ++ I S K ++ C LY D +D
Sbjct: 729 PFLLLIRDDASIFVAQCDDDNDLEEIERVDDILLSTKWLTGC-LYDD------YSGAFSD 781
Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDK---FVSGRT 785
+ S GE ++ + GAL I+ +P+ + V + FV
Sbjct: 782 SK-SNKAGE-------------NVKMFLLSAGGALHIYALPDLSKPVYVAEGICFVPPVL 827
Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
+ A +++ TEI L + P+L
Sbjct: 828 SADYAARKSAARETLTEI-----------------------LVANLGDSVSQSPYLILRP 864
Query: 846 TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET 905
++ + Y+ + + S S L S + +++N ++ P + EE
Sbjct: 865 SNDDLTIYEPFRVK------------SASPDLLSSTLQFLKIQNTHLTQAP--DVSAEEQ 910
Query: 906 PHGA------PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTV 959
GA P + I+ N+ G+ F+ G P + + + L + + +
Sbjct: 911 VDGAQQTSDKPMRAIS---NLGGYSTVFMPGGSPSFIIKSSKTAPKVLSLQGTGVRSLSS 967
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPHQITYFAEKNLYPL 1018
H C+ GFIY +++GI ++ Q P +T+ D ++KI + H + Y Y +
Sbjct: 968 FHTEGCDRGFIYASTEGIARVAQFPPNTTFADIGMALRKIEIGEDVHAVAYHPPLQTYVI 1027
Query: 1019 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL-SSVDLHRTYTVEEYEVRILEPDRAGGPW 1077
S D E+ D+ ++ ++E+ ++++ P W
Sbjct: 1028 GTST------------FTDFELPKDDDHRKTWQEENIALKPSIEKSFLKLVSPVN----W 1071
Query: 1078 QTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF---S 1133
I ++ E ++ + L + T E + L+ +GTA +GED+A GR+ ++ +
Sbjct: 1072 SVIDAIELEPCELITCIKTMNLVISEVTNERKHLIVVGTAITKGEDLATTGRLYVYDVVT 1131
Query: 1134 TGRNADNPQN------LVTEVYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGT 1183
D P+ + +E+ ++ G ++ L+ + QG +L+A G K ++ K GT
Sbjct: 1132 VVPEPDRPETNKKLKLISSEIITRGAGGPVTGLSEIGTQGFMLVAQGQKCMVRGLKEDGT 1191
Query: 1184 ELNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
L +AF D YV S+ + ++ D K ++F + E+ ++ L K ++
Sbjct: 1192 NLP-VAFMDMN-CYVTSVKELPGTGLCVMADALKGVWFAGYTEEPYRMLLFGKSAAKMEV 1249
Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH-------V 1294
+ L DG L +V +D N+ I Y P+ +S +G LL R F +GAH +
Sbjct: 1250 LCADLLPDGKDLFIVAADANGNLHIMQYDPEHPKSLQGHLLLHRTTFSLGAHHPTTMTLL 1309
Query: 1295 TKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
L L T+ + + T + LL + G++ ++PL E +RR +L
Sbjct: 1310 PTTRPLPQLTTAPSPSPDPSPQEDTPSPSQPLLLTSRTGTLALLSPLTESQYRRFGTLVS 1369
Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
L +++ H GLNPR++R + G +I+D +L + L + + E+A + G
Sbjct: 1370 HLTNTLYHPCGLNPRAYR-IDRDANEGIVGGRTIIDGGVLGRWMELGSQRRGEVAGRVGV 1428
Query: 1413 TRSQILSNLNDLALGTSFL 1431
++ L+ L G F+
Sbjct: 1429 DVLELRDELSGLRGGLGFI 1447
>gi|401889164|gb|EJT53104.1| cleavage and polyadenylation specific protein [Trichosporon asahii
var. asahii CBS 2479]
Length = 1358
Score = 240 bits (613), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 359/1466 (24%), Positives = 599/1466 (40%), Gaps = 236/1466 (16%)
Query: 45 ELPSKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD---------G 95
E+P + +G NLVV + ++ +R EE + + ++ MD
Sbjct: 39 EVPDVKVVG---NLVVAGGQDLRVFEIR--EESTPLPDDESAVPKQEDMDVGDSFFDSAP 93
Query: 96 ISAAS--------LELVCHYRLHGNVESLAILSQGGADNS-RRRDSIILAFEDAKISVLE 146
I A L L+ + LHG V LA L D+S D ++++FE AK S +
Sbjct: 94 IERAPVRYKTTRRLHLLTRHTLHGVVTGLAGLRT--IDSSVDGLDRLLVSFEHAKWSRGD 151
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+ S+H +E + + + + + P+++ DP R + + + +L
Sbjct: 152 -------IATVSLHTYERCQQM-INGNFQGYV--PMLRSDPLSRLAILTLPEDALAVLPI 201
Query: 207 SQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHER 266
Q S L +D+ S ++K++KDF+F+ G+ P + +L
Sbjct: 202 VQEQSELDAMQDSVSSP------------------EIKNIKDFLFLPGFHSPTIALLFAP 243
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLK-QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
TWAGR T + +I T+ +PLI S LP D+ L+A PS +GGV+VV
Sbjct: 244 MNTWAGRYKSVKDTFRLEIRTIDTSAGGTYPLITSVTGLPSDSQYLVACPSEVGGVVVVT 303
Query: 326 ANTIHYHSQSAS-CALALN---NYAVSL--DSSQELPRSSFSVELDAAHATWLQNDVALL 379
A+ I + QS + ++N NY ++ DSS E S + LD +HA ++ + LL
Sbjct: 304 ASGIIHIDQSGRLVSTSVNGWWNYTTNMKSDSSYE----SQKLALDNSHAQFVTENDMLL 359
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT-SDITTIGNSLFFLGSRLGDSLLVQFT 438
+TG++ + DGR V + + + + +V S + G+ F+GS GDSLL
Sbjct: 360 VLETGEVHQIRFEMDGRAVGAIKVDEQSSTVPPPSTLVPAGSDGIFVGSVEGDSLLAMVE 419
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE---ELSLYGSASNNTE 495
S +EE P TK+ D +++ G + S
Sbjct: 420 KARDQSA-----QEE--------PETKQQEMDVDDWDEEVATGPVTVSVKAQDVLSGIGR 466
Query: 496 SAQKTFSFAVRD-------SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK 548
A F AV D LV IG S G +N I+K+ +E +L
Sbjct: 467 IADMEFGIAVTDLGTRTYPQLVCIG---GGSQGSTMNVFRRGIPITKRRLFE--QLRTAV 521
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
W + + + NA + ++ + + E T + + +V E + F +
Sbjct: 522 ATWFLPVERA---NAPKFKDIPESEQSTIAIAATQEGSTQIFALS--TRKVQERIAEFPE 576
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES-GSGSENS---TVLSV 664
IA G R R++ V +LD SN+ G+ E S +++
Sbjct: 577 P-AIATGTWLRRTRIVLVLPSQVLLLD------------SNANPVGTICEMSDAPPIVAA 623
Query: 665 SIADPYVLLGMSDGSIRLLVGD-----------PSTCTVSVQTPAAI------------- 700
SIADPYVL+ +DGS+ + VGD P + V A +
Sbjct: 624 SIADPYVLIRRADGSVSVFVGDTVEGKWSEAPMPEGLALPVCQAAEVFTDTTGIYRTFEA 683
Query: 701 -----ESSKKPVSSC----TLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
E KPV + H G E R + +S V + +G
Sbjct: 684 TQGVKEEPVKPVPTKQGQKAKIHLTG-EQLKRLQDSKPAISADVATTESAFNAA---RGT 739
Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
+ + +SG L+I +P+F+ V + + D+ + D +T EEG
Sbjct: 740 QWIALLAQSGELQIRSLPDFDLVLQSNG-------VYDS--EPSFTDDQTGELPELEEG- 789
Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
+ + M + + RP + + G + Y+A P T + D
Sbjct: 790 -----DEVSQMLFCPIGTRTL-----RPHVIVLHRSGRLNIYEAQ----PRFTVDARD-- 833
Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--PHGAPCQRITIFKNISGHQGFFLS 929
+ RSL+V R R + T L + T T P P F +I G G F++
Sbjct: 834 QSRRSLAV------RFRKVH---TQLLSVTPSSTVKPAAIP------FTDIEGLTGAFIT 878
Query: 930 GSRPCWCMVFRERLRVHPQLCDG-SIVAFTVLHNVN-CNHGFIYVTSQGILKICQLPSGS 987
G RP W + HP G A+ + HG ++ + IC +P
Sbjct: 879 GERPHWIISSDS----HPIRAFGLKQAAYAFCKTTHQGGHGEYFLRIEDGSFICYMPPTL 934
Query: 988 TYDNYWPVQKIPLKATPHQITYFAEKNLY--PLIVSVPVLKPLNQVLSLLIDQEVGHQID 1045
D P + ++ T + + Y +SVP + ++ +L+ E +
Sbjct: 935 NTDFAMPCDRYKMERTYTHVAFDPPSCHYVAAAAMSVP-FQAYDEEGEILLGPEGPDLLP 993
Query: 1046 NHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT 1104
N SS++L ++ R+L+ +E L V VTL ++++
Sbjct: 994 PKNERSSIEL---FSAGSEPFRVLD------------GYDFDQNEEVLCVESVTLESSSS 1038
Query: 1105 KEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP---QNLVTEVYSKE-LKGAISA 1159
+A+GT GED A G V +F N + K+ + +SA
Sbjct: 1039 PTGFRDFIAVGTGKNFGEDRATSGAVYVFEVVEVVGTKPGVSNWRLKYRCKDPTRNPVSA 1098
Query: 1160 LASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
+A++ G+++ ++GPKI+ L G+AF D +YV S+ + KN IL+GD KS+
Sbjct: 1099 IANINGYIVHSNGPKILAKGLDYDDRLMGLAFLDVS-MYVTSIRVFKNLILVGDFVKSLI 1157
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
F S +E + + +D L A +FL+ ++ + +D+ N+++ + P +S
Sbjct: 1158 FASLQENPYKFVTIGRDLADLSLTAADFLVHEGQVTFITNDQHGNMRLVDFDPANPDSLN 1217
Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
G+KLL++ EF G VT + T+ + AP S L++ T DG+I +
Sbjct: 1218 GEKLLTQTEFGTGCPVTASCMIARRKTAEEE--FAPQSQ------LIYATADGAITSVVA 1269
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E F+RLQ +Q +LV + HVAGLNPR+FR N RP ++D LL+H+ +
Sbjct: 1270 VKEARFKRLQLVQDQLVRNAQHVAGLNPRAFRTVR-NDLVPRPLARGVLDGGLLAHFALQ 1328
Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDL 1424
PL Q E+ Q GT + S+L L
Sbjct: 1329 PLRRQREMMRQIGTDAVTVGSDLYTL 1354
>gi|116182170|ref|XP_001220934.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
gi|88186010|gb|EAQ93478.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
Length = 1394
Score = 238 bits (607), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 339/1379 (24%), Positives = 549/1379 (39%), Gaps = 242/1379 (17%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLM---------DGISAA-------- 99
NL V +++++I+ +V S+N+G R DG+ A+
Sbjct: 41 NLAVAKSSLLQIFRTKVIATELDTSQNNGHRTRNANRYESRLANDDDGLEASFLGGDSLA 100
Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRR-RDSIILAFEDAKISVLEFDD 149
L LV + L G V L + N+R DS++LAF+DAK+S++E+D
Sbjct: 101 QRTDRANYTKLVLVAEFPLAGTVTGLVRIK---TPNARLGLDSLLLAFKDAKLSLVEWDT 157
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
H L S+H +E E + DP RC + + IL Q
Sbjct: 158 EHHTLSTVSIHYYEQEELQGSPWAAPLSHYANFLAADPGSRCAALKFGARNLAILPFKQA 217
Query: 210 GSGL-VGDEDTFGSGGGFSARIES----------------SHVINLRDLD--MKHVKDFI 250
+ +GD D G + + S S V+ L +LD + H
Sbjct: 218 DEDIDMGDWDEELDGPRPAKDLSSAVINGASNIEDTPYSPSFVLRLSNLDPSLLHPVHLA 277
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
F+H Y EP IL H M+ L + K I S LP D ++
Sbjct: 278 FLHEYREPTFGILASTAAASNSLGRKDHFVYMVFTLDLQQ--KASTTILSVTGLPQDLFR 335
Query: 311 LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHA 369
++ +P+P+GG L+VG+N IH +A+N S + +S ++ L+
Sbjct: 336 VVPLPAPVGGALLVGSNELIHIDQSGKPNGVAVNPMTKHCTSFGLVDQSDLNLRLEGCVI 395
Query: 370 TWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGN 420
L D+ L+ G + ++T+ DGR V L+L + + S++ ++T IG
Sbjct: 396 DVLAADLGELLIILNDGQMAVMTLRIDGRTVSGLELKILPASSGGSIVPGRVSTLSRIGR 455
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA------ 474
+ F G GDS+L FG + +R R+ +A
Sbjct: 456 NAMFAGLEEGDSVL-------------------FGWAKKQTQVGRRKPRTKDNAGDVDVE 496
Query: 475 ---LQDMVNGEELSLYGSASNNTESAQKTFS--------FAVRDSLVNIGPLKDFSYGLR 523
+ +E LYG AS S V D L+N+GP++ +Y
Sbjct: 497 EDEDIEEEEEDEDDLYGEASAPQHQPVSAVSGLLSGEASLRVHDRLINLGPIQAMTYSQP 556
Query: 524 INADAS-----------------ATGISKQS-----NYEL-------VELPGCKGIWTVY 554
+ S A G K + N E+ E P +G WT+
Sbjct: 557 VWLPGSEEERNSAGVHSDLQLVCAVGREKSASLVTMNLEIQPKVIGRFEFPEARGFWTMC 616
Query: 555 HKSSRGHNADSSRMAAY-------DDEYHAYLIIS------LEARTMVLETADLLTEVTE 601
K S + + +Y ++I++ E + TA +
Sbjct: 617 AKKPIPKTLQSDKGGNFLGKDYDVSGQYDKFMIVAKVDLDGYEKSDVYALTAAGFESLGG 676
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
+ G TI AG + R+IQV + R DG + ++Q + + E+G+
Sbjct: 677 TEFDPAAGITIEAGTMGKGSRIIQVLKSEVRCYDGDFGLSQIVPM--LDEETGA---EPR 731
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
+S SIADP +L+ D S+ + D S ++ ++ K ++ C LY D
Sbjct: 732 AISASIADPLLLIIRDDSSVFVAQMDSSNELEELEKEDQTLATTKWLTGC-LYAD----- 785
Query: 721 WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
+T A+ E + G G P I + SG+L I+ +P+ + V +
Sbjct: 786 -----TTGAF-----AEEVAGKGGKPAQA--ILVFLLSASGSLYIYRLPDLSKPVYVAEG 833
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
+S Y+ L + S+ +GT KE + + V +LA R H+
Sbjct: 834 LS--------YIPPGLS-----ADYSARKGTA---KETVAEILVADLA-NRSQLRHAN-- 874
Query: 841 LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
D TI YQ + + +TS D S++L +L N F+++P +A
Sbjct: 875 -----DDLTI--YQPFRY----STSAGAD---FSKTLFFQ-----KLPNAAFAKSPEEAD 915
Query: 901 TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTV 959
E T H + NI+G+ FL G+ P + + + RV P L ++A +
Sbjct: 916 EDEAT-HQPRMLSMRRCSNIAGYSTVFLPGASPSFIIKSSKSAPRVLP-LQGAGVIAMSP 973
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPHQITYFAEKNLYPL 1018
H C +GFIY SQ + ++ QLP Y + V+KIP+ + Y Y
Sbjct: 974 FHTEGCENGFIYADSQHMARVTQLPQDWNYAETGLAVRKIPIGEDIAAVAYHPPMQSY-- 1031
Query: 1019 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQ 1078
+V L+P L D + + NLS TV+ ++++ P W
Sbjct: 1032 VVGCNTLEPFE----LPKDDDYHKEWARENLSF-----KPTVDRGILKLVSPIT----WT 1078
Query: 1079 TRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRN 1137
++ M+ E L V ++L + T E + L+A+GTA ++GED+ RGRV ++
Sbjct: 1079 VVDSVQMEPCETVLCVATLSLEVSEFTNERKQLIAVGTALIKGEDLPTRGRVYVYDITEV 1138
Query: 1138 ADNPQNLVTEVYSKELK---------GAISALASL--QGHLLIASGPKIILH--KWTGTE 1184
P T SK+LK GA++AL+ + QG +L+A G K ++ K GT
Sbjct: 1139 IPEPGRPET---SKKLKLIAKEEIPRGAVTALSEIGTQGLMLVAQGQKCMVRGLKEDGTL 1195
Query: 1185 LNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
L +AF D YV + + LL D K ++F + E+ ++ L K L+
Sbjct: 1196 LP-VAFMDMN-CYVTNAKELPGTGLCLLADAFKGVWFTGYTEEPYKMMLFGKSSTKLEVL 1253
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFLRL 1300
+FL DG L +V D NI I + P+ +S +G LL R F+ GA H TK L L
Sbjct: 1254 NADFLPDGKDLFIVACDADGNIHILEFDPEHPKSLQGHLLLHRTTFNTGANHPTKSLLL 1312
>gi|320591495|gb|EFX03934.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
clavigera kw1407]
Length = 1461
Score = 238 bits (606), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 341/1432 (23%), Positives = 568/1432 (39%), Gaps = 210/1432 (14%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
S + L LV + L G V LA + G + +++++A +DA++S+LE+D + L
Sbjct: 100 SISKLVLVAEFPLAGTVTGLARIKIPGTKSGG--EAVLVALKDARLSLLEWDPDQNDLTT 157
Query: 157 TSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
S+H +E E + DP RC + + IL Q V
Sbjct: 158 ISIHYYEQEELQGAPWAAPLSDYANFLVADPGSRCAALKFGARNLAILPFRQADEEDVDM 217
Query: 217 ED-------------------TFGSGGGFS-ARIESSHVINLRDLD--MKHVKDFIFVHG 254
+D G G G S V+ L +LD + H F+H
Sbjct: 218 DDWDEELDGPRPAKDPSSAAVVSGPGDGIEDTPFAPSFVLRLSNLDTTLLHPVHLAFLHE 277
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y EP IL T A V + ++ K I S NLP D ++++ +
Sbjct: 278 YREPTFGILSSSVSTSA--VIGRRDKLSYLVFTLDLQQKASTTILSVANLPQDLFRVVPI 335
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
PSPIGG ++VGAN IH + +A+N + S +S ++ L+ L
Sbjct: 336 PSPIGGAILVGANELIHIDQSGRANGVAVNPFTKQSTSFGLADQSDLALRLEGCTVDVLS 395
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLS----KTNPSVLTSDITT---IGNSLFF 424
+ L+ G L +LT+ DGR V L + + V+ S IT IG + F
Sbjct: 396 AEAGELLIVLHDGQLAVLTIRVDGRTVSGLSVKMVRREAGGDVIQSGITCLSRIGRQMLF 455
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVNGEE 483
GS DS+++ ++ G + G D+ AD R + D + +
Sbjct: 456 AGSDQADSVVLGWSRKQGQTARRKPRANRAGLDLGADEEYFDDEREEGEELDDDEDDDDL 515
Query: 484 LSLYGSAS------NNTESAQKTFSFAVRDSLVNIGPLKDFSYG-------LRINADAS- 529
SA+ N T SF + D L++I P++D G L +D +
Sbjct: 516 YGDGPSAAQTLGIDNTTGRGGDDLSFRIHDRLLSIAPIRDMVIGKPALVGELAKRSDQAT 575
Query: 530 ---------ATGISKQSNYELV------------ELPGCKGIWTV-----YHKSSRGHNA 563
A G + L+ E + +WTV ++ +G
Sbjct: 576 IHSELNLVCAVGSGRAGALALLSREINPDPLGAFEFAEAQALWTVSSSKPIPRTIQGEKG 635
Query: 564 DSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF-------VQGRTIAA 614
++ Y+ + Y+I++ E ET+D+ + G T+ A
Sbjct: 636 GATVGEDYESPAMHDKYMIVAKEDDDG-FETSDVYAVTASGFETLKGTEFEPAAGFTVQA 694
Query: 615 GNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
G + RR+IQV + R DG ++Q L + +G+E VL SIADPY+LL
Sbjct: 695 GTMGRNRRIIQVLKSEVRCYDGDLGLSQILPM----VDEDTGAE-PRVLFASIADPYLLL 749
Query: 674 GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLST 733
D S+ + + ++ +S K V+ C LYHD KTS A+L +
Sbjct: 750 IRDDASVLVAEMNKDFELEELERDDGSLASTKWVAGC-LYHDTAS--VFSKTSILAFLLS 806
Query: 734 GVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS--GRTHIVDTY 791
SG I+ +P+ V + ++ R + D
Sbjct: 807 A-------------------------SGTFYIYALPDLKQPVYVAEGLNYVPRLFLPDHT 841
Query: 792 MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTIL 851
+R + KE + + V +L A P+L + +
Sbjct: 842 VRRGMA------------------KEPLTEILVADLG----DAVSKAPYLIVRHANDDLT 879
Query: 852 CYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR--NLRFSRTPLDAYTREETPHGA 909
YQ P+ T SL + S L+ N F+++P+ + + ++
Sbjct: 880 IYQ---------------PLRTPSSLGSLSESLRFLKVPNPVFAKSPV-SISSDDASSQL 923
Query: 910 PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGF 969
+ + +NI G+ FL GS + + + L ++ + + H + F
Sbjct: 924 RAMPLRVCENIGGYSTVFLPGSSASFVLKSAKSQPRVVSLQGTAVRSLSPFHTESSERSF 983
Query: 970 IYVTSQGILKICQLPSGSTYDNYWP-VQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
IYV +G ++C +P+G +K+ L + + Y Y + S
Sbjct: 984 IYVDVEGSGRVCSMPAGWNLTELGVCARKVALDTDANALAYHPPTGTYAVGTSA------ 1037
Query: 1029 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1088
+ + + ++ H+ D + S+ R E + ++ P G W T T+ M+
Sbjct: 1038 --LEAFELPKDDPHRADWNKESTA--FRPL-AERGRLLLMSP----GSWSTIDTVEMEPY 1088
Query: 1089 ENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QN 1143
E + V+ + L + T E + L+A+GTA +GED+A RGRV +F P N
Sbjct: 1089 EVVMCVKTLNLEVSEATNERKQLVAVGTAISRGEDLAIRGRVYVFDVVSVIPEPGRPETN 1148
Query: 1144 LVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLY 1197
++ +KE +GA++A++ + QG +L+A G K ++ K GT L +AF D Y
Sbjct: 1149 RKLKLIAKEDIPRGAVTAVSEIGTQGLMLVAQGQKCLVRGLKEDGTLLP-VAFMDMN-CY 1206
Query: 1198 VVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
V S + ++ D K ++F + E+ ++ L K L + L DG L +
Sbjct: 1207 VTSAKELPGTGLCVMSDAFKGVWFTGYTEEPYKMILFGKSNTRLHALNVDLLPDGKELFI 1266
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF-LRLQMLATSSDRTGA-- 1312
VV+D N+ + + P+ +S +G LL RA F GAH + L L T +DR A
Sbjct: 1267 VVTDADGNLHVMQFDPEHPKSLQGHILLHRATFCTGAHFSTLSLLLPSTFTPADRPTANG 1326
Query: 1313 -------APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN 1365
P + + + LL G+ G + + PL E +RRL SL +L S+ AGLN
Sbjct: 1327 ETNGASSQPEAQQHQQHQLLLGSPTGLLASLVPLSESEYRRLSSLAGQLATSLTQTAGLN 1386
Query: 1366 PRSFRQFHSNGKAH-RPGPD-----SIVDCELLSHYEMLPLEEQLEIAHQTG 1411
P+ +R + A PG D S+VD LL+ + L + EIA + G
Sbjct: 1387 PKGYRMTAGSAAATLAPGVDAAVGRSVVDGALLARWTELGSGRKGEIAGRVG 1438
>gi|9794904|gb|AAF98386.1| cleavage and polyadenylation specificity factor [Drosophila
melanogaster]
Length = 507
Score = 237 bits (604), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 162/497 (32%), Positives = 258/497 (51%), Gaps = 49/497 (9%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV ANV+++Y + E S+ K N E + M LE + Y L+GNV S
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
L +S GA RD+++++F+DAK+SVL+ D L+ S+H FE + ++ G
Sbjct: 83 LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
P V+VDP RC +LVYG ++++L + S L + + +R
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
I +S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309
Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
SL+SS + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
K SVLTS I + + FLGSRLG+SLL+ FT +++++
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
L++E ++E + +L + + A + EEL +YGS + + + F F V D
Sbjct: 430 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488
Query: 508 SLVNIGPLKDFSYGLRI 524
SL+N+ P+ G R+
Sbjct: 489 SLMNVAPINYMCAGERV 505
>gi|406699110|gb|EKD02327.1| cleavage and polyadenylation specific protein [Trichosporon asahii
var. asahii CBS 8904]
Length = 1339
Score = 234 bits (596), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 315/1251 (25%), Positives = 519/1251 (41%), Gaps = 183/1251 (14%)
Query: 242 DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK-QHPLIWS 300
++K++KDF+F+ G+ P + +L TWAGR T + +I T+ +PLI S
Sbjct: 200 EIKNIKDFLFLPGFHSPTIALLFAPMNTWAGRYKSVKDTFRLEIRTIDTSAGGTYPLITS 259
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALN---NYAVSL--DSSQE 354
LP D+ L+A PS +GGV+VV A+ I + QS + ++N NY ++ DSS E
Sbjct: 260 VTGLPSDSQYLVACPSEVGGVVVVTASGIIHIDQSGRLVSTSVNGWWNYTTNMKSDSSYE 319
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT-S 413
S + LD +HA ++ + LL +TG++ + DGR V + + + + +V S
Sbjct: 320 ----SQKLALDNSHAQFVTENDMLLVLETGEVHQIRFEMDGRAVGAIKVDEQSSTVPPPS 375
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ G+ F+GS GDSLL S +EE P TK+ D
Sbjct: 376 TLVPAGSDGIFVGSVEGDSLLAMVEKARDQSA-----QEE--------PETKQQEMDVDD 422
Query: 474 ALQDMVNGE---ELSLYGSASNNTESAQKTFSFAVRD-------SLVNIGPLKDFSYGLR 523
+++ G + S A F AV D LV IG S G
Sbjct: 423 WDEEVATGPVTVSVKAQDVLSGIGRIADMEFGIAVTDLGTRTYPQLVCIG---GGSQGST 479
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
+N I+K+ +E +L W + + + NA + ++ + +
Sbjct: 480 MNVFRRGIPITKRRLFE--QLRTAVATWFLPVERA---NAPKFKDIPESEQSTIAIAATQ 534
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
E T + + +V E + F + IA G R R++ V +LD
Sbjct: 535 EGSTQIFALS--TRKVQERIAEFPEP-AIATGTWLRRTRIVLVLPSQVLLLD-------- 583
Query: 644 SFGPSNSES-GSGSENS---TVLSVSIADPYVLLGMSDGSIRLLVGD-----------PS 688
SN+ G+ E S +++ SIADPYVL+ +DGS+ + VGD P
Sbjct: 584 ----SNANPVGTICEMSDAPPIVAASIADPYVLIRRADGSVSVFVGDTVEGKWSEAPMPE 639
Query: 689 TCTVSVQTPAAI------------------ESSKKPVSSC----TLYHDKGPEPWLRKTS 726
+ V A + E KPV + H G E R
Sbjct: 640 GLALPVCQAAEVFTDTTGIYRTFEATQGVKEEPVKPVPTKQGQKAKIHLTG-EQLKRLQD 698
Query: 727 TDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTH 786
+ +S V + +G + + +SG L+I +P+F+ V +
Sbjct: 699 SKPAISADVATTESAFNAA---RGTQWIALLAQSGELQIRSLPDFDLVLQSNG------- 748
Query: 787 IVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILT 846
+ D+ + D +T EEG + + M + + RP + +
Sbjct: 749 VYDS--EPSFTDDQTGELPELEEG------DEVSQMLFCPIGTRTL-----RPHVIVLHR 795
Query: 847 DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET- 905
G + Y+A P T + D + RSL+V R R + T L + T T
Sbjct: 796 SGRLNIYEAQ----PRFTVDARD--QSRRSLAV------RFRKVH---TQLLSVTPSSTV 840
Query: 906 -PHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDG-SIVAFTVLHNV 963
P P F +I G G F++G RP W + HP G A+
Sbjct: 841 KPAAIP------FTDIEGLTGAFITGERPHWIISSDS----HPIRAFGLKQAAYAFCKTT 890
Query: 964 N-CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLY--PLIV 1020
+ HG ++ + IC +P D P + ++ T + + Y +
Sbjct: 891 HQGGHGEYFLRIEDGSFICYMPPTLNTDFAMPCDRYKMERTYTHVAFDPPSCHYVAAAAM 950
Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT 1079
SVP + ++ +L+ E + N SS++L ++ R+L+
Sbjct: 951 SVP-FQAYDEEGEILLGPEGPDLLPPKNERSSIEL---FSAGSEPFRVLD---------- 996
Query: 1080 RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNA 1138
+E L V VTL ++++ +A+GT GED A G V +F
Sbjct: 997 --GYDFDQNEEVLCVESVTLESSSSPTGFRDFIAVGTGKNFGEDRATSGAVYVFEVVEVV 1054
Query: 1139 DNP---QNLVTEVYSKE-LKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDA 1193
N + K+ + +SA+A++ G+++ ++GPKI+ L G+AF D
Sbjct: 1055 GTKPGVSNWRLKYRCKDPTRNPVSAIANINGYIVHSNGPKILAKGLDYDDRLMGLAFLDV 1114
Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
+YV S+ + KN IL+GD KS+ F S +E + + +D L A +FL+ +
Sbjct: 1115 S-MYVTSIRVFKNLILVGDFVKSLIFASLQENPYKFVTIGRDLADLSLTAADFLVHEGQV 1173
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
+ + +D+ N+++ + P +S G+KLL++ EF G VT + T+ + A
Sbjct: 1174 TFITNDQHGNMRLVDFDPANPDSLNGEKLLTQTEFGTGCPVTASCMIARRKTAEEE--FA 1231
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
P S L++ T DG+I + + E F+RLQ +Q +LV + HVAGLNPR+FR
Sbjct: 1232 PQSQ------LIYATADGAITSVVAVKEARFKRLQLVQDQLVRNAQHVAGLNPRAFRTVR 1285
Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
N RP ++D LL+H+ + PL Q E+ Q GT + S+L L
Sbjct: 1286 -NDLVPRPLARGVLDGGLLAHFALQPLRRQREMMRQIGTDAVTVGSDLYTL 1335
>gi|348679545|gb|EGZ19361.1| putative cleavage and polyadenylation specificity factor CPSF
[Phytophthora sojae]
Length = 1752
Score = 233 bits (594), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 171/600 (28%), Positives = 289/600 (48%), Gaps = 93/600 (15%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS------IVAFTVLHNVNCNH 967
+T F N++ G F G+ P W + R + P +C + +++FT H+ NC +
Sbjct: 1159 LTTFYNVNNMSGAFFRGAHPMWILGDRGQPTFIP-MCSAAPKVSVPVLSFTPFHHWNCPN 1217
Query: 968 GFIYVTSQGILKICQLPSGSTY-----DNYWPVQKIPLKATPHQITYFA----------- 1011
GFIY S+G L++C+LPS T + +QK AT H + Y
Sbjct: 1218 GFIYFHSRGALRVCELPSSKTSTILPSSGGFVLQKAEFGATLHHMLYLGNHGPGGVSEAL 1277
Query: 1012 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS-------------------- 1051
E Y ++ SV +KP + + + ++ + + NL +
Sbjct: 1278 EAPTYAVVCSVK-MKPTDAERATEV-EDADEEKEPENLDANGNPVGSNVMAPTAEMFPDF 1335
Query: 1052 -VDLHRTYTVEEYEVRILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT----- 1103
+D E YE+R+++ + G W R + + E L+V+++ L++++
Sbjct: 1336 EIDQMAHTEEEVYELRLVQTNEFG-EWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKEE 1394
Query: 1104 --------TKENETLLAIGTAYV--QGEDVAARGRVLLFS---------TGRNADNPQNL 1144
K+ L IGT +V GED + RGR+LL+ G + +
Sbjct: 1395 VASTSAEWNKKKRPYLVIGTGWVGPHGEDESGRGRLLLYELDYAQYVDEEGGSTSSKLPK 1454
Query: 1145 VTEVYSKELK-GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNI 1203
+ V+ KE + GAIS++ L ++L A G K+I++++ +L G AFYDA +++V+LN+
Sbjct: 1455 LRLVFIKEHRQGAISSVVQLGPYVLAAVGSKLIVYEFKSEQLIGCAFYDAQ-MFIVTLNV 1513
Query: 1204 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
VK+F++ GD++KS++FL W+E QL LLAKD+ L ATEF + L+L+ D +N
Sbjct: 1514 VKDFVMYGDVYKSVHFLRWREMQRQLVLLAKDYEPLAVSATEFSVFEKKLALLAVDMDEN 1573
Query: 1264 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-----SDRTGAAPGSDK 1318
+ + +AP+ ES GQ+LL ++FH+G V R ++ + R AP S
Sbjct: 1574 LHVMQFAPQDIESRGGQRLLRVSDFHLGVQVASMFRKRVDGPGGHVAVNGRGPRAPPSYY 1633
Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
N + G +G +G + P+ E FRRL +LQ +V+++P LNPR FR +N +
Sbjct: 1634 VN----VMGNSEGGVGALIPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQR 1689
Query: 1379 HRPGPDS---------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
PD+ +D +L + L Q E+A GTT ++ NL ++ T+
Sbjct: 1690 RCGRPDAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVIHNLLEVQHATA 1749
Score = 159 bits (403), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 159/628 (25%), Positives = 262/628 (41%), Gaps = 143/628 (22%)
Query: 235 VINLRDLD-MKHVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR+L+ M V D F+ GY+EP +++LHE + + GR++ T I+ +SI+
Sbjct: 277 LLRLRELEIMGKVIDLAFLDGYLEPTLMVLHEENEKNSTCGRLAAGFDTYCITVISINMN 336
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL-- 349
+ HP IW+ NLP D +KL +P+GGV+V+ AN Y +Q+ LA N +A
Sbjct: 337 TRLHPKIWTVKNLPSDCFKLFPCRAPLGGVVVLSANAFLYFNQTQFHGLATNVFASKTVN 396
Query: 350 -------DSSQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
D+ E P + + L +L LL+ GD +L++ Y+ +
Sbjct: 397 QSVFPLSDAVYETPDHEMAQLHIVLYDCQFEYLHEKEVLLTMPNGDAYVLSLPYEDTSSR 456
Query: 400 RL----DLSKTNPSVLTSDITTIG-----------NSLFFLGSRLGDSLLVQFTCGSGTS 444
L S + + L+ + G F+GSR GDS+L TS
Sbjct: 457 GLYGFGGASSSRNASLSLRMLRSGIQAHCLCVNEEKKTLFVGSRSGDSVLYALDQKKLTS 516
Query: 445 MLSSGLK----EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA--------SN 492
K EE E + A ++ + ++L LYG+A S
Sbjct: 517 AGGEASKQQEDEEMLIKEEVVKEEVTAEVKAEPAEEEEEDEDDLFLYGAAPTKEEPTTSG 576
Query: 493 NTESAQKTFSFAVR----------------------DSLVNIGPLKDFSYGLRINADAS- 529
+TE+ T AV+ D L +IG + G+ NAD++
Sbjct: 577 STEAVNGTNGSAVKKEENGHAVEEESGPYDYVLHQIDVLPSIGQITSIELGIENNADSNE 636
Query: 530 -------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAA 570
+ G + ++ EL GC+ +WTV + R
Sbjct: 637 KREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATKSSDGR--- 693
Query: 571 YDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
Y+AYLI+S+ RTMVL T + + + + ++ G T+AA NLF ++R++Q+F++G
Sbjct: 694 ---SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGPTLAAANLFNKQRIVQIFKQG 750
Query: 631 ARIL------------DGS----------------------YMTQDLSFGPSNSESGSGS 656
AR++ DG+ TQ+++ G
Sbjct: 751 ARVMMEVPDEETSNGNDGAEKTAKPEDEEVDDEDDGPKVKLVCTQEITLEGDVECGGMNV 810
Query: 657 ENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP------------AAIES 702
+ +TV +SV + DPY+LL ++DGS+RLL+GD ++V P ++
Sbjct: 811 DTATVGIVSVDVVDPYILLLLTDGSVRLLMGDEEDMELTVIDPEIDYLDGVTESNGTADA 870
Query: 703 SKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
SK SS L++D W +AW
Sbjct: 871 SKHGSSSACLFYD-----WAGMFRENAW 893
>gi|353234640|emb|CCA66663.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Piriformospora indica DSM 11827]
Length = 1324
Score = 233 bits (593), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 328/1432 (22%), Positives = 596/1432 (41%), Gaps = 210/1432 (14%)
Query: 55 VPNLVVTAANVIEIYVVR---VQEEGSKES--KNSGETKRRVLMDGISAASLELVCHYRL 109
V NLVV N + IY VR EE ES K+SG ++ + L LV + L
Sbjct: 36 VTNLVVGRNNRLRIYDVRRTIYTEETHVESDLKSSGPSRH--------SHRLCLVREHLL 87
Query: 110 HGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
HG + LA + D ++++F+D+K++++E+ ++++ + S+H +E L
Sbjct: 88 HGIIIGLAAVRTANPGLGSP-DRLLVSFQDSKLALMEWSNTLYDISTVSIHSYERSPLLL 146
Query: 170 LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL-VGDEDTFGSGGGFSA 228
E A ++ DP RC +++ + +L Q + L V D G +
Sbjct: 147 NSDFTECRA---YLRTDPANRCAALVMPRDNIALLPWYQPQTELDVQD--------GIQS 195
Query: 229 RIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTC 281
E S+V N+ +D ++++ D +F+ G+ P + IL + + TW GR+
Sbjct: 196 IAEELPYSPSYVTNVSAMDERIRNILDLVFLPGFNVPTIAILFQEQRTWTGRLKENKDNT 255
Query: 282 MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCAL 340
+ +S+ + + +I + LP+D+ + + +GGVLVV AN+I H S L
Sbjct: 256 SLFFISLDLVSRSYQVIATIEKLPYDSLYMSPCHAKLGGVLVVTANSILHVDQASKITTL 315
Query: 341 ALNNYAVSL-DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
++ +A + D+S + + L+ + ++ + +LS G + + + ++GR V
Sbjct: 316 PMSGWAARVSDTSHGFQDAVDDIHLEGSRMGYISDSQVILSLSNGKCLHIRIDHEGRTVW 375
Query: 400 RLDLSKT-----NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF 454
L T PSVL + + L FLGS GDS+L ++
Sbjct: 376 GLTAVHTFGISSPPSVLIAK-----DGLVFLGSTAGDSVLFEYA---------------- 414
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
QD+ + + L N +E+ +FS D+L + G
Sbjct: 415 ---------------------QDLSSHRDFML----PNASETIPTSFSLLPVDNLQDSGS 449
Query: 515 LKDFSY-GLRINADAS---ATGISKQSNYELV----------ELP---GCKGIWTVYHKS 557
S+ GLR + + + A G+ + V +LP G +GIW S
Sbjct: 450 YTAASFFGLRGSEEPALIAANGLDDLGGFSTVHKTMPLRLRKKLPAIAGRQGIW-----S 504
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR----TIA 613
R H + + + ++S +A T + + T+ +D + R TIA
Sbjct: 505 MRVHQGNGIELPLGHNT-----LLSTDA-TPTPGASRIATKSQARLDINITTRIPMLTIA 558
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
F ++QV R+L T D S + + + + + +I DPYVL+
Sbjct: 559 VAPFFDGTHLLQVTSNSLRLL-----TTDGSEKQVIPDRDNSTARARIRHAAICDPYVLI 613
Query: 674 GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-----KGPEPWLRKTSTD 728
D ++ L VG+P+ + + + + K + T Y D K E +R+T
Sbjct: 614 LREDDTLGLFVGEPTRGKLRRKDMSPLGDKKLCYWAATFYDDLTGRLKIDEDLMRETK-- 671
Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
VG ++G+ + +C +G LEI+ +P VF V G +
Sbjct: 672 -----AVG-----------NRGEKWLALCRSTGTLEIWSLPKLALVF-VSSISLGPS--- 711
Query: 789 DTYMREALK-DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
LK D + E++S+++ G + + + +L S H L +
Sbjct: 712 ------VLKHDQKKEVDSATKTELPVG-ATTLQQVIITDLGEIEPSPH-----LIVLYDS 759
Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVS-ASRLRNLRFSRTPLDAYTREETP 906
++ YQ P K+ P RS+ +S R+ + + TP + T +
Sbjct: 760 NLLIVYQMV----PLEPDKAGLPQLDRRSVPSLRISFVKRMVHHLANPTPDENQTSGGSN 815
Query: 907 HGAPCQRITIFKNISGH----QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHN 962
+ I F + G F++G P W + +H ++ +FT
Sbjct: 816 EKRLPKTIVPFSVLDWEGNSIYGAFVTGDNPAWILSKNHSGLLHLPCGYEAVHSFTPCSM 875
Query: 963 VNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV 1022
+ + F+ T +G + Q G T+ +P K T I Y N L+V+
Sbjct: 876 WDFSPTFLMSTEEGSC-LVQWTPGITFHGQYPCSKTRKGRTQTNIAY---SNTTGLLVAA 931
Query: 1023 PVLKPLNQVLSLLIDQEVGH--QIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1080
N LL D+E + + D N+S L + + +L+P+ W T
Sbjct: 932 SS----NDRDFLLFDEEGTNSWEPDGVNVSLPKLGAS------ALELLDPET----WVTI 977
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1139
++E V V L +T+ N+ +A+GT+ +GED+A RG +F
Sbjct: 978 DGYEFAANEVVNIVESVKLETLSTQTGNKEFIAVGTSIHRGEDLAVRGGTYIFEIAEVIQ 1037
Query: 1140 NPQ------NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYD 1192
+ + + + + E KG ++A+ + G+L+ + G KI + + E L G+AF D
Sbjct: 1038 DTEERGRRRHRLKLLCKDEAKGPVTAVCGMNGYLVSSMGQKIFVRAFDLDERLVGVAFLD 1097
Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
A +YV S+ +KN +++ D K ++F++++E +L +L+K+ +F +
Sbjct: 1098 AG-VYVTSIRCLKNLLVITDAIKGVWFVAFQEDPFKLVILSKEVRPTSIPQGDFFFAHND 1156
Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
+ L+ D + +++ Y P ++ +G +LL EF +R+ M SS
Sbjct: 1157 MELLTIDLRGVLRLHSYDPTHVDTEEGARLLCSVEFQTHVEPVTIVRVAMEQPSS----- 1211
Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
++ LL +DGS+ ++PLD F+RL LQ +LV H+A LNP+++R
Sbjct: 1212 ---DSASDASRLLIPRVDGSLASLSPLDMDIFKRLYLLQAQLVRHTHHIAALNPKAYRAV 1268
Query: 1373 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ R ++D LL ++ L + Q IA+Q G T ++ + L
Sbjct: 1269 QGSSTT-RTMSRRMLDFGLLVGFKKLSFDRQQGIANQIGETWETLIRDCTQL 1319
>gi|325094074|gb|EGC47384.1| cleavage factor two protein 1 [Ajellomyces capsulatus H88]
Length = 1377
Score = 232 bits (592), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 326/1429 (22%), Positives = 585/1429 (40%), Gaps = 223/1429 (15%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G + L + D+ +++++A +AK+S++E+D H + TS+H
Sbjct: 65 LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIH 122
Query: 161 CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
+E + +++ + A P + VDP RC VL +G + + IL Q G LV
Sbjct: 123 YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180
Query: 215 -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
DE +G F SS V+ + L+ M H F++ Y
Sbjct: 181 DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL+ + T + + + S ++ + + S LP+D +K++ +P
Sbjct: 241 REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300
Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
P+GG L++G+N +H + A+ +N +A S +S + L+ + L
Sbjct: 301 PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFL 425
+N LL G + +L+ DGR V + L + S+L + + + F
Sbjct: 361 ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD-------- 477
GS GDS+L+ ++ S + + G+I + D
Sbjct: 421 GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480
Query: 478 MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
M G + +S+ G+ N+ + F + D L N+GP++D + G + D
Sbjct: 481 MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533
Query: 531 TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
S +N ELV G G +VY K + +
Sbjct: 534 PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593
Query: 565 SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
S Y YL++S + +++V + E T++ ++ + RTI G L
Sbjct: 594 GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653
Query: 619 GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
RV+QV + R D G + Q + SE +V+ S ADPYVL+ D
Sbjct: 654 SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708
Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
SI LL D S +T I S+ S +LY DK
Sbjct: 709 QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDKY-------------------R 747
Query: 738 AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
+ + +G P + Q D + + L +F +PN VFT +
Sbjct: 748 SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793
Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
D +I S+ +E I + V +L + P+L ++ + Y+
Sbjct: 794 ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLTLYE 846
Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
Y + TS ++ S R + ++N + S + ++ + T P +
Sbjct: 847 PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTALSKPLR-- 895
Query: 915 TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
+ ++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV +
Sbjct: 896 -VLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954
Query: 975 QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
++++C+ P + +D W +KI L + Y + Y + + V +L
Sbjct: 955 DNVVRMCRFPRNTHFDGSWAARKIGLGEQVDAVEYSSSSETYVIGTNQKV------DFNL 1008
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
D E+ + N +S + +++ V++L P W + ++++E + V
Sbjct: 1009 PEDDEIHPEWRNEVISFLP-----QIDKGSVKLLTPRT----WSIIDSYNLRNAERIMCV 1059
Query: 1095 RVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEV 1148
+ + L + T E + + +GTA +GED+AARG + +F + D P+ + +
Sbjct: 1060 KCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIFEVIKVVPEVDRPETNRKLKLI 1119
Query: 1149 YSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV 1204
+E+KGA+++L+ + QG L+ A G K I+ K G+ L +AF D YV L +
Sbjct: 1120 AKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKEL 1177
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
K G I ++ A++ + + D +FL D + L ++V+D+
Sbjct: 1178 KG----GAITNCLF-------SARMTVPSSD-------DADFLPDENRLYILVADDD--- 1216
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA- 1323
S KG +LL R+ F G + L ATSS + G D + +
Sbjct: 1217 --------YPGSSKGDRLLHRSTFQTGHFASTMTLLPRTATSSSQ-GPDADPDMMDLDSS 1267
Query: 1324 -----LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
+L + GSI I P+ E ++RRL +LQ +L +++ H GLNPR+FR S+G
Sbjct: 1268 GPLHHVLVTSETGSIALITPVSETSYRRLSALQSQLANTLEHPCGLNPRAFRAVESDGIG 1327
Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
R +VD +L+ + L + + EIA++ G +I ++L + G
Sbjct: 1328 GR----GMVDGDLVKRWLDLGTQRKAEIANRVGADVWEIRADLEAIGKG 1372
>gi|367018592|ref|XP_003658581.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
42464]
gi|347005848|gb|AEO53336.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
42464]
Length = 1547
Score = 232 bits (592), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 325/1339 (24%), Positives = 529/1339 (39%), Gaps = 209/1339 (15%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR------------DSIILAFEDAK 141
D + L LV + L G V LA + A+ + DS+++AF DA+
Sbjct: 93 DRANTTKLVLVAEFPLAGTVTGLARIRTPKANRNHDGGAGHAGHAGHGCDSLLIAFRDAR 152
Query: 142 ISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVL 195
+S++E+D H L S+H +E E + S PL + DP RC +
Sbjct: 153 LSLVEWDAEQHTLSTISIHYYEQEEL------QGSPWAAPLSHYVNFLVADPGSRCAALK 206
Query: 196 VYGLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVIN----------------- 237
+ IL Q + +GD D G + S+ V+N
Sbjct: 207 FGARNLAILPFRQADEDIDMGDWDEELDGPRPAKDPSSNAVVNGASNIEDTPYSPSFVLR 266
Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
L +LD + H F+H Y EP IL H M+ L + K
Sbjct: 267 LSNLDPSLLHPVHLAFLHEYREPTFGILASATAPSNALGRKDHLVYMVFTLDLQQ--KAS 324
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQE 354
I S LP D ++++ +P+P+GG L+VG+N IH +A+N +
Sbjct: 325 TTILSVSGLPQDLFRVVPLPAPVGGALLVGSNELIHVDQSGKPNGVAVNPMTRQCTNFGL 384
Query: 355 LPRSSFSVELDAAHATWLQNDVALLST--KTGDLVLLTVVYDGRVVQRLDLSKTNPSV-- 410
+ +S ++ L+ L D+ L G ++T DGR V L++ S
Sbjct: 385 VDQSDLNLRLEGCAIDVLTPDLGELFVVLNDGRAAVVTFRIDGRTVSGLEIKMLPESAGG 444
Query: 411 -----LTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGLKEEFGDIEADAP 462
S ++ IG + F G GDSLL+ + +G L + GD++A+
Sbjct: 445 SLIPGRVSTLSRIGRNAVFAGREEGDSLLLGWAKRQAQTGRRRLRARDAAGSGDVDAEG- 503
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT-------------FSFAVRDSL 509
L D + + + +E ESA + SF V D L
Sbjct: 504 --AELAEGDEDVVAEGEDEDEDEEDEDDLYGEESAPRQQPVSAASSFLSGDVSFRVHDRL 561
Query: 510 VNIGPLKDFSY----------------GLRINADASAT-GISKQSNYELV---------- 542
+++ P++ +Y G+R + + T G K + V
Sbjct: 562 LSVAPIQALTYSQPVYLAGSEEERNSAGVRSDLNLVCTVGRDKSAALATVNLAIQPRVIG 621
Query: 543 --ELPGCKGIWTV-----YHKSSRGHNADSSRMAAYD--DEYHAYLIIS------LEART 587
E P +G WTV KS +G A +S YD +Y ++I++ E
Sbjct: 622 RFEFPEARGFWTVCAKKPVPKSLQGDKAGNSLSKDYDTAGQYDRFMIVAKVDLDGYEKSD 681
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
+ TA + + G TI AG + R+IQ+ + R DG + + P
Sbjct: 682 VYALTAAGFEGLGGTEFDPAAGITIEAGTMGKGSRIIQILKSEVRCYDGDFGLSQIV--P 739
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
E +G+E V S SI DP++L+ D S + D S + +S K +
Sbjct: 740 MLDEE-TGAEPRAV-SASIVDPFLLIIRDDSSAFIAQVDSSNELEELDKEDPTLASTKWL 797
Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
+ C LY D +T A+ G+ GG L Q + + SGAL I+
Sbjct: 798 TGC-LYAD----------TTGAFAEEAPGK------GGKLSQ-SVLMFLLSASGALHIYR 839
Query: 768 VPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVEL 827
+P+ + V + +S Y+ L + S+ +GT KE I + V +L
Sbjct: 840 LPDLSKPVYVAEGLS--------YIPPGLS-----ADYSARKGTA---KETIAEILVADL 883
Query: 828 AMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRL 887
H P L T+ + YQ + + NT + S++L +L
Sbjct: 884 G----DMTHKSPHLILRHTNDDLTLYQPFRY----NTGAG---LEFSKTLFF-----QKL 927
Query: 888 RNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP 947
N F+++P +A E T H + N+ G+ FL G+ P + + + +
Sbjct: 928 PNTVFAKSPEEADDDEAT-HQPRFLSMRRCANVGGYSTVFLPGASPSFIIKSSKSVPKVL 986
Query: 948 QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-WPVQKIPLKATPHQ 1006
L ++A + H C HGFIY S+ + ++ QLP +Y V+KIP+
Sbjct: 987 PLQGTGVIAMSPFHTEGCEHGFIYADSRDMARVAQLPQDWSYAELGLAVRKIPIGEDIAA 1046
Query: 1007 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVR 1066
Y Y + + P + L D + + NL+ TV+ ++
Sbjct: 1047 AAYHPPMQSYVVGCNTP------EPFELPKDDDYHKEWARENLAF-----KPTVDRGNLK 1095
Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAA 1125
++ P W +I M+ E L V + L + T E + L+A+GTA +GED+
Sbjct: 1096 LVSPIT----WTVVDSIQMEPCETVLCVECLGLEVSEFTNERKQLIAVGTAITKGEDLPT 1151
Query: 1126 RGRVLLFSTGRNADNPQNLVTEVYSKELK---------GAISALASL--QGHLLIASGPK 1174
RGRV ++ P T SK+LK GA++AL+ + QG +L+A G K
Sbjct: 1152 RGRVYVYDIADVIPQPGRPET---SKKLKLIAKEDIPRGAVTALSEIGTQGLMLVAQGQK 1208
Query: 1175 IILH--KWTGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLN 1230
++ K G+ L +AF D YV + + L+ D K ++F + E+ ++
Sbjct: 1209 CMVRGLKEDGSLLP-VAFMDM-SCYVTAAKELPGTGLCLMADAFKGVWFTGYTEEPYKMM 1266
Query: 1231 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1290
L K L+ +FL DG L +VVSD +I I + P+ +S +G LL R F+
Sbjct: 1267 LFGKSATRLEVLNADFLPDGKELFIVVSDADGHIHILQFDPEHPKSLQGHLLLHRTTFNT 1326
Query: 1291 GAHVTKFLRLQMLATSSDR 1309
GAH L + T +D+
Sbjct: 1327 GAHQPTKSLLLPVTTPADQ 1345
Score = 48.1 bits (113), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 49/106 (46%), Gaps = 18/106 (16%)
Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR------------- 1370
L+ G + + L E +RRL SL +L S+PH AGLNPR +R
Sbjct: 1418 LVLAAPTGVLAALRALPESAYRRLSSLAAQLAGSLPHAAGLNPRGYRLPDGVASSSSPWS 1477
Query: 1371 QFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEIAHQTG 1411
S+ A PG D +IVD LL + L + ++E+A + G
Sbjct: 1478 SSSSSFSAVVPGVDAGVGRTIVDGALLQRFTELGMARRVELAGRAG 1523
>gi|429851266|gb|ELA26469.1| protein cft1 [Colletotrichum gloeosporioides Nara gc5]
Length = 1411
Score = 231 bits (588), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 323/1345 (24%), Positives = 545/1345 (40%), Gaps = 193/1345 (14%)
Query: 69 YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSR 128
Y R+ ++ ES G V D + L LV Y + G V LA + NS+
Sbjct: 66 YDRRLNDDDGLESSFLGGDGMLVRADRTNNTKLVLVAEYPIFGVVAGLARIK---IQNSK 122
Query: 129 RR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL----- 182
+++++A A++S++++D H L S+H +E E S GPL
Sbjct: 123 SGGEALLIATRVARLSLVQWDPEKHALEDVSIHFYEKEEL------EGSPFDGPLSNYPT 176
Query: 183 -VKVDPQGRCGGV---------LVYGLQMIILKASQGGSGLVGDED---------TFGSG 223
+ DP RC + L + L + + G T G+
Sbjct: 177 HLAADPGSRCAALRFGSRYIAFLPFKLNDEDIDMDDWDEDVDGPRPAKEPSATAATNGTS 236
Query: 224 GGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTC 281
+S+V+ L LD + H F+H Y EP I+ + H +
Sbjct: 237 NLADVPYSTSYVLPLPQLDPSLLHPVHLAFLHEYREPTFGIISSMQRRSNTLPRKDHFSY 296
Query: 282 MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCAL 340
+ L + + I S NLP D +K++A+P PIGG L+VG N IH +
Sbjct: 297 KVFTLDLQQ--RASTAILSVNNLPQDLFKVIALPGPIGGALLVGTNELIHIDQSGKPNGV 354
Query: 341 ALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVV 398
A+N + S +S + L+ + + +N L+ G L ++ DGR V
Sbjct: 355 AVNAFTKETTSFPLADQSELDLRLEHCYIEQMSPENGELLMVLSDGRLAIIAFKIDGRTV 414
Query: 399 QRLDL----SKTNPSVL---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
L + ++ +V+ S I+ + + FF+GS DSL+V G + +
Sbjct: 415 SGLSVRIVPAEAGGNVVQCGASSISRLSKNAFFIGSTGSDSLVV------GVTRKQTQNA 468
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS-LYGSASNNTESAQKTFSFAVRDSLV 510
+ + D+ + D D + GE + + S + N SF V DSL+
Sbjct: 469 RKKTRLVDDSFADDLEDEDIDDDDDDDLYGETTTTVQSSTAANGVPKGGEISFRVHDSLL 528
Query: 511 NIGPLKDFSYG--------------------LRINA-----DASATGISKQSNYELV--- 542
++ P+KD + G L++ A +A+A I Q+ V
Sbjct: 529 SLAPVKDMTTGKQAFIPESEDEKNSVGVVADLQLAAAVGKGNAAAIAIMNQNIQPKVIGK 588
Query: 543 -ELPGCKGIWT--VYHKSSRGHNADSSRMAAYDDEYHA------YLIISLEARTMVLETA 593
E P +G WT V + D AA E+ A ++I+S + ET+
Sbjct: 589 FEFPEARGFWTMCVQKPIPKSLQGDKGANAAVGSEFDASSIYDKFMIVS-KVDLDGYETS 647
Query: 594 DLLTEVTESVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSF 645
D+ + F G T+ AG + R+IQV + R DG ++Q L
Sbjct: 648 DVYALTGAGFEAFTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLGLSQILPM 707
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
+ E+G+ V+S SIADPY+LL D SI + D + ++ S +
Sbjct: 708 --LDEETGA---EPRVVSASIADPYLLLVRDDASIMVAQIDNNNELEEMEKQDDTILSTQ 762
Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
++ C LY D +TGV I G P Q I+ + GAL I
Sbjct: 763 WLAGC-LYTD----------------TTGVFAPIQTDKGTPESQ-SIFMFLLSAVGALYI 804
Query: 766 FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
+ +P+ + V +G T+ V ++ + + GT Q E + + V
Sbjct: 805 YALPDLSKPVYV---AAGMTY-VPPFL---------SADYAVRRGTVQ---ETLTEVLVA 848
Query: 826 ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS 885
+L A S P+L + I Y+ E D +++L ++
Sbjct: 849 KLG----DATESSPYLILRHANDDITIYEPIRLE------SQDKSEGLAKTLHFQKIT-- 896
Query: 886 RLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
N +++P++ + E P P + NI+G+ FL G+ P + + +
Sbjct: 897 ---NPALAKSPVEVADDDANEQPRFVPLRPCA---NINGYSTVFLPGASPSFIIKSAKSA 950
Query: 944 RVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKAT 1003
L + + H C GFIY S+G ++ QLP+ ++++ ++KIP+
Sbjct: 951 PKVLGLQGIGVRGMSSFHTEGCERGFIYADSEGHTRVTQLPADTSFELGVSIRKIPVGDA 1010
Query: 1004 PHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY-TVEE 1062
I Y Y + SV +P E+ D H + + T+ +E
Sbjct: 1011 IGLIAYHPPMETYAVACSVS--EPF----------ELPKDDDYHKEWAKETITTFPQMER 1058
Query: 1063 YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGE 1121
+++L P W T+ + E A+ ++ + L + TKE L+AIGTA +GE
Sbjct: 1059 GIIKLLSP----ATWSVIDTVELDPHEVAMCMKTLHLEVSEETKERRMLIAIGTAINRGE 1114
Query: 1122 DVAARGRVLLFSTGRNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGP 1173
D+ RGR+L++ P N ++ +KE +GA++AL + QG +L+A G
Sbjct: 1115 DLPIRGRILVYDVVPVVPQPGRPETNKKLKLVAKEEIPRGAVTALCEVGSQGLMLVAQGQ 1174
Query: 1174 KIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQL 1229
K ++ K GT L +AF D YV S+ V+ + L+ D K ++F+ + E+ ++
Sbjct: 1175 KCMVRGLKEDGTLLP-VAFMDMS-CYVTSVREVRGTGYCLMADAFKGVWFVGYAEEPYKI 1232
Query: 1230 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1289
L K G + +FL+DG L +VV D+ I + + P+ +S +G LL+RA F
Sbjct: 1233 MLFGKSTGKFEVLTADFLVDGDELHIVVCDKDGVIHVMQFDPEHPKSLQGHLLLNRASFS 1292
Query: 1290 VGA-HVTKFLRLQMLATSSDRTGAA 1313
H T L L T++ A+
Sbjct: 1293 AAPNHPTATLSLPRTTTTAQSASAS 1317
>gi|378734083|gb|EHY60542.1| histone H2A [Exophiala dermatitidis NIH/UT8656]
Length = 1361
Score = 231 bits (588), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 314/1393 (22%), Positives = 553/1393 (39%), Gaps = 249/1393 (17%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLR 155
S L LV Y L G + SL + NS+ D++++AF DAK+S++E+D ++H +
Sbjct: 49 SETKLVLVAEYNLAGTITSLGRVK---IPNSKSGGDAVLVAFRDAKLSLIEWDPALHSIS 105
Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
S+H +E + + + + VDP RC + I+ Q L
Sbjct: 106 TLSIHYYEHHDLQSIPWQPDLSKCVSHLTVDPSSRCAAFNFGVSNLAIIPLHQVRDELAM 165
Query: 216 DE---------DTFGSGGGFSARIES-------SHVINLRDLD--MKHVKDFIFVHGYIE 257
DE + G + +S S V+ L LD + H D F+H Y +
Sbjct: 166 DEFDEVDGEVKERLSPDGQNENKHDSPDTPFKPSFVLPLTALDPGLLHPVDMAFLHEYRD 225
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + IL+ + + + + ++ K + S LP+D Y+++A+P P
Sbjct: 226 PTVGILYSTAARSSNMNHERRDVTIYAVYALDIGQKASTALQSVQKLPNDLYRVMALPPP 285
Query: 318 IGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
+GG L++G N IH + A+A+N A S +++ ++L+ L N
Sbjct: 286 VGGALLIGGNELIHIDQSGKTIAIAVNELAKEASSFPMADHANYRLKLEGCQIEHLGNPS 345
Query: 377 A--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV-------LTSDITTIGNSLFFLGS 427
L+ KTG+L LL+ DGR+V + L + ++ T +G++ F+GS
Sbjct: 346 GDMLVILKTGELALLSFRMDGRMVSSMALRRVGEGQSQGLALGASTCSTNLGSNRLFIGS 405
Query: 428 RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA-----LQDMVNGE 482
DS+L+ G T+ L + R++ + DA ++ +
Sbjct: 406 EESDSILL--ATGRKTTQLRR--------------TNSRIQSQADDAGLFDDNEEDGIED 449
Query: 483 ELSLYGSASN----NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ-- 536
E LY ++ N + +F + D L +I P+ D + A + +++Q
Sbjct: 450 EDDLYAELADELNGNASTDVSGHNFRLLDRLPSIAPINDVALANVGKRRAEESEVTRQEL 509
Query: 537 ---------------------SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
S ++ G+W + +RG A ++ +
Sbjct: 510 AVAYGRGHAGGLAFLSRKLEPSVTRQIKFERPIGVW-CFSSGNRGQQG------AEEENF 562
Query: 576 HAYLIISL-----EARTMVLETAD-LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
++IS RT +L D L + ES G I L IQV
Sbjct: 563 DDLVMISQTTDDGAGRTKLLRLIDGDLNSMGESEFDESAGAAIGVFKLEATNHTIQVLPT 622
Query: 630 GARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPST 689
R+ D + + F + E G + + + VS DPY+++ DGS+ LL D +
Sbjct: 623 ELRVYDAGFALSQI-FPIVDEEEG---QTARAVKVSFVDPYLVVVKDDGSMSLLKADKAG 678
Query: 690 CTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ 749
V+ P + + + S TLY D TD T G
Sbjct: 679 ELDEVELPENLRAWS--ILSATLYQD-----------TDDMFQTSRFY------NGTATP 719
Query: 750 GDIYSVVCYESGALEIFDVPNFNC-VFTVDKFVSGRTHIV-DTYMREALKDSETEINSSS 807
G I +++ + G + +PN + VF D TH++ D + + ++
Sbjct: 720 GPILTILT-QDGHFCLLSLPNVSIQVFQCDSLPFLPTHLMQDLQLPKHWRN--------- 769
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
K+++ + + +L ++ +P+L G ++ Y+++
Sbjct: 770 --------KDDLGEVLLADLG----NSTDRQPYLVVRNLVGDVIIYESFAMP-------- 809
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 927
D + + R V +A L + EE + Q + N++GH F
Sbjct: 810 -DVLGSFRFKKVFTKAAGELED------------GEEVGQPSTLQPMQAVTNVAGHASVF 856
Query: 928 LSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGS 987
+ G +P M + +L + + +H C G + V + +K C +P +
Sbjct: 857 IPGRQPLLIMREASTMPRVYELNPTKLKSMNSVHTGTCRQGLVLVDADDEIKFCNIPDST 916
Query: 988 TYD-NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDN 1046
+ W ++++PL + YFA + Y L N
Sbjct: 917 VLGLSDWVIRRVPLGQDITSVAYFAPTDSYILAT-------------------------N 951
Query: 1047 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRAT--IP--MQSSENALTV-------- 1094
H E ++ + D WQ AT +P +QSS L+
Sbjct: 952 HT--------------TEFQLPQDDEWHPEWQGEATKFLPSSIQSSLKLLSAKTHSIISQ 997
Query: 1095 -------RVVTL------FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NA 1138
RV+ L + T E + L+ +GTA V+GE+V RG + +F
Sbjct: 998 YSFDACERVLCLESLNLEVSEETHERKDLIVVGTAIVKGENVTTRGNLYIFDVVDVVPEP 1057
Query: 1139 DNPQ-NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILHKWT-GTELNGIAFYDA 1193
D P+ +L ++ +KE ++GA+SAL + QG LL A G K ++ + +AF D
Sbjct: 1058 DRPESDLKIKLITKEDVRGAVSALCDIGSQGFLLAAQGQKSMVRGLKEDMSILPVAFLDM 1117
Query: 1194 PPLYVVSLNIV-KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
V+ + +LGD ++ + + E+ +L +L +D A EFL DG
Sbjct: 1118 RYYVHVARELPGTGLCILGDAFSGLWLVGYSEEPYKLQILGRDLEDPPVLAAEFLPDGKQ 1177
Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDR-- 1309
L ++ SD+ +++ Y P+ ++ +G KLL R+ FH GA TK + L +A+ R
Sbjct: 1178 LYIISSDDDGLLRVLQYDPENPKAERGTKLLLRSTFHSGAAPTKMILLPPQVASGRGRDP 1237
Query: 1310 -------TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVP-HV 1361
+GA P + R +L T +GS+ + PL E T+RRL +LQ L+ ++ H
Sbjct: 1238 EIDMDVDSGAGPAA---GRHRILVTTQEGSLCMLTPLSEATYRRLSALQTTLLTTLDFHP 1294
Query: 1362 AGLNPRSFRQFHS 1374
LNPR++RQ +
Sbjct: 1295 CSLNPRAYRQVET 1307
>gi|58702050|gb|AAH90169.1| LOC564406 protein, partial [Danio rerio]
Length = 416
Score = 229 bits (584), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 129/342 (37%), Positives = 203/342 (59%), Gaps = 14/342 (4%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE V + L GNV S+A + G + RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 66 LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P+V+VDP+ RC +LVYG +++L + + DE
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT---LADEQEG 175
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R+LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 176 IVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
TC I A+S++ K HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPP 295
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
++LN+ + P+ + LD + A+++ +D ++S K G++ +LT++ DG R
Sbjct: 296 FGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVLTLITDGMR 355
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
V+ K SVLT+ + T+ FLGSRLG+SLL+++T
Sbjct: 356 SVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYT 397
>gi|322694449|gb|EFY86278.1| Cleavage factor two protein 1 [Metarhizium acridum CQMa 102]
Length = 1431
Score = 229 bits (583), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 329/1387 (23%), Positives = 552/1387 (39%), Gaps = 182/1387 (13%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
R ++ ES G V D L L+ L G V LA + + +
Sbjct: 70 RANDDDGLESSFLGVESLIVRADPSHNTKLVLISEIPLAGTVIGLARVKI--KNTPSGGE 127
Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
+++LA++ AK+ + E+D H L TS+H +E E L+ G V + DP
Sbjct: 128 ALLLAYKAAKMCLTEWDPQRHTLETTSIHYYEKDE---LQGAPWEMPFGDYVNYLEADPG 184
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGD---ED-------------TFGSGGG----FSA 228
RC + IL +Q L D ED T G G G +
Sbjct: 185 SRCVAFKFGSRNLAILPFTQSEEDLEMDDWDEDLDGPCPVKEEPPTTNGDGPGDHDLVKS 244
Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
R S V+ L LD + H F+H Y EP IL + H T + L
Sbjct: 245 RYTPSFVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSMQSPSPALGIKDHLTYKVFTL 304
Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY 345
+ + I S LP D ++++A+P+P+GG L+VG N IH +A+N+
Sbjct: 305 DLQQ--RASTTILSVTGLPQDLFRVIALPAPMGGALLVGENELIHIDQSGKPNGVAVNDM 362
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL 403
A + S + +S + L+ L ND+ LL G L ++ DGR V ++ +
Sbjct: 363 AKQMTSFSLVDQSELGLRLEGCAVELLANDIGELLLILNDGRLAIICFHIDGRTVSKISI 422
Query: 404 ----SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
++ +++ S ++ I G++ FLGS DS+++ ++ G K +
Sbjct: 423 RLVSAECGGNLIKSQVSCISKLGSNTLFLGSESNDSIVLGWSRKQGQE------KRKKSR 476
Query: 457 IEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPL 515
+ + D D + G + SL S + N S SF V+D+L++I P+
Sbjct: 477 LLDPDLALDVDDLDLDDDEDDDLYGNDSSLAKPSQTINGSSKPGEVSFRVQDTLLSIAPI 536
Query: 516 KDFSYGL-RINADASATGISKQSNYEL----------------------------VELPG 546
+D + G D+ +SK EL + P
Sbjct: 537 RDVACGAPAFVPDSEEATLSKGVTAELELACAVGRGFSGSVAILNREIQPKVIGRFDFPE 596
Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVT 600
+G WT+ K A + +Y Y+I++ E + TA +
Sbjct: 597 ARGFWTMCVKKPLSKGAAVASDYDTTAQYDKYMIVAKVDLDGYETSDVYALTAAGFETLK 656
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENS 659
++ G T+ AG + + R+IQV + R DG ++Q L P E +
Sbjct: 657 DTEFEPAAGFTVEAGTMGKQMRIIQVLKSEVRCYDGDLGLSQIL---PMLDEDTGAEPRA 713
Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPE 719
T S SI DPY+LL D SI + + V P S K S C LY+D
Sbjct: 714 T--SASIVDPYLLLNRDDSSIFIAQIHSNNELEEVFKPDGTLKSTKWASGC-LYND---- 766
Query: 720 PWLRKTSTDAWLSTGVG-EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVD 778
T + V + D AD I + +GAL ++ +P+ V
Sbjct: 767 -------TQGIFQSNVNKQKADAAD-------RIMMFLLSSAGALHVYALPD------VS 806
Query: 779 KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR 838
K + ++ EAL ++++ G KE+I + V +L A
Sbjct: 807 KPI---------FVAEALTSIPPFLSAAFVARKG-ASKESITEILVADLG----DAISQT 852
Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
P+L + Y+ P + D ++ L V+ S + P
Sbjct: 853 PYLIVRHASDDLTIYE------PVRCQEEGDAELSASLLFKKCVNTSLAKT-----APEV 901
Query: 899 AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFT 958
+ E P P +R N++G+ FL G+ P + + L + +
Sbjct: 902 SEDDAEPPRFVPLRRCA---NVNGYGAVFLPGASPSFVLKSSHSEPRVIGLQGLGVRGMS 958
Query: 959 VLHNVNCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPHQITYFAEKNLYP 1017
H C+ GFIYV +GI ++ QLPS +++ D V+KI L I+Y Y
Sbjct: 959 TFHTEGCDRGFIYVDVEGIARVTQLPSNASFTDLGVSVKKIALDGDVGMISYHHPTGTY- 1017
Query: 1018 LIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGP 1076
+V+ L+P E+ D H + + T ++++ P
Sbjct: 1018 -VVACTKLEPF----------ELPRDDDYHKEWAKETIKFPPTTARGILKLINPVT---- 1062
Query: 1077 WQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1135
W + ++ E+ +++ + L + TKE + L+A+GTA +GED+ RGRV +F
Sbjct: 1063 WTVIHELELEPCESIESMKTLHLEVSEETKERKMLVAVGTALSKGEDLPTRGRVQVFDIV 1122
Query: 1136 RNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTEL 1185
P N ++ +KE +G ++AL+ + QG +L+A G K ++ K G+ L
Sbjct: 1123 TVIPEPGRPETNKRLKLIAKEEIPRGGVTALSEVGAQGLMLVAQGQKCMVRGLKEDGSLL 1182
Query: 1186 NGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
+AF D +V S+ + ++ D+ K ++F + E+ +L K G L
Sbjct: 1183 P-VAFLDM-NCHVASVKELPGTGLCVMADVFKGLWFAGYTEEPYTFKILGKSSGKLPLLV 1240
Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1303
+FL DG LS+V D + ++ I + P+ +S +G LL R F VT L
Sbjct: 1241 ADFLPDGEDLSMVAVDAEGDMHILEFNPEHPKSLQGHLLLHRTSF----AVTPNTPTSTL 1296
Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
+ + P + ++ LL G + ++PL E T+RRL S+ +L ++ G
Sbjct: 1297 LLPRTHSPSYPHASSSSHM-LLLACPSGQVAALSPLAESTYRRLLSVTNQLHPAIVAHCG 1355
Query: 1364 LNPRSFR 1370
L+ ++ R
Sbjct: 1356 LHTKAHR 1362
>gi|147799623|emb|CAN68460.1| hypothetical protein VITISV_027523 [Vitis vinifera]
Length = 558
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 119/175 (68%), Positives = 142/175 (81%), Gaps = 5/175 (2%)
Query: 424 FLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE 483
F GS+LGDSLLVQFT S+ SS +++ GDIE B PS KR RRSSSDALQDMVNG++
Sbjct: 331 FEGSQLGDSLLVQFT-----SIPSSSVEKRVGDIEGBVPSAKRSRRSSSDALQDMVNGDK 385
Query: 484 LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE 543
L LYGSA N+TE++QKTFSF+V DSL+++GPLKDF+YGLRINAD ATGI KQ VE
Sbjct: 386 LPLYGSAPNSTETSQKTFSFSVNDSLIDVGPLKDFAYGLRINADLKATGIVKQKMITEVE 445
Query: 544 LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
LPGC+ IWTVYHK++RGHNADS++M DDEY AYLIIS E+RTMVLET +LL E
Sbjct: 446 LPGCERIWTVYHKNTRGHNADSTKMITKDDEYCAYLIISPESRTMVLETVELLGE 500
>gi|449661926|ref|XP_002167992.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Hydra magnipapillata]
Length = 1122
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 193/664 (29%), Positives = 305/664 (45%), Gaps = 124/664 (18%)
Query: 57 NLVVTAANVIEIYVV----RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
NLV + +Y + V +G + SK ++D + LEL+ + L GN
Sbjct: 29 NLVTAGGQRLNVYRLCDADMVVSDGDQSSK---------IVDSVGKRRLELLASFTLFGN 79
Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ ++ ++ G S RDS++LAF+ AK+S++EFD H L+ SMH FE+ E+ K
Sbjct: 80 IINMQVVRLG----SNVRDSLLLAFKHAKLSIVEFDPLSHDLKTDSMHYFENDEF---KG 132
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES 232
G PLV+VDP+ RC +L+Y +++L + DE S G +
Sbjct: 133 GLSHNIYLPLVRVDPEQRCACMLIYNRHLVVLPFKHD---IKLDESEELSDGEHIKSVLP 189
Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
S++I+L L+ + ++ + F+HGY +P ++ L E T GRV+ + T +SA+S++
Sbjct: 190 SYMIDLHSLEQPLLNITELQFLHGYHQPTLMFLFEPVQTSTGRVAVRQDTFCVSAISLNM 249
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
T K HP+IWS NLP D + L + PIGGVLV +N++ Y +QS + Y VSL+
Sbjct: 250 TEKVHPVIWSVTNLPFDCHMLRPIEKPIGGVLVFASNSLIYLNQS------IPPYGVSLN 303
Query: 351 SSQE----LP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLD 402
S E P + + L + + D +LS K G++ +L+++ DG R V+
Sbjct: 304 SITEGSTMFPLKIQEDVVITLAESSCDAIATDQFILSLKGGEIYVLSLLSDGLRTVRSFH 363
Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
K SVL S + I + FLGSRLG+SLL+++T E D+
Sbjct: 364 FEKAAGSVLASCVCWIEHGFVFLGSRLGNSLLLRYT-------------------EKDSA 404
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
S +S ++ M G DSL+NIGP+ + G
Sbjct: 405 SIA--EKSKEAKVEKMYGGGVGGGIIVC----------------DSLLNIGPITKAALGE 446
Query: 523 RINADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHK 556
G S+Q + E+V ELPGC +WTV K
Sbjct: 447 PAFLSEEFFG-SRQIDLEMVCCSGYGKNGTLTVLQRSIRPQVVTTFELPGCVNMWTVCGK 505
Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
SS+ + YH+YLI+S + TMVL+T +TE+ S + VQ TI A N
Sbjct: 506 SSKESV----------ENYHSYLILSRDDSTMVLKTGAEITELDNS-GFNVQQPTIFACN 554
Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
+ ++QV + +L+ + +S + + SI+DPYV++ S
Sbjct: 555 HLSNKYILQVCPQSIHLLEDTVQINSISL----------QDTIKITQCSISDPYVVMVDS 604
Query: 677 DGSI 680
G +
Sbjct: 605 TGQL 608
Score = 224 bits (570), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 128/380 (33%), Positives = 215/380 (56%), Gaps = 18/380 (4%)
Query: 1060 VEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTA 1116
+E + V ++ P W+T + +Q E+ ++V+ L + + L +GT
Sbjct: 753 IERFVVSLISP----TSWETVPNSRTVLQEFEHVTCMKVLLLHSELVDIGLKQYLVVGTT 808
Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIAS 1171
+ GED+A +GR+L+F P +T+ VY KE KG ++A+ + G+++ A
Sbjct: 809 FNYGEDLACKGRILIFDVLEVVPEPGQPLTKTKCKCVYDKEQKGPVTAICATSGYIIAAV 868
Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
G KI K+ +L G+AF D+ ++ V+L ++N I+ DI +SI + ++ + L L
Sbjct: 869 GQKIYAFKYKDNDLVGVAFVDSQ-VFTVNLMAIRNVIVAADISRSISLVRFQVEHKSLAL 927
Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
+++D +L+ + +EF IDGS + VVSD ++NI IF Y P+ ES+ G +LL +A+ ++G
Sbjct: 928 VSRDTKTLEAYTSEFFIDGSQVGFVVSDAERNIVIFSYQPEALESFGGHRLLQKADINIG 987
Query: 1292 AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
+HV +R++++ D + S++ R ++ TLDGSIG + PL E FRRL LQ
Sbjct: 988 SHVNTMMRIKLI---QDEQSLSKSSEQ--RQLIILPTLDGSIGILFPLSEKPFRRLTMLQ 1042
Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
KLVD +PH AGLNPR+FR + +I+D +LL Y L +E+ +IA + G
Sbjct: 1043 NKLVDCLPHKAGLNPRAFRALDVPLRTLTNPHRNILDGQLLDKYAQLSFQERFDIAKKMG 1102
Query: 1412 TTRSQILSNLNDLALGTSFL 1431
TT QIL ++ D+ ++ L
Sbjct: 1103 TTSGQILDDMMDIERASNHL 1122
>gi|46120520|ref|XP_385083.1| hypothetical protein FG04907.1 [Gibberella zeae PH-1]
Length = 1436
Score = 228 bits (580), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 338/1441 (23%), Positives = 565/1441 (39%), Gaps = 205/1441 (14%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GGADN 126
R ++ ES G V D + L LV L G V LA + GG
Sbjct: 68 RANDDDGLESSFLGGETMIVKTDRTNNTKLVLVAELPLSGAVTGLAKVKTKHSKCGG--- 124
Query: 127 SRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKV 185
+++++A++ AK+ + +D L S+H +E E LH SF ++
Sbjct: 125 ----EALLIAYKAAKLCMAVWDPEKSTLETISIHYYEKEE-LHGAPWEVSFDEYANYLEA 179
Query: 186 DPQGRCGGVLVYGLQMIILKASQGGSGLVGDE------------DTFGSGGGFSARIES- 232
DP RC + IL Q L D+ +T G S +E
Sbjct: 180 DPGSRCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKETAAVANGDSDTVEPP 239
Query: 233 ---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
S V+ L LD + H F F+H Y EP IL + H T + L
Sbjct: 240 YTPSFVLRLPLLDPSLLHPVHFAFLHEYREPTFGILSSSQERAHSLGQKDHLTYKVFTLD 299
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + I S +LP D +K+LA+P+P+GG L++G N IH + +A+N+ A
Sbjct: 300 LQQ--RASTTILSVTDLPRDLFKILALPAPVGGALLIGENELIHVDQSGKANGVAVNSMA 357
Query: 347 VSLDSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S ++ ++ L+ ++N LL G + +++ + DGR V L +
Sbjct: 358 RQITSFSLTDQADLNLRLEHCVVEQLHIENGELLLVLNDGQIGIVSFLIDGRTVSGLSIK 417
Query: 405 ----KTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
+ +VL S +T +G + FF+GS +GDS+++ +T G K D
Sbjct: 418 MVTDENGGNVLKSRASTASKLGKNTFFVGSEMGDSVVLGWTRKMGQEKRR---KPRLIDT 474
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
+ + D D+ E + + + N SF + D+L++I P+KD
Sbjct: 475 DIALDVDELDLEDDDDEDDDLYGTESAAAKPAQALNGSGRSGELSFRIHDTLLSIAPIKD 534
Query: 518 FSYGLR--------------INAD---ASATGISKQSNYELV------------ELPGCK 548
+ G + +D A G K + ++ E P +
Sbjct: 535 LTPGKTSFLPDSEEMTLSDGVVSDLHLACIVGRGKAGSLAILNRNIQPKIIGRFEFPEAR 594
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHA------YLIISLEARTMVLETADLLT----- 597
G WT+ K S A DEY A Y+I++ + ET+D+
Sbjct: 595 GFWTMSVKKPLPKALGGS--AGVGDEYEAFGQHDKYMIVA-KVDLDGYETSDVYALTGAG 651
Query: 598 -EVTESVDY-FVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGS 654
E + ++ G T+ AG + + R+IQV + R DG +TQ L + E+G+
Sbjct: 652 FETLKETEFDPAAGFTVEAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA 709
Query: 655 GSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYH 714
V S SI DPY+LL D S+ L D + V+ A + K + C LY
Sbjct: 710 ---EPRVTSASIVDPYLLLIRDDSSLLLAQIDSNNELEEVEKMDATLQNTKWHAGC-LYA 765
Query: 715 DKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-C 773
D T + D G ++ I + +GAL ++ +P+ +
Sbjct: 766 D-----------------TEGAFQFNANDKGETEK--IMMFLLSSTGALHVYALPDLSKP 806
Query: 774 VFTVDKFVSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
V+ + H+ D +R L KE + + V +L
Sbjct: 807 VYVAEGLSYVPPHLSADYTLRRGLA------------------KETLREILVADLG---- 844
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS----RLR 888
P+L + Y+ P+ R SN+SA+ ++
Sbjct: 845 DTISQSPYLILRNQTDDLTIYE---------------PIHHVRPGGESNLSAALSFKKMS 889
Query: 889 NLRFSRTPLDAYTRE-ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP 947
N+ + TP + E P P +R NI+G+ FL GS P + + + +
Sbjct: 890 NVTLATTPAQTEDDDVEQPRFMPMRRCA---NINGYSTVFLPGSSPSFVLKSSKSIPRVI 946
Query: 948 QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQ 1006
L I + H C+ GFIY +GI ++ Q PS + + V+K+PL +
Sbjct: 947 GLQGLGIRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNFTELGISVKKVPLGSDVRG 1006
Query: 1007 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEV 1065
I Y Y I +P E+ D H + L T+ +
Sbjct: 1007 IAYHQPTGAY--IAGCMTSEPF----------ELPKDDDYHKEWAKETLSFPPTMPRGVL 1054
Query: 1066 RILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVA 1124
+++ P W I ++S E+ ++ + L + TKE L+A+GTA +GED+
Sbjct: 1055 KLISPIT----WTVIHDIELESCESIECMKTLHLEVSEDTKERRFLVAVGTAVSKGEDLP 1110
Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKEL------KGAISALASL--QGHLLIASGPKII 1176
RGRV ++ P T K + +G ++A++ + QG +L+A G K +
Sbjct: 1111 IRGRVHVYDIVTVIPEPGKPETNRRLKAIAREDIPRGGVTAISEIGTQGLMLVAQGQKCM 1170
Query: 1177 LH--KWTGTELNGIAFYDAPPLYVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLA 1233
+ K G+ L +AF D + + + L+ D K ++F + E+ +L
Sbjct: 1171 VRGLKEDGSLLP-VAFLDMSCHVSSARELSRTGLCLMADAFKGVWFAGYTEEPYTFKVLG 1229
Query: 1234 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
K G L +FL DG L++V +D ++ I + P+ +S +G LL R F V +
Sbjct: 1230 KSHGRLPVVVADFLPDGDDLAIVAADVDGDLHILEFNPEHPKSLQGHLLLHRTSFSVSPN 1289
Query: 1294 -VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
+ L L S T P LL + G + + PL E +RRL S+
Sbjct: 1290 PPSTTLLLPRTTPPSHPTPQDP------PHVLLLASSSGHLSSLIPLPETAYRRLLSVTN 1343
Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGK--AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1410
+L+ ++ GLN ++ R G +IVD +L+ + L ++ EIA +
Sbjct: 1344 QLLPALTPHGGLNAKAHRLPVGTRTVGVEAAGGRAIVDGAVLARWAELSAAKRAEIAGKG 1403
Query: 1411 G 1411
G
Sbjct: 1404 G 1404
>gi|358390357|gb|EHK39763.1| hypothetical protein TRIATDRAFT_48211 [Trichoderma atroviride IMI
206040]
Length = 1441
Score = 227 bits (579), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 334/1473 (22%), Positives = 578/1473 (39%), Gaps = 200/1473 (13%)
Query: 57 NLVVTAANVIEIYVVR-------------------------VQEEGSKESKNSGETKRRV 91
NLVV ++++I+ V+ V ++ ES G +
Sbjct: 28 NLVVAKGSLLQIFTVKAISTELDPEFQPSQPTETETRFDRQVNDDDGLESSFLGGESMFM 87
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
D + L L+ L G V LA + + + ++++LA++ AK+ + E+D
Sbjct: 88 RTDRTNNTKLVLIAEIPLAGTVIGLARVKT--KNTASGGEALLLAYKAAKMCLAEWDPKK 145
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKVDPQGRCGGVLVYGLQMIILKASQGG 210
+ L S+H +E E + E F ++ DP RC + IL ++
Sbjct: 146 NELETISIHYYEKEE-MQGSPWEEVFGEYVNYLEADPGSRCAAFKFGTRNLAILPFTRSE 204
Query: 211 SGLVG---DED-------------TFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFV 252
L DED G G A S V+ L LD + H F+
Sbjct: 205 EDLEMEDWDEDLDGPRPVKEHTAAANGDGNNVEAAYTPSFVLRLPLLDPSLLHPVHLTFL 264
Query: 253 HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
H Y EP +L + + S H + + L + + I S LPHD YK++
Sbjct: 265 HEYREPTFGVLSSSQAPASSLGSKDHLSYKVFTLDLQQ--RASTTILSVTGLPHDLYKVI 322
Query: 313 AVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATW 371
A+P+P+GG L+VG N IH +A+N A S +S ++ L++
Sbjct: 323 ALPAPVGGALLVGQNELIHVDQSGKPNGVAINPMAKLATSFNLTDQSDLNLRLESCAIEL 382
Query: 372 L--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITTI---GNSL 422
L +N LL G L +++ DGR V L + + +++ S +T I G +
Sbjct: 383 LAIENGELLLILNDGRLGIISFKIDGRTVSGLGVKLVGADCGGNIIKSRVTCISRLGKNA 442
Query: 423 FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE 482
FFLGS DS+++ + S K D + + + +
Sbjct: 443 FFLGSETSDSVVLGW---SRKQTQEKRRKSRLIDTDLALDVDELDLEDDEEDDDLYGDDS 499
Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR--------------INAD- 527
+ +N SF + D+L++I P++D + G ++AD
Sbjct: 500 ATTKPNQTANGGTVKSGDISFRIHDTLLSIAPIQDITCGQSAFLPDSEEATLNKGVSADL 559
Query: 528 --ASATGISKQSNYELV------------ELPGCKGIWTVYHK----SSRGHNADSSRMA 569
A A G + + ++ E P +G WT+ K S G NA ++
Sbjct: 560 QLACAVGRGEAGSIAVINREIQPKVIGRFEFPEARGFWTMCVKKPVPKSLGTNAGAAGDY 619
Query: 570 AYDDEYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
++ ++I++ E + TA + E+ G T+ AG + + V
Sbjct: 620 DAPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKETEFEPAAGFTVEAGTMGNQMVV 679
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
IQV + R +G + Q L + +G+E V S SI DPY+L+ D S+ L
Sbjct: 680 IQVLKSEVRCYNGDLGLIQILPM----LDEETGAEPRAV-SASIVDPYLLIIRDDASVFL 734
Query: 683 LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGA 742
D + ++ + +S K + C LY D + GV +A G
Sbjct: 735 AQIDSNNEIEEIEKTDSGLTSTKWAAGC-LYKD----------------TKGVFQANQG- 776
Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSET 801
D ++ + +GAL I+ +P+ + V+ + S H+ ++ + +
Sbjct: 777 DQAKKSGEEVMMFLLNTAGALHIYALPDLSKPVYVAEGLSSIPPHLSADFVAKKV----- 831
Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
+E + + V +L H P+L + + Y+
Sbjct: 832 ------------ASREALTELVVADLG----DTVHYSPYLILRHSTDDLTIYEPIRL--- 872
Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
+D P SA+ + PL+ ++ P P + I N+
Sbjct: 873 ----PTDSPTRNLSDTLFFKKSANSILAKSTVEDPLEDTAQQ--PRYVP---LRICANVG 923
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
G+ FL G P + + + + + + + + C+ GFIY S+GI ++
Sbjct: 924 GYSTVFLPGPSPAFILKSSKSVPRVVGVQGLGVRGMSTFNTEGCDRGFIYSDSEGIARVT 983
Query: 982 QLPSGSTYDNYW-PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1040
QLPS + + V+K+PL + Y Y +V + L D +
Sbjct: 984 QLPSKTNFTELGVSVKKVPLGNDVRHVAYHHPTETYIAGCAV------TEGFELPKDDDY 1037
Query: 1041 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1099
+ +LS H + ++++ P W +I M+ E+ ++ + L
Sbjct: 1038 HKEWAKESLS---FHPSTV--RGSLKLISPVT----WTVIHSIDMEPGESIECMKTLHLE 1088
Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNLVTEVYSKE--L 1153
+ TKE LLA+GTA +GED+ RGRV ++ P N ++ +KE
Sbjct: 1089 VSEETKERRMLLAVGTALTRGEDLPTRGRVQVYDIVTVIPEPGKPETNKKLKLLAKEEIP 1148
Query: 1154 KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVK--NF 1207
+G ++AL+ + QG +L+A G K ++ K G+ L +AF D +V S +
Sbjct: 1149 RGGVTALSEIGTQGLMLMAQGQKCMVRGLKEDGSLLP-VAFLDM-SCHVASARELPGTGL 1206
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
L+ D K ++F + E+ +L K GSL +FL DG LS+V D +I +
Sbjct: 1207 CLIADAFKGLWFAGYTEEPYTFKVLGKSSGSLPLLVADFLPDGEDLSMVAVDADGDIHVL 1266
Query: 1268 YYAPKMSESWKGQKLLSRAEFHVGAH-VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLF 1326
+ P+ +S +G LL R F V + T L L +S A+ S LL
Sbjct: 1267 EFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPRTLPASQSATASQDSSTPQPHLLLL 1326
Query: 1327 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDS-VPHVAGLNPRSFRQFHSNGKAHR----- 1380
+ GS+ + PL E +RRL S+ +L+ + VPH GL+ R+ R G R
Sbjct: 1327 ASPSGSLAALTPLPESAYRRLLSVTNQLLPALVPH-GGLHARAHRAPEGGGGMSRMVGVE 1385
Query: 1381 --PGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
+IVD +L+ + L ++ E+A + G
Sbjct: 1386 TAASGRAIVDGAILTRWNELGAAKRAEVASRGG 1418
>gi|358387835|gb|EHK25429.1| hypothetical protein TRIVIDRAFT_32877 [Trichoderma virens Gv29-8]
Length = 1440
Score = 226 bits (577), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 339/1477 (22%), Positives = 591/1477 (40%), Gaps = 209/1477 (14%)
Query: 57 NLVVTAANVIEIYVVR-------------------------VQEEGSKESKNSGETKRRV 91
NLVV ++++I+ V+ V ++ ES G +
Sbjct: 28 NLVVAKGSLLQIFTVKSISTELDPEFQPNQPAEVDTRFDRQVNDDDGLESSFLGGETMFM 87
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
D + L L+ L G V LA L + + ++LA++ AK+ + ++D
Sbjct: 88 RTDRTNNTKLVLIAEIPLAGTVIGLARLKTN--KTASGGEVLLLAYKAAKMCLAQWDPKK 145
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQMIILKASQ 208
+ L S+H +E E L E F G V + DP RC + IL +
Sbjct: 146 NELETISIHYYEKEE-LQGSPWEEVF--GEYVNHLEADPGSRCAAFKFGTRNLAILPFRR 202
Query: 209 GGSGLVG---DEDTFG---------SGGGFSARIESSHV------INLRDLDMKHVKDFI 250
L DED G + G S +E+++ + L D + H
Sbjct: 203 SEEDLEMEDWDEDLDGPRPVKEQAAAVNGDSDNVEAAYTPSFVLRLPLLDPSLLHPVHLT 262
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
F+H Y EP +L + A + K H ++ + I S LPHD YK
Sbjct: 263 FLHEYREPTFGVLSSSQAP-AASLGLKDHLSY-KVFTLDLQQRASTTILSVTGLPHDLYK 320
Query: 311 LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD--AA 367
++A+P+P+GG L+VG N IH +A+N A + S ++ ++ L+ A
Sbjct: 321 VIALPAPVGGALLVGQNELIHVDQSGKPNGVAVNPMAKLVTSFSLTDQADLNLRLENCAI 380
Query: 368 HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDLSKTNPSVLTSD---ITTIGN 420
++N LL G L +++ DGR V RL + +V+ S I+ +G
Sbjct: 381 ELLAVENGELLLILNDGRLGIISFKIDGRTVSGLSVRLVGADCGGNVIKSRAACISRLGK 440
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD-APSTKRLRRSSSDALQDMV 479
+ FF+GS GDS+++ ++ + + + I+ D A L + D+
Sbjct: 441 NTFFVGSETGDSVVLGWS-----RRQTQEKRRKSRLIDPDLALEVDELDLEDDEEDDDLY 495
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR--------------IN 525
+ + +N + SF + D L++I P++D + G ++
Sbjct: 496 GDDSAATKPQTTNGGAAKSGDLSFRIHDVLLSIAPIQDITCGQAACLPDSEEATLIKGVS 555
Query: 526 AD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAA 570
+D A A G + + ++ E P +G WT+ K + S+ A
Sbjct: 556 SDLQLACAVGRGEAGSLAIINREIQPRVIGRFEFPEARGFWTMCVKKPVPKSLGSNVGVA 615
Query: 571 YDDE---YHAYLIISLEARTMVLETADLLTEVTESVDYFVQ-------GRTIAAGNLFGR 620
D + H +I + ET+D+ + + G T+ AG + +
Sbjct: 616 GDYDAPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKETEFEPAAGFTVEAGTMGKQ 675
Query: 621 RRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
VIQV + R +G + Q L + +G+E V S SI DPY+L+ DGS
Sbjct: 676 MMVIQVLKSEVRCYNGDLGLIQILPM----LDEETGAEPRAV-SASIVDPYLLIIRDDGS 730
Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI 739
+ L D + ++ +S K V+ C LY D + GV ++
Sbjct: 731 VFLAQIDSNNEIEEMEKADGGLTSTKWVAGC-LYKD----------------TKGVFQSN 773
Query: 740 DGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYM--REAL 796
+ G D+G + + +GAL I+ +P+ + V+ + S H+ ++ R A
Sbjct: 774 LNSAAGKADEG-VMMFLLNSAGALHIYSLPDLSKAVYIAEGLSSIPPHLSAGFVARRGAT 832
Query: 797 KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+++ TEI V +L + HS P+L + + Y+
Sbjct: 833 RETLTEI-------------------VVADLG----DSVHSSPYLILRHSTDDLTIYEPI 869
Query: 857 LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI 916
T D + +S + S+++ S + + P D + P P +
Sbjct: 870 RLPTASATHALSDTLFFKKSAN-SSLAKSAVED------PSD--DTAQPPRYVPLRTCA- 919
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
N+ G+ FL G P + + + + L + + H C+ GFIY S+G
Sbjct: 920 --NVGGYSAVFLPGPSPAFIIKSSKSIPRVVGLQGLGVRGMSTFHTEGCDRGFIYADSEG 977
Query: 977 ILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1035
I ++ QLPS + V+K+PL + Y Y ++ + L
Sbjct: 978 IARVTQLPSKTNLTELGVSVKKVPLGHDIRHVAYHHPTETYIAGCTI------TENFELP 1031
Query: 1036 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1095
D + + +LS + ++ ++++ P W +I M+ E+ ++
Sbjct: 1032 KDDDYHKEWARESLSFLP-----SMARGALKLINPIT----WTVIHSIDMEPGESIECMK 1082
Query: 1096 VVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNLVTEVYS 1150
+ L + TKE LLA+GTA +GED+ RGRV ++ P N ++ +
Sbjct: 1083 TLHLEVSEETKERRMLLAVGTALTRGEDLPTRGRVQVYDIVTVIPEPGKPETNKRLKLLA 1142
Query: 1151 KE--LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV 1204
KE +G ++AL+ + QG +L+A G K ++ K G+ L +AF D + +
Sbjct: 1143 KEEIPRGGVTALSEIGTQGLMLVAQGQKCMVRGLKEDGSLLP-VAFLDMSCHVSTARELP 1201
Query: 1205 -KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
L+ D K ++F + E+ +L K GSL +FL DG LS+V D +
Sbjct: 1202 GTGLCLIADAFKGLWFAGYTEEPYTFKVLGKSSGSLPLLVADFLPDGEDLSMVAVDADGD 1261
Query: 1264 IQIFYYAPKMSESWKGQKLLSRAEFHVGAH-VTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
I + + P+ +S +G LL R F V + T L L +S +P S +
Sbjct: 1262 IHVLEFNPEHPKSLQGHLLLHRTTFSVTPNPPTSTLLLPRTLPASQSATTSPDSSSSQPH 1321
Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS-VPHVAGLNPRSFRQFHSNGKAHR- 1380
LL + G + + PL E +RRL S+ +L+ + VPH GL+ R+ R G R
Sbjct: 1322 LLLLASPSGCLASLTPLPESAYRRLLSVTNQLLPALVPH-GGLHARAHRTPEGGGGMSRT 1380
Query: 1381 ------PGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
+IVD +L+ + L ++ E+A + G
Sbjct: 1381 VGVETAASGRAIVDGAILARWNELGAAKRAEVATRGG 1417
>gi|408396642|gb|EKJ75797.1| hypothetical protein FPSE_03977 [Fusarium pseudograminearum CS3096]
Length = 1427
Score = 226 bits (576), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 335/1440 (23%), Positives = 560/1440 (38%), Gaps = 203/1440 (14%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GGADN 126
R ++ ES G V D + L LV L G V LA + GG
Sbjct: 68 RANDDDGLESSFLGGETMIVKTDRTNNTKLVLVAELPLSGAVTGLAKVKTKHSKCGG--- 124
Query: 127 SRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKV 185
+++++A++ AK+ + +D L S+H +E E LH SF ++
Sbjct: 125 ----EALLIAYKAAKLCMAVWDPEKSTLETISIHYYEKEE-LHGAPWEVSFDEYANYLEA 179
Query: 186 DPQGRCGGVLVYGLQMIILKASQGGSGLVGDE------------DTFGSGGGFSARIESS 233
DP RC + IL Q L D+ +T G S +E
Sbjct: 180 DPGSRCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKETATVANGDSDTVEPP 239
Query: 234 HV------INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
+ + L D + H F F+H Y EP IL + H T + L
Sbjct: 240 YTPSFVLRLPLLDPSLLHPVHFAFLHEYREPTFGILSSSQEPAHSLGQKDHLTYKVFTLD 299
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + I S +LP D +K+LA+P+P+GG L++G N IH + +A+N+ A
Sbjct: 300 LQQ--RASTTILSVTDLPRDLFKILALPAPVGGALLIGENELIHVDQSGKANGVAVNSMA 357
Query: 347 VSLDSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S ++ ++ L+ ++N LL G + +++ + DGR V L +
Sbjct: 358 RQITSFSLTDQADLNLRLEHCVVEQLHIENGELLLVLNDGQIGIVSFLIDGRTVSGLSVK 417
Query: 405 ----KTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
+ +VL S +T +G + FF+GS +GDS+++ +T G K D
Sbjct: 418 MVTDENGGNVLKSRASTASKLGKNAFFVGSEMGDSVVLGWTRKMGQEKRR---KPRLIDT 474
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
+ + D D+ E + + + N SF + D+L++I P+KD
Sbjct: 475 DIALDVDELDLEDDDDEDDDLYGTESAAAKPAQALNGSGRSGELSFRIHDTLLSIAPIKD 534
Query: 518 FSYGLR--------------INAD---ASATGISKQSNYELV------------ELPGCK 548
+ G + +D A G K + ++ E P +
Sbjct: 535 LTPGKTSFLPDSEEMTLSDGVVSDLHLACIVGRGKAGSLAILNRNIQPKIIGRFEFPEAR 594
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEY-----HAYLIISLEARTMVLETADLLT------ 597
G WT+ K S A DEY H +I + ET+D+
Sbjct: 595 GFWTMSVKKPLPKALGGS--AGVGDEYETFGQHDKYMIVAKVDLDGYETSDVYALTGAGF 652
Query: 598 EVTESVDY-FVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
E + ++ G T+ AG + + R+IQV + R DG +TQ L + E+G+
Sbjct: 653 ETLKETEFDPAAGFTVEAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA- 709
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
V S SI DPY+LL D S+ L D + V+ A + K + C LY D
Sbjct: 710 --EPRVTSASIVDPYLLLIRDDSSLLLAQIDSNNELEEVEKMDATLQNTKWHAGC-LYAD 766
Query: 716 KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CV 774
T + +D G ++ I + +GAL ++ +P+ + V
Sbjct: 767 -----------------TKGAFQLSASDKGETEK--IMMFLLSSTGALHVYALPDLSKPV 807
Query: 775 FTVDKFVSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
+ + H+ D +R L KE + + V +L
Sbjct: 808 YVAEGLSYVPPHLSADYTLRRGLA------------------KETLREILVADLG----D 845
Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS----RLRN 889
P+L + Y+ P+ R SN+SA+ + N
Sbjct: 846 TISQSPYLILRNQTDDLTIYE---------------PIRHVRPGGESNLSAALSFKKTSN 890
Query: 890 LRFSRTPLDAYTRE-ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ 948
+ + TP E E P P +R NI+G+ FL GS P + + + +
Sbjct: 891 VTLATTPAQTEDDEVEQPRFMPMRRCA---NINGYSTVFLPGSSPSFVLKSSKSIPRVIG 947
Query: 949 LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQI 1007
L I + H C+ GFIY +GI ++ Q PS + + V+K+PL + I
Sbjct: 948 LQGLGIRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNFTELGISVKKVPLGSDVRGI 1007
Query: 1008 TYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVR 1066
Y Y I +P E+ D H + L T+ ++
Sbjct: 1008 AYHQPTGAY--IAGCMTSEPF----------ELPKDDDYHKEWAKETLSFPPTMPRGILK 1055
Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAA 1125
++ P W I ++S E+ ++ + L + TKE L+A+GTA +GED+
Sbjct: 1056 LISPIT----WTVIHDIELESCESIECMKTLHLEVSEDTKERRFLVAVGTAVSKGEDLPI 1111
Query: 1126 RGRVLLFSTGRNADNPQNLVTEVYSKEL------KGAISALASL--QGHLLIASGPKIIL 1177
RGRV ++ P T K + +G ++A++ + QG +L+A G K ++
Sbjct: 1112 RGRVHVYDIVTVIPEPGKPETNRRLKAIAREDIPRGGVTAISEIGTQGLMLVAQGQKCMV 1171
Query: 1178 H--KWTGTELNGIAFYDAPPLYVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
K G+ L +AF D + + + L+ D K ++F + E+ +L K
Sbjct: 1172 RGLKEDGSLLP-VAFLDMSCHVSSARELSRTGLCLMADAFKGVWFAGYTEEPYTFKVLGK 1230
Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH- 1293
G L +FL DG L++V +D ++ I + P+ +S +G LL R F V +
Sbjct: 1231 SHGRLPVVVADFLPDGDDLAIVAADVDGDLHILEFNPEHPKSLQGHLLLHRTSFSVSPNP 1290
Query: 1294 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1353
+ L L S T P LL + G + + PL E +RRL S+ +
Sbjct: 1291 PSTTLLLPRTTPPSHPTPQDP------PHVLLLASSSGHLSSLIPLPETAYRRLLSVTNQ 1344
Query: 1354 LVDSVPHVAGLNPRSFRQFHSNGK--AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
L+ ++ GLN ++ R G +IVD +L+ + L ++ EIA + G
Sbjct: 1345 LLPALTPHGGLNAKAHRLPVGTRTVGVEAAGGRAIVDGAVLARWAELSAAKRAEIAGKGG 1404
>gi|149512998|ref|XP_001514888.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Ornithorhynchus anatinus]
Length = 831
Score = 225 bits (573), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 215/738 (29%), Positives = 340/738 (46%), Gaps = 141/738 (19%)
Query: 233 SHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
S++I++R LD K ++ D F+HGY EP ++IL+E TW GRV+ + TC I A+S++
Sbjct: 8 SYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILYEPNQTWPGRVAVRQDTCSIVAISLNI 67
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS + Y VSL+
Sbjct: 68 LQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQS------VPPYGVSLN 121
Query: 351 S----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLD 402
S + P R + LD A A ++ D ++S K G++ +LT++ DG R V+
Sbjct: 122 SLTAGTTAFPLRLREGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFH 181
Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA- 461
K SVLT+ + T+ FLGSRLG+SLL+++T S +E D AD
Sbjct: 182 FDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPAGSA-REPARDSGADKQ 240
Query: 462 -PSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
P K+ R + A QD V+ E+ +YGS A + T+ A T+SF V DS++NI
Sbjct: 241 EPPVKKKRVEQALSWAGGKSAAQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 296
Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
GP + + G L I + + + + K ++V ELPGC
Sbjct: 297 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 356
Query: 548 KGIWTVYHK-------SSRGHNADSSRMAAY---DDEYHAYLIISLEARTMVLETADLLT 597
+WTV S +G A+S D + H +LI+S E TM+L+T +
Sbjct: 357 YDMWTVIAPVRKEEGDSPKGEGAESEPTPPEPEDDGKRHGFLILSREDSTMILQTGQEIM 416
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
E+ S + QG T+ AGN+ R ++QV G R+L+G L F P +
Sbjct: 417 ELDTS-GFATQGPTVYAGNIGDDRYIVQVSPLGLRLLEG---VNQLHFIPVDL------- 465
Query: 658 NSTVLSVSIADPYVLLGMSDGSIR--LLVGDP---STCTVSVQTPAAIESSKKPVSSCTL 712
S ++ ++ADPYV++ ++G + LL D T +++ P + S K ++ C +
Sbjct: 466 GSPIVQCAVADPYVVIMSAEGHVTMFLLKSDSYGGRTHRLALHKP-PLHSQSKVIALC-V 523
Query: 713 YHD-----------KGP--EPWLRKTSTDAWLSTGVGEAIDGADG--------------- 744
Y D GP +P LR S L + +D +
Sbjct: 524 YRDVSGMFTTESRASGPRDDPSLRGQSEAEPLLQELSHTVDDEEEMLYGDSSSLFSPSRD 583
Query: 745 -------GPLDQGDI--------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD 789
P D+ + V+ ++GA+EI+ +P + VF V F G+ +VD
Sbjct: 584 EPRRSSLPPADRDAPQYRAEPTHWCVLVRDNGAMEIYQLPEWRLVFLVKNFPMGQRVLVD 643
Query: 790 TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAI----- 844
+ + S + + EE QG + + +V L +RP+L +
Sbjct: 644 SSFGQPAA-SAAQAEAKKEEPARQGELPLVKEVLLVALG-----NRQTRPYLLRLKWAIR 697
Query: 845 ---LTDGTILCYQAYLFE 859
LT T + Q Y+ +
Sbjct: 698 DSELTSITFIDMQLYIHQ 715
Score = 107 bits (267), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 55/140 (39%), Positives = 85/140 (60%), Gaps = 8/140 (5%)
Query: 1167 LLIASG-----PKIILHKWT--GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
LL+A G P ++ KW +EL I F D LY+ + VKNFIL D+ KSI
Sbjct: 676 LLVALGNRQTRPYLLRLKWAIRDSELTSITFIDMQ-LYIHQIISVKNFILAADVMKSISL 734
Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G
Sbjct: 735 LRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGG 794
Query: 1280 QKLLSRAEFHVGAHVTKFLR 1299
+LL RA+FHVGAHV F R
Sbjct: 795 MRLLRRADFHVGAHVNAFWR 814
>gi|402591342|gb|EJW85272.1| hypothetical protein WUBG_03818, partial [Wuchereria bancrofti]
Length = 1025
Score = 223 bits (569), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 253/998 (25%), Positives = 418/998 (41%), Gaps = 149/998 (14%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + RL V+S AI + DS +L F+DAK+S++ + + L+ S+H
Sbjct: 62 LECLLAVRLLAPVQSFAI---ARISQNPDCDSFLLGFDDAKLSIVAVNPADRCLKTISLH 118
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G P+++VDP RC +LV+G + +L + + L
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCASMLVFGRYLAVLPFNDSSAQL------- 168
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+ + L +D + +V D +F+ GY EP ++ L+E T GR ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217
Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
T + L +S +K+ L +W NLP D ++LA+P P+GG+L+V N + Y +QS
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275
Query: 337 S-CALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
C ++LN+ +D + P F ++ LD A T + + LL + G L L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLKDFKHMALTLDGAVVTVVSTNKILLCDRNGRLFTLILV 332
Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
D V+ L+L +V+ +T+ F+GSRL DS+ + C S L
Sbjct: 333 TDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFLH--CIFEQSTLEES-- 388
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS---ASNNTESAQKTFSFAVRDS 508
+TK+++ S+ + E+ LYG + ++ + V D
Sbjct: 389 -----------ATKKIKLSTEPNANE--EDEDFELYGEMLPKVAKPDITEELLNIRVLDK 435
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
L+N+GP K + G + K ++LV G G + +S R SS
Sbjct: 436 LLNVGPCKKITGGCPSISAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSS 495
Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
+ +D+ H Y I S E T+ LET + L E+ E+ + TIAAG L
Sbjct: 496 SIEGVVQYWAIGRREDDTHMYFIASRELGTLALETDNDLVEL-EAPIFSTSESTIAAGEL 554
Query: 618 FGRRRVIQV-------FERGARILDGSYMTQDLSFGPSNSES-----GSGSENSTVLSVS 665
+QV G +I Y+ L+F N+ ++N +L
Sbjct: 555 ADGGLAVQVTTSSLVMVAEGQQI---QYIPLQLTFPVRNASIVDPYIAICTQNGRLLMYE 611
Query: 666 IAD-PYVLLGMSDGSIRL------------------LVGDPSTCTVSVQTPAAIESSKKP 706
+ + P+V L D S RL ++ S +S Q A + P
Sbjct: 612 LTNHPHVHLKEIDISKRLRHETSPITSLSVYRDMSGIIRFCSAANMSQQQQATGANMHIP 671
Query: 707 -------VSSCTLYHDKGPEPWLRKTSTDAWLSTGVG--EAIDGADGGPLDQGDI----Y 753
V LY D RK + G+ E D +D I +
Sbjct: 672 EQEDFEDVDDLLLYGDSKKS---RKETLSKRRIVGMKLTEQNTHFDTDVIDPNTIVPSHW 728
Query: 754 SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG- 812
+ E+G + I+ +P + V+ V K +H+ D + D E ++ EGT
Sbjct: 729 IAIARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDE----PATGEGTDA 780
Query: 813 -QGRKENIHSMK----VVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
G + ++K ++EL + + RP LF +L D T+ Y+ + + N
Sbjct: 781 MSGTMTDTFAVKPEEVIMELLLVGMGMNQGRPLLF-LLIDDTVSAYEMFTY----NNGIQ 835
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQ 924
+ L + V+ R+ RF T D E+ A + + F+ I
Sbjct: 836 GHLAIRFKRLPYTTVT----RSCRFQGT--DGRAAVESVRDAVRHKTVLHFFERIGNVLN 889
Query: 925 GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-ILKICQL 983
G F+ S PC + R+HP DG I++FT +N C +GFIY+T + ++++ +L
Sbjct: 890 GVFICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERDRLMRVAKL 949
Query: 984 PSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1021
PS D +PV++I + AT H + Y N Y ++ S
Sbjct: 950 PSDMILDASYPVKRINVGATVHSVVYLLHSNTYAVLTS 987
>gi|302924728|ref|XP_003053954.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256734895|gb|EEU48241.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1429
Score = 221 bits (564), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 332/1437 (23%), Positives = 562/1437 (39%), Gaps = 202/1437 (14%)
Query: 75 EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSII 134
++G + S G V D + L LV L G V LA + + ++++
Sbjct: 72 DDGLESSFLGGGESMLVRTDRTNNTKLVLVAELPLTGTVIGLAKIKTKYTKSGG--EALL 129
Query: 135 LAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQG 189
LA++ AK+ + E+D + L S+H +E E W + + + ++ DP
Sbjct: 130 LAYKAAKMCLCEWDPKKNTLETLSIHYYEKDELQGAPW---EVAFDEYVN--FLEADPGS 184
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGD---EDTFGS---------GGGFSARIESSHV-- 235
RC + IL Q L D ED G G S +E+++
Sbjct: 185 RCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKESTAVANGDSDTLEAAYTPS 244
Query: 236 ----INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
+ L D + H F+H Y EP IL + H T + L +
Sbjct: 245 FVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSSQERAHSLGQKDHLTYKVFTLDLQQ- 303
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLD 350
+ I S +LP D YK++A+P+P+GG L++G N IH + +A+N+ A +
Sbjct: 304 -RASTTILSVTDLPRDLYKMIALPAPVGGALLIGENEFIHIDQSGKANGVAVNSMARQMT 362
Query: 351 SSQELPRSSFSVELDAA--HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS---K 405
S ++ ++ L+ +++N LL G L +++ DGR V + + +
Sbjct: 363 SFSLSDQADLNLRLEGCIIEQLYIENGELLLILNDGRLGIVSFRIDGRTVSGISIKMIPE 422
Query: 406 TNPSVL----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
N L S + +G + FF+GS GDS+++ G M ++
Sbjct: 423 ENGGRLIKSRASTASKLGKNTFFIGSETGDSVVL----GWSRKMSQEKRRKTRLVDADLG 478
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
L D D + G E + + + N SF + D+L++I P++D + G
Sbjct: 479 LDVDDLDLEDDDDEDDDLYGTETAAKPTQALNGAGKSGELSFRIHDTLLSIAPIRDLTSG 538
Query: 522 -LRINADASATGISKQ--SNYELV--------------------------ELPGCKGIWT 552
D+ +SK S+ +L E P +G WT
Sbjct: 539 KAAFLPDSEEATLSKGVVSDLQLACVVGRGNSGSLAILNRHIQPKIIGRFEFPEARGFWT 598
Query: 553 VYHK----SSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVTES 602
+ K S G N ++ Y+I++ E + TA + E+
Sbjct: 599 MCVKKPVPKSLGGNVTVGNDYETFGQHDKYMIVAKVDLDGYETSDVYALTAAGFETLKET 658
Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTV 661
G T+ AG + + RVIQV + R DG +TQ L + E+G+ V
Sbjct: 659 EFDPAAGFTVEAGTMGKQMRVIQVLKSEVRSYDGDLGLTQILPM--LDEETGA---EPRV 713
Query: 662 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPW 721
+S SIADPY+LL D S+ + D + V+ + S K + C LY D
Sbjct: 714 ISASIADPYLLLIRDDSSVLIAQIDSNNELEEVEKTDSTLQSTKWHAGC-LYTD------ 766
Query: 722 LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKF 780
+ GV + G G D I + +GAL ++ +P+ + V+ +
Sbjct: 767 ----------TKGVFQPSVGDKGA--DTSKIMMFLLSSTGALHVYALPDLSKPVYVAEGL 814
Query: 781 VSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRP 839
H+ D +R L KEN+ + V +L P
Sbjct: 815 CYVPPHLSADYTLRRGLA------------------KENLRELLVADLG----DTVSQSP 852
Query: 840 FLFAILTDGTILCYQA--YLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
+L + Y+ Y EG E T S +L+ S + L +
Sbjct: 853 YLILRNQTDDLTIYEPLRYQPEGAEPT--------LSATLTFKKTSNAALATSPVETSQE 904
Query: 898 DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
DA + P P + N++G+ FL G P + + + + L I
Sbjct: 905 DAV---QQPRFVPLRTCA---NVNGYSTVFLPGPSPSFILKSSKSIPRVIGLQGLGIRGM 958
Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPHQITYFAEKNLY 1016
+ H C+ GFIY +GI ++ QLPS + + D V+K+PL + I Y Y
Sbjct: 959 STFHTEGCDRGFIYADDEGIARVTQLPSETNFTDLGISVKKVPLDSDVCGIAYHQPTGTY 1018
Query: 1017 PLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGP 1076
+ N+ L D + + L+ T+ ++++ P
Sbjct: 1019 IAGCTT------NEPFELPRDDDYHKEWAKETLTFAP-----TMPRGVLKLISP------ 1061
Query: 1077 WQTRATI----PMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1131
T+ ++S E+ ++ + L + TKE LL +GTA +GED+ RGRV +
Sbjct: 1062 --VSLTVIHDQELESCESIECMKTLQLEVSEETKERRFLLTVGTALSKGEDLPIRGRVHV 1119
Query: 1132 FSTGRNADNPQNLVTEVYSKEL------KGAISALASL--QGHLLIASGPKIILH--KWT 1181
F P T K + +G ++A++ + QG +L+A G K ++ K
Sbjct: 1120 FDIVTVIPEPGKPETNKRLKAIAREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGLKED 1179
Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
G+ L +AF D + + + ++ D K ++F + E+ +L K G L
Sbjct: 1180 GSLLP-VAFLDMSCHVSSARELPRTGLCVMADAFKGVWFAGYTEEPYTFKILGKSHGRLP 1238
Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1300
+FL DG L++V +D ++ I + P+ +S +G LL R F V +
Sbjct: 1239 LLVADFLPDGEDLAIVAADADGDLHILEFNPEHPKSLQGHLLLHRTTFSVSPNPPT---- 1294
Query: 1301 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1360
ML A P ++ LL + G + + PL E T+RRL S+ +L+ ++
Sbjct: 1295 SMLLLPRTTPPAHPSPSDPSQI-LLLASPSGHLSTLVPLPEATYRRLLSVTNQLLPALTP 1353
Query: 1361 VAGLNPRSFRQFHSNGKAHRP-GPD-----SIVDCELLSHYEMLPLEEQLEIAHQTG 1411
GLN + +R RP G D +IVD +L+ + L ++ EIA + G
Sbjct: 1354 YGGLNAKGYRL----PSGTRPVGVDAAAGRTIVDGAILARWAELGAAKRAEIAGKGG 1406
>gi|33411762|emb|CAD58786.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
Length = 880
Score = 221 bits (564), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 161/497 (32%), Positives = 252/497 (50%), Gaps = 62/497 (12%)
Query: 233 SHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 8 SYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNI 67
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSL 349
T K HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+
Sbjct: 68 TQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGT 127
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNP 408
+ + + LD A A ++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 128 TAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAA 187
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
SVLT+ + T+ FLGSRLG+SLL+++T S+ E D E KR+
Sbjct: 188 SVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRVD 245
Query: 469 RS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
+ S QD V+ E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 246 ATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 301
Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHKS 557
L I + + + + K ++V ELPGC +WTV
Sbjct: 302 PAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 361
Query: 558 SR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+ G + A DD H +LI+S E TM+L+T + E+ S +
Sbjct: 362 RKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFAT 420
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
QG T+ AGN+ R ++QV G R+L+G L F P + S ++ ++A
Sbjct: 421 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVA 470
Query: 668 DPYVLLGMSDGSIRLLV 684
DPYV++ ++G + + +
Sbjct: 471 DPYVVIMSAEGHVTMFL 487
Score = 129 bits (324), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/270 (29%), Positives = 128/270 (47%), Gaps = 15/270 (5%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
+ ++ E+GA+EI+ +P++ VF V F G+ +VD+ + T+ + EE T
Sbjct: 602 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 657
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
QG + + +V L + RP+L + D +L Y+A+ P ++ +
Sbjct: 658 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 707
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
N++ + + T E T R F++I G+ G F+ G
Sbjct: 708 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 767
Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P W +V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD
Sbjct: 768 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 827
Query: 992 YWPVQKIPLKATPHQITYFAEKNLYPLIVS 1021
WPV+KIPL+ T H + Y E +Y + S
Sbjct: 828 PWPVRKIPLRCTAHYVAYHVESKVYAVATS 857
>gi|322704830|gb|EFY96421.1| Cleavage factor two protein 1 [Metarhizium anisopliae ARSEF 23]
Length = 1433
Score = 221 bits (564), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 333/1393 (23%), Positives = 550/1393 (39%), Gaps = 192/1393 (13%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
R ++ ES G V D + L L+ L G V LA + + +
Sbjct: 70 RANDDDGLESSFLGGESLIVRADPSNITKLVLITEIPLAGTVIGLARVKV--KNTPSGGE 127
Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
+++LA++ AK+ + E+ H L TS+H +E E L+ + G V + DP
Sbjct: 128 ALLLAYKAAKMCLTEWHPQRHTLETTSIHYYEKDE---LQGAPWEMSFGDYVNYLEADPG 184
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGD---ED-------------TFGSGGG----FSA 228
RC + IL +Q L D ED T G G G +
Sbjct: 185 SRCVAFKFGSRNLAILPFTQSEEDLEMDDWDEDLDGPRPVKEELPLTNGDGPGDHDLVKS 244
Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
R S V+ L LD + H F+H Y EP IL + A H T + L
Sbjct: 245 RYTPSFVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSMQSPSAALGIKDHLTYKVFTL 304
Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY 345
+ + I S LP D ++++A+P+P+GG L+VG N IH +A+N+
Sbjct: 305 DLQQ--RASTTILSVTGLPQDLFRVMALPAPMGGALLVGENELIHIDQSGKPNGVAVNDM 362
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL 403
A + S + +S + L+ L ND+ LL G L ++ DGR V ++ +
Sbjct: 363 AKQMTSFSLVDQSELGLRLEGCAVELLANDIGELLLILNDGRLAIVCFHIDGRTVSKISI 422
Query: 404 ----SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
++ +++ S ++ I G++ FLGS DS+++ ++ G K +
Sbjct: 423 RLVSAEYGGNLIKSQVSCISKLGSNTLFLGSESNDSIVLGWSRKQGQE------KRKKSR 476
Query: 457 IEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPL 515
+ + D D + G + SL S + N S SF ++DSL++I P+
Sbjct: 477 LLDPDLALDVDDLDLDDDEDDDLYGNDASLAKPSQTINGGSKPGEVSFRIQDSLLSIAPI 536
Query: 516 KDFSYGL-RINADASATGISKQSNYEL----------------------------VELPG 546
+D + G + D+ +SK EL + P
Sbjct: 537 RDVACGAPALVPDSEEATLSKGVTAELELACAVGRGSSGSVAILNREIQPKVIGRFDFPE 596
Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVT 600
+G WT+ K A + +Y Y+I++ E + TA +
Sbjct: 597 ARGFWTMCAKKPLSKGAAVASDFDTTGQYDKYMIVAKVDLDGYETSDVYALTAAGFETLK 656
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENS 659
++ G T+ AG + + R+IQV + R DG ++Q L P E +
Sbjct: 657 DTEFEPAAGFTVEAGTMGKQMRIIQVLKSEVRCYDGDLGLSQIL---PMLDEDTGAEPRA 713
Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPE 719
T S SI DPY+LL D SI + + V P S K S C LY+D
Sbjct: 714 T--SASIVDPYLLLIRDDSSIFIAQIHSNNELEEVLKPDGTLKSTKWASGC-LYND---- 766
Query: 720 PWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD-IYSVVCYESGALEIFDVPNFN-CVFTV 777
T V E D+ D I + GAL ++ +P+ + VF
Sbjct: 767 -------TQGIFQNNVNEQ-------QADETDRIMMFLLSSVGALHVYALPDVSRPVFVA 812
Query: 778 DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
+ S + ++ A + +G KE+I + V +L A
Sbjct: 813 EALTS-----IPPFLSAAF---------VARKGAS---KESITEILVADLG----DAISQ 851
Query: 838 RPFLFAILTDGTILCYQA--YLFEGPENTSKS---DDPVSTSRSLSVSNVSASRLRNLRF 892
P+L + Y+ Y EG S S V+TS + + VS
Sbjct: 852 TPYLIVRHASDDLTIYEPVRYQAEGDAELSASLLFKKCVNTSLAKTAPEVSED------- 904
Query: 893 SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDG 952
DA E P P +R N++G+ FL + P + + L
Sbjct: 905 -----DA----EPPRFVPLRRCA---NVNGYGAVFLPNASPSFVLKSSHSEPRVMGLQGL 952
Query: 953 SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYFA 1011
+ + H C+ GFIYV +GI ++ QLPS + V+KI L I+Y
Sbjct: 953 GVRGMSTFHTEGCDRGFIYVDMEGIARVTQLPSNANLTELGVSVKKIALDGDVGMISYHH 1012
Query: 1012 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1071
Y +V L+ +E + N T+ ++++ P
Sbjct: 1013 PTGTY--VVGCTKLEQFELPRDDDYHKEWAKETSNF---------PPTMARGILKLINPV 1061
Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVL 1130
W + ++ E+ +++ + L + TKE + L+A+GTA +GED+ RGRV
Sbjct: 1062 T----WTVIHELELEPCESIESMKTLHLEVSEETKERKMLVAVGTALSKGEDLPTRGRVQ 1117
Query: 1131 LFSTGRNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILH--KW 1180
+F P N ++ +KE +G ++AL+ + QG +L+A G K ++ K
Sbjct: 1118 VFDIVTVIPEPGRPETNKRLKLIAKEEIPRGGVTALSEVGAQGLMLVAQGQKCMVRGLKE 1177
Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
G+ L +AF D +V S+ + ++ D+ K ++F + E+ +L K G
Sbjct: 1178 DGSLLP-VAFLDM-SCHVASVKELPGTGLCVMADVFKGLWFAGYTEEPYTFKILGKSSGK 1235
Query: 1239 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1298
L A +FL DG LS+V D + ++ I + P+ +S +G LL R F V +
Sbjct: 1236 LPLLAADFLPDGEDLSMVAVDAEGDLHILEFNPEHPKSLQGHLLLHRTSFAVTPNTPSS- 1294
Query: 1299 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS- 1357
+L + S ++ LL G + ++PL E T+RRL S+ +L +
Sbjct: 1295 --TLLLPRTHSPSYPQASSSSSSHMLLLACPSGQLAALSPLAESTYRRLLSVTNQLHPAI 1352
Query: 1358 VPHVAGLNPRSFR 1370
VPH GL+ ++ R
Sbjct: 1353 VPH-GGLHSKAHR 1364
>gi|339253000|ref|XP_003371723.1| cleavage and polyadenylation specificity factor subunit 1
[Trichinella spiralis]
gi|316967988|gb|EFV52332.1| cleavage and polyadenylation specificity factor subunit 1
[Trichinella spiralis]
Length = 1376
Score = 221 bits (564), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 168/629 (26%), Positives = 295/629 (46%), Gaps = 62/629 (9%)
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSK----------------------------SDD 869
RPFLFA++ + +L Y+A+ + P+ + +DD
Sbjct: 755 RPFLFAVVEE-QLLIYEAFHYPYPQQRYRLSVRFKKVRHTAILQRFRRIGRDDFKLLADD 813
Query: 870 -----PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISG 922
R S + + SR R R S +A+ E + AP ++++ F+N++G
Sbjct: 814 FQFSEQYRRRRKRSKHDSNRSR-RGDRHSGRRQEAHEHEPYRLTYEAPARQLSPFENVAG 872
Query: 923 HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
+ G F+ G P +C + ++ LR+HP DG +VAF + F Y T+ G++++
Sbjct: 873 YAGLFIGGGYPYFCFLSKQGDLRLHPMHIDGPVVAFAPYCSPKQLRAFAYFTADGMMRVS 932
Query: 982 QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
LPS +D P K+ L H + Y E + Y L S + P ++V++L+ D +
Sbjct: 933 SLPSKFDFDRSIPSMKVELGRAAHFVVYLMESHTYALTTSEQM--PCHKVVTLIGDDKQF 990
Query: 1042 HQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTR--ATIPMQSSENALTVRVVT 1098
D H Y T+E++++++ D W A + E+ + V
Sbjct: 991 ETFDREAP-----HFIYPTMEQFKLQLYSADT----WLPVPGAELDFDEFEHVTACQEVQ 1041
Query: 1099 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKE 1152
L + + ++ LAIGT GE+V RGR+L+ P +T+ VYSKE
Sbjct: 1042 LKSEGSASGLQSYLAIGTVLNYGEEVLIRGRLLIIDVVEVVPEPDRPMTKFKLKVVYSKE 1101
Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
KG +++L SL+G+LL G K+ + ++ L GI+F D +YV + ++ L D
Sbjct: 1102 QKGPVTSLCSLRGYLLTGMGQKVYIWQYKDNALVGISFLDLQ-VYVHQMASIRYLALTAD 1160
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
+ L ++E+ L+L+++D + A EFL+D + LS +++ +I + Y P+
Sbjct: 1161 AFFGVSLLRYQEEYKALSLVSRDPRPDEVLAVEFLVDRTDLSFLMTSAAGDILTYVYLPE 1220
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+S+ GQ+L+ +A++H G+ V F+R++ + + R L+F + DGS
Sbjct: 1221 SLDSFGGQRLVPQADYHFGSQVNAFVRMR---CHAQEIAGRKRQEVLQRQGLIFASSDGS 1277
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+ + PL E +R L LQ L+D +P AGLN +R R +I+D +
Sbjct: 1278 VNYLLPLPEREYRLLGMLQSLLIDMLPSFAGLNVDDYRTVRFPNSCLREPTKNIIDGNIC 1337
Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
Y + +Q +I Q G++ SQI+ L
Sbjct: 1338 MLYLYIDALQQEDIVRQIGSSHSQIMLEL 1366
Score = 188 bits (477), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 165/616 (26%), Positives = 286/616 (46%), Gaps = 74/616 (12%)
Query: 99 ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
AS ELV +++G + S+AI G + D I+LA +DAK+SV+ +D H L S
Sbjct: 65 ASFELVLSEQVYGRLASVAIARLTGF----QLDVILLAIDDAKLSVVGYDIETHSLVTLS 120
Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVG 215
MH +E + K G F P++++DP+ RC + +YG +++L + S S +
Sbjct: 121 MHYYEDDLF---KLGFTRFEIPPMLRMDPERRCAAMTIYGAHLVVLPLVRESLYESMNIV 177
Query: 216 DEDTFGSGGGFSARIESSHV-INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
D G FS R+ S V N D M +V D F+HG+ EP +++L+E T AGRV
Sbjct: 178 DPSQ-RPGWPFSLRLTSYTVAFNAIDAKMHNVTDMCFLHGFYEPTVLLLYEPTQTTAGRV 236
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
+ T I A+S++ K H +IW+ NLP DA+ LLA+P P+GGVL+ N+I Y +Q
Sbjct: 237 VVRQDTYQILAVSLNPKDKTHAVIWTLGNLPFDAFALLALPKPLGGVLLFSVNSIIYLNQ 296
Query: 335 SA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
S C + +N+ + RS V LD +HA + + A L ++G + ++++++
Sbjct: 297 SVPCCGILINDNGRGFTNYPLRDRSELMVTLDGSHAALIDSANAALVLRSGLVFVVSLLF 356
Query: 394 DG-RVVQRLDLSKTN-----PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
D +V+ + L+ ++ PS +++ + ++ F+GS +G+S L + +++
Sbjct: 357 DRLNMVKEILLTASSVRGAAPSTVSA---CVSSNCLFVGSAIGNSALYAYEAIEQVDVVA 413
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN--NTES-AQKTFSFA 504
L R + + L DM LYG TE+ Q F F
Sbjct: 414 VTLPA---------------RDTGLNLLDDM------QLYGELIRPCTTETLVQTKFEFR 452
Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
D L ++GP + + G A + S++ + PG G +TV +S R
Sbjct: 453 RLDQLASLGPCRAITVGESSVAMVNNFYEDYVSDWLVAGGPGTDGSFTVMQRSVRPRLLT 512
Query: 565 SSRMAAYDDEYHA-----------------YLIISLEARTMVLETADLLTEVTESVDYFV 607
+R+ + + Y++++ + RT+V + +TE+ ++ + +
Sbjct: 513 QTRVEDVLNAWSVGAQLIGSVDRSASPRPQYMLLTTKQRTVVFTLSSGITEIFDT-GFEI 571
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
+ TIA G++ V+QV + +L Q ++ V S+
Sbjct: 572 RFETIACGDMMNGAYVVQVTKENLVLLHRGQQVQCINL----------RVFEEVCQASVI 621
Query: 668 DPYVLLGMSDGSIRLL 683
DPYV L + G + L
Sbjct: 622 DPYVALIVRHGHVLLF 637
>gi|119602515|gb|EAW82109.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
CRA_b [Homo sapiens]
Length = 377
Score = 218 bits (555), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 133/369 (36%), Positives = 205/369 (55%), Gaps = 31/369 (8%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
+ + + LD A AT++ D ++S K G++ +LT++ DG R V+ K
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366
Query: 408 PSVLTSDIT 416
SVLT+ ++
Sbjct: 367 ASVLTTSVS 375
>gi|50552095|ref|XP_503522.1| YALI0E03982p [Yarrowia lipolytica]
gi|74634000|sp|Q6C740.1|CFT1_YARLI RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|49649391|emb|CAG79101.1| YALI0E03982p [Yarrowia lipolytica CLIB122]
Length = 1269
Score = 218 bits (554), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 311/1370 (22%), Positives = 548/1370 (40%), Gaps = 215/1370 (15%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
A LEL+ Y L G V + + DN DS+ ++ + AK ++ ++ S +
Sbjct: 51 APRLELITEYYLDGTVTGVTRIKT--IDN-YDLDSLYISVKHAKAVIVAWNASSFTIDTK 107
Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
S+H +E + L E V + +L +M L + G + D+
Sbjct: 108 SLHYYE--KGLVESNFFEPECSSVAVSDEANSFYTCLLFQNDRMAFLPIIEKG---LDDD 162
Query: 218 DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
+ SG F + S ++ LD +++V D F+H Y E M IL + + W G +
Sbjct: 163 EMPESGQVF----DPSFIVKASRLDKRIENVMDICFLHEYRETTMGILFQPKRAWVGMKN 218
Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
T + +S+ K +I + LP DA K++ +P+P+GG L++ ANTI Y S
Sbjct: 219 ILKDTVSYAIVSVDVHQKNSTVIGTLNGLPVDAQKVIPLPAPLGGSLIICANTILYIDSS 278
Query: 336 ASCALALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQN--DVALLSTKTGDLVLLTV 391
AS + N +S + R S+ + L+ A ++Q + ALL T+ G L
Sbjct: 279 ASYTGVMVNNTHRQNSDLIVSRDQSTLDLRLEGAEVCFIQELGNTALLVTEDGQFFSLLF 338
Query: 392 VYDGRVVQRLDLSKTNPS--VLT--SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
DGR V L+L P +L+ S + + FLGSR GDSLLV++ G S
Sbjct: 339 NKDGRRVASLELRPIEPDNFILSQPSSVAAGPDGTIFLGSRAGDSLLVKWYHGEPESQPE 398
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVR 506
L D N + LYG + TE + + +
Sbjct: 399 ETL--------------------------DDGNESDDDLYGGDTAQTEDTTNRPLKLRLA 432
Query: 507 DSLVNIGPLKDFSYGLRINAD----ASATGISKQSNYELV--------------ELPGCK 548
D ++ +GP++ + G + + TG+ S ++ ++PG +
Sbjct: 433 DRMLGMGPMQSLALGKNRGSQGVEFVTTTGVGANSALAILTSALMPYKRKSLYKDMPGGQ 492
Query: 549 GIWTVYHK-SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
W+V + G A S D ++YL A V+E L T+ ++ +FV
Sbjct: 493 -FWSVPVRFEEEGEVAKSRTYVVSSDSENSYLYYVDAAG--VIEDVSLSTKKKKTKKHFV 549
Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
T + ++QV I D S + +T + +
Sbjct: 550 SNVTTIFSSSMLDSALLQVCLETVNIYDAKI---------GQPHKYSLPQGTTAVEARVL 600
Query: 668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTST 727
YVL+ +SDG +++L VS+ +++++ + + G +T
Sbjct: 601 GNYVLVLLSDGQVKILEA------VSINKRPFLKAAQVSIEPASESKAIG------IYAT 648
Query: 728 DAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI 787
D+ L+ G G P VVCY G+L + S I
Sbjct: 649 DSSLTFGAPSKKRTRQGSPAQDSRPVVVVCYADGSL------------LLQGLNSDDRLI 696
Query: 788 VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
+D ++++ +E GQ +++V++A+ H +LT
Sbjct: 697 LDA----------SDLSGFIKEKDGQLYDA---PLELVDIALSPLGDDHILRDYLVLLTP 743
Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
++ Y+ Y + LRF + L E TP
Sbjct: 744 QQLVVYEPYHYND----------------------------KLRFRKIFL-----ERTPT 770
Query: 908 GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD----GSIVAFTVLHNV 963
+R+T I+G ++G + + L P+L + VAFT
Sbjct: 771 INSDRRLTQVPLINGKHTLGVTGET---AYILVKTLHTSPRLIEFGETKGAVAFT----- 822
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL-KATPHQITYFAEKNLYPLIVSV 1022
+ + F Y+T G + C+ + + WPV+ + L T ++TY ++Y
Sbjct: 823 SWDGKFAYLTQAGEVAECRFDPSFSLETNWPVKHVQLCGETISKVTYHETMDVY------ 876
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSV--DLHRTYTVEEYEVRILEPDRAGGPWQTR 1080
V+ V ++ D+ D+ + S+ D+ T + +RI+ P W
Sbjct: 877 -VIATHKTVPHVVRDE------DDEVIESLTPDIMPATTYQG-AIRIVNP----YSWTVI 924
Query: 1081 ATIPMQ-SSENALTVRVVTLFNTTTK-ENETLLAIGTAYVQGEDVAARGRVLLFSTGR-- 1136
+ + +E AL V L + K + ++A+GT+ ++GED+AARG + LF
Sbjct: 925 DSYEFEMPAEAALCCESVKLSISDRKSQKREVVAVGTSILRGEDLAARGALYLFDVIEIV 984
Query: 1137 -NADNPQN--LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-ELNGIAFYD 1192
+ P+ + ++ ++GA +A+ + G LL G K+++ L +AF D
Sbjct: 985 PEKERPETNRRLKKLVQDRVRGAFTAVCEVSGRLLAVQGQKLLVQALQDDLTLVPVAFLD 1044
Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
YV + + +LLGD +S+ F+ + Q+ A+D + +F I+G
Sbjct: 1045 MQ-TYVAVAKSLNSMLLLGDATRSVQFVGFSMDPYQMIPFARDLQRVLVTTCDFAIEGEN 1103
Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
L+ VV+D QK + I Y P +S+ G +LL R+ F+ G + D +
Sbjct: 1104 LTFVVADLQKRLHILEYDPDDPQSYSGARLLRRSVFYSGKVI-------------DSSAM 1150
Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
P ++ +RF ++ DGS+ + P E +RRL ++Q ++ D HV GL+PR++R
Sbjct: 1151 VPINE--DRFMVIGVCSDGSVTDVVPCPEDAYRRLYAIQTQITDKEAHVCGLHPRAYRYD 1208
Query: 1373 H----SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
+ HRP I+D L + LP +Q A++ G Q++
Sbjct: 1209 PILPGTGNSPHRP----ILDGHTLIRFANLPRNKQNVYANRLGQRYQQLI 1254
>gi|340515387|gb|EGR45642.1| predicted protein [Trichoderma reesei QM6a]
Length = 1441
Score = 218 bits (554), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 342/1447 (23%), Positives = 575/1447 (39%), Gaps = 203/1447 (14%)
Query: 72 RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
+V ++ ES G V + + L L+ L G V LA L + + +
Sbjct: 68 QVNDDDGLESSFLGGETMLVRTERTNNTKLVLITEIPLAGTVIGLARLRT--SRTASGGE 125
Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
+++A++ AK+ + E+D + L S+H +E E L E F G V + DP
Sbjct: 126 VLLIAYKAAKLCMAEWDPRKNELETISIHYYEKEE-LQGAPWEEVF--GEYVNHLEADPG 182
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVG---DEDTFG---------SGGGFSARIESSHV- 235
RC + + IL + L DED G + G S +E+++
Sbjct: 183 SRCAALKFGTRNLAILPFRRSEEDLEMEDWDEDLDGPRPVKEQAAAVNGDSDNVEAAYTP 242
Query: 236 -----INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
+ L D + H F+H Y EP +L + A + H + + L +
Sbjct: 243 SFVLRLPLLDPSLLHPVHLTFLHEYREPTFGVLSSSQAPAASLGARDHLSYKVFTLDLQQ 302
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL 349
+ I S LPHD Y+++A+P+P+GG L+VG N IH S +A+N A
Sbjct: 303 --RASTTILSVTGLPHDLYRVIALPAPVGGALLVGQNELIHVDQSGKSNGVAVNPMAKLA 360
Query: 350 DSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDL 403
S +S + L+ A ++N LL G L +++ DGR V RL
Sbjct: 361 TSFSLTDQSDLKLRLENCAIEVLAIENGELLLILNDGRLGIISFKIDGRTVSGLSVRLVG 420
Query: 404 SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
+ +VL S T + G + F+GS DS+++ + S K D +
Sbjct: 421 ADCGGNVLKSRATCVSRLGKNTLFVGSETSDSVVLGW---SRRQTQEKRKKSRLIDPDLA 477
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+ + + + N + +F + D L++I P++D +
Sbjct: 478 LEVDELDLEDDEEDDDLYGDDSVATKPQQLPNGGPAKSGDLTFRIHDVLLSIAPIQDVTC 537
Query: 521 GLR--------------INAD---ASATGISKQSNYELV------------ELPGCKGIW 551
G + AD A A G + + ++ E P +G W
Sbjct: 538 GQAAFPPDSEEATLNRGVRADLQLACAVGRGEAGSLAIINREIQPRVIGRFEFPEARGFW 597
Query: 552 TVYHKSS--RGHNADSSRMAAYDD--EYHAYLIIS------LEARTMVLETADLLTEVTE 601
T+ K + A++ YD ++ ++I++ E + TA + E
Sbjct: 598 TMCVKKPVPKSLGANAGVAGDYDTPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKE 657
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
+ G T+ AG + + VIQV + R +G + Q L + +G+E
Sbjct: 658 TEFEPAAGFTVEAGTMGKQMVVIQVLKSEVRCYNGDLNLIQILPM----LDEETGAEPRA 713
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
V S SI DPY+ + DGS+ L D + ++ + +S K V+ C KG
Sbjct: 714 V-SASIVDPYLFIVRDDGSVFLAQIDSNNEIEEMEKTDSSLTSTKWVAGCLYKDTKG--- 769
Query: 721 WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDK 779
+ + +D+ T EA+ + +GAL IF +P+ + V+ +
Sbjct: 770 IFQSSYSDSTKQTS--EAV-------------MMFLLNSTGALHIFALPDLSKAVYVAEG 814
Query: 780 FVSGRTHIVDTYM--REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
S H+ Y R A +++ TEI V +L A H+
Sbjct: 815 LSSIPPHLSAGYAARRGATRETLTEI-------------------VVADLG----DAVHA 851
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTP- 896
P+L + + Y+ P N T+ +LS L F ++P
Sbjct: 852 SPYLILRHSTNDLTIYEPIRL--PAN--------ETAHTLS---------DTLFFKKSPN 892
Query: 897 -LDAYTREETPHGAPCQR-----ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
+ A + E P Q + I N+ G+ FL G P + + + L
Sbjct: 893 AVLAKSAVEDPSDDTAQPPRYVPLRICANVGGYSSVFLPGPSPAFVIKSSRSVPRVVGLQ 952
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITY 1009
+ + H C+ GFIY S+GI ++ QLPS + + V+K+PL + Y
Sbjct: 953 GHGVRGMSTFHTEGCDRGFIYADSEGIARVTQLPSKTNFTELGISVKKVPLGFDVRHVAY 1012
Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN---LSSVDLHRTYTVEEYEVR 1066
Y I V + + E+ D H SV L T ++
Sbjct: 1013 HHPTETY--IAGCAVTE----------NFELPKDDDYHKEWARESVPLPPTAV--RGALK 1058
Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAA 1125
++ P W +I M++ E+ ++ + L + TKE LLA+GTA +GED+
Sbjct: 1059 LINPIT----WTVIHSIDMEAGESIECMKTLHLEVSEETKERRMLLAVGTALSRGEDLPT 1114
Query: 1126 RGRVLLFSTGRNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIIL 1177
RGRV ++ P N ++ +KE +G ++AL+ + QG +L+A G K ++
Sbjct: 1115 RGRVQVYDIVTVIPEPGKPETNKRLKLLAKEDIPRGGVTALSEIGTQGLMLVAQGQKCMV 1174
Query: 1178 H--KWTGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLA 1233
K G+ L +AF D +V S+ + L+ D K ++F + E+ +L
Sbjct: 1175 RGLKEDGSLLP-VAFLDM-SCHVSSVRELPGTGLCLIADAFKGLWFAGYTEEPYTFKVLG 1232
Query: 1234 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
K GSL +FL DG LS+V D ++ + + P+ +S +G LL R F V +
Sbjct: 1233 KSSGSLPLLVADFLPDGEDLSMVAVDADGDMHVLEFNPEHPKSLQGHLLLHRTTFSVTPN 1292
Query: 1294 -VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
T L L +S + + S T LL + GSI + PL E +RRL S+
Sbjct: 1293 PPTSTLLLPRTLPASQSSQDSSSSSSTQPHILLLASPSGSIAALTPLPESAYRRLLSVTN 1352
Query: 1353 KLVDS-VPHVAGLNPRSFRQFHSNGKAHR-------PGPDSIVDCELLSHYEMLPLEEQL 1404
+L+ + VPH GL+ R+ R G R +IVD +L+ + L ++
Sbjct: 1353 QLLPALVPH-GGLHARAHRTPEGGGGMSRTVGVETAATGRAIVDGTVLTRWNELGAAKRA 1411
Query: 1405 EIAHQTG 1411
E+A + G
Sbjct: 1412 EVATRGG 1418
>gi|313232279|emb|CBY09388.1| unnamed protein product [Oikopleura dioica]
Length = 1451
Score = 217 bits (553), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 179/678 (26%), Positives = 315/678 (46%), Gaps = 60/678 (8%)
Query: 760 SGALEIFDVPNFNCVFTV-DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
+G+LEI+ +P+ C+ D+ + I++T S EG+ +GR+ +
Sbjct: 814 NGSLEIYSLPD--CLLRFGDRNFANAPRILET---------------SRFEGS-EGRRVD 855
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
+ + V E+ + S P++ ++ D ++ Y F N +++ PV + R +
Sbjct: 856 V--LDVQEMNVFNMGPS-SLPYIVVMIGDQLMI----YRFRATLNRFQTESPVLSGRFIK 908
Query: 879 VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
+ + + + LR D ++ + + ++ F NIS H G FL G+ P W
Sbjct: 909 LQD----KTKLLRRIPGVHDESSKTKNRNNKIMRQ---FMNISDHNGIFLGGAYPTWIFC 961
Query: 939 FRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT-SQGILKICQLPSGSTYDNYWPVQ 996
+ RL +H +G + AFT N C GF+Y S L + L YD WP +
Sbjct: 962 GQNGRLNIHSMWQEGFVNAFTPFDNEKCADGFLYFRHSTKTLTVANLQPFLKYDADWPFK 1021
Query: 997 KIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL--IDQEVGHQIDNHNLSSVDL 1054
KI L TP +Y E+ + + V ++ + +L I+ E GH+ + +L V
Sbjct: 1022 KIKLNYTPCFSSYDLEQKV------LTVCGSRSEKIEMLPKINAE-GHK-EYEDLPEVQN 1073
Query: 1055 HRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE-NETLL 1111
T ++ V + P W+ + I M + E+ L R V L + + + +
Sbjct: 1074 VETQLFPQFFVEMFSP----ASWEVIPNSRIEMDAHEHILCCRSVYLKSEASMSGRKQYI 1129
Query: 1112 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGH 1166
AIGT+ + GED +RGR++L P +T V+ +G +SA+ SL G
Sbjct: 1130 AIGTSNICGEDFQSRGRLILLEVIDVVPEPGKPLTRYKYKTVFDASQRGPVSAVDSLDGA 1189
Query: 1167 LLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
L+ A G K+ +H + L F D LY + + KN+ L+GDI + I L + +
Sbjct: 1190 LIAAIGQKVFIHAFQDDNLRATGFVDTQ-LYTHATHCFKNYALVGDIQQGITLLRHQGER 1248
Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
++ +++ + + A L+DG+ + LV +D Q+N+Q++ Y P ES G++L+ +A
Sbjct: 1249 NCISQISRARRAGEVTAVGILLDGNQVGLVSTDMQRNLQVYMYKPDQKESNGGKQLVRQA 1308
Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1346
+ ++G V L +D ++ R + LDGSIG I P+ E FRR
Sbjct: 1309 DINLGKRVISIW--NSLGRQNDTFTKVALTENDARHVTFYAGLDGSIGDIVPVSEKVFRR 1366
Query: 1347 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEI 1406
L+ LQ + +PH GLNPR +R + + +I+D +LL + L EQ ++
Sbjct: 1367 LEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAKNIIDGDLLERFNGLSFTEQTDL 1426
Query: 1407 AHQTGTTRSQILSNLNDL 1424
+ + G TR +L ++ D+
Sbjct: 1427 SRKIGVTREALLDDMMDV 1444
Score = 188 bits (478), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 182/695 (26%), Positives = 313/695 (45%), Gaps = 77/695 (11%)
Query: 57 NLVVTAANVIEIYVVR--VQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
NL V A N++ +Y +R V E G+ + EL + L G V
Sbjct: 45 NLAVAAGNMLSVYRIRSSVDEAGNHFDR------------------FELCDEFELWGIVV 86
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
+ L G+ RDS++L+ E++K ++E++ L SMH F+ + L+RG
Sbjct: 87 CMTRLRLAGS----VRDSLLLSIEESKCVIVEYEPDTGSLSTISMHFFQDED---LRRGF 139
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS-GLVGD--EDTFGSGGGFSARIE 231
+ L +VD RC VLVYG + +L + L G + F GF A
Sbjct: 140 RKLSSMALARVDGFNRCAAVLVYGSYLAVLPFRRSTERDLSGQRHQAVFYENSGFIA--- 196
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
++I+L+ L +K V DF F+ GY +P +++L+E TW GRV+ + TC + ALSI+
Sbjct: 197 --NMIDLQSLPVKIASVLDFQFLEGYNDPTILLLYEALPTWTGRVTERQDTCGMVALSIN 254
Query: 290 TTLKQHPLIWSAMNLPH-DAYK--LLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNY 345
+ HP+IW LP + Y L +P P+GG L+ N++ Y QS +ALN+
Sbjct: 255 LIDETHPVIWQMAGLPFPNPYSSALFPIPKPLGGSLLFATNSLIYLDQSVPPYGVALNSL 314
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
+ + + + L A L +D +S ++GD+ ++T+ D V+R L
Sbjct: 315 PLGCTNFALKTQDVAPLNLQNCKACMLSDDSICVSLESGDVYIITLKKDSLNNVRRFYLD 374
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
+ SV+ + ++ + ++L FLGSRLG+SLL+++ C + S+ L+ D
Sbjct: 375 QVASSVIPTTLSKLSDNLIFLGSRLGNSLLLRYKCKENSKKSSTSLENGEKDGVEIENKE 434
Query: 465 KRLRRSSSDALQDMVNG------EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
+ + + + NG +++ YG N + ++ F D+L NIGP
Sbjct: 435 EEKNELNFEIEKSSENGSPENKRKKMRYYGDEIFNLD-VNTSYDFETMDNLSNIGPCGPV 493
Query: 519 SYGLRINADASATGI---SKQSNYELVELPGCK----GIWTVYHKSSRGHNA-------- 563
N + + + ++ N ++ L G G TV HKS R A
Sbjct: 494 ELIHTANHNDNYDHVGSDARDRNIDVCVLSGKDKTGFGSITVLHKSVRPSIASQFPFPMN 553
Query: 564 --DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV-TESVDYFVQGRTIAAGNLFGR 620
D + ++E H+ L+++ + +TMV +T +L E+ E +TI +
Sbjct: 554 FSDMWTLRRSEEETHSLLVMTKKDQTMVFQTGAILEELKKEECGLATNAKTIFCATIGNG 613
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS- 679
+ ++QV R ++D TQ+ SG ++ V+ DPYV++ S G+
Sbjct: 614 KYIVQVLPRAVVLVDMD--TQETIQNKPFDLSGQ------IIQVA-CDPYVVILASKGTI 664
Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYH 714
I L++ + S T ++T A E + + H
Sbjct: 665 ISLVLFENSDGTAMLKTSTAPECKNQDDPEKKIMH 699
>gi|154285962|ref|XP_001543776.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407417|gb|EDN02958.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 1283
Score = 217 bits (552), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 242/987 (24%), Positives = 419/987 (42%), Gaps = 144/987 (14%)
Query: 501 FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYELVELPG----------- 546
+ F + D L N+GP++D + G + D S +N ELV G
Sbjct: 376 YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQPVSSILANLELVTTQGYGKAGGLAILR 435
Query: 547 ---------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
G +VY K + + S Y YL++S + +
Sbjct: 436 REIDPFVIDSLMIKDTDGARSVYVKDPKLPSQSGSLPLNPGSNYDHYLLLSKSKGLDKEK 495
Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDLS 644
++V + E T++ ++ + RTI G L RV+QV + R D G + Q
Sbjct: 496 SVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLASGTRVVQVLKGEVRSYDSGLGLAQIFP 555
Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSK 704
+ SE +V+ S ADPYVL+ D SI LL D S +T I S+
Sbjct: 556 VWDEDM-----SEEKSVVHTSFADPYVLIIRDDQSILLLQADDSGDLDEAETDGIINSTT 610
Query: 705 KPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGP-LDQGD-IYSVVCYESGA 762
S +LY DK + +G P + Q D + +
Sbjct: 611 --WISGSLYQDKY-------------------RSFKSHEGPPNMKQSDNVLLFLLSSESK 649
Query: 763 LEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
L +F +PN VFT + D +I S+ +E I
Sbjct: 650 LYVFHLPNAREPVFTTESI-----------------DLLPQILSTEPPPRRVTYRETITE 692
Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
+ V +L + P+L ++ ++ Y+ Y + ST R S
Sbjct: 693 LLVADLG----DSVSRSPYLILRSSNSDLILYEPYHYTS-----------STERQFS--G 735
Query: 882 VSASRLRNLRFSRTPLDAYTREETPHGAPCQRIT----IFKNISGHQGFFLSGSRPCWCM 937
+ ++ N F ++ ++ + H A C I+ + ++ G++ F+ G+ PC+ +
Sbjct: 736 LRFVKIANHHFPKSHSESNAGK---HPANCTAISKPLRVLGDVCGYRTVFMPGNSPCFII 792
Query: 938 VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
+ L ++ + + + C GF+YV + ++++C+ P + +D W +K
Sbjct: 793 KSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARK 852
Query: 998 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1057
I L + Y + Y + + V +L D E+ + N +S +
Sbjct: 853 IGLGEQVDAVEYSSSSETYVIGTNQKV------DFNLPEDDEIHPEWRNEVISFLP---- 902
Query: 1058 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1116
+++ V++L P W + ++++E + V+ + L + T E + + +GTA
Sbjct: 903 -QIDKGSVKLLTPRT----WSIIDSYNLRTAERIMCVKCLNLEVSEITHERKDTIVVGTA 957
Query: 1117 YVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKELKGAISALASL--QGHLLI 1169
+GED+AARG + +F D P+ + + +E+KGA+++L+ + QG L+
Sbjct: 958 LTKGEDIAARGCIYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGSLIA 1017
Query: 1170 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQ 1225
A G K I+ K G+ L +AF D YV L +K ++GD K ++F + E+
Sbjct: 1018 AQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVNVLKELKGTGMCIMGDALKGLWFAGYSEE 1075
Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
+L+L +KD G+L A +FL DG+ L ++V+D+ NI + Y P+ S KG +LL R
Sbjct: 1076 PYKLSLFSKDDGTLQVMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHR 1135
Query: 1286 AEFHVGAHVTKFLRLQMLAT-SSDRTGAAPGSDKTNRFALLFGTL----DGSIGCIAPLD 1340
+ F G + L AT SS R A P + L L GSI I P+
Sbjct: 1136 STFQTGHFASTMTLLPRTATSSSQRPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVS 1195
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
E ++RRL +LQ +L +++ H GLNPR+FR S+G R +VD +L+ + L
Sbjct: 1196 ETSYRRLSALQSQLTNTLEHPCGLNPRAFRAVESDGIGGR----GMVDGDLVKRWLDLGT 1251
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALG 1427
+ + EIA++ G +I ++L + G
Sbjct: 1252 QRKAEIANRVGADVWEIRADLEAIGKG 1278
Score = 47.4 bits (111), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 45/180 (25%), Positives = 84/180 (46%), Gaps = 21/180 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V +++++ + GS + +T+ + L LV Y L G + L
Sbjct: 28 NLIVAKTTLLQVFNLVNVVYGSGPGQPDEKTRSQY-------TKLVLVAEYALSGTITDL 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ +++++A +AK+S++E+D H + TS+H +E + +++ +
Sbjct: 81 GRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIHYYERDD-VNISPWTPN 137
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGSGGGFSARIESSH 234
A P + VDP RC VL +G + + IL Q G LV D+ F + +E H
Sbjct: 138 LASCPSYLTVDPNSRC-AVLNFGKKNLAILPFHQVGDDLVMDD--------FDSDVEEQH 188
>gi|154320778|ref|XP_001559705.1| hypothetical protein BC1G_01861 [Botryotinia fuckeliana B05.10]
Length = 1153
Score = 214 bits (546), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 272/1232 (22%), Positives = 503/1232 (40%), Gaps = 181/1232 (14%)
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDS 351
K I S LP+D ++++ + P+GG L+VG N IH + +A+N +A
Sbjct: 10 KASTTILSVGGLPYDLFRIVPLAPPVGGALLVGTNELIHIDQAGKANGVAVNMFAKQCTG 69
Query: 352 SQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP- 408
L ++ + L+ L +N L+ +GD+ +L+ DGR V L + + +
Sbjct: 70 FSLLDQADLDLRLEGCKIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAE 129
Query: 409 ---SVLT---SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
++LT S ++++G F+GS + DS+++ + SG + + E D
Sbjct: 130 LGGAILTGAASCVSSLGAGSLFVGSEVSDSVILGWNRKSGQTSRRKSRLDSSAIAEVD-- 187
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSY 520
+ D + G+ ++ + +N T S KT ++F + DS+VNI P+ + ++
Sbjct: 188 -EAMFDEEDLEDDDDDLYGDGPTITHATANITASNSKTGDYTFRIHDSMVNIAPITNIAF 246
Query: 521 G---LRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
G L + D QS +LV +LP +GIW
Sbjct: 247 GEAALSLGKDEELKSSGVQSELQLVAAVGREKGGSLAVINREIQPNVIGRFDLPEARGIW 306
Query: 552 TVYHK--SSRGHNADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLT 597
T+ K + +G + + D +Y +I+S +A + E+A D
Sbjct: 307 TMSAKRPAPKGLQVNKEKSVTSGDYGVDAQYDRLMIVSKASDAEDAIEESAVYALTDAGF 366
Query: 598 EVTESVDYF-VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
E ++ G TI AG L RV+Q+ + R DG + Q L + E+G+
Sbjct: 367 EALTGTEFEPAAGSTIEAGTLGNGMRVVQILKSEVRSYDGDLGLAQILPM--LDDETGA- 423
Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
++S S ADP++LL D SI + D ++ I S K ++ C LY D
Sbjct: 424 --EPKIISASFADPFLLLIRDDASIFVAQCDDDNDLEEIERVDDILLSTKWLTGC-LYDD 480
Query: 716 KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVF 775
+D+ S GE ++ + GAL I+ +P+ +
Sbjct: 481 ------YSGAFSDSK-SNKAGE-------------NVKMFLLSAGGALHIYALPDLSKPV 520
Query: 776 TVDK---FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
V + FV + A +++ TEI L
Sbjct: 521 YVAEGICFVPPVLSADYAARKSAARETLTEI-----------------------LVANLG 557
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
+ P+L ++ + Y+ + + S S L S + +++N
Sbjct: 558 DSVSQSPYLILRPSNDDLTIYEPFRVK------------SASPDLLSSTLQFLKIQNTHL 605
Query: 893 SRTPLDAYTREETPHGA------PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
++ P + EE GA P + I+ N+ G+ F+ G P + + +
Sbjct: 606 TQAP--DVSAEEQVDGAQQTSDKPMRAIS---NLGGYSTVFMPGGSPSFIIKSSKTAPKV 660
Query: 947 PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKIPLKATPH 1005
L + + + H C+ GFIY +++GI ++ Q P +T+ D ++KI + H
Sbjct: 661 LSLQGTGVRSLSSFHTEGCDRGFIYASTEGIARVAQFPPNTTFADIGMALRKIEIGEDVH 720
Query: 1006 QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL-SSVDLHRTYTVEEYE 1064
+ Y Y + S D E+ D+ ++ ++E+
Sbjct: 721 AVAYHPPLQTYVIGTST------------FTDFELPKDDDHRKTWQEENIALKPSIEKSF 768
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDV 1123
++++ P W I ++ E ++ + L + T E + L+ +GTA +GED+
Sbjct: 769 LKLVSPVN----WSVIDAIELEPCELITCIKTMNLVISEVTNERKHLIVVGTAITKGEDL 824
Query: 1124 AARGRVLLF---STGRNADNPQN------LVTEVYSKELKGAISALASL--QGHLLIASG 1172
A GR+ ++ + D P+ + +E+ ++ G ++ L+ + QG +L+A G
Sbjct: 825 ATTGRLYVYDVVTVVPEPDRPETNKKLKLISSEIITRGAGGPVTGLSEIGTQGFMLVAQG 884
Query: 1173 PKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQ 1228
K ++ K GT L +AF D YV S+ + ++ D K ++F + E+ +
Sbjct: 885 QKCMVRGLKEDGTNLP-VAFMDMN-CYVTSVKELPGTGLCVMADALKGVWFAGYTEEPYR 942
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
+ L K ++ + L DG L +V +D N+ I Y P+ +S +G LL R F
Sbjct: 943 MLLFGKSAAKMEVLCADLLPDGKDLFIVAADANGNLHIMQYDPEHPKSLQGHLLLHRTTF 1002
Query: 1289 HVGAH-------VTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPL 1339
+GAH + L L T+ + + T + LL + G++ ++PL
Sbjct: 1003 SLGAHHPTTMTLLPTTRPLPQLTTAPSPSPDPSPQEDTPSPSQPLLLTSRTGTLALLSPL 1062
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
E +RR +L L +++ H GLNPR++R + G +I+D +L + L
Sbjct: 1063 TESQYRRFGTLVSHLTNTLYHPCGLNPRAYR-IDRDANEGIVGGRTIIDGGVLGRWMELG 1121
Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
+ + E+A + G ++ L+ L G F+
Sbjct: 1122 SQRRGEVAGRVGVDVLELRDELSGLRGGLGFI 1153
>gi|268580265|ref|XP_002645115.1| Hypothetical protein CBG16808 [Caenorhabditis briggsae]
gi|296439546|sp|A8XPU7.1|CPSF1_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
Length = 1454
Score = 213 bits (542), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 193/748 (25%), Positives = 344/748 (45%), Gaps = 83/748 (11%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
++ DA +S+ GE D +D YS VV +++G + I +P+ V+
Sbjct: 737 KRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLVVAHDNGRITIHSLPDLELVYQ 791
Query: 777 VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT--------GQGRKENIHSM------ 822
+ +F + +VD + E K+ + + ++ E+ + ++ ++S
Sbjct: 792 IGRFSNVPELLVDMTVEEEEKEKKAKQTAAQEKEKETEKKKDDAKNEEDQVNSEMKKLCE 851
Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
KVVE + + + P L AI+ D ++ Y+ + P+ V+ + + +
Sbjct: 852 KVVEAQIVGMGINQAHPVLIAII-DEEVVLYEMFASYNPQPGHLG---VAFRKLPHLIGL 907
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE 941
S N+ R P + E HG I F+ IS + G + G+ P +V+
Sbjct: 908 RTSPYVNIDGKRAPFEM----EMEHGKRYTLIHPFERISSINNGVMIGGAVPTL-LVYGA 962
Query: 942 --RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKI 998
++ H DGSI AFT +N N HGF+Y+T Q L+I ++ YD +PV+KI
Sbjct: 963 WGGMQTHQMTIDGSIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYDMPYPVKKI 1022
Query: 999 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHR 1056
+ T H + Y ++Y ++ SVP KP N++ ++ D QE H+ D + + +
Sbjct: 1023 EVGKTVHNVRYLMNSDIYAVVSSVP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPP 1078
Query: 1057 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN------ETL 1110
YT+ + Q A +P E V + + K +T
Sbjct: 1079 KYTLNLFSS------------QDWAAVPNTEFEFEDMEAVTAMEDVPLKSESRYGGLDTY 1126
Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQG 1165
LA+ T GE+V RGR++L P + +Y KE KG ++ L ++ G
Sbjct: 1127 LALATVNNYGEEVLVRGRIILCEVIEVVPEPGQPTSNRKIKVLYDKEQKGPVTGLCAING 1186
Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
LL G K+ + ++ +L GI+F D YV L+ ++ L D +S+ + ++E+
Sbjct: 1187 LLLSGMGQKVFIWQFKDNDLMGISFLDMH-YYVYQLHSIRTIALALDARESMSLIRFQEE 1245
Query: 1226 GAQLNLLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
+++ ++D C A+EFL+DG + ++SDE NI +F Y+P+ ES G++
Sbjct: 1246 NKAMSIASRD--DRKCAQAPMASEFLVDGMHIGFLLSDEHGNITLFSYSPEAPESNGGER 1303
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
L +A ++G ++ FLR++ + D + + R +FG+LDGS G I PL E
Sbjct: 1304 LTVKAAINIGTNINAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGSFGYIRPLTE 1363
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHYE 1396
++RRL LQ + P +AGL+ + R Q NG+ R +++D +++ Y
Sbjct: 1364 KSYRRLHFLQTFIGSVTPQIAGLHIKGARSSKPSQPIVNGRNAR----NLIDGDVVEQYL 1419
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
L + ++ ++A + G R IL +L L
Sbjct: 1420 HLSVYDKTDLARRLGVGRYHILDDLMQL 1447
Score = 188 bits (478), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 150/576 (26%), Positives = 266/576 (46%), Gaps = 81/576 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++ F+DAK+S++ ++ ++ S+H FE+ +L+ G ++ P+V+ DP
Sbjct: 92 QDSILMTFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFTTYFNPPIVRTDPAN 148
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + ++ S++I L+ +D + +V
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 191 DMVFLEGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
LL++P P+GG +V G+NTI Y +Q+ C + LN+ D + P +
Sbjct: 251 CNSLLSIPKPLGGAVVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKHLKMT 307
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD + + ++++ + ++ GDL LL +V G V+ L+ SK + + +T
Sbjct: 308 LDCSTSVYMEDGRIAVGSREGDLYLLRLVTSSGGATVKSLEFSKVCDTSIAFTLTVCAPG 367
Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG 481
F+GSRLGDS L+++T ++ S K+ R + + ++
Sbjct: 368 HLFVGSRLGDSQLLEYTL-----------------LKVTKESAKKQRLEQQNPSEIELDE 410
Query: 482 EELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ 536
+++ LYG A +++ E ++ F D L+N+GP+K +G R N ++ +K+
Sbjct: 411 DDIELYGGAIEMQQNDDDEQISESLQFRELDRLLNVGPVKSMCFG-RPNYMSNDLIDAKR 469
Query: 537 SN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLIISL 583
+ ++LV G G V+ +S R SS + ++E H YLI+S
Sbjct: 470 KDPVFDLVTASGHGKNGALCVHQRSMRPEIITSSLLEGAEQLWAVGRKENESHKYLIVS- 528
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMTQD 642
R+ ++ E + T+AAG L +QV A + DG M Q+
Sbjct: 529 RVRSTLILELGEELVELEEQLFVTNEPTVAAGELLQGALAVQVTSTCIALVTDGQQM-QE 587
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
+ N V+ SI DPYV + +G
Sbjct: 588 VHI----------DSNFPVVQASIVDPYVAVLTQNG 613
>gi|308459872|ref|XP_003092248.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
gi|308253976|gb|EFO97928.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
Length = 1448
Score = 211 bits (536), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 185/736 (25%), Positives = 339/736 (46%), Gaps = 69/736 (9%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
R+ DA +S+ GE D +D YS +V +++G L I +P+ V+
Sbjct: 741 RRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLMVAHDNGRLSIHSLPDMELVYQ 795
Query: 777 VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR----W 832
+ +F + ++D E K+ + + ++++ + K+ E M+
Sbjct: 796 IGRFSNVPELLMDMTTDEEEKERKAKAQQAAKDTAADEDQLTTEMKKLCERVMEAQIVGM 855
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
+ S P L AI+ D ++ Y+ + P+ ++ + + S N
Sbjct: 856 GINQSHPVLMAIV-DEQVVMYEMFSHYNPQAGHLG---IAFRKLPHFICLRTSSHLNSDG 911
Query: 893 SRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFR-ERLRVHPQLC 950
R P + E +G I F+ IS + G + G+ P + ++ H
Sbjct: 912 KRAPFEM----EVENGKRYTLIHPFERISSINNGVMIGGAVPTLVVYGAWGGMQTHQMTI 967
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKIPLKATPHQITY 1009
DG I AFT +N N HGF+Y+T Q L+I ++ Y+ +P++KI + T H + Y
Sbjct: 968 DGPIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYEMPYPMKKIEVGRTIHNVRY 1027
Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHRTYTVEEYEVRI 1067
++Y ++ S+P KP N++ ++ D QE H+ D + + + YT+ +
Sbjct: 1028 LMNSDVYVVVSSIP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPPKYTLNLF---- 1079
Query: 1068 LEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVA 1124
+ W+ I + E V+L + +T ET LA+GT GE+V
Sbjct: 1080 -----SSQDWKAVPNTEIEFEDMEAVTACEDVSLKSESTISGVETYLAVGTVNNYGEEVL 1134
Query: 1125 ARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHK 1179
RGR++L P + ++ KE KG ++ L ++ G LL G K+ + +
Sbjct: 1135 VRGRIILCEVIEVVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLSGMGQKVFIWQ 1194
Query: 1180 WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--FG 1237
+ +L G++F D YV L+ ++ L D +S+ + ++E+ +++ ++D
Sbjct: 1195 FKDNDLMGLSFLDMH-YYVYQLHSLRTIALACDARESMSLIRFQEENKAMSIASRDDRRT 1253
Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
+ A +F++DG+ L ++SDE NI +F Y+P+ ES G++L RA ++G +V F
Sbjct: 1254 AKPPMAAQFVVDGAHLGFLLSDENGNITLFNYSPEAPESNGGERLTVRAAMNIGTNVNAF 1313
Query: 1298 LRLQ----MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1353
LR++ +L SD + R + +FG+LDGS G + PL E ++RRL LQ
Sbjct: 1314 LRVKGHTSLLNLQSDEEKES----VEQRMSTIFGSLDGSFGFVRPLSEKSYRRLHFLQTF 1369
Query: 1354 LVDSVPHVAGLNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1408
+ P +AGL+ + R Q NG+ R +++D +++ Y L L ++ ++A
Sbjct: 1370 IGSVTPQIAGLHIKGARSARPAQPIVNGRNAR----NLIDGDVVEQYLHLSLYDKTDLAR 1425
Query: 1409 QTGTTRSQILSNLNDL 1424
+ G R I+ +L L
Sbjct: 1426 RLGVGRYHIIDDLMHL 1441
Score = 181 bits (460), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 155/578 (26%), Positives = 263/578 (45%), Gaps = 86/578 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++AF+DAK+S++ ++ ++ S+H FE+ +L+ G ++ P+V+ DP
Sbjct: 92 QDSILMAFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFINYFHPPIVRTDPSN 148
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + ++ S++I L+ +D + +V
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
LL +P P+GG LV G+NTI Y +Q+ C + LN+ D + P +
Sbjct: 251 CTSLLPIPKPLGGALVFGSNTIVYLNQAVPPCGVVLNS---CYDGFTKFPLKDMKHLKMT 307
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD A + ++++ + + G L LL +V G V+ ++ S+ + + +T
Sbjct: 308 LDCATSVYMEDGRIAVGGRDGVLYLLRLVTSSGGATVKSMEFSRVWETSIAYCLTVCAPG 367
Query: 422 LFFLGSRLGDSLLVQFTCGSGT--SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
F+GSRLGDS LV++T T S ++++ G+IE D
Sbjct: 368 HLFIGSRLGDSQLVEYTLLKMTKESAKRQKIEKDPGEIELDE------------------ 409
Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+++ LYG A +++ E ++ F D L N+GP+K +G R N +S
Sbjct: 410 --DDMELYGGAIEMQLNDDEEQILESLEFRELDRLRNVGPVKSMCFG-RPNYMSSDLAEM 466
Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
K+ + ++LV G G V+ +S R SS + ++E H YLI+
Sbjct: 467 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSILEGAEQLWAVGRKENESHKYLIV 526
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
S R+ ++ E + T+AAG L +QV A + DG M
Sbjct: 527 S-RVRSTLVLELGEELVELEEQLFVTNEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 584
Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
Q++ N V+ SI DPYV + +G
Sbjct: 585 QEVHI----------DSNFPVVQASIQDPYVAVLTQNG 612
>gi|320040273|gb|EFW22206.1| hypothetical protein CPSG_00105 [Coccidioides posadasii str.
Silveira]
Length = 1387
Score = 210 bits (535), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 153/533 (28%), Positives = 266/533 (49%), Gaps = 41/533 (7%)
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
+ +I G++ F+SGS PC+ M +L ++ + + H C GF YV +
Sbjct: 878 YSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLRGEAVSSLSSFHIPACEKGFAYVDASN 937
Query: 977 ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
++++C+LP + +DN W +K+ + + YFA +Y L S V L +
Sbjct: 938 MVRMCRLPGNTRFDNSWVTRKVHVGDQIDCVEYFAHSEIYALGSSHKVDFKLPE------ 991
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
D E+ + + +S + +E +++L P W + + +E + ++
Sbjct: 992 DDEIHPEWRSEVISFMP-----QLERGCIKLLSPRT----WSVVDSYELGDAERVMCMKT 1042
Query: 1097 VTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ----NLVTEVYSK 1151
+ + + T E + +L +GTA V+GED+ RG + +F A +P N ++++K
Sbjct: 1043 INMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEIIEVAPDPDRPETNRKLKIFAK 1102
Query: 1152 E-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
+ +KGA++A++ + QG L++A G K ++ K G+ L +AF D YV L ++
Sbjct: 1103 DDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVKVLKELQG 1160
Query: 1207 --FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
++GD K I+F + E+ +L L KD L A +FL DG L ++V+D+ I
Sbjct: 1161 TGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEYLQVIAADFLPDGKRLYILVADDDCTI 1220
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN---- 1320
+ Y P+ S KG +LL R+ FH+G H T + L + SS + PG D +
Sbjct: 1221 HVLEYDPEDPTSSKGDRLLHRSSFHMG-HFTSTMTL-LPQHSSSPSADDPGEDDMDVDYV 1278
Query: 1321 --RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
+ +L + +GSIG + PL E ++RRL +LQ +LV S+ H GLNP+++R S+G
Sbjct: 1279 PKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVTSMEHPCGLNPKAYRAVESDGFG 1338
Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
R IVD LL + + ++ + EIA + G I +L ++ G FL
Sbjct: 1339 GR----GIVDGNLLLRWLDMGVQRKAEIAGRVGADIESIRVDLEKISGGLDFL 1387
Score = 116 bits (290), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 175/731 (23%), Positives = 291/731 (39%), Gaps = 94/731 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + G+ N+ + R ++ L LV Y L G + L
Sbjct: 28 NLIVAKTSILQVFSLVNVAYGTSALPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ +++++A +AK+S++E+D HG+ S+H +E E +H
Sbjct: 85 GRVKI--LDSRSGGEALLVATRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLV-----GDEDTFGSGGG---- 225
P L+ VDP RC +L +G+ + IL Q G LV GD D G
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDGDLDEKPEGASNIPA 200
Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ SS V+ L LD + H F++ Y EP IL+ T +
Sbjct: 201 QIAVENDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ + S ++ + + + LP D +K++ +P PIGG L++G+N IH
Sbjct: 261 LRDRKDIVSYSVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + + +S + L+ L D LL G + +L
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380
Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ DGR V +L K S+L + + ++G F GS DSLL+ + S
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGW---SRP 437
Query: 444 SMLSSGLKEEFGDI---EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
S L K E D + D VN LS S +N +
Sbjct: 438 SQLMRKPKVESADDVFGDHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495
Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
F F D L N+GP+ D + G R +AD + N
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555
Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
+ +++ G+W++ + DS+ Y YL+ S + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610
Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
++V E ++ ++ + T+ G L G RV+QV + R D + +
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
P E S+ +V+S S A+PYVL+ D S+ LL D S V I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725
Query: 706 PVSSCTLYHDK 716
+S C LY DK
Sbjct: 726 WLSGC-LYLDK 735
>gi|119195757|ref|XP_001248482.1| hypothetical protein CIMG_02253 [Coccidioides immitis RS]
gi|121769680|sp|Q1E5B0.1|CFT1_COCIM RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|392862316|gb|EAS37050.2| protein CFT1 [Coccidioides immitis RS]
Length = 1387
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 160/559 (28%), Positives = 275/559 (49%), Gaps = 49/559 (8%)
Query: 891 RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
RF +P AY PH + + + +I G++ F+SGS PC+ M +L
Sbjct: 860 RFDPSP-KAYM----PHS---KFLRAYSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLR 911
Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYF 1010
++ + + H C GF YV + ++++C+LPS + +DN W +K+ + + YF
Sbjct: 912 GEAVSSLSSFHIPACEKGFAYVDASNMVRMCRLPSNTRFDNSWVTRKVHVGDQIDCVEYF 971
Query: 1011 AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEP 1070
A +Y L S V L + D E+ + + +S + +E +++L P
Sbjct: 972 AHSEIYALGSSHKVDFKLPE------DDEIHPEWRSEVISFMP-----QLERGCIKLLSP 1020
Query: 1071 DRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRV 1129
W + + +E + ++ + + + T E + +L +GTA V+GED+ RG +
Sbjct: 1021 RT----WSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSI 1076
Query: 1130 LLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KW 1180
+F A +P N ++++K+ +KGA++A++ + QG L++A G K ++ K
Sbjct: 1077 YVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKE 1136
Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
G+ L +AF D YV L ++ ++GD K I+F + E+ +L L KD
Sbjct: 1137 DGSLLP-VAFMDMQ-CYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEY 1194
Query: 1239 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1298
L A +FL DG L ++V+D+ I + Y P+ S KG +LL R+ FH G H T +
Sbjct: 1195 LQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHTG-HFTSTM 1253
Query: 1299 RLQMLATSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
L + SS + P D + + +L + +GSIG + PL E ++RRL +LQ
Sbjct: 1254 TL-LPEHSSSPSADDPEEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQS 1312
Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
+LV S+ H GLNP+++R S+G R IVD LL + + ++ + EIA + G
Sbjct: 1313 QLVTSMEHPCGLNPKAYRAVESDGFGGR----GIVDGNLLLRWLDMGVQRKAEIAGRVGA 1368
Query: 1413 TRSQILSNLNDLALGTSFL 1431
I +L ++ G FL
Sbjct: 1369 DIESIRVDLETISGGLDFL 1387
Score = 113 bits (282), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 172/731 (23%), Positives = 294/731 (40%), Gaps = 94/731 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + G+ N+ + R ++ L LV Y L G + L
Sbjct: 28 NLIVAKTSILQVFSLVNVAYGTSAPPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ ++++++ +AK+S++E+D HG+ S+H +E E +H
Sbjct: 85 GRVKI--LDSRSGGEALLVSTRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE-----DTFGSGGG---- 225
P L+ VDP RC +L +G+ + IL Q G LV DE D G
Sbjct: 142 LRLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDEDLDEKPEGASNIPA 200
Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ SS V+ L LD + H F++ Y EP IL+ T +
Sbjct: 201 QAAVANDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ + + ++ + + + LP D +K++ +P PIGG L++G+N IH
Sbjct: 261 LHDRKDIVSYAVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + + +S + L+ L D LL G + +L
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380
Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ DGR V +L K S+L + + ++G F GS DSLL+ ++ S
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGWSRPS-Q 439
Query: 444 SMLSSGLK---EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
SM ++ + FG + D VN LS S +N +
Sbjct: 440 SMRKPKVESADDVFG--DHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495
Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
F F D L N+GP+ D + G R +AD + N
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555
Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
+ +++ G+W++ + DS+ Y YL+ S + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610
Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
++V E ++ ++ + T+ G L G RV+QV + R D + +
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
P E S+ +V+S S A+PYVL+ D S+ LL D S V I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725
Query: 706 PVSSCTLYHDK 716
+S C LY DK
Sbjct: 726 WLSGC-LYLDK 735
>gi|346319828|gb|EGX89429.1| protein CFT1 [Cordyceps militaris CM01]
Length = 1452
Score = 207 bits (527), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 330/1440 (22%), Positives = 574/1440 (39%), Gaps = 223/1440 (15%)
Query: 91 VLMDGISAASLELVCHYRLHGNVESLAIL-----SQGGADNSRRRDSIILAFEDAKISVL 145
+L D L LV + G + LA L S GG ++++LA+ AK+ +
Sbjct: 94 LLRDRSQHTKLVLVAELPVAGTIIGLARLKLPHTSSGG-------EALLLAYRGAKMCLT 146
Query: 146 EFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQMI 202
E++ L S+H +E E L+ G V + DP RC +
Sbjct: 147 EWNPRRAALETVSIHFYEKDE---LQGAPWELPFGEYVNYLEADPASRCAAFKFGSRNLA 203
Query: 203 ILKASQGG--------------------SGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
IL Q + L + D G G ++ S V+ L LD
Sbjct: 204 ILPFRQAEEDLEMEDWDEALDGPKPPKEASLATNGDANGDANGTQSQHSPSFVLRLPLLD 263
Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
+ H F+H Y EP IL + T H T + L + + I S
Sbjct: 264 PTLLHPVHLAFLHQYREPTFGILSSAQSTSIALGFRDHLTYKVFTLDLKQ--RASTTILS 321
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA-----VSLDSSQE 354
LP D +++ +P+P+GG L+VGAN IH + +A+N A SL+ E
Sbjct: 322 VTGLPQDLSRVIPLPTPVGGALLVGANELIHIDQSGKANGVAVNPMARQMTSFSLNDQSE 381
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL---SKTNPSVL 411
L ++ +E A +++ LL L +++ DGR V + L S+ N L
Sbjct: 382 L---NYRLEGCAIEPVSMESGELLLILNDASLAIVSFKIDGRTVSGISLVPVSQENGGNL 438
Query: 412 ----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
S I+ IG S F+GS GDS+++ ++ S +++ ++A+
Sbjct: 439 LKSHVSCISRIGKSSMFIGSEYGDSVVLGWS-----RKQSQEKRKKSRVLDAELALDVDD 493
Query: 468 RRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
D + G E + S + N + F ++DSL+ + P+ D + G
Sbjct: 494 IDLDDFDEDDDLYGTESTAAKPSLATNGVTKGGELIFRLQDSLLCLAPIHDVAPGKAVFP 553
Query: 522 -------LRINAD-----ASATGISKQS-----NYEL-------VELPGCKGIWTVYHKS 557
LR A A G K N E+ E P +G WT+ K
Sbjct: 554 LDSEEVVLRDGVTSELQLACAVGRGKAGAIAILNREIQPKVIGRFEFPEARGFWTMCVKK 613
Query: 558 ----SRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF------ 606
+ G NA S + YD E + +I + ET+D+ +
Sbjct: 614 PLPKALGSNAVVS--SEYDSMELYDRFMIVAKVDLDGYETSDVYALTDAGFESLKDTEFE 671
Query: 607 -VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSV 664
G T+ AG + + R+IQV + R DG ++Q L + +G+E V+S
Sbjct: 672 PAAGFTVMAGTMGKQMRIIQVLKSEVRCYDGDLGLSQILPM----MDEDTGAE-PRVVSA 726
Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
SIADPY+++ D SI + A I+S+ + + DKGP ++
Sbjct: 727 SIADPYLMVIRDDNSIFI---------------AKIDSNDE---LDEVEKDKGPLASIK- 767
Query: 725 TSTDAWLSTGVGEAIDGADGGPL--DQGDIYSVVCY---ESGALEIFDVPNFNCVFTVDK 779
W TG A P D+G ++ + +GAL I+D+ N +
Sbjct: 768 -----W-QTGCLYADHDGHFQPKQPDEGSSPRILMFLMSTTGALHIYDLDNLS------- 814
Query: 780 FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRP 839
Y+ E L S S++ G KE + + V +L P
Sbjct: 815 --------EPVYVAEGLT-STPPFLSANFTGRKAAAKETLTEILVADLG----DVVAKSP 861
Query: 840 FLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDA 899
+L + Y+ + P ++S S +L + S + + D
Sbjct: 862 YLILRHDTDDLTLYEPVRYHEPNSSS-----APLSDTLFFKKSTNSTIAKSAPASDKEDD 916
Query: 900 YTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTV 959
T+++ P Q + N+ G+ FLSG P + + + + L + +
Sbjct: 917 ETQQK--RFVPLQ---LCANVGGYSAVFLSGDSPSFILKSAKSIPRIVGLQGQGVQGMST 971
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYFAEKNLYPL 1018
H C+ GFIY ++GI ++ QLP+ + Y V+KIPL +++++ + Y
Sbjct: 972 FHTEGCDRGFIYADTKGIARVSQLPTDTNYAELGISVKKIPLDCDVNRVSFHSHTATY-- 1029
Query: 1019 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPW 1077
I + +P E+ D H + ++ T+ ++++ P W
Sbjct: 1030 IAACSTREPF----------ELPKDDDYHKEWARETVNFAPTMPRGILKLISP----AAW 1075
Query: 1078 QTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1136
++ ++S E ++ + L + TKE ++A+G+A +GED+ RGRV +F
Sbjct: 1076 TVIHSLDLESCETIESMMALHLEISEETKERRMVVAVGSAICKGEDLPTRGRVQVFDIVT 1135
Query: 1137 NADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKIILHKWTGTELNG- 1187
P N ++ +KE +G +++L+ + G LLIA G K ++ G +G
Sbjct: 1136 VIPEPGRPETNKRLKLLAKEELPRGGVTSLSEIGTSGLLLIAQGQKCMVR---GLREDGG 1192
Query: 1188 ---IAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
+AF D +++ + ++ L+ D K ++F + E+ +L K G +
Sbjct: 1193 LLPVAFLDMN-CHILGVRELRGTGLCLMADAFKGMWFAGYTEEPYTFKVLGKSGGQIPML 1251
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH--VTKFLRL 1300
+FL DG L+++ D ++ +F + P +S +G LL R F + + T L
Sbjct: 1252 VADFLPDGEDLNMIGVDADGDLHVFEFNPDHPKSLQGHLLLHRTTFSLSPNEPTTTVLLE 1311
Query: 1301 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1360
+ + S + G++ + LL G + + PL E +RRL SL +L+ +V
Sbjct: 1312 RTIPASQPQPQGTTGAETPH--TLLLSCPTGQLAALTPLSESAYRRLLSLANQLMPAVVP 1369
Query: 1361 VAGLNPRSFRQFHSNGK---AHRPGPDS------IVDCELLSHYEMLPLEEQLEIAHQTG 1411
GL+P++ R G A G ++ IVD +L+ + L ++ E+A ++G
Sbjct: 1370 YGGLHPKAHRLPEGRGAQSHARAVGVETAASGRMIVDGAVLARWTELGAAKRAEMATKSG 1429
>gi|400597740|gb|EJP65470.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
Length = 1444
Score = 207 bits (527), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 330/1450 (22%), Positives = 575/1450 (39%), Gaps = 236/1450 (16%)
Query: 85 GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
GET +L D L LV + G V LA L ++ ++++LA+ AK+ +
Sbjct: 85 GET--LLLRDRAQNTKLVLVAEIPVAGTVIGLARLKLQNTESGG--EALLLAYRGAKMCL 140
Query: 145 LEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQM 201
E++ L S+H +E E L+ G V + DP RC +
Sbjct: 141 TEWNPQKAALDTVSIHYYEKDE---LQGAPWELPFGEYVNYLEADPASRCAAFKFGSRNL 197
Query: 202 IILKASQGGSGL-VGDEDTFGSG-----------GGFSARIESSH----VINLRDLD--M 243
IL Q L + D D G G ES H V+ L LD +
Sbjct: 198 AILPFRQAEEDLEMEDWDEALDGPKPAKEAALATNGDDHETESQHSPSFVLRLPLLDPTL 257
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
H F+H Y EP IL + T H T + L + + I S
Sbjct: 258 LHPVHLAFLHQYREPTFGILSSAQSTSIALGFRDHMTYKVFTLDLKQ--RASTTILSVTG 315
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
LP D +++ +P+P+GG L+VG N IH + +A+N A + S +S +
Sbjct: 316 LPQDLKRVIPLPTPVGGALLVGENELIHIDQSGKANGVAVNPMARQMTSFSLADQSELNY 375
Query: 363 ELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL---SKTNPSVL----TS 413
L+ A +++ LL L +++ DGR V + L S+ N L S
Sbjct: 376 RLEGCAIEPISMESGELLLILNDASLAIISFKIDGRTVSGISLAAVSQENGGNLLKSRVS 435
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
I+ IG + F+GS GDS+++ ++ S +++ ++ D L D
Sbjct: 436 CISRIGKASMFIGSESGDSVVLGWS-----RKQSQEKRKKSRALDTD------LALDVED 484
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFS--------FAVRDSLVNIGPLKDFSYGLRIN 525
D E+ LYG+ S + +Q F ++D+L+ + P+ D + G +
Sbjct: 485 IDLDDDFDEDDDLYGTESAAAKPSQAGAGATKGGEPVFRLQDALLCLAPIHDVAPGKAVF 544
Query: 526 AD-----------------ASATGISKQS-----NYEL-------VELPGCKGIWTVYHK 556
A A G K N E+ E P +G W + K
Sbjct: 545 PSDSEEAFLRDGVTSELQLACAVGRGKAGAIAILNREIQPKVIGRFEFPEARGFWAMCVK 604
Query: 557 SSRGHNADSSRM--AAYD--DEYHAYLIISLEARTMVLETADLLTEVTESVDYF------ 606
SS + + YD ++Y ++I++ + ET+D+ +
Sbjct: 605 KPVPKALGSSAVISSEYDSTEQYDRFMIVA-KVDLDGYETSDVYALTDAGFESLKDTEFE 663
Query: 607 -VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSV 664
G T+ AG + + R++QV + R DG ++Q L + +G+E V+S
Sbjct: 664 PAAGFTVMAGTMGKQMRIVQVLKSEVRCYDGDLGLSQILPM----LDEDTGAE-PRVVSA 718
Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
SIADPY+++ D S+ + + + +E +K DKGP
Sbjct: 719 SIADPYLMIIRDDNSVFI---------AKIGSNDELEEVEK---------DKGP------ 754
Query: 725 TSTDAWLSTGVGEAIDGA--DGGPLDQGDIYSVVCYES--GALEIFDVPNFNCVFTVDKF 780
+ W + + DG P D +++ S GAL ++D+ N +
Sbjct: 755 LVSTKWQTGCLYTDYDGTFQAKKPDDNASPRTMMFLMSTAGALHMYDLDNLS-------- 806
Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
Y+ E L S S++ G KE + + V +L PF
Sbjct: 807 -------EPVYVAEGLT-STPPFLSANFTGRKAAAKERLTEILVADLG----DVVSKSPF 854
Query: 841 LFAILTDGTILCYQAYLFEGPENTS---------KSDDPVSTSRSLSVSNVSASRLRNLR 891
L + Y+ ++ P ++S K + ++S S + + R
Sbjct: 855 LILRHDTDDLTLYEPVRYQEPNSSSPPLTDTLFFKKSANATIAKSASAFDKEEDETQQRR 914
Query: 892 FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
F PL PC N+ G+ FLSG P + + + + L
Sbjct: 915 F--VPLQ-----------PC------GNVGGYSTVFLSGDSPSFVLKSAKSIPRIVGLQG 955
Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYF 1010
+ + H C+ GFIY ++GI ++CQLP+ + Y V+KIPL +++++
Sbjct: 956 QGVQGMSTFHTAGCDRGFIYADTKGIARVCQLPTDTNYAELGISVKKIPLDCDVNRVSFH 1015
Query: 1011 AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL-HRTYTVEEYEVRILE 1069
+ Y I + +P E+ D H + ++ T+ ++++
Sbjct: 1016 SHTATY--IAACSTREPF----------ELPKDDDYHKEWAREVVSFAPTMPRGMLKLIS 1063
Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1128
P W ++ ++S E ++ + L + TKE L+A+G+A +GED+ RGR
Sbjct: 1064 P----AAWTVIHSLDLESCETIESMMALHLEISEETKERRMLVAVGSAICKGEDLPTRGR 1119
Query: 1129 VLLFST-------GRNADNPQNLVTEVYSKELKGAISALASL--QGHLLIASGPKIILHK 1179
V +F GR N + L + + +G +++L+ + G LLIA G K ++
Sbjct: 1120 VQVFDIVTVIPEPGRPETN-KRLKLQAKEELPRGGVTSLSEIGTSGLLLIAQGQKCMVR- 1177
Query: 1180 WTGTELNG----IAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLA 1233
G +G +AF D +++ + ++ L+ D K ++F + E+ +L
Sbjct: 1178 --GLREDGGLLPVAFLDMN-CHILGVRELRGTGLCLMADAFKGMWFAGYTEEPYTFKVLG 1234
Query: 1234 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
K G + +FL DG LS++ D ++ +F + P +S +G L+ R F + +
Sbjct: 1235 KSGGQIPMLVADFLPDGEDLSMIGVDADGDLHVFEFDPDHPKSLQGHLLIHRTTFSLSPN 1294
Query: 1294 --VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
T L + + S + G++ + LL G + + PL E +RRL SL
Sbjct: 1295 EPTTTVLLERTIPASQPQPKGTTGAETPH--TLLLSCPTGQLAALTPLSESAYRRLLSLT 1352
Query: 1352 KKLVDS-VPHVAGLNPRSFR-------QFHSN--GKAHRPGPDSIVDCELLSHYEMLPLE 1401
+++ + VPH GL+P++ R Q HS G IVD +L+ + L
Sbjct: 1353 NQVLPAVVPH-GGLHPKAHRLPEGRGAQSHSRAVGVETAASGRMIVDGAVLARWTELGAA 1411
Query: 1402 EQLEIAHQTG 1411
++ E+A ++G
Sbjct: 1412 KRAEMALKSG 1421
>gi|342877552|gb|EGU79002.1| hypothetical protein FOXB_10431 [Fusarium oxysporum Fo5176]
Length = 1399
Score = 206 bits (523), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 322/1424 (22%), Positives = 547/1424 (38%), Gaps = 193/1424 (13%)
Query: 69 YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GG 123
Y R ++ ES G V D + L LV L G V LA + GG
Sbjct: 65 YDHRANDDDGLESSFLGGESMLVRTDRTNLTKLVLVAELPLSGTVTGLAKVKTKHSKCGG 124
Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPL 182
+++++A++ AK+ + +D L S+H +E E LH SF
Sbjct: 125 -------EALLIAYKAAKLCMAVWDPEKSNLETISIHYYEKEE-LHGAPWEVSFDEYTNY 176
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
++ DP RC + IL Q L D+ G + ES+ V N D D
Sbjct: 177 LEADPGSRCAAFQFGSRNLAILPFRQAEEDLEMDDWDEDLDGPRPVK-ESTTVAN-GDSD 234
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
EP IL + H T + L + + I S
Sbjct: 235 TLEPA---------EPTFGILSSSQERAHSLGQKDHLTYKVFTLDLQQ--RASTTILSVT 283
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS 361
+LP D +K++ +P+P+GG L++G N IH S +A+N+ A + S ++ +
Sbjct: 284 DLPRDLFKIIPLPAPVGGSLLIGENELIHVDQSGKSNGVAVNSMARQITSFSLTDQADLN 343
Query: 362 VELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDLSKTNPSVLTSDI 415
+ L+ ++N LL G + ++T DGR V R+ + +++ S
Sbjct: 344 LRLEHCVIETLSIENGELLLVLNDGRIGIVTFQIDGRTVSGLTVRMVADENGGNLIKSRA 403
Query: 416 TT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+T +G + +F+GS +GDS+++ +T G K D E
Sbjct: 404 STASKLGKNAYFVGSEVGDSVVLGWTRKMGQEKRR---KPRLIDAEIGLEMDDLDLEDED 460
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-LRINADASAT 531
D D+ E + + + N SF + D+L++I P+KD + G + + D+
Sbjct: 461 DEDDDLYGTESAAAKPAQALNGGGKTGELSFRIHDTLLSIAPIKDLTPGKVSFHPDSEEA 520
Query: 532 GISKQSNYEL----------------------------VELPGCKGIWTVYHKS----SR 559
+S+ +L E P + WT+ K +
Sbjct: 521 TLSQGVVSDLHLACVVGRGKAGSLAILNRNIQPKIIGRFEFPEARDFWTMSVKKPMPKAL 580
Query: 560 GHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLT------EVTESVDY-FVQGRTI 612
G N ++ Y+I++ + ET+D+ E + ++ G T+
Sbjct: 581 GGNVGMGNEYETFGQHDKYMIVA-KVDLDGYETSDVYALTGAGFETLKDTEFDPAAGFTV 639
Query: 613 AAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
AG + + R+IQV + R DG +TQ L + E+G+ V S SIADPY+
Sbjct: 640 EAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA---EPRVTSASIADPYL 694
Query: 672 LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
LL D S+ L D + V+ A + K S C KG + ++D
Sbjct: 695 LLIRDDSSLMLAQIDSNNELEEVEKMDATLQNTKWHSGCLYADTKGA---FQPNASDKGA 751
Query: 732 STGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHI-VD 789
T I + +GAL ++ +P+ + V+ + H+ D
Sbjct: 752 ETE----------------KIMMFLLSSTGALHVYALPDLSKPVYVAEGLCYVPPHLSAD 795
Query: 790 TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGT 849
+R L KEN+ + V +L P+L
Sbjct: 796 YTLRRGLA------------------KENLREILVADLG----DTTSQSPYLILRNQTDD 833
Query: 850 ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
+ Y+ P + S S +L+ S + L + D E P
Sbjct: 834 LTIYE------PLRHVRDGGETSLSATLTFKKTSNTTLATIPVETEQDDV----EQPRFV 883
Query: 910 PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGF 969
P + NI+G+ FL G P + + + + L + + H C+ GF
Sbjct: 884 PLRPCA---NINGYSTVFLPGPSPSFVIKSSKSIPRVIGLQGLGVRGMSTFHTEGCDRGF 940
Query: 970 IYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
IY +GI ++ QLP + + V+K+PL A I Y Y I + +P
Sbjct: 941 IYADDKGIARVTQLPPDTNFTELGISVKKVPLGADVRGIAYHQPTGAY--IAGCMISEPF 998
Query: 1029 NQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
E+ D H + L T+ ++++ P W + ++S
Sbjct: 999 ----------ELPKDDDYHKEWAKETLTFPPTMPRGVLKLISPVS----WTVIHEVELES 1044
Query: 1088 SENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT 1146
E+ ++ + L + TKE L+ +GTA +GED+ RGRV +F P T
Sbjct: 1045 CESIECMKTLHLEVSEDTKERRFLVTVGTAVSKGEDLPIRGRVHVFDIVTVIPEPGRPET 1104
Query: 1147 EVYSKEL------KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPL 1196
K + +G ++A++ + QG +L+A G K ++ K G+ L +AF D
Sbjct: 1105 NKRLKAIAREDIPRGGVTAISEIGTQGLMLVAQGQKCMVRGLKEDGSLLP-VAFLDMSCH 1163
Query: 1197 YVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
+ + + L+ D K ++F + E+ +L K G L +FL DG L++
Sbjct: 1164 VSTARELPRTGLCLMADAFKGVWFAGYTEEPYTFKVLGKSHGRLPVLVADFLPDGEDLAI 1223
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH--VTKFLRLQMLATSSDRTGAA 1313
V +D ++ I + P+ +S +G LL R F V + T L + L S
Sbjct: 1224 VAADADGDLHILDFNPEHPKSLQGHLLLHRTSFSVSPNPPSTTLLLPRTLPPSHPPP--- 1280
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
+ LL + G + + PL E T+RRL S+ +L+ ++ GLN ++ H
Sbjct: 1281 ----QDPPHILLLASSSGHLATLVPLPETTYRRLLSVTNQLLPALTPHGGLNAKA----H 1332
Query: 1374 SNGKAHRP------GPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
RP G +IVD +L+ + L ++ EIA + G
Sbjct: 1333 RLPDGIRPVGVEAAGGRTIVDGAILARWAELGAAKRAEIAGKGG 1376
>gi|452979579|gb|EME79341.1| hypothetical protein MYCFIDRAFT_104419, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 1342
Score = 204 bits (519), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 331/1438 (23%), Positives = 568/1438 (39%), Gaps = 253/1438 (17%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G V SLA DN+ D+II+AF DAK+S++E+D H + S+H
Sbjct: 46 LSLVAEYPLAGTVISLA--RTKPRDNASGGDAIIIAFRDAKLSLVEWDPENHRISTISLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
+E + G ++ VDP RC + Q+ IL G L G+E+
Sbjct: 104 YYEGDNVITPPFGPTLAESESILTVDPSSRCAALKFGARQLAILPFRHFGDELAGEEEED 163
Query: 221 GSG----GGFSARIESSH--------------VINLRDLD--MKHVKDFIFVHGYIEPVM 260
G S R ES+H V+ L LD + H F+H Y EP
Sbjct: 164 GFENEPMSAVSKRRESTHLNGEEEQTPYKASFVLPLTALDPTLSHTVHLAFLHEYREPTF 223
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
IL + + + ++ + + + LP +K+ +P PIGG
Sbjct: 224 GILSAPMEPSNALLEERKDVLTYTVYTLDLEQRASTNLITVPKLPSTLWKVKPLPLPIGG 283
Query: 321 VLVVGANTIHYHSQSASC-ALALNNYA-------VSLDSSQELPRSSFSVELDAAHATWL 372
L+VG N + + QS A A+N +A +S S L S+E + L
Sbjct: 284 ALLVGTNELVHVDQSGKANATAVNEFAKLESDFGMSDQSHLNLKLEDCSIETIDPKSGQL 343
Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-------TNPSVLTSDITTIGNSLFFL 425
LL T G L ++ GR + ++++ T+ S S I + N F+
Sbjct: 344 -----LLVTSDGALAIIEFKLLGRSISAINVTPVTEDNGVTSLSAAPSCIANLANGSVFI 398
Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD------IEADAPSTKRLRRSSSDALQDMV 479
GS G S L+ ++ + + G +A L ++ +A + V
Sbjct: 399 GSEDGASSLMGWSQPTAPLTRKRSHAQMLGKDGDEEDEDAIEEDDDDLYDAAPEAKKRAV 458
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD-----ASATGIS 534
+ EL S+ + F +RD L ++GP+ G + + A+ATG
Sbjct: 459 SDTELG----------SSNAAYQFEIRDHLQSLGPIHRMCVGRQGKSSDKLQLAAATG-R 507
Query: 535 KQS------NYELVELPGCKGIWTVYHKSSRGHNADSS---RMAAYDDEYHAYLIISLEA 585
KQS N ++V PG ++SR NA S+ R DE +L+
Sbjct: 508 KQSGRLTLLNRDVVPTPG---------RASRFENAKSAWAVRAHQAGDES------TLDN 552
Query: 586 RTMVLETADLLT-EVTESVDYFV--------------QGRTIAAGNLFGRRRVIQVFERG 630
+ V E A+ E++ + ++FV +G T+ L + ++Q ++
Sbjct: 553 KLFVFEGANTKAYEISSADEHFVEDRYPEHAKSEWESEGETLEVVALADGKIIVQFRKQE 612
Query: 631 ARILDGSY-MTQDLSFGPSNSESGSGSENS-TVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
R D + M Q L P E +EN ++ +++ DPYVL+ D SI++L
Sbjct: 613 VRTYDANLAMNQIL---PMEDE----AENELNIVHIAVCDPYVLVIRDDSSIQIL----- 660
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
SVQ + +P+ + +K WL+ + G L
Sbjct: 661 ----SVQG-----NELEPLEAEGSVAEK------------KWLTGSLY-------AGTLT 692
Query: 749 QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI-VDTYMREALKDSETEINSSS 807
QG + G L F +P+ +F + I VD R A
Sbjct: 693 QGSAAVFLLNADGGLHAFALPDLQPLFAIPTLPHLPPVIAVDAAQRRA------------ 740
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
G +E + + V +L ++P+L ++ Y+ + + P+ + +
Sbjct: 741 ------GTRETLTEVLVSDLGQHGV----TQPYLVLRTAMDDVVLYEPFHY--PQTSGRK 788
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 927
S + L R R + FS P + + E+ P ++ I +
Sbjct: 789 ----SWHQDL--------RFRKVPFSHIPKYSESIAESQSARPPPLKSV--KIDTYSAIA 834
Query: 928 LSGSRPCWCM----VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
+ G+ PC + + L + + ++ V C +GF + + L+ QL
Sbjct: 835 IPGAPPCLLLKEPSTLPKVLEIRQSAELNRLSMLCPINRVGCENGFFMINADEELEEQQL 894
Query: 984 PSGSTYDNYWPVQKIPLKATPHQ------ITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1037
P + Y W V ++P+ P+Q I Y E+ LY V+ +V +
Sbjct: 895 PLNTWYGTGWSVHQVPI-GHPNQIEDVRRIAYHEERGLY-------VVATCREVDFYFAE 946
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEP------DRAGGPWQTRAT----IPMQS 1087
++ H + D+ V +Y V ++ D P+ T + +++
Sbjct: 947 EDGRHPEQD------DITLRPKVPQYNVHLISAISHHIIDTVHMPYLAAITDLQVMMLEA 1000
Query: 1088 SENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL--- 1144
SEN T E + L+ + A +GED+ A+G + ++ +P
Sbjct: 1001 SEN-------------THEQKPLVVVSAAAQRGEDMPAKGTLYVYDIIDVVPDPDIAESG 1047
Query: 1145 --VTEVYSKELKGAISALAS--LQGHLLIASGPKIILH--KWTGTELNGIAFYDAPPL-Y 1197
+ ++ +E +GAI+ALA G + A G K+++ K G+ L +AF DA +
Sbjct: 1048 VKLHQLAREENRGAITALAGPFPGGFIGTAQGLKVMIRGMKEDGSCLP-VAFLDAQSYTH 1106
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG-SLDCFATEFLIDGSTLSLV 1256
V+ + L GD K ++F + E+ ++ +L K ++ + EFL L +V
Sbjct: 1107 VLKTLPGRGMWLAGDAWKGLWFGGFTEEPYRVTVLGKAPKMHMEVMSAEFLPFDGALYIV 1166
Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDR-----T 1310
V D ++ + Y P+ +S G +LL R+ FH+G T + L LA+ + +
Sbjct: 1167 VLDADCDMHVLQYDPENPKSLNGMRLLHRSTFHIGHFTTNSMLLPSTLASFAAQQHEMMN 1226
Query: 1311 GAAPGSDKTNRFA-LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
G + K + +L + G+IG I PLDE +RRL +LQ L + H AGLNPR++
Sbjct: 1227 GGSKAEVKPDPLQHVLTSSTSGAIGLITPLDEQAYRRLSALQTHLTSILEHAAGLNPRAY 1286
Query: 1370 RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
R S G +VD L+ L + ++ + G + + S+L + G
Sbjct: 1287 RSIESESFG---GARGVVDGLLVRRIHELGAARRADVLGRAGVSAWGLRSDLEIIGGG 1341
>gi|295665178|ref|XP_002793140.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226278054|gb|EEH33620.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 1408
Score = 202 bits (515), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 149/529 (28%), Positives = 265/529 (50%), Gaps = 39/529 (7%)
Query: 919 NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV + ++
Sbjct: 900 DVCGYRTVFMPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVV 959
Query: 979 KICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1038
++C+ P + +D W +KI L + Y + Y L S V L + D
Sbjct: 960 RMCRFPRNTHFDGSWAARKIGLGEQVDSVEYSSSSETYVLGTSQKVDFKLPE------DD 1013
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
E+ + N +S +++ V++L P W + +++SE + V+ +
Sbjct: 1014 EIHPEWRNEVISFFP-----QIDKGSVKLLNPRT----WSIIDSYQLRTSERVMCVKCLN 1064
Query: 1099 L-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKE 1152
L + T E + ++A+GTA +GED+AARG + +F + D P+ + + +E
Sbjct: 1065 LEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEVIKVVPEVDRPETNRKLKLIAKEE 1124
Query: 1153 LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN-- 1206
+KGAI++L+ + QG L+ A G K I+ K G+ L +AF D YV L +K
Sbjct: 1125 VKGAITSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVSVLKELKGTG 1182
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
++GD K ++F + E+ +L+L +KD GSL A +FL DG L ++V+D+ NI +
Sbjct: 1183 MCIMGDALKGLWFAGYSEEPYKLSLFSKDDGSLQVMAADFLPDGKRLYIMVADDDCNIHV 1242
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAAPGSDKTNRF- 1322
Y P+ S KG +LL R+ FH G + +T R +L+ + A D +
Sbjct: 1243 LQYDPEDPGSAKGDRLLHRSTFHTGQFASTLTLLPRTSVLSQGPETEANAMDLDLSGPLH 1302
Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
+L + GSI I P+ E+ +RRL +LQ ++++++ H GLNPR+FR S+G R
Sbjct: 1303 QVLVTSETGSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGGR-- 1360
Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
+VD +L+ + L + + EIA + G +I ++L A+G + L
Sbjct: 1361 --GMVDGDLVQKWLDLGTQRKAEIASRVGADVWEIRADLE--AIGKAGL 1405
Score = 125 bits (314), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 184/750 (24%), Positives = 304/750 (40%), Gaps = 121/750 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++Y + GS ++ +T+ + + L LV Y L G + L
Sbjct: 28 NLIVAKTTLLQVYNLVNVVYGSSPGQSDEKTRSQY-------SKLVLVAEYALSGTITDL 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ ++I++A +AK+S++E+D H + TS+H +E + +H+ +
Sbjct: 81 GRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISPWTPN 137
Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-------------------- 214
A P + VDP RC VL +G + + IL Q G LV
Sbjct: 138 LAACPSHLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMDDFDSDHDDERQIDTNHTA 196
Query: 215 --GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
DE G + SS V+ + L+ M H F++ Y EP IL+ +
Sbjct: 197 EERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQVAAS 256
Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-I 329
+ + + S ++ + + S LP+D +K++ +P P+GG L+VG+N +
Sbjct: 257 SALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGSNELV 316
Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLV 387
H + A+ +N +A S +S + L+ L +N LL G +
Sbjct: 317 HVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLNGVMA 376
Query: 388 LLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
+++ DGR V + L +T PS +G F GS GDS+L+
Sbjct: 377 VVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SAPVGRGKIFFGSEEGDSILI- 431
Query: 437 FTCGSGTSMLSSGLK----EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY----- 487
G S LS+G K E G+ D + D D + E LY
Sbjct: 432 -----GWSRLSAGAKVSPAPETGE---DNVAELSEDEEDDDDDDDEEDAYEDDLYATPVT 483
Query: 488 -GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNY 539
G NT S T + F + D L N+GP++D + G + D + S +
Sbjct: 484 PGINPRNTASMNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYL 543
Query: 540 ELVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDD 573
ELV G G+ +V+ K + + S A
Sbjct: 544 ELVTTQGYGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPSQSGSLPANAGS 603
Query: 574 EYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVF 627
Y YL++S + +++V + + E T + ++ + RTI G L G RV+QV
Sbjct: 604 NYDHYLLLSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVL 663
Query: 628 ERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
+ R D G + Q ++ SE +V+ S A+PYVL+ D SI LL D
Sbjct: 664 KGEVRSYDSGLGLAQIYPVWDEDT-----SEERSVMHASFAEPYVLIIRDDSSILLLQAD 718
Query: 687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
S ++T I+S+ S +LY DK
Sbjct: 719 ESGDLDEIETDGIIKSTT--WISGSLYQDK 746
>gi|147827332|emb|CAN62175.1| hypothetical protein VITISV_001516 [Vitis vinifera]
Length = 1989
Score = 201 bits (511), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 121/228 (53%), Positives = 148/228 (64%), Gaps = 49/228 (21%)
Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSG 442
+G+L+LLT+V DGRVV +L LSK+ SV TS I IG+SL F GS+LGDSLLVQF
Sbjct: 1657 SGELLLLTLVCDGRVVYKLGLSKSRASVFTSGIAAIGSSLSFPGSQLGDSLLVQF----- 1711
Query: 443 TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
T++ SS ++++ GD E D PSTKR RRSSSDALQDM NG++L LY
Sbjct: 1712 TAIPSSSVEKKVGDSEGDVPSTKRSRRSSSDALQDMDNGDKLPLY--------------- 1756
Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYEL--------------------- 541
V DSL+N+GPLKDF+YGLRIN D ATGI KQSNYEL
Sbjct: 1757 --VSDSLINVGPLKDFAYGLRINTDLKATGIVKQSNYELMCCSGHGKNGALCILQQSIRP 1814
Query: 542 -----VELPGCKGIWTVYHKSSRGHNADSSRMA-AYDDEYHAYLIISL 583
VELPGCKGIWTVYHK++RGHNADS +M+ +D E+ A++ SL
Sbjct: 1815 ERITEVELPGCKGIWTVYHKNTRGHNADSIKMSHVFDLEFRAFIFFSL 1862
>gi|303321596|ref|XP_003070792.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
gi|240110489|gb|EER28647.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
Length = 1394
Score = 201 bits (510), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 154/552 (27%), Positives = 267/552 (48%), Gaps = 60/552 (10%)
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-- 974
+ +I G++ F+SGS PC+ M +L ++ + + H C GF YV +
Sbjct: 866 YSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLRGEAVSSLSSFHIPACEKGFAYVDASV 925
Query: 975 -----------------QGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYP 1017
Q ++++C+LP + +DN W +K+ + + YFA +Y
Sbjct: 926 CVPKQYFVPWNKLILVIQNMVRMCRLPGNTRFDNSWVTRKVHVGDQIDCVEYFAHSEIYA 985
Query: 1018 LIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPW 1077
L S V L + D E+ + + +S + +E +++L P W
Sbjct: 986 LGSSHKVDFKLPE------DDEIHPEWRSEVISFMP-----QLERGCIKLLSPRT----W 1030
Query: 1078 QTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1136
+ + +E + ++ + + + T E + +L +GTA V+GED+ RG + +F
Sbjct: 1031 SVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEIIE 1090
Query: 1137 NADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNG 1187
A +P N ++++K+ +KGA++A++ + QG L++A G K ++ K G+ L
Sbjct: 1091 VAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRGLKEDGSLLP- 1149
Query: 1188 IAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
+AF D YV L ++ ++GD K I+F + E+ +L L KD L A +
Sbjct: 1150 VAFMDMQ-CYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKDNEYLQVIAAD 1208
Query: 1246 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
FL DG L ++V+D+ I + Y P+ S KG +LL R+ FH+G H T + L +
Sbjct: 1209 FLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHMG-HFTSTMTL-LPQH 1266
Query: 1306 SSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVP 1359
SS + PG D + + +L + +GSIG + PL E ++RRL +LQ +LV S+
Sbjct: 1267 SSSPSADDPGEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQSQLVTSME 1326
Query: 1360 HVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1419
H GLNP+++R S+G R IVD LL + + ++ + EIA + G I
Sbjct: 1327 HPCGLNPKAYRAVESDGFGGR----GIVDGNLLLRWLDMGVQRKAEIAGRVGADIESIRV 1382
Query: 1420 NLNDLALGTSFL 1431
+L ++ G FL
Sbjct: 1383 DLEKISGGLDFL 1394
Score = 116 bits (290), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 175/731 (23%), Positives = 291/731 (39%), Gaps = 94/731 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + G+ N+ + R ++ L LV Y L G + L
Sbjct: 28 NLIVAKTSILQVFSLVNVAYGTSALPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ +++++A +AK+S++E+D HG+ S+H +E E +H
Sbjct: 85 GRVKI--LDSRSGGEALLVATRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLV-----GDEDTFGSGGG---- 225
P L+ VDP RC +L +G+ + IL Q G LV GD D G
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDGDLDEKPEGASNIPA 200
Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ SS V+ L LD + H F++ Y EP IL+ T +
Sbjct: 201 QIAVENDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ + S ++ + + + LP D +K++ +P PIGG L++G+N IH
Sbjct: 261 LRDRKDIVSYSVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + + +S + L+ L D LL G + +L
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380
Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ DGR V +L K S+L + + ++G F GS DSLL+ + S
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGW---SRP 437
Query: 444 SMLSSGLKEEFGDI---EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
S L K E D + D VN LS S +N +
Sbjct: 438 SQLMRKPKVESADDVFGDHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495
Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
F F D L N+GP+ D + G R +AD + N
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555
Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
+ +++ G+W++ + DS+ Y YL+ S + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610
Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
++V E ++ ++ + T+ G L G RV+QV + R D + +
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
P E S+ +V+S S A+PYVL+ D S+ LL D S V I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725
Query: 706 PVSSCTLYHDK 716
+S C LY DK
Sbjct: 726 WLSGC-LYLDK 735
>gi|38014465|gb|AAH60475.1| LOC398931 protein, partial [Xenopus laevis]
Length = 363
Score = 199 bits (507), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 109/303 (35%), Positives = 178/303 (58%), Gaps = 23/303 (7%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LEL+ + GN+ S+A + GA +RD+++L+F++AK+SV+E+D H L+ S+H
Sbjct: 66 LELMASFSFFGNIMSMASVQLAGA----KRDALLLSFKEAKLSVVEYDPGTHDLKTLSLH 121
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVG 215
FE PE L+ G P V+VDP GRC +L+YG Q+++L ++ GLVG
Sbjct: 122 YFEEPE---LRDGFVQNVHIPKVRVDPSGRCAVMLIYGTQLVVLPFRRDTLAEEHEGLVG 178
Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
+ G + S++I++R+LD K ++ D F+HGY EP ++IL E TW GR
Sbjct: 179 E--------GQKSSFLPSYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGR 230
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
V+ + TC I A+S++ K HP+IWS +LP+D + LAVP P+GGV++ N++ Y +
Sbjct: 231 VAVRQDTCSIVAISLNIMQKVHPIIWSLNSLPYDCTQALAVPKPVGGVVIFAVNSLLYLN 290
Query: 334 QSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
QS ++LN+ S P+ + LD + AT++ D ++S K G++ ++T++
Sbjct: 291 QSVPPYGVSLNSLTNGTTSFPLKPQEEVRITLDCSQATFISYDKMVISLKGGEIYVVTLI 350
Query: 393 YDG 395
DG
Sbjct: 351 TDG 353
>gi|238508528|ref|XP_002385456.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus flavus NRRL3357]
gi|220688975|gb|EED45327.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus flavus NRRL3357]
Length = 1204
Score = 199 bits (507), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 144/547 (26%), Positives = 262/547 (47%), Gaps = 46/547 (8%)
Query: 898 DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
D + EE P + I NISG F G P + + L G +
Sbjct: 678 DQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKGGYAQSL 734
Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYP 1017
+ C GFI + S+G++ +CQ+P G D W +Q+IP+ + Y + +Y
Sbjct: 735 SKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQIPIGEQVDHLAYSSSSGMYV 794
Query: 1018 LIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPW 1077
+ S L D E+ + N S V+ ++++ P W
Sbjct: 795 IGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQRSSLKVVSPKT----W 839
Query: 1078 QTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1136
+ + +E+ + V+ ++L + T E + ++ +GTA+ +GED+A+RG V +F +
Sbjct: 840 TVIDSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCVYVFEVIK 899
Query: 1137 NADNPQNLVTE-----VYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNG 1187
+P+ + V + +KGA++AL+ + QG L++A G K I+ K G+ L
Sbjct: 900 VVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGLKEDGSLLP- 958
Query: 1188 IAFYDAPPLYVVSLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
+AF D +++VK ++ D K ++F + E+ +++L AKD L+
Sbjct: 959 VAFMDVQ----CHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDLDYLEVL 1014
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
A +FL DG+ L ++V+D N+ + Y P+ +S G +LLSR++FH G ++ L
Sbjct: 1015 AADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFISTLTLLPR 1074
Query: 1303 LATSSDR----TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSV 1358
+ SS++ A K R +L + +GS+G + + E ++RRL +LQ +L +++
Sbjct: 1075 TSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLSALQSQLTNTI 1134
Query: 1359 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
H GLNPR+FR S+G A R ++D +LL + + + ++EIA + G +I
Sbjct: 1135 EHPCGLNPRAFRAVESDGTAGR----GMLDGKLLFQWLDMSKQRKVEIASRVGANEWEIK 1190
Query: 1419 SNLNDLA 1425
++ ++
Sbjct: 1191 ADFEAIS 1197
Score = 99.4 bits (246), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 159/355 (44%), Gaps = 40/355 (11%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
++I+LAF +AK++++E+D +G+ S+H +E + + + G ++ VDP R
Sbjct: 88 EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147
Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
C V +G++ + IL Q G LV D+ + GS G A
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206
Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
SS V+ L LD + H F++ Y EP IL+ + T + + + +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
+ + + S LP D +K++A+P P+GG L++G+N +H + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326
Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
+ S +S ++ L+ L N LL TG++VL+ DGR V + +
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386
Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
P S +G+ FLGS DS+L+ G S+ SSG K+
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKK 435
>gi|225679191|gb|EEH17475.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 1377
Score = 198 bits (503), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 146/529 (27%), Positives = 263/529 (49%), Gaps = 49/529 (9%)
Query: 919 NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV + ++
Sbjct: 877 DVCGYRTVFMPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVV 936
Query: 979 KICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1038
++C+ P + +D W +KI L + Y + Y L S Q
Sbjct: 937 RMCRFPRNTHFDGSWAARKIGLGEQVDSVEYSSSSETYVLGTS----------------Q 980
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
+V ++ + ++H + EE V++L P W + ++++E + V+ +
Sbjct: 981 KVDFKLPEDD----EIHPEWRNEE-SVKLLNPRT----WSIIDSYQLRTAERVMCVKCLN 1031
Query: 1099 L-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKE 1152
L + T E + ++A+GTA +GED+AARG + +F + D P+ + + +E
Sbjct: 1032 LEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEVIKVVPEVDRPETNRKLKLIAKEE 1091
Query: 1153 LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN-- 1206
+KGAI++L+ + QG L+ A G K I+ K G+ L +AF D YV L +K
Sbjct: 1092 VKGAITSLSGIGGQGFLIAAQGQKCIVRGLKEDGSLLP-VAFMDMQ-CYVSVLKELKGTG 1149
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
++GD K ++F + E+ +L+L +KD GSL A +FL G L ++V+D+ NI +
Sbjct: 1150 MCIMGDALKGLWFAGYSEEPYKLSLFSKDDGSLQVMAADFLPHGKRLFIMVADDDCNIHV 1209
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAAPGSDKTNRF- 1322
Y P+ S KG +LL R+ FH G + +T R +L+ + A D +
Sbjct: 1210 LQYDPEDPGSAKGDRLLHRSTFHTGQFASTLTLLPRTSVLSQGPEAEANAMDLDSSGPLH 1269
Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
+L + GSI I P+ E+ +RRL +LQ ++++++ H GLNPR+FR S+G R
Sbjct: 1270 QVLVTSETGSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGGR-- 1327
Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
+VD +L+ + L + + EIA + G +I ++L A+G + L
Sbjct: 1328 --GMVDGDLVQKWLDLGTQRKAEIASRVGADVWEIRADLE--AIGKAGL 1372
Score = 122 bits (305), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 181/747 (24%), Positives = 302/747 (40%), Gaps = 110/747 (14%)
Query: 53 GPVPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
G V V ++++Y + GS ++ +T+ + + L LV Y L G
Sbjct: 4 GAVAAFRVAKTTLLQVYNLVNVVYGSGPGQSDEKTRSQY-------SKLVLVAEYALSGT 56
Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
V L + D+ ++I++A +AK+S++E+D H + TS+H +E + +H+
Sbjct: 57 VTDLGRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISP 113
Query: 173 GRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GDEDTFGSGGGFSAR 229
+ A P + VDP RC VL +G + + IL Q G LV GD F S +
Sbjct: 114 WTPNLAACPSQLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMGD---FDSDHDEERQ 169
Query: 230 IESSHVINLRD--------------------------LDMKHVKDFIFVHGYIEPVMVIL 263
I+++H RD M H F++ Y EP IL
Sbjct: 170 IDTNHTAEERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGIL 229
Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
+ + + + + S ++ + + S LP+D +K++ +P P+GG L+
Sbjct: 230 YSQVAASSALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALL 289
Query: 324 VGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLS 380
VG+N +H + A+ +N +A S +S + L+ L +N LL
Sbjct: 290 VGSNELVHVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLV 349
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRL 429
G + +++ DGR V + L +T PS +G F GS
Sbjct: 350 LLNGVMAVVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SALVGRGKIFFGSEE 405
Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
GDS+L+ ++ S + + E D A+ + DA +D + ++ G
Sbjct: 406 GDSMLIGWSRPSAGATVPPA-PETGEDNVAELSEDEEEEDDDEDAYEDDLYATPVT-PGI 463
Query: 490 ASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNYELV 542
S NT S T + F + D L N+GP++D + G + D + S + ELV
Sbjct: 464 NSRNTASVNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYLELV 523
Query: 543 ELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYH 576
G G+ +V+ K + S Y
Sbjct: 524 TTQGYGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPTQSGSLPVNAGSNYD 583
Query: 577 AYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERG 630
YL++S + +++V + + E T + ++ + RTI G L G RV+QV +
Sbjct: 584 HYLLLSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVLKGE 643
Query: 631 ARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPST 689
R D G + Q ++ SE +V+ S ADPYVL+ D SI LL D S
Sbjct: 644 VRSYDSGLGLAQIYPVWDEDT-----SEERSVVHASFADPYVLIIRDDSSILLLQADESG 698
Query: 690 CTVSVQTPAAIESSKKPVSSCTLYHDK 716
++T IES+ S +LY DK
Sbjct: 699 DLDEIETDGIIESTT--WISGSLYQDK 723
>gi|312069702|ref|XP_003137805.1| hypothetical protein LOAG_02219 [Loa loa]
Length = 1065
Score = 197 bits (500), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 186/696 (26%), Positives = 307/696 (44%), Gaps = 99/696 (14%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEE 809
+ E+G + I+ +P + V+ V K +H+ D + D E + S++
Sbjct: 445 IARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPATAESIDTMSATMT 500
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
T + E + ++EL M + RP LF +L D T+ Y+ + + N
Sbjct: 501 DTFAAKPEEV----IMELLMVGMGMNQGRPMLF-LLIDDTVSVYEMFTY----NNGIQGH 551
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGF 926
+ L + V+ R+ RF LD E+ A + + F+ I G
Sbjct: 552 LAVRFKRLPYTVVT----RSCRFQG--LDGRAAVESVRDAVRHKTVLHFFERIGNVLNGV 605
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPS 985
F+ S PC + R+HP DG I++FT +N C +GFIY+T + ++++ +L
Sbjct: 606 FICSSYPCIFFLETGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRVAKL-- 663
Query: 986 GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1045
PV K+ + + F E KP D V ++D
Sbjct: 664 --------PVTKMCVLINDDKT--FEEHE-----------KP---------DTFVYPEMD 693
Query: 1046 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTK 1105
+ L + Y+ E+++ P Q + + VV T
Sbjct: 694 QYKL------QLYSPEDWK-----------PVQNVEVLFEEFEVVTCCEEVVLRSEGTVS 736
Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISAL 1160
+ LA+GTA GE+V RGR+++ P ++ + +Y KE KG +++L
Sbjct: 737 GVQNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSL 796
Query: 1161 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
S G+LL G K+ + + L GI+F D YV L V+N L D+++S+ L
Sbjct: 797 CSCNGYLLTGMGQKVFIWLFKDNNLQGISFLDMH-FYVHQLIGVRNLALACDMYRSVALL 855
Query: 1221 SWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
++E+ L+L ++D S A +F+ID + V+SDE NI IF Y P+ ES
Sbjct: 856 RYQEEYKALSLASRDMRSDVQPPMAAQFIIDNKQMGFVMSDEAANIAIFNYLPETLESLG 915
Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL-----LFGTLDGSI 1333
G+KL RAE ++G V F+R++ +S G + F+L LF +LDGS
Sbjct: 916 GEKLTLRAEINIGTVVNSFIRVKGHISS--------GFVENELFSLERQSVLFASLDGSF 967
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
G + PL E FRRL LQ+ + VP AGLN + R H ++VD +++
Sbjct: 968 GFLRPLTEKVFRRLHMLQQLMSSMVPQPAGLNAKGARAARPPRPNHYLNTRNLVDGDMVM 1027
Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
Y L L E+ ++A + GT+R I+ +L ++ T+
Sbjct: 1028 QYLHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTA 1063
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/387 (23%), Positives = 150/387 (38%), Gaps = 49/387 (12%)
Query: 349 LDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
+D + P F + LD T + + LL + G L L +V D V+ L+L
Sbjct: 1 MDGFTKFPLRDFKHMVLTLDGCVVTVISTNKILLCDRNGRLFTLVLVTDATNSVKSLELK 60
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
+V+ +T+ F+GSRL DS+ + T ++ AP
Sbjct: 61 FQFKTVIPCTMTSCAPGYLFIGSRLCDSVFLHCIFEQST-------------LDESAPKK 107
Query: 465 KRLRRSSSDALQDMVNGEELSLYGSA---SNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+L + +A +D E+ LYG +SA++ + V D L+N+GP K + G
Sbjct: 108 IKLN-TELNANED----EDFELYGEVLPKVAKPDSAEELLNIRVLDKLLNVGPCKKITGG 162
Query: 522 LRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSSRMAAY-------- 571
+ K ++LV G G ++ +S R SS +
Sbjct: 163 CPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSSSIEGVVQYWAVGR 222
Query: 572 -DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
+D+ H Y I S E T+ LET + L E+ E+ + TIAAG L +QV
Sbjct: 223 REDDTHMYFIASKELGTLALETDNDLVEL-EAPIFATSEPTIAAGELADGGLAVQVTTSS 281
Query: 631 ARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDPS 688
++ Q + + V S SI DPY+ + +G + + L P
Sbjct: 282 LVMVAEGQQIQHIPLQLT----------FPVRSASIVDPYIAICTQNGRLLMYELTSHPH 331
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHD 715
+ + P++S ++Y D
Sbjct: 332 VHLKEIDISKRLRHETSPITSLSIYRD 358
>gi|255948500|ref|XP_002565017.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592034|emb|CAP98296.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1392
Score = 196 bits (497), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 154/533 (28%), Positives = 265/533 (49%), Gaps = 45/533 (8%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLCDGSIVAFTVLHNVNC--NHG 968
+ I NISG F+ G+ + VFR + P +L G + +V+ ++G
Sbjct: 877 LRILPNISGFSTIFMPGASSSF--VFRTA-KSSPHIIRLRGGFTRWLSSFDSVDTGRDNG 933
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
FIYV SQ ++ CQLPS + +D W ++K+P++ + Y Y L S
Sbjct: 934 FIYVDSQNCVRACQLPSQTQFDYPWTLRKVPIEEQVDFLAYSTSSETYVLGTS------R 987
Query: 1029 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1088
L ++ + N LS + E ++++ P W + P+
Sbjct: 988 EGDFKLPEGDDLHPEWRNEELSFCP-----KIPESSIKVVSPKT----WTIIDSYPLDPD 1038
Query: 1089 ENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT- 1146
E V+ V + + T E L+ +GTA V+GED+ ARG + +F + A +P+ T
Sbjct: 1039 EQVTAVKNVNIEVSENTHERRDLIVVGTAIVKGEDMPARGTIYVFDVIKVAPDPEKPETG 1098
Query: 1147 ---EVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYV 1198
++ KE +KGA++AL+ + QG +++A G K ++ K G+ L +AF D YV
Sbjct: 1099 HKLKLIGKESVKGAVTALSGIGGQGFVIVAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYV 1156
Query: 1199 VSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
+K ++LGD K ++F + E+ ++ L KD L+ A +FL DG+ L ++
Sbjct: 1157 TVAKELKGTGLVILGDAVKGLWFAGYSEEPYRMTLFGKDPEYLEVVAADFLPDGNKLYML 1216
Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAA 1313
V+D N+ + Y P+ +S G +LLSR++F+ G + VT R + + ++ +
Sbjct: 1217 VADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFASSVTLLPRTAVSSERTESSEEG 1276
Query: 1314 PGSDKT-NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
D+T R +L + +GS+ + + E ++RRL +LQ +L+++V H AGLNPR+FR
Sbjct: 1277 MDLDETFARHQVLIASQNGSLALVTSVAEESYRRLSALQSQLINTVDHPAGLNPRAFRAI 1336
Query: 1373 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
S+G A R +VD LL + + + Q EIA + G T +I ++L +
Sbjct: 1337 ESDGAAGR----GMVDGNLLRLWLNMGKQRQTEIAGRVGATEWEIKADLETIG 1385
Score = 114 bits (286), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 170/731 (23%), Positives = 296/731 (40%), Gaps = 151/731 (20%)
Query: 57 NLVVTAANVIEIY--VVRVQEEGSKE-----SKNSGETKRRVLMDGISAASLELVCHYRL 109
NLVV ++++++ V V + KE S S + + +++++ Y L
Sbjct: 28 NLVVVRTSLLQVFSLVKIVSSQPQKEVPEPLSSQSSQPETKLVLEK----------EYPL 77
Query: 110 HGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
G V L S+ N+R ++I++A +AK+S++E+D G+ S+H +E +
Sbjct: 78 SGTVTDL---SRVKILNTRSGGEAILIAVRNAKLSLIEWDPERRGISTISIHYYERDDLT 134
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDED 218
+ G ++ VDP RC V +G++ + IL Q G LV G+
Sbjct: 135 RSPWVPDLSRCGSILSVDPSSRCA-VYNFGIRNLAILPFHQAGDDLVMDDYDSELDGERP 193
Query: 219 TFGSGGGFSARIE-------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVIL 263
+ SGGG A+IE SS V+ L LD + H F++ Y EP IL
Sbjct: 194 SQNSGGG--AQIEKRKEEPDHQTPYSSSFVLPLTALDPSLLHPISLAFLYEYREPTFGIL 251
Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
+ + T + + + ++ + + S LP D +K++A+P P+GG L+
Sbjct: 252 YSQVATSTALLHERKDVVFYAVFTLDLEQRASTTLLSVSRLPSDLFKVVALPLPVGGALL 311
Query: 324 VGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLS 380
+G+N I H + A+ +N ++ + S +S + L+ L D LL+
Sbjct: 312 LGSNEIVHVDQAGKTNAVGVNEFSRQVSSFSMTDQSDLAFRLEGCVVERLGGDSGDLLLA 371
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDS 432
+G++ L+ DGR V + + P+ DI T +G+ F+GS DS
Sbjct: 372 LASGNMALIKFKLDGRSVSGITVHSL-PAYAGGDILKSAASCSTCLGDGNVFIGSEDADS 430
Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR---RSSSDALQDM----VNGEELS 485
+L++++ S ST++ R + ++D L D+ E+
Sbjct: 431 VLLEWSHTSA--------------------STRKARLESKQTADGLDDLSDEDDQMEDDD 470
Query: 486 LYGSASNNTE---------SAQKTFSFAVRDSLVNIGPLKDFSYGL---RINADASATGI 533
LY SA + S + ++F + D L +IGPL+D + G N + AT
Sbjct: 471 LYSSAPGPIQVDNRMGTDSSTPEFYNFRLNDKLSSIGPLRDITLGKAFSNTNRKSQATTG 530
Query: 534 SKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNADSSR 567
+ + ELV G +W+ RG
Sbjct: 531 TVAAELELVASQGSDRGGGLVVIKREIDPLTTMSLKVDDADAVWSASVTKRRG------- 583
Query: 568 MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGR 620
++ D+ Y++IS + E ++ +S+ F + T+ G+L G
Sbjct: 584 ASSTDNPSCQYVVISRSTDSE-QEVNEVFIVEEQSLKPFKAPEFNPNEDCTVDIGSLAGN 642
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSE---SGSGSENSTVLSVSIADPYVLLGMSD 677
R++QV R SY D+ G S S+ S S DPY+++ D
Sbjct: 643 TRLVQVLRNEVR----SY---DIDLGLSQIYPVWDEDTSDERVAASASFIDPYLVIIRDD 695
Query: 678 GSIRLLVGDPS 688
S+ LL D S
Sbjct: 696 SSVLLLQADES 706
>gi|296806499|ref|XP_002844059.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845361|gb|EEQ35023.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 1348
Score = 194 bits (494), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 151/558 (27%), Positives = 269/558 (48%), Gaps = 59/558 (10%)
Query: 904 ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM--------VFRERLRVHPQLCDGSIV 955
E H P + + +I G++ F+ G PC+ + V R R + ++
Sbjct: 820 EGKHPFPRKPLRALSDICGYKTVFMPGQNPCFILKSAITQPHVLRLRGK--------AVQ 871
Query: 956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNL 1015
+ + H C GF YV I+++ +LPS + +D+ W +KIPL I Y +
Sbjct: 872 SLSGFHIAACERGFAYVDEDNIIRMSRLPSNTRFDSTWATRKIPLGEQVDCIVYSSASES 931
Query: 1016 YPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG 1075
Y + SV + L D E + N ++ + +E V++L+P
Sbjct: 932 YVIGTSV------KEDFKLPEDDESHTEWQNEFITFLP-----QLERGTVKLLDPKN--- 977
Query: 1076 PWQTRATIP----MQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVL 1130
W P ++ +E + V+ L + T E + ++ +G+A V+GED+ +G +
Sbjct: 978 -WSIADIAPSSHELEPAERITCIEVIRLEISEITHERKDMVVVGSAIVKGEDIVPKGCIR 1036
Query: 1131 LFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWT 1181
+F +P N +++++E +KGA++AL+ + QG L++A G K ++ K
Sbjct: 1037 VFEIIDVVPDPDHSEMNKRLKLFAREEVKGAVTALSGIGSQGFLIVAQGQKCMVRGLKED 1096
Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1239
G+ L +AF DA YV L +K ++GD K ++F + E+ +L+L K+ ++
Sbjct: 1097 GSLLP-VAFKDAQ-CYVSVLKELKGTGMCIVGDAIKGLWFTGYSEEPYKLDLFGKENENI 1154
Query: 1240 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1299
A +FL DG+ L ++V+D+ N+ + Y P+ S KG +LL R FHVG H +
Sbjct: 1155 AVIAADFLPDGNRLYVLVADDDCNLHVLQYDPEDPSSSKGDRLLHRNVFHVG-HFASTMT 1213
Query: 1300 LQMLATSSDRTGAAPGSDKTN------RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1353
L + + + A + T+ ++ +L GS+G I PL+E ++RRL +LQ +
Sbjct: 1214 LLPQGSHTPHSPADRDAMDTDAPLPPSKYQILMTFQTGSVGIITPLNEDSYRRLLALQSQ 1273
Query: 1354 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
LV+++ H GLNPR +R S+G + G ++D LL + + + + EIA + G
Sbjct: 1274 LVNALEHPCGLNPRGYRAVESDGIGGQRG---MIDGNLLLRWLDMGAQRKAEIAGRVGAD 1330
Query: 1414 RSQILSNLNDLALGTSFL 1431
I +L L G ++L
Sbjct: 1331 VGAIRMDLEKLHGGLAYL 1348
Score = 104 bits (260), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 160/744 (21%), Positives = 297/744 (39%), Gaps = 114/744 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + + R D A L LV Y++ G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSSLANHPDQKSRH---DRSQHAKLVLVAEYQVSGTITGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ + + D+I+++ +AK+S++E+D HG+ S+H +E E +
Sbjct: 85 ERVKISNSKSGG--DAILVSSRNAKLSLIEWDPRNHGISTISIHYYEGEESHMSPWVPDL 142
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE------------------ 217
+ + VDP G C + +G+ + IL Q G LV D+
Sbjct: 143 GSCASNLTVDPNGNCA-IFNFGIHSLAILPFHQTGDDLVMDDYDSVLNGDSAADTINDTQ 201
Query: 218 -DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
T G S E S V+ L LD + H F+H Y EP IL+ +
Sbjct: 202 KPTAGDSTVHSKPYEPSFVLPLAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLS 261
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
+ + ++ + + + LP D +K++++P P+GG L++G N +H
Sbjct: 262 IDRKDVVSYAIFTLDLQQRASTSLLTVSRLPSDMFKVVSLPPPVGGALLIGTNELVHVDQ 321
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
+ A+ +N +A + + +S + L+ L +D LL G + +LT
Sbjct: 322 AGKTNAVGVNEFARQASAFSMVDQSDLEMRLEDCVVEQLGSDAGEVLLILTDGRMAILTF 381
Query: 392 VYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTS 444
DGR V + L ++ S++ + + +G S F GS GDS+L+ ++ S +
Sbjct: 382 KVDGRSVSGISLHYVAEQSGGSIIKARPSCSAGLGRSKLFCGSEEGDSILLGWSKPSSNT 441
Query: 445 MLSSGLKE---EFGDIEADAPSTKRLRR-----------SSSDALQD--MVNGEELSLYG 488
+ E E G E + + + LQ+ +VNG++ +
Sbjct: 442 KKPTKANEDTNEDGTTEFSGEDEQDDDDDDIYEDDLYSANPAPTLQEKRVVNGDDTA--- 498
Query: 489 SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------LRINAD-----------ASAT 531
F F + D L ++GP +D + G L+ D +A
Sbjct: 499 -----------DFVFKIHDRLWSLGPFRDITLGRPPKSKLKDKRDNVPSISASLELVAAR 547
Query: 532 GISKQSNYEL------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYL 579
G K + +++ G+W++ + A ++ +Y YL
Sbjct: 548 GFGKSGGLAVLKREIDPFTIDSLKMDNVYGVWSIRVTDPKSKEASAT---GNSRDYDKYL 604
Query: 580 II-----SLEARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARI 633
++ S + ++V + + ++ ++ + TI G L RV+QV R
Sbjct: 605 LLAKAKCSDKEESVVYSVGNSGLDSIDAPEFNPNEDCTIDIGTLAAGSRVVQVLRTEIRS 664
Query: 634 LDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
D + +TQ ++ SE TV+ S A+PY+L D S+ +L D +
Sbjct: 665 YDYNLGLTQIYPVWDEDT-----SEERTVVQASFAEPYLLAIRDDHSLLVLQADKTGDLD 719
Query: 693 SVQTPAAIESSKKPVSSCTLYHDK 716
V+ + +S VS C LY D+
Sbjct: 720 EVEI-QGLATSADWVSGC-LYEDR 741
>gi|357611296|gb|EHJ67409.1| putative cleavage and polyadenylation specific factor 1 [Danaus
plexippus]
Length = 328
Score = 194 bits (494), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 108/320 (33%), Positives = 183/320 (57%), Gaps = 20/320 (6%)
Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQG 1165
+AIGT Y GED+ +RGR+L++ P +N E+Y+KE KG ++AL + G
Sbjct: 16 IAIGTNYNYGEDITSRGRILIYDIIDVVPEPGQPLTKNRFKEIYAKEQKGPVTALTQVLG 75
Query: 1166 HLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
L+ A G KI L + +L G+AF D +YV + VKN IL+ D++KSI L ++ Q
Sbjct: 76 FLISAVGQKIYLWQLKDNDLVGVAFIDTQ-IYVHRMLAVKNLILVADVYKSISLLRYQHQ 134
Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
L+L+++D + + +F+ID ++L +VS+ + N ++ + P+ ES+ GQ+L+ +
Sbjct: 135 HRTLSLVSRDLRTAQIYDMQFMIDNTSLGFLVSESEGNFAMYMHQPQARESYGGQRLIRK 194
Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1345
++H+G V RL AA G +T+ +F TLDG +G + P+ E +R
Sbjct: 195 CDYHLGQRVHAMFRL-----------AARGERQTH--VTMFTTLDGGVGYVLPVSEKVYR 241
Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG-PDSIVDCELLSHYEMLPLEEQL 1404
RL LQ + + H+AGLNP+++R + + +A G ++D +L+S Y +P EQ
Sbjct: 242 RLLMLQNVINNYCCHLAGLNPKAYRTYKVSRRALCGGAARGVLDGDLVSLYTSMPRTEQQ 301
Query: 1405 EIAHQTGTTRSQILSNLNDL 1424
+IA + GT +I+S+L ++
Sbjct: 302 DIARKIGTKVEEIMSDLYEI 321
>gi|452841862|gb|EME43798.1| hypothetical protein DOTSEDRAFT_79774 [Dothistroma septosporum NZE10]
Length = 1347
Score = 194 bits (493), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 314/1410 (22%), Positives = 560/1410 (39%), Gaps = 213/1410 (15%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y L G V +LA + D D+++LAF+DAK++++E+D H + S+H
Sbjct: 51 LVLVGEYSLSGTVTNLAQVKL--PDTKTAGDALLLAFKDAKLTLIEWDPENHRISTISIH 108
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
+E + G ++ VDP RC + Q+ +L Q L +ED
Sbjct: 109 YYEGDNVVSQPFGPGLGECENILTVDPNWRCAALKFGTRQLAVLPFRQLDDELGVEEDGD 168
Query: 221 GSGGGFSAR-----------------IESSHVINLRDL--DMKHVKDFIFVHGYIEPVMV 261
+ + ++S V+ L L D+++ D F++GY E +
Sbjct: 169 AEPASTTLKRSESILQNVNGEVQQTPYKASFVLALSTLLEDIRYTVDLGFLYGYRESTLG 228
Query: 262 ILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGV 321
IL + + + + + + LP+ +K++ +P+P+GG
Sbjct: 229 ILSSSLQPSSSLLDIRKDELEYRMFKLELEQGESTELQVVKQLPNSLWKVVPLPAPVGGA 288
Query: 322 LVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP-RSSFSVELDAAHATWL--QNDVA 377
L+VG N+ +H + ++A+N +A +L+S + + +S +++L+ L ++
Sbjct: 289 LLVGTNSFVHVDLNAKVNSVAVNEFA-ALESDRGMEDQSDLNLKLEGCSVEILDAESRQV 347
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLG 426
L+ + G L + GR +Q L DL KT PS + + ++ F+G
Sbjct: 348 LVVLRDGSLATIYFEQSGRSIQGLKVSRVREEHGGDLVKTAPSC----VARLDHNKVFVG 403
Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
S G S LV+++ S LS K G + D E
Sbjct: 404 SEDGASSLVRWS--RSISTLSR--KRTHGQMLGQHGDEDDEEALEDDDDDLYDAAPETK- 458
Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------ATGISKQS--- 537
A++ T++ + SF ++D L ++GP+ D G A TG + S
Sbjct: 459 -KRATSTTDAFETPPSFQIQDVLHSLGPINDVCLGKSDGAQVDKLQMMLGTGRGRSSRIS 517
Query: 538 --NYELVELPG-------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
N ++V + K W V+ K + DD++H L+ + + +
Sbjct: 518 CLNRDIVPVSARKSTIGRAKSAWAVHAKRND-----------RDDDFHDNLLFAYDGQET 566
Query: 589 VLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFG 646
+ D + + + F +G TI L V+Q + R D ++Q +
Sbjct: 567 KIYDVDEVGYMERTAQEFEHEGETIDVQMLAKDTIVVQCRKSEIRTYDADLALSQIIPMV 626
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
++ E ++ +S DPY+L+ +D
Sbjct: 627 DEETD-----EEYEIVYLSFCDPYLLVVRND----------------------------- 652
Query: 707 VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIF 766
SS + H +G E + D +G +I G L + + G + +F
Sbjct: 653 -SSIQVLHVRGKEIEPLEGEGDIAEKKWLGGSIHT---GSLTKDVPALFLLSAQGTMHVF 708
Query: 767 DVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVE 826
+P+ V Y AL ++S + + G KE + + V E
Sbjct: 709 SLPSLEPV----------------YHAPALPHLPPVLSSDAPQRRA-GPKEALTELLVAE 751
Query: 827 LAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASR 886
L ++ P+L A ++ Y+ F PE P + + +
Sbjct: 752 LG----ASGVDTPYLVARTALDDLVLYEP--FRHPE-------PAPSDQWYT-------- 790
Query: 887 LRNLRFSRTPL-------DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
NLRF + P+ +A +EE+ P + I ++ + + GS P ++
Sbjct: 791 --NLRFRKVPVTYIPKYNEAIAQEESTRPLPLRSI----HVGDYDAVTIPGSPPL--LLV 842
Query: 940 RER------LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
+E L V + +H +C GF V + G+L+ LP + Y W
Sbjct: 843 KEASSLPRVLEVRISNESNRVATLLPIHLDHCKKGFAAVNADGLLEEYHLPLSAWYGTGW 902
Query: 994 PVQKIPLKATPHQITYFA---EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
VQ++ L + ++ + A + +Y + V + + Q G Q D
Sbjct: 903 SVQQVDLGSEDLEVRHLAYHETRGVYVVATCKDVDFYFAEDDHRHLGQSGGGQDDITLRP 962
Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1110
V + + V R+++ +RA +P + AL V + + + T E +
Sbjct: 963 QVKQYSIHLVSSKTHRVID---------SRA-MPYLEAITALQVMPLEV-SELTHEQDLR 1011
Query: 1111 LAIGTAYVQGEDVAARGRVLLFS---TGRNADNPQNLVT-EVYSKE-LKGAISALASLQG 1165
+ + TA ++GED+ ARG +++F+ D P++ + V ++E KGAI+ALA G
Sbjct: 1012 ILVSTAAMRGEDMPARGAIIVFNIIDVVPAPDVPESGIKLHVNAREETKGAITALAPFPG 1071
Query: 1166 HLL-IASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNI-VKNFILLGDIHKSIYFLS 1221
+ G KI++ K G+ L +AF DA V + L GD K ++F
Sbjct: 1072 GFVGSGQGQKIMIRGLKEDGSCLP-VAFLDAQCHTTVIKTLGTSGMWLAGDAWKGLWFGG 1130
Query: 1222 WKEQGAQLNLLAK-DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
+ E+ +L +L K ++ A EFL L +++ D ++ + Y P+ +S G
Sbjct: 1131 FTEEPYKLTVLGKAPERQMEVMAAEFLPFDGALYILIIDADMDLHVLQYDPENPKSQNGM 1190
Query: 1281 KLLSRAEFHVGAHVTKFLRL---------QMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1331
+LL R+ FH+G T L L T+ D G +P + + F +L +L G
Sbjct: 1191 RLLHRSTFHLGHFATNMLLLPSSLNPFGENQPFTNGDTNGESP-EESSPLFHVLTTSLTG 1249
Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
SIG I PLDE ++RRL +LQ L + H A LNPR++R S G +VD +
Sbjct: 1250 SIGMITPLDESSYRRLSALQTHLTTILEHPASLNPRAYRAIESESFG---GARGVVDGNI 1306
Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ L + ++ + G I S+L
Sbjct: 1307 VRRINELGAARRADVLARAGADAWSIRSDL 1336
>gi|327304811|ref|XP_003237097.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
gi|326460095|gb|EGD85548.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
Length = 1398
Score = 192 bits (488), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 147/538 (27%), Positives = 260/538 (48%), Gaps = 38/538 (7%)
Query: 911 CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
C+R+ ++ G++ F+SG PC+ ++ R H G V + + H C GF
Sbjct: 882 CKRLRALPDVCGYKTVFMSGHNPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 940
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
YV ++++ +LPS + +D+ W +KI I Y + Y + S
Sbjct: 941 AYVDEDNVIRMSRLPSNTRFDSGWATRKIAFGEQVDSIVYSSASECYVIGTSA------K 994
Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1089
+ L D E + N ++ + +E V++LEP W T + ++ +E
Sbjct: 995 EDFKLPEDDESHTEWRNEFITFLP-----QLERGTVKLLEPRN----WSTIDSHELEPAE 1045
Query: 1090 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNL 1144
+ + V+ L + T E + ++ +G++ V+GED+ +G + +F P ++
Sbjct: 1046 RIMCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKSK 1105
Query: 1145 VTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVV 1199
++++KE +KGA++AL+ + QG L++A G K ++ K G+ L +AF D YV
Sbjct: 1106 KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDGSLLP-VAFKDTQ-CYVN 1163
Query: 1200 SLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
L +K ++GD K ++F + E+ +L+L K+ +L +FL DG+ L ++V
Sbjct: 1164 VLKELKGTGMCIIGDAFKGLWFTGYSEEPYKLDLFGKENENLAVVDADFLPDGNKLYILV 1223
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG--AHVTKFLRLQMLATSS--DRTGAA 1313
+D+ N+ + Y P+ S KG +LL R+ FH G A L TSS D
Sbjct: 1224 ADDDCNLHVLQYDPEDPSSSKGDRLLRRSVFHTGHFASTVTLLPHGAHTTSSPVDEDAMD 1283
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
S +++ +L GSI I PL E ++RRL +LQ +LV+++ H LNPR +R
Sbjct: 1284 TDSPPPSKYQILMTFQTGSIAVITPLSEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVE 1343
Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
S+G + G ++D LL + + + + EIA + G I +L L G ++L
Sbjct: 1344 SDGMGGQRG---MIDGNLLLRWLDMGAQRKAEIAGRVGADVGAIRIDLEKLHGGLAYL 1398
Score = 106 bits (265), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 176/734 (23%), Positives = 287/734 (39%), Gaps = 96/734 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + R + A L L Y + G + SL
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSTTAAQPDQKGRN---ERSQHAKLVLAAEYEVPGTITSL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ + + D+II++ +AK+S++E+D HG+ S+H +E E H+
Sbjct: 85 QRVKISNSKSGG--DAIIVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVPD 141
Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGSGGGFSARIES-- 232
P + DP G C + +G+ + IL Q G LV D+ G SA + S
Sbjct: 142 LGSCPSSLTADPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGNDSADVVSDP 200
Query: 233 ----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
S V+ + LD + H F+H Y EP IL+ +
Sbjct: 201 QKSAPENTAHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
+ S ++ K + + LP D +K++ +P P+GG L++G N +H
Sbjct: 261 IDRKDIVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
+ A+ +N +A + +S + L+ L + LL G + +L+
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIIEQLGSGTGDILLILADGRMSILSF 380
Query: 392 VYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
DGR V + L ++N S+ + T ++G + F GS GDS+L+ ++ S T
Sbjct: 381 KVDGRSVSGISLHFVAEQSNGSITIARPTCSASLGRNKLFCGSEEGDSILLGWSRPSSTI 440
Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES------AQ 498
S K G E A D D + ++L AS E +
Sbjct: 441 KRPS--KAADGVDENGAADLSDEAEQDDDGDDDDMYEDDLYSANLASTRQEKQVVNGDSP 498
Query: 499 KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WTV 553
F F D L ++GP +D + G + + S +EL +G V
Sbjct: 499 ADFIFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPEIAAPLELVAARGFGKSGGLAV 558
Query: 554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEART-------------MVLETADLLTEVT 600
+ + DS +M DD Y + I ++ ++ ++ +T D +
Sbjct: 559 LKREIDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTGLSRSYDKYLLLAKTKD--DDKE 613
Query: 601 ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
ESV Y V + TI G L RV+QV R D Y
Sbjct: 614 ESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLAAGTRVVQVLRTEIRSYD--YNLGLAQ 671
Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAIES 702
P E SE TV+ S A+PY+L D S+ +L D + V VQ AA
Sbjct: 672 IYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA--- 726
Query: 703 SKKPVSSCTLYHDK 716
S K +S C LY DK
Sbjct: 727 SGKWISGC-LYEDK 739
>gi|194374339|dbj|BAG57065.1| unnamed protein product [Homo sapiens]
Length = 330
Score = 191 bits (486), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 114/302 (37%), Positives = 170/302 (56%), Gaps = 35/302 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
P V+VDP GRC +LVYG ++++L ++ GLVG+ G +
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS Y V+L
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVAL 300
Query: 350 DS 351
+S
Sbjct: 301 NS 302
>gi|341892673|gb|EGT48608.1| CBN-CPSF-1 protein [Caenorhabditis brenneri]
Length = 1440
Score = 191 bits (485), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 178/623 (28%), Positives = 300/623 (48%), Gaps = 93/623 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++AF+DAK+S++ ++ ++ S+H FE+ +L+ G + P+V+ DP+
Sbjct: 91 QDSILMAFDDAKLSIITINEKERNMQTISLHAFENE---YLRDGFVKYFHPPIVRTDPEN 147
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + S RI S ++I L+ +D + +V
Sbjct: 148 RCAASLVYGKHIAILPFHEN-----------------SKRIHS-YIIPLKQIDPRLDNVA 189
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 190 DIVFLDGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 249
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
LL +P P+GG +V G+NTI Y +Q+ C + LN+ D + P S +
Sbjct: 250 CATLLPIPKPLGGAIVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKSMKMT 306
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD + + ++++ + T+ G+L LL +V G V+ L+ SK + + +T
Sbjct: 307 LDCSTSVYMEDGRIAVGTRDGELFLLRLVTSSGGATVKSLEFSKVWDTSIAYTLTVCAPG 366
Query: 422 LFFLGSRLGDSLLVQFTCGSGT--SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
FLGSRLGDS L++++ T S+ +++E +EA+ +
Sbjct: 367 HLFLGSRLGDSQLLEYSLIKTTRESVKRHKMEQEQNHVEAE------------------L 408
Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+ ++L LYG A +++ E ++ F+ D L NIGP+K G R N ++ +
Sbjct: 409 DEDDLELYGGAIEEQQNDDEEQITESLQFSELDRLRNIGPVKSMCVG-RPNYMSNDLVDA 467
Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
K+ + ++++ G G V+ +S R SS + ++E H YLI+
Sbjct: 468 KRRDPVFDVITASGHGKNGSLCVHQRSLRPEIVTSSLLEGAEQLWAVGRKENESHKYLIV 527
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFGRRRVIQVFERG-ARILDGSYM 639
S T+VLE + L E+ E + FV G+ T+AAG L +QV A + DG
Sbjct: 528 SRIRSTLVLELGEELIELEEPL--FVTGQPTVAAGELSQGAFAVQVTSTSIALVTDG--- 582
Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDPSTCTVSV--- 694
Q L+ +S N V+ SI DPYV + +G + L LV +P +
Sbjct: 583 -QQLAEVKIDS-------NFPVVQASIVDPYVAVLTQNGRLLLYTLVSNPYMQLQEIDLA 634
Query: 695 QTPAA--IESSKKPVSSCTLYHD 715
QTP + I S ++S ++Y D
Sbjct: 635 QTPFSTFIAQSASQITSISMYAD 657
Score = 184 bits (467), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 187/740 (25%), Positives = 334/740 (45%), Gaps = 80/740 (10%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS 782
++ DA S GE D D + V+ +E+G L + +P V+ + +F +
Sbjct: 736 KRLGHDAIQSGRGGEQSDAIDPSSYTSISHWLVLAHENGRLSVHSLPEMELVYQIGRFPN 795
Query: 783 GRTHIVD-----TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
+VD +K +S EE K+N +++E + + S
Sbjct: 796 VPELLVDLTPEEEEKERRIKAQLAAKEASDEEQLNAEMKKNCE--RIMEAQIVGMGINQS 853
Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
P L AI+ D ++ Y+ + +P S L ++ LR S
Sbjct: 854 HPILMAIV-DEQVIMYEMFA-----------NPNSQPGHLGIAFRKLPHFICLRSSPYLK 901
Query: 898 DAYTR------EETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE--RLRVHPQ 948
R EE P I F+ +S + G + G+ P +V+ ++ HP
Sbjct: 902 SDGKRAAFQIVEEDGKRYPL--IHSFERVSTVNNGVIIGGAVPT-LLVYGAWGGMQTHPM 958
Query: 949 LCDGSIVAFTVLHNVNCNHGFIYVT-SQGILKICQLPSGSTYDNYWPVQKIPLKATPHQI 1007
DGSI AFT + N +GF+Y+T + L+I ++ + Y+ +PV+KI + T H +
Sbjct: 959 TIDGSIKAFTPFNIDNVPYGFVYMTQKKSELRIAKMHADFDYEMPYPVKKIEVGRTIHSV 1018
Query: 1008 TYFAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHRTYTVEEYEV 1065
Y ++Y ++ SVP KP N++ ++ D QE H+ D + + + YT+ +
Sbjct: 1019 RYLMNSDVYVVVSSVP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPPKYTLNLF-- 1072
Query: 1066 RILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGED 1122
+ W+ I + E V L + +T ET LAIGT GE+
Sbjct: 1073 -------SSQDWKAVPNTEISFEDMEAVTACEDVALKSESTHTGFETYLAIGTVNNYGEE 1125
Query: 1123 VAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIIL 1177
V RGR++L P + ++ KE KG ++ L +++G LL G K+ +
Sbjct: 1126 VLVRGRIILAEVIEVVPEPGQPTSNRKIKVLFDKEQKGPVTGLCAMEGLLLSGMGQKVFI 1185
Query: 1178 HKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
++ +L G++F D YV L+ +++ L D +S+ + ++E+ +++ ++D
Sbjct: 1186 WQFKDNDLMGLSFLDMH-YYVYQLHSLRSIALACDARESMSLIRFQEENKAMSVASRD-- 1242
Query: 1238 SLDC----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
C A +F++DG+ + ++SDE NI +F YAP+ ES G++L RA ++G +
Sbjct: 1243 DRKCAQAPMAAQFMVDGAHIGFLLSDENGNITLFNYAPEAPESNGGERLTVRAAINIGTN 1302
Query: 1294 VTKFLRLQ----MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
+ FLR++ +L AA R + +F +LDGS G I PL E ++RRL
Sbjct: 1303 INAFLRVKGHTALLNLHEFEKEAA-----EQRMSTIFASLDGSFGFIRPLTEKSYRRLHF 1357
Query: 1350 LQKKLVDSVPHVAGLNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
LQ + +AGL+ + R Q NG+ R +++D +++ Y L ++
Sbjct: 1358 LQTFIGSVSQQIAGLHIKGARSAKPPQPIVNGRNAR----NLIDGDVVEQYLNLSTYDKT 1413
Query: 1405 EIAHQTGTTRSQILSNLNDL 1424
++A + G + I+ +L +L
Sbjct: 1414 DLARRLGVGKYHIIDDLMEL 1433
>gi|326477251|gb|EGE01261.1| protein kinase subdomain-containing protein [Trichophyton equinum CBS
127.97]
Length = 1267
Score = 190 bits (483), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 144/538 (26%), Positives = 260/538 (48%), Gaps = 38/538 (7%)
Query: 911 CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
C+ + ++ G++ F+SG PC+ ++ R H G V + + H C GF
Sbjct: 751 CKLLRALPDVCGYKTVFMSGHNPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 809
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
YV ++++ +LPS + +D+ W +KI L I Y + Y + S
Sbjct: 810 AYVDEDNVIRMSRLPSNTRFDSGWATRKIALGEQVDSIVYSSASECYVIGTSA------K 863
Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1089
+ L D E + N ++ + +E V++LEP W T + ++ +E
Sbjct: 864 EDFKLPEDDESHTEWRNEFITFLP-----QLERGTVKLLEPKN----WSTIDSHELKPAE 914
Query: 1090 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNL 1144
+ V+ L + T E + ++ +G++ V+GED+ +G + +F P ++
Sbjct: 915 RITCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKSK 974
Query: 1145 VTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVV 1199
++++KE +KGA++AL+ + QG L++A G K ++ K G+ L +AF D YV
Sbjct: 975 KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDGSLLP-VAFKDTQ-CYVN 1032
Query: 1200 SLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
L +K ++GD K ++F+ + E+ +L+L K+ +L +FL DG+ L ++V
Sbjct: 1033 VLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENENLAVVDADFLPDGNKLYILV 1092
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS----DRTGAA 1313
+D+ N+ + Y P+ S KG +LL R+ FH G + L A + D
Sbjct: 1093 ADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTLLPHGAYTPSAPVDEDAMD 1152
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
S +++ +L GSI I PL E ++RRL +LQ +LV+++ H LNPR +R
Sbjct: 1153 TDSLPPSKYQILMTFQTGSIAVITPLSEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVE 1212
Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
S+G + G ++D LL + + + + EIA + G I ++L L G ++L
Sbjct: 1213 SDGMGGQRG---MIDGNLLLRWLDMGAQRKAEIAGRVGADVGAIRTDLEKLHGGLAYL 1267
Score = 94.7 bits (234), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 133/579 (22%), Positives = 232/579 (40%), Gaps = 65/579 (11%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + R D A L L Y + G + L
Sbjct: 28 NLIVVKTSLLQVFSLVNVTYGSTTATQPDQKGRN---DRSQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+ NS+ D+I+++ +AK+S++E+D HG+ S+H +E E H+
Sbjct: 85 QRVR---ISNSKSGGDAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVP 140
Query: 176 SFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------D 218
P + VDP G C + +G+ + IL Q G LV D+ D
Sbjct: 141 DLGSCPSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDMVSD 199
Query: 219 TFGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
S G +A + S V+ + LD + H F+H Y EP IL+ +
Sbjct: 200 AQKSAPGNTAHDKPYAPSFVLPMAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSL 259
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ S ++ + + + LP D +K++ +P P+GG L++G N +H
Sbjct: 260 TIDRKDVVSYSIFTLDLQQRASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVD 319
Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
+ A+ +N +A + +S + L+ L + LL G + +L+
Sbjct: 320 QAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILS 379
Query: 391 VVYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
DGR V + L +K PS S +G + F GS GDS+L+ ++
Sbjct: 380 FKVDGRSVSGISLHFVAEQSGGLITKARPSCSAS----LGRNKLFYGSEEGDSILLGWSR 435
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES--- 496
S T+ S K G E+ A D D + ++L AS E
Sbjct: 436 PSSTTKRPS--KAADGVDESGAADLSDEAEQDDDGDDDDMYEDDLYSVNPASIRQEKQVV 493
Query: 497 ---AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI--- 550
+ F+F D L ++GP +D + G + + S + +EL +G
Sbjct: 494 NGDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPAIAAPLELVAARGFGKS 553
Query: 551 --WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
TV + + DS +M DD Y + I ++ ++
Sbjct: 554 GGLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKS 589
>gi|25148482|ref|NP_500157.2| Protein CPSF-1 [Caenorhabditis elegans]
gi|22096347|sp|Q9N4C2.2|CPSF1_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
gi|373220398|emb|CCD73182.1| Protein CPSF-1 [Caenorhabditis elegans]
Length = 1454
Score = 190 bits (482), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 176/705 (24%), Positives = 323/705 (45%), Gaps = 69/705 (9%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD-TYMREALKDSETEINSSSEEGTGQ 813
+V +E+G L I +P V+ + +F + +VD T E + ++ E
Sbjct: 777 IVSHENGRLSIHSLPEMEVVYQIGRFSNVPELLVDLTVEEEEKERKAKAQQAAKEASVPT 836
Query: 814 GRKENIHSM------KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
E +++ +V+E + + + P L AI+ D ++ Y+ + S
Sbjct: 837 DEAEQLNTEMKQLCERVLEAQIVGMGINQAHPILMAIV-DEQVVLYEMF---------SS 886
Query: 868 DDPVSTSRSLSVSNV-------SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
+P+ +S + ++S L N R P + + +G I F+ +
Sbjct: 887 SNPIPGHLGISFRKLPHFICLRTSSHL-NSDGKRAPFEM----KINNGKRFSLIHPFERV 941
Query: 921 SG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QG 976
S + G + G+ P +V+ ++ H DG I AFT +N N HG +Y+T +
Sbjct: 942 SSVNNGVMIVGAVPTL-LVYGAWGGMQTHQMTVDGPIKAFTPFNNENVLHGIVYMTQHKS 1000
Query: 977 ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
L+I ++ Y+ +PV+KI + T H + Y ++Y ++ S+P KP N++ ++
Sbjct: 1001 ELRIARMHPDFDYEMPYPVKKIEVGRTIHHVRYLMNSDVYAVVSSIP--KPSNKIWVVMN 1058
Query: 1037 D--QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
D QE H+ D + + + YT+ + + D A P I + E
Sbjct: 1059 DDKQEEIHEKDENFV--LPAPPKYTLNLFSSQ----DWAAVP---NTEISFEDMEAVTAC 1109
Query: 1095 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----V 1148
V L + +T ETLLA+GT GE+V RGR++L P + +
Sbjct: 1110 EDVALKSESTISGLETLLAMGTVNNYGEEVLVRGRIILCEVIEVVPEPDQPTSNRKIKVL 1169
Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
+ KE KG ++ L ++ G LL G K+ + ++ +L GI+F D YV L+ ++
Sbjct: 1170 FDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFLDMH-YYVYQLHSLRTIA 1228
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKNI 1264
+ D +S+ + ++E +++ ++D C A++ ++DG+ + ++SDE NI
Sbjct: 1229 IACDARESMSLIRFQEDNKAMSIASRD--DRKCAQPPMASQLVVDGAHVGFLLSDETGNI 1286
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
+F YAP+ ES G++L RA ++G ++ F+RL+ + R
Sbjct: 1287 TMFNYAPEAPESNGGERLTVRAAINIGTNINAFVRLRGHTSLLQLNNEDEKEAIEQRMTT 1346
Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSNGKAH 1379
+F +LDGS G + PL E ++RRL LQ + P +AGL+ + R Q NG+
Sbjct: 1347 VFASLDGSFGFVRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGSRSAKPSQPIVNGRNA 1406
Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
R +++D +++ Y L L ++ ++A + G R I+ +L L
Sbjct: 1407 R----NLIDGDVVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQL 1447
Score = 186 bits (473), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 162/589 (27%), Positives = 273/589 (46%), Gaps = 84/589 (14%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++ F+DAK+S++ ++ ++ S+H FE+ +L+ G + + PLV+ DP
Sbjct: 92 QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
RC LVYG + IL + S RI S +VI L+ +D + ++
Sbjct: 149 RCAACLVYGKHIAILPFHEN-----------------SKRIHS-YVIPLKQIDPRLDNIA 190
Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
D +F+ GY EP ++ L+E T GR ++ T I +S++ +Q ++W NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSS---FSVE 363
+LL +P P+GG LV G+NT+ Y +Q+ C L LN+ D + P +
Sbjct: 251 CSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS---CYDGFTKFPLKDLKHLKMT 307
Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
LD + + ++++ + ++ GDL LL ++ G V+ L+ SK + + +T
Sbjct: 308 LDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKSLEFSKVYETSIAYSLTVCAPG 367
Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD--ALQDMV 479
F+GSRLGDS L+++T T + KRL+ + D A + +
Sbjct: 368 HLFVGSRLGDSQLLEYTLLKTTRDC----------------AVKRLKIDNKDPAAAEIEL 411
Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+ +++ LYG A +++ E ++ F D L N+GP+K G R N ++ +
Sbjct: 412 DEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVGPVKSMCVG-RPNYMSNDLVDA 470
Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
K+ + ++LV G G V+ +S R SS + ++E H YLI+
Sbjct: 471 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLEGAEQLWAVGRKENESHKYLIV 530
Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
S R+ ++ E + T+AAG L +QV A + DG M
Sbjct: 531 S-RVRSTLILELGEELVELEEQLFVTGEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 588
Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDP 687
Q++ N V+ SI DPYV L +G + L LV +P
Sbjct: 589 QEVHI----------DSNFPVIQASIVDPYVALLTQNGRLLLYELVMEP 627
>gi|159123784|gb|EDP48903.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus A1163]
Length = 1401
Score = 189 bits (480), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 152/538 (28%), Positives = 260/538 (48%), Gaps = 40/538 (7%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
+ I NIS F+ G RP ++ + H G V + L + + + GFI
Sbjct: 884 LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942
Query: 971 YVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1030
YV S+ +L+IC+ PS + +D W ++KI + + Y Y L S +
Sbjct: 943 YVDSKDVLRICRFPSDTLFDYTWALRKISIGEQVDHLAYATSSETYVLGTS------HSA 996
Query: 1031 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSEN 1090
L D E+ N L L + + ++++ P W + + E
Sbjct: 997 DFKLPDDDELHPDWRNEGLVISFLPE---LRQCSLKVVSPRT----WTVIDSYSLGPDEY 1049
Query: 1091 ALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
+ V+ + L + T E ++ +GTA+ +GED+ +RG + +F + +P+ T+
Sbjct: 1050 VMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIYVFEVIKVVPDPEKPETDRK 1109
Query: 1148 --VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVS 1200
+ KEL KGA++AL+ + QG L+ A G K ++ K G+ L +AF D YV
Sbjct: 1110 LKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVNV 1167
Query: 1201 LNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
L +K ++GD K ++F + E+ +++L KD G L+ A EFL DG L ++V+
Sbjct: 1168 LKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKDQGYLEVVAAEFLPDGDKLFILVA 1227
Query: 1259 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS-- 1316
D N+ + Y P+ +S G +LL+R++FH+G T L SS++ A P S
Sbjct: 1228 DSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMANPDSME 1287
Query: 1317 --DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
+T +L + GS+G + + E ++RRL +LQ +L +S+ H GLNPR++R S
Sbjct: 1288 IDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQSQLANSLEHPCGLNPRAYRAVES 1347
Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL-ALGTSFL 1431
+G A R ++D LL + + ++EIA + G +I ++L + A G +L
Sbjct: 1348 DGTAGR----GMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGAEGLGYL 1401
Score = 136 bits (343), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)
Query: 57 NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV +V++I+ +++VQ E+ + + D + L L Y L G V
Sbjct: 28 NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83
Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
+ + S+ G + +++LAF +AK+S++E+D HG+ S+H +E +
Sbjct: 84 ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
+ + G ++ VDP RC V +G++ + IL Q G L D+ F
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196
Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
G G ++ SS V+ L LD + H F++ Y EP IL+ +
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
T +S + + + ++ + + S LP D +K++A+P P+GG L++G+N
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316
Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
+H + A+ +N +A + + + +S ++ L+ + + LL +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376
Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
+VL+ DGR V + L ++ +++ S ++ +G+ F GS DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
+ ++ D +S DA + D+ E E G + +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492
Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
+ F D L NIGPL+D + G + + + K + EL EL +G
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551
Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
+WT G D ++ + EY Y+I+S +
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607
Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
T ET+++ T+ + F TI G L ++RV+QV R SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663
Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
D+ G P E S+ +S S+ADPY+ + D ++ +L D S
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718
Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
V+ A + K SC LY DK
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740
>gi|134025022|gb|AAI35011.1| LOC564406 protein [Danio rerio]
Length = 348
Score = 189 bits (479), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 104/283 (36%), Positives = 165/283 (58%), Gaps = 13/283 (4%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE V + L GNV S+A + G + RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 66 LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P+V+VDP+ RC +LVYG +++L + + DE
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT---LADEQEG 175
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R+LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 176 IVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
TC I A+S++ K HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPP 295
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
++LN+ + P+ + LD + A+++ +D ++S
Sbjct: 296 FGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVIS 338
>gi|146324727|ref|XP_747211.2| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus Af293]
gi|148886828|sp|Q4WCL1.2|CFT1_ASPFU RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|129556124|gb|EAL85173.2| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus Af293]
Length = 1401
Score = 189 bits (479), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 152/538 (28%), Positives = 260/538 (48%), Gaps = 40/538 (7%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
+ I NIS F+ G RP ++ + H G V + L + + + GFI
Sbjct: 884 LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942
Query: 971 YVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1030
YV S+ +L+IC+ PS + +D W ++KI + + Y Y L S +
Sbjct: 943 YVDSKDVLRICRFPSETLFDYTWALRKISIGEQVDHLAYATSSETYVLGTS------HSA 996
Query: 1031 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSEN 1090
L D E+ N L L + + ++++ P W + + E
Sbjct: 997 DFKLPDDDELHPDWRNEGLVISFLPE---LRQCSLKVVSPRT----WTVIDSYSLGPDEY 1049
Query: 1091 ALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
+ V+ + L + T E ++ +GTA+ +GED+ +RG + +F + +P+ T+
Sbjct: 1050 VMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIYVFEVIKVVPDPEKPETDRK 1109
Query: 1148 --VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVS 1200
+ KEL KGA++AL+ + QG L+ A G K ++ K G+ L +AF D YV
Sbjct: 1110 LKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYVNV 1167
Query: 1201 LNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
L +K ++GD K ++F + E+ +++L KD G L+ A EFL DG L ++V+
Sbjct: 1168 LKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKDQGYLEVVAAEFLPDGDKLFILVA 1227
Query: 1259 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS-- 1316
D N+ + Y P+ +S G +LL+R++FH+G T L SS++ A P S
Sbjct: 1228 DSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMANPDSME 1287
Query: 1317 --DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
+T +L + GS+G + + E ++RRL +LQ +L +S+ H GLNPR++R S
Sbjct: 1288 IDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQSQLANSLEHPCGLNPRAYRAVES 1347
Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL-ALGTSFL 1431
+G A R ++D LL + + ++EIA + G +I ++L + A G +L
Sbjct: 1348 DGTAGR----GMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGAEGLGYL 1401
Score = 136 bits (342), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)
Query: 57 NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
NLVV +V++I+ +++VQ E+ + + D + L L Y L G V
Sbjct: 28 NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83
Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
+ + S+ G + +++LAF +AK+S++E+D HG+ S+H +E +
Sbjct: 84 ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
+ + G ++ VDP RC V +G++ + IL Q G L D+ F
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196
Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
G G ++ SS V+ L LD + H F++ Y EP IL+ +
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
T +S + + + ++ + + S LP D +K++A+P P+GG L++G+N
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316
Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
+H + A+ +N +A + + + +S ++ L+ + + LL +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376
Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
+VL+ DGR V + L ++ +++ S ++ +G+ F GS DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
+ ++ D +S DA + D+ E E G + +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492
Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
+ F D L NIGPL+D + G + + + K + EL EL +G
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551
Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
+WT G D ++ + EY Y+I+S +
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607
Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
T ET+++ T+ + F TI G L ++RV+QV R SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663
Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
D+ G P E S+ +S S+ADPY+ + D ++ +L D S
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718
Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
V+ A + K SC LY DK
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740
>gi|350297359|gb|EGZ78336.1| protein cft-1 [Neurospora tetrasperma FGSC 2509]
Length = 1437
Score = 188 bits (477), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 169/624 (27%), Positives = 283/624 (45%), Gaps = 68/624 (10%)
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
KE++ + V +L H P+L + + YQ Y + + + P S S
Sbjct: 791 KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
+ ++ N F++ P + ++ PH A P +R + NISG+ FL GS
Sbjct: 841 ------LFFQKVPNSTFAKAPEEKPVDDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P + + + L + A + H C HGFIY + GI ++ Q+P+ S+Y
Sbjct: 892 SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAE 951
Query: 992 Y-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
V+KIP+ + Y Y +V ++P L D + + N++
Sbjct: 952 LGLSVKKIPVGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPKDDDYHKEWARENIT 1005
Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1109
+ V+ +++L +G W T+ M+ E L V + L + +T E +
Sbjct: 1006 FKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVETLNLEVSESTNERKQ 1056
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK---------GAISAL 1160
L+A+GTA ++GED+ RGRV +F P T SK+LK GA++AL
Sbjct: 1057 LIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKLVAKEDIPRGAVTAL 1113
Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1214
+ + QG +L+A G K ++ K GT L +AF D YV S+ + L+ D
Sbjct: 1114 SEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVKELPGTGLCLMADAF 1171
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
K ++F + E+ ++ L K ++ +FL DG L +V SD +I I + P+
Sbjct: 1172 KGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDADGHIHILQFDPEHP 1231
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+S +G LL R F+ GAH L + A + + + S++ + LL + G +
Sbjct: 1232 KSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENSPHILLLASPTGVLA 1290
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA--HRPGPDS-----IV 1387
+ PL E +RRL SL +L + +PH AGLNP+ +R + A PG D+ IV
Sbjct: 1291 TLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGIGRNIV 1350
Query: 1388 DCELLSHYEMLPLEEQLEIAHQTG 1411
D ++L + L ++ E+A + G
Sbjct: 1351 DGKILERFLELGTGKRQEMAGRAG 1374
Score = 127 bits (318), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 156/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 96 DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + +S + L+ L
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 393
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T++ G S F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRMGRSTVF 453
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + ++ I+ D D + GEE
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRIQDADLDLDLDDEDLEDDDDDDLYGEES 507
Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
+ A + ++ + +F + D L++I P++ +YG + S
Sbjct: 508 TSPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEKERNSEGVRSDLQ 567
Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
A G K S ++ E P +G WTV K + +D
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627
Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+YH ++I++ E + TA +T + G T+ AG + R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
+QV + R DG ++Q + + E+G+ V + SIADP++LL D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740
>gi|336463425|gb|EGO51665.1| hypothetical protein NEUTE1DRAFT_89273 [Neurospora tetrasperma FGSC
2508]
Length = 1437
Score = 188 bits (477), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 169/624 (27%), Positives = 283/624 (45%), Gaps = 68/624 (10%)
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
KE++ + V +L H P+L + + YQ Y + + + P S S
Sbjct: 791 KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
+ ++ N F++ P + ++ PH A P +R + NISG+ FL GS
Sbjct: 841 ------LFFQKVPNSTFAKAPEEKPVDDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P + + + L + A + H C HGFIY + GI ++ Q+P+ S+Y
Sbjct: 892 SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAE 951
Query: 992 Y-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
V+KIP+ + Y Y +V ++P L D + + N++
Sbjct: 952 LGLSVKKIPVGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPKDDDYHKEWARENIT 1005
Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1109
+ V+ +++L +G W T+ M+ E L V + L + +T E +
Sbjct: 1006 FKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVETLNLEVSESTNERKQ 1056
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK---------GAISAL 1160
L+A+GTA ++GED+ RGRV +F P T SK+LK GA++AL
Sbjct: 1057 LIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKLVAKEDIPRGAVTAL 1113
Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1214
+ + QG +L+A G K ++ K GT L +AF D YV S+ + L+ D
Sbjct: 1114 SEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVKELPGTGLCLMADAF 1171
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
K ++F + E+ ++ L K ++ +FL DG L +V SD +I I + P+
Sbjct: 1172 KGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDADGHIHILQFDPEHP 1231
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+S +G LL R F+ GAH L + A + + + S++ + LL + G +
Sbjct: 1232 KSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENSPHILLLASPTGVLA 1290
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA--HRPGPDS-----IV 1387
+ PL E +RRL SL +L + +PH AGLNP+ +R + A PG D+ IV
Sbjct: 1291 TLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGIGRNIV 1350
Query: 1388 DCELLSHYEMLPLEEQLEIAHQTG 1411
D ++L + L ++ E+A + G
Sbjct: 1351 DGKILERFLELGTGKRQEMAGRAG 1374
Score = 127 bits (318), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 156/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 96 DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + +S + L+ L
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 393
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T++ G S F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRMGRSTVF 453
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + ++ I+ D D + GEE
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRIQDADLDLDLDDEDLEDDDDDDLYGEES 507
Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
+ A + ++ + +F + D L++I P++ +YG + S
Sbjct: 508 TSPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567
Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
A G K S ++ E P +G WTV K + +D
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627
Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+YH ++I++ E + TA +T + G T+ AG + R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
+QV + R DG ++Q + + E+G+ V + SIADP++LL D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740
>gi|398397855|ref|XP_003852385.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
gi|339472266|gb|EGP87361.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici IPO323]
Length = 1333
Score = 188 bits (477), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 297/1401 (21%), Positives = 533/1401 (38%), Gaps = 199/1401 (14%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
+ + L L+ Y L G V S+A + D ++I+LAF++AK+S++E+D H +
Sbjct: 45 AQSKLVLIGGYPLAGTVTSIARVKT--LDTRTGGEAILLAFKNAKLSLIEWDPENHRIST 102
Query: 157 TSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL------------ 204
S+H +E + G ++ VDP RC + Q+ IL
Sbjct: 103 VSIHYYEGENVIAQPYGPSLGEYESILTVDPGSRCAALKFGARQLAILPFRQFGDELLGE 162
Query: 205 ------KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYI 256
A+ G + D G + S V+ L LD + H D F+H Y
Sbjct: 163 EEGEFENANDGTTSKKHDAMQNGEDEAEQTPYKQSFVLPLTTLDPALSHTIDLAFLHEYR 222
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
EP I+ + + ++ K + + NLP +K++ +PS
Sbjct: 223 EPTFGIISSAIEPSYALFDERKDILSYTVFTLDLEQKASTNLITVPNLPSTLWKVVPLPS 282
Query: 317 PIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQND 375
PIGG L++G N IH + A A+N +A+ +S +++L+ L
Sbjct: 283 PIGGALLIGTNEFIHVDQSGKANATAVNEFAMKESDFGMADQSGLNLKLEGCSVEILNAS 342
Query: 376 VA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLG 426
L+ + G L + GR + + ++ + S S ++ + + F+G
Sbjct: 343 TGEMLVVLRDGSLATVDFKMLGRSISAVIVTIISSENGGKVFSTAPSCVSRLDQNNLFIG 402
Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
S G S L+ C + S LS K + + D +
Sbjct: 403 SEDGSSSLL--GCTNSQSGLSR--KRSHAQMLGQNSGDEEEDVLDEDDDDLYDAEPDSKK 458
Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA-------SATGISKQSNY 539
++ + + F ++D L +IGP+ + G R NA + TG S+ S
Sbjct: 459 RATSVAEQSAGDSSTHFVIKDDLHSIGPINNTCVG-RSNAAGEDKLQLLAGTGKSRSSRL 517
Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
+ +R R ++ HA+ + + E E +
Sbjct: 518 ACI---------------NRDIIPQHIRKNQFEGARHAWAVCAREKNANDDEAGEGGYTE 562
Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSEN 658
+ ++ G T+ L V+QV R D ++Q + +++
Sbjct: 563 RSATEFEHDGETLEVLTLGHGTAVVQVRRMEIRTYDSHLALSQIIPMIDDETDA-----E 617
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGP 718
+++ S DPY+L+ D SI++ ++ +K + + D
Sbjct: 618 FSIVHTSACDPYLLVIRDDSSIQV-----------------LQHERKDIEPLDITPDAAD 660
Query: 719 EPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVD 778
+ W+ GE DG + + G+L + +P+ V
Sbjct: 661 KQWMSGCVYS-------GEFTDG---------NAALFLLSAEGSLHVLTLPDLQLV---- 700
Query: 779 KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR 838
YM AL + S+ G KE + + V +L +
Sbjct: 701 ------------YMTPALPHLPP-VLSADVSHRRMGVKETLTELLVADLGNDGVL----Q 743
Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
P+L ++ Y+ P +S S S + NLRF + P+
Sbjct: 744 PYLTVRTAMDDVVLYE---------------PFHSSPSASTGPWHS----NLRFRKVPVP 784
Query: 899 AYTR-EETPHGAPCQRITIFK--NISGHQGFFLSGSRPCWCMVFRER------LRVHPQL 949
+ ++P P R + I G+ + G+ C ++ +E L V+
Sbjct: 785 YIPKYNDSPLEDPNARPPALRRMQIGGYNTVSIPGAPSC--LLLKEASGPPKILEVNEPK 842
Query: 950 CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITY 1009
+ T L+ + C +GF V G L CQLP + + W +++I L ++ +
Sbjct: 843 RSNATTILTPLNRIGCENGFATVDVNGALHECQLPPDAWFSTGWSIRQIDLGDDAREVRH 902
Query: 1010 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS---SVDLHRTYTVEEYEVR 1066
A + V+ + +E G + ++S V + + + +
Sbjct: 903 LAYHEARGIFVAATC-----TTVDFYFAEEDGRHPEQDDISIRPQVPQYSVHLISAKTHK 957
Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR 1126
I+ + +P + AL V + + + E + ++ + T +GED+ A+
Sbjct: 958 IIHTHK----------LPYLETVTALKVMPAEV-SELSHEVKPVVVVSTGAQRGEDMPAK 1006
Query: 1127 GRVLLFSTGRNADNPQ----NLVTEVYSKE-LKGAISALASLQGHLL-IASGPKIILHKW 1180
G +++F +P L V ++E +GAI+ALAS G ++ A G K+++
Sbjct: 1007 GALIVFDVIDVVPDPDVEESGLHLHVLAREESRGAITALASFPGGMIGTAQGLKLMIR-- 1064
Query: 1181 TGTELNG----IAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
G +G +AF DA Y L + + L GD K ++F + ++ +L LL K
Sbjct: 1065 -GMREDGSCLPVAFLDAQ-CYTSLLKTLDSRGLWLAGDAWKGLWFGGFTQEPYKLTLLGK 1122
Query: 1235 DFGS-LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
+ ++ +FL L L+V D ++ + Y P+ +S GQ+LL R+ FH+G
Sbjct: 1123 SPRTEMEVIEADFLPFDGALFLLVLDADADLHVLQYDPENPKSLNGQRLLHRSTFHIGHF 1182
Query: 1294 VTKFLRL-QMLATSSDRTGAAPGSDKTNR---------FALLFGTLDGSIGCIAPLDELT 1343
T + L LA +++ P D + F +L T GSIG I PLDE T
Sbjct: 1183 PTGSMLLPSTLAPFTEQARDLPNGDSEDTKQEEVNSPLFHVLTTTSSGSIGLITPLDEST 1242
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA---HRPGPDSIVDCELLSHYEMLPL 1400
+RRL +LQ L + + H AGLNPR +R + KA G +VD L+ L
Sbjct: 1243 YRRLSALQGHLTNILEHAAGLNPRMYRT-DTEMKATDSEMGGAKGVVDGSLIRRISELGA 1301
Query: 1401 EEQLEIAHQTGTTRSQILSNL 1421
+ ++ + G Q+ S+L
Sbjct: 1302 ARRADVLSRVGGDVWQLRSDL 1322
>gi|164429683|ref|XP_964609.2| hypothetical protein NCU02082 [Neurospora crassa OR74A]
gi|157073577|gb|EAA35373.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 1437
Score = 187 bits (476), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 169/624 (27%), Positives = 283/624 (45%), Gaps = 68/624 (10%)
Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
KE++ + V +L H P+L + + YQ Y + + + P S S
Sbjct: 791 KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
+ ++ N F++ P + ++ PH A P +R + NISG+ FL GS
Sbjct: 841 ------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891
Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
P + + + L + A + H C HGFIY + GI ++ Q+P+ S+Y
Sbjct: 892 SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAE 951
Query: 992 Y-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
V+KIP+ + Y Y +V ++P L D + + N++
Sbjct: 952 LGLSVKKIPIGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPKDDDYHKEWARENIT 1005
Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1109
+ V+ +++L +G W T+ M+ E L V + L + +T E +
Sbjct: 1006 FKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVETLNLEVSESTNERKQ 1056
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK---------GAISAL 1160
L+A+GTA ++GED+ RGRV +F P T SK+LK GA++AL
Sbjct: 1057 LIAVGTALIKGEDLPTRGRVYVFDIADVIPEPGKPET---SKKLKLVAKEDIPRGAVTAL 1113
Query: 1161 ASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIH 1214
+ + QG +L+A G K ++ K GT L +AF D YV S+ + L+ D
Sbjct: 1114 SEVGTQGLMLVAQGQKCMVRGLKEDGTLLP-VAFMDMN-CYVTSVKELPGTGLCLMADAF 1171
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
K ++F + E+ ++ L K ++ +FL DG L +V SD +I I + P+
Sbjct: 1172 KGVWFTGYTEEPYKMMLFGKSSTRMEVLNADFLPDGKELYIVASDADGHIHILQFDPEHP 1231
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+S +G LL R F+ GAH L + A + + + S++ + LL + G +
Sbjct: 1232 KSLQGHLLLHRTTFNTGAHHPTS-SLLLPAVYPNPSSLSSNSEENSPHILLLASPTGVLA 1290
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA--HRPGPDS-----IV 1387
+ PL E +RRL SL +L + +PH AGLNP+ +R + A PG D+ IV
Sbjct: 1291 TLRPLQENAYRRLSSLAVQLTNGLPHPAGLNPKGYRLPSPSASASMQLPGVDAGIGRNIV 1350
Query: 1388 DCELLSHYEMLPLEEQLEIAHQTG 1411
D ++L + L ++ E+A + G
Sbjct: 1351 DGKILERFLELGTGKRQEMAGRAG 1374
Score = 124 bits (311), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 154/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)
Query: 94 DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
D ++A L LV L G + LA + + +S D ++L+F DA++S++E++ +
Sbjct: 96 DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155
Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
L S+H +E E + L+ DP RC + + IL Q +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215
Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
D +D G+ ++ IE S V+ L L+ + H F+H
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y +P + +L + H T M+ L + + I + LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
P+P+GG L+VGAN IH S +A+N S + ++ + L+ L
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393
Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
++ LL G L L+T DGR V L + P SV+ S +T++ G S F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+GS GDS+L+ +T G + ++ ++ D D + GEE
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507
Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
+ A + ++ + +F + D L++I P++ +YG + S
Sbjct: 508 ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567
Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
A G K S ++ E P +G WTV K + +D
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627
Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+YH ++I++ E + TA +T + G T+ AG + R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687
Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
+QV + R DG ++Q + + E+G+ V + SIADP++LL D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740
>gi|119484094|ref|XP_001261950.1| cleavage and polyadenylation specificity factor subunit A, putative
[Neosartorya fischeri NRRL 181]
gi|148886830|sp|A1DB13.1|CFT1_NEOFI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|119410106|gb|EAW20053.1| cleavage and polyadenylation specificity factor subunit A, putative
[Neosartorya fischeri NRRL 181]
Length = 1400
Score = 187 bits (474), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 150/542 (27%), Positives = 260/542 (47%), Gaps = 50/542 (9%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRER----LRVHPQLCDGSIVAFTVLHNVNCNHGF 969
+ I NIS F+ G + + + R+ + G ++ L + + + GF
Sbjct: 885 LRILPNISDLSAVFMPGPSASFILKTAKSCPHVFRLRGEFVRG--LSIFDLASPSLDKGF 942
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
IYV S+ +L+IC+ PS + +D W ++KI + + Y Y L S +
Sbjct: 943 IYVDSKDVLRICRFPSETLFDYTWALRKIGIGEQVDHLAYATSSETYVLGTS------HS 996
Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1089
L D E+ N +S + R +++ R W + + +E
Sbjct: 997 ADFKLPDDDELHPDWRNEVISFLPELRQCSLKVVSPRT---------WTVIDSYSLGPAE 1047
Query: 1090 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE- 1147
+ V+ + L + T E ++ +GTA+ GED+ +RG + +F + +P+ T+
Sbjct: 1048 YVMAVKNMDLEVSENTHERRNMIVVGTAFAWGEDIPSRGCIYVFEVIKVVPDPEKPETDR 1107
Query: 1148 ---VYSKEL-KGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVV 1199
+ KEL KGA++AL+ + QG L+ A G K ++ K G+ L +AF D YV
Sbjct: 1108 KLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYV- 1164
Query: 1200 SLNIVKNF-----ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
N+VK ++GD K ++F + E+ +++L KD G L+ A EFL DG L
Sbjct: 1165 --NVVKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKDQGYLEVVAAEFLPDGDKLF 1222
Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1314
++V+D N+ + Y P+ +S G +LL+R++FH+G T L SS++ A P
Sbjct: 1223 ILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFATTMTLLPRTMVSSEKAMADP 1282
Query: 1315 GS----DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
S +T +L + GS+G + + E ++RRL +LQ +L +S+ H GLNPR++R
Sbjct: 1283 DSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQSQLTNSLEHPCGLNPRAYR 1342
Query: 1371 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL-ALGTS 1429
S+G A R ++D LL + + ++EIA + G +I ++L + A G
Sbjct: 1343 AVESDGTAGR----GMLDGNLLYQWLDMGQHRKMEIAARVGAHEWEIKADLEAIGAEGLG 1398
Query: 1430 FL 1431
+L
Sbjct: 1399 YL 1400
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 181/751 (24%), Positives = 310/751 (41%), Gaps = 127/751 (16%)
Query: 57 NLVVTAANVIEIY-VVRVQEE---GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
NLVV +V++I+ +++VQ G+ E K++ D + L L Y L G
Sbjct: 28 NLVVVKTSVLQIFSLLKVQHHLRGGTIEGKSARP-------DRVETTKLVLEREYPLSGT 80
Query: 113 VESLA---ILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
V + IL+ GG ++++LAF +AK+S++E+D HG+ S+H +E +
Sbjct: 81 VVDICRVKILNPKSGG-------EALLLAFRNAKLSLVEWDPERHGISTLSIHYYERDDL 133
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGD-------EDT 219
+ + G ++ VDP RC V +G++ + IL Q G L D +D
Sbjct: 134 TRSPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDD 192
Query: 220 FGS-----GGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
F G ++ SS V+ L LD + H F++ Y EP +L+
Sbjct: 193 FNQVSDHVGNDLKSKDRTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGVLY 252
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
+ T + + + + ++ + + S LP D +K++A+P P+GG L++
Sbjct: 253 SQIATSHALLPERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLI 312
Query: 325 GANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
G+N +H + A+ +N +A + + + +S ++ L+ L + LL
Sbjct: 313 GSNELVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHLSDSTGDLLLVL 372
Query: 382 KTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLL 434
+G++VL+ DGR V + L ++ +++ S ++ +G+ F GS DS+L
Sbjct: 373 SSGNMVLVHFQLDGRSVSGISLRPLPAQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVL 432
Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASN 492
+ ++ S + ++ D +S D + D+ E E G +
Sbjct: 433 LSWSSMSSN---PKKPRPRMSNVAEDREEASVDSQSEEDVYEDDLYTAEPETPALGRRPS 489
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG-----------LRINADASATGISKQS---N 538
S + F + D L NIGPL+D + G L NA + I+ Q N
Sbjct: 490 AETSGVGVYIFQILDRLPNIGPLRDITLGKPASTVENTGRLIENACSELELIAAQGSGRN 549
Query: 539 YELV--------------ELPGCKGIWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIIS 582
LV + +G+WT G D R+ + EY Y+I+S
Sbjct: 550 GGLVLMKREIEPDVAASFDAQSVQGVWTAVVALGSGAPLVPDEQRI---NQEYRQYVILS 606
Query: 583 L-------EARTMVLETADL----LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
++ + + DL E + D TI G L +RRV+QV
Sbjct: 607 KPEAPDKEQSEVFIADKQDLKPFKAPEFNPNNDV-----TIEIGTLSCKRRVVQVLRNEV 661
Query: 632 RILDGSYMTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
R SY D+ G P E S+ +S S+ADPY+ + D ++ LL D
Sbjct: 662 R----SY---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMLLQAD 712
Query: 687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDKG 717
S V+ + + K SC LY DK
Sbjct: 713 DSGDLDEVELDDSTRAGK--WRSCCLYWDKA 741
>gi|326432241|gb|EGD77811.1| hypothetical protein PTSG_08901 [Salpingoeca sp. ATCC 50818]
Length = 1506
Score = 186 bits (473), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 180/690 (26%), Positives = 297/690 (43%), Gaps = 128/690 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLV NV+ +Y + VQ +G+ + + E +DG+ + V R GN
Sbjct: 29 NLVTVQGNVLSVYNL-VQAQGAADKRCHLEADISFTLDGVP----QDVATVRPRGN---- 79
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHLK 171
RD +I F+DA+++++ FD + L S+H FE + W +
Sbjct: 80 ------------SRDLLIFTFKDARVAIVRFDPKMRDLETVSLHAFEDTDTKLGGWHSEQ 127
Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF-GSGGGFSARI 230
R R V VDP RC ++VYG ++I++ S G + + DT + F++R
Sbjct: 128 RLR--------VCVDPLHRCAALMVYGCKLIVISFSSGTATAAPEADTQEDTEQSFTSR- 178
Query: 231 ESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
VI+L L + V D F+ GY P + ILH+ W G ++ T ++ALS+
Sbjct: 179 ----VIDLLSLPSTIGRVDDMAFLDGYDVPCLAILHQPRPAWVGHMAKTKDTAHVTALSL 234
Query: 289 S------------TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
+ P++W NLP D + L VP+P+GGV+V+G N + Y +QS
Sbjct: 235 ALDEMTARRAPTAPPPPPPPVVWHQENLPSDTFALQPVPAPLGGVVVIGVNVLFYVTQSL 294
Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR 396
+LALN Y+ + ++ ++ S++LD AH L L + +GD+ LLT+V
Sbjct: 295 VRSLALNGYSRASTNAPIQEQTGISLDLDGAHHALLTPTQILFALPSGDIHLLTIVCTDV 354
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGL--K 451
V L + K SV+ SDI T+G F+ SR SLL+++ + T + SG+ +
Sbjct: 355 TVDGLRMDKLATSVIGSDICTLGRRHIFIASRHATSLLLEWAPIPLSATTHIDVSGVSGR 414
Query: 452 EEFGDIEADAPSTKRLRRSSS------------DALQDMVNGEELSLYGSASNNTESAQK 499
++ G + ST L S+S D D+V+G +G S
Sbjct: 415 DDAGLYGTSSDSTAALNTSASRDGSSTGGDDLDDVYGDVVDGGTTGAHGIGSGGR---VM 471
Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE---------------- 543
T RD+L + P+K + G +A +S YELV
Sbjct: 472 TVKLMARDALPTVAPIKSTAVG--TSAQGVVPHADPRSQYELVSCIGHDKNGALANISYS 529
Query: 544 -----------LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
L K W V+ +S+ +H +++ S +TMV
Sbjct: 530 LKPQVLLTEDALSSVKDCWAVHSNNSK---------------HHTHVVFSKPKKTMVFRV 574
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
A ++ + + T+ AGN+ GR+ V+QV + +LD +D F
Sbjct: 575 AGDFEQLRHPRGFDTEASTVFAGNVMGRQLVLQVTAKHVMLLDD----RDCVFDERM--- 627
Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+ + VS+ADPY+ L ++D + ++
Sbjct: 628 ---KKGVRITKVSVADPYIALLLNDATTKV 654
Score = 127 bits (319), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 163/723 (22%), Positives = 288/723 (39%), Gaps = 91/723 (12%)
Query: 757 CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN------------ 804
C ++G L IF VP+ VF F D+ R+ + E E
Sbjct: 807 CDKNGVLSIFQVPDMREVFCCTVFSVLPNVAWDSVYRKEIGPVELEPEMPLKRAKTMDEK 866
Query: 805 -----------SSSEEGTGQGRKENIHSMKVVELAMQRWSA----HHSRPFLFAILTDGT 849
+ E G+ Q ++ ++ E+ + A SRP LF
Sbjct: 867 GQSVFVEADEEADDESGSAQAEEDEQDRLQRKEMTIVELLAIGLGRGSRPHLFLRNETQH 926
Query: 850 ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
++ Y+ + TS S R RLR T +D + +
Sbjct: 927 VIVYEIF-------TS------SYKRHEKYEGRLQIRLRKRHQHPTWIDERLAQSS--SI 971
Query: 910 PCQRITIFKNISGHQGFFLSGSRPCWCMV--FRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
P F +ISG G F+ RP W M + +R H DG++ FT L +
Sbjct: 972 PPAAFRPFADISGCDGVFVCARRPSWFMCDHTHKVVRHHAMRFDGAVQCFTQLKHAMHTS 1031
Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1027
F+Y T +G++++ +G P ++ P+KA+ + + E +Y +V + +P
Sbjct: 1032 CFLYFTGKGVMRMATTAAGQVLSTPLPSRRTPIKASACYVDFDPESGVY--VVVLKHKEP 1089
Query: 1028 LNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
+ E +D S L + E Y + + + WQ P++
Sbjct: 1090 CAHLPKFGPPMEEAPAVDMKFASDEPLPQR---ERYSICLFSCED----WQLVPNSPVEI 1142
Query: 1088 SENALTVRVVTLFNTTTKENET----LLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-- 1141
+ V + N +++ + T +A+GT V GE RG + L+ P
Sbjct: 1143 PADH-HVTAFKVINISSERHLTGKKPCVAVGTTPVLGERNLERGLLQLYDVLEVVPEPGK 1201
Query: 1142 ---QNLVTEVYSKELKGAISALASLQGHLLIA----SGPKIILHKWTGTE-LNGIAFYDA 1193
+N + + S + GA++AL S++G+++ A GPKI + + E L IAF +
Sbjct: 1202 PTTKNRLKLMLSSDETGAVTALNSIEGYVIGALARRDGPKIFVWRVEDDEKLQPIAFLEG 1261
Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYF--LSWKEQGAQLNL-----------LAKDFGSLD 1240
++ V+L + NF+++GD + L E LNL + +D
Sbjct: 1262 -SMFTVTLKVALNFVIIGDYMGRVMLARLIKDETLKILNLSKGTTSQALLQVGRDVAPTS 1320
Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH--VTKFL 1298
+A +F++ G+ L ++ D+ N+ I + + +G ++L R + H +
Sbjct: 1321 VYAADFIVRGAELHVLFLDQHANMTILAFDSD-DPTTRGGRILKRHSVYNTGHQRIVALT 1379
Query: 1299 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSV 1358
RLQ + + R + L + TL+G G I + E FRRL LQ +L+ +
Sbjct: 1380 RLQNVPPRNSRNAT------VDAHFLTYQTLEGGAGYITSIPEDIFRRLMLLQLRLLPHL 1433
Query: 1359 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
AGL+P +F+++ S + ++ + ML L+ Q E+A Q GTT Q+
Sbjct: 1434 KFRAGLHPSAFKKYKSASLHMVHQEVRTICADVYTRLFMLDLDAQKEVARQVGTTTKQLC 1493
Query: 1419 SNL 1421
+
Sbjct: 1494 DDF 1496
>gi|121719617|ref|XP_001276507.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus clavatus NRRL 1]
gi|148886827|sp|A1C3U1.1|CFT1_ASPCL RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|119404719|gb|EAW15081.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus clavatus NRRL 1]
Length = 1401
Score = 185 bits (469), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 158/559 (28%), Positives = 262/559 (46%), Gaps = 55/559 (9%)
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG---------SRPCWCMVF 939
N R P D+ T + + + I +ISG+ F+ G SR C +
Sbjct: 861 NHVLPRIPPDSDTNISDKEPSNHRPLCILPDISGYSAVFMPGTSASFIFKTSRSC-PHIL 919
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
R R V L D FT + + GFIYV S+ +++ICQLP + YD W ++K+
Sbjct: 920 RLRGGVVRSLSD---FDFT---DPSLGRGFIYVDSKDVVRICQLPPETIYDYSWTLKKVA 973
Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYT 1059
+ + Y Y L S + L D E+ + N +S + R
Sbjct: 974 IGEHVDHLAYSISSETYVLGTS------HSADFKLPEDDELHPEWRNEAISFLPELRQCC 1027
Query: 1060 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYV 1118
+ +++ P W + + E + V+ + L + T E + ++ +GTA
Sbjct: 1028 L-----KVVHPKT----WTVIDSYTLGPDEEIMAVKNMNLEVSENTHERKNMIVVGTALA 1078
Query: 1119 QGEDVAARGRVLLFSTGRNADNPQNLVTE----VYSKEL-KGAISALASL--QGHLLIAS 1171
+GED+ ARG + +F + +P+ T+ + KEL KGA++AL+ + QG L+ A
Sbjct: 1079 RGEDIPARGCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSEIGGQGFLIAAQ 1138
Query: 1172 GPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGA 1227
G K ++ K G+ L +AF D YV L +K ++GD K I+F + E+
Sbjct: 1139 GQKCMVRGLKEDGSLLP-VAFMDVQ-CYVNVLKELKGTGMCIVGDAFKGIWFAGYSEEPY 1196
Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
+++L KD + A +FL DG L ++V+D N+ + Y P+ S G KLL R++
Sbjct: 1197 KMSLFGKDLEYPEVVAADFLPDGDKLFILVADSDCNLHVLQYEPEDPMSSNGDKLLVRSK 1256
Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSD-----KTNRFALLFGTLDGSIGCIAPLDEL 1342
FH+G H T L L T+S +A + +L + GSIG + + E
Sbjct: 1257 FHMG-HFTSTLTLLPRTTASYEIPSADSDSMEVDPRITPQQVLITSQSGSIGIVTSIPEE 1315
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
++RRL +LQ +L ++V H GLNPR++R S+G A R ++D LL + + +
Sbjct: 1316 SYRRLSALQSQLANTVEHPCGLNPRAYRAIESDGTAGR----GMLDGNLLYQWLSMSKQR 1371
Query: 1403 QLEIAHQTGTTRSQILSNL 1421
++EIA + G +I ++L
Sbjct: 1372 RMEIAARVGAHEWEIKADL 1390
Score = 119 bits (297), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 167/723 (23%), Positives = 294/723 (40%), Gaps = 127/723 (17%)
Query: 57 NLVVTAANVIEIYV---VRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
NLVV +V++I+ V EG + S D + + L L Y L G V
Sbjct: 28 NLVVVKTSVLQIFSLLNVSCSAEGEIIAAKSARP------DQLQSTKLILEREYSLSGTV 81
Query: 114 ESLA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
L + ++ G D +I+LAF +AK+S++E+D +G+ S+H +E +
Sbjct: 82 SDLCRVKLLKTKSGGD------AILLAFRNAKLSLVEWDPERYGISTISIHYYERDDITR 135
Query: 170 LKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GD----------- 216
+ + G ++ VDP RC V +G++ + IL Q G LV GD
Sbjct: 136 SPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLVMGDYESDSQKQSHE 194
Query: 217 ---EDTFGS-----GGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
+D+ G+ G SS V+ L LD + H F++ Y EP IL+ +
Sbjct: 195 HEMDDSAGNSKSKEGAVHQTPYASSFVLPLTALDSAILHPVSLAFLYEYREPTFGILYSQ 254
Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
T + + + ++ + ++ S LP D +K++A+P P+GG L++G
Sbjct: 255 IATSNSLLHERKDAIFYTVFTLDLEQRASTMLLSVTRLPSDLFKVVALPPPVGGALLIGY 314
Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
N +H + A+ +N ++ + + +S ++ L+ L N LL+ +
Sbjct: 315 NELVHVDQAGKTNAVGVNEFSRQVSTFSMADQSELALRLEGCVVELLGNSSGDLLLALSS 374
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLV 435
G +VL+ DGR V + + + P +I ++G+ F GS +S+L+
Sbjct: 375 GTMVLVHFKLDGRSVSGISI-RPLPGHAGGNILKAAASASASLGSDKVFFGSEDAESVLL 433
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN-- 493
++ S + S + E IE D S D +D LY +A +
Sbjct: 434 GWSLSSSNARKS---RSESKRIEKDHEEGSDDSESEEDVYED-------DLYSAAPDTPA 483
Query: 494 -------TESAQKTFSFAVRDSLVNIGPLKDFSYG-------------------LRINAD 527
S ++ F V D L N PL+D + G L + A
Sbjct: 484 LGHRLSVAPSTFASYKFKVHDVLPNTAPLRDIALGQPAMPVEDTGSHLDNICSELELVAA 543
Query: 528 ASATG-----ISKQSNYELVE----LPGCKGIWT---VYHKSSRGHNADSSRMAAYDDEY 575
+ G + K+ +V+ + G+WT +++ + D + + +E+
Sbjct: 544 YGSNGNGGLVVMKRELEPVVKASLNVGPIHGVWTASIALGSAAKPMSGDQTNI----EEW 599
Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFE 628
Y+I++ + +T+ E +++ ++ F +I G L R+RV+QV
Sbjct: 600 RQYVILT-KPQTIDKEESEVFIVDGLNLKPFKAPEFNPNNDISIQVGTLSNRKRVVQVLR 658
Query: 629 RGARILDGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
R D DL P E S+ LS S+ADPY+ + D ++ LL
Sbjct: 659 NEVRSYD-----SDLELAQIYPVWDE--DTSDERMALSASLADPYIAILRDDSTLLLLQA 711
Query: 686 DPS 688
D S
Sbjct: 712 DDS 714
>gi|260835073|ref|XP_002612534.1| hypothetical protein BRAFLDRAFT_58262 [Branchiostoma floridae]
gi|229297911|gb|EEN68543.1| hypothetical protein BRAFLDRAFT_58262 [Branchiostoma floridae]
Length = 318
Score = 184 bits (467), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 107/297 (36%), Positives = 168/297 (56%), Gaps = 38/297 (12%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
+LVV + +Y ++ E S++ K +ELV + ++GN+ S+
Sbjct: 29 SLVVAGTTQLHVYRLKGDMEKSRKQK------------------MELVASFSMYGNIMSV 70
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ G+D RD+++L+F DAK+S++E+D H L+ SMH FE E +K G S
Sbjct: 71 ESVQLAGSD----RDALLLSFMDAKLSIVEYDPGTHDLKTASMHYFEEEE---VKDGYVS 123
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P+V+VDP+GRC +L+YG ++++L + G+ DE +G S I +++I
Sbjct: 124 NYHAPMVRVDPEGRCAVMLIYGKRLVVLPFRKEGAV---DEAEMSAGSKSS--ILPTYMI 178
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
L+DLD + +V D F+HGY +P ++IL+E TW GRV+ + TC I A+S++ +
Sbjct: 179 KLQDLDERLINVVDLQFLHGYFDPTLLILYEPLQTWPGRVAVRQDTCCIVAVSLNIAQRV 238
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
HP+IWS NLP D + +AVP PIGGVLV N++ Y +QS Y VSL+S
Sbjct: 239 HPIIWSVGNLPFDCKQAVAVPKPIGGVLVFAVNSLLYLNQSVP------PYGVSLNS 289
>gi|358338426|dbj|GAA28838.2| cleavage and polyadenylation specificity factor subunit 1 [Clonorchis
sinensis]
Length = 1741
Score = 182 bits (463), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 187/754 (24%), Positives = 321/754 (42%), Gaps = 103/754 (13%)
Query: 748 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
D+ ++ + + +G LEI+ +P+F ++ V F +VD A + ++ E+N +
Sbjct: 1007 DKSRYFAFIVFTNGVLEIYSLPDFTLLYEVHHFSDLPAMLVDC---RAGQGNKVEVNLEN 1063
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
++NI V+E+ + + RP L + T I ++A S +
Sbjct: 1064 IPNCPAAEEDNIPP-TVLEITVFPIGRNRDRPVLL-VRTSQEIAFFEALC------PSHN 1115
Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET-PHGAPCQRITI--------FK 918
+ S S S + R R L PL A R T P A Q + F+
Sbjct: 1116 EAHPFASESWSQEGL---RWRRLPIP-CPLVAPRRVRTDPKIADVQSTMLTRKNLLRPFE 1171
Query: 919 NISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
+I GH G F+ G+ P W +RV DG + +F L+ C GF+Y T
Sbjct: 1172 DIDGHCGVFVCGATPIWLFSSDTGHIRVFNHSIDGIMGSFAPLNTDICPSGFVYFTYSNE 1231
Query: 978 LKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1037
+++ L G ++ + ++ +PL+ TP+ + Y E Y L+ + +K + V L +
Sbjct: 1232 MRLATLLPGYSFKEHLGMRWVPLELTPYFLQYHIESKTYALVGTR--VKSCSSVYHLNAE 1289
Query: 1038 ----QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA---GGPWQT--RATIPMQSS 1088
+EV + L S+D Y +++ P + PWQ A I +
Sbjct: 1290 GNKEEEVLLRPPTCVLPSLDY--------YVLQMYAPSTSLAEATPWQAIPHACIDFEPW 1341
Query: 1089 ENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE 1147
E + L + T + LA+G GE++ RGR+++ P +T
Sbjct: 1342 EVVTCMITAQLSSEQTFHGTKDYLALGANLSYGEEIPVRGRIIILDVIDVVPEPGQPLTR 1401
Query: 1148 -----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLN 1202
+Y E KG ++AL+S QGHL+ A G K+ + +L G+AF D+ LY+ SL
Sbjct: 1402 HKLKTIYDGEQKGPVTALSSCQGHLVSAIGQKVYIWTLKNADLVGVAFVDSE-LYIHSLL 1460
Query: 1203 IVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
VKN IL D+ KSI L ++ L+++++D + + + F +DG L +V+DE+
Sbjct: 1461 CVKNLILAADVLKSIQLLRFQSDLRVLSVVSRDAIPREVYTSNFFVDGRRLGFLVTDERG 1520
Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGA------HVTKFLRLQMLATSS--------- 1307
N+ I+ Y P S G++L+ RA+ + V LR +L+ S
Sbjct: 1521 NVVIYSYDPLEPSSRSGRRLVRRADMCLPTRAISSLRVANRLRHALLSVKSAGTGTQTTV 1580
Query: 1308 ------------DRTG-------AAPG------------SDKTN------RFALLFGTLD 1330
+RTG APG TN + ++ GT
Sbjct: 1581 PSAAGVGGSEVLERTGKTGVSSFVAPGRANSASAMTLSTPSATNIDPEKLKHSVYLGTQT 1640
Query: 1331 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
G++ I PL + + RL+ +K L+ GL P+ + + + D +
Sbjct: 1641 GAVFLIGPLRDKMYSRLRITEKNLIHHFGPTCGLLPKLCWNYRPSAPELVNPSGQVADAD 1700
Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
LL Y LP ++LEIA ++G + I+ ++ +L
Sbjct: 1701 LLWRYLTLPHSQRLEIAKKSGQSLEGIMDDIAEL 1734
Score = 172 bits (436), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 141/482 (29%), Positives = 225/482 (46%), Gaps = 67/482 (13%)
Query: 129 RRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQ 188
R DS++L+F +AK++V+ FD + L+ S+H +E + +LK GR F+ P+++VDP
Sbjct: 27 RLDSLLLSFTEAKVAVMGFDPVQYELKTLSLHNYE---FENLKSGRTHFSHLPILRVDPL 83
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDE--------DTFGSGGGFS------ARIESSH 234
RC VLVY + +L + + GD+ +T G S A + ++
Sbjct: 84 QRCAVVLVYDRHLAVLPFRRSEALAAGDKYLAKPVTNNTARGAGSLSWERRATAPLLATF 143
Query: 235 VINLRDL---DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
L + +V D F++G+ EP +++L+E TWAGRVS + TC I ALS +
Sbjct: 144 TTCLSSSTGEKINNVLDMQFLNGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQ 203
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---V 347
+ +P+IW +LP+D + +VP PIGGVL++ N+I Y Q+ SC L LN YA
Sbjct: 204 KRTNPVIWFQESLPYDCTYVHSVPEPIGGVLILATNSIIYMKQTLPSCGLPLNCYAQVTT 263
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT--VVYDGRVVQRLDLSK 405
+ Q++P+ + LD + + L+ T+TG + LL+ V + + V L L +
Sbjct: 264 NFPMRQDVPQCG-PLTLDGCRIVTMTDSQFLIVTRTGKMCLLSLWVEHTTQTVSSLLLHE 322
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS------------GTSMLSSGLKEE 453
SV + + F+GSRL DS+L+ T + + + + +
Sbjct: 323 IGCSVPPYSVALLDKGYVFVGSRLCDSVLLHLTASTMFVNTLGRIVDLDETTTADNFRTD 382
Query: 454 FGDIEADA---------PSTKRLRRSSSDALQD----MVNGE------ELSLYGSASNNT 494
IE DA P+ K SS +V+G ++ LYG N
Sbjct: 383 IPMIERDAESIPVDKNNPTEKEAENVSSGTPSKPSGSIVHGPYVFDEVDVELYGDTILNP 442
Query: 495 ESAQK---TFSFAVRDSLVNIGPL-----KDFSYGLRINADASATG-ISKQSNYELVELP 545
S + T+ F V D LVN GP+ + Y N D + I+ Q+ VEL
Sbjct: 443 PSDVRELNTYKFEVADRLVNFGPMGLLTSGEVPYLAPGNTDPTDEALIAAQAEMHHVELL 502
Query: 546 GC 547
C
Sbjct: 503 AC 504
>gi|393907593|gb|EJD74705.1| CPSF A subunit region family protein [Loa loa]
Length = 990
Score = 182 bits (461), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 167/640 (26%), Positives = 276/640 (43%), Gaps = 83/640 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + RL V+S AI + DS++L F+DAK+S++ + + L+ S+H
Sbjct: 62 LECLLAVRLLAPVQSFAI---ARIPQNPDCDSLLLGFDDAKLSIVGVNPADRSLKTISLH 118
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G P+++VDP RC +LV+G + +L + G+ L
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCAAMLVFGRYLAVLPFNDSGAQL------- 168
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+ + L +D + +V D +F+ GY EP ++ L+E T GR ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217
Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
T + L +S +K+ L +W NLP D ++LA+P P+GG+L+V N + Y +QS
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275
Query: 337 -SCALALNNYAVSLDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
C ++LN+ +D + P F + LD T + + LL + G L L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLRDFKHMVLTLDGCVVTVISTNKILLCDRNGRLFTLVLV 332
Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
D V+ L+L +V+ +T+ F+GSRL DS+ + T
Sbjct: 333 TDATNSVKSLELKFQFKTVIPCTMTSCAPGYLFIGSRLCDSVFLHCIFEQST-------- 384
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA---SNNTESAQKTFSFAVRDS 508
++ AP +L + +A +D E+ LYG +SA++ + V D
Sbjct: 385 -----LDESAPKKIKL-NTELNANED----EDFELYGEVLPKVAKPDSAEELLNIRVLDK 434
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
L+N+GP K + G + K ++LV G G ++ +S R SS
Sbjct: 435 LLNVGPCKKITGGCPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSS 494
Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
+ +D+ H Y I S E T+ LET + L E+ E+ + TIAAG L
Sbjct: 495 SIEGVVQYWAVGRREDDTHMYFIASKELGTLALETDNDLVEL-EAPIFATSEPTIAAGEL 553
Query: 618 FGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
+QV ++ Q + V S SI DPY+ + +
Sbjct: 554 ADGGLAVQVTTSSLVMVAEGQQIQHIPL----------QLTFPVRSASIVDPYIAICTQN 603
Query: 678 GSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
G + + L P + + P++S ++Y D
Sbjct: 604 GRLLMYELTSHPHVHLKEIDISKRLRHETSPITSLSIYRD 643
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 70/273 (25%), Positives = 123/273 (45%), Gaps = 29/273 (10%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEEGTG 812
E+G + I+ +P + V+ V K +H+ D + D E + S++ T
Sbjct: 733 ENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPATAESIDTMSATMTDTF 788
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
+ E + ++EL M + RP LF +L D T+ Y+ + + N
Sbjct: 789 AAKPEEV----IMELLMVGMGMNQGRPMLF-LLIDDTVSVYEMFTY----NNGIQGHLAV 839
Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGFFLS 929
+ L + V+ R+ RF LD E+ A + + F+ I G F+
Sbjct: 840 RFKRLPYTVVT----RSCRFQG--LDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFIC 893
Query: 930 GSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGST 988
S PC + R+HP DG I++FT +N C +GFIY+T + ++++ +LP+
Sbjct: 894 SSYPCIFFLETGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRVAKLPNDMI 953
Query: 989 YDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1021
D +PV++I + A+ H +TY N Y ++ S
Sbjct: 954 LDTSYPVKRIDVGASVHSVTYLLHSNTYAVLTS 986
>gi|212541400|ref|XP_002150855.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces marneffei ATCC 18224]
gi|210068154|gb|EEA22246.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces marneffei ATCC 18224]
Length = 1383
Score = 179 bits (453), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 133/530 (25%), Positives = 243/530 (45%), Gaps = 55/530 (10%)
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
++ G+ +SG+ P + + L + I + + C G +YV ++
Sbjct: 873 LSDLGGYAAVVMSGASPNLIVRTSKSLPHVYSIQSDFIRGISGFNGAGCKKGLVYVDNER 932
Query: 977 ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
+++ CQL + + D WP+++IPL + Y Y + +
Sbjct: 933 LVRTCQLYNNAQLDFSWPIRRIPLNEQVDHLAYSTASGTYVVGTT--------------- 977
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYE---------VRILEPDRAGGPWQTRATIPMQS 1087
H+ D +LH + EE ++++ P W+ +
Sbjct: 978 -----HEQDFKLPDDDELHPEWATEEISLLPKVAYGSIKLINPKT----WKVIDSYTFSP 1028
Query: 1088 SENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ---- 1142
+E V + L + T + + ++ +GT Y +GED+AARG V +F +P
Sbjct: 1029 AERITAVENINLEISEKTGKRKDMIVVGTTYAKGEDIAARGNVYVFDVIDVVPDPDEPGT 1088
Query: 1143 NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLY 1197
NL ++ +E ++GA++A++ + QG +++A G K ++ K G+ L +AF D Y
Sbjct: 1089 NLKLKLIGEESIRGAVTAVSGIGGQGFMIVAQGQKCMVRGLKDDGSLLP-VAFIDVQ-CY 1146
Query: 1198 VVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
V + +K L+GD K ++F + E+ ++ L KD L+ +FL DG L +
Sbjct: 1147 VSVIKELKGTGMCLIGDAFKGLWFTGYSEEPYKMTLFGKDLDELEVVTADFLPDGKKLYI 1206
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
+V+D N+ + Y P+ +S G +LL+R +FH+G + L A SS+ +
Sbjct: 1207 LVADGDCNLYVLQYDPEDPKSSNGDRLLNRCKFHMGHFASTLTLLPRTAVSSELAVMSSD 1266
Query: 1316 SDKTNRFALLFGTL----DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
S + + L+ L GS+ I L E ++RRL +LQ +L +++ H GLNPR++R
Sbjct: 1267 SMDIDSYTPLYQALITTQSGSMALITSLSEESYRRLTALQSQLSNTLEHPCGLNPRAYRS 1326
Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
S+G R ++D +LL + L +LEIA + G +I ++L
Sbjct: 1327 VESDGVVGR----GMIDGKLLMRWLDLSRSRKLEIAGRVGADEWEIRADL 1372
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 181/751 (24%), Positives = 305/751 (40%), Gaps = 137/751 (18%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++IY + + + +N + V + A L L Y L+G V +
Sbjct: 28 NLIVVKTSLLQIYTLVAETSTTLILENDQQADDDVKNE---ATKLHLHAEYDLYGTVTDI 84
Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ + S+ G D +++L+F +AK+S++E++ G+ S+H +E
Sbjct: 85 SPVKILKSRSGGD------ALLLSFRNAKLSLIEWNPETQGISTMSIHYYE--------- 129
Query: 173 GRESFARGPLV----------KVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE---- 217
+E P V VDP RC +L +G++ I IL Q G LV DE
Sbjct: 130 -KEDITLSPWVPDLSQCDSHLTVDPSSRCA-LLNFGVRNIAILPFHQAGDDLVMDEYDPD 187
Query: 218 ------------------DTFGSGGGF--SARIESSHVINLRDLD--MKHVKDFIFVHGY 255
D+ + G +S V+ L LD + H F+H Y
Sbjct: 188 LDMDDLTDQEENKKPSHTDSKKAEGDLIHQTPYAASFVLPLTALDPTLIHPIGLTFLHEY 247
Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
EP IL+ T A + + + S ++ + + S LP D ++A+P
Sbjct: 248 REPTFGILYSPIATSAALLEERKDVVVYSVFTLDLEQRASTPLLSIAKLPSDLLHIMALP 307
Query: 316 SPIGGVLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQN 374
+P+GG L++G+N IH + A+A+N +A + + + +S + L+ + + N
Sbjct: 308 APVGGTLLIGSNEMIHIDQSGKASAVAVNEFAKQVSAFPMVDQSDLELRLEGSVVEVINN 367
Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT--------IGNSLFF 424
+ LL+ TG+LVL+ DGR V + P+V D+ + +G+ F
Sbjct: 368 ESGDILLTLSTGELVLVHFKIDGRSVSGFVVFPI-PAVSGGDVVSAVASCAVALGSGKVF 426
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS--SDALQDMVNGE 482
+GS +S+L+ S S S + D E + + S A ++ VN
Sbjct: 427 IGSEDAESVLLDCYLPSAVSKKSRDYDRDHFDEEMNNEEDDDMYEDDLYSSAPKEAVN-- 484
Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV 542
+ G S+N ++F V D L+++GPL+ + G + D++A Q + + +
Sbjct: 485 KTVSNGRISDN-------YTFKVIDRLLSLGPLRAVAVGKPASRDSNAE--DAQQSVDDL 535
Query: 543 ELPGCKG-----------------------------IWTVYHKSSR-GHNADSSRMAAYD 572
EL G +W + +++ GHN DS
Sbjct: 536 ELAAAYGSGRGGGVALLQRTLHLDDVFTLGAESADSVWNITTSNTKSGHN-DSG------ 588
Query: 573 DEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFGRRRVIQV 626
+E +Y+I++ T+V + E + D G TI L G RV+QV
Sbjct: 589 EENQSYVILTKANSPENEETLVYAVNERNLEPFNAPDVNPNGDPTIDIDVLAGNSRVVQV 648
Query: 627 FERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
RI D + M Q P E G E V S S AD Y+L+ D S+ LL
Sbjct: 649 LTGEVRIYDTNLGMAQ---IYPVWDED-EGDERFAV-SASFADHYLLIIRDDSSVLLLHS 703
Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
D S + P + S +P LY D+
Sbjct: 704 DESGDLDELTKPETV--SSQPWLCGCLYTDR 732
>gi|49619061|gb|AAT68115.1| cleavage and polyadenylation specificity factor 1 [Danio rerio]
Length = 312
Score = 178 bits (452), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 99/253 (39%), Positives = 149/253 (58%), Gaps = 18/253 (7%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE V + L GNV S+A + G + RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 66 LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P+V+VDP+ RC +LVYG +++L + + DE
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFR---NDTLADEQEG 175
Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G S++I++R+LD + ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 176 IVGEGQKFSFLPSYIIDVRELDETLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
TC I A+S++ K HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVP- 294
Query: 339 ALALNNYAVSLDS 351
+ VSL+S
Sbjct: 295 -----PFGVSLNS 302
>gi|170576536|ref|XP_001893668.1| CPSF A subunit region family protein [Brugia malayi]
gi|158600196|gb|EDP37499.1| CPSF A subunit region family protein [Brugia malayi]
Length = 1323
Score = 178 bits (451), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 169/647 (26%), Positives = 278/647 (42%), Gaps = 96/647 (14%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + RL V+S AI + DS++L F+DAK+S++ + + L+ S+H
Sbjct: 62 LECLLAVRLLAPVQSFAIARISQNPDC---DSLLLGFDDAKLSIVAVNPADRCLKTISLH 118
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G P+++VDP RC +LV+G + +L + + L
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCASMLVFGRYLAVLPFNDSSTQL------- 168
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+ + L +D + +V D +F+ GY EP ++ L+E T GR ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217
Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
T + L +S +K+ L +W NLP D ++LA+P P+GG+L+V N + Y +QS
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275
Query: 337 -SCALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
C ++LN+ +D + P F ++ LD A T + + LL + G L L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLKDFKHMALTLDGAVVTVVSTNKILLCDRNGRLFTLILV 332
Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
D V+ L+L +V+ +T+ F+GSRL DS+ + C S L
Sbjct: 333 TDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFLH--CIFEQSTLEES-- 388
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT---ESAQKTFSFAVRDS 508
+TK+++ S+ + E+ LYG + ++ + V D
Sbjct: 389 -----------ATKKMKLSTEPNANE--EDEDFELYGEVLPKVAKPDVTEELLNIRVLDK 435
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
L+N+GP K + G + K ++LV G G + +S R SS
Sbjct: 436 LLNVGPCKKITGGCPSVSAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSS 495
Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
+ +D+ H Y I S E T+ LET + L E+ E+ + TIAAG L
Sbjct: 496 SIEGVVQYWAVGRREDDTHMYFIASRELGTLALETDNDLVEL-EAPIFSTSESTIAAGEL 554
Query: 618 FGRRRVIQV-------FERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
+QV G +I Y+ L+F V S SI DPY
Sbjct: 555 ADGGLAVQVTTSSLVMVAEGQQI---QYIPLQLTF--------------PVRSASIVDPY 597
Query: 671 VLLGMSDGSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
+ + +G + + L P + + P++S ++Y D
Sbjct: 598 IAICTQNGRLLMYELTNQPHVSLKEIDISKRLRHETSPITSLSIYRD 644
Score = 147 bits (371), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 96/292 (32%), Positives = 151/292 (51%), Gaps = 16/292 (5%)
Query: 1145 VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
+ +Y KE KG +++L S G+LL G K+ + + L GI+F D Y+ L V
Sbjct: 1039 IKTLYDKEQKGPVTSLCSCNGYLLTGMGQKVFIWLFKDNNLQGISFLDMH-FYIHQLIGV 1097
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQK 1262
+N L D+++S+ L ++E+ L+L ++D S A +FLID + ++SDE
Sbjct: 1098 RNLALACDMYRSLALLRYQEEYKALSLASRDMRSDVQPPMAAQFLIDNKQMGFIMSDEAA 1157
Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
NI IF Y P+ ES G+KL RAE ++G V F+R++ +S G + F
Sbjct: 1158 NIAIFNYLPETLESLGGEKLTLRAEINIGTVVNSFIRVKGHISS--------GFVENELF 1209
Query: 1323 AL-----LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+L LF +LDGS G + PL E FRRL LQ+ + V AGLN + R
Sbjct: 1210 SLERQSVLFASLDGSFGYLRPLTEKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRP 1269
Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
H ++VD ++ Y L L E+ ++A + GT+R I+ +L ++ T+
Sbjct: 1270 NHYLNTRNLVDGDVAMQYLHLSLPEKNDLARKLGTSRYHIIDDLIEICRVTA 1321
Score = 81.6 bits (200), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 70/276 (25%), Positives = 121/276 (43%), Gaps = 29/276 (10%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEE 809
+ E+G + I+ +P + V+ V K +H+ D + D E + S +
Sbjct: 731 IARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPVTGEGIDAMSGTMT 786
Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
T + E + ++EL + + RP LF +L D T+ Y+ + + N
Sbjct: 787 DTFAVKPEEV----IMELLLVGMGMNQGRPLLF-LLIDDTVSAYEMFTY----NNGIQGH 837
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGF 926
+ L + V+ R+ RF T D E+ A + + F+ I G
Sbjct: 838 LAIRFKRLPYTTVT----RSCRFQGT--DGRAAVESVRDAVRHKTVLHFFERIGNVLNGV 891
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-ILKICQLPS 985
F+ S PC + R+HP DG I++FT +N C +GFIY+T + +++ +LPS
Sbjct: 892 FICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAVCPNGFIYLTERDRFMRVAKLPS 951
Query: 986 GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1021
D +PV++I + AT H + Y N Y ++ S
Sbjct: 952 DMILDASYPVKRINVGATVHSVVYLLHSNTYAVLTS 987
>gi|353231025|emb|CCD77443.1| putative cleavage and polyadenylation specificity factor cpsf
[Schistosoma mansoni]
Length = 1825
Score = 176 bits (446), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 181/769 (23%), Positives = 312/769 (40%), Gaps = 125/769 (16%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
++ + + +G LEI+ +P+F ++ V F ++D + + ++ +
Sbjct: 1086 FAFIVFTNGVLEIYSLPDFTLLYEVHHFTDLPQMLID---HRGVSSEQLHKQYTNSQNVS 1142
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
++I ++E+ + RP L + T I ++A L P+
Sbjct: 1143 YTEDDSIPP-PILEILVYPIGIDKDRPVLM-VRTSQEIAFFEA-LCPSPDE--------- 1190
Query: 873 TSRSLSVSNVSASRLRNLRFS-RTPLDAYTREET-PHGAPCQRITI--------FKNISG 922
S L RLR R PL A R T P Q + F+NI
Sbjct: 1191 -SYPLISGTFYEGRLRWRRLPLPCPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGD 1249
Query: 923 HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
H+G F+ G P W +LRV P DG + +F L+ C+ GF+Y T +++
Sbjct: 1250 HRGVFVCGGNPIWLFATDSGQLRVFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLA 1309
Query: 982 QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
LP G +++ + ++ I L P+ + Y E Y +V + +P V L +
Sbjct: 1310 TLPPGYSFNEHLGIKWITLDPVPYYVQYHVESKTY-AVVGIHS-EPCKSVFRLNAEGNKE 1367
Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG------PWQTRATIPMQSSENALTVR 1095
+ + V T++ Y +++ P+ PW IP E
Sbjct: 1368 EDVLVRPKTCV----LPTLDYYSLQMYAPNLNANHRNKQPPW---LLIPNTLIEFEPWEV 1420
Query: 1096 VVTLFNTTTKENETL------LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
V L ET LA+G GE++ RGR+L+ P +T
Sbjct: 1421 VTCLITAQLASEETFHGTKDYLALGANLTYGEEIPVRGRILILDVIDVVPEPGQPLTRHK 1480
Query: 1148 ---VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
++ E KG ++AL S QGHL+ A G KI + T+L G+AF D+ LY+ +L V
Sbjct: 1481 LKIIHDGEQKGPVTALTSCQGHLISAIGQKIYIWTLKNTDLVGVAFVDSE-LYIHNLLCV 1539
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
KN +L D+ KS+ L ++ L+++++D S + + + F +DG L +VSDE N+
Sbjct: 1540 KNLVLAADVLKSVQLLRFQSDLRVLSVVSRDNISREVYTSNFFVDGRRLGFMVSDELGNV 1599
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVT------KFLRLQMLATSSDRTGAAPG--- 1315
I+ Y P S G++L+ A+ + + T LR +L+ T A
Sbjct: 1600 TIYSYDPLDPSSRSGRRLVRCADMRLPSRATCSLRVANRLRHALLSVKPSSTTTASAMTA 1659
Query: 1316 ------SDKTN--------------------------------------------RFALL 1325
D TN R ++
Sbjct: 1660 GTSATIQDSTNTVLDNLSRVDSVNQMNNLRQSQQQSTAAQQGTTNPNSGVDPEKFRQSIY 1719
Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD- 1384
FG+ +GSI I P+ + + RL+ +K L+ + + G+ P+S + +RP P+
Sbjct: 1720 FGSQNGSIYRIGPIRDKMYSRLRITEKNLIHHLGPICGMPPKSCWSY------NRPQPEL 1773
Query: 1385 -----SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
+ D +L+ Y LP ++LEIA ++G + I+ ++ +L T
Sbjct: 1774 ANPCGKVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIMDDIAELIATT 1822
Score = 156 bits (395), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 146/586 (24%), Positives = 252/586 (43%), Gaps = 123/586 (20%)
Query: 4 AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
A +K + PT + NC +TH + + NLV+T
Sbjct: 15 AVFKHISPPTAVDNCLYCHLTHPK---------------------------LKNLVITRG 47
Query: 64 NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
IEIY V+ S SGET+ V ++ N+ + + G
Sbjct: 48 GFIEIYNVK--------SSASGETR------------FNWVYGTSVYENIADIVTVRFTG 87
Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
S++L+F +AK++V+ F+ LR S+H +E + +LK GR +F + P++
Sbjct: 88 DLLD----SLLLSFPEAKVAVMNFNPVTFELRTLSLHNYE---FENLKSGRMNFTKLPIL 140
Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED----TFGSGGGFSARIESSHVINLR 239
++DP RC +LVY + +L + + + D + + + R + +
Sbjct: 141 RLDPHQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHISVRNSLLWQQRATAPLLATFT 200
Query: 240 DL-------DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+ +V D F++G+ EP +++L+E TWAGRVS + TC I ALS +
Sbjct: 201 TCLSTSTGEKINNVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQK 260
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VS 348
+ +P+IW +LP D +++VP PIGGV+V+ AN+I Y Q+ SC L LN YA +
Sbjct: 261 RTNPVIWFQESLPFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTN 320
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKT 406
Q++P S + +D L L+ T++G+L LL++ + + V L K
Sbjct: 321 FPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKV 379
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF---------------------TCGSGTSM 445
+V + + + F+GSR DS+L++ + G+ ++
Sbjct: 380 GHAVPPHCMVLLESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVDHQLLNQSSGTNNTL 439
Query: 446 LSSGLKEEFGDIEAD------------------------APSTKRLRRSSSDALQD---M 478
S L + +E D + STKR +D + D
Sbjct: 440 KDSELVDGKSIVEDDSDEIPNKCPRIEEGENDKTISKSLSQSTKRNTLDENDIISDNHYK 499
Query: 479 VNGEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYG 521
+ ++ LYG + + S + +SF V D L+N+GP+ + G
Sbjct: 500 FDEVDVELYGESILSPPSIYREIVNYSFKVVDRLINLGPMGQLTSG 545
>gi|425765419|gb|EKV04111.1| Cleavage and polyadenylation specificity factor subunit A, putative
[Penicillium digitatum Pd1]
gi|425767100|gb|EKV05682.1| Cleavage and polyadenylation specificity factor subunit A, putative
[Penicillium digitatum PHI26]
Length = 1271
Score = 176 bits (446), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 134/467 (28%), Positives = 235/467 (50%), Gaps = 43/467 (9%)
Query: 978 LKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1037
++ CQLPS + +D W ++K+P++ + + Y Y L S L
Sbjct: 822 IRACQLPSQTQFDYSWTLRKVPIEEQVNFLAYSTSSETYVLGTS------RQGDFKLPEG 875
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV 1097
E+ + N LS + E ++++ P W + P+ E V+ V
Sbjct: 876 DELHPEWRNEELSFCP-----KIPESSIKVVSPKT----WTIIDSYPLDPDEQVTAVKNV 926
Query: 1098 TL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT----EVYSKE 1152
+ + T E L+ +GTA +GED+ ARG + +F + A +P+ T ++ KE
Sbjct: 927 NIEVSENTHERMDLIVVGTAIAKGEDMPARGTIYVFDVIKVAPDPERPETGRKLKLIGKE 986
Query: 1153 -LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
+KGA++AL+ + QG +++A G K ++ K G+ L +AF D YV N+VK
Sbjct: 987 TVKGAVTALSGIGGQGFIIVAQGQKCMVRGLKEDGSLLP-VAFMDMQ-CYV---NVVKEL 1041
Query: 1208 -----ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
++LGD K ++F + E+ ++ L KD L+ A +FL DG+ L ++V+D
Sbjct: 1042 KGTGMVILGDAVKGLWFAGYSEEPYRMTLFGKDPEYLEVVAADFLPDGNKLYMLVADSDC 1101
Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAAPGSDKT 1319
N+ + Y P+ +S G +LLSR++F+ G + VT R + + ++ + A D+T
Sbjct: 1102 NLHVLQYDPEDPKSSNGDRLLSRSKFYTGNFASSVTLLPRTAVSSELTESSEEAMDVDET 1161
Query: 1320 -NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
++ +L + +GS+ + + E ++RRL LQ +L+++V H AGLN R+FR S+G A
Sbjct: 1162 FAKYQVLIASQNGSLALVTSVAEESYRRLSGLQSQLINTVDHPAGLNARAFRATESDGAA 1221
Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
R +VD LL + + + Q EIA + G T +I ++L +
Sbjct: 1222 GR----GMVDGNLLRLWLNMGKQRQAEIAGRVGATEWEIKADLETIG 1264
Score = 114 bits (286), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 175/758 (23%), Positives = 301/758 (39%), Gaps = 153/758 (20%)
Query: 57 NLVVTAANVIEIYVV------RVQEEGSK---ESKNSGETKRRVLMDGISAASLELVCHY 107
NL+V ++++I+ + ++Q+EGS+ + ETK L L Y
Sbjct: 28 NLIVIRTSLLQIFSLVKIVSSQLQKEGSEPHGSQFSQPETK------------LVLEKEY 75
Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
L G V L+ + +N ++I++A +AK+S++E+D HG+ S+H +E +
Sbjct: 76 PLSGTVTDLSRVKI--LNNKSGGEAILIAVRNAKLSLIEWDPERHGISTISIHYYERDDL 133
Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDE 217
+ G ++ VDP RC V +G++ + IL Q G LV G+
Sbjct: 134 TRSPWVPDLSRCGSILSVDPSSRCA-VYNFGIRNLAILPFHQAGDDLVMDDYDSELEGER 192
Query: 218 DTFGSGGGFSARIE-----------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
SGGG + SS V+ L LD + H F++ Y EP IL
Sbjct: 193 PIQNSGGGAEPKKSKEGPAYQTPYCSSFVLPLTALDPSLLHPISLAFLYEYREPTFGILF 252
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
+ T + + + ++ + + S LP D +K++A+P P+GG L++
Sbjct: 253 SQVATSTALLYERKDVVFYAVFTLDLEQRASTTLLSVSRLPSDLFKVVALPLPVGGALLL 312
Query: 325 GANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
G+N I H + A+ +N ++ + S +S + L+ L D LL+
Sbjct: 313 GSNEIVHVDQAGKTNAVGVNEFSRQVSSFSMTDQSDLAFRLEGCVVERLGGDSGDLLLAL 372
Query: 382 KTGDLVLLTVVYDGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+GD+ L+ DGR V + D+ K+ S + +G+ F+GS
Sbjct: 373 ASGDMALIKFKLDGRSVSGITIHLLPAHAGGDMLKSAASC----SSCLGDGNVFIGSEDA 428
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA-------LQDMVNGEE 483
DS+L++++ S STK+ R S D E+
Sbjct: 429 DSVLLEWSRSSA--------------------STKKARLESKQTADGFDDLEDDDDQMED 468
Query: 484 LSLYGSASNNTESAQKT---------FSFAVRDSLVNIGPLKDFSYGLRIN---ADASAT 531
LY SA +T+ + ++F ++D L +IGPL+D + G + + AT
Sbjct: 469 DDLYSSAPGSTQVDNRMGTENLTTEFYNFRLKDCLPSIGPLRDITLGKVFSNTYREKQAT 528
Query: 532 GISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNADS 565
+ + ELV G G+W+ K RG
Sbjct: 529 CEAVSAELELVASQGSDRGGGLVVIKREIDPLTTMSLKIDDADGVWSASVKKRRG----- 583
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLF 618
++ D+ Y+++S + E ++ +++ F + T+ G+
Sbjct: 584 --ASSTDNPSRQYVVVSRSTDSE-QELNEVFVAEEQNLKPFRAPEFNPNEDCTVDIGSFA 640
Query: 619 GRRRVIQVFERGARILDGSYMTQDLS-FGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
G R++QV R D M LS P E S+ +S S DPY+++ D
Sbjct: 641 GDTRLVQVLRNEVRSYD---MELGLSQIYPVWDE--DTSDERVAVSASFIDPYLMIIRDD 695
Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
S+ LL D + V I SS+ S LY+D
Sbjct: 696 SSVLLLQADENGDLDEVPLSTLIISSR--WRSGCLYYD 731
>gi|325187036|emb|CCA21579.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1912
Score = 175 bits (444), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/344 (31%), Positives = 185/344 (53%), Gaps = 24/344 (6%)
Query: 1111 LAIGTAYV--QGEDVAARGRVLLFST---------GRNADNPQNLVTEVYSKELKGAISA 1159
+ IGT YV GED + +GR+LL+ G + L + +GAI++
Sbjct: 1570 IVIGTGYVGPNGEDASGKGRLLLYEVDYAQYVDKDGTTSSKLPKLRLTFIKEHHQGAITS 1629
Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV-KNFILLGDIHKSIY 1218
+ L ++L + G K+I++++ +L G AFYDA +++ SL+++ K +++ D++KS+
Sbjct: 1630 VIQLGMYVLASVGSKMIVYEFKSDQLIGCAFYDAQ-MFITSLSVLRKEYVMYSDVYKSVS 1688
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
FL W+++ QL LLAKD+ L EF I + L+L+ +D ++N+ + YAP ES
Sbjct: 1689 FLRWRQKDRQLILLAKDYEPLAVTTAEFNILDTRLALIAADVEENLHVLQYAPHDIESRG 1748
Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTG-AAPGSDKTNRFALLFGTLDGSIGCIA 1337
GQ+LL ++FHVG ++ LR +++ +S + A G N + + G+ +G I +
Sbjct: 1749 GQRLLRTSDFHVGVQISSILRKLVISNASHQQYIPAKGRCIGNMYLNVLGSSEGGIAALI 1808
Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS---------IVD 1388
P+ E FRRL +LQ ++ ++P LNPR FR +NG+ D+ +D
Sbjct: 1809 PVPERVFRRLFTLQNVMISALPQNCALNPREFRVMKANGRVRSGRADAWCKQKWKKGFLD 1868
Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT-SFL 1431
++L + L Q E+A GT I+ NL++L T SFL
Sbjct: 1869 GQVLCRFLHLDYVAQKELARCIGTNPEVIIQNLSELQRNTMSFL 1912
Score = 166 bits (419), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 153/569 (26%), Positives = 252/569 (44%), Gaps = 124/569 (21%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR+LD+K + DF F+ GY+EP ++ILHE + +GR + + T ++ LSI+
Sbjct: 382 LLRLRELDIKGRIADFAFLDGYLEPTLMILHEENERIASSGRFAIGYDTMCLTVLSITLN 441
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA----- 346
+ HP+IW NLP D ++++ PIGG L++ N I Y +Q+ + LN +A
Sbjct: 442 SRLHPVIWCVKNLPADCFRIIPCKVPIGGALLLSTNAILYFNQTQFYGIKLNVFADKTVN 501
Query: 347 --------VSLDSSQELPRSS--------------FSVELDAAHATWLQNDVALLSTKTG 384
+ + + LP +S S+ L H +L + LLS
Sbjct: 502 QSLFPCQDATYEVLEPLPDASEPPAQGRLAFIEKPLSILLYDCHYDYLGSSDILLSLPDD 561
Query: 385 DLVLLTVVY-DGRVVQRLDLSKTNPSVL---TSDITTIG-------NSLFFLGSRLGDSL 433
L +L + RV + + T +L S +T N F+GSR GDS+
Sbjct: 562 SLYVLKMPQTSNRVFSVEEYNHTGKFILRKVASPASTASCLLVNRENDSIFIGSRCGDSV 621
Query: 434 LVQF--------TCGSGTSMLS----SGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVN 480
L SGT ++S SG G D + +A ++L+ S D +
Sbjct: 622 LYSAHRQKINARKTLSGTVVMSDGSISGTSNVRGADTDNEAALAEKLQAFGSTIALDATD 681
Query: 481 GEELSLYG------SASNNTESAQKTFSFA------------VRDSLVNIGPLKDFSYGL 522
++ LYG S + FSF+ D + IG + G+
Sbjct: 682 EDDAFLYGPTLSQESTGGGKLPSSDCFSFSSMKQEDHSLHLQAIDFIPGIGQITSMDLGV 741
Query: 523 RINADAS--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHN 562
+ N+D++ + G SK + ++ EL GC+ +WTV SS
Sbjct: 742 QSNSDSNEQHEELVVSGGSSKDGSISVIHHGLRPIVSTAAELSGCRAMWTVVGMSSDVPE 801
Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
+ +R Y +YLI+S+ RTM+L T + + + + ++ G T+ A NLF +RR
Sbjct: 802 SQVTR------RYDSYLILSVAQRTMILRTGEEMEPLEDDSGFYTCGPTLCATNLFSQRR 855
Query: 623 VIQVFERGARILDGSYM----------------------TQDLSFGPSNSESGS---GSE 657
++QVF++G R++ + + TQ++ F + ESG +
Sbjct: 856 IVQVFKQGVRVMQQASIPASEAKEDDEGTQDVPLTRLVCTQEIPFA-GDIESGGMNVDTA 914
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
N ++SV DPY+LL ++DGSIRLL GD
Sbjct: 915 NVGIVSVDTIDPYILLLLTDGSIRLLEGD 943
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 68/316 (21%), Positives = 127/316 (40%), Gaps = 64/316 (20%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS----ETEINSSSEEG- 810
+CY G+L ++ VP+F + +++T E+ + S +T + S+ G
Sbjct: 1086 LCYGDGSLHVYSVPDFGKMGIFPYVTFAPKFLLNTMTPESRRASYGYGDTARHRISKGGP 1145
Query: 811 ----------TGQGRKENIHSMK--VVELAMQRWSA----HHSRPF----LFAILTDGTI 850
T +GR H++ V ++A+ R H+S+ F L L +G +
Sbjct: 1146 RLGFSAIPADTNEGRIRKAHAINSPVADIAIHRIGPSEGQHNSQLFSHMVLLVFLANGDL 1205
Query: 851 LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGA 909
+ Y+ P S D + + V+ +R ++ + +A T +E G+
Sbjct: 1206 IMYKLL----PSIPSPRDSKQPSFHFVRVNENLITRPNLPMKAIKDSGNAGTHDENSLGS 1261
Query: 910 P----------------CQRITIFKNISGHQGFFLSGSRPCWCMVFRER-----LRVHPQ 948
+T F N++ + G F G+ P W + + + L +
Sbjct: 1262 TEASTSAIIAKLRANFRYPMLTRFFNVNNNSGMFFRGAYPVWILPNQGQPVFVPLNIAAA 1321
Query: 949 LCDGS--------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD-----NYWPV 995
D + +++FT H+ NC +GF+Y S G L++C+LPS N + +
Sbjct: 1322 PSDPTRRTTFKVPVLSFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVL 1381
Query: 996 QKIPLKATPHQITYFA 1011
QK+ AT H + Y
Sbjct: 1382 QKVRFGATIHHLLYLG 1397
>gi|325189779|emb|CCA24259.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1911
Score = 175 bits (443), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/344 (31%), Positives = 185/344 (53%), Gaps = 24/344 (6%)
Query: 1111 LAIGTAYV--QGEDVAARGRVLLFST---------GRNADNPQNLVTEVYSKELKGAISA 1159
+ IGT YV GED + +GR+LL+ G + L + +GAI++
Sbjct: 1569 IVIGTGYVGPNGEDASGKGRLLLYEVDYAQYVDKDGTTSSKLPKLRLTFIKEHHQGAITS 1628
Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV-KNFILLGDIHKSIY 1218
+ L ++L + G K+I++++ +L G AFYDA +++ SL+++ K +++ D++KS+
Sbjct: 1629 VIQLGMYVLASVGSKMIVYEFKSDQLIGCAFYDAQ-MFITSLSVLRKEYVMYSDVYKSVS 1687
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
FL W+++ QL LLAKD+ L EF I + L+L+ +D ++N+ + YAP ES
Sbjct: 1688 FLRWRQKDRQLILLAKDYEPLAVTTAEFNILDTRLALIAADVEENLHVLQYAPHDIESRG 1747
Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTG-AAPGSDKTNRFALLFGTLDGSIGCIA 1337
GQ+LL ++FHVG ++ LR +++ +S + A G N + + G+ +G I +
Sbjct: 1748 GQRLLRTSDFHVGVQISSILRKLVISNASHQQYIPAKGRCIGNMYLNVLGSSEGGIAALI 1807
Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS---------IVD 1388
P+ E FRRL +LQ ++ ++P LNPR FR +NG+ D+ +D
Sbjct: 1808 PVPERVFRRLFTLQNVMISALPQNCALNPREFRVMKANGRVRSGRADAWCKQKWKKGFLD 1867
Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT-SFL 1431
++L + L Q E+A GT I+ NL++L T SFL
Sbjct: 1868 GQVLCRFLHLDYVAQKELARCIGTNPEVIIQNLSELQRNTMSFL 1911
Score = 166 bits (421), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 152/568 (26%), Positives = 253/568 (44%), Gaps = 123/568 (21%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR+LD+K + DF F+ GY+EP ++ILHE + +GR + + T ++ LSI+
Sbjct: 382 LLRLRELDIKGRIADFAFLDGYLEPTLMILHEENERIASSGRFAIGYDTMCLTVLSITLN 441
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA----- 346
+ HP+IW NLP D ++++ PIGG L++ N I Y +Q+ + LN +A
Sbjct: 442 SRLHPVIWCVKNLPADCFRIIPCKVPIGGALLLSTNAILYFNQTQFYGIKLNVFADKTVN 501
Query: 347 --------VSLDSSQELPRSS--------------FSVELDAAHATWLQNDVALLSTKTG 384
+ + + LP +S S+ L H +L + LLS
Sbjct: 502 QSLFPCQDATYEVLEPLPDASEPPAQGRLAFIEKPLSILLYDCHYDYLGSSDILLSLPDD 561
Query: 385 DLVLLTVVY-DGRVVQRLDLSKTNPSVL---TSDITTIG-------NSLFFLGSRLGDSL 433
L +L + RV + + T +L S +T N F+GSR GDS+
Sbjct: 562 SLYVLKMPQTSNRVFSVEEYNHTGKFILRKVASPASTASCLLVNRENDSIFIGSRCGDSV 621
Query: 434 LVQF--------TCGSGTSMLS----SGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVN 480
L SGT ++S SG G D + +A ++L+ S D +
Sbjct: 622 LYSAHRQKINARKTLSGTVVMSDGSISGTSNVRGADTDNEAALAEKLQAFGSTIALDATD 681
Query: 481 GEELSLYG-----SASNNTESAQKTFSFA------------VRDSLVNIGPLKDFSYGLR 523
++ LYG ++ + FSF+ D + IG + G++
Sbjct: 682 EDDAFLYGPTLSQESTGGAMPSSDCFSFSSMKQEDHSLHLQAIDFIPGIGQITSMDLGVQ 741
Query: 524 INADAS--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNA 563
N+D++ + G SK + ++ EL GC+ +WTV SS +
Sbjct: 742 SNSDSNEQHEELVVSGGSSKDGSISVIHHGLRPIVSTAAELSGCRAMWTVVGMSSDVPES 801
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+R Y +YLI+S+ RTM+L T + + + + ++ G T+ A NLF +RR+
Sbjct: 802 QVTR------RYDSYLILSVAQRTMILRTGEEMEPLEDDSGFYTCGPTLCATNLFSQRRI 855
Query: 624 IQVFERGARILDGSYM----------------------TQDLSFGPSNSESGS---GSEN 658
+QVF++G R++ + + TQ++ F + ESG + N
Sbjct: 856 VQVFKQGVRVMQQASIPASEAKEDDEGTQDVPLTRLVCTQEIPFA-GDIESGGMNVDTAN 914
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGD 686
++SV DPY+LL ++DGSIRLL GD
Sbjct: 915 VGIVSVDTIDPYILLLLTDGSIRLLEGD 942
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 68/316 (21%), Positives = 127/316 (40%), Gaps = 64/316 (20%)
Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS----ETEINSSSEEG- 810
+CY G+L ++ VP+F + +++T E+ + S +T + S+ G
Sbjct: 1085 LCYGDGSLHVYSVPDFGKMGIFPYVTFAPKFLLNTMTPESRRASYGYGDTARHRISKGGP 1144
Query: 811 ----------TGQGRKENIHSMK--VVELAMQRWSA----HHSRPF----LFAILTDGTI 850
T +GR H++ V ++A+ R H+S+ F L L +G +
Sbjct: 1145 RLGFSAIPADTNEGRIRKAHAINSPVADIAIHRIGPSEGQHNSQLFSHMVLLVFLANGDL 1204
Query: 851 LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGA 909
+ Y+ P S D + + V+ +R ++ + +A T +E G+
Sbjct: 1205 IMYKLL----PSIPSPRDSKQPSFHFVRVNENLITRPNLPMKAIKDSGNAGTHDENSLGS 1260
Query: 910 P----------------CQRITIFKNISGHQGFFLSGSRPCWCMVFRER-----LRVHPQ 948
+T F N++ + G F G+ P W + + + L +
Sbjct: 1261 TEASTSAIIAKLRANFRYPMLTRFFNVNNNSGMFFRGAYPVWILPNQGQPVFVPLNIAAA 1320
Query: 949 LCDGS--------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD-----NYWPV 995
D + +++FT H+ NC +GF+Y S G L++C+LPS N + +
Sbjct: 1321 PSDPTRRTTFKVPVLSFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVL 1380
Query: 996 QKIPLKATPHQITYFA 1011
QK+ AT H + Y
Sbjct: 1381 QKVRFGATIHHLLYLG 1396
>gi|422295485|gb|EKU22784.1| cleavage and polyadenylation specificity factor subunit 1
[Nannochloropsis gaditana CCMP526]
Length = 395
Score = 174 bits (442), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 108/334 (32%), Positives = 184/334 (55%), Gaps = 22/334 (6%)
Query: 1108 ETLLAIGTAYV--QGEDVAARGRVLLFSTGRNA-----DNPQNLVTEVYSKELKGAISAL 1160
+T LA+GT V +GEDV ++GR+L++ + P + + YS+ G +A+
Sbjct: 68 DTYLAVGTCTVRAKGEDVPSKGRLLMYRISLDPYAGLTSPPTLTLVDQYSQR-SGPPTAI 126
Query: 1161 ASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
A L H++IA+GP + ++ ++ E L IAFYDA YVVSL +VK + + D + S++
Sbjct: 127 AQLGPHIIIAAGPTLWVYAFSAREKLKPIAFYDAD-FYVVSLRVVKTLVAVTDAYHSVHL 185
Query: 1220 LSWKEQGAQ--LNLLAKDFG---SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
L W E L L+ KD+ S + F++D +L ++V D + N+Q+ Y P
Sbjct: 186 LRWHEHDPAHTLELMGKDYSPIVSAQPGGSHFVVDPPSLGMLVGDSRGNLQLLQYDPADV 245
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
ES G +L+ RA+FH+ +H FL+ +A PG+ + ++FG+++G +G
Sbjct: 246 ESRGGNRLVRRADFHL-SHRLSFLQHTRMAEVPR-----PGAYRAGVRVMVFGSVEGGVG 299
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ P++E +RRL +LQ +V+++PHV NPR FR + G A +D ELL
Sbjct: 300 ALVPVEEKVYRRLYALQAVMVNALPHVGAFNPRGFRLVEARGWAQG-RKKGTLDGELLWR 358
Query: 1395 YEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
+ L + +Q ++A GT+R +L +L ++ + T
Sbjct: 359 FAGLSVGKQEDLASAIGTSREMVLESLLEVDMMT 392
>gi|388581811|gb|EIM22118.1| hypothetical protein WALSEDRAFT_28358 [Wallemia sebi CBS 633.66]
Length = 1259
Score = 174 bits (440), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 110/322 (34%), Positives = 176/322 (54%), Gaps = 21/322 (6%)
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGR----NADNPQNL-VTEVYSKELKGAISALASLQ 1164
+ +GT +GEDVA +G + LF + D+ N + + +E KGA+SA+ S
Sbjct: 950 FIGVGTCINRGEDVAVKGAMYLFEIAELIPSSKDSGNNYKLKMLMREETKGAVSAITSCS 1009
Query: 1165 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
G+ ++A G K+++ E L +AFYDA Y+VSL ++KNFIL+GD KSI FL+++
Sbjct: 1010 GYFVVAVGQKVLIRALEINERLISVAFYDAG-TYIVSLEVLKNFILVGDQVKSITFLAFQ 1068
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E +L L++D ++ + FL +S V +D Q ++++ Y P + G+KL+
Sbjct: 1069 ESPYKLVQLSRDARQIETCVSNFLAHEDQISFVSNDIQGDLRLIDYNPFDPTAEGGEKLI 1128
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
EFH G+ T L L + P S+ LL G +DGS+ C++P+DE+T
Sbjct: 1129 RTTEFHKGSEATCSLLLP-------KPSVRPSSE------LLLGCVDGSLSCLSPVDEIT 1175
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
F+ L LQ LV +PH+A LNPR+ R N R I+D LLS Y+ + Q
Sbjct: 1176 FKALWLLQGALVRQIPHIAALNPRAHRHVR-NDYVSRSLSKGILDGLLLSAYQTIDHATQ 1234
Query: 1404 LEIAHQTGTTRSQILSNLNDLA 1425
+EIA + G +++++L L + +
Sbjct: 1235 VEIAKRIGYSKAELLGYLRNFS 1256
Score = 136 bits (343), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 164/666 (24%), Positives = 278/666 (41%), Gaps = 103/666 (15%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV--- 113
N+V TA N ++IY + + +A L L Y+LHG +
Sbjct: 30 NIVTTANNTLKIYEIDIDS-------------------NTPSAKLILRREYQLHGEIIGI 70
Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
+S+ ILS +D +++AF DAKI++LE+ D I+ + S+H +E + + + +
Sbjct: 71 QSIKILST----TEDGKDRLLIAFRDAKIALLEWSDEINDIVTVSIHTYERSQQV-ISQD 125
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
F +++ DP+ RC +L+ + IL + L D D S S
Sbjct: 126 MSRFK--AILRSDPENRCSALLLPDDSLAILPVHSAHAEL-EDLDQDVSNAIKDVPYAPS 182
Query: 234 HVINLR--DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
++ L+ D D+ +V D+ F+ G+ P + +L E TW GR+S TC + L++
Sbjct: 183 FILPLKSIDSDICNVIDYTFLPGFHNPTLAVLCEPRQTWTGRLSDSQDTCQVFFLTLDLV 242
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLD 350
+ +P+I + NLP+D+ L A P IGGV ++ AN IH A N +A +L
Sbjct: 243 TQVYPIIATVDNLPYDSMSLKAAPKEIGGVAILSANAIIHVDQNGRPVGRATNGWA-TLT 301
Query: 351 SSQEL--PRSSFSVELDAAHATWLQ------NDVALLSTKTGDLVLLTVVYDGRVVQRLD 402
S++ P V L+ A +LQ + ALL G++ + +GR + R+D
Sbjct: 302 SARNFDAPPKDLFVRLEGASIEFLQPKSKQTHPQALLFLPNGEIHAVQFYREGRTISRID 361
Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
+SK I + L G L V GTS L + + D+E
Sbjct: 362 ISK---PFAKGSIPSGAYRLDIDGQGLSGGQFVFIPSMVGTSFLIR-VGKSLNDLEL--- 414
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS------FAVRDSLVNIGPLK 516
+ + + A DM + LYGS+ + ++ F + D + + GP++
Sbjct: 415 -FPKQEKVGTTAYDDMDVDVDEELYGSSDKKADEKEEEEEISSEPPFTICDYIESYGPIQ 473
Query: 517 DFSYG---------LRINADASA------TGISKQSNYE---LVELPGCKGIWTVYHKSS 558
D + G L+I A A T ++ +E +++ G G+WT ++ +
Sbjct: 474 DITIGRYMQTRNSPLQILAATGAGHVGGITAFHQEVPFESKHKLDVQGNHGLWT-FNVTG 532
Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
G+ + A D + + L + + L D EV TIAA
Sbjct: 533 VGN-----VLVATDSKSKTKISKLLPSNEVALIAED--NEV-----------TIAADTAA 574
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
R++ + ++L + Q EN V SI+DPY+L S+G
Sbjct: 575 NSTRILMITSNAIKVLKEDGIEQ----------QSLQIENGEVQRASISDPYILTLQSNG 624
Query: 679 SIRLLV 684
SI L +
Sbjct: 625 SISLFI 630
>gi|291232724|ref|XP_002736306.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
[Saccoglossus kowalevskii]
Length = 304
Score = 173 bits (439), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 118/354 (33%), Positives = 175/354 (49%), Gaps = 62/354 (17%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+A Y+ +H PTGI +C G EE NL++
Sbjct: 2 YALYRQIHPPTGIEHCVYGH-------------FFSKEE--------------KNLIIAG 34
Query: 63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
A + +Y + + + SK+ K+ E R + L GN+ SL
Sbjct: 35 ATDLHVYRL-LSDVDSKQKKSKLEHLRS----------------FSLFGNIMSLQTTRLA 77
Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
GA RD+++L+F+DAK+SV+E+D H L+ S+H FE LK G S P
Sbjct: 78 GAS----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEEA---LKEGYVSNYYIPQ 130
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
V VDP RC +L+YG ++++L + G+ D+D G S+ + S++INL+D+D
Sbjct: 131 VVVDPDNRCAVMLMYGSKLVVLPFRREGAA--EDQDGVLPGSSKSSFL-PSYIINLQDID 187
Query: 243 MK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
K ++ D F+HGY EP + IL E TW GRV+ + TC I A+S++ + HP+IWS
Sbjct: 188 QKLINIIDIKFLHGYYEPTLFILFEPLRTWPGRVAVRKDTCCIVAISLNIEQRVHPVIWS 247
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
NLP D K + VP PIGGVLV +++ Y +QS Y VSL+ E
Sbjct: 248 LNNLPFDCIKAIPVPKPIGGVLVFAVDSLLYLNQSVP------PYGVSLNGLTE 295
>gi|149066088|gb|EDM15961.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
isoform CRA_b [Rattus norvegicus]
Length = 241
Score = 173 bits (438), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 94/239 (39%), Positives = 139/239 (58%), Gaps = 13/239 (5%)
Query: 1188 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1247
+AF D LY+ + VKNFIL D+ KSI L ++E+ L+L+++D L+ ++ +F+
Sbjct: 1 MAFIDTQ-LYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFM 59
Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
+D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+FHVGAHV F R +
Sbjct: 60 VDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWR-------T 112
Query: 1308 DRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1362
GAA G K N+ F TLDG IG + P+ E T+RRL LQ L +PH A
Sbjct: 113 PCRGAAEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHA 172
Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
GLNPR+FR H + + + +++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 173 GLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 231
>gi|328864890|gb|EGG13276.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
Length = 1627
Score = 173 bits (438), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 100/287 (34%), Positives = 166/287 (57%), Gaps = 6/287 (2%)
Query: 1142 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1201
Q + +Y K+ KG ++++A L G L+++ GPK+I++ ++ L G+AFYD +++VSL
Sbjct: 1336 QKRLNLLYEKDQKGPVTSIAGLNGLLIMSIGPKMIVNNFSSGSLIGLAFYDTQ-IFIVSL 1394
Query: 1202 NIVKNFILLGDIHKSIYFLSWKE---QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
N VKN+IL+GD+ KSI F K Q + LL KD+ + ++++F++D LS+V+S
Sbjct: 1395 NTVKNYILVGDMFKSISFFKLKVCIIQKKNIILLGKDYEEVSTYSSDFIVDEKKLSMVLS 1454
Query: 1259 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1318
D +NI++F + P ES GQ LL+++ FH+G KF+R+ M T+ D ++
Sbjct: 1455 DANRNIRMFSFDPSDPESRAGQMLLAKSSFHIGELNNKFVRIPMKNTNYDNNSSSSSIIV 1514
Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ-FHSNGK 1377
++ L +GTL G I + P+++ L +L+ KL+ AGLNPR FR H N
Sbjct: 1515 NDKHLLFYGTLGGGINLLMPINKRFHEILHALETKLMHR-GQTAGLNPRGFRYGHHVNNT 1573
Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+VD +LL+ ++ L ++ ++A G+T IL LN L
Sbjct: 1574 LGHLHNQYVVDGDLLTKFQSLSPDDAKQLATSIGSTTPIILDLLNQL 1620
Score = 115 bits (287), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 68/242 (28%), Positives = 114/242 (47%), Gaps = 37/242 (15%)
Query: 912 QRITIFKNISGHQGFFLSG-SRPCWCMVFRERLRVHPQ---------------LCDGSIV 955
+RI F NI +G F+SG S P W + R+HP I
Sbjct: 1018 RRIIPFSNIGNKRGIFVSGVSTPIWIFSEKNFPRIHPMKQQQQTTSSSSSSSSSSKRPIT 1077
Query: 956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNL 1015
FT HN+NC HGFIY G+L IC+LP G+ Y+N WP++K+ ++ T H+I+Y +
Sbjct: 1078 TFTTFHNINCKHGFIYFDHTGMLCICRLPDGTNYENEWPIRKLAIRMTCHKISYHPVQKC 1137
Query: 1016 YPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPDRAG 1074
Y L++S P ++ ++E+ L + +EE Y++++++P
Sbjct: 1138 YVLVLSYPQAPQSDEDEQEEQEREL-------------LKKPLVLEEKYQLKLIDP---A 1181
Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKEN----ETLLAIGTAYVQGEDVAARGRVL 1130
W + + E L +++ L + + + + +GTAY GED +GR+L
Sbjct: 1182 NNWNIIDSFSLAEKETVLCSKIIYLRHADESDIIPKLKPFVIVGTAYTHGEDTVCKGRIL 1241
Query: 1131 LF 1132
+F
Sbjct: 1242 IF 1243
>gi|196012166|ref|XP_002115946.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
gi|190581722|gb|EDV21798.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
Length = 1187
Score = 171 bits (434), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 147/491 (29%), Positives = 225/491 (45%), Gaps = 82/491 (16%)
Query: 949 LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQIT 1008
L DG + F + NC +GF+Y S+ L+IC L TYD WPV K+PL+ T H IT
Sbjct: 767 LVDGYVKCFAPFNIANCPNGFLYFNSEEDLRICVLDQRFTYDCPWPVHKVPLRNTLHFIT 826
Query: 1009 YFAEKNLYPLIVS-VPVLKPLNQVLSL---LIDQEVGHQIDNHNLSSVDLHRTYTVEEYE 1064
+ Y +I S + V + + + + I E G + + + L + T E +E
Sbjct: 827 HHFVTKTYVIISSTMTVCEKMPHITTEDKEFIPVEKGDRFIHAPVEKFCL-QLITSETWE 885
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDV 1123
+ PD A I M E+ ++ V L + T + +A+GT V GE+V
Sbjct: 886 II---PD---------AEIQMAEWEHVTCLKSVKLKSEETVSGLKEFIAVGTTNVCGEEV 933
Query: 1124 AARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1178
A RGR+++F P +N + Y KE KG ++A+ ++G L+ + G KI +
Sbjct: 934 ACRGRIVIFDVIEVVPEPGKPLTKNKIKTYYDKEQKGPVTAITCVEGFLVTSIGQKIYIW 993
Query: 1179 KW-TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
++ +L G+AF D +Y+ SL D
Sbjct: 994 EFRDNKDLIGMAFIDTL-IYIHSL---------------------------------DRH 1019
Query: 1238 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1297
L+ F T F ++ + L V AP ES GQ L+ RAE G++ F
Sbjct: 1020 QLEIFNTNFYVNKNQLGFV-------------AP---ESHGGQFLVRRAEIQTGSNAHAF 1063
Query: 1298 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1357
R ++ A + + N+ FGTLDGSIG + P+DE +RRL SLQ KL
Sbjct: 1064 FRTKVRALNQRQ--------NENKHITWFGTLDGSIGLLLPVDEKEYRRLFSLQAKLSIY 1115
Query: 1358 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
+ AGLN ++FR F S+ K + +I+D +LL Y L E+ ++A Q +T QI
Sbjct: 1116 LEQNAGLNQKAFRTFRSHQKKLQNSMRNILDGDLLKRYFHLGFVERRDLAKQIMSTPEQI 1175
Query: 1418 LSNLNDLALGT 1428
+++L L L T
Sbjct: 1176 INDLTKLELST 1186
Score = 144 bits (364), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 95/299 (31%), Positives = 144/299 (48%), Gaps = 19/299 (6%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+ I +Y + +E S + D LE + Y +G + +
Sbjct: 29 NLLTAGPTCIRVYDIIKDQEDIDLDNRSDNADNHLNKDNKLHPELEFLASYSFYGKIYGI 88
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ RDS+ + F DAK+S++E+D L S+H FE E LK G
Sbjct: 89 ----ESVRFRHHHRDSLFICFADAKLSLVEYDADNSNLTTLSLHTFEDDE---LKNGFSR 141
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-DTFGSGGGFSARIESSHV 235
P+++VDP RC ++V + + IL G + D + G + + S+V
Sbjct: 142 NLSIPIIRVDPDNRCAAMVVSNVHLAILPFRHRGPAEQQVQIDPKNTSGKYP--LMPSYV 199
Query: 236 INLRDLDMKHVKDFI---FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+++RDL + V I F+ GY EP ++IL E TW+GRV+ + TC I A+S++T
Sbjct: 200 VDVRDLGNEKVSRLIDIRFLEGYYEPTILILCEILRTWSGRVAVRQDTCSILAVSLNTID 259
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
K HP+IWS NLP D + VP PIGGVL+ AN + + +QS YA SL+S
Sbjct: 260 KVHPVIWSLNNLPFDCLGAITVPRPIGGVLIFAANCLLHLNQSKP------PYAESLNS 312
Score = 75.9 bits (185), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 88/391 (22%), Positives = 159/391 (40%), Gaps = 100/391 (25%)
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT--ESAQKTFSFAVRDSLVNIGPL 515
+ D P++K+LR +++ LY + ++ T ES ++++F V D ++++GP
Sbjct: 336 DTDEPTSKKLRTDDEKEDEELE-----KLYSAHTSCTAKESYLRSYTFEVCDRILHVGPC 390
Query: 516 KDFSYGLRINADASATGISKQSNYELV--------------------------ELPGCKG 549
+ G +T + ++S+ E+V +LPGC
Sbjct: 391 ASIAIG------QISTFVQEESDVEVVICSGHDKNGALSVLNKGIKPQVVASYDLPGCVD 444
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+WTV K R ++ + + H +LIIS + TM+L T +TEV E + + Q
Sbjct: 445 MWTV--KDIRLNDENDGDFET--ENTHKFLIISRDNLTMILRTGKEITEV-EQLGFLTQT 499
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
+T+ AGNL +IQV ++ Q L S ++ S+ DP
Sbjct: 500 KTVFAGNLDNGNCIIQVTPYEVILVSKGEKIQQLEL----------ENESPIVFCSLQDP 549
Query: 670 YVLLGMSDGSIRLL---VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD----------- 715
Y+ L + GSI +L + D V + + S+ +++C L+ D
Sbjct: 550 YISLLLEGGSIMMLAFELSDNGEKQVKLVNTTPLNHSR--IAACCLFQDNNGRMSVSDGI 607
Query: 716 --KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI--------------------- 752
+ P P T+ A L ID + LD D
Sbjct: 608 SIRTPSP----TNEPAELMEDEKFTIDDDELLYLDVNDTNLQTNDVPVASTSYTDNLERK 663
Query: 753 ---YSVVCYESGALEIFDVPNFNCVFTVDKF 780
+ +C ++G LE++ +P+++ V+TV+ F
Sbjct: 664 VSYWLFLCLDNGKLEVYSIPSYDKVYTVNGF 694
>gi|392585051|gb|EIW74392.1| hypothetical protein CONPUDRAFT_133073 [Coniophora puteana RWD-64-598
SS2]
Length = 1490
Score = 169 bits (428), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 113/350 (32%), Positives = 177/350 (50%), Gaps = 19/350 (5%)
Query: 1077 WQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFSTG 1135
W T +E V VVTL +T+ ++ +A+GT +GED+A RG +F
Sbjct: 1142 WITLDGYEFAPNEFVNAVEVVTLETLSTETGSKEFVAVGTTINRGEDLAVRGATYIFEVV 1201
Query: 1136 RNADNPQNLVTEVYSKEL------KGAISALASLQGHLLIASGPKIILHKWTGTE-LNGI 1188
+P + + Y ++ KG ++AL + G+L+ + G KI + + E L G+
Sbjct: 1202 EVVPDPSSKLDRWYKLKMRVRDDAKGPVTALCGINGYLVSSMGQKIFIRAFDLDERLVGV 1261
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
AF DA +YV SL +KN +L+GD KS++ ++++E +L +L+KD + +F
Sbjct: 1262 AFLDAG-VYVTSLKALKNLLLIGDAVKSVWLVAFQEDPYKLVILSKDIRRQYAASVDFFF 1320
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
LS+V DE+ ++ + Y P ES GQ+LL EFH + L + +
Sbjct: 1321 ANGELSIVTEDEEGVLRAYEYDPNDPESRSGQQLLCHTEFHGHKECSTTLTIARRTKTEH 1380
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
A L+ G DGS+ + P+DE F+RLQ LQ +L +V H+AGLNPR+
Sbjct: 1381 EIPQA---------KLISGFGDGSLSALTPVDEAAFKRLQLLQGQLTRNVQHIAGLNPRA 1431
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
FR N +P I+D +LLS +E + Q E+ Q GT R+ IL
Sbjct: 1432 FRIVR-NETVSKPLSKGILDGQLLSSFEAQGITRQGEMTRQIGTERTTIL 1480
Score = 121 bits (303), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 196/913 (21%), Positives = 349/913 (38%), Gaps = 130/913 (14%)
Query: 57 NLVVTAANVIEIYVVR-------VQEEGSKESKN---------SGETKRRVLMDGI---- 96
NLV +N+I IY VR Q E KE K+ GE + DG
Sbjct: 40 NLVTARSNIIRIYEVREDAASLSSQVEAEKERKSHVRKGTEAVEGEVEMDTGGDGWVNMG 99
Query: 97 ----------SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLE 146
+ V + +HG V + + + + N R D ++++F+DAKI++LE
Sbjct: 100 SVKSTSSGPPTVTRFHFVREHVVHGIVTGMDCI-RTISSNEDRMDRLLVSFKDAKIALLE 158
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+ D+ H L S+H +E E L R L +VDP RC + + + IL
Sbjct: 159 WSDAAHDLITVSIHTYERSE--QLMSIDAPLFRSSL-RVDPLSRCAALSLPNNALAILPF 215
Query: 207 SQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVIL 263
Q + E + G S +++L D + +V DF F+ G+ P + +L
Sbjct: 216 YQTQAEFDVIEGEGETEGMRDVPYSPSFILDLPVDVDSSLCNVIDFAFLPGFNNPTLAVL 275
Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP------ 317
+ E TWAGR+ T ++ ++ P++ + LP DA+ L P
Sbjct: 276 CQSEQTWAGRLKEHRDTTLVVTFTLDLLSCTFPILSTLRGLPSDAFSLSPATLPPDFTSG 335
Query: 318 -------IGGVLVVGANTIHYHSQSASCA-------------LALNNYAVSLDSSQELPR 357
GV+V+ + + Y Q A C L+++N ++ ++++
Sbjct: 336 LSGGASNAHGVVVLTPDAVLYADQ-ARCVGAAVSGWATRTSDLSISNAYLTGGTAKDAEG 394
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-TNPSVLTSDIT 416
+ L+ A L LL ++G++ ++ +V +GR V R+D+ +V+ + +
Sbjct: 395 DVKPLALEGAFPLLLTPTALLLVLRSGEMHVVRLVTEGRSVGRVDVGPCVGQTVMPATVV 454
Query: 417 TIGNSLFFLGSRLGDS--------LLVQFTCGSGTSMLSSGLKEEFGDIEADA--PSTKR 466
+ LG G+ + V G T +LS+ EE + S
Sbjct: 455 RVKAPQRALGQGQGEGEKAKERRMVFVGSIVGPAT-LLSAERVEETAAANGNGVNGSGAN 513
Query: 467 LRRSSSDALQDM--VNGEELSLYGSASNNTE----SAQKTFSFAVRDSLVNIGPLKDFSY 520
+ DA +M ++ LYG + ++ SA++ FA D++ GP+ D ++
Sbjct: 514 GHVENKDAGMEMDVDLDDDDDLYGPTTLTSQPSSGSAEEALRFAFCDAIPAHGPILDMAF 573
Query: 521 GLRINAD------ASATGISKQSNYELVE-------------LPGCKGIWTVYHKSS-RG 560
L D ++TG + L + L G +GIW++ K S RG
Sbjct: 574 ALGKWGDRYVPELVASTGAEHLGGFTLFQRDLPIRTKRKLHVLGGARGIWSISVKQSPRG 633
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLF 618
A S+ + + ++IS +A V A T ++ + G T+ AG F
Sbjct: 634 SAASSAGAGPNPELANDTVVISTDANPSPGVSRIATRSTRTDLAIPTRIPGTTVGAGPFF 693
Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
GR ++ V R+L+ D + S ++ + + SI DP VL+ D
Sbjct: 694 GRTAILHVMTNSIRVLE-----PDGTERQSIKDTDGNMPRAKIRWCSICDPVVLIIREDD 748
Query: 679 SIRLLVGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKG-----PEPWLRKTSTDAWLS 732
++ L +G+P + + + + E S + ++ C G +P S+
Sbjct: 749 TLGLFIGEPERGRIRRKDMSPMGEKSSRYIAGCFFADTSGLFEAFMDPKAAAASSKGDKD 808
Query: 733 TGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
G + + + + V+ G LEI+ +P VF+ + D+Y
Sbjct: 809 KGATQTMQSVVNAATNSQ--WLVLVRPQGVLEIWTLPKLTLVFSTTLIATLDNVCADSYD 866
Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
AL Q + V + M + + P L L G +
Sbjct: 867 PAALS-------------LPQDPPRKPQELDVENIVMAQLGESNPTPHLMVFLRSGQVAI 913
Query: 853 YQAYLFEGPENTS 865
Y+ P + S
Sbjct: 914 YETVHHPPPPDPS 926
>gi|258575565|ref|XP_002541964.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902230|gb|EEP76631.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1376
Score = 169 bits (427), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 148/553 (26%), Positives = 253/553 (45%), Gaps = 86/553 (15%)
Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVFRE------RLRVHPQLCDGSIVAFTVLHNVNCN 966
R+ ++ G++ F+ GS PC+ M RL+ P + + + H C
Sbjct: 876 RLRAIPDLCGYKTMFMPGSNPCFIMKSSTSSPHVLRLKGEP------VSSLSSFHMPACE 929
Query: 967 HGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLK 1026
GF YV ++ ++++C+LP + +DN W +KI + + YFA Y L S
Sbjct: 930 KGFAYVDAKNMVRMCRLPGNTRFDNAWAARKIHIGEQVDCVEYFARSETYVLGTS----- 984
Query: 1027 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQ 1086
++ L D EV + + +S + ++ V++L P W +
Sbjct: 985 -YHEDFKLPEDDEVHTEWRSEVISFMP-----QLDRGRVKLLSPRT----WSIIDCYDLG 1034
Query: 1087 SSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS---TGRNADNPQ 1142
++E L ++ + + + T E + ++ +GTA V+GED+ RG + +F + D P+
Sbjct: 1035 ATERILCLKTINMEVSEITHERQDMVVVGTAIVRGEDITPRGSIYVFEIIDVAPDPDRPE 1094
Query: 1143 -NLVTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPL 1196
N ++++KE +KGA++A++ + QG L+ A G K ++ K G+ L +AF D
Sbjct: 1095 TNQKFKLFAKEDVKGAVTAISGIGGQGFLIAAQGQKCLVRGLKEDGSLLP-VAFMDMQ-C 1152
Query: 1197 YVVSLNIVKN--FILLGDIHKSIYFLSWK------------EQGAQLNLLAKDFGSLDCF 1242
YV L ++ ++GD K ++F + E+ +L L KD L
Sbjct: 1153 YVSVLKELQGTGLCIMGDALKGLWFTGYSVQLSSAVDVETCEEPYKLTLFGKDSEYLQVV 1212
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
A +FL P S KG +LL R+ FH G H L L
Sbjct: 1213 AADFL-----------------------PDDPSSSKGDRLLHRSSFHTG-HFISTLTLIP 1248
Query: 1303 LATSSDRTGAAPGSDKTNR----FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSV 1358
TSS TGA+ + + + ++ + GS+G I PL E T+RRL +LQ +LV S+
Sbjct: 1249 QYTSSG-TGASEDNMDVDYMPAGYQVVVTSQSGSVGVITPLTEETYRRLSALQSQLVMSM 1307
Query: 1359 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
H GLNP+++R S+G + R +VD LL + + ++ + EIA + G I
Sbjct: 1308 EHPCGLNPKAYRAVESDGFSGR----GLVDGNLLLRWLDMGVQRKAEIAGRVGADLQSIR 1363
Query: 1419 SNLNDLALGTSFL 1431
++L + G FL
Sbjct: 1364 ADLERINGGLDFL 1376
Score = 114 bits (284), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 186/745 (24%), Positives = 298/745 (40%), Gaps = 123/745 (16%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V +V++++ + G+ S ++ + R ++ L L+ Y L G V L
Sbjct: 28 NLIVAKTSVLQVFSLVNVAYGASTSPSTDDKTR---VERQQYTRLVLLAEYDLPGTVTGL 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ +++++A +AK+S++E+D HG+ S+H +E E LH
Sbjct: 85 GRVKT--LDSKSGGEALLVATRNAKLSLVEWDHERHGISTVSIHYYER-EDLHNSPWTPD 141
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD---EDTFG---------- 221
P L+ VDP RC +L +G+ + IL Q G LV D ED G
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDDFDEDLRGEKPEDMDNAL 200
Query: 222 ---SGGGFSAR----IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
+ AR SS V+ L LD + H F++ Y EP IL+ T
Sbjct: 201 VESTAANDVARHKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHVATSFA 260
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHY 331
+ + + ++ + + + LP D + ++ +P PIGG L++G+N IH
Sbjct: 261 LLGERKDVVSYAVFTLDIQQRTSTTLVTVSRLPSDLWNVVPLPPPIGGSLLIGSNELIHV 320
Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL---QNDVALLSTKTGDLVL 388
+ A+ +N +A +S + L+ L D+AL+ +G + +
Sbjct: 321 DQAGKTNAVGVNEFARQASEFSMADQSDLELRLEGCVIEQLGTESGDIALV-LASGRMAI 379
Query: 389 LTVVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGS 441
+ DGR V +L ++ S+L + + ++G FLGS DS+LV +T S
Sbjct: 380 VRFKVDGRSVSGIFVQLVSTQAGGSILKARPSCSASLGRGKIFLGSEETDSVLVGWTRPS 439
Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGSASNNT 494
S KRL+R SS D D + E LY + +N T
Sbjct: 440 Q--------------------SIKRLKRDSSGPRAGETDTDDDEDDIYEDDLYSTPTNQT 479
Query: 495 ESAQKT----------FSFAVRDSLVNIGPLKDFSYGLRINA-DASATGISKQS-NYELV 542
+ F F D L ++GP+KD + G D ++ SK S + ELV
Sbjct: 480 TVPKTVSQTNGLIKDEFVFRCHDRLWSLGPMKDITLGRTPGTRDQASKKTSKPSTDLELV 539
Query: 543 EL--PGCKGIWTVYHKSSRGHNADSSRMAAYDD-------------------EYHAYLII 581
G G T+ K + DS +M D Y YL+
Sbjct: 540 VTHGQGDAGGLTILRKELDPYIIDSMKMDNVDGVWSVQIAPSNTSNPSTTSRNYDKYLVF 599
Query: 582 SLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARIL 634
S ++R E + + T +D + T+ G L G RV+QV R
Sbjct: 600 S-KSRGHAKEQSVVYTVGGNGIDEMKAPEFNPNEDHTVDIGTLAGGTRVVQVLTSEVRSY 658
Query: 635 DGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
D DL+ P E S+ +V S A+PY+L+ D S+ LL D S
Sbjct: 659 D-----TDLALAQIYPVWDE--DTSDELSVTGASFAEPYLLITRDDQSLLLLQPDSSGDL 711
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDK 716
V + +S K + C LY DK
Sbjct: 712 DEVNIDGLL-TSNKWLCGC-LYFDK 734
>gi|147864212|emb|CAN80950.1| hypothetical protein VITISV_016701 [Vitis vinifera]
Length = 262
Score = 167 bits (422), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 89/148 (60%), Positives = 105/148 (70%), Gaps = 26/148 (17%)
Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLV 510
+++ GDIE D PS KR RRSSSDALQDM N ++L LYG A N+TE++QKTFSF+V DSL+
Sbjct: 53 RKKVGDIEGDVPSAKRSRRSSSDALQDMFNSDKLPLYGLAPNSTETSQKTFSFSVSDSLI 112
Query: 511 NIGPLKDFSYGLRINADASATGISKQSNYEL--------------------------VEL 544
N+GPLKDF+YGLRINAD ATGI KQSNYEL VEL
Sbjct: 113 NVGPLKDFAYGLRINADLKATGIVKQSNYELMCCSGHGKNGALCILQQSIRPERITEVEL 172
Query: 545 PGCKGIWTVYHKSSRGHNADSSRMAAYD 572
PGCKGIWTVYHK++RGHNADS +M + D
Sbjct: 173 PGCKGIWTVYHKNTRGHNADSIKMVSAD 200
>gi|393907594|gb|EJD74706.1| hypothetical protein LOAG_18016 [Loa loa]
Length = 398
Score = 165 bits (418), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 108/329 (32%), Positives = 172/329 (52%), Gaps = 11/329 (3%)
Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALAS 1162
+ LA+GTA GE+V RGR+++ P ++ + +Y KE KG +++L S
Sbjct: 72 QNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCS 131
Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
G+LL G K+ + + L GI+F D YV L V+N L D+++S+ L +
Sbjct: 132 CNGYLLTGMGQKVFIWLFKDNNLQGISFLDMH-FYVHQLIGVRNLALACDMYRSVALLRY 190
Query: 1223 KEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
+E+ L+L ++D S A +F+ID + V+SDE NI IF Y P+ ES G+
Sbjct: 191 QEEYKALSLASRDMRSDVQPPMAAQFIIDNKQMGFVMSDEAANIAIFNYLPETLESLGGE 250
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
KL RAE ++G V F+R++ +S + R ++LF +LDGS G + PL
Sbjct: 251 KLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLE---RQSVLFASLDGSFGFLRPLT 307
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
E FRRL LQ+ + VP AGLN + R H ++VD +++ Y L L
Sbjct: 308 EKVFRRLHMLQQLMSSMVPQPAGLNAKGARAARPPRPNHYLNTRNLVDGDMVMQYLHLSL 367
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
E+ ++A + GT+R I+ +L ++ T+
Sbjct: 368 PEKNDLARKLGTSRYHIIDDLIEICRVTA 396
>gi|392572878|gb|EIW66021.1| hypothetical protein TREMEDRAFT_70300 [Tremella mesenterica DSM 1558]
Length = 1408
Score = 165 bits (418), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 138/520 (26%), Positives = 237/520 (45%), Gaps = 41/520 (7%)
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRER-LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
+ N+ G G F++G +P W M + LR++ L G++ H + F+ +
Sbjct: 916 YDNLEGQSGAFITGEKPYWIMSSEKHPLRLY-GLKQGAMAFGPTTHLGSMGEYFMKIDDG 974
Query: 976 GILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLY--PLIVSVPVLKPLNQVLS 1033
IC P D P + ++ T + + Y +SVP
Sbjct: 975 CF--ICYFPQSLNTDLTMPCDRYEMQRTYTNVVFDPPSGHYLGATAISVPFQA------- 1025
Query: 1034 LLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENA 1091
D+E Q+ + NL L+ ++E + R PW+ +EN
Sbjct: 1026 --YDEEGEIQLGPEGENLVP-PLNERSSLELFS-------RGSDPWRVIDGYDFDQNENV 1075
Query: 1092 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN---LVTE 1147
L+++ V L +++ +A+GT + GED A RG V +F P +
Sbjct: 1076 LSMQSVLLESSSVPGGYRDFVAVGTGFDFGEDRATRGNVYIFEVVEVVPEPGQKSAWALK 1135
Query: 1148 VYSKE-LKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVK 1205
+ K+ + +SAL ++ G+LL ++GPK+ + E L G+AF D +Y+ S+ + K
Sbjct: 1136 LRCKDPCRNPVSALGNINGYLLHSNGPKMYVKGLDFDERLMGLAFVDVM-IYLTSIKVFK 1194
Query: 1206 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
NFIL+ D+ KSI+FLS++E + +++KD + + +FL+ ++ + D +I+
Sbjct: 1195 NFILISDMVKSIWFLSFQEDPYKFTVISKDLMPISVTSADFLVHDGHVTFLTYDRSGDIR 1254
Query: 1266 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
+ + P ES G++L+ R E+H G+ VT M+A R G + + ++
Sbjct: 1255 MVDFDPANPESINGERLIVRTEYHGGSPVTV---STMIAR---RRGVE--EEFAPQTQII 1306
Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
DGSI FRRL + +L+ + HVAGLNPR+FR N +P
Sbjct: 1307 CAHADGSISTFVSTKPARFRRLHFVSDQLIRNAQHVAGLNPRAFRTVR-NDLVAKPLSRG 1365
Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
I+D ELL + + P++ Q E+ Q GT + S+L L
Sbjct: 1366 ILDGELLGRFAIQPIDRQREMLKQIGTDGGTVASDLQALG 1405
Score = 123 bits (309), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 151/658 (22%), Positives = 275/658 (41%), Gaps = 98/658 (14%)
Query: 101 LELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
L L+C + LHG + LA L + D D ++++F+DAK+++LE+ S + S
Sbjct: 117 LHLLCQHTLHGWITGLAPLRTIESSVDG---LDRLLVSFKDAKMALLEW--SRGDIATVS 171
Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
+H +E + + G F PL++ DP R + + + IL Q S L E+
Sbjct: 172 LHTYERCQ--QMVTGDLQFYT-PLLRSDPLSRLAVLTLPEDSLAILPVLQEQSDLDPLEN 228
Query: 219 TFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
F +S S V++L D+ +K+++D +F+ G+ P + +L+ TWAGR
Sbjct: 229 -FTKDAPYSP----SFVLSLADVAPTIKNLQDLLFLPGFHSPTLAVLYSPYHTWAGRYHS 283
Query: 277 KHHTCMISALSISTTLK-QHPLIWSAMNLPHDAYKLLAVPSPIGGV-LVVGANTIHYHSQ 334
+ T + + T +PL+ S LP D+ ++A P+ +GGV LV +H
Sbjct: 284 QRDTFCLEVRTFDITAGGSYPLLTSVSGLPSDSLYIVACPAELGGVVLVTTTGLLHIDQS 343
Query: 335 SASCALALNNY-----AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
+ A ++N + + D S E S + L+ + + ++ LL + GD+ +
Sbjct: 344 GRTVATSVNAWWSHITTLPCDKSSE----SRKISLEGSKSVFVTERDMLLVLQNGDVHQV 399
Query: 390 TVVYDGRVVQRLDLSKTNPSV-LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
+GR + + + + + +V S + T GN F+G GDSLL +
Sbjct: 400 RFEMNGRAIGAIKVDEQSSNVPAPSSMVTTGNQAIFVGCAEGDSLLANVDIKRAVA---- 455
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE----SAQKTFSFA 504
+++ IEA+A D +D+ ++ L A+N + + +
Sbjct: 456 -IEDRKPAIEAEA---------EVDWDEDLYGDIDVPLTNGATNGAKYQAITGPANIVLS 505
Query: 505 VRDSLVNIGPLKDFSYGLRINADASAT--------GISKQSNY-------------ELVE 543
D L +G + D +G+ + + T G SK+S + E
Sbjct: 506 PADVLTGVGKIVDMEFGIASTDEGTRTYPQLVTIGGGSKRSTFNAFRRGIPISKRRRFNE 565
Query: 544 LPGCKGIWTVYHKSSRGHN-----ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
L + +W + + G + D + E I SL A+ +
Sbjct: 566 LFNTESVWFLPIQRPSGQHLKSIPEDRRTTMLFSSEATQTRIFSLSAKPNPEQIGR---- 621
Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSEN 658
+ G+++ G F R V+ V + +LD TQ G+E
Sbjct: 622 --------ISGKSLTVGPFFQRSNVLVVTQTEVLLLDSDGKTQ----------QSIGNEG 663
Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKKPVSSCTLYHD 715
++S SI+DPYV++ +GS + VGD +S V+ P+ +S + P + ++ D
Sbjct: 664 EEIVSASISDPYVVIRRVNGSGSMFVGDTVARQLSEVKIPS--DSLQPPYQAIEVFSD 719
>gi|452825139|gb|EME32137.1| cleavage and polyadenylation specificity factor subunit-like protein
[Galdieria sulphuraria]
Length = 1454
Score = 164 bits (414), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 156/578 (26%), Positives = 248/578 (42%), Gaps = 84/578 (14%)
Query: 906 PHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD-----GSIVAFTVL 960
PH P F N+S H G FL+GS P ++ + + H + D G I++ T +
Sbjct: 877 PHLRP------FYNLSSHFGVFLTGSVPSIIVLSKGYPQKHEIMIDSGVEYGDILSITNM 930
Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGS--TYDNYWPVQKIPLKATPHQITYFAEKNLYPL 1018
+ N + S G + ++ + + WPV+ + + Y A + +
Sbjct: 931 GDPENNRKLWILDSNGRIHFGEIRETQLESINWAWPVEVFRMNGCVKNVVYHATTGTFGV 990
Query: 1019 IVSVPV----LKPLNQVLSLLIDQE---VGHQIDNHNLSSVDLHRTY-------TVEEYE 1064
+VS V L+ Q+ E +G Q ++ + VE YE
Sbjct: 991 VVSSIVSMSRLERKRQIFERQKRDERAILGSQAPPEEENNTEFEENEPKNALPIEVEAYE 1050
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENAL--TVRVVTLFNTTTKEN--------------- 1107
++I D W+ + E L T V + T +EN
Sbjct: 1051 LQIYRAD----TWELVDKFAFKEEEAVLSATFMQVDAYKITEEENNDDKSSRATQQQAEA 1106
Query: 1108 -------------ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN-------LVTE 1147
+ + IGT +++GED RGR++LF R + +
Sbjct: 1107 AISQSSRSIKFKPKECIVIGTGFIKGEDAGTRGRLMLFEVARQEAYTEESGAFSAIQLML 1166
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTG-TELNGIAFYDAPPLYVVSLNIVKN 1206
+ KELK +S++A L+G++ A GPK+ ++K +EL +FY L+ S+N VK
Sbjct: 1167 IAEKELKSVVSSIARLEGYICCAVGPKVEIYKLVNESELVCCSFYSGFQLFSTSINTVKQ 1226
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
++ +GD++K YFL W+++ LN L KDF + +TEFLI + VVSD N+ +
Sbjct: 1227 YVFVGDMYKGGYFLFWRDRNKSLNFLGKDFDPVQTLSTEFLILNEFILFVVSDNFGNLHL 1286
Query: 1267 FYYA-PKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM---LATSSDRTGAAPGSDKTNRF 1322
YA P ES G+KLL R H+G + +RL+ S DR G+
Sbjct: 1287 LEYAGPHEIESRGGEKLLRRGVLHLGTRSSSMIRLRTDWKENNSEDRAGSH--------- 1337
Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKK--LVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
++ GT DG + C+ PL + + + L KK L +VAGLNP+ FR K R
Sbjct: 1338 IVVLGTWDGGLACLLPLQQEEYEQKNELLKKVYLHSYSLYVAGLNPQEFRIPRGLSKKTR 1397
Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
+ ++D LLS + L +EIA G SQ+L
Sbjct: 1398 TFGERLLDSTLLSSLQGLEYTSIVEIAKSCGLDASQLL 1435
Score = 97.1 bits (240), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 120/528 (22%), Positives = 198/528 (37%), Gaps = 89/528 (16%)
Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
KVDP+ VL+ ++++ ++ D+ + + + +++LR L
Sbjct: 166 KVDPEHGLIAVLIRKKNLLLI----AKYPILSHRDSLSAECSSNKLLSDPVILDLRRLGH 221
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
F F+ GY P + +L E+ TW+G S + ++S + + K+ IW
Sbjct: 222 FETIHFCFMFGYSLPTLALLEEKTPTWSGSFSVTRDSRLVSVVQFDLSDKKMKRIWQVEE 281
Query: 304 LPHDAYKLLAVPS-PIGGVLVVGANTIHYHSQSASC-ALALNNYAVSLDSSQELPRSSFS 361
LPH+ + + +VP GG LV G N I Y + L+ N+ S L
Sbjct: 282 LPHECFMVSSVPFLQGGGFLVFGWNIILYFRDGSFVDGLSCNDLGDVYLSKWSLRSQDAP 341
Query: 362 VELDA--------AHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
+ LD +H T+++N V +L + G L + G + L + S
Sbjct: 342 ISLDGCEVVSEFDSHDTFMKNPVIIL--RDGAFFELCIPKKGG-DSVISLRYCKILIQPS 398
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
++ GN L FLGS + S L++ + T + D
Sbjct: 399 TVSYCGNGLIFLGSHVSPSALLEIIWKNSTEL-----------------------HPEDD 435
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
L+ S +G +SN + S RDSL IGP++D I + +
Sbjct: 436 ELE--------SFFGKSSNKNFVVETIDS---RDSLFCIGPIQDLEVFDNIIGSSRKMEL 484
Query: 534 SK---QSNYELV---------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
NY V L C+ IW V + G S +
Sbjct: 485 IAAVGSRNYGAVIIFRRTVSPSLLTSIRLEDCQQIWNVLCQRKMGERNGSVPL------- 537
Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
LI+S + T+VL +D + E+ +S + RT+ + R +IQVF+ G RIL
Sbjct: 538 ---LILSTQRNTIVLSVSDTIDELVDS-QFQTSSRTLWVSRVLHDRYIIQVFDEGLRILG 593
Query: 636 GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
L P V + DPYV+L +S + +L
Sbjct: 594 NWDSLISLYELPP---------GDVVTQAFVCDPYVMLHLSSSYLVIL 632
>gi|402590016|gb|EJW83947.1| hypothetical protein WUBG_05142 [Wuchereria bancrofti]
Length = 374
Score = 164 bits (414), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 107/329 (32%), Positives = 172/329 (52%), Gaps = 11/329 (3%)
Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISALAS 1162
+ LA+GTA GE+V RGR+++ P ++ + +Y KE KG +++L S
Sbjct: 48 QNYLAVGTACNYGEEVLVRGRIIISEIIEVVPEPGQPTSKHRIKTLYDKEQKGPVTSLCS 107
Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
G+LL G K+ + + L GI+F D Y+ L V+N L D+++S+ L +
Sbjct: 108 CNGYLLTGMGQKVFIWLFKDNNLQGISFLDMH-FYIHQLIGVRNLALACDMYRSLALLRY 166
Query: 1223 KEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
+E+ L+L ++D S A +FLID + ++SDE NI IF Y P+ ES G+
Sbjct: 167 QEEYKALSLASRDMRSDVQPPMAAQFLIDNKQMGFIMSDEAANIAIFNYLPETLESLGGE 226
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
KL RAE ++G V F+R++ +S + R ++LF +LDGS G + PL
Sbjct: 227 KLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSLE---RQSVLFASLDGSFGYLRPLT 283
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
E FRRL LQ+ + V AGLN + R H ++VD +++ Y L L
Sbjct: 284 EKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRPNHYLNTRNLVDGDVVMQYLHLSL 343
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
E+ ++A + GT+R I+ +LN++ T+
Sbjct: 344 PEKNDLARKLGTSRYHIIDDLNEICRVTA 372
>gi|380494933|emb|CCF32776.1| cft-1, partial [Colletotrichum higginsianum]
Length = 542
Score = 162 bits (410), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 145/545 (26%), Positives = 242/545 (44%), Gaps = 48/545 (8%)
Query: 889 NLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
N +++P++ E E P P + NI+G+ FL G+ P + +
Sbjct: 1 NPTIAKSPVEVADDEANEQPRFVPLRPCA---NINGYSTVFLPGASPSLIVKSAKSSPKV 57
Query: 947 PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PVQKIPLKATPH 1005
L + + H C GFIY S+G ++ QLP+ S + V+KIP+
Sbjct: 58 VGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVTQLPADSNFAELGVSVRKIPIGDAVG 117
Query: 1006 QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEV 1065
I Y Y + S+ ++ L D + + +S L E V
Sbjct: 118 LIAYHPPMETYAVACSI------SEHFELPKDDDYHKEWAKETTTSYPL-----TERGIV 166
Query: 1066 RILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVA 1124
+++ P W T+ ++ E A+ ++ + L + TKE L+ IGTA +GED+
Sbjct: 167 KLMSPTT----WSVIDTVELEPHEVAMCMKTLHLEVSEETKERRMLITIGTAINRGEDLP 222
Query: 1125 ARGRVLLFSTGRNADNP----QNLVTEVYSKE--LKGAISALASL--QGHLLIASGPKII 1176
RGR+L++ P N ++ +KE +GA++ L + QG +L+A G K +
Sbjct: 223 IRGRILVYDVVPVVPQPGRPETNKKLKLVAKEEIPRGAVTGLCEVGSQGLMLVAQGQKCM 282
Query: 1177 LH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLL 1232
+ K GT L +AF D YV ++ V+ + L+ D K ++F+ + E+ ++ L
Sbjct: 283 VRGLKEDGTLLP-VAFMDMN-CYVTAVREVRGTGYCLMTDAFKGVWFVGYAEEPYKMMLF 340
Query: 1233 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1292
K G+ + +F++ G L +VV D+ I + + P+ +S +G LL+RA F
Sbjct: 341 GKSMGNFEVLTADFVVAGDELHIVVCDKDGVIHVMQFDPEHPKSLQGHLLLNRASFSAAP 400
Query: 1293 -HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1351
H T L L S T + K LL + G++ + PL E +RRL SL
Sbjct: 401 NHPTITLSLPRTPISPSATSVS----KNPPTTLLLASPTGALASLTPLSEQAYRRLTSLA 456
Query: 1352 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLSHYEMLPLEEQLEI 1406
+ ++PH A NP+ R + A PG D SIVD LL+ + L + E+
Sbjct: 457 NSIAGALPHAAATNPKGHRLQPLD--ARTPGVDTSAGRSIVDGALLARWNELGAGRRSEV 514
Query: 1407 AHQTG 1411
A + G
Sbjct: 515 AGKGG 519
>gi|219109892|ref|XP_002176699.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411234|gb|EEC51162.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1678
Score = 162 bits (409), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 203/839 (24%), Positives = 328/839 (39%), Gaps = 202/839 (24%)
Query: 740 DGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
+G G P +Y VC +SG LEI+ VP ++D R K S
Sbjct: 895 EGDGGNPA----LYIAVCRQSGQLEIYLVP-----------------LIDNIPRCCWKSS 933
Query: 800 ETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFE 859
+ SS G + + + KV H+R F FE
Sbjct: 934 GCGLGVSSLTGQNESKVPLPKTYKV-----------HAREIRF---------------FE 967
Query: 860 GPENTSKSDDPVSTSRSLSVS------NVSASRL----------RNLRFSRTPLDAYTRE 903
SK+ D +RSL ++ ++S RL R +F + ++E
Sbjct: 968 CGPIPSKTLDTAKKNRSLCLAVDCSSGDLSVYRLAISQDHGFPPRFEKFRMKSVFRRSQE 1027
Query: 904 ETPH------------------GAPCQRITIFKNISGHQGFFLSGSRPCW-C-------M 937
+ H G R+ F ISG G F + RP W C M
Sbjct: 1028 QARHRTKLIRKRMVVDVNDGTGGFVYNRLYRFSGISGQAGMFAAVPRPFWLCAERGKPSM 1087
Query: 938 VFRERLRVHP---QLCDGSIVAFTVLHNVNCNHGFI-------YVTSQGILKICQLPSGS 987
+F P +L S V+++ + N GFI + SQ + L
Sbjct: 1088 LFHRTRHASPAGGKLRPVSGFCSAVINDKSGNGGFITLHERVGRIGSQRLTLFHGLAPAF 1147
Query: 988 TYDNYWP-----VQKIPLKATPHQITYF-------AEKNLYPLIVSVPVLKPLNQVLSLL 1035
P V+KI T I + +E LY L+VS K L S L
Sbjct: 1148 GAHGLLPGGGMCVEKILFGMTVRHIQFINDPFVSTSEHPLYALLVS----KKLEVDQSDL 1203
Query: 1036 IDQ-----------------EVGHQIDNHNLSSVDLHRTYT--VEEYEVRILEPDRAGGP 1076
D ++ Q++ +L DL + +E + +E G P
Sbjct: 1204 NDDGLTAQERKETEEEKENAKIKRQVE-ADLGGFDLENEWVEEIERDDCFAVEMQLGGAP 1262
Query: 1077 -----------------WQTRATIPMQSSENALTVRVVTLF---------NTTTKENETL 1110
W + + E+ +T+ ++ L N T + L
Sbjct: 1263 PIPKEAFAVWIVDAANNWMVVDSFKLDEYEHGMTLSIMELTEFPEEPGSSNDTDVSGDEL 1322
Query: 1111 -----LAIGTAYV--QGEDVAARGRVLLFSTGRNADNPQNLVTEV------YSKEL-KGA 1156
+A+GT + GEDVA+RGR +L R + + +V Y KE+ GA
Sbjct: 1323 SKRMFVAVGTGVLDHNGEDVASRGRAILLELKRTNSSAKAAGRQVVELSFCYEKEIFHGA 1382
Query: 1157 ISALASL----QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
+++L L + LLI +G I + +W +L + F+ A + V+ K+F+LL D
Sbjct: 1383 VTSLVCLSSEGKNRLLIGAGADINVEQWGNAKLTQVGFFRAT-MQVLHTIPFKSFLLLSD 1441
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
+ S+YFL W+E L LLAKD+ + +A + G ++ + D+++N+Q F YAP
Sbjct: 1442 AYDSLYFLIWRESDKSLTLLAKDYDPIPVYAAGVMSRGPAMTFLCHDDRQNLQFFQYAPG 1501
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTK----FLRLQMLATSSDRTGAAPG----------SDK 1318
+ + G +L+ RA++H+G T F R ++ S+ T S++
Sbjct: 1502 EAAARGGNRLVCRADYHLGTQTTSFASHFCRSSLMIHSATPTSTLAALKQQDSYFGRSEE 1561
Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
R FGT DG +G + PL E + RL +LQ + +++ L PR++R + + +
Sbjct: 1562 DQRLGAYFGTADGGMGAVVPLSEPVYWRLTALQSIVANALESDCALAPRAWRLYRRSTR- 1620
Query: 1379 HRPGPDS------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
R G S ++D +L+ Y L + +Q +IA G+T IL NL +L G+ L
Sbjct: 1621 -RGGCRSNDRKKGVIDGDLVLQYADLSISKQEDIASAIGSTVDLILDNLLELQCGSLVL 1678
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 135/333 (40%), Gaps = 79/333 (23%)
Query: 251 FVHGYIEPVMVILHE--RELTWAGRVSWKHHTC-----MISALSISTTLKQHPLIWSAMN 303
F+ GY+EPV+V+LH W+GR+ + ++ALSIS + ++WS +
Sbjct: 243 FLSGYLEPVLVLLHSDVEGPVWSGRLGRERGVAGAPPLFVTALSISVVHGRTAVLWSQV- 301
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSS------QELP 356
+ DA K+L+ G LVVGANT + +A+N +A S + Q P
Sbjct: 302 VSADATKILSFGKT--GCLVVGANTLVILEIGKVQQVIAMNGWARSTCPAALQTALQANP 359
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG--------------------- 395
+++LD TWL A+++ +TG L +L D
Sbjct: 360 VVKLAIQLDGCCVTWLSEHSAIMALRTGQLYVLQRTDDRWAVMPLGQTLGAVGEVAHLAS 419
Query: 396 ------RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
R ++++ + + S + + F GSR GDSL + + +M +
Sbjct: 420 LPIGGLRWLEKMKMDENKASEMQMGV-------LFAGSRTGDSLFLGYAL-EIVTMPWAA 471
Query: 450 LKEE---FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS----------ASNNTES 496
+K E F + E S ++ L ++ EE +LYG+ S E+
Sbjct: 472 IKSEGQTFINFEGSELSKVATTAPIANGLDRILQLEEEALYGTDRSTPLHIVRDSEEEET 531
Query: 497 AQ--------KTFSFAV------RDSLVNIGPL 515
A + +F V D LVN+GPL
Sbjct: 532 ADIPSDAKRLRPVAFTVVRTIVPLDVLVNLGPL 564
>gi|224000243|ref|XP_002289794.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975002|gb|EED93331.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1820
Score = 161 bits (407), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 162/672 (24%), Positives = 278/672 (41%), Gaps = 141/672 (20%)
Query: 888 RNLRFSRTPLDAYTR---EETPHGAPCQRITI----------------FKNISGHQGFFL 928
R+L FSR PL R E H +R I F +ISG G F
Sbjct: 1162 RSLEFSRVPLSCVARSSEEAARHFIKLRRKGIVTSTTHSDFRPNRLHRFCDISGEDGLFA 1221
Query: 929 SGSRPCWCMVFRERLRV------HPQLCDGSIVAF----TVLHNV--NCNHGFIYVTSQ- 975
+ +RP W + R V H G V T + ++ N N GF+ + +
Sbjct: 1222 ATARPLWFVSERGAPTVVSHKSRHVSPAGGRPVPVSGFCTTMPHIFQNANKGFVTLHERI 1281
Query: 976 GILKICQLPSGSTYDNYWPV--------------QKIPLKATPHQITYFAEKNLYPLIVS 1021
G + +L + ++ W V QK+P T I + + ++
Sbjct: 1282 GRIGSQRL---TLFNGLWDVFSTHGLLPGGGICVQKVPFGVTVRAIEFIDDASIS----- 1333
Query: 1022 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLH-RTYTVEEYEV----RILEPDRAG-- 1074
P V +LLI +E+ + N +D R + EE E + +E D G
Sbjct: 1334 ----SPSRPVYALLISREIEADQSDLNDDGMDAEERKHIKEEKEAASIRKQVEADLGGFD 1389
Query: 1075 ----------------------------------------GPWQTRATIPMQSSENALTV 1094
W + E+A +
Sbjct: 1390 VEQEWVEEIEREECFDVDTSIGLAPSIPTRQFEVWLVDAASQWTVLDKYQLCDFEHATAL 1449
Query: 1095 RVVTLFNTTTKENET-----LLAIGTAYVQ--GEDVAARGRVLLFS--TGRNADNPQNLV 1145
+V+ L + +E +A+GT ++ GED+A++GR+LLF+ ++ + +++
Sbjct: 1450 KVLFLTDVVEDSDEPPKKSLFVAVGTGRIERDGEDIASKGRILLFNLKKKKHQKDKRSMT 1509
Query: 1146 TEVYSKELK----GAISALASLQGH----LLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
E++ K K G +++L+SL+ + + +G ++ + +W +L + FY A +
Sbjct: 1510 LELHLKHEKDITIGPVTSLSSLRSEDIFRVAVGAGAEVTVEQWGSGKLVQVGFYHAH-MQ 1568
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
V ++++ K F LL D + +++FL W+E L LLAKD+ FA + G +S V
Sbjct: 1569 VQNISLFKTFFLLSDAYDALHFLVWRESDKSLTLLAKDYEPTQVFAAGMISRGGAMSFVC 1628
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF--------------LRLQML 1303
D+++NIQ YAP + G KL+ RA+FH+G+ T L
Sbjct: 1629 HDDRQNIQFLQYAPTDVAARGGNKLVCRADFHLGSQTTSLNSHWAQSSLLFNSCTVSSTL 1688
Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
A+ + D RFA+ FGT DGS I PL E T+ RL +LQ + +++ A
Sbjct: 1689 ASLKQQDSLFGRLDDDQRFAVNFGTTDGSFVSIIPLSEPTYWRLTALQSVMSNALESNAA 1748
Query: 1364 LNPRSFRQFHSN----GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1419
L+ R++R + + G ++D +L+ + LPL EQ ++ G+T ++
Sbjct: 1749 LSHRAWRLYRRSTRRGGCRTNDRKKGVIDADLVMKFVDLPLPEQEDLTSSIGSTVGLVMD 1808
Query: 1420 NLNDLALGTSFL 1431
NL +L+ S +
Sbjct: 1809 NLLELSCAGSVV 1820
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 91/410 (22%), Positives = 145/410 (35%), Gaps = 141/410 (34%)
Query: 246 VKDFIFVHGYIEPVMVILHERE-----LTWAGRVSWKHHTCM------------------ 282
+ D F+ GYIEP +++LH WAGR+ +
Sbjct: 398 IVDIAFLSGYIEPTLLVLHSNPKRGGGRAWAGRLGRTEEVPLSNNGGSGESKDDYGEDID 457
Query: 283 -----------------------ISALSISTTLKQHPLIWSAMN-LPHDAYKLLAVPSPI 318
++A+S++ ++ ++WS ++ LP DA+KL VP P
Sbjct: 458 LEGGDAAKKGPDLVSTGTKYGLSLTAISLAIHQRRSVVLWSLLDALPADAWKL--VPHPS 515
Query: 319 GGVLVVGANTIHYHSQSA--SCALALNNY------------------AVSLDSSQELPRS 358
GV+V G NT Y S SCALA N + AV L+ + P
Sbjct: 516 DGVIVWGVNTAVYVSMGGKISCALAANGFAKIGCPIGLIPPSGRIGSAVYLEPNPS-PLP 574
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV--------------------------- 391
+++LD A ++ DVA++ G L L +
Sbjct: 575 MLALQLDGARVGFVTEDVAIVCLGNGSLYSLELHRAKSMVSPSMFLSMSPLGHRVGGLGV 634
Query: 392 -------------------VYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFFLGSRL 429
+ D V+ D +K + SV I + G L F GSR+
Sbjct: 635 ASCLSVLAMACHSNSVGHFLVDNEGVKDEDHAKETISKESVSGPKIRSRG--LIFAGSRM 692
Query: 430 GDSLLVQFT---------------CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SD 473
GD L+ F+ G+G L E+ + P+ K+L++ S
Sbjct: 693 GDCSLLAFSLNVPIHLVITDVDSETGAGKRKLGGSRPEQLSSMP--EPAQKQLKKEEISP 750
Query: 474 ALQDMVNGEE--LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ D +GEE + S + + + + DSL +GPL YG
Sbjct: 751 SRTDSEDGEEDIVCAMSSPRRSVRTLSMFRTVSALDSLTGLGPLGQGCYG 800
>gi|150951283|ref|XP_001387581.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
polyadenylation factor II complex, subunit CFT1 (CPSF
subunit) RNA processing and modification [Scheffersomyces
stipitis CBS 6054]
gi|149388465|gb|EAZ63558.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
polyadenylation factor II complex, subunit CFT1 (CPSF
subunit) RNA processing and modification [Scheffersomyces
stipitis CBS 6054]
Length = 1341
Score = 159 bits (403), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 136/587 (23%), Positives = 257/587 (43%), Gaps = 68/587 (11%)
Query: 836 HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
H +L + G +L Y+ Y F+G N + ++L +
Sbjct: 786 HKEEYLTILTIGGEVLLYKLY-FDG-------------------ENYEFKKEKDLAITGA 825
Query: 896 PLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
P +AY P G +R + F N++G+ F++G P + + Q
Sbjct: 826 PENAY-----PIGTAVERRLAYFPNLNGYTCIFVTGVTPYLILKSLHSIPRIYQFSKIPA 880
Query: 955 VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKN 1014
V+ + H+ +G I++ +Q +ICQLP Y+N WP++ I + + ITY +
Sbjct: 881 VSISPFHDSKVANGLIFLDNQQNARICQLPLDFNYENTWPMKLIHIGESIRAITYHESSH 940
Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY---TVEEYEVRILEPD 1071
Y V+ + D+E G I V LH+ + + ++++ P
Sbjct: 941 TY-------VVSTFKDIDYECFDEE-GKPI-------VGLHKDKPPSSAYKGSIKLISPF 985
Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLLAIGTAYVQGEDVAARG 1127
W TI + +E +TV+ + L ++TK+ + + IG+ + ED++A G
Sbjct: 986 N----WSVIDTIELADNELGMTVKSMILDVGSSTKKFKHKKEFIVIGSGKYRMEDLSANG 1041
Query: 1128 RVLLFSTGR---NADNPQ--NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
++ D P+ + EV+ ++ KGA++++ + G L++ G K+I+
Sbjct: 1042 SFRIYEIIDIIPEPDRPETNHKFKEVFKEDTKGAVTSVCEVSGRFLVSQGQKVIVRDLQD 1101
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
+ +AF D +YV N ++LGD KS++ + + + ++ +L KD LD
Sbjct: 1102 DGVVPVAFLDTA-VYVSEAKSFGNMMILGDSLKSVWLVGFDAEPFRMIMLGKDLQGLDVN 1160
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
+F+ + ++++D + + Y P+ + GQ+LLS++ F + + VT L
Sbjct: 1161 CADFITKDEEVFILIADNNNVLHLVQYDPEDPTALNGQRLLSKSSFSINSFVTCLKSLPK 1220
Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1362
D T F + T+DGS + P++E ++RR+ LQ++L D H
Sbjct: 1221 TEEKYD----------TGNFQTIGSTIDGSFFSVVPINEASYRRMYILQQQLTDKEYHYC 1270
Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
GLNPR R + A+ I+D +++ Y L E + +A +
Sbjct: 1271 GLNPRLNRFGGLSMTANDTNTKPILDYDVIRAYGKLNEERKKNLASK 1317
Score = 97.1 bits (240), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 98/415 (23%), Positives = 183/415 (44%), Gaps = 55/415 (13%)
Query: 55 VPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
V +LVV A +++I+ V VQ + S SK L+L+ ++LHG +
Sbjct: 26 VKHLVVGKATLLQIFEV-VQLKSSTPSK--------------PQHRLKLIDQFKLHGLIT 70
Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
+ + + N D ++++ + AK SV+++D +H + S+H +E+
Sbjct: 71 DIKPIRTVESPNF---DYLLVSTKSAKFSVIKWDHHLHTISTVSLHYYENAIQ---NSTY 124
Query: 175 ESFARGPLVKVDPQGRCGGVLVYGL----------QMIILKASQGGSGLVGDEDTFGSGG 224
E ++ L+ ++P G C + L ++ A +V E G
Sbjct: 125 EKLSKSELL-LEPYGSCSCLRFKNLLCFLPFETAEELDDDDADSENEDMVKSEKKEHENG 183
Query: 225 GFSARI--------ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
+ + ++S +I+ + LD + + D F+ Y EP IL +R+ WAG +
Sbjct: 184 TVNVPVTDQPGSFFDTSFLIDGQSLDSSIGSIIDMQFLFKYREPTFGILSQRQQAWAGNL 243
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHS 333
L++ T K + NLP+D +++ +PSP+ G L++G N IH +
Sbjct: 244 PKIKDNVQFCILTLDLTTKSTVSVLKIDNLPYDVDRIVPLPSPLNGCLLLGCNEIIHVDN 303
Query: 334 QSASCALALNNYAVSLDSSQEL--PRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLT 390
+A+N + + +S + ++ +++L+ L ND ALL TG+ L
Sbjct: 304 GGIVRRIAVNQFTSLITASTKAYQDQTHLNLKLEDCSVVALPNDHRALLVLSTGEFYYLN 363
Query: 391 VVYDGRVVQRLDLSKTNPSVLTSD--------ITTIGNSLFFLGSRLGDSLLVQF 437
DG+ +++ + + +L SD I T+ N+L F + G+S LVQF
Sbjct: 364 FEVDGKSIKKFTIESVD-KLLYSDIKLTFPGQIATLDNNLLFFANHNGNSPLVQF 417
>gi|402085944|gb|EJT80842.1| cft-1 [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 1450
Score = 159 bits (403), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 144/574 (25%), Positives = 250/574 (43%), Gaps = 46/574 (8%)
Query: 867 SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-----GAPCQRITIFK--N 919
S+D ++ ++ S S LRF + P A + + AP +R+ + N
Sbjct: 871 SNDDLTIYEPFKIAESSQSLSGTLRFRKLPNPAVAKSQDTKVSDDAPAPMRRMPLRACGN 930
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
I+G+ FL G P + + + L + A + H C+ GFIY +G+ +
Sbjct: 931 IAGYSCVFLPGHSPSFLIKSSKSTPRVIGLQGPGVRAMSPFHTKGCDRGFIYADYEGVAR 990
Query: 980 ICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1038
+ Q+P+ ++ V+K+PL I Y +Y +V+ +P L D
Sbjct: 991 VAQIPNDCSFAELGLSVKKVPLNMDADGIAYHTPSGVY--VVTCSFWEPFE----LPSDD 1044
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
E + N++ E ++++ P W T +E A+ ++ +
Sbjct: 1045 ESHREWAKENITF-----KPQTEHSVLKVINPVN----WSEIWTEEFDKNEVAMCIKSLN 1095
Query: 1099 L-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF---STGRNADNPQ---NLVTEVYSK 1151
L + +T E L+ +GTA +GED+ RG V ++ S D P+ L +
Sbjct: 1096 LEVSQSTNERRHLITVGTAICKGEDLPVRGCVYVYDLASVVPQKDRPETDKKLKLMAKDE 1155
Query: 1152 ELKGAISALASL--QGHLLIASGPKIILHKW-TGTELNGIAFYDAPPLYVVSLNIVKN-- 1206
+GA++AL+ + QG +L+A G K ++ L +AF D YV +
Sbjct: 1156 VPRGAVTALSEIGTQGLMLVAQGQKCLVRGLGEDGRLLPVAFMDMN-CYVSCAKELPGTG 1214
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
F + D K ++F + E ++ + K +L+ +FL DG L LV +D + N+ I
Sbjct: 1215 FCAMADAFKGVWFTGYTEGPYKMMIFGKSSTNLEVINVDFLPDGRNLLLVAADAEGNLHI 1274
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--- 1323
F + P+ +S +G LL+R F GAH + L M TSS+ + A D + A
Sbjct: 1275 FQFDPEHPKSLQGHLLLNRTTFSTGAHHPQ-KSLLMPTTSSNPSQPATNGDASAAAAGPQ 1333
Query: 1324 -LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
+L G + + PL + + RL +L L SVPH A LNP+++R + +
Sbjct: 1334 HILMAAPTGVLAAVQPLGQGVYTRLSALASNLAASVPHHAALNPKAYRMPPAPARNQVAA 1393
Query: 1383 PD-----SIVDCELLSHYEMLPLEEQLEIAHQTG 1411
D ++VD LL+ + L + E+A + G
Sbjct: 1394 VDISVGRAVVDGALLARWAELGSGRRAEVAGRAG 1427
Score = 94.0 bits (232), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 157/686 (22%), Positives = 262/686 (38%), Gaps = 126/686 (18%)
Query: 91 VLMDGISAASLELVCHYRLHGNVESLA------ILSQGGADNSRRRDSIILAFEDAKISV 144
V D S + L+ + L G V LA + GG S D +++AF+DAK+S+
Sbjct: 86 VRSDRASHTKIVLIAEFPLSGTVTGLARVKPPNVSKTGGG--SGVGDLLLIAFKDAKLSL 143
Query: 145 LEFDDSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGL 199
+ +D L S+H +E E W +F + DP RC +
Sbjct: 144 VAWDSERRSLETFSIHYYEQDELQGNPWECPLSDYANF-----LVADPGSRCAALKFGPR 198
Query: 200 QMIILKASQGGSGL-VGDEDTFGSGG------------GFSARIES-----SHVINLRDL 241
+ IL Q + +GD D G ++ IE S V+ L +L
Sbjct: 199 SLAILPFKQADEDIGMGDWDEALDGPRPAQSQSAAVAINGTSTIEDTPYSPSFVLRLPNL 258
Query: 242 D--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
D + H F++ Y EP IL +T + + K H + ++ K I
Sbjct: 259 DPALLHPVHLAFLYEYREPTFGILSS-SITPSNCLDRKDH-LTYTVFTLDLQQKASTTIL 316
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRS 358
S LP D +++A+P+P+GG L+VGAN IH + +A+N + S S
Sbjct: 317 SVGGLPKDLTRVIALPAPVGGALLVGANELIHIDQSGKANGVAVNPFTKQCTSFGLADHS 376
Query: 359 SFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLT 412
++ L+ L ++ L+ G L +T DGR V L + P ++L
Sbjct: 377 DLNLRLEGCTIEVLSAEHGELLVVLDDGRLATITFHIDGRTVSGLKVRIIPPEAGGNILP 436
Query: 413 SDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
+ ++ IG + F GS GDS+++ G + SS + + ++ L
Sbjct: 437 TSVSCLSRIGRNAMFAGSERGDSIVI------GWNRKSSQVSRKKSRVQ-----DPDLDL 485
Query: 470 SSSDALQDMVNGEELSLYGSASNNT--------ESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ ++ LYG T ++ + F D L++I P++D +YG
Sbjct: 486 DIDFDDLEDDEDDDDDLYGDTEKTTTVAGLASGQAKLEDLVFRCHDRLISIAPIRDMAYG 545
Query: 522 LRINADASATGISK----QSNYELV--------------------------ELPGCKGIW 551
TG QS +LV + P +G+W
Sbjct: 546 KPPPPAEGETGSRNSTPIQSELQLVAVVGRDRASSLAIMNREMTPVSIGRFDFPEARGLW 605
Query: 552 TVY-----------HKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETAD 594
T+ K ++ D YD +++++ E+ + + TA
Sbjct: 606 TLACQKPLPKVLQGEKGTKPVGGDFGVPVQYDK----FMVVAKEDDDNFESSNIYVLTAA 661
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
++ + G TI AG + ++IQV + R DG +TQ + P E
Sbjct: 662 GFEKLVGTEFEPAAGFTIEAGTMGNHTKIIQVLKSEVRCYDGDLGLTQII---PMLDEET 718
Query: 654 SGSENSTVLSVSIADPYVLLGMSDGS 679
+ +T S SIADPY+L+ D S
Sbjct: 719 NHEPRAT--SASIADPYLLIIRDDSS 742
>gi|189203597|ref|XP_001938134.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187985233|gb|EDU50721.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 1407
Score = 159 bits (401), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 143/537 (26%), Positives = 243/537 (45%), Gaps = 69/537 (12%)
Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPC 934
S S S++ LR ++ S+ + Y + + + ++ G+ F G+ P
Sbjct: 831 SRSASDLWTKNLRWVKLSQQHVPRYIEDNGSEDPGFESTLVALDDVCGYSTVFQRGTTPA 890
Query: 935 WCMVFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
+ +F+E P+ L + + T H C GF Y+ S L+ICQLP + Y +
Sbjct: 891 F--IFKEASSA-PRVIGLSGKPVKSLTSFHTSKCQRGFAYLDSTDTLRICQLPPQTHYGH 947
Query: 992 Y-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID----- 1045
W +++P+ + H +TY LY IV Q +Q+D
Sbjct: 948 LGWATRRMPMDSEVHALTYHP-SGLY--IVGT--------------GQTEDYQLDPTETY 990
Query: 1046 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTT 1104
+++L DL ++E V++L+ W T + E L+++ + L + T
Sbjct: 991 HYDLPKEDLTFKPSIERGVVKLLDEKS----WTIIDTHILDPQEIVLSIKTLNLEVSEIT 1046
Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISA 1159
+ + L+A+GT+ V GED+A +G + +F P T + E+KGA+SA
Sbjct: 1047 HQRKDLIAVGTSVVHGEDLATKGCIRIFEVITVVPQPDRPETNKRLKLIVKDEVKGAVSA 1106
Query: 1160 LASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLGDI 1213
++ L QG L++A G K ++ K GT L +AF D YV L + + +GD
Sbjct: 1107 ISELGTQGFLIMAQGQKCMVRGLKEDGTLLP-VAFMDMQ-CYVSDLKNLPGTGMLAMGDA 1164
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
++ ++F + E+ +++L A+ +L+ A +FL L LVV+D N+QI + P
Sbjct: 1165 YRGVWFTGYTEEPYKMSLFARSKHNLETIAVDFLPFDQQLHLVVADADMNLQILQFDPDN 1224
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAAPGSDKTNRFAL------- 1324
+S G +LL +A FH G H+ L L L S AA S+ + FA+
Sbjct: 1225 PKSEAGSRLLHKATFHTG-HLPTSLHLIHSHLKLPSATDFAATNSNPADAFAMDTSPNTT 1283
Query: 1325 -----------LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
L T G++ + PL E ++RRL +L L +++ LNPR+FR
Sbjct: 1284 TDTPQQPFHQILHTTQSGTLALLTPLSEDSYRRLSNLTAYLANTLDSACSLNPRAFR 1340
Score = 139 bits (350), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 168/697 (24%), Positives = 292/697 (41%), Gaps = 86/697 (12%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++I+ ++ V + S+N+ E L + A L LV
Sbjct: 28 NLVVAKNSLLQIFELKSTTTEVTPGAGENSENAAANLDTEAADVPLQRTENTAKLVLVAE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
+ L G V SLA + A N++ + +++++AF DAK+S++E+D + L S+H +E+P
Sbjct: 88 FPLAGTVISLARVK---ALNTKSKGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144
Query: 166 E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ----------- 208
+ W + +F + DP RC + + IL Q
Sbjct: 145 DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLVEDDYDSD 199
Query: 209 -GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
G + G+ G SS V+ L +LD + H F+H Y EP I+
Sbjct: 200 ADGPKETKADQANGTNGEHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAA 259
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
T ++ + S ++ K + S LP+D K++ +PSPIGG L+VG
Sbjct: 260 SRATAPSLLAQRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITKVVPLPSPIGGALLVG 319
Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTK 382
N IH + +A+N +A + S +S ++ L+ L + L+
Sbjct: 320 GNEIIHVDQGGKTNGVAVNEFAKACTSFSLSDQSDLALHLEGCSIELLSQETGDVLIVLN 379
Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLV 435
G L++LT DGR V + + S + +G F+GS G+S+++
Sbjct: 380 NGRLLILTFTLDGRTVSGMTIQTVAADHGGHLLKSAASCTSNLGRGRLFIGSEDGESVML 439
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNT 494
+T L++ L+ + + + D D D+ N +++ +A+ +
Sbjct: 440 GWTG------LTNQLRRKLSNADLDG-EDDSEEEEIDDMEDDLYNDTAPTMHKITAAVSE 492
Query: 495 ESAQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL- 541
+A T++F + D L +I P+KD + G + ++ A G + EL
Sbjct: 493 PTAPGTYTFRIHDVLPSIAPIKDAVLHPGKVTESLNRGEIMLSTGRGAAGAITALDRELH 552
Query: 542 ------VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISL--EART 587
ELP G+W V+ + + D+ A D +Y YL++S E T
Sbjct: 553 PISVATKELPSAHGVWAVHARKQAPGDVTAAFGEDTEANMATDVDYDQYLVMSKNGEDGT 612
Query: 588 MVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
+V E D LTE + +G T+ G L +V+QV RI D +
Sbjct: 613 VVYEVNGDKLTETDKGDFEREEGTTLLVGILAAGTKVVQVMRTEVRIYDSELNLVHIQSM 672
Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
E GS E + +++ S ADPY+L+ D S+++
Sbjct: 673 EEEEEGGSTKELN-IINASFADPYLLILREDSSVKIF 708
>gi|302506529|ref|XP_003015221.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
gi|291178793|gb|EFE34581.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
Length = 1370
Score = 157 bits (397), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 136/534 (25%), Positives = 242/534 (45%), Gaps = 60/534 (11%)
Query: 911 CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
C+ + ++ G++ F+SG PC+ ++ R H G V + + H C GF
Sbjct: 846 CKLLRALPDVCGYRTVFMSGHSPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 904
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
YV + I L I Y + Y + S
Sbjct: 905 AYVD----------------------EDITLGEQVDSIVYSSASECYVIGTSA------K 936
Query: 1030 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1089
+ L D E + N ++ + +E +++LEP W T + ++ +E
Sbjct: 937 EDFKLPEDDESHTEWRNEFITFLP-----QLERGTIKLLEPRN----WSTIDSHELEPAE 987
Query: 1090 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP----QNL 1144
+ V+ L + T E + ++ +G++ V+GED+ +G + +F P +N
Sbjct: 988 RITCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVFEVIDVVPEPDQPEKNK 1047
Query: 1145 VTEVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVV 1199
++++KE +KGA++AL+ + QG L++A G K ++ K G+ L +AF D YV
Sbjct: 1048 KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGLKEDGSLLP-VAFKDTQ-CYVN 1105
Query: 1200 SLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
L +K ++GD K ++F+ + E+ +L+L K+ +L +FL DG+ L ++V
Sbjct: 1106 VLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENENLAVVDADFLPDGNKLYILV 1165
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL----QMLATSSDRTGAA 1313
+D+ N+ + Y P+ S KG +LL R+ FH G + L ++ D
Sbjct: 1166 ADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTLLPHGGHTPSSPVDEDAMD 1225
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
S +++ +L GSI I PL E ++RRL +LQ +LV+++ H LNPR +R
Sbjct: 1226 TDSPPPSKYQILMTFQTGSIAIITPLGEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVE 1285
Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
S+G + G ++D LL + + + + EIA + G I +L L G
Sbjct: 1286 SDGMGGQRG---MIDGNLLLRWLDMGAQRKAEIAGRVGADVGAIRVDLEKLHGG 1336
Score = 105 bits (262), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 181/736 (24%), Positives = 287/736 (38%), Gaps = 100/736 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + + R D A L L Y + G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSTTATQPDQKGRH---DRSQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+ NS+ D+I+++ +AK+S++E+D HG+ S+H +E E +
Sbjct: 85 QRVR---ISNSKSGGDAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEESHMSPWVPD 141
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------DT 219
+ + VDP G C + +G+ + IL Q G LV D+ D
Sbjct: 142 LGSCSSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDLVSDA 200
Query: 220 FGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
S G +A + S V+ + LD + H F+H Y EP IL+ +
Sbjct: 201 QKSAPGNTAHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
+ S ++ K + + LP D +K++ +P P+GG L++G N +H
Sbjct: 261 IDRKDVVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
+ A+ +N +A + S + L+ L + LL G + +L+
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADHSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILSF 380
Query: 392 VYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
DGR V + L +K PS S +G + F GS GDS+L+ ++
Sbjct: 381 KVDGRSVSGISLHFVAEQSGGSITKARPSCSAS----LGRNKLFYGSEEGDSVLLGWSRP 436
Query: 441 SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES---- 496
S T+ S K G E A D D + ++L AS E
Sbjct: 437 SSTTKRPS--KAADGVDENGAADLSDEAEQDDDGDDDDMYEDDLYSVNPASTRQEKQVVN 494
Query: 497 --AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
+ F+F D L ++GP +D + G + + S +EL +G
Sbjct: 495 GDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQQDSVPEIAAPLELVAARGFGKSG 554
Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR---TMVLETAD---LLTEVT--- 600
TV + + DS +M DD Y + I L+ + T + + D LL +
Sbjct: 555 GLTVLKREVDPYTIDSLKM---DDVYGVWSIRVLDPKSKDTGLSRSYDKYLLLAKAKGED 611
Query: 601 --ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
ESV Y V + TI G L RV+QV R D Y
Sbjct: 612 KEESVVYSVGSSGLDSIDTPEFNPNEDCTIDIGTLATGTRVVQVLRTEIRSYD--YNLGL 669
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAI 700
P E SE TV+ S A+PY+L D S+ +L D + V VQ AA
Sbjct: 670 AQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA- 726
Query: 701 ESSKKPVSSCTLYHDK 716
S K +S C LY DK
Sbjct: 727 --SGKWISGC-LYEDK 739
>gi|256079900|ref|XP_002576222.1| cleavage and polyadenylation specificity factor cpsf [Schistosoma
mansoni]
Length = 1958
Score = 156 bits (394), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 144/576 (25%), Positives = 245/576 (42%), Gaps = 54/576 (9%)
Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
++ + + +G LEI+ +P+F ++ V F ++D + + ++ +
Sbjct: 1103 FAFIVFTNGVLEIYSLPDFTLLYEVHHFTDLPQMLID---HRGVSSEQLHKQYTNSQNVS 1159
Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
++I ++E+ + RP L + T I ++A L P+
Sbjct: 1160 YTEDDSIPP-PILEILVYPIGIDKDRPVLM-VRTSQEIAFFEA-LCPSPDE--------- 1207
Query: 873 TSRSLSVSNVSASRLRNLRFS-RTPLDAYTREET-PHGAPCQRITI--------FKNISG 922
S L RLR R PL A R T P Q + F+NI
Sbjct: 1208 -SYPLISGTFYEGRLRWRRLPLPCPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGD 1266
Query: 923 HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
H+G F+ G P W +LRV P DG + +F L+ C+ GF+Y T +++
Sbjct: 1267 HRGVFVCGGNPIWLFATDSGQLRVFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLA 1326
Query: 982 QLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1041
LP G +++ + ++ I L P+ + Y E Y +V + +P V L +
Sbjct: 1327 TLPPGYSFNEHLGIKWITLDPVPYYVQYHVESKTY-AVVGIHS-EPCKSVFRLNAEGNKE 1384
Query: 1042 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG------PWQTRATIPMQSSENALTVR 1095
+ + V T++ Y +++ P+ PW IP E
Sbjct: 1385 EDVLVRPKTCV----LPTLDYYSLQMYAPNLNANHRNKQPPW---LLIPNTLIEFEPWEV 1437
Query: 1096 VVTLFNTTTKENETL------LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-- 1147
V L ET LA+G GE++ RGR+L+ P +T
Sbjct: 1438 VTCLITAQLASEETFHGTKDYLALGANLTYGEEIPVRGRILILDVIDVVPEPGQPLTRHK 1497
Query: 1148 ---VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
++ E KG ++AL S QGHL+ A G KI + T+L G+AF D+ LY+ +L V
Sbjct: 1498 LKIIHDGEQKGPVTALTSCQGHLISAIGQKIYIWTLKNTDLVGVAFVDSE-LYIHNLLCV 1556
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
KN +L D+ KS+ L ++ L+++++D S + + + F +DG L +VSDE N+
Sbjct: 1557 KNLVLAADVLKSVQLLRFQSDLRVLSVVSRDNISREVYTSNFFVDGRRLGFMVSDELGNV 1616
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1300
I+ Y P S G++L+ A+ + + T LR+
Sbjct: 1617 TIYSYDPLDPSSRSGRRLVRCADMRLPSRATCSLRV 1652
Score = 155 bits (391), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 147/586 (25%), Positives = 255/586 (43%), Gaps = 106/586 (18%)
Query: 4 AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
A +K + PT + NC + H + +D+ L + NLV+T
Sbjct: 15 AVFKHISPPTAVDNCLYCHLKH----------ISPPTAVDNCLYCHLTHPKLKNLVITRG 64
Query: 64 NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
IEIY V+ S SGET+ V ++ N+ + + G
Sbjct: 65 GFIEIYNVK--------SSASGETR------------FNWVYGTSVYENIADIVTVRFTG 104
Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
S++L+F +AK++V+ F+ LR S+H +E + +LK GR +F + P++
Sbjct: 105 DLLD----SLLLSFPEAKVAVMNFNPVTFELRTLSLHNYE---FENLKSGRMNFTKLPIL 157
Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED----TFGSGGGFSARIESSHVINLR 239
++DP RC +LVY + +L + + + D + + + R + +
Sbjct: 158 RLDPHQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHISVRNSLLWQQRATAPLLATFT 217
Query: 240 DL-------DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
+ +V D F++G+ EP +++L+E TWAGRVS + TC I ALS +
Sbjct: 218 TCLSTSTGEKINNVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQK 277
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VS 348
+ +P+IW +LP D +++VP PIGGV+V+ AN+I Y Q+ SC L LN YA +
Sbjct: 278 RTNPVIWFQESLPFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTN 337
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKT 406
Q++P S + +D L L+ T++G+L LL++ + + V L K
Sbjct: 338 FPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKV 396
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF---------------------TCGSGTSM 445
+V + + + F+GSR DS+L++ + G+ ++
Sbjct: 397 GHAVPPHCMVLLESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVDHQLLNQSSGTNNTL 456
Query: 446 LSSGLKEEFGDIEAD------------------------APSTKRLRRSSSDALQD---M 478
S L + +E D + STKR +D + D
Sbjct: 457 KDSELVDGKSIVEDDSDEIPNKCPRIEEGENDKTISKSLSQSTKRNTLDENDIISDNHYK 516
Query: 479 VNGEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYG 521
+ ++ LYG + + S + +SF V D L+N+GP+ + G
Sbjct: 517 FDEVDVELYGESILSPPSIYREIVNYSFKVVDRLINLGPMGQLTSG 562
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 67/130 (51%), Gaps = 15/130 (11%)
Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1364
T++ +G P + R ++ FG+ +GSI I P+ + + RL+ +K L+ + + G+
Sbjct: 1835 TTNPNSGVDP---EKFRQSIYFGSQNGSIYRIGPIRDKMYSRLRITEKNLIHHLGPICGM 1891
Query: 1365 NPRSFRQFHSNGKAHRPGPD------SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
P+S + +RP P+ + D +L+ Y LP ++LEIA ++G + I+
Sbjct: 1892 PPKSCWSY------NRPQPELANPCGKVADGDLIWRYLTLPHCQRLEIAKKSGQSLESIM 1945
Query: 1419 SNLNDLALGT 1428
++ +L T
Sbjct: 1946 DDIAELIATT 1955
>gi|190348091|gb|EDK40482.2| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
Length = 1320
Score = 156 bits (394), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 130/549 (23%), Positives = 250/549 (45%), Gaps = 52/549 (9%)
Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMV 938
SN + ++L + P +AY P G +R ++ +SG F++G P +
Sbjct: 787 SNFKLKKEKDLLITGAPDNAY-----PAGTSIERRLVYIPLVSGFSSIFVTGVVPYFITR 841
Query: 939 FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKI 998
R + + + +F + ++G I++ + +IC+LP YDN PV+K+
Sbjct: 842 TRHSIPRIFKFTKIAAQSFASFSDSKVSNGLIFLDNAKNARICELPRDFNYDNNLPVKKV 901
Query: 999 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDL 1054
P+ T +TY N Y +VS P N +D+E G + D + +S
Sbjct: 902 PIGETVKSVTYHELSNTY--VVSTYREIPYNA-----LDEEGNPIAGLKKDKPSANSY-- 952
Query: 1055 HRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETL 1110
+ ++++ P W T+ ++ +E A+TV+ + L ++TK + L
Sbjct: 953 -------KGSLKLISPYN----WTVIETVELRDNEIAMTVKSMVLDIGSSTKRFKHRKEL 1001
Query: 1111 LAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASL 1163
L +GT + ED+ A G ++ G+ N + E +++ KGA++++ +
Sbjct: 1002 LVVGTGRYRMEDLGANGAFKIYEIIDIIPEPGKPETNHK--FKEYNTEDTKGAVTSMCEV 1059
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
G L+A G KII+ + +AF D +YV N ++LGD KS++ +
Sbjct: 1060 SGRFLVAQGQKIIVRDVQDDGVVPVAFLDTS-VYVSEAKSFGNLVILGDTLKSVWLAGFD 1118
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
+ ++ +L KD S+D EF+ + ++++ + + + P+ S GQ+L+
Sbjct: 1119 AEPFRMIMLGKDLQSVDVSCAEFISKDEEIYILIAGNNNVMHLVQFDPEDPTSSNGQRLV 1178
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
RA F+V + T ++M+ + + + ++ F + T+DGS + P++E T
Sbjct: 1179 HRASFNVSSSTTC---MRMVPKNEE-----INTQYSDVFQTVGSTIDGSFFTVFPVNEFT 1230
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
+RR+ +Q++L D H GLNPR R + G I+D +++ Y L + +
Sbjct: 1231 YRRMYIIQQQLTDKEYHYCGLNPRLNRFGGEAFDDSQTGVKPILDHQVIKRYAKLNEDRK 1290
Query: 1404 LEIAHQTGT 1412
IA + +
Sbjct: 1291 QTIAQKVSS 1299
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 127/611 (20%), Positives = 237/611 (38%), Gaps = 68/611 (11%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L+ ++L+G V +L + +S D I++A + AK+S++ +D H + S+H
Sbjct: 52 LRLLDQFKLYGTVTAL---KKFRTVDSPDLDYILVATKAAKVSMIRWDHQTHSIATESLH 108
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-----G 215
+E E+ L+ V+P + + + L S G
Sbjct: 109 YYEKSIQ---AATYETLDETELI-VEPNRYSCFCVRFKNLLTFLPFSTPDDDDDDMDDEG 164
Query: 216 DEDTFGSGGGFSARI-ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
+ GF + + SS +++ + L+ + + D F+H Y EP + IL + TW G
Sbjct: 165 ETKKQKYVPGFDSEVFGSSFMVDAQTLEPSIGTIVDMQFLHNYREPTVAILSSKAATWTG 224
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHY 331
+ ++I K + NLP D +L+ + P+ G L++G N IH
Sbjct: 225 LLPKVKDNITYHVMTIDLATKATTTVLKIENLPFDIDRLVPLSHPLNGCLLLGCNEIIHV 284
Query: 332 HSQSASCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVL 388
+ LA+N Y + +S + ++ ++ L+ L ND LLS TG L
Sbjct: 285 DNGGIVRRLAVNKYTEDITASVKNYHDQTDLNLMLENCAVIPLPNDNRVLLSLSTGSLFH 344
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS-DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
+ D + ++R L + +S D+T G F DS L+ +G S L
Sbjct: 345 INFDVDIKTIKRFALEPVLETHYSSVDLTYPGQPAFL------DSNLLFIANNNGNSPL- 397
Query: 448 SGLKEEFGDIEADAPSTKRL-RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
+E + + + S+ +DM EEL +A Q +
Sbjct: 398 ---------LEVKYLRNEEVTEKVQSNGKEDMDGDEELYDDDNAGEKIVIRQGDIKYFKH 448
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG------CKGIWT-------- 552
D L+N GP+ DF+ G A I+ N + G C I+
Sbjct: 449 DELINHGPVSDFTLGKYSTEKFKANLINPNLNDVCIVSNGGSHKQSCLNIFAPSVQPIIR 508
Query: 553 ---VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+ + +R N ++ + DD I +E L++ D + +
Sbjct: 509 SSLTFSQVNRMWNINNKYLITSDDVNSKSEIFQIEKSYSRLKSKDFIND----------E 558
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TIA L + ++Q+ + + + + + +SF E + ++S ++ D
Sbjct: 559 MTIAMHELNNGKYILQITPKHIEVFNSKF-KRHMSF---EDELKDAMKEDQIISSTVHDD 614
Query: 670 YVLLGMSDGSI 680
Y+++ + G +
Sbjct: 615 YLMIFFASGEV 625
>gi|224135031|ref|XP_002321966.1| predicted protein [Populus trichocarpa]
gi|222868962|gb|EEF06093.1| predicted protein [Populus trichocarpa]
Length = 180
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 85/152 (55%), Positives = 100/152 (65%), Gaps = 22/152 (14%)
Query: 733 TGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
TG+ EAIDGADGG DQGDIY V+CYE+GALEIFDVPNFN VF VDKFVSG+TH+VD++M
Sbjct: 4 TGISEAIDGADGGAHDQGDIYRVICYETGALEIFDVPNFNSVFIVDKFVSGKTHLVDSFM 63
Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
E +D +N EE G GRKE +V L F ILT GTILC
Sbjct: 64 GEPPRDLTKGMN---EEVAGAGRKE------IVLL-------------FFGILTYGTILC 101
Query: 853 YQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA 884
Y A LFEGP+ SK +DPVS S+ S++SA
Sbjct: 102 YHACLFEGPDGNSKLEDPVSAQNSVGDSSISA 133
Score = 67.0 bits (162), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 31/50 (62%), Positives = 41/50 (82%)
Query: 1019 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1068
I + V +P+NQVLS + DQE GHQI+N NLSSV++HRT +V+E+EVRIL
Sbjct: 131 ISAFAVQRPVNQVLSSMADQEFGHQIENPNLSSVEIHRTDSVDEFEVRIL 180
>gi|294659889|ref|XP_462318.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
gi|218511978|sp|Q6BHK3.2|CFT1_DEBHA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|199434312|emb|CAG90824.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
Length = 1342
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 121/500 (24%), Positives = 231/500 (46%), Gaps = 46/500 (9%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFR 940
N + ++L + P +AY+ T +R+ F N++G F++G P +
Sbjct: 810 NFKLVKEKDLIITGAPDNAYSLGTTIE----RRLVYFPNVNGFTSIFVTGITPYYISKTT 865
Query: 941 ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL 1000
+ + V+F + +G IY+ + +IC++P Y+N WP++KIP+
Sbjct: 866 HSVPRIFKFTKLPAVSFAPYSDDKIKNGLIYLDNSKNARICEIPVDFNYENNWPIKKIPI 925
Query: 1001 KATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS--SVDLHRTY 1058
K + +TY N + V+ ++ +D+E G I + S S + ++ Y
Sbjct: 926 KESIKSVTYHELSNTF-------VISTYEEIPYDCLDEE-GKPIVGVDKSKPSANSYKGY 977
Query: 1059 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLLAIG 1114
++++ P W TI + E + V+ + L ++TK+ + L+ IG
Sbjct: 978 ------IKLISPYN----WSVIDTIELVDGEIGMNVQSMVLDVGSSTKKFKNKKELIVIG 1027
Query: 1115 TAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
T + ED++A G +F G+ N + E++ ++ KGA++++ + G
Sbjct: 1028 TGKYRMEDLSANGSFKIFEIIDIIPEPGKPETNHK--FKEIHQEDTKGAVTSICEISGRF 1085
Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
L++ G KII+ + +AF D +YV N ++LGD KSI+ + +
Sbjct: 1086 LVSQGQKIIIRDLQDDGVVPVAFLDTS-VYVSEAKSFGNLLILGDSLKSIWLAGFDAEPF 1144
Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
++ +L KD SLD +F+I + ++++D + + Y P+ S GQ+L+ +A
Sbjct: 1145 RMVMLGKDLQSLDVNCADFIIKDEEIFILIADNNSTLHLVKYDPEDPTSSNGQRLIHKAS 1204
Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1347
F++ + T + + P S T F + T+DGS + P++E ++RR+
Sbjct: 1205 FNINSTPT------CIRSIPKNEEINPSS--TEVFQSIGSTIDGSFYTVFPINEASYRRM 1256
Query: 1348 QSLQKKLVDSVPHVAGLNPR 1367
LQ+++ D H GLNPR
Sbjct: 1257 YILQQQITDKEYHFCGLNPR 1276
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/414 (21%), Positives = 178/414 (42%), Gaps = 64/414 (15%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
L+V A V++++ + E +++ K L+LV ++LHG + +
Sbjct: 29 LIVGKATVLQVFEIITTETKTQQYK------------------LKLVEQFKLHGLITDIK 70
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+ +NS+ D ++++ + AK+S++++D ++ + S+H +E+ E
Sbjct: 71 AIRT--VENSQL-DYLLVSSKGAKMSLIKWDHHLNSISTVSLHYYENSIQ---SSTYEKL 124
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMII-----------------LKASQGGSGLVGDEDTF 220
LV V+P C + L + + S G +++
Sbjct: 125 TTTDLV-VEPNNNCTCLRFKNLLTFLPFETLDEEEEDDDDDEEMNGSSGSDKKATNKENG 183
Query: 221 GSGGG-FSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
S G S ESS +I+ R LD + + D F++ Y EP + I+ + WAG +
Sbjct: 184 NSNGEEVSELFESSFMIDGRTLDSRIGDIIDMQFLYNYREPTIAIIFSKAHAWAGNLPKV 243
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSA 336
LS+ K + NLP D K++ +P P+ G L++G N IH +
Sbjct: 244 KDNINFIVLSLDLVTKASTTVLKIDNLPFDIDKIIPLPQPLNGSLLMGCNEIIHVDNGGI 303
Query: 337 SCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVY 393
+ LALN + S+ +S + +S +++L+ + ND L+ GD +
Sbjct: 304 TRRLALNQFTSSITTSLKNYHDQSDLNLKLENCSVKPIPNDNKVLMILNNGDFYYINFKI 363
Query: 394 DGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
DG+ +++ D+ T P +I T+ N+L F+ ++ G++ L++
Sbjct: 364 DGKTIKKFFVEKVSDLNYDDIQLTYP----GEIATLDNNLMFISNKNGNNPLLE 413
>gi|330919204|ref|XP_003298516.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
gi|311328242|gb|EFQ93393.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
Length = 1388
Score = 154 bits (390), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 131/495 (26%), Positives = 222/495 (44%), Gaps = 62/495 (12%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT 973
+ +I G+ F G+ P + + L + + T H +C GF Y+
Sbjct: 870 LVALDDICGYSTVFQRGTTPAFILKEASSAPRVIGLSGKPVKSLTSFHTSSCQRGFAYLD 929
Query: 974 SQGILKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVL 1032
S L+ICQLP + Y + W +++P+ + H +TY LY IV
Sbjct: 930 STDTLRICQLPPQTHYGHLGWATRRMPMDSEVHTLTYHP-PGLY--IVGT---------- 976
Query: 1033 SLLIDQEVGHQID-----NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
Q +Q+D +++L DL ++E +++L+ W T +
Sbjct: 977 ----GQAEDYQLDPTETYHYDLPKEDLTFKPSIERGVIKLLDEKS----WTIIDTHVLDP 1028
Query: 1088 SENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT 1146
E L+++ + L + T + + L+A+GT+ V GED+A +G + +F P T
Sbjct: 1029 QEVVLSIKTLNLEVSEITHQRKDLIAVGTSVVHGEDLATKGCIRIFEVITVVPQPDRPET 1088
Query: 1147 E-----VYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLY 1197
+ E+KGA+SA++ L QG L++A G K ++ K GT L +AF D Y
Sbjct: 1089 NRRLKLIVKDEVKGAVSAISELGTQGFLIMAQGQKCMVRGLKEDGTLLP-VAFMDMQ-CY 1146
Query: 1198 VVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
V L + + +GD ++ ++F + E+ +++L A+ +L+ A +FL L L
Sbjct: 1147 VSDLKNLPGTGMLAMGDAYRGVWFTGYTEEPYKMSLFARSKHNLETIAVDFLPFDQQLHL 1206
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAA 1313
VV+D N+QI + P + G +LL +A FH G H L L L S AA
Sbjct: 1207 VVADADMNLQILQFDPDNPKGEAGSRLLHKATFHTG-HFPTSLHLIHSHLKLPSATDFAA 1265
Query: 1314 PGSDKTNRFAL------------------LFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1355
++ + FA+ L T G++ + PL E ++RRL +L L
Sbjct: 1266 TNNNPADAFAMDTSPNTTTDTPQQPFHQILHTTQSGTLALLTPLSEDSYRRLSNLSAYLA 1325
Query: 1356 DSVPHVAGLNPRSFR 1370
+++ LNPR+FR
Sbjct: 1326 NTLDSACSLNPRAFR 1340
Score = 135 bits (341), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 168/698 (24%), Positives = 290/698 (41%), Gaps = 88/698 (12%)
Query: 57 NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NLVV ++++I+ ++ V + S+N+ E L + A L LV
Sbjct: 28 NLVVAKNSLLQIFELKSTTTEVTPGSGENSENAAANLDTEAADVPLQRTENTAKLVLVAE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + + + +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVISLARVK--ALNTKSKGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE--- 217
W + +F + DP RC + + IL Q LV D+
Sbjct: 146 LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQ--RDLVEDDYDS 198
Query: 218 -----------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
+ G SS V+ L +LD + H F+H Y EP I+
Sbjct: 199 DAEVPKETKADQANDTSGEHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVA 258
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
T ++ + S ++ K + S LP+D K++ +PSPIGG L+V
Sbjct: 259 ASRATAPSLLAQRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITKVVPLPSPIGGALLV 318
Query: 325 GAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
G N IH + +ALN +A + S +S ++ L+ L + L+
Sbjct: 319 GRNEIIHVDQGGKTNGVALNEFAKACTSFSLSDQSDLALHLEGCSIELLSQETGDVLIVL 378
Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLL 434
G L++LT DGR V + + S + +G F+GS G+S++
Sbjct: 379 NNGRLLILTFTLDGRTVSGMTIQTVAADHGGHLVKSAASCTSNLGRGRLFIGSEDGESVM 438
Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNN 493
+ +T L++ L+ + + + D D D+ N +++ +A+ +
Sbjct: 439 LGWTG------LTNQLRRKLSNADLDG-EDDSDEEEIDDMEDDLYNDTAPTMHKITAAVS 491
Query: 494 TESAQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL 541
+A T++F + D L +I P+KD + G + ++ A G + EL
Sbjct: 492 EPTAPGTYTFRIHDVLPSIAPIKDAVLHPGKVTESLNRGEIMLSTGRGAAGAITALDREL 551
Query: 542 -------VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISL--EAR 586
ELP G+W V+ + + D+ A D +Y YL++S E
Sbjct: 552 HPISVATKELPLAHGVWAVHARKQAPGDVTAAFGEDTEANMATDVDYDQYLVMSKNGEDG 611
Query: 587 TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
T+V E D LTE + +G T+ G L +V+QV RI D +
Sbjct: 612 TVVYEVNGDQLTETDKGDFEREEGTTLLVGVLAAGTKVVQVMRTEVRIYDSELNLVHIQS 671
Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
E GS E + +++ S ADPY+L+ D S+++
Sbjct: 672 MEEEEEGGSTKELN-IINASFADPYLLILREDSSVKIF 708
>gi|146415762|ref|XP_001483851.1| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC 6260]
Length = 1320
Score = 154 bits (389), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 129/548 (23%), Positives = 249/548 (45%), Gaps = 52/548 (9%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMVF 939
N + ++L + P +AY P G +R ++ +SG F++G P +
Sbjct: 788 NFKLKKEKDLLITGAPDNAY-----PAGTSIERRLVYIPLVSGFSSIFVTGVVPYFITRT 842
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
R + + + +F + ++G I++ + +IC+LP YDN PV+K+P
Sbjct: 843 RHSIPRIFKFTKIAAQSFASFSDSKVSNGLIFLDNAKNARICELPRDFNYDNNLPVKKVP 902
Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDLH 1055
+ T +TY N Y +VS P N +D+E G + D + +S
Sbjct: 903 IGETVKSVTYHELSNTY--VVSTYREIPYNA-----LDEEGNPIAGLKKDKPSANSY--- 952
Query: 1056 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLL 1111
+ ++++ P W T+ ++ +E A+TV+ + L ++TK + LL
Sbjct: 953 ------KGSLKLISPYN----WTVIETVELRDNEIAMTVKSMVLDIGSSTKRFKHRKELL 1002
Query: 1112 AIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQ 1164
+GT + ED+ A G ++ G+ N + E +++ KGA++++ +
Sbjct: 1003 VVGTGRYRMEDLGANGAFKIYEIIDIIPEPGKPETNHK--FKEYNTEDTKGAVTSMCEVS 1060
Query: 1165 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
G L+A G KII+ + +AF D +YV N ++LGD KS++ +
Sbjct: 1061 GRFLVAQGQKIIVRDVQDDGVVPVAFLDTS-VYVSEAKSFGNLVILGDTLKSVWLAGFDA 1119
Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
+ ++ +L KD S+D EF+ + ++++ + + + P+ S GQ+L+
Sbjct: 1120 EPFRMIMLGKDLQSVDVSCAEFISKDEEIYILIAGNNNVMHLVQFDPEDPTSSNGQRLVH 1179
Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
RA F+V + T ++M+ + + + ++ F + T+DGS + P++E T+
Sbjct: 1180 RASFNVSSSTTC---MRMVPKNEE-----INTQYSDVFQTVGSTIDGSFFTVFPVNEFTY 1231
Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
RR+ +Q++L D H GLNPR R + G I+D +++ Y L + +
Sbjct: 1232 RRMYIIQQQLTDKEYHYCGLNPRLNRFGGEAFDDSQTGVKPILDHQVIKRYAKLNEDRKQ 1291
Query: 1405 EIAHQTGT 1412
IA + +
Sbjct: 1292 TIAQKVSS 1299
Score = 80.5 bits (197), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 127/611 (20%), Positives = 237/611 (38%), Gaps = 68/611 (11%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L+ ++L+G V +L + +S D I++A + AK+S++ +D H + S+H
Sbjct: 52 LRLLDQFKLYGTVTAL---KKFRTVDSPDLDYILVATKAAKVSMIRWDHQTHSIATESLH 108
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-----G 215
+E E+ L+ V+P + + + L S G
Sbjct: 109 YYEKSIQ---AATYETLDETELI-VEPNRYSCFCVRFKNLLTFLPFSTPDDDDDDMDDEG 164
Query: 216 DEDTFGSGGGFSARI-ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
+ GF + + SS +++ + L+ + + D F+H Y EP + IL + TW G
Sbjct: 165 ETKKQKYVPGFDSEVFGSSFMVDAQTLEPSIGTIVDMQFLHNYREPTVAILSLKAATWTG 224
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHY 331
+ ++I K + NLP D +L+ + P+ G L++G N IH
Sbjct: 225 LLPKVKDNITYHVMTIDLATKATTTVLKIENLPFDIDRLVPLSHPLNGCLLLGCNEIIHV 284
Query: 332 HSQSASCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVL 388
+ LA+N Y + +S + ++ ++ L+ L ND LLS TG L
Sbjct: 285 DNGGIVRRLAVNKYTEDITASVKNYHDQTDLNLMLENCAVIPLPNDNRVLLSLLTGSLFH 344
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS-DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
+ D + ++R L + +S D+T G F DS L+ +G S L
Sbjct: 345 INFDVDIKTIKRFALEPVLETHYSSVDLTYPGQPAFL------DSNLLFIANNNGNSPL- 397
Query: 448 SGLKEEFGDIEADAPSTKRL-RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
+E + + + S+ +DM EEL +A Q +
Sbjct: 398 ---------LEVKYLRNEEVTEKVQSNGKEDMDGDEELYDDDNAGEKIVIRQGDIKYFKH 448
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG------CKGIWT-------- 552
D L+N GP+ DF+ G A I+ N + G C I+
Sbjct: 449 DELINHGPVSDFTLGKYSTEKFKANLINPNLNDVCIVSNGGSHKQSCLNIFAPSVQPIIR 508
Query: 553 ---VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
+ + +R N ++ + DD I +E L++ D + +
Sbjct: 509 SSLTFSQVNRMWNINNKYLITSDDVNLKSEIFQIEKSYSRLKSKDFIND----------E 558
Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TIA L + ++Q+ + + + + + +SF E + ++S ++ D
Sbjct: 559 MTIAMHELNNGKYILQITPKHIEVFNSKF-KRHMSF---EDELKDAMKEDQIISSTVHDD 614
Query: 670 YVLLGMSDGSI 680
Y+++ + G +
Sbjct: 615 YLMIFFASGEV 625
>gi|403178252|ref|XP_003336695.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375164075|gb|EFP92276.2| hypothetical protein PGTG_18491 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1149
Score = 154 bits (389), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 110/357 (30%), Positives = 170/357 (47%), Gaps = 18/357 (5%)
Query: 1075 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF- 1132
G W T Q +E ++ V L +T + +GT + ED+AARG + +F
Sbjct: 800 GKWVTIDGYEFQQNEWVTSMANVELDSRSTVSGRRQFVGVGTTCNRAEDLAARGGIYVFE 859
Query: 1133 ----STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNG 1187
+ +N + Y +E K ++A+ ++ G+ L G K+ + E L
Sbjct: 860 IVVVNPAQNHRTYNRALRLRYYEETKACVTAVDAINGYFLHTMGQKLYAKCFEQDERLLA 919
Query: 1188 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1247
+ F D P Y + I KNFILLGD K I ++++E+ +L L + L C +FL
Sbjct: 920 VGFLDIKP-YTTCMRIFKNFILLGDAVKGITLVAFQEEPYKLIELGHTYVDLKCSTIDFL 978
Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
+ L++V +D I+IF Y P ES GQKLL R+EF+ + +T ++
Sbjct: 979 VIDGKLAIVATDLNGVIRIFEYNPTNIESQGGQKLLCRSEFNTSSEMTCSMQF------G 1032
Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
R A D+ F +LDGSI + P E ++RLQ +Q +L + H AGLNP+
Sbjct: 1033 KRLSA---KDEAKVMGTFFASLDGSISSLVPAKEAVYKRLQLVQTRLTRHIQHFAGLNPK 1089
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
R N R I+D ELL + +L + +Q EIA G+ R +L NL +L
Sbjct: 1090 GHRTVR-NDLVSRAINRGILDGELLIKFHLLSVTQQAEIAGLAGSDRETVLVNLLNL 1145
>gi|403170487|ref|XP_003329830.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375168746|gb|EFP85411.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1513
Score = 153 bits (387), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 110/357 (30%), Positives = 170/357 (47%), Gaps = 18/357 (5%)
Query: 1075 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF- 1132
G W T Q +E ++ V L +T + +GT + ED+AARG + +F
Sbjct: 1164 GKWVTIDGYEFQQNEWVTSMANVELDSRSTVSGRRQFVGVGTTCNRAEDLAARGGIYVFE 1223
Query: 1133 ----STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNG 1187
+ +N + Y +E K ++A+ ++ G+ L G K+ + E L
Sbjct: 1224 IVVVNPAQNHRTYNRALRLRYYEETKACVTAVDAINGYFLHTMGQKLYAKCFEQDERLLA 1283
Query: 1188 IAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFL 1247
+ F D P Y + I KNFILLGD K I ++++E+ +L L + L C +FL
Sbjct: 1284 VGFLDIKP-YTTCMRIFKNFILLGDAVKGITLVAFQEEPYKLIELGHTYVDLKCSTIDFL 1342
Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
+ L++V +D I+IF Y P ES GQKLL R+EF+ + +T ++
Sbjct: 1343 VIDGKLAIVATDLNGVIRIFEYNPTNIESQGGQKLLCRSEFNTSSEMTCSMQF------G 1396
Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
R A D+ F +LDGSI + P E ++RLQ +Q +L + H AGLNP+
Sbjct: 1397 KRLSA---KDEAKVMGTFFASLDGSISSLVPAKEAVYKRLQLVQTRLTRHIQHFAGLNPK 1453
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
R N R I+D ELL + +L + +Q EIA G+ R +L NL +L
Sbjct: 1454 GHRTVR-NDLVSRAINRGILDGELLIKFHLLSVTQQAEIAGLAGSDRETVLVNLLNL 1509
Score = 122 bits (306), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 173/771 (22%), Positives = 308/771 (39%), Gaps = 167/771 (21%)
Query: 48 SKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKE---SKNSGETKRRVLMDGISAASLELV 104
SK P+ NL+V + +++++ + + E+ E ++N E K + L +
Sbjct: 37 SKTRPRPITNLIVARSTLLQVFELCLVEDDQAENNHTRNHHELKNK-------NYKLFHL 89
Query: 105 CHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
C +RLHG V L L+ D ++++F+DAK+++LE+ +S L S+H FE
Sbjct: 90 CEHRLHGRVTGLQRLTTLDTQEDGL-DRLLVSFQDAKMTLLEWSNSAADLVPISLHTFEK 148
Query: 165 -PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
P+ R+ + ++VDP RC +L+ + +L Q L D+ G
Sbjct: 149 LPQITQGDLPRDFQGQ---LEVDPLSRCAVLLLPQATLAVLPFFQDQLDL----DSLGLS 201
Query: 224 GGFSARI------------ESSHVINLRD----------LDMKH--VKDFI---FVHGYI 256
GG + + SS +++ LD +H +K I F+ G+
Sbjct: 202 GGLKSALGSEQQRFQTFPYASSFILDFNQQLLNHLPPSSLDSQHRPIKSVIALKFLPGFS 261
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
EP + +L++ + TW+ R+ +T + L++ P+I NLP+DA+ L+A P
Sbjct: 262 EPTLAVLYQSQYTWSARLENHANTAALIVLTLDLGSNHFPIISHTTNLPYDAHGLVACPK 321
Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS------------------ 358
+ GVLV+ A+ I + QS+ N V S ++PR
Sbjct: 322 ELAGVLVLCADMILHVDQSSKIIGLATNGWVKHTSELQIPRQDTVRLITPTNKISGHRST 381
Query: 359 ---------------------------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
V L+ A + + D A + +TG++ L
Sbjct: 382 TNKSDERPEDLEDGEEQDESGVPEGHEKLLVRLENAKIVFSRADRAFVFLRTGEVFSLQF 441
Query: 392 VYDGRVVQRLDLSKTN-PSVLTSDITTIGNSLFFLGSRLGDSLL-----------VQFTC 439
+ DGR + +L L K + S++ S + + N F+GS GDS L
Sbjct: 442 LRDGRTLTKLVLEKLDLLSIIPSTVLKVNNECLFVGSMAGDSALYILDHLRPRSSSDDDN 501
Query: 440 GSGTSMLSSGLKE-----------EFG-DIEADAPSTKRLRRSSSDALQD--MVNGEELS 485
G + SS + + +F DI D T +RR+ L D NG +
Sbjct: 502 DDGHQLPSSSIIQPDKAAKNQSSLDFDEDIYGDRTETDPVRRTDHSQLYDDRPSNGADDG 561
Query: 486 LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
G+ ++ E + + D + GP++DF+ +ATG+ +EL
Sbjct: 562 RPGAGAHLAEPFLR-----LGDVIQAHGPIRDFTM--------AATGVENMP----LELL 604
Query: 546 GCKGI-----WTVYH-----KSSRGHNADSSRMAAYDDEYHAYLII-----SLEARTMVL 590
C G TV+H + R + +S + + + L++ S E + + +
Sbjct: 605 ACTGTGDLGGLTVFHREIPLRKRRKLSFESPSASHINALFFTSLVVESGGLSEERKVVWM 664
Query: 591 ETADLLTEVT---ES-----VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
+ TE+ ES ++ F + +T+A FG++ V+QV ++ S
Sbjct: 665 GRSGPRTEIATYGESGELSLINTFPE-KTLAVSPFFGKQFVVQVTNTAIKLFTSSL---- 719
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
++ +L SI D YV+L G + GD + T+S
Sbjct: 720 -----EEAQVIQPEPAVKILRASIVDDYVMLETHCGLKLIYQGDHDSKTLS 765
>gi|358056450|dbj|GAA97624.1| hypothetical protein E5Q_04302 [Mixia osmundae IAM 14324]
Length = 1305
Score = 153 bits (387), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 169/673 (25%), Positives = 300/673 (44%), Gaps = 98/673 (14%)
Query: 55 VPNLVVTAANVIEIY-----VVRVQE---EGSKESKNSGETKRRVLMDGISAASLELVCH 106
V NLVV +N +++Y V VQ +GS S +T+ L+L+
Sbjct: 37 VRNLVVARSNFLQVYEVLEEPVPVQSSVTDGSSASMREDQTR------------LQLLAE 84
Query: 107 YRLHGNVESLAILSQGGADNSRR--RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
+ HG V LA LS ++R+ R ++++F DAK++V+E+ D +H L SMH FE
Sbjct: 85 HVCHGIVTGLARLS---TLDTRQDGRHRLVISFRDAKMTVMEWSDQLHDLAPVSMHSFE- 140
Query: 165 PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGG 224
L +G + A +++VD RC +L+ + IL Q S L ED G
Sbjct: 141 -RLPQLSQG-DLGAFQAVLRVDQASRCVALLLPDNTLGILPFFQDLSEL---EDMTREGL 195
Query: 225 GFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCM 282
S S I+L ++ +++V DF F+ G+ EP + IL +R+ TW GR+ +
Sbjct: 196 Q-SLPYAPSLTIDLSEIGPGIRNVVDFAFLPGFSEPTIAILFQRKPTWTGRIDFAKDITS 254
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALA 341
+ +++ + +P+I+ A LP+DA L P +GGV+++ AN+ +H S +A
Sbjct: 255 LVMVTLDIGSRNYPVIFEADGLPYDALSLSVCPRELGGVVILCANSLVHIDQSSKMTGIA 314
Query: 342 LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
+N + +L ++ R + + L+ A ++ VA+L T+TG+ L + DGR V +
Sbjct: 315 VNGWTSTLTDARLDSRPTLRLVLEGAQCAFVGQQVAVLCTRTGETFSLHLEKDGRNVSSM 374
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
D + + + I T+G + F+GS G S+L+++ SG DI
Sbjct: 375 DCRPRAVTCIPACIETVGAAYVFVGSAQGQSVLLRWASQSGAG----------ADILDIT 424
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSA-SNNTESAQ-----KTFSFAVRDSLVNIGPL 515
S L + SDA+ D LY +A ++N Q K + D+L G +
Sbjct: 425 ESGTGLVQ--SDAMDD-------DLYATAGAHNGNGHQIAPTGKDVQLELCDTLPGYGTI 475
Query: 516 KDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
+ + D ++ + + S + G+ T++ D A D +
Sbjct: 476 RHIAV-----LDHTSASLDEPSLVACTGVQAMAGLTTIHRHVPSVRQVDLDLPTARDIRH 530
Query: 576 HAYLIISLEAR----------TMVLETADLLTEVTESVDYFVQGRT---------IAAGN 616
+ LE R ++ T + + ++D Q T +AAG+
Sbjct: 531 --IWTVGLEQRQKMGRGPITHQIICSTGS--SSMVYTLDQDTQAATLARKSAEVPLAAGS 586
Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
F R +V++V E R+ G +E+ G ++ + V+++DP+V + +
Sbjct: 587 FFSRSQVLEVTEDMLRLYSPD--------GQITTEAPHGQADA--IDVTVSDPFVAVLSA 636
Query: 677 DGSIRLLVGDPST 689
++ + GDP+T
Sbjct: 637 ARNVTVFFGDPTT 649
Score = 125 bits (314), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 127/528 (24%), Positives = 230/528 (43%), Gaps = 48/528 (9%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRE---RLRVHPQLCDGSIVAFTVLHNVNCNHG 968
+R+ F + +G G F++GS P + + R RL P G AF +
Sbjct: 806 RRLVSFISTTGRSGVFITGSAPFYLLTDRAGIARLYRAPY---GRASAFGAFDPPSSTP- 861
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
+ V + G + L ++ PV + AT T A + +V+ V+
Sbjct: 862 -LLVLADGAMHTYDLSDQASLARELPVTHV---ATSKCFTSTAYHDSSHTLVAARVV--- 914
Query: 1029 NQVLSLLIDQEVG-HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
N L D+ ++ + ++ S + R+ + +L P G W
Sbjct: 915 NAPFELFDDEGAPVYRAPSEDMISPTVFRSC------LELLVP----GSWDCIDGHEFPQ 964
Query: 1088 SENALTVRVVTLFNTTTKENETLLAIG-TAYVQGEDVAARGRVLLF--------STGRNA 1138
+E+ L + TL + T I T +GED+ RG + +F + A
Sbjct: 965 NESILQLICATLPSATDPSGRARFVIASTCNNRGEDLQTRGGLYVFRISTTESTAASDQA 1024
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLY 1197
++ V++ +L+ + A+ + GH++ + G K+ + + + L + F D L
Sbjct: 1025 QARSAKLSLVHADDLRHPVGAICEVNGHIIHSLGQKVFIKAFDSDQRLITVGFLDVG-LD 1083
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
V ++ +KN +++GD YF++++E +L LL K+ D + +FL+ + L L+
Sbjct: 1084 VSAMRSIKNLLIIGDSLTGTYFVAFQEDPFKLVLLGKEARKTDVYCVDFLVQENRLGLLS 1143
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL-QMLATSSDRTGAAPGS 1316
+ ++ Y P +ES G++LL R E+H+G + L + L+T D +
Sbjct: 1144 VSRKGLLRQLEYNPGNAESRAGERLLDRTEYHLGKQIIDSLSFAKRLSTDEDLRQSG--- 1200
Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
+L G DGS+ + P+ E+ +RRL L+++L +PH AGLNPR+FR N
Sbjct: 1201 ------VMLVGA-DGSLTWVTPVREVVYRRLALLERQLHRQLPHFAGLNPRAFRTAR-ND 1252
Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
RP ++D +LL+ Y L Q +A + + NL +L
Sbjct: 1253 YYSRPLARGMLDGDLLAIYANLHASRQQSLASHINSDPDTLSVNLGNL 1300
>gi|12697776|dbj|BAB21613.1| polyadenylation specificity factor [Homo sapiens]
Length = 216
Score = 153 bits (386), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 81/212 (38%), Positives = 122/212 (57%), Gaps = 12/212 (5%)
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+
Sbjct: 2 KSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAK 61
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTL 1329
ES+ G +LL RA+FHVGAHV F R + GA G K N+ F TL
Sbjct: 62 ESFGGMRLLRRADFHVGAHVNTFWR-------TPCRGATEGLSKKSVVWENKHITWFATL 114
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
DG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D
Sbjct: 115 DGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDG 174
Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 175 ELLNRYLYLSTMERSELAKKIGTTPDIILDDL 206
>gi|321260384|ref|XP_003194912.1| cleavage and polyadenylation specific protein [Cryptococcus gattii
WM276]
gi|317461384|gb|ADV23125.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
gattii WM276]
Length = 1431
Score = 152 bits (384), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 130/529 (24%), Positives = 228/529 (43%), Gaps = 48/529 (9%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP----QLCDGSIVAFTVLHNVNCNHGF 969
I F NI G G F++G +P W + HP L ++ H F
Sbjct: 931 IVPFNNIEGLTGAFITGEKPHWII----SSEAHPLRAFALKQAAMAFGKTTHLGGKGEYF 986
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
I + IC LP D P + ++ T IT+ Y S+ V P
Sbjct: 987 IRIEDGSF--ICYLPPTLNTDFAIPCDRYQMERTYTNITFDPTSAHYVGAASIEV--PFQ 1042
Query: 1030 QVLSLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
D+E Q+ D +L R+ T+E + + PW+
Sbjct: 1043 AY-----DEEGEIQLGPDGPDLIPPTNQRS-TLELFS-------QGSDPWKVIDGYEFDQ 1089
Query: 1088 SENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL-- 1144
+E +++ V L + +A+GT + GED A RG +F +
Sbjct: 1090 NEEVMSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGP 1149
Query: 1145 -------VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPL 1196
+ + ++A+ + G+LL +GPK+ + + +L G+AF D L
Sbjct: 1150 GSVPGWKLVRRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGFDYDAQLMGLAFLDIQ-L 1208
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
Y ++ + KNF+L+GD+ KS +F+S +E + ++KD + +FL+ ++ +
Sbjct: 1209 YATTVKVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKDLQHVSVVTADFLVHDGQVTFI 1268
Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
SD ++++ + P +S G++L+ R E+H G+ T + T+ + AP +
Sbjct: 1269 SSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAATVSKVIARRKTTEEE--FAPQT 1326
Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
+++ T DG++ + + + F+RLQ + +LV + HVAGLNPR+FR N
Sbjct: 1327 Q------IIYATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRAFRTVR-ND 1379
Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
RP I+D +LL+ + + P+ Q E+ Q GT + S+L L
Sbjct: 1380 LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGTDAVTVASDLQALG 1428
Score = 110 bits (274), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 160/733 (21%), Positives = 298/733 (40%), Gaps = 121/733 (16%)
Query: 45 ELPSKRGIGPVPNLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---- 96
+ P + IG NLVV A + ++ +R + E K ++ E K+ V M+ +
Sbjct: 39 DTPDVKVIG---NLVVAGAEALRVFEIREESVPIIEKVKLEEDVAEGKKDVQMEEVGDGF 95
Query: 97 --------------SAASLELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDA 140
+ L L+ + L+G V LA + D D +I++F+DA
Sbjct: 96 FDDGHAERAPLKYQTTRRLYLLAQHELNGTVTGLAATRTLESAIDG---LDRLIVSFKDA 152
Query: 141 KISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ 200
K+++LE+ S + S+H +E ++ +S+ PL++ DP R + +
Sbjct: 153 KMALLEW--SRGDIATVSLHTYERCPQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDS 207
Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEP 258
+ +L Q S L D G A S V++L D+ +K+++D +FV G+ P
Sbjct: 208 LAVLPLIQEQSEL----DPLSEGFSRDAPYSPSFVLSLSDVSTTIKNIQDLLFVPGFHSP 263
Query: 259 VMVILHERELTWAGRV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
+ +L TW+GR+ + K C+ I +S+ +PL+ S LP D+ L+A PS
Sbjct: 264 TIALLFSPMHTWSGRLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPS 322
Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFS--VELDAAHATWLQN 374
+GG+++V + I + Q A A N S +S + +S S + L+ + ++
Sbjct: 323 ELGGIVLVTSTGIVHIDQGGRVAAACVNAWWSRITSLKCSMASVSQKLTLEGSRCVFVTP 382
Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLD-LSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
LL + G + + +GR V ++ L K SD+ G+ F+GS GDS
Sbjct: 383 HDMLLILQNGAVHQVRFSMEGRAVGLIEVLDKGCVVPPPSDLIVTGDGAVFVGSAEGDSW 442
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
L + +R+ R+ + V+ +E LYG ++
Sbjct: 443 LAKVNV-----------------------VRQRVERAEEKKDEMEVDWDE-DLYGDINDA 478
Query: 494 T--ESAQKTF-----SFAVRDSLVNIGPLKDFSYGLRINADA--------SATGISKQSN 538
E AQ+ F + + D L +G + D +G+ + + +G S+ S
Sbjct: 479 ALDEKAQEQFGPAAITLSPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNST 538
Query: 539 YELV-------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
+ + EL +G+W + G + + A +++S E
Sbjct: 539 FNVFRRGIPITKRRRFNELLNAEGVWFLSIDRQTGQ-----KFKDIPEAERATILLSSEG 593
Query: 586 ---RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
R L + ++ + G+T++A F R ++ V +LD +
Sbjct: 594 NATRVFALSSKPTPQQIGR-----LDGKTLSAAPFFQRSCILHVSPLEVVLLDNN----- 643
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
G + +++ SI+DP+ ++ +D S+ VGD TV+ + P E
Sbjct: 644 ---GKIIQTVCPRGDGPKIVNASISDPFAIIRRADDSVTFFVGDTVARTVA-EAPIVSEG 699
Query: 703 SKKPVSSCTLYHD 715
+ ++ D
Sbjct: 700 ESPVCQAVEVFTD 712
>gi|257215708|emb|CAX83006.1| Cleavage and polyadenylation specificity factor subunit 1
[Schistosoma japonicum]
Length = 462
Score = 152 bits (384), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 115/402 (28%), Positives = 197/402 (49%), Gaps = 55/402 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLV+T + IEIY ++ S SGET+ + + S+ + N+ +
Sbjct: 41 NLVITRSGFIEIYNIK--------SSVSGETR----FNWVYGTSV--------YENIADI 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ G S++L+F +AK++V+ F+ LR S+H +E + +LK GR +
Sbjct: 81 VSVRFAGDLLD----SLLLSFSEAKVAVMNFNPITFELRTLSLHNYE---FENLKSGRMN 133
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG------------- 223
F + P++++DP RC +LVY + +L + + + D G
Sbjct: 134 FTKLPILRLDPYQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHIGVRNFLLWQQRATA 193
Query: 224 ---GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
F+ + +S + +V D F+HG+ EP +++L+E TWAGRVS + T
Sbjct: 194 PLLATFTTCLSTS-----TGEKINNVLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDT 248
Query: 281 CMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCA 339
C I ALS + + +P+IW +LP D ++ VP PIGGV+++ AN+I Y Q+ SC+
Sbjct: 249 CCIVALSFNLQKRTNPVIWFQESLPFDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCS 308
Query: 340 LALNNYA---VSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD-- 394
L LN YA + Q++P S + +D L L+ T++G+L LL++ +
Sbjct: 309 LPLNCYAQISTNFPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQA 367
Query: 395 GRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
+ V L K +V + + + F+GSR DS+L++
Sbjct: 368 TQTVTSLLFHKVGHAVPPHCMVLLESKYLFIGSRFCDSVLMK 409
>gi|156040479|ref|XP_001587226.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980]
gi|154696312|gb|EDN96050.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1447
Score = 152 bits (383), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 137/588 (23%), Positives = 258/588 (43%), Gaps = 49/588 (8%)
Query: 872 STSRSLSVSNVSASRLRNLRFSRTP-LDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
STS +L S + ++ N ++ P + A + + + + N+ G+ F+ G
Sbjct: 879 STSPNLLSSTLQFLKIHNTHLAQAPDVSAEEQADETQQTSDKPMRAVSNLGGYSVVFMPG 938
Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY- 989
P + + + L L + + H C+ GFIY ++GI+++ Q P +T+
Sbjct: 939 GSPSFIVKSSKTLPKVLSLQGTGVRGLSSFHTEGCDRGFIYADTEGIVRVAQFPPTTTFA 998
Query: 990 DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1049
D ++K+ + H + Y + Y + S D E+ D+H
Sbjct: 999 DIGMALRKVEIGEDVHAVAYHSPLQTYVIGTST------------FTDFELPKD-DDHRR 1045
Query: 1050 S--SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF-NTTTKE 1106
S D+ ++E+ ++++ P W TI ++ E ++ + L + T E
Sbjct: 1046 SWQEEDIAFKPSIEKSSLKLISPVN----WSVIDTIELEPCEVITCIKTMNLVVSEVTNE 1101
Query: 1107 NETLLAIGTAYVQGEDVAARGRVLLFSTG---RNADNPQN------LVTEVYSKELKGAI 1157
+ LL +GTA +GED+A GR+ ++ D P+ + E ++ G +
Sbjct: 1102 RKPLLVVGTAITKGEDLATTGRLYVYDVVIVVPEPDRPETNKKLKLISAETITRGAGGPV 1161
Query: 1158 SALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIV--KNFILLG 1211
+ L+ + QG +L+A G K ++ K GT L +AF D YV S+ + ++
Sbjct: 1162 TGLSEIGTQGFMLVAQGQKCMVRGLKEDGTNLP-VAFMDTN-CYVTSIKELPGTGLCVIA 1219
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D K ++F + E+ ++ L K ++ + L DG L +V +D N+ I Y P
Sbjct: 1220 DALKGVWFAGYTEEPYKMLLFGKSATRMEVLCADLLPDGKDLFIVAADADGNLHIMQYDP 1279
Query: 1272 KMSESWKGQKLLSRAEFHVGAH-------VTKFLRLQMLATSSDRTGAAPGSDK--TNRF 1322
+ +S +G LL R F +GAH + L L T+S + + + +
Sbjct: 1280 EHPKSLQGHLLLHRTTFSLGAHHPTTMTLLPAIPSLHPLTTASSSSLSPSPQEDSPSPSQ 1339
Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
+LL + G+ ++PL E +RR +L L +++ H GLNPR++R + G
Sbjct: 1340 SLLLTSRTGTFALLSPLTESQYRRFGTLVSHLTNTLYHPCGLNPRAYR-VDKDANEGIVG 1398
Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+I+D +L + L + + E+A + G ++ L++L G F
Sbjct: 1399 GRTIIDGGVLGRWMELGSQRRGEVAGRVGVDVLELRDELSELRRGLEF 1446
Score = 129 bits (324), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 184/745 (24%), Positives = 307/745 (41%), Gaps = 153/745 (20%)
Query: 57 NLVVTAANVIEIYV-----VRVQEEGSKES---KNSGETKRRVL-MDGISAA-------- 99
NLVV A++++I+ V + E K+S K+ T R DG+ A+
Sbjct: 28 NLVVAKASLLQIFTTKTVSVDLDELSGKDSSTVKDVTSTDPRAHDEDGVEASFLGADSIL 87
Query: 100 ---------SLELVCHYRLHGNVESLA----ILSQGGADNSRRRDSIILAFEDAKISVLE 146
L L+ Y L G V SL I S+ G + ++++ F+DAK+S++E
Sbjct: 88 PRSELARTTKLVLIAEYNLSGTVTSLVRVKTISSKTGGE------ALLVGFKDAKLSLVE 141
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+D G+ S+H +E E + VDP RC + + IL
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201
Query: 207 SQGGSGLVGDED--------------TFGSGGGFSARIESSHVINLRDLDMKHV--KDFI 250
Q + D D + G + SS V+ L LD +
Sbjct: 202 KQDEDVNMDDWDEELDGPRPAKISQKSAAENGILATPYGSSFVLRLSSLDPSLIFPIHLE 261
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
F++ Y EP IL + + + H T M+ L I K I S LP+D
Sbjct: 262 FLYEYREPTFGILSSTMAPSSALLQERKDHLTYMVFTLDIHQ--KASTTILSVGGLPYDL 319
Query: 309 YKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
+ ++ + P+GG L+VGAN IH + +A+N +A + L +S ++ L+
Sbjct: 320 FMIVPLAPPVGGALLVGANELIHIDQAGKANGVAVNMFAKQCTNFSLLDQSDLALRLEGC 379
Query: 368 HATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS----VLT---SDITTI 418
L +N L+ +GD+ +L+ DGR V L + + + +LT S ++++
Sbjct: 380 KIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAELGGDILTGAASCVSSL 439
Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
G F+GS + DS+++ ++ SG PS ++ R SS A+ D+
Sbjct: 440 GAGALFVGSEVSDSVILGWSRKSGQ------------------PSRRKSRLDSS-AIADV 480
Query: 479 ----------------VNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSY 520
+ G+ ++ +A+N T S K ++F++ DS+VNI P+ + ++
Sbjct: 481 DEAMLDEEDLEDDDDDLYGDGPTISPTAANVTASNSKAGDYTFSIHDSMVNIAPITNITF 540
Query: 521 G-----------LRINADAS------ATGISKQSNYELV------------ELPGCKGIW 551
G L++N S A G K + ++ ELP +GIW
Sbjct: 541 GEVALSSDKEEELKLNGVQSELQLLAAVGREKGGSLAVINRNIQPNVIGRFELPEARGIW 600
Query: 552 TVYHK--SSRGHNADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLT 597
T+ K + +G + + D +Y +I+S EA + E+A +
Sbjct: 601 TMSAKKPAPKGLQVNKEKTVIGGDYGVDAQYDRLMIVSKASEAEDAIDESAVYALTNAGF 660
Query: 598 EVTESVDYF-VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
E ++ G TI AG L RVIQV + R DG + Q L + E+G+
Sbjct: 661 EALSGTEFEPAAGSTIEAGTLGNGMRVIQVLKSEVRSYDGDLGLAQILPM--LDDETGA- 717
Query: 656 SENSTVLSVSIADPYVLLGMSDGSI 680
++S S ADP++LL D SI
Sbjct: 718 --EPKIISASFADPFLLLIRDDASI 740
>gi|167526060|ref|XP_001747364.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774199|gb|EDQ87831.1| predicted protein [Monosiga brevicollis MX1]
Length = 1324
Score = 150 bits (379), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 174/734 (23%), Positives = 304/734 (41%), Gaps = 114/734 (15%)
Query: 739 IDGADGGP-LDQGDIYSVVCY------ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTY 791
D A+ GP + D Y+ Y ESGAL I VP+ F F G + + D
Sbjct: 661 FDAAELGPSVATSDDYAAPTYWALITTESGALYICTVPDLKIAFHCPSFGDGHSLVWDRL 720
Query: 792 MREALK---DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
+A+ D + + + G +E I ++ L + RP L A +D
Sbjct: 721 PNQAIPQAGDGADAPDEARNDDDAHGSEEYIVETLLIGLGQGQ------RPHLLARTSDH 774
Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
+L Y+ + PV V +V+ + +R
Sbjct: 775 HLLMYEVF-------------PV-------VPSVTEASVR-------------------- 794
Query: 909 APCQRITIFKNISGHQGFFLSGSRPCW--CMVFRERLRVHPQLCDGSIVAFTVLHNVNCN 966
R+ F+NI+G G ++G RP C + + + P + ++ F LH +
Sbjct: 795 ----RLKPFQNIAGCDGVCVTGPRPLLVACGHQLKAITIVPLALEDAVKTFHPLHMDDVE 850
Query: 967 HGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLK 1026
+GFIY T G L P G + ++ L T +I + + L L++ P +
Sbjct: 851 NGFIYFTKAGTLCCATAPDGLMLNRGVLARRAVLGRTIQKIAFDLDSRLAALLLMEP--R 908
Query: 1027 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHR-TYTVEE-------YEVRILEPDRAGGPWQ 1078
P E+ N++ S +L +Y +E +++++L P
Sbjct: 909 P-----------ELKPSRGNNDPPSNELPNISYRPDEPKALTPFFQLQLLSPKSMKLLPD 957
Query: 1079 TRATIPMQSSENALT-VRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRN 1137
TR + + VR+ + N+T K+N +A+G ++G+ G V ++ +
Sbjct: 958 TRIEYDLHHHVTSFAAVRLSSSLNSTGKQN--YIAVGVTLLEGQRATTTGFVDFYTVDVH 1015
Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIAS-----GPKIILHKWT-GTELNGIAFY 1191
D + + + S + G +SA+ + L+A+ G KI + + G EL +A++
Sbjct: 1016 -DGKETRLEKRASCKQPGCVSAMDCTEDGFLVAAVGQRLGSKIYVWNFQDGQELQPLAYF 1074
Query: 1192 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSW-KEQGAQ--------------LNLLAKDF 1236
+A +Y + ++KN ++GD + L + +++G Q L + D
Sbjct: 1075 EAG-IYTSCIRVIKNLAIVGDYESGVQLLRFSRQKGLQQMPVFRGTKHRFYSLVKVGADP 1133
Query: 1237 GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
+C+ +F++ S L+++ D N+ Y ++ G+ L+ A FH+G ++
Sbjct: 1134 HKSNCYCADFVVRESDLAMIYGDADGNLVALDYDADSPDTRGGRILVRSANFHLGTRLSA 1193
Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
LRLQ A R + FG ++G G + PL E +RRL+ LQKKLV
Sbjct: 1194 MLRLQ--AAPVVRAPGGLAEAQKCHVVHTFG-IEGQQGVVIPLHEAEYRRLEMLQKKLV- 1249
Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1416
S +AGL+P FR F S+ R I+D LL Y L EQL++A Q G + Q
Sbjct: 1250 SHSSLAGLHPFQFRAFKSSIWRPRSFAQGILDGALLRQYFCLGRREQLDVAEQLGVSAQQ 1309
Query: 1417 ILSNLNDLALGTSF 1430
+ ++ AL +F
Sbjct: 1310 LERDMAH-ALDAAF 1322
Score = 126 bits (316), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 142/291 (48%), Gaps = 29/291 (9%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LEL +RL+G ++ ++ + RD+++L+F DAKIS ++F+ S L +
Sbjct: 34 LELAASFRLNGVATAMVAITL----PKQLRDTVVLSFADAKISAIQFEPSTRTLITQKLI 89
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
E E ++ + P+++ DP RC G LVYG +++I+ A
Sbjct: 90 NLEI-EAVYGSKVNADLP--PVLQADPLHRCIGALVYGCRLVIIPAH------------- 133
Query: 221 GSGGGFSARIESS-HVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
R VI+L L + K F F+ GY P ++LHE W GR +
Sbjct: 134 ----ALQPRTNVQFRVIDLEKLSSPLGQAKSFCFLTGYTTPTALLLHEPRPVWVGRHAVG 189
Query: 278 HHTCMISALS--ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
+C++SALS + TT P +W+ +LP D + L+ P P+GG L+V N + + +Q+
Sbjct: 190 RDSCVLSALSCELDTTDDFAPTVWAKDSLPSDCFALVPTPQPLGGALIVSPNMVLHTNQA 249
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
+S A+A+N A ++ S+ LD A T++ + A+ S ++G L
Sbjct: 250 SSSAVAVNAIAARATGYPHTTQAGLSLNLDNARVTFITSVDAIFSLQSGQL 300
>gi|58268668|ref|XP_571490.1| cleavage and polyadenylation specific protein [Cryptococcus
neoformans var. neoformans JEC21]
gi|134113364|ref|XP_774707.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338817789|sp|P0CM63.1|CFT1_CRYNB RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|338817790|sp|P0CM62.1|CFT1_CRYNJ RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|50257351|gb|EAL20060.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57227725|gb|AAW44183.1| cleavage and polyadenylation specific protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 1431
Score = 150 bits (379), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 129/529 (24%), Positives = 228/529 (43%), Gaps = 48/529 (9%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP----QLCDGSIVAFTVLHNVNCNHGF 969
I F NI G G F++G +P W + HP L ++ H F
Sbjct: 931 IVPFNNIEGLTGAFITGEKPHWII----SSEAHPLRAFALKQAAMAFGKTTHLGGKGEYF 986
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
I + IC LP D P + ++ IT+ Y S+ V P
Sbjct: 987 IRIEDGSF--ICYLPPTLNTDFAIPCDRYQMERAYTNITFDPTSAHYVGAASIEV--PFQ 1042
Query: 1030 QVLSLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
D+E Q+ D +L R+ T+E + + PW+
Sbjct: 1043 AY-----DEEGEIQLGPDGPDLIPPTNQRS-TLELFS-------QGSDPWKVIDGYEFDQ 1089
Query: 1088 SENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL-- 1144
+E +++ V L + +A+GT + GED A RG +F +
Sbjct: 1090 NEEVMSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGP 1149
Query: 1145 -------VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPL 1196
+ + + ++A+ + G+LL +GPK+ + ++L G+AF D L
Sbjct: 1150 GSVPGWKLVKRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGLDYDSQLMGLAFLDIQ-L 1208
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
Y ++ + KNF+L+GD+ KS +F+S +E + ++KD + +FL+ ++ +
Sbjct: 1209 YATTVKVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKDLQHVSVVTADFLVHDGQVTFI 1268
Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
SD ++++ + P +S G++L+ R E+H G+ T + T+ + AP +
Sbjct: 1269 SSDRNGDMRMLDFDPTDPDSLNGERLMLRTEYHAGSAATVSKVIARRKTAEEE--FAPQT 1326
Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
+++ T DG++ + + + F+RLQ + +LV + HVAGLNPR+FR N
Sbjct: 1327 Q------IIYATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRAFRTVR-ND 1379
Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
RP I+D +LL+ + + P+ Q E+ Q GT + S+L L
Sbjct: 1380 LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGTDAVTVASDLQALG 1428
Score = 112 bits (279), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 161/716 (22%), Positives = 300/716 (41%), Gaps = 108/716 (15%)
Query: 57 NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---------------- 96
NLVV A V+ ++ +R + E K ++ E ++ V M+ +
Sbjct: 48 NLVVAGAEVLRVFEIREESVPIIENVKLEEDVAEGEKDVQMEEVGDGFFDDGHAERAPLK 107
Query: 97 --SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
+ L L+ + L+G + LA ++ D +I++F+DAK+++LE+ S +
Sbjct: 108 YQTTRRLHLLTQHELNGTITGLAA-TRTLESTIDGLDRLIVSFKDAKMALLEW--SRGDI 164
Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
S+H +E ++ +S+ PL++ DP R + + + +L Q S L
Sbjct: 165 ATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDSLAVLPLIQEQSEL- 220
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDM--KHVKDFIFVHGYIEPVMVILHERELTWAG 272
D G A S V++L D+ + K+++D +F+ G+ P + +L TW+G
Sbjct: 221 ---DPLSEGFSRDAPYSPSFVLSLSDMSITIKNIQDLLFLPGFHSPTIALLFSPMHTWSG 277
Query: 273 RV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
R+ + K C+ I +S+ +PL+ S LP D+ L+A PS +GG+++V + I
Sbjct: 278 RLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSELGGIVLVTSTGIV 336
Query: 331 YHSQ----SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
+ Q +A+C A + SL S + S + L+ + ++ LL + G +
Sbjct: 337 HVDQGGRVTAACVNAWWSRITSLKCS--MASVSQKLTLEGSRCVFVTPHDMLLVLQNGAV 394
Query: 387 VLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ +GR V++ LD P SD+T G+ F+GS GDS L +
Sbjct: 395 HQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSWLAKVNVVRQV 452
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
S K+E +++ D + L +DA D E L+G A+ +
Sbjct: 453 VERSEKKKDEM-EVDWD----EDLYGDINDAALDEKAQE---LFGPAA---------ITL 495
Query: 504 AVRDSLVNIGPLKDFSYGL-----------------------RINADASATGISKQSNYE 540
+ D L +G + D +G+ IN I+K+ +
Sbjct: 496 SPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTINVFRRGIPITKRRRFN 555
Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
EL +G+W + G + + A +++S E L ++ T
Sbjct: 556 --ELLNAEGVWFLPIDRQTGQ-----KFKDIPEAERATILLSSEGNAT--RVFALFSKPT 606
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGPSNSESGSGSENS 659
+ G+T++A F R +++V +LD + + Q + G G +
Sbjct: 607 PQQIGRLDGKTLSAAPFFQRSCILRVSPLEVVLLDNNGKIIQTV------CPRGDGPK-- 658
Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
+++ SI+DP+V++ +D S+ VGD TV+ + P E + ++ D
Sbjct: 659 -IVNASISDPFVIIRRADDSVTFFVGDTVARTVA-EAPIVSEGESPVCQAVEVFTD 712
>gi|298715584|emb|CBJ28137.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 255
Score = 150 bits (379), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 83/244 (34%), Positives = 140/244 (57%), Gaps = 12/244 (4%)
Query: 1188 IAFYDAPPLYVVSLNIVKN-FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEF 1246
I F+D P +YV+SL+++K+ FIL+GD + S+ + W+E+ L L+KD F+ E+
Sbjct: 23 IGFHD-PRVYVMSLSVIKHKFILVGDAYGSVQLVVWREEDHSLTALSKDHEDCQVFSAEY 81
Query: 1247 LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS 1306
LID +++VV+D ++N+++ YAP + S G KLL +++F++G+ V K R +
Sbjct: 82 LIDEPGMAIVVADGRRNVKVLQYAPNATNSRGGTKLLCQSDFYLGSRVGKLTRRRTRGNL 141
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNP 1366
D GA R+ LL GTLDG +G + P+DE FRRL +LQ + +++ H NP
Sbjct: 142 RD--GA--------RYCLLAGTLDGGLGAVLPVDERVFRRLYALQGIMSNALGHNGAANP 191
Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
R++R F +++D LL + L + Q ++ GTT ++++NL D+ L
Sbjct: 192 RAYRLFDHGPTFRYETKQNMLDGSLLWRFVGLDAKTQHDLTRAIGTTVDRVMANLLDIDL 251
Query: 1427 GTSF 1430
+ F
Sbjct: 252 ASLF 255
>gi|302652143|ref|XP_003017931.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
gi|291181517|gb|EFE37286.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
Length = 429
Score = 150 bits (378), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 200/392 (51%), Gaps = 33/392 (8%)
Query: 1060 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYV 1118
+E V++LEP W T + ++ +E + V+ L + T E + ++ +G++ V
Sbjct: 51 LERGTVKLLEPRN----WSTIDSHELEPAERITCIEVIRLEISELTHERKDMVVVGSSIV 106
Query: 1119 QGEDVAARGRVLLFSTGRNADNP----QNLVTEVYSKE-LKGAISALASL--QGHLLIAS 1171
+GED+ +G + +F P ++ ++++KE +KGA++AL+ + QG L++A
Sbjct: 107 KGEDIVPKGFIRVFEVIDVVPEPDQPEKSKKLKLFAKEEVKGAVTALSGIGGQGFLIVAQ 166
Query: 1172 GPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQGA 1227
G K ++ K G+ L +AF D YV L +K ++GD K ++F+ + E+
Sbjct: 167 GQKCMVRGLKEDGSLLP-VAFKDTQ-CYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPY 224
Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
+L+L K+ +L +FL DG+ L ++V+D+ N+ + Y P+ S KG +LL R+
Sbjct: 225 KLDLFGKENENLAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSV 284
Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAP--------GSDKTNRFALLFGTLDGSIGCIAPL 1339
FH G F L RT ++P S +++ +L GS+ I PL
Sbjct: 285 FHTG----HFASTMTLLPHGARTPSSPVDEDAMDTDSPPPSKYQILMTFQTGSVAVITPL 340
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
E ++RRL +LQ +LV+++ H LNPR +R S+G + G ++D LL + +
Sbjct: 341 GEDSYRRLLALQSQLVNALEHPCSLNPRGYRAVESDGMGGQRG---MIDGNLLLRWLDMG 397
Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
+ + EIA + G I +L L G ++L
Sbjct: 398 AQRKAEIAGRVGADVGAIRVDLEKLHGGLAYL 429
>gi|405121446|gb|AFR96215.1| cleavage and polyadenylation specific protein [Cryptococcus
neoformans var. grubii H99]
Length = 1431
Score = 148 bits (374), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 128/529 (24%), Positives = 227/529 (42%), Gaps = 48/529 (9%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP----QLCDGSIVAFTVLHNVNCNHGF 969
I F NI G G F++G +P W + HP L ++ H F
Sbjct: 931 IVPFNNIEGLTGAFITGEKPHWII----SSEAHPLRAFALKQAAMAFGKTTHLGGKGEYF 986
Query: 970 IYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1029
I + IC LP D P + ++ IT+ Y S+ V P
Sbjct: 987 IRIEDGSF--ICYLPPTLNTDFAIPCDRYQMERAYTNITFDPTSAHYVGAASIEV--PFQ 1042
Query: 1030 QVLSLLIDQEVGHQI--DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1087
D+E Q+ D +L R+ T+E + + PW+
Sbjct: 1043 AY-----DEEGEIQLGPDGPDLIPPTNQRS-TLELFS-------QGSDPWRVIDGYEFDQ 1089
Query: 1088 SENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL-- 1144
+E +++ V L + +A+GT + GED A RG +F +
Sbjct: 1090 NEEVMSMESVNLESPGAPGGYRDFIAVGTGFNFGEDRATRGNTYIFEILQTVGPQGGGGP 1149
Query: 1145 -------VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPL 1196
+ + + ++A+ + G+LL +GPK+ + +L G+AF D L
Sbjct: 1150 GSVPGWKLVKRTKDPARHPVNAVNHINGYLLNTNGPKLYVKGLDYDAQLMGLAFLDIQ-L 1208
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
Y ++ + KNF+L+GD+ KS +F+S +E + ++KD + +FL+ ++ +
Sbjct: 1209 YATTVKVFKNFMLIGDLCKSFWFVSLQEDPYKFTTISKDLQHVSVVTADFLVHDGQVTFI 1268
Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
SD ++++ + P +S G++L+ + E+H G+ T + T+ + AP +
Sbjct: 1269 SSDRNGDMRMLDFDPTDPDSLNGERLMLKTEYHAGSAATVSKVIARRKTAEEE--FAPQT 1326
Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG 1376
+++ T DG++ + + + F+RLQ + +LV + HVAGLNPR+FR N
Sbjct: 1327 Q------IIYATADGALTTVVSVKDARFKRLQLVSDQLVRNAQHVAGLNPRAFRTVR-ND 1379
Query: 1377 KAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
RP I+D +LL+ + + P+ Q E+ Q GT + S+L L
Sbjct: 1380 LLPRPLSKGILDGQLLNQFALQPIGRQKEMMRQIGTDAVTVASDLQALG 1428
Score = 111 bits (278), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 160/726 (22%), Positives = 301/726 (41%), Gaps = 107/726 (14%)
Query: 45 ELPSKRGIGPVPNLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---- 96
+ P + IG NLVV A V+ ++ +R + E +K ++ E ++ V M+ +
Sbjct: 39 DTPDVKVIG---NLVVAGAEVLRVFEIREESVPIIEKAKLEEDVAEGEKDVQMEEVGDGF 95
Query: 97 --------------SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKI 142
+ L L+ + L+G V LA ++ D +I++F+DAK+
Sbjct: 96 FDDGHAERAPLKYQTTRRLHLLTQHELNGTVTGLAA-TRTLESTIDGLDRLIVSFKDAKM 154
Query: 143 SVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
++LE+ S + S+H +E ++ +S+ PL++ DP R + + +
Sbjct: 155 ALLEW--SRGDIATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLWRLAVLTLPEDSLA 209
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
+L Q S L D G A S V++L D+ +K+++D +F+ G+ P +
Sbjct: 210 VLPLIQEQSEL----DPLSEGFSRDAPYSPSFVLSLSDVSTTIKNIQDLLFLPGFHSPTI 265
Query: 261 VILHERELTWAGRV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
+L TW+GR+ + K C+ I +S+ +PL+ S LP D+ L+A PS +
Sbjct: 266 ALLFSPMHTWSGRLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSEL 324
Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFS--VELDAAHATWLQNDV 376
GG+++V + I + Q A A N S +S + +S S + L+ + ++
Sbjct: 325 GGIVIVTSTGIVHVDQGGRVAAACVNAWWSRITSLKCSTASVSQKLTLEGSRCVFVTPHD 384
Query: 377 ALLSTKTGDLVLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
LL + G + + +GR V++ LD P SD+T G+ F+GS GDS
Sbjct: 385 MLLVLQNGAVHQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSW 442
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
L + + K+E +++ D + L +DA D E+ +G A+
Sbjct: 443 LAKVNVVRQVVERAEKKKDEM-EVDWD----EDLYGDINDAALDEKAQEQ---FGPAA-- 492
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA--------SATGISKQSNYELV--- 542
+ + D L +G + D +G+ + + +G S+ S + +
Sbjct: 493 -------ITLSPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTFNVFRRG 545
Query: 543 ----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA---RTMV 589
EL G+W + G + + A +++S E R
Sbjct: 546 IPITKRRRFNELLNADGVWFLPIDRQTGQ-----KFKDIPEAERATMLLSSEGNATRVFA 600
Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
L + ++ + G+T++A F R ++ V +LD + G
Sbjct: 601 LSSKPTPQQIGR-----LDGKTLSAAPFFQRSCILHVSPLEVVLLDNN--------GKII 647
Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSS 709
+ +++ SI+DP+V++ +D S+ VGD TV + P E +
Sbjct: 648 QTVCPRGDGPKIVNASISDPFVIIRRADDSVTFFVGDTVARTVG-EAPIVSEGESPVCQA 706
Query: 710 CTLYHD 715
++ D
Sbjct: 707 VEIFTD 712
>gi|344305212|gb|EGW35444.1| pre-mRNA 3'-end processing factor CF II [Spathaspora passalidarum
NRRL Y-27907]
Length = 1348
Score = 148 bits (374), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 133/591 (22%), Positives = 256/591 (43%), Gaps = 66/591 (11%)
Query: 836 HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
H +L + G ++ Y+ Y F+G N + ++LR +
Sbjct: 783 HKEEYLTILTIGGEVIMYKLY-FDG-------------------ENYIFKKEKDLRITGA 822
Query: 896 PLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
P +AY P G +R + F N++G+ F++G P M + Q
Sbjct: 823 PENAY-----PLGTTIERRLVYFPNLNGYTSIFVTGIIPYLIMKPMHSIPRIFQFSKIPA 877
Query: 955 VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKN 1014
++ + + +G I++ + +IC+L TY+ WP+++I + + ITY N
Sbjct: 878 LSISAFSDSKIKNGLIFLDNSKNARICELSLDFTYEFNWPMRQIHIGDSIKSITYHETSN 937
Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDR 1072
Y +VS P + G D + +T Y+ ++++ P
Sbjct: 938 TY--VVSTFREIPYD-----------GLDEDGKLIVGTLPDKTPRPVAYKGSIKMISPLN 984
Query: 1073 AGGPWQTRATIPMQSSENALTVRVVTLFNTTT----KENETLLAIGTAYVQGEDVAARGR 1128
W TI + +E A+ V+ + L ++ K + + IG+ + ED+ A G
Sbjct: 985 ----WTVIDTIELDDTEVAMNVQSMMLDVGSSMKKFKNKKEFIVIGSGKYRNEDLVANGS 1040
Query: 1129 VLLFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
+F P + EV+ ++ +GA++++ L G LLIA G K+I+
Sbjct: 1041 FKIFEIVDIVPEPGKPETNHKFKEVFQEDTRGAVTSICGLSGRLLIAQGQKVIVRDVQDD 1100
Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
+ +AF D +YV + N ++LGD KS + + + + ++ +L KD L+
Sbjct: 1101 GVVPVAFLDTA-VYVSESKSLGNLLMLGDPLKSCWLVGFDAEPFRMIMLGKDLHHLNVSC 1159
Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1303
+F+ + ++++D + + Y P +S GQ+L+S++ F + + V+ +L +
Sbjct: 1160 GDFITKDEDIYMLIADNNNILHLIQYDPDDPQSLNGQRLISKSAFEIESTVSCMRKLPKI 1219
Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
+S +++ + F ++ T DGS + P+DE ++RR+ LQ++L D H G
Sbjct: 1220 ESSFEKSEIK--FSPIDEFQIIGSTSDGSFFNVFPVDESSYRRMYILQQQLTDKEYHYCG 1277
Query: 1364 LNPRSFR-----QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
LNPR R + N +P I+D L+ Y L + + +A +
Sbjct: 1278 LNPRLNRFGGAIELRDNETNTKP----ILDFGLIKRYAQLNEDRKRNLASK 1324
Score = 74.3 bits (181), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 100/500 (20%), Positives = 196/500 (39%), Gaps = 77/500 (15%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
L+V N+++I+ + ++ S +K L+++ ++L+G + L
Sbjct: 29 LIVAKGNLLQIFEPVLIKQQSTPTK--------------PKYKLQIIGQFKLNGLITDLH 74
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR--E 175
L +N D +I++ + AK S+++++ +H + S+H +E H R E
Sbjct: 75 PLRT--VENPHL-DYLIVSTKYAKFSIIKWNHHLHTISTVSLHYYE-----HAIRNSTFE 126
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE----------------DT 219
L+ V+P L + + L + D+ D
Sbjct: 127 KLGISELI-VEPTFNSCSCLRFKNLLCFLPFAVSDEEEEEDDEEDMDLDNKKEKKEKLDI 185
Query: 220 FGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
G + +SS +I+ + LD ++ V D F+H Y EP + IL + WAG +
Sbjct: 186 NGKPADAVSFYDSSFIIDAQTLDSSIETVVDIQFMHNYREPTIAILSSKSNVWAGNLLKV 245
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSA 336
+++ K ++ NLP++ +++ +PSP+ G L++G N I H +
Sbjct: 246 KDNVSFQVMTLDLVSKSTVSVFKIDNLPYEIDRIIPLPSPLNGCLLLGCNEIFHVDNGGI 305
Query: 337 SCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
+A+N++ S S Q+ S S+E D + L+ TG +
Sbjct: 306 IKRIAVNSFTSLVTASTKSYQDQTDLSLSLE-DCCIIPIPGDHRVLMVLTTGQFFYINFE 364
Query: 393 YDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
DG+ ++++ + + ++ + ++ + ++L F + G+S LVQF
Sbjct: 365 LDGKAIKKVHIDTVDQALYSQIKLCYPGEVAVLDHNLLFFANENGNSPLVQF-------- 416
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ----KTF 501
+ D+ D + + +E LY N E Q
Sbjct: 417 -------RYTDV--DQKRITQEAAKEEKKEEKDDEEDEDDLYMDEENEEEQKQIISNSPI 467
Query: 502 SFAVRDSLVNIGPLKDFSYG 521
F D L+N GP+ F+ G
Sbjct: 468 EFIHHDELINNGPISSFTLG 487
>gi|344229600|gb|EGV61485.1| hypothetical protein CANTEDRAFT_109087 [Candida tenuis ATCC 10573]
Length = 1300
Score = 148 bits (374), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 128/527 (24%), Positives = 234/527 (44%), Gaps = 60/527 (11%)
Query: 918 KNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
+N+SG G F+SG P + + + + I++F N+ I++ +
Sbjct: 814 ENLSGLTGIFVSGDVPYYIVKTNHSIPRIFKFARIPIMSFGKF----ANNQLIFLDDKKN 869
Query: 978 LKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1037
+IC++PS Y+N WP ++I + T + Y N + ++S P N +D
Sbjct: 870 TRICEIPSEFNYENNWPARQINIGETIKDVAYHETSNTF--VISTYKEIPYN-----CLD 922
Query: 1038 QE----VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1093
+E VG D + S + ++++ P W + +E
Sbjct: 923 EENVPIVGIMEDKPSALSY---------KGSIKLVSP----ISWTVIDEFELDDNEVGTK 969
Query: 1094 VRVVTL-FNTTT---KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNL 1144
V + L ++T K + IGT ++ ED+AA G + P +
Sbjct: 970 VSSMVLDVGSSTRRFKSKREFVVIGTGKLRMEDLAANGSFKVLEIIDVIPEPGHPETNHK 1029
Query: 1145 VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV 1204
E Y +E KGA++A++ + G L++ G KII+ + +AF D +YV
Sbjct: 1030 FKEFYKEETKGAVTAVSDVSGRFLVSQGQKIIVRDLQDDGVVPVAFLDCS-VYVSESKSY 1088
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
NF+LLGD KS++ + + ++ +L KD S+D +F++ L ++V D +
Sbjct: 1089 GNFVLLGDTLKSVWLAGFDAEPYRMIMLGKDLKSIDVNCADFIVKDEELYIIVGDNNNIL 1148
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
+ Y P+ S GQ+L+ +A F++ A VT +L+ L D + + GS
Sbjct: 1149 HLLKYDPEDPNSSNGQRLVEKAAFNLNAKVT---QLKQLPNLMDNSTSCIGS-------- 1197
Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGKAHR 1380
T++GS + P++E ++RR+ LQ++L D H GLNPR R + +N ++
Sbjct: 1198 ---TIEGSFFTVFPINESSYRRMYILQQQLTDKAYHHCGLNPRLNRFGGLKLTANESNNK 1254
Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
P I+D +++ Y L + + I + S+I ++ + G
Sbjct: 1255 P----ILDYDVIKLYAKLNEDRRRNIGAKVSREGSEIWRDMLEFEAG 1297
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 105/463 (22%), Positives = 201/463 (43%), Gaps = 45/463 (9%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV Y+L G + S+ + + + D +++A + AKIS++ +D + H +R S+H
Sbjct: 51 LNLVDQYKLFGTITSIKPIR---TIENPKLDYLLVATQLAKISLVRWDHASHSIRTVSLH 107
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
+E+ + + L+ V+P+ C V L + ++
Sbjct: 108 YYEN---VIQTSTFDKLNSAELI-VEPKNACLCVRYKNLLTFLPFTRLKTEEDEYADEED 163
Query: 221 GS-GGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
G+ + +SS +IN ++LD + + D F+H Y +P + +L ++ WAG + +K
Sbjct: 164 GAVTNSYDGIYDSSFLINGQNLDSRIGTIVDADFLHNYRQPTVALLSSKDQVWAGNLFFK 223
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
LS+ K+ + +LP+D +L+++PSP+ G L+VGAN IH +
Sbjct: 224 KDNISYIVLSLDLNTKKSTTVLKIDDLPYDIDRLISLPSPLNGSLLVGANQLIHIDNGGI 283
Query: 337 SCALALNNYA--VSLDSSQELPRSSFSVELDAAHATWLQNDVALLST-KTGDLVLLTVVY 393
+ +++N + + +S + S ++ L+ L N+ +L TG+ +LT
Sbjct: 284 TRKISVNPFTDLTTKNSKNYINYSHMNLRLENCSVVPLPNENKVLVILSTGEFYMLTFEI 343
Query: 394 DGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
DG+ ++RL S + N+L F+G++ G+S L+Q+
Sbjct: 344 DGKTIKRLTFEVVETSRYNGINVTFPGQFAALDNNLLFVGNKNGNSPLIQYKY------- 396
Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
G KE+ ++ DA + D + S ++ F +
Sbjct: 397 -EGAKEK--AVKEDAKDEEDNDGDEELYEDDEEKVKSFS------------KEKLDFTLC 441
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
D L+N GP+ F++G N + I+ NY+ V + G
Sbjct: 442 DELINHGPISAFTFGFYSNEKFKSNLIN--PNYQEVSIFSNSG 482
>gi|302403950|ref|XP_002999813.1| cft-1 [Verticillium albo-atrum VaMs.102]
gi|261361315|gb|EEY23743.1| cft-1 [Verticillium albo-atrum VaMs.102]
Length = 1349
Score = 148 bits (373), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 137/558 (24%), Positives = 234/558 (41%), Gaps = 109/558 (19%)
Query: 886 RLRNLRFSRTPLDAYTREETPHGAPCQRITIFK---NISGHQGFFLSGSRPCWCMVFRER 942
++ N +++P++A E R+ + NI G+ F+ G+ P + + +
Sbjct: 846 KVSNSHLAKSPVEAAEDEAVQE----NRVIPLRACDNIGGYSTVFVPGASPSFILKSSKS 901
Query: 943 LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST---------YDNYW 993
L + + H C GFIY S+G ++ Q P + Y W
Sbjct: 902 TPKVIGLQGLGVNGMSSFHTEGCERGFIYADSKGCARVTQFPDAANVAELGVDDDYHKEW 961
Query: 994 PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1053
++ P+ P + + K P+ +V ID+
Sbjct: 962 AKEECPM---PPMKEHGSIKLYSPITWNV-------------IDE--------------- 990
Query: 1054 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1113
+ +E+YEV + T+ ++ SE TKE L A+
Sbjct: 991 ----FELEQYEVAM-----------CMKTLLLEVSEE-------------TKERRMLFAV 1022
Query: 1114 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKEL-KGAISALASLQGHL 1167
GTA ++GED+ RGR+L+F P T+ + +E+ +GA+++L
Sbjct: 1023 GTAILRGEDLPVRGRILVFDVVHVIPQPDRPETDRKLKLIAKEEIPRGAVTSLC----EK 1078
Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGDIHKSIYFLSWKEQ 1225
+ G L +W +A D YVV+++ ++N + L+ D + ++F+ + E+
Sbjct: 1079 CMVRG----LRRWHAA---AVALPDLS-TYVVAVHELRNTGYCLMADANMGVWFVGYSEE 1130
Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
++ L K L C +FL+ G+ LS+V SDE + I + P+ S +G LL+R
Sbjct: 1131 PYRMTLFGKSGTQLKCLTADFLVAGNDLSIVASDEDGVLHILQFDPEHPRSLQGHLLLNR 1190
Query: 1286 AEFHVGA-HVTKFLRLQMLAT------SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
A F V H L L T S T AA ++T LL + G+I + P
Sbjct: 1191 ASFSVAPNHAWVTLALPRTTTRPYLPQSEPATNAAGSQNRTQ--TLLLASASGAIASLNP 1248
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD-----SIVDCELLS 1393
+ E +RRL SL L +++PH AG+NP++ R +G A P D +IVD LL+
Sbjct: 1249 ITEHAYRRLTSLTTSLANALPHAAGMNPKAHRLPPQDGAARPPAVDVSAGRTIVDGALLA 1308
Query: 1394 HYEMLPLEEQLEIAHQTG 1411
+ L ++ E A + G
Sbjct: 1309 RWNELGARQRAEAAGKGG 1326
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 117/507 (23%), Positives = 199/507 (39%), Gaps = 75/507 (14%)
Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
S V+ L LD + H F F+H Y EP + I+ H T + ++
Sbjct: 197 SFVLALPQLDPEILHPVHFAFLHEYREPTLGIISSSNRRLKMEPQMDHFTFKV--FTVDL 254
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL 349
K I + NLP K++A+ P+GG L++G N IH + +A+N YA +
Sbjct: 255 LQKASTAILTVSNLPQSLKKVVALSKPMGGALLIGENELIHIDQAGKAHGVAVNPYAAKM 314
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL---- 403
+S + L+ + D LL T+ G++ ++T DGR V + +
Sbjct: 315 TKFPLADQSELKLRLEHCEVELMSPDNGEMLLVTRHGEMAVVTFKMDGRSVSGVSVKVVA 374
Query: 404 SKTNPSVL---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
++ +L + +T + + F G+ GDS ++ + S + ++ K D +
Sbjct: 375 TENGGDILPFRAACLTKVSKNSMFYGTIGGDSQVIGW---SRQHVQTARKKARLLD---E 428
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
+ D D + GE ++ + F V DSL+++ P+ D +Y
Sbjct: 429 SLDYDLDEDELDDDDDDDLYGEGTVAPQPSAAAGSAKGGDVVFRVHDSLLSLSPIMDMAY 488
Query: 521 G----------------LRINAD-ASATGISKQSNYELV------------ELPGCKGIW 551
G +R D A G + + L+ E P +G W
Sbjct: 489 GKTAFFPGSEEAKNSEGVRSELDLVCAVGRHRGGSLALINQHIQPRVIGRFEFPEARGFW 548
Query: 552 TV-----YHKSSRGHNADSSRMAAYDD-----EYHAYLIISLEARTMVLETADLLTEVTE 601
T KS +G + +A +D +Y ++I++ + ET+D+
Sbjct: 549 TTRVQKTIAKSLQGEKG--ANLAVGNDYGSVTQYDKFMIVA-KVDLDGYETSDVYALTGA 605
Query: 602 SVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
+ G TI AG + R+IQV R DG ++Q L + E+G
Sbjct: 606 GFEALSGTEFDPAAGLTIEAGTMGNDMRIIQVLRSEVRCYDGDLGLSQILPM--LDEETG 663
Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSI 680
+ V+S SI DPY+LL D SI
Sbjct: 664 A---EPRVISASIVDPYLLLLREDSSI 687
Score = 42.7 bits (99), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 66/138 (47%), Gaps = 30/138 (21%)
Query: 57 NLVVTAANVIEIYVVRV-------QEEGSKESKNSGETKRRVLMD--GISAA-------- 99
NL+V+ ++++I+ V+ + +K S +GET R + D G+ +A
Sbjct: 28 NLIVSKGSLLQIFAVKTVSTEIDTSQIQAKSSSKAGETYDRRINDDDGLESAFLGGDGML 87
Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDD 149
L LV Y +HG + LA + +SR +++++ A++S+L++D
Sbjct: 88 MRADRTTNTRLVLVAEYPVHGVIAGLARVK---IQSSRSGGEALLVHSRTARLSLLQWDP 144
Query: 150 SIHGLRITSMHCFESPEW 167
HG+ S+H +E EW
Sbjct: 145 EKHGVEDVSIHFYEKEEW 162
>gi|328848896|gb|EGF98089.1| hypothetical protein MELLADRAFT_96156 [Melampsora larici-populina
98AG31]
Length = 1427
Score = 147 bits (371), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 173/384 (45%), Gaps = 45/384 (11%)
Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLFS 1133
G W T Q +E ++++V+L + + + + GT + ED+AARG V +F
Sbjct: 1051 GSWDTIDGHEFQQNEWVTSMKLVSLDSKSKRSGRRDFIGAGTTCNRAEDLAARGGVYVFE 1110
Query: 1134 TGRNADNPQNL-----VTEVYSKELKGAISALASLQGHLLIASG-------PKIILHKWT 1181
+P++ + Y + K ++A+ L G+ + G P+ K++
Sbjct: 1111 VIEIVPDPKHPERNRGLRLRYHETTKACVTAVDGLNGYFIHTMGQKVDPGYPRSPTRKYS 1170
Query: 1182 GT---------------------ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
L + F D P Y +L ++KNFI+LGD K I +
Sbjct: 1171 DILADQIIAFYSKLYAKCFEQDERLLAVGFLDIRP-YTTTLKVLKNFIVLGDAVKGITLV 1229
Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
+++E+ +L L F L C +FL+ + LS+V SD I+IF Y P ES G
Sbjct: 1230 AFQEEPYKLIELGHTFVDLRCSTIDFLVLENKLSIVTSDLGGTIRIFEYNPTNIESQGGL 1289
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
KLL R EF + L +S + + LF LDGSI + P+
Sbjct: 1290 KLLCRTEFGTAGEMGSSLGFGKRLSSKEEAKS---------IGTLFAGLDGSISSLVPVK 1340
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
E F+RLQ +Q +L+ + H AGLNPR FR N R I+D E++ + L L
Sbjct: 1341 EAVFKRLQIVQTRLIRHLDHFAGLNPRGFRTVR-NDLVSRAMNRGIIDGEIIERFGALKL 1399
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDL 1424
+EQ I G+ R+ IL NLN+L
Sbjct: 1400 DEQDSIGKLAGSDRNTILINLNNL 1423
Score = 104 bits (260), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 193/933 (20%), Positives = 350/933 (37%), Gaps = 171/933 (18%)
Query: 104 VCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE 163
V ++LHG V L ++ + D ++++F+DAKI++LE+ L S+H FE
Sbjct: 73 VLEHQLHGIVTGLQPITTIDT-HVDGLDRLLVSFKDAKITLLEWSHQQSDLVPISLHTFE 131
Query: 164 SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
+ F + ++ DPQ RC + + + +L Q + D +T S
Sbjct: 132 KLPQITQGDFPTIFDQ---LETDPQSRCAILKLPQSTIAVLPFFQENN---LDLETLFSN 185
Query: 224 GGFSA---RIES-----SHVINLRD------------------LDMKHVKDFIFVHGYIE 257
SA RI+S S +I+L +K + F F+ G+ +
Sbjct: 186 SNPSANNQRIQSFPYAPSFIIDLNQSQSFKSQTQTHSQTQTQQKSIKSIISFKFLPGFSQ 245
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + IL+ + TWAGR+ +C + +++ + +I+ NLP+ A+ ++A P
Sbjct: 246 PTLAILYTYQHTWAGRLENTTDSCSLIFITLDLSSNHFTIIFQIDNLPYHAHSIMACPKE 305
Query: 318 IGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP---------------RSSFSV 362
+GGVLV+ A+ I + QS+ N L + ++P V
Sbjct: 306 VGGVLVICADMILHIDQSSKLIGIATNGWSKLSTHLDVPTQQMVKIVTEDGQDQEERLKV 365
Query: 363 ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN-PSVLTSDITTIGNS 421
L+ + ++ D AL+ G + L + DGR + +L L K SV+ S I +
Sbjct: 366 RLENSKLVFVTIDRALMFLTDGQIFRLCLYQDGRTLIKLCLEKFPVVSVIPSVAVKISDH 425
Query: 422 LFFLGSRLGDSLLVQFTC------------------------GSGTSMLSSGLKEEFGDI 457
F+GS LGDS+++ G+ + + E +G
Sbjct: 426 SVFVGSMLGDSIVMGIEFEGEKEVEVVEEVEVEVEAEVVHQNGNEMEIDQAEEDEIYGKE 485
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
E D TK D + ++ + N + ++ S + DS+ GP++D
Sbjct: 486 EPDDKKTK-----DQDGIDSIIK----------ATNKKIHREIRSLRLHDSISGHGPIRD 530
Query: 518 FSYGLRINADASATGISKQSNYE-LVELPGCKGI-----WTVYHKS---SRGHNADSSRM 568
F+ +SK +E +E+ GC G T+++K + DS+
Sbjct: 531 FT-------------MSKIGGFEDSLEMVGCTGSGETGGLTIFYKEMPLMKRKKLDSTNE 577
Query: 569 A---------AYDDEYHA----YLIISLEARTMVLETADLLTEVTESVDY----FVQGRT 611
+ A++D + IS+ RT + E + D + T
Sbjct: 578 SMKITNLNSIAFNDPTGSPGCELAWISIHDRTKIFSMIKNPEEGNRTSDLKFMKTLNAST 637
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
I F + +Q+ ++L + P +E ++ + ++ + Y+
Sbjct: 638 IYVAMFFDQTCFLQITSYEIKLLKVVGFGEVQVIRPIETE----NKKNKIIRAKVVQDYI 693
Query: 672 LLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
LL SD + L G + T+ +Q P KPV+ +L+ P + T+
Sbjct: 694 LLETSDHRVMLYKGQVDSLTIDRIQLPQL----SKPVTYASLFSAHLP-LYDHDDQTN-- 746
Query: 731 LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF-----VSGRT 785
G+G D P + V G L I +P VFTV +
Sbjct: 747 ---GIGLDNDEDAEKP------WLFVTDLGGVLHILSLPELEIVFTVKGIENLPDLLDED 797
Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
+ + A++ + + EE KEN + A +RP L+ L
Sbjct: 798 EDEEQQQQPAIEYEHEDGDVKMEEDEKVEPKENSSIQMIYGFVT---GAKVARPHLYVEL 854
Query: 846 TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF-SRTPLDAYTREE 904
+G + YQ + K DP ++ ++ +++ +F S P+ R
Sbjct: 855 NNGALAVYQISI----AYDRKPGDPSTSKPRRQALSIRLNKVLGYQFESSEPISNLDR-- 908
Query: 905 TPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM 937
++ + K + G LSG P W +
Sbjct: 909 --------KVKVVKKNATFSGIHLSGLEPIWIV 933
>gi|448105510|ref|XP_004200513.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|448108635|ref|XP_004201144.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|359381935|emb|CCE80772.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
gi|359382700|emb|CCE80007.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
Length = 1344
Score = 145 bits (365), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 133/607 (21%), Positives = 264/607 (43%), Gaps = 69/607 (11%)
Query: 824 VVELAMQRWSAHHSRPFLFAILT-DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
+ + HS+ ILT G ++ Y+ + F+G N
Sbjct: 774 IKNIVFNELGDEHSKDEYLTILTIGGEVIIYKLF-FDG-------------------DNF 813
Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMVFRE 941
+ ++L+ + P +AY P G +R ++ N++G+ F++G P +
Sbjct: 814 KFIKEKDLKITGAPDNAY-----PLGTTLERRLVYVPNVNGYSSIFVTGIIPYFITKTVH 868
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
+ + V+F+ + N +GFIY+ + ++C++P Y+N WP++KI +
Sbjct: 869 SVPRIFRFTKLPAVSFSSYSDSNIKNGFIYLDNSKNARMCEIPLDFNYENNWPIKKIQMP 928
Query: 1002 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE 1061
T I Y N + V+ ++ +D+E G I +D + E
Sbjct: 929 ETVKAIAYHELSNTF-------VVSSYEEIPYDCLDEE-GKPI-----VGIDKSKP-PAE 974
Query: 1062 EYE--VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL----FNTTTKENETLLAIGT 1115
Y+ +R++ P W TI + +E + V + L K + L+ +G+
Sbjct: 975 SYKGYLRLISPYN----WSVIDTIVLADNEIGMNVLSMVLDVGSSTKKFKSKKELIVLGS 1030
Query: 1116 AYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
+ ED+++ G +F G+ N + EV+ ++ +GA++++ + G LL
Sbjct: 1031 GKYRIEDLSSNGSFKIFEIIDIIPEPGKPETNHK--FKEVHIEDTRGAVTSICEVSGRLL 1088
Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
+ G KII+ + +AF D +YV N ILLGD KS++ + + +
Sbjct: 1089 VTQGQKIIIRDLQDDGVVPVAFLDTA-VYVSEAKSFGNLILLGDSLKSVWLAGFDAEPFR 1147
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
+ LL+KD +LD +F++ + ++ +D + + + P+ S GQ+L+ + F
Sbjct: 1148 MILLSKDIQTLDVSCADFIVKDEEIFILFADNNNVLHVVKFDPEDPLSSNGQRLVHKTSF 1207
Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1348
++ + T F + + + + + T F + T+DGS + P++E T+RR+
Sbjct: 1208 NINSAATCF---RTIPKNEENYPSL-----TTSFQSIGSTIDGSFFTVFPINESTYRRMY 1259
Query: 1349 SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1408
LQ++L D H+ GLNPR R N +++ +++ + L + + A
Sbjct: 1260 ILQQQLTDKEFHICGLNPRLNRFGGLNETNSDANSKPMLEYDVIKKFVNLNSDRKKNFAS 1319
Query: 1409 QTGTTRS 1415
+ G+ S
Sbjct: 1320 KIGSKNS 1326
Score = 93.2 bits (230), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 105/502 (20%), Positives = 196/502 (39%), Gaps = 83/502 (16%)
Query: 58 LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL- 116
LVV + +++++ + + SKE K L+LV ++LHG + L
Sbjct: 29 LVVGKSTLLQVFDIVQSNKKSKEYK------------------LKLVEQFKLHGLITDLK 70
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A+ + D D ++++ + AK+S++++D + + S+H +E+ E
Sbjct: 71 AVRTVENPD----LDYLLVSTKSAKMSLVKWDHHENSISTVSLHYYENSIQ---SSTYEK 123
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMII-------------LKASQGGSGLVGDEDTFGSG 223
L+ ++P C + L + + G SG G D +
Sbjct: 124 LTTTELI-MEPNNTCACLRFKNLLTFLPFEMPDEDDEEDGYENVDGASGSRGKHDNKATQ 182
Query: 224 GGFS-ARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
+ A SS VI+ ++LD + +V D F++ Y EP + I+ + TW G +
Sbjct: 183 QDENQALFYSSFVIDAQNLDSRIGNVIDMKFLYNYKEPTLAIISSKNHTWTGLLPLTKDN 242
Query: 281 CMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSASCA 339
LS+ K + NLP D ++ +P P+ G L++G N IH +
Sbjct: 243 ISFIVLSLDLVTKTSTTVLKIDNLPFDIDTIVPLPKPLNGTLLIGCNEIIHVDHGGITRR 302
Query: 340 LALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGR 396
LA+N + S+ SS + R S +++L+ + ND L K GD + DG+
Sbjct: 303 LAVNQFTSSITSSIKNYRDQSELNLKLENCCVKPIPNDHRVFLILKNGDFYYINFAIDGK 362
Query: 397 VVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
++ L K N D+ + N+L F+ ++ G+S L++
Sbjct: 363 TIKNFYLEKVNSINQNEIGISYPEDVVHLDNNLMFICNKNGNSPLIELKF---------- 412
Query: 450 LKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES----------AQK 499
+++ + + +QD NG ++
Sbjct: 413 ---------SESKDNQNAEQQKDTEMQDTENGTTDKNDNDDDDDIYEDDEDNEKVLIKNS 463
Query: 500 TFSFAVRDSLVNIGPLKDFSYG 521
F D L+N GP+ F++G
Sbjct: 464 VIEFTKHDELINNGPVSSFTFG 485
>gi|121925707|sp|Q0UUE2.1|CFT1_PHANO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
Length = 1375
Score = 145 bits (365), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 133/521 (25%), Positives = 226/521 (43%), Gaps = 44/521 (8%)
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
P +S L N+ +L R D E + NI+G+
Sbjct: 836 PSRSSSDLWTHNLRWVKLSQQHVPRYMEDGAQEEAADEPGFESTLLALDNINGYSTVIQR 895
Query: 930 GSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
G P + + L + + T H +C GF Y+ S L+I QLP + Y
Sbjct: 896 GRSPAFILKESSSAPRVIGLSGNPVKSLTRFHTSSCQRGFAYLDSTDTLRISQLPPSTHY 955
Query: 990 DNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVLSLLIDQEVGHQID 1045
+ W +++P+ A H + Y LY + P L P + L +E +
Sbjct: 956 GHLGWAARRMPMDAEVHALAYHP-SGLYVIGTGQPEEYTLDPNDTFHYELPKEETSFKPK 1014
Query: 1046 -NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTT 1103
H + V +T+TV + +L+P E L ++ + L + T
Sbjct: 1015 VEHGIIKVMDEKTWTV--IDTHVLDP-----------------QEVILCIKTLNLEVSET 1055
Query: 1104 TKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAIS 1158
T + + ++A+GTA V GED+A +G + +F P + T + E+KG +S
Sbjct: 1056 THQRKDVIAVGTAIVLGEDLATKGNIRIFEVITVVPEPDHPETNKRLKLIVKDEVKGTVS 1115
Query: 1159 ALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKN--FILLGD 1212
A++ L QG L++A G K ++ K GT L +AF D YV +L + N +L+GD
Sbjct: 1116 AISDLGTQGFLIMAQGQKSMVRGLKEDGTLLP-VAFMDMQ-CYVTTLKTLPNTGMLLMGD 1173
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
+K +F + E+ ++ L + L+C +FL L ++V+D N+Q+ + P
Sbjct: 1174 AYKGAWFTGYTEEPYKMMLFGRSKHHLECITADFLPFEEQLHIIVADADMNLQVLQFDPD 1233
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQ---MLATSSDRTGAAPGSDKTNRFALLFGTL 1329
+S G +LL ++ FH G + LQ + T+S+ T + S ++ +L +
Sbjct: 1234 HPKSMGGTRLLQKSTFHTGHFPSTMHLLQSRLHMPTASEFTTSTTSSLPLHQ--ILCTSQ 1291
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
G++ I PL E ++RRL L L + GLN ++FR
Sbjct: 1292 SGTLALITPLSESSYRRLSGLATHLQQFLDSPCGLNGKAFR 1332
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 167/727 (22%), Positives = 295/727 (40%), Gaps = 142/727 (19%)
Query: 57 NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NL+V ++++++ V V G E+ N+ E L + A L LV
Sbjct: 28 NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + + R +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
W + +F + DP RC + + IL
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
+A+Q G E G+ G + + SS V+ L +LD + H F+H Y EP
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
++ + T A ++ + + ++ K + S LP+D +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316
Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
L+VG+N IH + +A+N A + S ++ ++ L+ L D
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDI---TTIGNSLFFLGSRLG 430
++ G + +LT DGR V + + + ++L + T +G F+GS G
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPVPADNGGNILKTRASCSTNLGRGRLFIGSEDG 436
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
+S+L+ +T ++ +LRR S+ Q + E++S
Sbjct: 437 ESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEEE 472
Query: 491 S---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD----------- 517
N+T + K T++F V D L +I P++D
Sbjct: 473 EVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTES 532
Query: 518 FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------GH 561
+ G + ++ A G N EL ELP G+W V+ K G
Sbjct: 533 LTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFGQ 592
Query: 562 NADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAAG 615
+A+++ A+ D +Y YL++S T+V E TE D+ +G T++ G
Sbjct: 593 DAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSVG 650
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
L +V+QV R D + + P E N +++ S ADPY+L+
Sbjct: 651 VLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQR 706
Query: 676 SDGSIRL 682
D S+++
Sbjct: 707 EDSSVKI 713
>gi|169603229|ref|XP_001795036.1| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
gi|160706354|gb|EAT88382.2| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
Length = 1338
Score = 144 bits (362), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 125/474 (26%), Positives = 214/474 (45%), Gaps = 44/474 (9%)
Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
NI+G+ G P + + L + + T H +C GF Y+ S
Sbjct: 846 LDNINGYSTVIQRGRSPAFILKESSSAPRVIGLSGNPVKSLTRFHTSSCQRGFAYLDSTD 905
Query: 977 ILKICQLPSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVL 1032
L+I QLP + Y + W +++P+ A H + Y LY + P L P +
Sbjct: 906 TLRISQLPPSTHYGHLGWAARRMPMDAEVHALAYHP-SGLYVIGTGQPEEYTLDPNDTFH 964
Query: 1033 SLLIDQEVGHQID-NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENA 1091
L +E + H + V +T+TV + +L+P E
Sbjct: 965 YELPKEETSFKPKVEHGIIKVMDEKTWTV--IDTHVLDP-----------------QEVI 1005
Query: 1092 LTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE--- 1147
L ++ + L + TT + + ++A+GTA V GED+A +G + +F P + T
Sbjct: 1006 LCIKTLNLEVSETTHQRKDVIAVGTAIVLGEDLATKGNIRIFEVITVVPEPDHPETNKRL 1065
Query: 1148 --VYSKELKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSL 1201
+ E+KG +SA++ L QG L++A G K ++ K GT L +AF D YV +L
Sbjct: 1066 KLIVKDEVKGTVSAISDLGTQGFLIMAQGQKSMVRGLKEDGTLLP-VAFMDMQ-CYVTTL 1123
Query: 1202 NIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
+ N +L+GD +K +F + E+ ++ L + L+C +FL L ++V+D
Sbjct: 1124 KTLPNTGMLLMGDAYKGAWFTGYTEEPYKMMLFGRSKHHLECITADFLPFEEQLHIIVAD 1183
Query: 1260 EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ---MLATSSDRTGAAPGS 1316
N+Q+ + P +S G +LL ++ FH G + LQ + T+S+ T + S
Sbjct: 1184 ADMNLQVLQFDPDHPKSMGGTRLLQKSTFHTGHFPSTMHLLQSRLHMPTASEFTTSTTSS 1243
Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
++ +L + G++ I PL E ++RRL L L + GLN ++FR
Sbjct: 1244 LPLHQ--ILCTSQSGTLALITPLSESSYRRLSGLATHLQQFLDSPCGLNGKAFR 1295
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 167/727 (22%), Positives = 295/727 (40%), Gaps = 142/727 (19%)
Query: 57 NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
NL+V ++++++ V V G E+ N+ E L + A L LV
Sbjct: 28 NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ L G V SLA + + R +++++AF DAK+S++E+D + L S+H +E+P+
Sbjct: 88 FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145
Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
W + +F + DP RC + + IL
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
+A+Q G E G+ G + + SS V+ L +LD + H F+H Y EP
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
++ + T A ++ + + ++ K + S LP+D +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316
Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
L+VG+N IH + +A+N A + S ++ ++ L+ L D
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDI---TTIGNSLFFLGSRLG 430
++ G + +LT DGR V + + + ++L + T +G F+GS G
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPVPADNGGNILKTRASCSTNLGRGRLFIGSEDG 436
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
+S+L+ +T ++ +LRR S+ Q + E++S
Sbjct: 437 ESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEEE 472
Query: 491 S---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD----------- 517
N+T + K T++F V D L +I P++D
Sbjct: 473 EVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTES 532
Query: 518 FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------GH 561
+ G + ++ A G N EL ELP G+W V+ K G
Sbjct: 533 LTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFGQ 592
Query: 562 NADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAAG 615
+A+++ A+ D +Y YL++S T+V E TE D+ +G T++ G
Sbjct: 593 DAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSVG 650
Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
L +V+QV R D + + P E N +++ S ADPY+L+
Sbjct: 651 VLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQR 706
Query: 676 SDGSIRL 682
D S+++
Sbjct: 707 EDSSVKI 713
>gi|402219312|gb|EJT99386.1| hypothetical protein DACRYDRAFT_17537 [Dacryopinax sp. DJM-731 SS1]
Length = 1620
Score = 142 bits (359), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 101/343 (29%), Positives = 171/343 (49%), Gaps = 29/343 (8%)
Query: 1094 VRVVTLFNTTTKENET----LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVY 1149
+ VV N T +E+ +A+GT+ +GED+A RG +F P + +
Sbjct: 1291 INVVKSVNLETLSSESGFKDYIAVGTSTFRGEDLAVRGATYIFEVIEVVSYPDDPLPPYR 1350
Query: 1150 SK-----ELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNI 1203
K E K ++A+ L G+L+ + G K+ + + E L G+AF DA + V SL
Sbjct: 1351 LKLLCRDEAKAPVNAICGLNGYLVSSQGFKVFVRAFEQDERLVGVAFMDAG-VCVTSLTR 1409
Query: 1204 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE--FLIDGSTLSLVVSDEQ 1261
+KN +L+GD +S+ F++++E +L + T+ FL D S++ +D++
Sbjct: 1410 LKNLLLIGDAKRSVSFVAFQEDPFKLR---------PTYVTDAAFLFDEGDFSILAADDE 1460
Query: 1262 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1321
+++F + P ++ + G L+ EF+ + T L + + R P +
Sbjct: 1461 GTLRLFEFDPNLTGATHGNPLICETEFNGQSEHTHILAI------AGRGREDPEEMQIPE 1514
Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
L+FGT+DG++G I+P+ + F+RLQ L +L+ SV H AGLNPR+FR N RP
Sbjct: 1515 AQLIFGTIDGTLGTISPVPDECFKRLQLLSGQLMRSVQHFAGLNPRAFRTVR-NDLLSRP 1573
Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
++D +LL + L + Q I Q GT IL ++ L
Sbjct: 1574 LNKGMLDYDLLHAFRELDIRRQATITKQIGTDTITILRDIRSL 1616
Score = 122 bits (306), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 162/667 (24%), Positives = 267/667 (40%), Gaps = 112/667 (16%)
Query: 101 LELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
L LV +R+HG V L + G D D ++++F+DAKI++LE+ D+I+ L S
Sbjct: 136 LHLVREHRMHGFVTGLEKVRTLASGEDGM---DRLLVSFKDAKIALLEWSDAIYDLSTVS 192
Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD-- 216
+H +E + E PL++ DP+ RC +L+ + IL Q + D
Sbjct: 193 LHTYERSSQVSTSEASE---HRPLLRADPESRCAALLLPKDALAILPFVQRTGLDLADPA 249
Query: 217 EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
D ++ S+V L D D ++HV DF F+ + P + IL++ W GR+
Sbjct: 250 RDKEREHQPYTP----SYVFPLSDADDTLRHVLDFCFLPSFHTPTLAILYQPAQNWTGRL 305
Query: 275 SWKHHTCMISALSISTTLK----------QHPLIWSAMNLPHDAYKLLAVP--SPIGGVL 322
S ++ +++ K +I LP+DA+ LL S GGV+
Sbjct: 306 SQTKDNTSLAIVTLDLVGKGAAAGGGAGGGGAVISRTHGLPYDAFSLLPAREGSTFGGVV 365
Query: 323 VVGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--------LDAAHATWLQ 373
V+ N++ H LA + + S+ P +F+ E L+ + W
Sbjct: 366 VLAGNSVLHVDPAGRIVGLAASGWHAQ-SSALRFPLWAFTAEEGETEERKLEGSRLCWAG 424
Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLD----LSKTN-PSVLT-------SDITTIGNS 421
+L G L V +GR V L L +T+ P+VL + G
Sbjct: 425 EQQLILVGAQGWARELKVGVEGRNVSSLSAGRRLGRTSAPAVLCPVGEQSGRALKPTGRD 484
Query: 422 LFFLGSRLGDSLLVQFTCGSG--TSMLSSGLKEEF--GDIEADAPSTKRLRRSSSDALQD 477
L +L S G S+L+Q G + +G ++E D+E DA S K +D L D
Sbjct: 485 LVWLASEAGQSVLLQVHKGEPRVEEVKPNGEEKEIEGEDMEIDADSDK------NDDLAD 538
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ L ++ A + V D+L G + D S+ L +G + +
Sbjct: 539 IYGDSGLPAAAASGVTAGPALPWLTLEVLDALQGHGQIADMSFALSFR-----SGPDRPT 593
Query: 538 NYELVELP-GCKGIWTVYHK--------------SSRG---------------HNADSSR 567
+ P G +G WTVY +RG +
Sbjct: 594 PKLVCSTPEGERGAWTVYENGLPIRVKRRVPAVAGTRGIWSLRVRRGDRARRGGRRERGE 653
Query: 568 MAAYDDEYHAYLIISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
D E LI+S +A RT+ +++ L ++ + T+AAG F
Sbjct: 654 REWADGEERDNLIVSTDATPSPGISRTITVDSRGELQIISR-----LPALTLAAGVFFSH 708
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
V+QV +LDG ++L N E S ++ + DP+V++ +GS+
Sbjct: 709 TCVMQVTPDSLHLLDGD--GKELQVLKDNE---GNKEASPIIKACVEDPWVVVTRENGSV 763
Query: 681 RLLVGDP 687
L +GDP
Sbjct: 764 ALYLGDP 770
>gi|260941626|ref|XP_002614979.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
gi|238851402|gb|EEQ40866.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
Length = 1363
Score = 139 bits (349), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 135/555 (24%), Positives = 248/555 (44%), Gaps = 70/555 (12%)
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRERLRVHP 947
+L + P +AY+ HG +R I F ++SG ++G P M+ R R H
Sbjct: 837 DLPITGAPFNAYS-----HGTSIERRMIYFPDVSGTTCIMVTGVIPY--MITRSR---HS 886
Query: 948 QL-----CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKA 1002
Q+ IV+F +G IY+ ++ +I +LPS +YD WP++K+ +
Sbjct: 887 QVKVFKFSKIPIVSFVPFSTDKIKNGLIYLDTKKNARIVELPSEFSYDYNWPIRKVSIGE 946
Query: 1003 TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDLHRTY 1058
T + + N L+VS P N ID+E VG I + S++ +
Sbjct: 947 TVKSVAFHEGSN--TLVVSTLKEIPYN-----CIDEEGNPIVG--IKPNKPSAISYKGS- 996
Query: 1059 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLLAIG 1114
++++ P W I + +E L V+ + L + TK + + +G
Sbjct: 997 ------IKLISPVN----WSVIDNIELADNEVGLHVKSMPLDVGSETKRFKSKKEFVLVG 1046
Query: 1115 TAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
T + ED+A G L G+ N + E ++ +GA++++ + G
Sbjct: 1047 TGKYRLEDLACNGSYKLLEIIDIIPEPGKPETNHK--FKEFTQEDTRGAVTSICEVSGRF 1104
Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
L+A G KII+ +AF D ++V N ++LGD KS++ + +
Sbjct: 1105 LVAQGQKIIVRDIKDNSAVSVAFLDTS-VFVSESKSFGNLVVLGDTLKSVWLAGFDAEPF 1163
Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
++ +L KD LD + +FL+ + ++V+D +++ + Y P+ S GQ+LL ++
Sbjct: 1164 RMIMLGKDLQGLDVSSADFLVKDEEIYILVADNNRSLHVLQYNPEDPASSNGQRLLHKSS 1223
Query: 1288 FHVGAHVT---KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
F T + + L+T D A P F + T++G++ + P+ E T+
Sbjct: 1224 FTTNYLTTCTKSVPKHEQLSTWFD-PQAIP-------FQTVGSTVEGAMYVVFPISEPTY 1275
Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFR--QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
RR+ +Q++L+D H GLNPR R + S A+ +++DCEL+ + L +
Sbjct: 1276 RRMYIMQQQLIDKEYHHCGLNPRLNRIGRIESVNYANL---RAMLDCELIRRFSKLNEDR 1332
Query: 1403 QLEIAHQTGTTRSQI 1417
+ ++ + T Q+
Sbjct: 1333 KRTLSSKISTKNVQV 1347
Score = 86.7 bits (213), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 110/220 (50%), Gaps = 15/220 (6%)
Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
SS ++ LD K + D F+H Y +P + +L +++ TWAG + + S LS+
Sbjct: 224 SSFILEASALDNKIGDIIDLQFLHHYRQPTIAVLSQQKSTWAGLLPQTKDNVIFSVLSLD 283
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVS 348
+ + NLP+D K++A+PSP+ G L++G N IH + + +A+N Y
Sbjct: 284 MQTRLTTTVLQIENLPYDLEKIIALPSPLNGSLLIGCNELIHVDTGGITRRIAVNQYTED 343
Query: 349 LDSSQE--LPRSSFSVELDAAHATWLQNDVALLST-KTGDLVLLTVVYDGRVVQRLDLSK 405
+ +S + ++S ++L+ + ND LL +TG++ + DG+ ++R+ + +
Sbjct: 344 ITASLKNYADQTSLDLKLEDCSILPIPNDNKLLMVLRTGEMYFIVFEVDGKTIKRMSVEE 403
Query: 406 TNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLVQF 437
PS S I ++ N+L FL R +S LV+
Sbjct: 404 I-PSETYSQIKLMDPSSFASLDNNLLFLTGRSSNSHLVEL 442
>gi|33411764|emb|CAD58787.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
Length = 180
Score = 137 bits (346), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 72/184 (39%), Positives = 105/184 (57%), Gaps = 12/184 (6%)
Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES+ G +LL RA+FHVGAHV
Sbjct: 1 DAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHV 60
Query: 1295 TKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
F R + GAA G K N+ F TLDG IG + P+ E T+RRL
Sbjct: 61 NTFWR-------TPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLM 113
Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
LQ L +PH AGLNPR+FR H + + + +++D ELL+ Y L E+ E+A +
Sbjct: 114 LQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKK 173
Query: 1410 TGTT 1413
GTT
Sbjct: 174 IGTT 177
>gi|426235955|ref|XP_004011942.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Ovis aries]
Length = 819
Score = 137 bits (345), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 117/398 (29%), Positives = 179/398 (44%), Gaps = 82/398 (20%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ + +KN T + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAK--------------ISVLEFDDSIHGLRITSMHCF 162
A + GA +RD+++L+F+DAK + FD + + TSM
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKGGYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTM 140
Query: 163 ESPEWLHL-----------------------------------KRGRESFARGPLVKVD- 186
E P +L L K+ R G +V
Sbjct: 141 E-PGYLFLGSRLGNSLLLKYTEKLQEPPASTTREAADKEEPPSKKKRVDATTGWAGRVRE 199
Query: 187 ---PQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSH---VINLRD 240
PQ + VYG + +Q G+ L T+ F R S +I++R
Sbjct: 200 GELPQDEVDEIEVYGSE------AQSGTQLA----TYS----FEVRWGSEWLPGIIDVRA 245
Query: 241 LDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K HP+I
Sbjct: 246 LDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVI 305
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPR 357
WS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ + +
Sbjct: 306 WSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQ 365
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
+ LD A A ++ D ++S K G++ +LT++ DG
Sbjct: 366 EGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDG 403
Score = 112 bits (280), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 94/339 (27%), Positives = 148/339 (43%), Gaps = 31/339 (9%)
Query: 907 HGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNC 965
+G R+ + K H F+ G P W +V R LR+HP DG I +F HN+NC
Sbjct: 486 YGGRHHRLALHKPPLHH--VFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINC 543
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
GF+Y QG L+I LP+ +YD WPV+KIPL+ T H + Y E +Y + S
Sbjct: 544 PRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTST- 602
Query: 1026 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1085
P +V + G + + + + + E + ++++ P W+ +
Sbjct: 603 -PCTRVPRM-----TGEEKEFETIERDERYVHPQQEAFCIQLISPVS----WEAIPNARI 652
Query: 1086 QSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1145
+ E ++ + + G+ +GE+V RGR+L+ P +
Sbjct: 653 ELEEXXXXX------XXGSRGHVYSVPAGSCLKEGEEVTCRGRILIMDVIEVVPEPGQPL 706
Query: 1146 TE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTG-TELNGIAFYDAPPLYVV 1199
T+ +Y KE KG ++AL GHL+ A G K LN AF V
Sbjct: 707 TKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKXXXXXLPPHAGLNPRAFRMLHVDRRV 766
Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
N V+N +L G++ +LS E+G LAK G+
Sbjct: 767 LQNAVRN-VLDGELLNRYLYLSTMERGE----LAKKIGT 800
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 76/160 (47%), Gaps = 22/160 (13%)
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDL-VLLTVVYDG-RVVQRLDLSKTNPSVLTSDI 415
S SV+L A + D LLS K +LT++ DG R V+ K SVLT+ +
Sbjct: 83 SMASVQLAGA-----KRDALLLSFKDAKGGYVLTLITDGMRSVRAFHFDKAAASVLTTSM 137
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL-------- 467
T+ FLGSRLG+SLL+++T + E D E KR+
Sbjct: 138 VTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTTREAADKEEPPSKKKRVDATTGWAG 195
Query: 468 RRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVR 506
R + QD V+ E+ +YGS A + T+ A T+SF VR
Sbjct: 196 RVREGELPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVR 231
Score = 48.9 bits (115), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 40/71 (56%)
Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1410
QK +P AGLNPR+FR H + + + +++D ELL+ Y L E+ E+A +
Sbjct: 739 QKXXXXXLPPHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKI 798
Query: 1411 GTTRSQILSNL 1421
GTT IL +L
Sbjct: 799 GTTPDIILDDL 809
>gi|68471460|ref|XP_720278.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
gi|46442138|gb|EAL01430.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
Length = 758
Score = 136 bits (343), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 125/562 (22%), Positives = 243/562 (43%), Gaps = 53/562 (9%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
N + ++L + P +A+ P+G +R + F N++G F++G P +
Sbjct: 193 NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 247
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
+ Q + ++ + + +G I++ +Q +IC+LP Y+ P++ +
Sbjct: 248 VHSIPRIFQFSKIAAMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVD 307
Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI--------DNHNLS- 1050
+ + I Y + VL Q+ +D+E G I D +S
Sbjct: 308 IGESIKSIAYHETSD-------TVVLSTFKQIPYDCLDEE-GKPIAGIIKDIKDTPAMSF 359
Query: 1051 --SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1108
S+ L Y E LE + G ++ S + L +L K+
Sbjct: 360 KGSIKLVSPYNWTVIETIELEDNEVGMTLKSMILDVGSESGSTLGSDPNSLIKKYNKKKR 419
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALA 1161
+ IG + ED+AA G ++ G+ N + E++ +E +GAI+++
Sbjct: 420 EYIPIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFKEETRGAITSIC 477
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
L G L++ G K+I+ +AF D P +YV N ++LGD+ K + +
Sbjct: 478 ELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLILGDLLKGCWLVG 536
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
+ + ++ +L KD + +F+I+ + ++V+D + + Y P +S G K
Sbjct: 537 FDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTK 596
Query: 1282 LLSRAEFHVGAHVTKFLRLQML-ATSSDRTGA---------APGSDKTNRFALLFGTLDG 1331
LL++A F + + ++ L ++ S +T A P + +N F ++ T DG
Sbjct: 597 LLTKASFELNSTISCLRSLPLIDIEESVQTDALTNIAVPPPLPPNTTSNYFQVIGSTQDG 656
Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGKAHRPGPDSIV 1387
S + P++E +RR+ LQ++L+D H GLNPR R + +N +P I+
Sbjct: 657 SFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKP----IL 712
Query: 1388 DCELLSHYEMLPLEEQLEIAHQ 1409
D +L+ + L + + +A++
Sbjct: 713 DYDLIRSFTKLSDDRKRNLANK 734
>gi|453082807|gb|EMF10854.1| CPSF_A-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 1349
Score = 135 bits (341), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 122/486 (25%), Positives = 226/486 (46%), Gaps = 46/486 (9%)
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLI 1019
+H C HGF+ + + ++ QLP + Y W +Q++ + I Y AE+ +Y
Sbjct: 875 VHRPGCEHGFLTINADEEVQENQLPEKTWYGTGWSIQQVDIGEDVRHIAYHAEREVY--- 931
Query: 1020 VSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT 1079
V+ + ++ H + D+ V +Y + ++ + Q
Sbjct: 932 ----VVATCRDIDFYFAGEDGRHPEQD------DIELRPQVPQYTIHLV----SAKSHQR 977
Query: 1080 RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS---TG 1135
++ + E ++V++L + T E + L+ + TA +GED+ A+G V+L+
Sbjct: 978 LQSVELGYLETVTALKVMSLEVSENTHEQKDLVVVSTAAQRGEDMPAKGAVILYDIIDVV 1037
Query: 1136 RNADNPQN--LVTEVYSKELKGAISALAS-LQGHLL-IASGPKIILH--KWTGTELNGIA 1189
+ D P++ + ++ ++ +GAI+++A L G L A G K+++ K GT L +A
Sbjct: 1038 PDPDVPESGFQLHQLAREQARGAITSIAGPLPGGFLGTAQGLKLMVRGLKEDGTCLP-VA 1096
Query: 1190 FYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS-LDCFATEF 1246
F DA Y +L ++ + L GD K ++F + E+ +L ++ K S ++ EF
Sbjct: 1097 FLDAQS-YTHTLKVLPGRGMWLAGDAWKGLWFGGFTEEPYKLTVMGKSPKSKMEVMTAEF 1155
Query: 1247 LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLA 1304
L L +++ D ++ + Y P+ +S G +LL R+ FH+G VT L + +
Sbjct: 1156 LPFDGALYILIMDADNDLHVLQYDPENPKSVGGMRLLHRSTFHIGHLVTNMLLVPSSLKP 1215
Query: 1305 TSSDRTGAAPGSDKTNRFA---------LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1355
S A G++ N A +L + GS+G I PLDE +RRL +LQ L
Sbjct: 1216 FESQDRDMANGTNGNNEEATRAPPSLHHILATSRSGSVGLITPLDEAAYRRLSALQTHLT 1275
Query: 1356 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
+ H AGLNPR++R + G +VD L++ L ++ ++ + G+
Sbjct: 1276 AILEHAAGLNPRAYRAVEAESFG---GARGVVDGSLVNRIGELGAAKRADVLGRAGSDGW 1332
Query: 1416 QILSNL 1421
+ S+L
Sbjct: 1333 VVRSDL 1338
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 144/680 (21%), Positives = 266/680 (39%), Gaps = 93/680 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV ++++++ G K + N G ++ VL V Y L G V S+
Sbjct: 28 NLVVAKTSLLQVF-------GVKAAGNDGGNEKLVL-----------VGEYSLAGTVTSI 69
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + D ++++L+F+DAK+S++E+D + + S+H +E + G
Sbjct: 70 ARVKT--LDTKSGGEAVLLSFKDAKLSLVEWDPENYRISTISLHFYEGDNVISAPFGPPL 127
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQ---------------GGSGLVGDEDT-- 219
++ VDP RC + Q+ IL Q L + T
Sbjct: 128 ADCDSILTVDPSSRCAALKFGARQLAILPFRQFGDELAGEEEEGEFDADHALATSKRTES 187
Query: 220 --FGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
+G ++S + L LD + H F+H Y EP IL +
Sbjct: 188 VPHANGDTEHTPYKASFTLALTALDPSVSHAVHLAFLHEYREPTFGILSATVEPSYSLLE 247
Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
+ + L++ + + S LP ++++ +P P+GG L++G N + + QS
Sbjct: 248 ERKDILTYTVLTLDLEQRASTNLISVPKLPSTLWEVVPLPLPVGGALLLGTNELVHVDQS 307
Query: 336 ASC-ALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVV 392
A A+N +A +S +++L+ L + L+ T G L +L+
Sbjct: 308 GKANATAVNEFAKLESDFGMADQSHLNLKLEDCRVEVLDSKTGELLIVTNDGSLAILSFQ 367
Query: 393 YDGRVVQRLDLSKTNPSVLTSDITT-------IGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
GR + L++ + ++ I T + S F+GS G S L+ ++ TS
Sbjct: 368 MHGRSISALNVKRATSENGSTTIHTAPSCMARLEGSKIFIGSEDGASSLLGWS--RPTSA 425
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
L+ K + + D E S + T +AQ TFS +
Sbjct: 426 LNR--KRSHAQMLDKEADDEDEEMEEDDDDLYDAAPEPKKRASSETAVTSTAQYTFS--I 481
Query: 506 RDSLVNIGPLKDFSYG--------LRINADASATGISKQS--NYELV-------ELPGCK 548
D L++ GP+ + G L I A A S+ + + ++V +L +
Sbjct: 482 IDELLSTGPIHEVCLGRSGPWKDRLEIAAGAGRKQASRLTLMHRDIVPTVRRKCKLGAAR 541
Query: 549 GIWTVYHKSSRGHNADSSRMA-AYD-DEYHAYLIISL--EARTMVLETADLLTEVTESVD 604
W + K + + +D D+ Y I S + + +A E++D
Sbjct: 542 ATWALRPKQRNAALPEYDNLLFVFDGDDTKVYDIPSQDEDGSSYTERSAPEFESAGETLD 601
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARI-LDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
T+A G + + R ++ A++ LD P E E+ +++
Sbjct: 602 M----ATVADGTIVVQTRRTELRTYNAKLGLD--------QIIPMTDE--ETDEDLSIVH 647
Query: 664 VSIADPYVLLGMSDGSIRLL 683
++++DPYVL+ D S+++L
Sbjct: 648 IAVSDPYVLVIRGDNSVQVL 667
>gi|147772179|emb|CAN73417.1| hypothetical protein VITISV_017053 [Vitis vinifera]
Length = 609
Score = 134 bits (337), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 69/122 (56%), Positives = 80/122 (65%), Gaps = 26/122 (21%)
Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYEL--------------------- 541
F V DSL+N+GPLK F+Y LRINAD ATGI KQSN+EL
Sbjct: 430 FEVNDSLINVGPLKVFAYALRINADLKATGIVKQSNFELMCCSGHGKNGALCILQQSIRP 489
Query: 542 -----VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
VEL GC+ IWTVYHK++RGHNADS++M DDEY AYLIIS E+RTMVLET +LL
Sbjct: 490 EMITEVELSGCERIWTVYHKNTRGHNADSTKMVTKDDEYCAYLIISPESRTMVLETVELL 549
Query: 597 TE 598
E
Sbjct: 550 GE 551
>gi|238881599|gb|EEQ45237.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 1423
Score = 133 bits (334), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 125/562 (22%), Positives = 242/562 (43%), Gaps = 53/562 (9%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
N + ++L + P +A+ P+G +R + F N++G F++G P +
Sbjct: 858 NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 912
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
+ Q + ++ + + +G I++ +Q +IC+LP Y+ P++ +
Sbjct: 913 VHSIPRIFQFSKIAAMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVD 972
Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI--------DNHNLS- 1050
+ + I Y + VL Q+ +D+E G I D +S
Sbjct: 973 IGESIKSIAYHETSD-------TVVLSTFKQIPYDCLDEE-GKPIAGIIKDIKDTPAMSF 1024
Query: 1051 --SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1108
S+ L Y E LE + G ++ S + L +L K+
Sbjct: 1025 KGSIKLVSPYNWTVIETIELEDNEVGMTLKSMILDVGSESGSTLGSDPNSLIKKYNKKKR 1084
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALA 1161
+ IG + ED+AA G ++ G+ N + E++ +E +GAI+++
Sbjct: 1085 EYIVIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFKEETRGAITSIC 1142
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
L G L++ G K+I+ +AF D P +YV N ++LGD K + +
Sbjct: 1143 ELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLILGDPLKGCWLVG 1201
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
+ + ++ +L KD + +F+I+ + ++V+D + + Y P +S G K
Sbjct: 1202 FDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTK 1261
Query: 1282 LLSRAEFHVGAHVTKFLRLQML-ATSSDRTGA---------APGSDKTNRFALLFGTLDG 1331
LL++A F + + ++ L ++ S +T A P + +N F ++ T DG
Sbjct: 1262 LLTKASFELNSTISCLRSLPLIDIEESVQTDAFTNIVVPPTLPPNTTSNYFQVIGSTQDG 1321
Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGKAHRPGPDSIV 1387
S + P++E +RR+ LQ++L+D H GLNPR R + +N +P I+
Sbjct: 1322 SFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKP----IL 1377
Query: 1388 DCELLSHYEMLPLEEQLEIAHQ 1409
D +L+ + L + + +A++
Sbjct: 1378 DYDLIRRFTKLSDDRKRNLANK 1399
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/238 (23%), Positives = 110/238 (46%), Gaps = 19/238 (7%)
Query: 216 DEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
+ED G+ R+ +SS +I+ LD + V D F+H Y EP + +L ++ WA
Sbjct: 203 EEDKNGTTTNQEPRLFYDSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWA 262
Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IH 330
G + L++ K ++ NLP++ +++ +PSP+ G L+VG N IH
Sbjct: 263 GNLIKSKDNIQFQVLTLDLNSKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIH 322
Query: 331 YHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGD 385
+ +A+N + S S Q+ +S +++L+ + +D LL +TG+
Sbjct: 323 VDNGGVLKRIAVNKFTRLITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGE 380
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
+ DG+ ++R+ + + ++ + ++ F+ + G+S L+Q
Sbjct: 381 FYFINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 438
>gi|149237256|ref|XP_001524505.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146452040|gb|EDK46296.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1380
Score = 133 bits (334), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 120/521 (23%), Positives = 236/521 (45%), Gaps = 67/521 (12%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS---IVAFTVLHNVNCNHG 968
+R+ F N++G+ F++G P + + L P++ S V+ + + +G
Sbjct: 876 RRLVYFPNLNGYTCIFVTGVIP---FIIIKSLHSIPRIFQFSKIPAVSISAFSDSKIKNG 932
Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1028
I + + +IC+L TY+ P++++ L + Y + + ++ S P
Sbjct: 933 LICLDNNKNARICELSLDYTYEFNLPIKRVDLGELVRSLAYHEQSD--TVVASTFKEVPY 990
Query: 1029 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1088
N +D+E G+ I + T+ + ++++ P W + ++ +
Sbjct: 991 N-----CVDEE-GNIIPGVYKEKLPHALTF---KSSIKLISPHN----WTVIDSFDLEDN 1037
Query: 1089 ENALTVRVVTLFN-------TTTKENETLLAIGTAYVQGEDVAARGRVLLFST------- 1134
E +TV+ + L K + IGT ++ ED+AA G ++
Sbjct: 1038 EVGMTVKSMILDRGSGAASLKKFKSKREYIVIGTGKLRMEDLAANGSFKIYEIIDIIPEP 1097
Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
G+ N + EV+ ++ +GA++A+ L G L++ G KII+ + +AF D
Sbjct: 1098 GKPETNHK--FKEVFQEDARGAVTAICDLSGRLMVGQGQKIIVRDIEDDGVVPVAFLDTS 1155
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
+Y+ N ++LGD KS++ + ++ + ++ +L KD LD +F+I +
Sbjct: 1156 -VYISEAKSFGNLLILGDPLKSVWLVGFEAEPYRMVMLGKDRQHLDVECADFIIKDEDIF 1214
Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1314
++V+D I + Y P +S G LL++A F + + AT+ R + P
Sbjct: 1215 ILVADNNNCIHLIQYDPDDPKSINGTILLNKASFELNS-----------ATTCLR--SIP 1261
Query: 1315 GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR------- 1367
+K + + ++ TLDG++ + P++E T+RR+ LQ+++ D V H GLNPR
Sbjct: 1262 KGEKGD-YQIIGSTLDGALYNVFPVNEFTYRRMYILQQQISDKVYHFCGLNPRLNRFGGS 1320
Query: 1368 -SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
+ R +N K I+D L+ + L L+ Q ++A
Sbjct: 1321 VTLRDRETNTKP-------ILDYGLIRRFSKLNLDRQQQLA 1354
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 71/328 (21%), Positives = 139/328 (42%), Gaps = 60/328 (18%)
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+SS +I +LD + + D F+H Y +P + +L R +WAG + + +S+
Sbjct: 223 DSSFIIEAGNLDSSIDTIIDLQFLHNYRDPTIALLSSRSHSWAGSLLKSKDNVHLEVMSL 282
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNY-- 345
K I+ NLP++ +++ + +P+ G L+VG N I H + + +++N++
Sbjct: 283 DLLTKLSTSIFKIENLPYEVDRIVPLSAPLNGCLLVGCNEIMHVDNGGIAKRISVNDFTS 342
Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
S+ S+Q+ +S+ ++L+ + +D L+ T+ G DG+ ++R+
Sbjct: 343 LTTASVKSNQD--QSNLGLKLENCSVVQIPDDHRVLIVTEQGSFYFANFELDGKSIKRVF 400
Query: 403 LSKTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
+ + ++ +I + +L F+ + GDS LVQ +K
Sbjct: 401 IDVVDKNMYDKIKFTFPGEIAVLSKNLLFMSNLNGDSPLVQ-------------VKYRNS 447
Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA----------------QK 499
I D T+R+ + G E + +SN + QK
Sbjct: 448 KILEDTRGTRRVEKGK---------GAEKNKNNVSSNEVDDDDDDDDDLYKEEEEEEQQK 498
Query: 500 TFS-----FAVRDSLVNIGPLKDFSYGL 522
S F ++D L+N P+ F+ GL
Sbjct: 499 VLSKSHIEFILQDRLINNSPISTFTLGL 526
>gi|164655043|ref|XP_001728653.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
gi|159102535|gb|EDP41439.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
Length = 1212
Score = 132 bits (333), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 96/287 (33%), Positives = 143/287 (49%), Gaps = 23/287 (8%)
Query: 1120 GEDVAARGRVLLF-------STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1172
GED +++G + +F S G A + L + ++E++ ++AL L G L+ A G
Sbjct: 902 GEDRSSKGHMYVFDVVECVPSEGMAASDALRL-QLLCTEEMRAPVTALHDLNGFLVAAVG 960
Query: 1173 PKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
K+++ W E L +AF D +Y S+ VKNF+LL D ++S YF++++E A+L L
Sbjct: 961 QKLLIRSWEYCEWLVTVAFLDMG-MYTTSIQRVKNFLLLTDYYQSAYFVAFQEDPARLVL 1019
Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1291
L +D+ FLID + LS+V D +++ Y P S GQ+LL+R E+H
Sbjct: 1020 LGRDYIPTSVTCGAFLIDRARLSIVTCDMNGCLRLMDYHPSNPTSLGGQRLLARCEYHAP 1079
Query: 1292 AHVT--KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
V + L LATS G T+ L +G++ + P+ E F LQ
Sbjct: 1080 GEVVRARMLHGPYLATS--------GECLTSEIVL--AKRNGAVDVLVPVTEKIFPTLQL 1129
Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
Q +LV V H AGLNPR FR N RP I+D LL E
Sbjct: 1130 FQSQLVRMVRHTAGLNPRGFRAVF-NQHISRPLAKGILDGTLLHTAE 1175
Score = 120 bits (301), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 152/591 (25%), Positives = 264/591 (44%), Gaps = 75/591 (12%)
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+RL G V + + Q A RD ++++F DAK++++E+DD L S+H FE
Sbjct: 22 HRLFGQVTGIQSV-QTLASQVDGRDRLLVSFRDAKLALMEWDDVYGDLNSISIHTFERAP 80
Query: 167 WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGF 226
L + SF P + VDP RC +L+ + IL Q S L G +D +
Sbjct: 81 QL-VDGLPPSFV--PRLLVDPASRCAALLLPQDALAILPFVQEASEL-GADDPRDAALLD 136
Query: 227 SARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
A S +++ D +++V+D +F+ G+ +P++ +L+E ELTW G +S T +
Sbjct: 137 QAPYAPSFILSFSEDVDASIRNVRDCVFLPGFQKPMLAVLYEPELTWTGSLSRARLTTRV 196
Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALAL 342
+++ T+ ++P+ ++ LP+D L+A P +GGVLVV + + + Q+A L++
Sbjct: 197 CFITLDLTVTKYPVTVTSEALPYDTLYLVACPDSLGGVLVVTPSALLHLDQTARLVGLSV 256
Query: 343 NNYAVSLDSSQELPRSSFSV---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
+ + S LP ++ ++ +L ++ T+ + + LL + G ++ +GR V
Sbjct: 257 SRWTDFTSSELMLPNATATLGDCDLQSSVLTFTEANGGLLVLRDGRMLTFQCALEGRTVT 316
Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
L L+ VL + G + F L + L++ + T + + L E +I A
Sbjct: 317 SLSLN----VVLVPERQ--GGASFV--QALPERLILCASFQDDTYLYAMNLLEAPTEIAA 368
Query: 460 D-APSTKRLR-RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
P + L + DA + G+ + S + A V D L +GPL D
Sbjct: 369 STGPDQQSLEPDADVDADALDLYGDSFKPDVATSKQAQPA----GLDVLDVLPTLGPLND 424
Query: 518 FSYGLRINADASA---TGISKQSNYELVE----------LPGCKGIWTVYHKSSRGHNAD 564
+YG+ NA A + Q + ++E + IWTV N
Sbjct: 425 MTYGVVRNAHGKAHPHMVATMQHHLAVIEPRLRCDVVQNIAPAHAIWTV------SINGK 478
Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
+ A+D+E L+ SLE+ + T + +Q RTIA G+ + VI
Sbjct: 479 WLLLTAWDEE---CLVYSLESNS------------THFLSQHLQ-RTIACGS--TQAGVI 520
Query: 625 QVFERGARILD--GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
+V + A +LD G MT +F ++ + G SI D YV L
Sbjct: 521 RVTSKRAEVLDEHGRIMT---TFAECDANASYG-------DASIQDSYVAL 561
>gi|68471006|ref|XP_720510.1| likely Cleavage and Polyadenylation Specificity Factor subunit
[Candida albicans SC5314]
gi|74591422|sp|Q5AFT3.1|CFT1_CANAL RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|46442380|gb|EAL01670.1| likely Cleavage and Polyadenylation Specificity Factor subunit
[Candida albicans SC5314]
Length = 1420
Score = 132 bits (332), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 120/572 (20%), Positives = 246/572 (43%), Gaps = 73/572 (12%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
N + ++L + P +A+ P+G +R + F N++G F++G P +
Sbjct: 855 NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 909
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
+ Q + ++ + + +G I++ +Q +IC+LP Y+ P++ +
Sbjct: 910 VHSIPRIFQFSKIAAMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVD 969
Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDLH 1055
+ + I Y + VL Q+ +D+E G D + ++
Sbjct: 970 IGESIKSIAYHETSD-------TVVLSTFKQIPYDCLDEEGKPIAGIIKDIKDTPAMSFK 1022
Query: 1056 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV-----------------T 1098
+ ++++ P W TI + +E +T++ + +
Sbjct: 1023 GS-------IKLVSPYN----WTVIETIELGDNEVGMTLKSMILDVGSESGSTLGSDPNS 1071
Query: 1099 LFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSK 1151
L K+ + IG + ED+AA G ++ G+ N + E++ +
Sbjct: 1072 LIKKYNKKKREYIVIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFKE 1129
Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1211
E +GAI+++ L G L++ G K+I+ +AF D P +YV N ++LG
Sbjct: 1130 ETRGAITSICELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLILG 1188
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D+ K + + + + ++ +L KD + +F+I+ + ++V+D + + Y P
Sbjct: 1189 DLLKGCWLVGFDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYDP 1248
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQML-ATSSDRTGA---------APGSDKTNR 1321
+S G KLL++A F + + ++ L ++ S +T A P + +N
Sbjct: 1249 DDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEESVQTDALTNIAVPPPLPPNTTSNY 1308
Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGK 1377
F ++ T DGS + P++E +RR+ LQ++L+D H GLNPR R + +N
Sbjct: 1309 FQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNET 1368
Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
+P I+D +L+ + L + + +A++
Sbjct: 1369 NTKP----ILDYDLIRSFTKLSDDRKRNLANK 1396
Score = 77.0 bits (188), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 52/221 (23%), Positives = 104/221 (47%), Gaps = 17/221 (7%)
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+SS +I+ LD + V D F+H Y EP + +L ++ WAG + L++
Sbjct: 217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY-- 345
LK ++ NLP++ +++ +PSP+ G L+VG N IH + +A+N +
Sbjct: 277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336
Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
S S Q+ +S +++L+ + +D LL +TG+ + DG+ ++R+
Sbjct: 337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394
Query: 403 LSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
+ + ++ + ++ F+ + G+S L+Q
Sbjct: 395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 435
>gi|414587800|tpg|DAA38371.1| TPA: hypothetical protein ZEAMMB73_571351 [Zea mays]
Length = 108
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 68/98 (69%), Positives = 77/98 (78%)
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
MVL+T D L EVTE+VDY VQG TIAAGNLFGR RVIQV+ +GAR+LDGS+MTQ+L+F
Sbjct: 1 MVLQTGDDLGEVTETVDYNVQGSTIAAGNLFGRCRVIQVYAKGARVLDGSFMTQELNFSM 60
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
SES SE S SIADPYVLL MSDGSIRLL+G
Sbjct: 61 HTSESSLNSEPLAAASASIADPYVLLKMSDGSIRLLIG 98
>gi|115490949|ref|XP_001210102.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114196962|gb|EAU38662.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 908
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 176/731 (24%), Positives = 297/731 (40%), Gaps = 148/731 (20%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
++++LAF +AK+S++E+D HG+ S+H +E + + + G ++ V+P R
Sbjct: 62 EAVLLAFRNAKLSLIEWDPERHGISTISIHYYERDDLTCSPWVPDLSSCGSILDVEPSSR 121
Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGD-------------EDTFGSGGGFSARIESSHVI 236
C V +G++ + I+ Q G LV D ++T ++ SS V+
Sbjct: 122 CA-VFNFGIRNLAIIPFHQPGDDLVMDDYDSDLDERKHVDQETTRESPAYATPYASSFVL 180
Query: 237 NLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
L D + H F+H Y EP IL+ + T + + S ++ +
Sbjct: 181 PLTAFDPSILHPISLAFLHEYREPTFGILYSQVATSNALLHERKDVVFYSVFTLDLEQRA 240
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQ 353
+ S LP D + ++A+P P+GG L++G+N +H + A+ +N ++ + +
Sbjct: 241 STTLLSVARLPSDLFHVVALPPPVGGSLLIGSNELVHVDQAGKTNAVGVNEFSRQVSAFS 300
Query: 354 ELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRL---------- 401
+S ++ L+ L ++ +L TG++VL+ DGR V +
Sbjct: 301 MTDQSDLALRLEGCRVERLADNSGDMILILSTGNMVLIKFKLDGRSVSGISVHPVPVHAG 360
Query: 402 -DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
DL K+ S +GN FLGS DSLL+ G S LSSG
Sbjct: 361 GDLMKS----AASSSAFLGNGEVFLGSEDADSLLL------GWSDLSSG----------- 399
Query: 461 APSTKRLR------RSSSDALQDMVNGEEL---SLYGSASNNTESAQKT---------FS 502
TKRLR S D D ++ +++ LY ++ + T ++ ++
Sbjct: 400 ---TKRLRSHKNDANDSGDVSDDNMSDDDVYEDDLYSTSPDATADGRRVSADPSSFGLYN 456
Query: 503 FAVRDSLVNIGPLKDFSYG--------LRINADASATGISKQS---NYELVEL------- 544
F + D L+NI PL+D + G + N A ++ Q N L+ +
Sbjct: 457 FRINDRLLNIAPLRDITLGKPSTFDKDRKDNVSAELELVASQGSDRNGGLIAMRREIDPE 516
Query: 545 -------PGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA---- 593
+WT +SS G ++ H Y+I+S + ET
Sbjct: 517 VLASFTIDSANCVWTACVESSGGKDS------------HQYVIVSKQTNIDKEETEIFRV 564
Query: 594 ---DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---P 647
DL V+ + TI G L + RV+QV + R D DL P
Sbjct: 565 DGLDLKPIKAPEVNP-NEEVTIDVGTLAKQSRVVQVLKNEVRCYDA-----DLGLAQIYP 618
Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
E S+ +S S+ DPYV + D ++ LL D S V+ P + ++ K +
Sbjct: 619 VWDE--DTSDEHPAVSASVTDPYVAILRDDSTLLLLHVDDSGDVDEVEMPDNM-AAHKWL 675
Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
SSC LY DK TGV + G Q D++ + + L I+
Sbjct: 676 SSC-LYLDK----------------TGVFASNTDTKGS--RQNDMFLFLLGQDCRLFIYR 716
Query: 768 VPNFNCVFTVD 778
+P+ V T+D
Sbjct: 717 LPDLLLVSTID 727
Score = 46.6 bits (109), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 7/76 (9%)
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
R+R P C + AF N +GFI++ S+ L++CQLP + +D WP+++IP+
Sbjct: 820 RMRGAPIQC---LDAF----NSPSGNGFIFLDSENALRMCQLPRETHFDYQWPMRRIPIG 872
Query: 1002 ATPHQITYFAEKNLYP 1017
+ Y A + P
Sbjct: 873 EQIDHLAYSAAEWKLP 888
>gi|241954348|ref|XP_002419895.1| subunit of the mRNA cleavage and polyadenylation factor, putative
[Candida dubliniensis CD36]
gi|223643236|emb|CAX42110.1| subunit of the mRNA cleavage and polyadenylation factor, putative
[Candida dubliniensis CD36]
Length = 1420
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 122/560 (21%), Positives = 242/560 (43%), Gaps = 50/560 (8%)
Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVF 939
N + ++L + P +A+ P+G +R + F N++G F++G P +
Sbjct: 856 NYFFKKEKDLTITGAPDNAF-----PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKT 910
Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIP 999
+ Q ++++ + + +G I++ +Q +IC+LP Y+ P++ +
Sbjct: 911 IHSIPRIFQFSKIAVMSISAFSDSKIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVD 970
Query: 1000 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE-------VGHQIDNHNLS-- 1050
+ + I Y + VL Q+ +D+E + + D +S
Sbjct: 971 IGESIKSIAYHETSD-------TVVLSTFKQIPYECLDEEGKPIAGIIKNIKDTPAISFK 1023
Query: 1051 -SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET 1109
SV L Y E L + G ++ S++ + +L K+
Sbjct: 1024 GSVKLVSPYNWTVIENIELGDNEVGMTIKSMILDVGSESKSTVGTDPNSLIKKYNKKKRE 1083
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALAS 1162
+ IG + ED+AA G ++ G+ N + E++ ++ +GAI+++
Sbjct: 1084 YIVIGIGKYRMEDLAANGIFKIYEIIDIIPEPGKPETNHK--FKEIFKEDTRGAITSICE 1141
Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
L G L++ G K+I+ +AF D P +YV N ++LGD K + + +
Sbjct: 1142 LSGRFLVSQGQKVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLVILGDPLKGCWLVGF 1200
Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
+ ++ +L KD + +F+I+ + ++V+D + + Y P +S G KL
Sbjct: 1201 DAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVADNNNVLHLLNYDPDDPQSINGTKL 1260
Query: 1283 LSRAEFHVGAHVTKFLRL------QMLATSSDRTGAA--PGSDKT-NRFALLFGTLDGSI 1333
L++A F + + ++ L + + +D AA P + T N F ++ T DGS
Sbjct: 1261 LTKASFELNSTISCLRSLPLKDIDEKVQNETDAAAAATIPLPNNTQNNFQVIGSTQDGSF 1320
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR----QFHSNGKAHRPGPDSIVDC 1389
+ P++E +RR+ LQ++L+D H GLNPR R + +N +P I+D
Sbjct: 1321 FNVFPINEAAYRRMYILQQQLIDKEFHYCGLNPRLNRIGSIKLQNNETNTKP----ILDY 1376
Query: 1390 ELLSHYEMLPLEEQLEIAHQ 1409
+L+ + L + + A++
Sbjct: 1377 DLIRRFTKLSDDRKRNFANK 1396
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 55/236 (23%), Positives = 110/236 (46%), Gaps = 17/236 (7%)
Query: 216 DEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
+EDT G+ +SS +I+ LD + V D F+H Y EP + +L ++ WAG
Sbjct: 197 EEDTNGTNKESHLFYDSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGN 256
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
+ L++ K ++ NLP++ +++ +PSP+ G L+VG N IH
Sbjct: 257 LIKSKDNIQFQVLTLDLNSKSTISVFKIDNLPYEIDRIVPLPSPLNGTLLVGCNELIHVD 316
Query: 333 SQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLV 387
+ +A+N + S+ S Q+ +S +++L+ + +D LL +TG+
Sbjct: 317 NGGVLKRIAVNKFTRLITASIKSFQD--QSDLNLKLENCSIVPIPDDHRVLLILQTGEFY 374
Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
+ DG+ ++R+ + + ++ + ++ F+ + G+S L+Q
Sbjct: 375 FINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAVLDKNMLFIANSNGNSPLIQ 430
>gi|380488833|emb|CCF37111.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
Length = 1062
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 234/1047 (22%), Positives = 402/1047 (38%), Gaps = 174/1047 (16%)
Query: 69 YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL----SQGGA 124
Y R+ ++ ES G V D L LV Y + G V LA + S+ G
Sbjct: 66 YDHRLNDDDGLESSFLGGDGMLVRADRAINTKLVLVAEYPIFGIVTGLAKIKLQYSKSGG 125
Query: 125 DNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL-- 182
+ ++++A A++S++++D H L S+H +E E S GPL
Sbjct: 126 E------ALLIATRVARLSLVQWDPEKHALEDISIHYYEKEEL------EGSPFDGPLNN 173
Query: 183 ----VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES------ 232
+ DP RC + + L Q + D+ G A+ S
Sbjct: 174 YRTHLAADPGSRCAALRFGPRYIAFLPFKQADEDIDMDDWDEDVDGPRPAKEPSATAATN 233
Query: 233 ------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+V+ L LD + H F+H Y EP I+ + H
Sbjct: 234 GTSNIADVPYSTSYVLPLPQLDPSLLHPVHLAFLHEYREPTFGIISSTQRRSNTLPRKDH 293
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSAS 337
+ + L + + I S NLP D +K++A+P P+GG L+VG N IH
Sbjct: 294 FSYKVFTLDLQQ--RASTAILSVNNLPQDLFKVIALPGPVGGALLVGTNELIHIDQSGKP 351
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDG 395
+A+N + + +S + L+ + + +N L+ G L ++T DG
Sbjct: 352 NGVAVNPFTKETTNFPLADQSDLDLRLEHCYIELMSAENGELLMILSDGRLAIITFKIDG 411
Query: 396 RVVQ----RLDLSKTNPSVLTSDITTIGN---SLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
R V +L ++ ++ ++TI ++FF+G+ DSL++ +T +
Sbjct: 412 RTVSGVGVKLVPTEVGGGIVQCSVSTISRLSRNVFFVGTTGSDSLVLGWTRKQAQNARK- 470
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS--LYGSASNNTESAQKTFSFAVR 506
K D D+ D D + GE + + A+ N S +F V
Sbjct: 471 --KTRLVD---DSFEYDLEDEDMEDDDDDDLYGETTTTMIQPGATANGVSKGGDLTFRVH 525
Query: 507 DSLVNIGPLKDFSYGLRI-------------------------NADASATGISKQSNYEL 541
DSL++I P+KD + G + +A A I Q+
Sbjct: 526 DSLLSIAPVKDMTSGKQAFIPDSEEEKNSVGVVADLQLACVVGRGNAGAVAIVNQNIQPK 585
Query: 542 V----ELPGCKGIWTV-----YHKSSRGHNADSSRMAAYDD---EYHAYLIISLEARTMV 589
V E P +G WT+ KS +G ++ +A+ D +Y ++I+S +
Sbjct: 586 VIGKFEFPEARGFWTMCVQKPVPKSLQGDKGANAAVASEFDASSKYDKFMIVS-KVDLDG 644
Query: 590 LETADLLTEVTESVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQ 641
ET+D+ + G T+ AG + R+IQV + R DG ++Q
Sbjct: 645 YETSDVYALTGAGFEALTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLGLSQ 704
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
L + E+G+ V+S SI DPY+LL D SI + D + V+
Sbjct: 705 ILPM--LDEETGA---EPRVISASITDPYLLLVRDDSSIMVAQIDNNCELEEVEKQDDTI 759
Query: 702 SSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG 761
S K ++ C LY D +TG+ + G P Q + + + +G
Sbjct: 760 LSTKWLAGC-LYTD----------------TTGLFAPMQTDKGTPEGQ-NTFMFLLSAAG 801
Query: 762 ALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
AL I+ +PN + V +G T+ V ++ + + GT Q E +
Sbjct: 802 ALYIYALPNLSKPVYV---AAGLTY-VPPFL---------SADYAVRRGTVQ---ETLTE 845
Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
+ V +L + P+L + + Y+ E + T + +++L
Sbjct: 846 LLVADLG----DTTATSPYLIVRHANDDLTIYEPIRLESQDKT------LGLAKTLHFQK 895
Query: 882 VSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
++ N +++P++ E E P P + NI+G+ FL G+ P +
Sbjct: 896 IT-----NPALAKSPVEVADDEANEQPRFVPLRPCA---NINGYSTVFLPGASPSLIV-- 945
Query: 940 RERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW-PV 995
+ + P+ L + + H C GFIY S+G ++ QLP+ S + V
Sbjct: 946 -KSAKSSPKVVGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVTQLPADSNFAELGVSV 1004
Query: 996 QKIPLKATPHQITYFAEKNLYPLIVSV 1022
+KIP+ I Y Y + S+
Sbjct: 1005 RKIPIGDAVGLIAYHPPMETYAVACSI 1031
>gi|255720869|ref|XP_002545369.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240135858|gb|EER35411.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 1351
Score = 129 bits (324), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 112/497 (22%), Positives = 216/497 (43%), Gaps = 53/497 (10%)
Query: 888 RNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
++L + P +A+ P+G +R + F N++G FL+G P + +
Sbjct: 831 KDLTITGAPENAF-----PYGTSIERRLVYFPNLNGFTCIFLTGVIPYLILKTIHAIPRI 885
Query: 947 PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQ 1006
Q V+ + + +G I++ ++ +IC+LP Y+ P++ +P+ +
Sbjct: 886 FQFTKIPAVSISAFSDSKIKNGLIFLDNEQNARICELPLDYNYEFNLPMKHVPIGESIKA 945
Query: 1007 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVR 1066
+ Y + ++VS P N +D+E G I + ++ T + ++
Sbjct: 946 MAYHEASDC--VVVSTFKEIPYN-----CVDEE-GKLI----VGVMEDKPAATSFKGSIK 993
Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTL------FNTTTKENETLLAIGTAYVQG 1120
++ P W TI + +E ++++ + L K + +GT +
Sbjct: 994 LISPYN----WSVIDTIELDDNEVGMSLKSMVLDIGSSSLIKKFKNKREYIVVGTGKYRM 1049
Query: 1121 EDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1173
ED+AA G +F G+ N + E + + +KGA++++ L G L++ G
Sbjct: 1050 EDLAANGAFKIFEIIDIIPEPGKPETNHK--FKETFQENIKGAVTSVCELSGRFLVSQGQ 1107
Query: 1174 KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1233
K+I+ +AF D P +YV N ++LGD K + + + + ++ +L
Sbjct: 1108 KVIVRDLQDDGTVPVAFLDTP-VYVSESKSFGNLLILGDPLKGCWLIGFDAEPFRMIMLG 1166
Query: 1234 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1293
KD L +F+I + ++V+D + + Y P +S G KLL++A F + +
Sbjct: 1167 KDTQHLSVECADFIIKDDEVYILVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELASP 1226
Query: 1294 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1353
++ L P D N F ++ DGS + P++E T+RR+ LQ++
Sbjct: 1227 ISCLRTL-------------PIDD--NNFQIIGSCQDGSFFNVFPINESTYRRMYILQQQ 1271
Query: 1354 LVDSVPHVAGLNPRSFR 1370
L + H GLNPR R
Sbjct: 1272 LTEKEYHYCGLNPRLNR 1288
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 134/318 (42%), Gaps = 29/318 (9%)
Query: 222 SGGGFSAR--IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
+G F R +SS +I+ LD + V D F+H Y EP + +L + WAG +
Sbjct: 198 NGNSFEPRQFYDSSFIIDATTLDSTVGTVIDMQFLHNYREPTIGVLSSKSEVWAGNLLKS 257
Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
L++ K ++ NLP++ +++ +PSP+ GV++VG N IH +
Sbjct: 258 KDNIQFQVLTLDLNSKSTVSVFKIDNLPYEIDRVIPLPSPLNGVILVGCNELIHVDNGGV 317
Query: 337 SCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTV 391
+A+N + S+ S Q+ +S +++L+ + + ND LL KTG+ +
Sbjct: 318 MKRIAVNKFTGLTTASIKSFQD--QSDLNLKLEDSTIVPIPNDHRVLLVLKTGEFYYINF 375
Query: 392 VYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
DG+ ++R+ + + + ++ + +L F + G+S LVQ S
Sbjct: 376 ELDGKSIKRVHIDVIDKKLYEKVKLTYPGEVAVLDKNLLFFANSSGNSPLVQVKYRDSLS 435
Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
G E D E + ++ E+ +L ++ F
Sbjct: 436 DAKIGAPIEESDEEDETQKADEDDDEDDLYKEEEEEEEQKNL----------SKTHIEFV 485
Query: 505 VRDSLVNIGPLKDFSYGL 522
D L+N GP F+ G+
Sbjct: 486 YHDELINNGPSSSFTLGV 503
>gi|226290902|gb|EEH46330.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides brasiliensis Pb18]
Length = 1343
Score = 127 bits (320), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 182/745 (24%), Positives = 303/745 (40%), Gaps = 109/745 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++Y + GS ++ +T+ + + L LV Y L G V L
Sbjct: 28 NLIVAKTTLLQVYNLVNVVYGSGPGQSDEKTRSQY-------SKLVLVAEYALSGTVTDL 80
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ D+ ++I++A +AK+S++E+D H + TS+H +E + +H+ +
Sbjct: 81 GRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISPWTPN 137
Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GDEDTFGSGGGFSARIESS 233
A P + VDP RC VL +G + + IL Q G LV GD F S +I+++
Sbjct: 138 LAACPSQLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMGD---FDSDHDEERQIDTN 193
Query: 234 HVINLRD--------------------------LDMKHVKDFIFVHGYIEPVMVILHERE 267
H RD M H F++ Y EP IL+ +
Sbjct: 194 HTAEERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQV 253
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
+ + + S ++ + + S LP+D +K++ +P P+GG L+VG+N
Sbjct: 254 AASSALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGSN 313
Query: 328 T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
+H + A+ +N +A S +S + L+ L +N LL G
Sbjct: 314 ELVHVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLNG 373
Query: 385 DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
+ +++ DGR V + L +T PS +G F GS GDS+
Sbjct: 374 VMAVVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SALVGRGKIFFGSEEGDSM 429
Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
L+ ++ S + + E D A+ + DA +D + ++ G S N
Sbjct: 430 LIGWSRPSAGATVPPA-PETGEDNVAELSEDEEEEDDDEDAYEDDLYATPVT-PGINSRN 487
Query: 494 TESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNYELVELPG 546
T S T + F + D L N+GP++D + G + D + S + ELV G
Sbjct: 488 TTSVNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYLELVTTQG 547
Query: 547 --------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
G+ +V+ K + S Y YL+
Sbjct: 548 YGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPTQSGSLPVNAGSNYDHYLL 607
Query: 581 ISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARIL 634
+S + +++V + + E T + ++ + RTI G L G RV+QV + R
Sbjct: 608 LSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVLKGEVRSY 667
Query: 635 DGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
D + + L P E SE +V+ S ADPYVL+ D SI LL D S
Sbjct: 668 DSANLHLGLGLAQIYPVWDE--DTSEERSVVHASFADPYVLIIRDDSSILLLQADESGDL 725
Query: 692 VSVQTPAAIESSKKPVSSCTLYHDK 716
++T IES+ S +LY DK
Sbjct: 726 DEIETDGIIESTT--WISGSLYQDK 748
Score = 124 bits (311), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 122/523 (23%), Positives = 221/523 (42%), Gaps = 94/523 (17%)
Query: 919 NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
++ G++ F+ G+ PC+ + + L ++ + + + C GF+YV + ++
Sbjct: 902 DVCGYRTVFMPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVV 961
Query: 979 KICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1038
++C+ P + +D W +KI L + Y + Y L S L D
Sbjct: 962 RMCRFPRNTHFDGSWAARKIGLGEQVDSVEYSSSSETYVLGTS------QKADFKLPEDD 1015
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
E+ + N +S +++ V++L P W + ++++E + V+ +
Sbjct: 1016 EIHPEWRNEVISFFP-----QIDKGSVKLLNPRT----WSIIDSYQLRTAERVMCVKCLN 1066
Query: 1099 L-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR---NADNPQN--LVTEVYSKE 1152
L + T E + ++A+GTA +GED+AARG + +F + D P+ + + +E
Sbjct: 1067 LEASEITHERKEMIAVGTALTRGEDIAARGCIYVFEVIKVVPEVDRPETNRKLKLIAKEE 1126
Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
+KGAI T L+GI + F++
Sbjct: 1127 VKGAI-------------------------TSLSGIGG--------------QGFLIAAQ 1147
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
K I KE G+ LL F + C+ + V E K
Sbjct: 1148 GQKCI-VRGLKEDGS---LLPVAFMDMQCYVS------------VLKELKGTD------- 1184
Query: 1273 MSESWKGQKLLSRAEFHVG---AHVTKFLRLQMLATSSDRTGAAPGSDKTNRF-ALLFGT 1328
S KG +LL R+ FH G + +T R +L+ + A D + +L +
Sbjct: 1185 -PGSAKGDRLLHRSTFHTGQFASTLTLLPRTSVLSQGPEAEANAMDLDSSGPLHQVLVTS 1243
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
GSI I P+ E+ +RRL +LQ ++++++ H GLNPR+FR S+G R +VD
Sbjct: 1244 ETGSIALITPVSEMAYRRLSALQSQMINTLEHPCGLNPRAFRAVESDGIGGR----GMVD 1299
Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
+L+ + L + + EIA + G +I ++L A+G + L
Sbjct: 1300 GDLVQKWLDLGTQRKAEIASRVGADVWEIRADLE--AIGKAGL 1340
>gi|312077399|ref|XP_003141287.1| hypothetical protein LOAG_05705 [Loa loa]
Length = 316
Score = 124 bits (312), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 80/265 (30%), Positives = 132/265 (49%), Gaps = 34/265 (12%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LE + RL V+S AI + DS++L F+DAK+S++ + + L+ S+H
Sbjct: 62 LECLLAVRLLAPVQSFAI---ARIPQNPDCDSLLLGFDDAKLSIVGVNPADRSLKTISLH 118
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
CFE LK G P+++VDP RC +LV+G + +L + G+ L
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCAAMLVFGRYLAVLPFNDSGAQL------- 168
Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
S+ + L +D + +V D +F+ GY EP ++ L+E T GR ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217
Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
T + L +S +K+ L +W NLP D ++LA+P P+GG+L+V N + Y +QS
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275
Query: 337 -SCALALNNYAVSLDSSQELPRSSF 360
C ++LN+ +D + P F
Sbjct: 276 PPCGISLNS---CMDGFTKFPLRDF 297
>gi|320583269|gb|EFW97484.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Ogataea parapolymorpha DL-1]
Length = 1309
Score = 124 bits (312), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 129/548 (23%), Positives = 227/548 (41%), Gaps = 61/548 (11%)
Query: 837 SRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTP 896
S+ +L A+ G +L Y+ + DP+ + L N + P
Sbjct: 750 SKDYLVALTFGGEVLIYETFF-----------DPIERTYKLMKIN---------EMCQFP 789
Query: 897 LDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV 955
+ H +R I N G++ ++G+ + + Q S +
Sbjct: 790 IVGAPDNSYAHATKIERYLISVDNFQGYKAVLVTGASAFVILKEYNSIPRMLQFTKRSSL 849
Query: 956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL-KATPHQITYFAEKN 1014
F + C +G I + +ICQL S TY N P+ K + T ++I Y + N
Sbjct: 850 YFAEYNTDRCPNGVISIDETKACRICQLDSSYTYSNRLPIAKYKIGDKTINKIRYHSLSN 909
Query: 1015 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQI----DNHNLSSVDLHRTYTVEEYEVRILEP 1070
Y I+S P N V E G + D+ L S L T V ++ P
Sbjct: 910 TY--IISTLEEGPYNPV------DEDGEPLPGLRDDRKLKSTSLKGT-------VHLVSP 954
Query: 1071 DRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL 1130
W TI ++ +E ++ V+ L + T +T++ IGTA + ED+A G
Sbjct: 955 ----ANWTIIDTIELEDNEYVTSIEVIELKVSETIATKTVVLIGTARCRNEDLATHGSWK 1010
Query: 1131 LFSTGRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE- 1184
++ P +N + + S+ +G + ++ ++ G I G ++++ +
Sbjct: 1011 IYEVIDIVPEPGRPEAKNRLKMITSETARGPVLSICNVSGRFAIVQGQRMLVRTLQKDDN 1070
Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
+ +AF D +Y + KN +L+GD +S+ + ++ KD +++ A
Sbjct: 1071 VAPVAFTDTS-IYSKEVKTFKNLVLIGDSFQSVSLYGFDAAPYRMLHFGKDEQNVELRAA 1129
Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA 1304
+FL+ L L+V+DE + Y P S KG KLL R+ A TK M++
Sbjct: 1130 DFLVHDGNLHLLVADEDSVFHLLQYDPYDGNSMKGLKLLRRSLLRSNALTTK-----MIS 1184
Query: 1305 TSSDRTGAAPGSDKTNR----FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1360
+ DR+ + S + + ++ +DGS + P++E +RRL S+Q L D H
Sbjct: 1185 VARDRSLFSMVSTLNHEDDLGYEIIGSNIDGSFYKVMPVNEYQYRRLYSIQNYLYDKELH 1244
Query: 1361 VAGLNPRS 1368
GLNP+S
Sbjct: 1245 WLGLNPKS 1252
Score = 105 bits (261), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 107/464 (23%), Positives = 212/464 (45%), Gaps = 46/464 (9%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+L+ YRL+G + ++ + ++ + D +I++ + AK+SV+++D +H + S+H
Sbjct: 51 LQLIGEYRLNGQIINI---DKFRSNENESLDYLIVSTKLAKLSVIKWDSQLHAISTVSLH 107
Query: 161 CFESP-EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMII--LKASQGGSGLVGDE 217
+++ + L +++ ++ + + DP C + + L + K L D
Sbjct: 108 YYDTALDALTVEKLEKTSVQH---RTDPNSLCTCLRLNELFTFLPFYKEYLDEEELKDDA 164
Query: 218 DTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERE-LTWAGRV 274
+ S ++N L D+K++ D+ F+H Y +P M IL+ E +TWAG +
Sbjct: 165 EEAKDIKKRKKLFTESFILNASSLYPDIKNIVDYQFLHSYRDPTMAILYAPETMTWAGHL 224
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHS 333
T + LS+ K+ I NLP+D + + SP G L+VG+N IH +S
Sbjct: 225 PKAKDTLKVIVLSLDLENKKASAIMELTNLPYDVDYIYPLESPTNGFLLVGSNEIIHVNS 284
Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
+ + N Y + + + +SS + L+ + ++ D L+ T++G+ L
Sbjct: 285 LGSVRGIYTNEYFTDISNLKLKDQSSLGLMLENSRVGLVKEDQVLIITESGEFYQLNFEK 344
Query: 394 DG-----RVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
G +Q+++ S N ++ + + ++ LFF+ + GDS L++ + SG
Sbjct: 345 IGGNSTITGLQKVETSNYKGIIVNHPIMITSVPSL--DLFFVCCQGGDSSLIRISSKSG- 401
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
+L KE+ GD + L + E+ + S+ N++ F
Sbjct: 402 -VLPQETKEQNGDTKETKDDDDWL-----------YDEEDQKSHKSSLVNSQ-------F 442
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
D+++N GPL DF+ G R++ + G+ + E V + C
Sbjct: 443 KKMDNILNCGPLVDFTLG-RVSIEQKIMGLPNPNYNEDVLVAAC 485
>gi|343962533|dbj|BAK62854.1| cleavage and polyadenylation specificity factor 160 kDa subunit [Pan
troglodytes]
Length = 269
Score = 123 bits (308), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 68/198 (34%), Positives = 111/198 (56%), Gaps = 7/198 (3%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1139
A I +Q E+ ++ V+L + T + +A GT +QGE+V RGR+L+
Sbjct: 41 ARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVP 100
Query: 1140 NPQNLVTE-----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
P +T+ +Y KE KG ++AL GHL+ A G KI L +EL G+AF D
Sbjct: 101 EPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ 160
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
LY+ + VKNFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L
Sbjct: 161 -LYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLG 219
Query: 1255 LVVSDEQKNIQIFYYAPK 1272
+VSD +N+ ++ Y P+
Sbjct: 220 FLVSDRDRNLMVYMYLPE 237
>gi|448530371|ref|XP_003870046.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis Co
90-125]
gi|380354400|emb|CCG23915.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis]
Length = 1327
Score = 122 bits (307), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 111/512 (21%), Positives = 228/512 (44%), Gaps = 53/512 (10%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
+R+ F N++G+ F++G P + + Q V+ + + +G I+
Sbjct: 827 RRLVYFPNLNGYTSIFVAGVIPFLIIKSCHSIPRIFQFSKIPAVSISAFSDSKIKNGLIF 886
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV 1031
+ + +IC+L Y+ P++++ + + + Y + + +++S P N
Sbjct: 887 LDNNQNARICELSLDYNYEFNLPIRRVHIGESIRSVAYHEQSD--TVVISTFKEIPYN-- 942
Query: 1032 LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENA 1091
+D+E G I + T + ++++ P W+ TI +Q +E
Sbjct: 943 ---CVDEE-GKPI----AGVLKDKPPATSFKGSIKLVSPFN----WKVIDTIELQDNEVG 990
Query: 1092 LTVRVVTL-FNTTTKENET---LLAIGTAYVQGEDVAARGRVLLFST-------GRNADN 1140
+ ++ + L ++ K+ +T + +GT ++ ED+AA G ++ G+ N
Sbjct: 991 MAIKSMVLDVGSSMKKFKTKREYIVVGTGKLRMEDLAANGSFKIYDIIDIIPEPGKPETN 1050
Query: 1141 PQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS 1200
+ E++ ++ +GA++++ L G L+ G K+I+ + +AF D P +YV
Sbjct: 1051 HK--FKEIFQEDTRGAVTSVCDLSGRFLVGQGQKVIVRDLEDDGVVPVAFLDTP-VYVSE 1107
Query: 1201 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
N LLGD KSI+ + ++ ++ +L KD L +F++ + ++V+D
Sbjct: 1108 AKSFGNLFLLGDPLKSIWLVGFEADPFRMVMLGKDRQHLRVECADFIVKDEEIFILVADV 1167
Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
++ + + P +S G L+++A F + T LR + P + T
Sbjct: 1168 NNSLHLIQFDPDDPKSINGTILINKASFETNSQTT-CLR------------SVPKGE-TG 1213
Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSN 1375
+ + T+DG+ + P++E T+RR+ +Q+++ D H GLNPR R Q N
Sbjct: 1214 DYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQISDKEYHYCGLNPRLNRFGGAVQIRDN 1273
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
+P I+D L+ + L L+ Q I
Sbjct: 1274 DTNAKP----ILDYNLIKEFAKLNLDRQKNIT 1301
Score = 74.3 bits (181), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 83/375 (22%), Positives = 157/375 (41%), Gaps = 57/375 (15%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+LV ++L G V L L + D ++++ + AK S++ ++ +H + S+H
Sbjct: 57 LKLVEQFKLQGTVSGLKALRTSECPH---LDYVVVSTKYAKFSIIRWNHQLHNISTVSLH 113
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
+E+ E A L V+P L Y + L
Sbjct: 114 YYEN---CIQHSTFEKLAISDLT-VEPTYSSVSCLRYKNLLCFLPFEGVHEEDDEDDTDD 169
Query: 205 ----KASQGGS----GLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHG 254
+GGS GL + F ++S +I+ LD + V D F+H
Sbjct: 170 EDIDNDKKGGSITKNGLSYENQPF---------YDASFIIDAGILDSTIDTVLDVQFLHN 220
Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
Y EP + IL + +WAG + +++ K +++ NLP+D +++ +
Sbjct: 221 YQEPTIAILSAKSNSWAGNLIKNKDNVQFQVMTLDVQSKSTLPVFNIDNLPYDIDRVIPL 280
Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHA 369
P+P+ G L++G N IH + + +A+N + S+ S Q+ S +++L+
Sbjct: 281 PNPLNGCLLIGCNELIHVDNGGIAKRIAVNAFTSLITASVKSYQD--ESDLNLKLENCAI 338
Query: 370 TWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNS 421
+ +D LL TG+ L DG+ ++++ L + + S + ++ +
Sbjct: 339 VPIPDDHRVLLILATGEFYYLNFDLDGKSIKKIHLELVDQKMYDSIRLTYPGQVASLDKN 398
Query: 422 LFFLGSRLGDSLLVQ 436
L F + GDS LV+
Sbjct: 399 LLFFANLNGDSSLVE 413
>gi|406602601|emb|CCH45811.1| hypothetical protein BN7_5397 [Wickerhamomyces ciferrii]
Length = 1287
Score = 122 bits (307), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 111/457 (24%), Positives = 215/457 (47%), Gaps = 49/457 (10%)
Query: 969 FIYVTSQGILKICQLPSGSTYDNY---WPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL 1025
F+Y+ +IC LP G + NY P++ + L TP+++TY L+ ++
Sbjct: 844 FMYIDIDKTARICSLPIGENF-NYSQNLPIEIVSLGQTPNKVTYHETSGLF-------IV 895
Query: 1026 KPLNQVLSLLIDQE----VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRA 1081
++ ID++ VG S + + + + ++++ P W
Sbjct: 896 STFEEISYNAIDEDGVPIVG--------SESEKPKAKNFKGF-LKLINPIN----WTIID 942
Query: 1082 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR---VLLFSTGRN 1137
I M+ +E VR + L ++ +K+ + + G + ED++ G + + S +
Sbjct: 943 EIEMEENEIINDVRSINLTISSRSKKKKEFIIFGIGKYRLEDLSVFGEFKIIDIISIVPD 1002
Query: 1138 ADNPQNL--VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAP 1194
P+ + E++ + +KGA++ + + G L + G KII+ +AF D
Sbjct: 1003 PTKPEAIYKFKEIFQEVVKGAVTTINEISGRFLTSQGQKIIIRDLQQDNSTVPVAFMDCA 1062
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
Y+ N +L+ D KSI+FL + + +L LL KD + T+F++D +
Sbjct: 1063 T-YLSDSKSFGNLLLISDSMKSIWFLGFDAEPYRLLLLGKDQQRFNAITTDFIVDDGEIY 1121
Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1314
+V+D+++++ + Y P +S GQKLL ++ F + +T L+L D+
Sbjct: 1122 FLVADDEESLHLLTYQPDDPKSLSGQKLLQKSTFTTNS-ITTCLKLVPKFNEFDQGSIT- 1179
Query: 1315 GSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
+ + +DGSI + P+DE+++RRL LQ++L D + H GLNPRS R F +
Sbjct: 1180 ------SYQNIGVNVDGSIFKMIPIDEISYRRLYILQQQLSDKIAHYVGLNPRSNR-FSA 1232
Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
N + +P I++ LL + L ++++ + + + G
Sbjct: 1233 NEQGQKP----IIEFGLLKWFINLNVDKRKQFSAKVG 1265
Score = 112 bits (279), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 129/613 (21%), Positives = 261/613 (42%), Gaps = 62/613 (10%)
Query: 96 ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
I + + +L+ ++ N + + I S D+ + +I+ + AK+S++ FD ++ ++
Sbjct: 42 IDSKNDKLILNHEFKLNGKIIGIKSIKLPDSQYDQLAILTSL--AKLSIVSFDHDLNTIQ 99
Query: 156 ITSMHCFESPEWLH-LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
S+H +ES + + + ES +K+DP + ++VY + L Q ++
Sbjct: 100 TNSLHYYESEFYTKSISKINES-----QLKIDPNNQTS-LVVYNDLLAFLPFKQDDDEII 153
Query: 215 GDEDTFGSGGGFSARIESSH--VI---NLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
D+ S IE H +I N + + ++ D F+H Y +P + ILH +E T
Sbjct: 154 DDDHHTQSNDQQQQNIELFHNSIILPANKLESTVSNIIDCDFLHSYRDPTLAILHNKEQT 213
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-T 328
WA +S K T LS+ I NLP+D + + +P PI G L++G N
Sbjct: 214 WASDLSIKKDTVNFVVLSLDLLNDSSTAILLVENLPYDLWFVKPLPDPINGTLLIGCNEI 273
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
IH + + + LN Y + + +S ++ L+ + L + L+ + G+
Sbjct: 274 IHIDNSGNTKGIGLNKYYQDITDFKLKDQSDLNIFLEHSKVEILNDKNILIIDQFGESYN 333
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS---DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
L DG+ V+ L ++K + IT I F+G + DS+L+++
Sbjct: 334 LQFFIDGKSVKDLLITKFEKDLQIRSPISITNIDEQNIFIGCQSSDSILIKY-------- 385
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
LK+E + + P+ + D + N F+ +
Sbjct: 386 --EKLKQETNEAKPTTPAATKTNNDDDDEDLYEDEDLNNNNDDELIN--------FNLQI 435
Query: 506 RDSLVNIGPLKDFSYGLRINADASATGIS--KQSNYELVEL--PGCKGIWTVYHKSSRGH 561
+D L N GPL F+ G +IN ++ G++ Q++ +V G +G T++++S +
Sbjct: 436 KDKLFNAGPLSSFTLG-KINPNSLIQGLTNPNQNDVSIVGTSGEGKQGKLTLFNQSIQPK 494
Query: 562 NADSSRMAAYDDEY---HAYLIIS-LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
S + + + + YLI + L+ + + + +S D+ TI +
Sbjct: 495 IHSSLKFNNINKTWNILNKYLITTDLQNFKSEIFLINENFKNFQSFDFKNNNITINIDTI 554
Query: 618 FGRRRVIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
++R++Q+ + D ++ + + F +++ I DP++++
Sbjct: 555 QSQKRILQITSNNVYLFDLNFKKLLQINFDF--------------EIINGKIFDPFIIIT 600
Query: 675 MSDGSIRLLVGDP 687
S G +++ DP
Sbjct: 601 SSKGEVKIFEMDP 613
>gi|354547787|emb|CCE44522.1| hypothetical protein CPAR2_403250 [Candida parapsilosis]
Length = 1334
Score = 122 bits (307), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 110/512 (21%), Positives = 229/512 (44%), Gaps = 53/512 (10%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
+R+ F N++G+ F++G P + + Q V+ + + +G I+
Sbjct: 834 RRLVYFPNLNGYTTIFVTGVIPFLIIKSCHSIPRIYQFSKIPAVSVSAFSDSKIKNGLIF 893
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV 1031
+ + +IC+L +Y+ P++K+ + + + Y + + V+ ++
Sbjct: 894 LDNNQNARICELSWDYSYEFNLPIRKVHIGESIKSVAYHEQSD-------TVVISTFKEI 946
Query: 1032 LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENA 1091
+D+E G I ++ T + ++++ P W+ T+ + +E
Sbjct: 947 PYDCVDEE-GKPI----AGALKDKPPATSFKGSIKLVSPYN----WKVIDTVELSDNEVG 997
Query: 1092 LTVRVVTL-FNTTTKENET---LLAIGTAYVQGEDVAARGRVLLFST-------GRNADN 1140
++++ + L ++ K+ +T + IGT+ ++ ED+AA G ++ G+ N
Sbjct: 998 MSIKSMVLDVGSSLKKFKTKREYIVIGTSKLRMEDLAANGSFKIYDIIDIIPEPGKPETN 1057
Query: 1141 PQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS 1200
+ E++ ++ KGA++++ L G L+ G K+I+ + +AF D P +YV
Sbjct: 1058 HK--FKEIFQEDTKGAVTSICDLSGRFLVGQGQKVIVRDLEDDGVVPVAFLDTP-VYVSE 1114
Query: 1201 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
N LLGD KSI+ + ++ ++ +L KD L +F++ + ++V+D
Sbjct: 1115 AKSFGNIFLLGDALKSIWLVGFEADPFRMVMLGKDRQHLHVECADFIVKDEEIFILVADI 1174
Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
+ + + P +S G L+++A F + T LR + P D+
Sbjct: 1175 NNGLHLIQFDPDDPKSINGTILVNKASFETNSQTT-CLR------------SVP-KDEAG 1220
Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-----QFHSN 1375
+ + T+DG+ + P++E T+RR+ +Q+++ D H GLNPR R Q +
Sbjct: 1221 DYQTIGSTIDGAFFNVFPVNESTYRRMYIVQQQISDKEFHHCGLNPRLNRFGGAIQIRDS 1280
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
+P I+D L+ + L L+ Q IA
Sbjct: 1281 DTNAKP----ILDYNLIREFAKLNLDRQRNIA 1308
Score = 87.4 bits (215), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 98/458 (21%), Positives = 181/458 (39%), Gaps = 64/458 (13%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+LV ++L G V L L + + D II++ + AK S+++++ +H + S+H
Sbjct: 57 LKLVEQFKLQGTVTGLKPLR---TSENPQLDYIIVSTKYAKFSIIKWNHQLHSISTVSLH 113
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
+E+ E A L+ V+P L Y + L
Sbjct: 114 YYEN---CIQHSTFEKLAISDLI-VEPTYSSVSCLRYKNLLCFLPFEGVNDHDDDDDDDD 169
Query: 205 -----KASQG-GSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYI 256
+G + G + + G+ +SS +I+ L+ + V D F+H Y
Sbjct: 170 DDDDTDDEKGVAENVAGVDKSNGASNDNQPFYDSSFIIDAGTLESSVDSVLDLQFLHHYQ 229
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
E + IL + +WAG + +++ K +++ NLP+D +++ +
Sbjct: 230 ETTIAILSSKSNSWAGNLIKNKDNVQFQVMTLDIQSKSTLPVFTIDNLPYDIDRIIPLSK 289
Query: 317 PIGGVLVVGAN-TIHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATW 371
P+ G L++G N IH + + +A+N + S+ S Q+ + +E D +
Sbjct: 290 PLNGCLLLGCNEIIHVDNGGIAKRIAVNAFTSLITASVKSYQDESELNLKLE-DCSIVPI 348
Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFF 424
++ LL TG+ L DG+ ++R+ L + ++ T+ N+L F
Sbjct: 349 PEDHRVLLILATGEFYFLNFELDGKSIKRIHLEAVEQKAYDAIKLTYSGEVATLDNNLLF 408
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+ GDS LV+ S + +E K D + GEE
Sbjct: 409 FANMNGDSPLVEIKYSSSAKV-----------VEKQVLDKKEEDSDEEDLYNEDEEGEEQ 457
Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
+ + F + DSL+N GP+ F+ GL
Sbjct: 458 KVMRKSH---------IEFKLHDSLINNGPVSSFTLGL 486
>gi|146096490|ref|XP_001467824.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania infantum JPCM5]
gi|134072190|emb|CAM70891.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania infantum JPCM5]
Length = 1542
Score = 122 bits (305), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 130/576 (22%), Positives = 252/576 (43%), Gaps = 59/576 (10%)
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFE--GPENTSKSDDPV-STSRSLSVSNVSASRLRN 889
SA + L IL+ G ++ Y+ + GP K + + + V +R +
Sbjct: 930 SAAPTEATLVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVVESIEARKKR 989
Query: 890 LRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCM--VFRERLRVH 946
L+ R L + T ++ H + +R+ F+ + H+G ++ G P + + +L
Sbjct: 990 LQEERAHLASVT-QQMRHCS--ERLVPFRGLQDRHKGIYVCGQTPVFLVYHAATNQLVCT 1046
Query: 947 PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-PSGSTYDNY-WPVQKIPLKATP 1004
++ F H+ + + GF+Y +G + + P G + W ++++ L TP
Sbjct: 1047 RHHATNAVRGFAPFHSRHVHGGFVYC-GEGFVHFATMQPFGELLGSSGWWLERVRLGCTP 1105
Query: 1005 HQITYFAEKNLYPLIVS-----VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSVD 1053
HQ+ Y + ++ S P P + L ++ D+E V H I+ +L S+
Sbjct: 1106 HQVIYSPAAHGCFVVASRPQPFSPKRAPFDVQLRMVEDEEGNRVPHVIEPVSLPPLSATS 1165
Query: 1054 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE------N 1107
T E YEV+ + WQ + + +E L+ ++ + TT +
Sbjct: 1166 GSPVPTNERYEVQFF----STLDWQCMGRLVLDVNEKVLSATLMQVTRDTTMDAANRSTT 1221
Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGH 1166
+ A+ TAY GEDV RGR+LL +T + + ++ + +KG ++A+ + +
Sbjct: 1222 APVCALATAYPLGEDVTTRGRILLLTTSQQGGQGMQQLRTLHEEPMKGPVTAITRVGEDC 1281
Query: 1167 LLIASGPKIILHKW----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
+ +A G + ++++ + E I + A YV L +N++++GD+ S+ F +
Sbjct: 1282 VAVAVGGTVRVYRYDTNKSTMETMAILYAGA---YVTCLQAFRNYLVIGDLFNSVLFARY 1338
Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKGQK 1281
E+ + +L +D ++ + + L + L+V+D+ +N+ Y P+ + E K K
Sbjct: 1339 SEEIHTITILGRDTNAISVVSNDMLYHDTRFGLLVTDDARNLVCMSYKPRVLEEPGKPPK 1398
Query: 1282 LLSR-----AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
+L E+ + V L++ L +S R N ++ T G IG +
Sbjct: 1399 ILESLLTVTGEYRLAGGV--LLKMMRLRAASTR----------NSSVAIYVTNMGEIGYL 1446
Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
PL + T R Q + ++L V H GL PR F F
Sbjct: 1447 VPLGDQTSRTGQWVGRRLQSEVAHAGGLPPRMFLGF 1482
Score = 41.2 bits (95), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 86/206 (41%), Gaps = 38/206 (18%)
Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIE 257
+ A+ G G + GG S + V + R D+K +++D FV E
Sbjct: 263 VAFGAASAGPGTASSQKV-TQGGVTSLLLRVGTVTHWRLQDVKTALRNIRDVQFVESAGE 321
Query: 258 PVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-L 304
P++ L E++ TWAGRV W+ TC I ++++ + H L S ++ L
Sbjct: 322 PLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGL 381
Query: 305 PHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSA----------SCALALNNYAVSLD 350
P+D + +P+ P V +H ++S A +L + AVSL+
Sbjct: 382 PYDVTSMTPLPAFQDVPSAVFCVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLE 441
Query: 351 ------SSQELPRSSFSVELDAAHAT 370
+SQ L V L+ A+AT
Sbjct: 442 AVQWRSASQALSTDLVKVNLNFANAT 467
>gi|398020786|ref|XP_003863556.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania donovani]
gi|322501789|emb|CBZ36871.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania donovani]
Length = 1542
Score = 122 bits (305), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 131/577 (22%), Positives = 254/577 (44%), Gaps = 61/577 (10%)
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFE--GPENTSKSDDPV-STSRSLSVSNVSASRLRN 889
SA + L IL+ G ++ Y+ + GP K + + + V +R +
Sbjct: 930 SAAPTEATLVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVVESIEARKKR 989
Query: 890 LRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFR---ERLRV 945
L+ R L + T ++ H + +R+ F+ + H+G ++ G P + +V+ +L
Sbjct: 990 LQEERAHLASVT-QQMRHCS--ERLVPFRGLQDRHKGIYVCGQTPVF-LVYHAATNQLVC 1045
Query: 946 HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-PSGSTYDNY-WPVQKIPLKAT 1003
++ F H+ + + GF+Y +G + + P G + W ++++ L T
Sbjct: 1046 TRHHATNAVRGFAPFHSRHVHGGFVYC-GEGFVHFATMQPFGELLGSSGWWLERVRLGCT 1104
Query: 1004 PHQITYFAEKNLYPLIVS-----VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSV 1052
PHQ+ Y + ++ S P P + L ++ D+E V H I+ +L S+
Sbjct: 1105 PHQVIYSPAAHGCFVVASRPQPFSPKRAPFDVQLRMVEDEEGNRVPHVIEPVSLPPLSAT 1164
Query: 1053 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE------ 1106
T E YEV+ + WQ + + +E L+ ++ + TT +
Sbjct: 1165 SGSPVPTNERYEVQFF----STLDWQCMGRLVLDVNEKVLSATLMQVTRDTTMDAANRST 1220
Query: 1107 NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QG 1165
+ A+ TAY GEDV RGR+LL +T + + ++ + +KG ++A+ + +
Sbjct: 1221 TAPVCALATAYPLGEDVTTRGRILLLTTSQQGGQGMQQLRTLHEEPMKGPVTAITRVGED 1280
Query: 1166 HLLIASGPKIILHKW----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
+ +A G + ++++ + E I + A YV L +N++++GD+ S+ F
Sbjct: 1281 CVAVAVGGTVRVYRYDTNKSTMETMAILYAGA---YVTCLQAFRNYLVIGDLFNSVLFAR 1337
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKGQ 1280
+ E+ + +L +D ++ + + L + L+V+D+ +N+ Y P+ + E K
Sbjct: 1338 YSEEIHTITILGRDTNAISVVSNDMLYHDTRFGLLVTDDARNLVCMSYKPRVLEEPGKPP 1397
Query: 1281 KLLSR-----AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
K+L E+ + V L++ L +S R N ++ T G IG
Sbjct: 1398 KILESLLTVTGEYRLAGGV--LLKMMRLRAASTR----------NSSVAIYVTNMGEIGY 1445
Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
+ PL + T R Q + ++L V H GL PR F F
Sbjct: 1446 LVPLGDQTSRTGQWVGRRLQSEVAHAGGLPPRMFLGF 1482
Score = 41.2 bits (95), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 86/206 (41%), Gaps = 38/206 (18%)
Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIE 257
+ A+ G G + GG S + V + R D+K +++D FV E
Sbjct: 263 VAFGAASAGPGTASSQKV-TQGGVTSLLLRVGTVTHWRLQDVKTALRNIRDVQFVESAGE 321
Query: 258 PVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-L 304
P++ L E++ TWAGRV W+ TC I ++++ + H L S ++ L
Sbjct: 322 PLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGL 381
Query: 305 PHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSA----------SCALALNNYAVSLD 350
P+D + +P+ P V +H ++S A +L + AVSL+
Sbjct: 382 PYDVTSMTPLPAFQDVPSAVFCVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLE 441
Query: 351 ------SSQELPRSSFSVELDAAHAT 370
+SQ L V L+ A+AT
Sbjct: 442 AVQWRSASQALSTDLVKVNLNFANAT 467
>gi|71654693|ref|XP_815961.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|50363265|gb|AAT75335.1| cleavage polyadenylation specificity factor CPSF160 [Trypanosoma
cruzi]
gi|70881056|gb|EAN94110.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 1436
Score = 120 bits (302), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 125/543 (23%), Positives = 228/543 (41%), Gaps = 67/543 (12%)
Query: 912 QRITIFKNISGHQGFFLSGSRPC---WCMVFRERLRVHPQLCDGSIVAFTVLHNVN---- 964
+RI F I G+ G ++ G P W RE L + G + F +N
Sbjct: 910 RRIVPFDAIGGNTGAYVCGQHPLFLFWDRRTRE-LEAYRHQTLGPVRGFVPFRIINSGYI 968
Query: 965 -CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV- 1022
C GF+ S C+ P+G W ++I L TPH + Y ++ S
Sbjct: 969 YCCEGFVDFASMDTY--CR-PTGQG----WLTRRIHLGVTPHFVVYHPPARSCFVVTSKK 1021
Query: 1023 ----PVLKPLNQVLSLLIDQEVG------HQIDNHNLSSVDLH---RTYTVEEYEVRILE 1069
P P + L+++ D+E G + N+ + + R + +E+R++
Sbjct: 1022 EPFRPQRAPFDVQLNIVYDEESGGVQSITTEAPVSNMPPIAPNAGIRVPMADRFEIRLM- 1080
Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE---NETLLAIGTAYVQGEDVAAR 1126
+ W T+ ++ +E L +++ + E + + TA+ GED+ R
Sbjct: 1081 ---STTDWACTDTLLLEENERVLGAQMMEIQCERDAEGLHTAPVCVVSTAFPLGEDITCR 1137
Query: 1127 GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK--WTGTE 1184
GR+LL +T + + +S+ L G +A+ ++ H+ +A G I L + W+ +
Sbjct: 1138 GRILLLAT--ICTKKKRKIVLFHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWSNRK 1195
Query: 1185 LN-GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
L G Y YV ++ +N+++ GD+ +S + E+ L++L KD ++
Sbjct: 1196 LVVGALLYAG--TYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDRNAVSVVH 1253
Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG--QKLLSR-----AEFHV-GAHVT 1295
+ + L+ SD+++N+ + Y P++ E+ G K+L E+ + G +
Sbjct: 1254 CDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEYRLSGGCLV 1313
Query: 1296 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1355
K LR + LA +S T L+ T G IG I P+ E R L ++L
Sbjct: 1314 KSLRFRSLAGNSSVT--------------LYVTNYGEIGFIVPIGEQANRTASWLMRRLQ 1359
Query: 1356 DSVPHVAGLNPRSFRQF-HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
+PH AGL PR F + + + +V LL+ + L + + IA T
Sbjct: 1360 IDLPHSAGLTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSRKTIASAAYTQL 1419
Query: 1415 SQI 1417
++
Sbjct: 1420 ERV 1422
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 62/261 (23%), Positives = 107/261 (40%), Gaps = 54/261 (20%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
+++V+D F+ EP++ L ER TWAGRV W+ LS S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
T+ ++ LI ++P++ + +P+G GV+ G NT+ + + + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFSQIPSGVICYGINTVMHVTTKRGYGVYL 365
Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQNDV---AL 378
NN + D E + F V L A+ T + N++ +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFKVNLSLANCTASFMSIVNEMLHLLV 425
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+S + G ++ L++ VQ + ++ S I IG+ + FLGS GDS
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTGCYCSGIARIGDQIVFLGSACGDS------ 479
Query: 439 CGSGTSMLSSGLKEEFGDIEA 459
C + M S + + F IE+
Sbjct: 480 CIAKVDMFHSDVAKRFQIIES 500
>gi|407850337|gb|EKG04765.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 1436
Score = 120 bits (301), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 128/545 (23%), Positives = 228/545 (41%), Gaps = 71/545 (13%)
Query: 912 QRITIFKNISGHQGFFLSGSRPC---WCMVFRERLRVHPQLCDGSIVAFTVLHNVN---- 964
+RI F I G+ G ++ G P W RE L + G + F +N
Sbjct: 910 RRIVPFDAIGGNAGAYVCGQHPLFLFWDRRTRE-LEAYRHQTLGPVRGFVPFRIINSGYI 968
Query: 965 -CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV- 1022
C GF+ S C+ P+G W ++I L TPH + Y ++ S
Sbjct: 969 YCCEGFVDFASMDTY--CR-PTGQG----WLTRRIHLGVTPHFVVYHPPARSCFVVTSKK 1021
Query: 1023 ----PVLKPLNQVLSLLIDQEVG--HQIDNH----NLSSVDLH---RTYTVEEYEVRILE 1069
P P + L ++ D+E G I N+ + + R + +E+R++
Sbjct: 1022 EPFRPQRSPFDVQLKIVYDEESGGVQSITTEAPVCNMPPIAPNAGIRVPMADRFEIRLM- 1080
Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL-----LAIGTAYVQGEDVA 1124
+ W T+ ++ +E L +++ + K+ E L + TA+ GED+
Sbjct: 1081 ---STTDWACTDTLLLEENERVLGAQMMEI--QCEKDAEGLHTAPVCVVSTAFPLGEDIT 1135
Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK--WTG 1182
RGR+LL +T + + +S+ L G +A+ ++ H+ +A G I L + W
Sbjct: 1136 CRGRILLLAT--MCTKKKRKIVLFHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWNN 1193
Query: 1183 TELN-GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
+L G Y YV ++ +N+++ GD+ +S + E+ L++L KD ++
Sbjct: 1194 RKLVVGALLYAG--TYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDRNAVSV 1251
Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG--QKLLSR-----AEFHV-GAH 1293
+ + L+ SD+++N+ + Y P++ E+ G K+L E+ + G
Sbjct: 1252 VHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEYRLSGGC 1311
Query: 1294 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1353
+ K LR + LA +S T L+ T G IG I P+ E R L ++
Sbjct: 1312 LVKSLRFRSLAGNSSVT--------------LYVTNYGEIGFIVPIGEQANRTASWLMRR 1357
Query: 1354 LVDSVPHVAGLNPRSFRQF-HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
L +PH AGL PR F + + + +V LL+ + L + + IA T
Sbjct: 1358 LQIDLPHSAGLTPRMFLGLSQGSPRTAMRAKEMLVSASLLNEFFFLDIHSRKTIASAAYT 1417
Query: 1413 TRSQI 1417
++
Sbjct: 1418 QLERV 1422
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 61/261 (23%), Positives = 107/261 (40%), Gaps = 54/261 (20%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
+++V+D F+ EP++ L ER TWAGRV W+ LS S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
T+ ++ LI ++P++ + +P+G GV+ G NT+ + + + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFSQIPSGVICYGINTVMHVTTKRGYGVYL 365
Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQNDV---AL 378
NN + D E + F V L A+ T + N++ +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFKVNLSLANCTASFMSIVNEMLHLLV 425
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+S + G ++ L++ VQ + ++ S I +G+ + FLGS GDS
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTDCYCSGIARLGDQIVFLGSACGDS------ 479
Query: 439 CGSGTSMLSSGLKEEFGDIEA 459
C + M S + + F IE+
Sbjct: 480 CIAKVDMFHSDVAKRFRIIES 500
>gi|50288865|ref|XP_446862.1| hypothetical protein [Candida glabrata CBS 138]
gi|74609915|sp|Q6FSD2.1|CFT1_CANGA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|49526171|emb|CAG59795.1| unnamed protein product [Candida glabrata]
Length = 1361
Score = 120 bits (301), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 157/337 (46%), Gaps = 23/337 (6%)
Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKG 1155
+T TK + +G Y EDV G ++ P T E++ ++++G
Sbjct: 1026 DTRTKRKREYIIVGIGYATMEDVPPTGEFHIYDITEVVPEPGKPNTNFKLKEIFKEDIRG 1085
Query: 1156 AISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
+S + + G LI+ KI++ + +AF D P ++V SL N I++GD
Sbjct: 1086 IVSVVNGISGRFLISQSQKIMVRDVQQDNSVIPVAFLDVP-VFVTSLKTFGNLIVIGDAM 1144
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+ I F+ + + ++ L + + EFL++ + +V+D + + YAP
Sbjct: 1145 QGIQFVGFDAEPYRMITLGSSITKFEVISVEFLVNNGDIYFLVTDRDSIMHVLKYAPDQP 1204
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR-FALLFGTLDGSI 1333
+ GQ+L+ + F++ + ML +D P + +R F + +DGSI
Sbjct: 1205 NTLSGQRLVHCSSFNLHS----LNNCTMLLPKNDE---FPRDQRYSRSFQTITAQVDGSI 1257
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ---FHSNGKAHRPGPDSIVDCE 1390
I P+ E T+RRL +Q++++D P +AGLNPR RQ ++ G + RP ++D
Sbjct: 1258 SKIVPVKEETYRRLYFIQQQIIDKEPQLAGLNPRMERQDNKYYHLGHSLRP----MLDFN 1313
Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRS-QILSNLNDLAL 1426
++ ++ + + + I + G + ++ +L DL
Sbjct: 1314 IIKRFKDMSMNRRSHIVQKLGKNSNLEVWRDLIDLEF 1350
Score = 57.4 bits (137), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 132/674 (19%), Positives = 265/674 (39%), Gaps = 122/674 (18%)
Query: 96 ISAASLELVCHYRLHGNVESLAIL---SQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
I + L L+ ++L G + +A++ S G N ++L+ AK+S+L +++
Sbjct: 43 IRSGRLYLMEEHKLSGRINDVALIPKHSNGSNGNGINLSYLLLSTGVAKLSLLMYNNMTS 102
Query: 153 GLRITSMHC----FESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
+ S+H FES L L AR ++++P G +++ ++ +
Sbjct: 103 SIETISLHFYEDKFESATMLDL-------ARNSQLRIEPNGNYA--MLFNNDVLAILPFY 153
Query: 209 GGSGLVGDED----------------TFGSGGGFSARIESSH---VINLRDL--DMKHVK 247
G DED F G + + +H +IN +L +K++K
Sbjct: 154 TGINEDEDEDYINNDKSKINDNSKKSLFKRKKGKTQNNKVTHPSIIINCSELGPQIKNIK 213
Query: 248 DFIFVHGYIEPVMVILHERELTWAGR---VSWKHHTCMIS---ALSISTTLKQHPLIWSA 301
D F+ G+ + + +L++ +L W G V + +IS SI T +I
Sbjct: 214 DIQFLCGFTKSTIGVLYQPQLAWCGNSQLVPLPTNYAIISLDMKFSIDATTFDKAIISEI 273
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVS-LDSSQELPRS 358
LP D + + + G L++G N I + + L LN+Y+ L + + +S
Sbjct: 274 SQLPSDWH---TIAPTLSGSLILGVNEIAFLDNTGVLQSILTLNSYSDKVLPKVRVIDKS 330
Query: 359 SFSVELDAAHATWL----QNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLS------ 404
S V + L +N+ + LL + G + + + +GR++ + +++
Sbjct: 331 SHEVFFNTGSKFALIPSNENERSVENILLFDENGCIFNVDLKSEGRLLTQFNITKLPLGE 390
Query: 405 -----KTNP---SVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFG 455
K+NP S++ +D + F+G + GD+ +++ S + +++
Sbjct: 391 DVLSQKSNPSSVSIIWAD-GRLDTYTIFIGFQSGDATMLKLNHLHSAIEVEEPTFMKDYV 449
Query: 456 DIEADAPSTKRLRRSS-------SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
+ +A A SD D VN + +G+ SN +AQ+
Sbjct: 450 NKQASAAYNNEDDDDDDDDFNLYSDEENDQVNNKNDRTFGTNESNEPFTAQELM------ 503
Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
L NIGP+ G + + + G+ + E+ + T + NA +
Sbjct: 504 ELRNIGPINSMCVGKVSSIEDNVKGLPNPNKQEI------SIVCTSGYGDGSHLNAILAS 557
Query: 568 MAAYDDEYHAYLIIS------LEARTMVLETADL------LTEVTESVDYFVQGR----- 610
+ ++ ++ I+ ++ + L T D + E+ + QGR
Sbjct: 558 VQPRVEKALKFISITKIWNLHIKGKDKFLITTDSTQSQSNIYEIDNNFSQHKQGRLRRDA 617
Query: 611 -TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
TI + +R++QV + D ++ + + V+ VS+ DP
Sbjct: 618 TTIHIATIGDNKRIVQVTTNHLYLYDLTF-----------RRFSTIKFDYEVVHVSVMDP 666
Query: 670 YVLLGMSDGSIRLL 683
YVL+ +S G I++
Sbjct: 667 YVLITLSRGDIKVF 680
>gi|449019486|dbj|BAM82888.1| similar to cleavage and polyadenylation specificity factor subunit
[Cyanidioschyzon merolae strain 10D]
Length = 1880
Score = 120 bits (301), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 88/324 (27%), Positives = 162/324 (50%), Gaps = 12/324 (3%)
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFST----GRNADNPQNLVTEVYS---KELKGAISALAS 1162
+L +GT +++GED + RGR+L+F GR Q + ++ + E+KGA+SA+A
Sbjct: 1546 VLVVGTCFLRGEDTSIRGRLLVFEISRQEGRQHHQHQRTLYQMQTLAATEVKGAVSAVAP 1605
Query: 1163 LQGHLLIAS-GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
++G + S GP++ ++K E++ I+FY L+ + +K +IL D+ + FL
Sbjct: 1606 VKGGFVCCSAGPRLEVYKLIEDEMSCISFYPGINLFFSHVGTLKQYILASDMRYGVSFLF 1665
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN-IQIFYYAPKMSESWKGQ 1280
W+ + N L +D + A+E+L+ G+ +++ +D N I++ +P ES G
Sbjct: 1666 WRSRNVSQNFLCRDEAQRELVASEWLMHGTKANVLSADMLGNIIELSIPSPVDPESAGGT 1725
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
++ A FHVG+ R+++ S++ S N +L GT+DG I ++PL
Sbjct: 1726 RMTFEAGFHVGSRPNAVRRVRIDDPSAETPPPNEPSSLWNTHVILLGTVDGMITMVSPLL 1785
Query: 1341 ELTFRRLQ-SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH--RPGPDSIVDCELLSHYEM 1397
++L+ + Q +++ L RS+R S A R SI+D ++L Y
Sbjct: 1786 RGVAKKLELAAQDLMLEPELRKWCLYARSWRVMRSLTVAAGLRKPKRSILDGDVLQLYGS 1845
Query: 1398 LPLEEQLEIAHQTGTTRSQILSNL 1421
L + EIA + G + + +
Sbjct: 1846 LDTPRRKEIARRIGMPQEALFEAI 1869
Score = 74.3 bits (181), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 112/499 (22%), Positives = 194/499 (38%), Gaps = 128/499 (25%)
Query: 243 MKHVK--DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI----STTLKQHP 296
+ HV+ D F+ G P MV+L+E TWAGRV ++C ++A+ + + + P
Sbjct: 345 LGHVRILDCCFLTGTALPTMVMLYEERPTWAGRVEAVSNSCALAAIVLPPLPAGAAGEEP 404
Query: 297 LI-WSAMNLPHDAYKLLAVPS------PIGGVLVVGANTIHYHSQSASCALAL--NNYA- 346
L+ W LP DA K++ +PS G+L++ AN + + + +L N++
Sbjct: 405 LVAWRIQGLPFDAEKVVPLPSVEWDRAAEQGLLLIAANVLFWIRGNGQIGASLSGNHFGD 464
Query: 347 --VSLDSSQELP---------------RSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
+ LD Q LP R+S + A ++ L G++ L
Sbjct: 465 TFMELDGCQ-LPGALYGGTDSDIISRCRTSQVLHFRGACIAPVRLHRYGLFLADGNVYQL 523
Query: 390 TVVYDGRVVQRLDL------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
+ D RL+ S+ P+ L D + L F+ + LG S+L + T
Sbjct: 524 ALHADAEYPLRLEALRVRGESRLAPAPL--DAKLLSRDLLFVAAHLGSSVLYRMT----- 576
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
P +R R S+++ G+ N + + +
Sbjct: 577 ---------------QVHPHGRRTRTSAAE-------------NGTLHKNATTKEAQWEL 608
Query: 504 AVRDSLVNIGPLKDF---------SYGLRINADA--SATGISKQS------------NYE 540
RD++ +GP+ D G ++ +ATG QS ++
Sbjct: 609 QQRDTIFQLGPIVDLVVIPPRYSPPAGTLLDPGEILAATGHQHQSCLARCTYQVQTREWQ 668
Query: 541 LVELPGCKGIWTVY--HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
+ GC+ +W++Y H + H + A+ + + L+ R + AD T
Sbjct: 669 RIPSAGCRRVWSLYADHDGTGMHQEEQ----AFLLLSLSKSSVILDIRRGFEQAAD--TR 722
Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY---MTQDL---SFGPSNSES 652
V + TIAAGNL RR + QV G R+LD + +D+ + P + S
Sbjct: 723 V------LLPSPTIAAGNLAQRRLIAQVHRTGIRLLDANLDVVYEEDMLLAALEPGTAVS 776
Query: 653 GSGSENSTVLSVSIADPYV 671
G+ S+ DPY+
Sbjct: 777 GA----------SVVDPYI 785
>gi|255718033|ref|XP_002555297.1| KLTH0G05984p [Lachancea thermotolerans]
gi|238936681|emb|CAR24860.1| KLTH0G05984p [Lachancea thermotolerans CBS 6340]
Length = 1307
Score = 120 bits (300), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 148/320 (46%), Gaps = 21/320 (6%)
Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELKG 1155
N+ T+ L +G +V+ ED+ G L+ P T +++ +E +G
Sbjct: 978 NSRTRRKREYLVVGNTFVRDEDIGTMGSFCLYDITEVVPEPGKPDTNYKLKQIFYEEFRG 1037
Query: 1156 AISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
A+S++ + G LI+ K+++ + +AF D P ++V N +++GD
Sbjct: 1038 AVSSVCEISGRFLISQSQKVLVRDVQEDNSVVPVAFLDVP-VFVTDSKSCGNLLIIGDAM 1096
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+ F+ + + ++ L K + + EFL++ ++ +VSD + I YAP
Sbjct: 1097 QGFQFVGFDAEPYRMIPLGKSVSKFEVMSLEFLVNNGSIYFLVSDRSNILHILKYAPDEP 1156
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
S GQKL+ F++ H T +L T P + F + DGS+
Sbjct: 1157 NSLSGQKLVHCTSFNL--HSTNTCMKLLLKNDEFPTLGEPPA-----FQAIGAQTDGSLF 1209
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCEL 1391
+ PL E ++RRL +Q++L++ H+ GLNP+ R F+ G RP ++D +
Sbjct: 1210 NVVPLSESSYRRLYMVQQQLIEKDVHLCGLNPKMERLQNDFYQLGHLMRP----MLDFTV 1265
Query: 1392 LSHYEMLPLEEQLEIAHQTG 1411
+ + LPL ++ +IA + G
Sbjct: 1266 IKSFATLPLNKRKQIAAKAG 1285
Score = 84.0 bits (206), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 127/651 (19%), Positives = 262/651 (40%), Gaps = 131/651 (20%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI-HGLRITSM 159
L L+ ++LHG + +A++ Q D ++++ AK+S++ FD S+ L S+
Sbjct: 47 LVLLHEFKLHGQITGMALVPQMEGP----LDCLVVSTGKAKLSLVRFDPSMPMCLETLSL 102
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYG---LQMIILKASQGGSGLVGD 216
H +E+ ++ A+ +++DP+ RC VL++ L ++ L ++ D
Sbjct: 103 HYYEAE---FTRKNLIELAKTSKLRLDPERRC--VLLFNSDVLALLPLNINEEDE----D 153
Query: 217 EDTFGSGGGFSARIES---------SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
++ + ++E+ S V+++ DL ++K+V D F++ + +P + +L++
Sbjct: 154 DNQEPTHQAKKRKVENGDARRLAKQSSVLHVSDLSAELKNVVDIQFLNSFSQPTLAVLYQ 213
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
L W+G M ++I+ K++ I+ LPHD + ++ + + ++VG
Sbjct: 214 PRLAWSGNDKVAGKGSM-RLMAITPHEKKNTTIYQVKELPHDVHTIIPLAN---SCVLVG 269
Query: 326 ANTIHY--HSQSASCALALNNYAVSLDSSQELPRSSFSVELD-----AAHATWLQNDVAL 378
N I ++ + + LN+++ S+++ SS V A+ ++ +
Sbjct: 270 VNEIVSVDNTGAIQSTIQLNSFSPKFTGSKQIDNSSLEVMFTEPIVWASAMVSKDREILI 329
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV--------LTSDITTIGNSL------FF 424
L D+ +T+ +GR++ L + P V L + I + + FF
Sbjct: 330 LMDHKADMYSITLQSEGRLLIDFTLVRL-PIVNDIFKDQNLPTCIVALSGGIRLKTCQFF 388
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
+G GD+++V+ S+ L+ F +A DAL G++
Sbjct: 389 IGFSSGDAVVVK----------SNNLRSAFESQYREAIELPNDEDEDYDALY----GDDE 434
Query: 485 SLYGSASNNTESAQKTFSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYEL- 541
L ++N + + F + DSL+N+GP+ G + +A+ G+ + EL
Sbjct: 435 DLARPVNDNKATVETAVPFEIELMDSLINVGPITSICTGRVSSINATIEGLPNPNRNELA 494
Query: 542 ---------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
++ IW + + + + A D
Sbjct: 495 IVSTSGHDSGTYLNVMEPSVRPLVQQALKFTSVTKIWNLKIRKKDKYLVTTDSGAEKSDV 554
Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
Y + A+ ++ VT T+ L G +R++QV + +
Sbjct: 555 YE------IGAKIASIKPKHFKRNVT----------TVEIAILGGGKRIVQVTTKAVYLF 598
Query: 635 DGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+ + MT F V+ VSI DP++LL S G I++
Sbjct: 599 NLGFKKLMTISFDF--------------EVVHVSILDPFILLTNSKGEIKI 635
>gi|313215162|emb|CBY42850.1| unnamed protein product [Oikopleura dioica]
Length = 228
Score = 119 bits (299), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/220 (29%), Positives = 115/220 (52%), Gaps = 2/220 (0%)
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
KN+ L+GDI + I L + + ++ +++ + + A L+DG+ + LV +D Q+N+
Sbjct: 4 KNYALVGDIQQGITLLRHQGERNCISQISRARRAGEVTAVGILLDGNQVGLVSTDMQRNL 63
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
Q++ Y P ES G++L+ +A+ ++G V L +D ++ R
Sbjct: 64 QVYMYKPDQKESNGGKQLVRQADINLGKRVISIW--NSLGRQNDTFTKVALTENDARHVT 121
Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1384
+ LDGSIG I P+ E FRRL+ LQ + +PH GLNPR +R + +
Sbjct: 122 FYAGLDGSIGDIVPVSEKVFRRLEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAK 181
Query: 1385 SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+I+D +LL + L EQ +++ + G TR +L ++ D+
Sbjct: 182 NIIDGDLLERFNGLSFTEQTDLSRKIGVTREALLDDMMDV 221
>gi|389602597|ref|XP_001567507.2| cleavage and polyadenylation specificity factor-like protein
[Leishmania braziliensis MHOM/BR/75/M2904]
gi|322505515|emb|CAM42945.2| cleavage and polyadenylation specificity factor-like protein
[Leishmania braziliensis MHOM/BR/75/M2904]
Length = 1536
Score = 119 bits (297), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 122/528 (23%), Positives = 236/528 (44%), Gaps = 65/528 (12%)
Query: 912 QRITIFKNISG-HQGFFLSGSRPCWCMVFR---ERLRVHPQLCDGSIVAFTVLHNVNCNH 967
+R+ F + G H+G ++ G P + +V+ +L ++ F H+ + +
Sbjct: 1003 ERLVPFCALQGRHKGIYVCGQTPVF-LVYHYATNQLVCTRHHATSAVRGFAPFHSRHVHG 1061
Query: 968 GFIYVTSQGILKICQL-PSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVS---- 1021
GF+Y +G + + P G + W ++++ L TPHQ+ Y + ++VS
Sbjct: 1062 GFVYC-GEGFVHFATMQPFGELLGSSGWWLERVRLGCTPHQVIYSPAAHGCFVVVSRPQP 1120
Query: 1022 -VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSVDLHRTYTVEEYEVRILEPDRAG 1074
P P + L ++ D+E V H ++ +L S+ T YEV++ +
Sbjct: 1121 FSPKRAPFDVQLRMVEDEEGNRVPHVVEPVSLPPLSATSGSPVPTNGRYEVQLF----ST 1176
Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKE------NETLLAIGTAYVQGEDVAARGR 1128
WQ + + +E L+ ++ + TT + + A+ TAY GEDV RGR
Sbjct: 1177 LDWQRVDCLALDVNEKVLSATLMQVSRDTTMDVAYRSATAPVCALATAYPLGEDVTTRGR 1236
Query: 1129 VLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGT---- 1183
VLL +T + + ++ + +KG ++A+ + + + +A G + ++++ +
Sbjct: 1237 VLLLATSQQGGQGMQKLRILHEEPMKGPVTAITRIDEDCIAVAVGGTVRVYRYDASKGVM 1296
Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
E I + A YV L +++++++GD+ S+ F + E+ + +L +D ++ +
Sbjct: 1297 ETTAILYAGA---YVTCLQALRDYLVIGDLFHSVLFARYSEEIHTITILGRDTNAISVVS 1353
Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKGQKLLSR-----AEFHV-GAHVTK 1296
++ L + L+V+D+ +N+ Y P+ + E K K+L E+ + G + K
Sbjct: 1354 SDMLYHDTRFGLLVADDARNLMCMSYKPRLLEEPGKPPKVLESLLSVTGEYRLAGGVLLK 1413
Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
+RL+ A S ++ T G IG + PL + T R Q + ++L
Sbjct: 1414 MMRLRASAARSSSV-------------AIYVTNMGEIGYLVPLGDQTSRTGQWVVRRLQS 1460
Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
V H GL PR F F + S+ E + H+ PL EQL
Sbjct: 1461 EVAHAGGLPPRMFLGFPQDDPLR-----SLKGDEWMLHF---PLLEQL 1500
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 68/138 (49%), Gaps = 20/138 (14%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSIST 290
+++++D FV EP++ L E++ TWAGRV W+ TC I ++++
Sbjct: 298 LRNIRDVQFVASAGEPLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLAN 357
Query: 291 TLKQHPLIWSAMN-LPHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSASCALALNNY 345
T H L S ++ LP+DA + +P+ P VL V N + + S + + +N
Sbjct: 358 TAAPHMLSLSEVDGLPYDATSMTPLPAFQDVP-SAVLCVSRNMMVHVSTKSGYGVYVN-- 414
Query: 346 AVSLDSSQELPRSSFSVE 363
A+ + ++ L S+ S E
Sbjct: 415 AMGEEQARSLKSSAVSCE 432
>gi|407410979|gb|EKF33219.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi marinkellei]
Length = 1436
Score = 119 bits (297), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 127/543 (23%), Positives = 229/543 (42%), Gaps = 67/543 (12%)
Query: 912 QRITIFKNISGHQGFFLSGSRPC---WCMVFRERLRVHPQLCDGSIVAFTVLHNVN---- 964
+RI F +I G+ G ++ G P W RE L + G + F +N
Sbjct: 910 RRIVPFDSIGGNAGAYVCGQHPLFLFWDRRTRE-LEAYRHQTLGPVRGFVPFRIINSGYI 968
Query: 965 -CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV- 1022
C GF+ S C+ P+G W ++I L TPH + Y ++ S
Sbjct: 969 YCCEGFVDFASMDTY--CR-PTGQG----WLTRRIHLGVTPHFVVYHPPARSCFVVTSKK 1021
Query: 1023 ----PVLKPLNQVLSLLIDQEVG--HQIDNH----NLSSVDLH---RTYTVEEYEVRILE 1069
P P + L+++ D+E G I N+ + + R + +E+ ++
Sbjct: 1022 EPFRPQRAPFDVQLNIVYDEESGGVQSITTEAPVCNMPPIPPNAGIRVPMADRFEICLM- 1080
Query: 1070 PDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE---NETLLAIGTAYVQGEDVAAR 1126
+ W T+ ++ +E L +++ + E + + TA+ GED+ +R
Sbjct: 1081 ---STTDWACTDTLLLEENERVLGAQMMEIHCEKDAEGLHTAPVCVVSTAFPLGEDITSR 1137
Query: 1127 GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK--WTGTE 1184
GR+LL ST + L+ +S+ L G +A+ ++ H+ +A G I L + W +
Sbjct: 1138 GRILLLSTMCTKKKRKILL--FHSEPLNGPATAVVGIRHHIAVAVGGTIKLFRFDWENRK 1195
Query: 1185 LN-GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
L G Y YV ++ +N+++ GD+ +S + E+ L++L KD ++
Sbjct: 1196 LVVGALLYAG--TYVTRMSSFRNYLIYGDLSRSCAIARFNEENHTLSVLGKDRNAVSVVH 1253
Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG--QKLLSR-----AEFHV-GAHVT 1295
+ + L+ SD+++N+ + Y P++ E+ G K+L E+ + G +
Sbjct: 1254 CDMMYHDRAFGLLCSDDERNLLVMGYTPRVQETEAGSPNKVLESVLSLDGEYRLSGGCLV 1313
Query: 1296 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1355
K LR + LA +S T L+ T G IG I P+ E R L ++L
Sbjct: 1314 KSLRFRSLAGNSSVT--------------LYVTNYGEIGFIVPIGEQANRTASWLMRRLQ 1359
Query: 1356 DSVPHVAGLNPRSFRQF-HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
+PH AGL PR F + + + +V LL+ + L + + IA T
Sbjct: 1360 MDLPHNAGLTPRMFLGLSQGSPRTALRAKEMLVSASLLNEFFFLDIHSRKTIASAAYTQL 1419
Query: 1415 SQI 1417
++
Sbjct: 1420 ERV 1422
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 63/260 (24%), Positives = 105/260 (40%), Gaps = 54/260 (20%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
+++V+D F+ EP++ L ER TWAGRV W+ LS S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
T+ ++ LI ++P++ + +P+G GV+ G NT+ + + + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFAQIPSGVICYGINTVMHVTTKRGYGVYL 365
Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
NN + D E + F V L A T + N+ + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVSWYDPKMETSTALFKVNLSLASCTASFMSIVNEMLHLLV 425
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+S + G ++ L++ VQ + ++ S IT IG+ + FLGS GDS
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQDIRIAILGTGCYCSGITRIGDQIVFLGSAFGDS------ 479
Query: 439 CGSGTSMLSSGLKEEFGDIE 458
C + M S + F IE
Sbjct: 480 CIAKVDMFHSDAAKRFQIIE 499
>gi|401426989|ref|XP_003877978.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322494225|emb|CBZ29522.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 1542
Score = 117 bits (293), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 112/495 (22%), Positives = 220/495 (44%), Gaps = 55/495 (11%)
Query: 912 QRITIFKNISG-HQGFFLSGSRPCWCM--VFRERLRVHPQLCDGSIVAFTVLHNVNCNHG 968
+R+ F+ + H+G ++ G P + + +L ++ F H+ + + G
Sbjct: 1009 ERLVPFRGLQDRHKGMYVCGQTPVFLVYHAATNQLVCTRHHATNAVRGFAPFHSRHVHGG 1068
Query: 969 FIYVTSQGILKICQL-PSGSTYDNY-WPVQKIPLKATPHQITYFAEKNLYPLIVS----- 1021
F+Y +G + + P G + W ++++ L TPHQI Y + ++ S
Sbjct: 1069 FVYC-GEGFVHFATMQPFGELLGSSGWWLERVRLGCTPHQIIYSPAAHGCFVVASRPQPF 1127
Query: 1022 VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSVDLHRTYTVEEYEVRILEPDRAGG 1075
P P + L ++ D+E V H I+ +L S+ T E YEV+ +
Sbjct: 1128 SPKRAPFDVQLRMVEDEEGNRVPHVIEAVSLPPLSAASGSPVPTNERYEVQFF----STL 1183
Query: 1076 PWQTRATIPMQSSENALTVRVVTLFNTTTKE------NETLLAIGTAYVQGEDVAARGRV 1129
WQ + + ++E L+ ++ + TT + + A+ TAY GEDV RGR+
Sbjct: 1184 DWQCMGRLVLDANEKVLSATLMQVTRDTTMDAANRSTTAPVCALATAYPLGEDVTTRGRI 1243
Query: 1130 LLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS-GPKIILHKW----TGTE 1184
LL +T + + + ++ + +KG ++A+ + + A+ G + ++++ + E
Sbjct: 1244 LLLTTTQQGGHGMQHLRTLHEEPMKGPVTAITRVGEDCVAAAVGGTVRVYRYDTYKSTME 1303
Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
I + A YV L ++++++GD+ S+ F + E+ + +L +D ++ +
Sbjct: 1304 TMAILYAGA---YVTCLQAFRDYLVIGDLFNSVLFARYSEEIHTITILGRDTNAISVVSN 1360
Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKGQKLLSR-----AEFHV-GAHVTKF 1297
+ L + L+V+D+ +N+ Y P+ + E K K+L E+ + G + K
Sbjct: 1361 DMLYHDTRFGLLVTDDARNLMCMSYKPRVLEEPGKPPKVLESLLTVTGEYRLAGGVLLKM 1420
Query: 1298 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1357
+RL+ + S ++ T G IG + PL + T R Q + ++L
Sbjct: 1421 MRLRAASAHSSS-------------VAIYVTNMGEIGYLVPLGDQTSRTGQWVVRRLQSE 1467
Query: 1358 VPHVAGLNPRSFRQF 1372
V H GL PR F F
Sbjct: 1468 VAHAGGLPPRMFLGF 1482
Score = 40.4 bits (93), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 84/185 (45%), Gaps = 37/185 (20%)
Query: 223 GGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHERELTWAGRVS--- 275
GGG S + V + R D+K +++D FV EP++ L E++ TWAGRV
Sbjct: 283 GGGTSLLLRIGTVTHWRLQDVKTALRNIRDIQFVESAGEPLLAFLFEKQPTWAGRVKLLE 342
Query: 276 WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-LPHDAYKLLA------VPSPIG 319
W+ TC I ++++ + H L S ++ LP+D + VPS +
Sbjct: 343 WRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGLPYDVTSMTPLTAFQDVPSAVF 402
Query: 320 GV---LVVGANT-----IHYHSQSASCALALNNYAVSLD------SSQELPRSSFSVELD 365
V ++V +T ++ ++ A +L + AVS + +SQ L V L+
Sbjct: 403 CVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSFEAVQWRSASQALSTDLVKVNLN 462
Query: 366 AAHAT 370
++AT
Sbjct: 463 FSNAT 467
>gi|388856288|emb|CCF50097.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Ustilago hordei]
Length = 1568
Score = 117 bits (293), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 97/312 (31%), Positives = 148/312 (47%), Gaps = 29/312 (9%)
Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFS-----TGRNADNPQNLVTEVYSKELKGA-ISALA 1161
+ +A+GT GED +G V LF + R ++L ++ ++ A ++ALA
Sbjct: 1153 KQFIAVGTTTYHGEDRTCKGSVYLFEIIQVVSSRRFQVGRDLRLKLICRDGSNAPVTALA 1212
Query: 1162 SLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
L G LL SG K+ + E L +AF D P Y+ S+ +VKNF+LL D K ++FL
Sbjct: 1213 ELHGFLLSTSGQKLYVRALEKEEWLISVAFLDCP-FYITSIRVVKNFVLLSDAKKGLWFL 1271
Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS------------DEQKNIQIFY 1268
+++E + L EFL+ LSLV + + I+++
Sbjct: 1272 AFQEDPYRFVDLGSALDGHCANLGEFLVYNDKLSLVSTSGVALGGFSGFGQDSGVIRLYE 1331
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLATSSDRTGAAPGSDKTNRFALLF 1326
Y P S GQ+LL R E+ + T L + L+ S R G ++ R LL
Sbjct: 1332 YNPSSPTSLGGQRLLLRTEYSTPSSTTCSLSAPGRWLSDSELR-----GREQL-RNKLLL 1385
Query: 1327 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1386
+GS+ +A ++E +RL LQ +LV SV H A LNPR+FRQ N RP +
Sbjct: 1386 SKSNGSLDSLASVEEKVAKRLHLLQGQLVRSVLHTAALNPRAFRQVR-NDFVSRPLYKGV 1444
Query: 1387 VDCELLSHYEML 1398
+D LL ++ L
Sbjct: 1445 LDARLLDAFKGL 1456
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 91/358 (25%), Positives = 164/358 (45%), Gaps = 61/358 (17%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L+ + L G V L + Q + + RD ++++F DAK+++LE++ + L S+H
Sbjct: 84 LVLIRKHSLFGTVTGLQRI-QTLSTSKDSRDRLLVSFTDAKLALLEWNHTTDDLETVSIH 142
Query: 161 CFE-SPEWL----HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS---- 211
+E +P+ L HL + P + +DP RC +L+ + IL + +
Sbjct: 143 TYERAPQLLNGIPHLFQ--------PNLNIDPLSRCAALLLPHDALAILPFYRDAAEFEF 194
Query: 212 --GLVGDEDTFGSGGGFSA-----RIES-----SHVINLRDLD--MKHVKDFIFVHGYIE 257
GL D + +G +A +IES S V+ +R++D ++++KDF F+ G+ +
Sbjct: 195 DHGLHLDLNLDFAGEDKAAMQAAVQIESLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQK 254
Query: 258 PVMVILHERELTWAGRVSWKHHT----------------CMISALSIS----TTLKQHPL 297
P + +L T G ++ + M+ + S S T HP+
Sbjct: 255 PTVALLFAHSPTCTGLLAERKDNFSVYLFTLDLAASLDGAMLGSASYSFDDATLRSMHPV 314
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNY-----AVSLDS 351
+ ++ +LP+D +L P +GGVLVV ++I + QS A ALN + A+ +S
Sbjct: 315 LTTSSSLPYDCLYMLPCPQTLGGVLVVCMSSILHVDQSGRVVATALNGWFNLVSAIQPES 374
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+LP + +L + + +L+ GD+ T DGR +Q L + S
Sbjct: 375 LLDLPEIA---DLQGSQLVFTAETEGVLTLVHGDVYTFTCQMDGRNIQGFRLERMQQS 429
>gi|254564833|ref|XP_002489527.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Komagataella pastoris GS115]
gi|238029323|emb|CAY67246.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
[Komagataella pastoris GS115]
gi|328349950|emb|CCA36350.1| Protein cft1 [Komagataella pastoris CBS 7435]
Length = 1388
Score = 117 bits (292), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 121/561 (21%), Positives = 242/561 (43%), Gaps = 62/561 (11%)
Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF---FLSGSRPCWCMV-FRERLR 944
N+ + P +AY P G +R I N G F+ G + W +
Sbjct: 862 NITITGAPDNAY-----PQGTKLERRLIKLNNIGDSKLSTLFVVGVKSFWITKRHSSSIN 916
Query: 945 VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATP 1004
+H Q S ++ + C +G + + + ++ ++PS P++++P+ T
Sbjct: 917 IH-QFTKLSTISCARFNTSRCKNGLMIIDTNKAARMVEIPSNLELSQRLPIRRVPVGCTI 975
Query: 1005 HQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDLHRTYTV 1060
+ + K +VS P N +D+E VG +DN +++ +
Sbjct: 976 KCVAF--HKASRTFVVSTVEETPYN-----CVDEEGNPIVG--VDN------TINKPASS 1020
Query: 1061 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT----KENETLLAIGTA 1116
+ ++++ P W + ++ ++++ +TL NT+ K + L +G +
Sbjct: 1021 FKSSIKLISP----ISWTVIDSFDLEDEHVCMSLKSMTL-NTSRIPMFKNLKEYLVLGIS 1075
Query: 1117 YVQGEDVAARGRVLLFST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+ ED+A+ G++ + G+ N + +++ KGA+++++ + G +I
Sbjct: 1076 NYRMEDLASNGQIRIVDVVDIIPEPGKPETNHK--FKDIFQDATKGAVTSVSDISGRFVI 1133
Query: 1170 ASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
G KII+ + T L + F D P YV +N +L+GD S+ + + +
Sbjct: 1134 GQGQKIIVRDLQEDNTAL-PVGFVDTP-FYVSETKSFQNLLLVGDSMHSVILVGFDAEPY 1191
Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
++ L KD +D A +F++ L ++++DE + + Y P+ S +GQ+LL R+
Sbjct: 1192 RMISLGKDVAHVDVCAADFVVFEGNLFIIIADEDGMLHLIQYDPEDPASMQGQRLLRRSI 1251
Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN---RFALLFGTLDGSIGCIAPLDELTF 1344
F + T M P + TN F ++ DGS + P+ E T+
Sbjct: 1252 FKTNQYTT-----CMKMRERKYVIKPPKNQFTNFSEAFEVVAANSDGSFYKVTPISEATY 1306
Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
RRL +Q+++ D H GLNPR R + + + P I+D + + + ++
Sbjct: 1307 RRLYVIQQQIFDQENHKCGLNPRENR--YLSDQYSIPNQRLILDFDNIRRFLEFDEIKKR 1364
Query: 1405 EIAHQTG-TTRSQILSNLNDL 1424
++ H+ G T S+ +L +L
Sbjct: 1365 DLVHKLGRNTYSEFYRDLLNL 1385
Score = 90.1 bits (222), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 105/444 (23%), Positives = 177/444 (39%), Gaps = 51/444 (11%)
Query: 93 MDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
+D L LV Y+L G V L + D +++A + K S++++D S +
Sbjct: 80 IDFSQNVKLSLVAEYKLDGLVTDLCKIR---TIEDSHHDYVLVATKGVKFSMIKWDQSSN 136
Query: 153 GLRITSMHCFESPEWLHLKRGRES-----FARGPLVKVDPQGRCGGVLVYGLQMIILKAS 207
+ S+H H K+ E+ F + DP C +L + + L
Sbjct: 137 SISTVSLH--------HYKKIVENSLIDKFNVDTKLIADPNNHCSCLLANEI-LFFLPFL 187
Query: 208 QGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
Q DE+ G ++ + DL ++K + D F+HGY EP + +L+
Sbjct: 188 QHEV----DEELDGKFVENKKLYSNTFLQFSNDLQPNIKTIIDIEFLHGYSEPTLAVLYT 243
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
T G + T + S++ K I NLP+D ++L + SP+ G L++G
Sbjct: 244 SFPTCTGALPKAKDTVSLQVFSLNLQNKASTSIIEVNNLPYDTDRILPLSSPLNGCLLIG 303
Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTG 384
AN IH +S + ++ N +A + + +S+ + L+ + ND +L T+ G
Sbjct: 304 ANQIIHLNSMGTAKGISCNLFAAKCSNFKLSDQSNLDLRLEKCVLGQVYNDKVILITEKG 363
Query: 385 DLVLLTVVYDGRV-----VQRLDLSKTNPSVLT--SDITTIGNSLFFLGSRLGDSLLVQF 437
+ G V +Q++ K VL+ + T I FF+G + DS+L
Sbjct: 364 AFYAFSFDIVGGVSSINEIQKIAAEKYQGLVLSLPTMFTNIDGKTFFIGCQGSDSVLF-- 421
Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
G K D ++ + DAL E LY N
Sbjct: 422 -----------GSKARLNTQNVDVNGKSKV-ITEEDALY------EEDLYADDIQNVAQG 463
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYG 521
F DSL+NIGP+ +F+ G
Sbjct: 464 IDHIDFVKLDSLLNIGPITNFTTG 487
>gi|261335516|emb|CBH18510.1| cleavage and polyadenylation specificity factor-like protein,
putative [Trypanosoma brucei gambiense DAL972]
Length = 1452
Score = 116 bits (291), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 134/573 (23%), Positives = 250/573 (43%), Gaps = 63/573 (10%)
Query: 879 VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
+ ++ A ++R L+ RT ++ T + H + +RI F ++G G ++ G P + M
Sbjct: 895 IESIEAKKMR-LQSERTMIENDT-QSVRHCS--RRIIPFAAVAGQSGAYVCGQHPLFLM- 949
Query: 939 FRERLR---VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS-GSTYDNYWP 994
+ R R + G + F +++ GFIY +G + + + S N W
Sbjct: 950 WDNRTRQLVAYRHQAPGPVRGFVPFTSMS--GGFIYCC-EGFVDFAVMNTYCSPGGNGWL 1006
Query: 995 VQKIPLKATPHQITYFAEKNLYPLIVSVPV-LKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1053
++I + ATPH I Y ++ S V +P Q S + ++ + D++ + SV
Sbjct: 1007 RRRIHIGATPHFIVYDPPGRSCFVVTSKKVPFRP--QRASFDVQLKIQYDEDSNTVQSVT 1064
Query: 1054 LH---------------RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
R E +EVR+ + G W + + +E L ++V
Sbjct: 1065 TEAPVCNMPAIKPGTGVRVPLTERFEVRLHSTFKKG--WDCTDKLMLDENEKVLGAQMVE 1122
Query: 1099 LF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
+ N + + TA+ GEDV RGR++L ++ RN +++V +++S+ L G
Sbjct: 1123 IHQDANADGSATAPVCVVCTAFPLGEDVTCRGRIILLAS-RNIKGRRSIV-QLHSEPLNG 1180
Query: 1156 AISALASLQGHLLIASGP--KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
+A+A + + +A G KI + W +L AF A +Y L++ +N+I+ GD+
Sbjct: 1181 PATAVAGICSQIAVAVGGTIKIFRYDWETKKLVVSAFLYAG-MYATRLSVFRNYIIYGDL 1239
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
+S + E+ L +L +D ++ + + ++ SD+++N+ I Y P++
Sbjct: 1240 CRSCSMARFNEENHTLTVLGRDRSAVSVVHCDMMYHDRAFGILCSDDERNVLIMGYTPRV 1299
Query: 1274 SESWKG------QKLLS-RAEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
E+ G + +LS E+ + + K LR + A +S T L
Sbjct: 1300 QETDAGTHPKVLESVLSLDGEYRLPSGSLVKSLRFRSTAGNSSVT--------------L 1345
Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH-SNGKAHRPGPD 1384
+ + G IG I P+ E R + ++L +P AGL PR F + S+ + G +
Sbjct: 1346 YVSNYGEIGFIVPIGEQANRTALWVTRRLQIDLPCEAGLTPRMFLSLNQSSPRNSLRGKE 1405
Query: 1385 SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
+V LL L L + IA T ++
Sbjct: 1406 MLVPAPLLRGLFSLDLRSRKAIARAAYTQLDRV 1438
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 42/234 (17%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
+++V+D F+ EP++ IL ER+ TWAGRV W+ + LS IS T
Sbjct: 254 IRYVRDVQFIGTLGEPLLAILCERKPTWAGRVKLVEWRTKAVESNMLSQQVTWVQISGTA 313
Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHY--------------- 331
P L+ +P++ +L V S + GV+ G NTI +
Sbjct: 314 SALPKLLLVGEVDGVPYNVTHMLPVGSISQAMSGVICFGVNTIMHITTRRGYGAYWNETG 373
Query: 332 -----HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHATWLQND-----VALLS 380
S+S++ + N+ L+SS L R + S+ A ++D +S
Sbjct: 374 KEECTSSKSSAVSYGKINWCDKKLESSTALFRVNLSLANCVAATLEGKDDEGSLQAVAVS 433
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
G +++L + G + + ++ S IT I L FLGS + DS +
Sbjct: 434 EDDGVVLMLQFLSQGSNIHDIRIAVLTSGCYCSSITPISERLMFLGSAVSDSCI 487
>gi|443894082|dbj|GAC71432.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
[Pseudozyma antarctica T-34]
Length = 1543
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 92/316 (29%), Positives = 148/316 (46%), Gaps = 28/316 (8%)
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSK-----ELKGAISALASLQ 1164
+A GT+ GED ++G V LF L ++ K + + ++A+A L
Sbjct: 1169 FIAAGTSTFHGEDRTSKGSVYLFEVIEVVSGKYQLGRDLRLKLVCRDDARAPVTAIAELN 1228
Query: 1165 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
G LL G K+ + E L +AF D P Y+ SL ++KNF+L+ D KS+ L+++
Sbjct: 1229 GFLLSTCGQKLYVRALEKEEWLISVAFLDGP-FYMTSLRVLKNFVLVSDAKKSLCLLAFQ 1287
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE------------QKNIQIFYYAP 1271
E+ + L ++ + +FL+ LSLV + + I+++ YAP
Sbjct: 1288 EEPYRFVDLGREINDHNASMAQFLVYNDRLSLVSTSDVPLGGISGFGASAGVIRLYEYAP 1347
Query: 1272 KMSESWKGQKLLSRAEFHVGAHV--TKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
++ + G +LL R+EF A + R + L+ S R G G K L+
Sbjct: 1348 HVATTLGGHRLLLRSEFQTPAAAVGSTVCRGRWLSDSELR-GREEGRSK-----LVLAKA 1401
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
+G++ ++ LD+ +RL LQ +LV SV H A LNPR+FR N R I+D
Sbjct: 1402 NGALDSLSALDDKVAKRLHLLQGQLVRSVQHTAALNPRAFRAVR-NDFVPRSLAKGILDA 1460
Query: 1390 ELLSHYEMLPLEEQLE 1405
LL + L + LE
Sbjct: 1461 RLLDRFVWLSRPKMLE 1476
Score = 94.7 bits (234), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 101/406 (24%), Positives = 170/406 (41%), Gaps = 58/406 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
LV +++ IY V + + S T D +L + + L G V L
Sbjct: 46 QLVTARDDLLTIYDVYDRSSSQSAASTSNGTANGTAGDAKPRHTLIVTRRHSLFGTVTGL 105
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ +D R ++++F DAK+++LE++D+ L S+H +E L G
Sbjct: 106 QRVDTLASDKDARH-RLLVSFADAKLALLEWNDTTDDLETVSIHTYERAT--QLLNGTPP 162
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS--------- 227
R P + VDP RC +L+ + IL + + E F G GF
Sbjct: 163 LFR-PNLNVDPLSRCAALLLPHDALAILPFYRDNA-----EFDFDDGLGFDLANDALDAS 216
Query: 228 --------ARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
A +ES S V+ +R++D ++++KDF F+ G+ +P + +L + TW G
Sbjct: 217 DAAAMAAAAHMESLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQKPTVAVLFDHSPTWTG 276
Query: 273 RVSWKHHTCMIS--ALSISTTL------------------KQHPLIWSAMNLPHDAYKLL 312
++ + + + L +S +L HP++ ++ LP+D +L
Sbjct: 277 LLTHRKDSFAVYLFTLDLSASLDGATLGSAAALLDDGNMRSAHPVVTTSSQLPYDCLYML 336
Query: 313 AVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV----ELDAAH 368
P +GGVLVV + I + QS + N S+ E P S V +L A+
Sbjct: 337 PCPQSLGGVLVVCMSAILHVDQSGRVVVTALNRWFKTTSAIE-PESVLDVPGLADLQASQ 395
Query: 369 ATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
+ + A+LS GDL L DGR V+ L + + SD
Sbjct: 396 LVFTTDTDAVLSLSNGDLYRLRCHMDGRSVEGFRLERIDQLTAGSD 441
>gi|343425828|emb|CBQ69361.1| related to cleavage and polyadenylation specificity factor, 160 kDa
subunit [Sporisorium reilianum SRZ2]
Length = 1567
Score = 115 bits (288), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 138/561 (24%), Positives = 226/561 (40%), Gaps = 57/561 (10%)
Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP--CQRITIFKNISGHQGFF 927
P +RSL ++ + R P + TR E HGA C + + Q
Sbjct: 965 PAGGARSLDADGIATTGYRAGAVKLEPFEIGTRAEK-HGADGDCNALAVLGGGREAQASV 1023
Query: 928 LSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG-FIYVTSQGILKICQLPSG 986
L CW RL P+ G + ++ N F Y G L + + P G
Sbjct: 1024 L-----CWTEQGGYRLLDWPE---GDLCCIASIYTPRANDADFAYCDRAGQLWLARAPHG 1075
Query: 987 STYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDN 1046
+ W + T + T + +V+ + Q ++ E G I +
Sbjct: 1076 LYAETSWMSSVV---RTGREYTRVVAHDATHTVVAASI-----QPCRFVLFDEDGEPIAD 1127
Query: 1047 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTK 1105
T E+ I E DR +++E + +VTL +T
Sbjct: 1128 PGADEALPSTTAQRGALELFISE-DRTTAA----DGYEFEANETVTALEIVTLDAPSTAS 1182
Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLF------STGRNADNPQNLVTEVYSKELKGAISA 1159
+ +A GT GED A+G V LF ++ R + V + +G ++A
Sbjct: 1183 GRKQFVAAGTTTFHGEDRTAKGCVYLFEVIEVVASARYQVGRDLRLKLVCRDDSRGPVTA 1242
Query: 1160 LASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
+A L G L+ G K+ + E L IAF D P LYV + +VKNF+LL D KS++
Sbjct: 1243 IAQLNGFLVSTCGQKLYVRALEKEEWLISIAFLDCP-LYVTGIRVVKNFVLLSDARKSLW 1301
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD------------EQKNIQI 1266
L+++E+ + L +D ++L+ L+LV + + +++
Sbjct: 1302 LLAFQEEPYRFVDLGRDIHDHHATLGQYLVYNERLALVSTSGAALGGSTAFGRDAGVVRL 1361
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL--RLQMLATSSDRTGAAPGSDKTNRFAL 1324
+ YAP ++ + +L+ R EF + T + R + L+ S R G G +K L
Sbjct: 1362 YEYAPHVASA--NTRLVLRTEFQTASPATASVACRGRWLSDSELR-GREHGRNK-----L 1413
Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1384
+ +G++ +A D+ +RL LQ +LV SV H A LNPR+FR N R
Sbjct: 1414 VLAKANGALETLAAADDRVAKRLHVLQGQLVRSVLHTAALNPRAFRAVR-NDFVSRALGK 1472
Query: 1385 SIVDCELLSHYEMLPLEEQLE 1405
++D LL + L + LE
Sbjct: 1473 GVLDARLLDSFVYLSRPKMLE 1493
Score = 89.0 bits (219), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/349 (24%), Positives = 158/349 (45%), Gaps = 48/349 (13%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV + L G V L + Q A + RD ++++F+DAK+++LE++D L S+H
Sbjct: 92 LVLVRRHTLFGVVTGLQRV-QTLATDKDARDCLLVSFKDAKLALLEWNDLTDDLETVSIH 150
Query: 161 CFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-------KASQGGSG 212
+E +P+ L+ G + P++ VDP RC +L+ + +L
Sbjct: 151 TYERAPQLLN---GTPNLFH-PILNVDPLSRCAALLLPHDALAVLPFYRDAADFDFDLDD 206
Query: 213 LVGDEDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
+ + +A +E+ S V+ +R++D ++++KDF F+ G+ +P + +L
Sbjct: 207 RLDLAKDDAAAVAAAAEMETLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQKPTVAVLFS 266
Query: 266 RELTWAGRVSWKHHTCMI-------------------SALSISTTLKQHPLIWSAMNLPH 306
TW G ++ + T + AL T HP++ ++ LP+
Sbjct: 267 HTPTWTGLLAERKDTFSVYLFTLDLSASLDGTLSSAADALDDGTVRSAHPVVTTSTALPY 326
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCAL-ALNNY-----AVSLDSSQELPRSSF 360
D +++ P +GGVLVV +++ + QS + ALN + A+ +S +LP
Sbjct: 327 DCLYMVSCPQTLGGVLVVCMSSVLHVDQSGRVVVTALNGWFKTISAIEPESVLDLPEIP- 385
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+L + + +L+ GDL DGR V+ L + + S
Sbjct: 386 --DLQGSQLVFTAETAGVLALVDGDLYRFRCQMDGRSVEGFRLERMDQS 432
>gi|74025892|ref|XP_829512.1| cleavage and polyadenylation specificity factor-like protein
[Trypanosoma brucei brucei strain 927/4 GUTat10.1]
gi|70834898|gb|EAN80400.1| cleavage and polyadenylation specificity factor-like protein,
putative [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 1452
Score = 115 bits (288), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 136/578 (23%), Positives = 247/578 (42%), Gaps = 73/578 (12%)
Query: 879 VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
+ ++ A ++R L+ RT ++ T + H + +RI F ++G G ++ G P + M
Sbjct: 895 IESIEAKKMR-LQSERTMIENDT-QSVRHCS--RRIIPFAAVAGQSGAYVCGQHPLFLM- 949
Query: 939 FRERLRV-------HPQLCDGSIVAFTVLHN--VNCNHGFIYVTSQGILKICQLPSGSTY 989
+ R R P L G V FT + + C GF+ ++ P G
Sbjct: 950 WDNRTRQLVAYRHQAPGLVRG-FVPFTSMPGGFIYCCEGFV---DFAVMNTYCSPGG--- 1002
Query: 990 DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPV-LKPLNQVLSLLIDQEVGHQIDNHN 1048
N W ++I + ATPH I Y ++ S V +P Q S + ++ + D++
Sbjct: 1003 -NGWLRRRIHIGATPHFIVYDPPGRSCFVVTSKKVPFRP--QRASFDVQLKIQYDEDSNT 1059
Query: 1049 LSSVDLH---------------RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1093
+ SV R E +EVR+ + G W + + +E L
Sbjct: 1060 VQSVTTEAPVCNMPAIKPGTGVRVPLTERFEVRLHSTFKKG--WDCTDKLMLDENEKVLG 1117
Query: 1094 VRVVTLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYS 1150
++V + N + + TA+ GEDV RGR++L ++ RN +++V +++S
Sbjct: 1118 AQMVEIHQDANADGSATAPVCVVCTAFPLGEDVTCRGRIILLAS-RNIKGRRSIV-QLHS 1175
Query: 1151 KELKGAISALASLQGHLLIASGP--KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1208
+ L G +A+A + + +A G KI + W +L AF A +Y L++ +N+I
Sbjct: 1176 EPLNGPATAVAGICSQIAVAVGGTIKIFRYDWETKKLVVSAFLYAG-MYATRLSVFRNYI 1234
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
+ GD+ +S + E+ L +L +D ++ + + ++ SD+++N+ I
Sbjct: 1235 IYGDLCRSCSMARFNEENHTLTVLGRDRSAVSVVHCDMMYHDRAFGILCSDDERNVLIMG 1294
Query: 1269 YAPKMSESWKG------QKLLS-RAEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
Y P++ E+ G + +LS E+ + + K LR + A +S T
Sbjct: 1295 YTPRVQETDAGTHPKVLESVLSLDGEYRLPSGSLVKSLRFRSTAGNSSVT---------- 1344
Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNG-KAH 1379
L+ + G IG I P+ E R + ++L +P AGL PR F + +
Sbjct: 1345 ----LYVSNYGEIGFIVPIGEQANRTALWVTRRLQIDLPCEAGLTPRMFLSLNQRSPRNS 1400
Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
G + +V LL L L + IA T ++
Sbjct: 1401 LRGKEMLVPAPLLRGLFSLDLRSRKAIARAAYTQLDRV 1438
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 42/234 (17%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
+++V+D F+ EP++ IL ER+ TWAGRV W+ + LS IS T
Sbjct: 254 IRYVRDVQFIGTLGEPLLAILCERKPTWAGRVKLVEWRTKAVESNMLSQQVTWVQISGTA 313
Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHY--------------- 331
P L+ +P++ +L V S + GV+ G NTI +
Sbjct: 314 SALPKLLLVGEVDGVPYNVTHMLPVGSISQAMSGVICFGVNTIMHITTRRGYGAYWNETG 373
Query: 332 -----HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHATWLQND-----VALLS 380
S+S++ + N+ L+SS L R + S+ A ++D +S
Sbjct: 374 KEECTSSKSSAVSYGKINWCDKKLESSTALFRVNLSLANCVAATLEGKDDEGSLQAVAVS 433
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
G +++L + G + + ++ S IT I L FLGS + DS +
Sbjct: 434 EDDGVVLMLQFLSQGSNIHDIRIAVLTSGCYCSSITPISERLMFLGSAVSDSCI 487
>gi|366994686|ref|XP_003677107.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
gi|342302975|emb|CCC70752.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
Length = 1340
Score = 115 bits (288), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 155/326 (47%), Gaps = 23/326 (7%)
Query: 1096 VVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYS 1150
++ L + + K+ E ++A G ++ ED+ + G ++ P T E++
Sbjct: 1003 LIQLDSKSRKKREYIVA-GITFIGTEDLPSTGAFHIYDLTEVIPEPGKPDTNFKLKEIFK 1061
Query: 1151 KELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFIL 1209
++++G+++++ + G LI KI++ + +AFYD P ++V NF++
Sbjct: 1062 EDIRGSVNSVCDISGRFLINQSQKIMVRDVQEDNSVVPVAFYDTP-IFVSDAKSFGNFLI 1120
Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
LGD + FL + + ++ L + S + + EFLI+ ++ ++D + + + Y
Sbjct: 1121 LGDSMQGFQFLGFDAEPYRMIPLGRSVSSFETVSVEFLINAGEINFAITDREDILHVLKY 1180
Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-NRFALLFGT 1328
AP + GQKL+ + F++ + T L L R SDK +F + G
Sbjct: 1181 APDEPNTLSGQKLVHCSSFNLYSSNTCMLMLP-------RNDEFETSDKAPPKFQAIGGQ 1233
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDS 1385
+DG I I PL E T+RRL +Q++++D + GLNPR R F+ RP
Sbjct: 1234 VDGGIFKIIPLKEDTYRRLYVVQQQIIDKEVQLGGLNPRMERLDNDFYQLTHVMRP---- 1289
Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTG 1411
++D ++ + L +E + A + G
Sbjct: 1290 MIDFNIIRRFSELSIERRTHFAQKAG 1315
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 83/386 (21%), Positives = 165/386 (42%), Gaps = 64/386 (16%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV + L+ + +A++ Q + S +++A AKIS++ FD + L S+H
Sbjct: 48 LNLVEEFNLNAKITDIALIPQEKSPLS----CLVIASGVAKISIVRFDAVTNSLETLSLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS--------- 211
+E + A+ ++VDP R +L++ I L G+
Sbjct: 104 YYEDKLS---DISLVTLAKTSKLRVDPMNR--ALLLFNNDSIALLPLFSGNHEDEDEDDE 158
Query: 212 ----GLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
+ E T + S + ++++L ++++V D F++ + +P + +L++
Sbjct: 159 EDDYDVTRGEVTTKRSKKNEKHVGQSKIFHVKELHQELQNVLDIQFLNDFTKPTLAVLYQ 218
Query: 266 RELTWAGRVSWKHH--TCMISALSIST----TLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
+LTW G + MI L++ T T +I + +L D ++LL +
Sbjct: 219 PKLTWVGNTELNPQPTSFMIFTLNLRTNELETAFDVVIIATLHDLSWDWFQLLPISR--- 275
Query: 320 GVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSFSVELDA---AHATW--- 371
G +V+G N + Y + + LN++A D S + R EL+ T+
Sbjct: 276 GCVVMGNNEMAYIDNTGVLQSIIHLNSFA---DKSLQRARIIDETELEVFFNEKVTYFWS 332
Query: 372 -------LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----------TNPS-VLT 412
+ ++ L+ + +L + + +GR++ + DL K +NP+ V
Sbjct: 333 ASTDKKNIDDETLLIIDASANLYYVRLEAEGRLLTKFDLIKLPIVNDALKDTSNPTCVAR 392
Query: 413 SDITTIGNSL-FFLGSRLGDSLLVQF 437
D + +S+ F+G GDSL+V+
Sbjct: 393 VDPNSSNSSMDLFIGYLSGDSLVVRL 418
>gi|342186481|emb|CCC95967.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 1456
Score = 114 bits (286), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 123/523 (23%), Positives = 229/523 (43%), Gaps = 60/523 (11%)
Query: 879 VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
+ ++ A + R L+ RT ++ T + T H A +RI F ++G G ++ G P + +
Sbjct: 899 IESIEARKCR-LQRERTMIENDT-QSTRHCA--RRIIPFACVAGQSGAYVCGQHPVFLLW 954
Query: 939 FRERLRV--HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-----PSGSTYDN 991
+ + R+ + G++ F + GFIY +G + ++ P+G
Sbjct: 955 DKRKRRIAAYRHQSPGAVRGFVSFPQMA--GGFIYC-CEGFVDFARMNTYCAPNGQG--- 1008
Query: 992 YWPVQKIPLKATPH-----------------QITYFAEKNLYPLIVSVPVLKPLNQVLSL 1034
W ++I + ATPH + T+ ++ + + + + + LN V S+
Sbjct: 1009 -WLTRRIAIGATPHFLVYDPPGKSCFVVTSEKKTFRPQRAFFDVQLKIHYDEELNTVQSV 1067
Query: 1035 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ V H + + V R VE++EVR+L G W+ ++ +E L
Sbjct: 1068 TAEPPVCHMPPINPGAGV---RVPMVEQFEVRLL--STTGEQWECTHKFALEENEKVLGA 1122
Query: 1095 RVVTLFNTTT---KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSK 1151
+ V L + + + TA+ GEDV RGR++L ++ + + +++S+
Sbjct: 1123 QAVELRQDEAIAGAPSAPVCVLCTAFPLGEDVTCRGRIILLAS--KTVKKKRAIVQLHSE 1180
Query: 1152 ELKGAISALASLQGHLLIASGP--KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFIL 1209
L G +A+ + + +A G KI + W +L AF A +Y L+ +N+I+
Sbjct: 1181 PLNGPATAVTGICSQIAVAVGGTIKIFRYDWETKKLVVSAFLYAG-VYATRLSAFRNYII 1239
Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
GD+ +S + EQ L +L KD ++ + + T ++ S++Q+++ + Y
Sbjct: 1240 YGDLCRSCAMARFNEQNHTLTVLGKDHNAVSVVHCDMMYHDRTFGILCSNDQRDLLLMGY 1299
Query: 1270 APKMSESWKGQKLLSR---AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLF 1326
P++ ES G+ SR + F + L LA S AA S T ++
Sbjct: 1300 TPRVQES--GEHTPSRVLESPFSLDGEYR--LPSGCLAKSLRFRSAAGNSSVT-----VY 1350
Query: 1327 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
+ G +G I PL E R + ++L +P AGL PR F
Sbjct: 1351 ISNYGEVGFIVPLGEQANRTALWITRRLQVDLPCDAGLTPRMF 1393
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 130/334 (38%), Gaps = 77/334 (23%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
+++V+D FV EP++ +L ER TWAGRV W+ + LS IS L
Sbjct: 254 LRYVRDLQFVGSSGEPLLGVLCERRPTWAGRVKLVEWRTKAVDTNTLSMQVAWVQISGAL 313
Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPSP---IGGVLVVGANTIHYHSQSASCALALNNYA 346
HP L+ ++P++ ++ V S GV+ G NT+ + + + N+
Sbjct: 314 TTHPKLLLVGEVDSVPYNVTHMIPVESSSQTPSGVICFGINTVMHITTKRGYGVYFNSTG 373
Query: 347 V---------------------SLDSSQELPRSSFSVELDAAHATWLQN------DVALL 379
+ L+SS L R +FS L AT + +
Sbjct: 374 MEECGSNKSSAMSYGKMSWCDAKLESSTALFRVNFS--LANCTATIFSPRSSDSLQILAV 431
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
S + G + +L + G V + +S S +T I ++LFFLGS V F+C
Sbjct: 432 SEEDGVVAVLEFLSQGANVHDIQISVLASGCYCSSLTPISDNLFFLGS------AVSFSC 485
Query: 440 GSGTSMLSSGLKEEFGDIE--------------------ADAPSTKRLRRSSSDALQDMV 479
+ + +SG +F +E AD S R +S+S L+D
Sbjct: 486 IASITPTNSGAIGKFKVVESIEAIGSIRDVDVVDCSNDAADCISGPRGNQSNSSWLEDTP 545
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIG 513
E A N T S A R +++++
Sbjct: 546 FAE------LAGNTTLDPMPNLSVAQRRAIMDLA 573
>gi|367001853|ref|XP_003685661.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
gi|357523960|emb|CCE63227.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
Length = 1357
Score = 114 bits (286), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 155/325 (47%), Gaps = 19/325 (5%)
Query: 1096 VVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYS 1150
++ N+ T + L+ +G + V ED+ G +++T +P T +V+
Sbjct: 1018 MIIQLNSKTNFKKELIVVGISNVGTEDLPPTGSFYIYNTNEVVPDPSKPDTNYRFKDVFH 1077
Query: 1151 KELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFIL 1209
+++KG I+ + + G ++ K+++ E + +AF+D P ++V + N +
Sbjct: 1078 EQVKGTINNVCEISGRFMVNQSQKLLVRDIQEDESVVPVAFHDVP-VFVADIKSFGNLFI 1136
Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
+GD + F+ + + ++ +L + A +F++ + VVSD + I Y
Sbjct: 1137 VGDSMQGFQFVGFDAEPYRMIMLGRSVSKFKTMALDFVVRNGEIYFVVSDTDDILHILKY 1196
Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
+P S GQ+L + F++ + T L ++D G + ++ F + L
Sbjct: 1197 SPDEPNSLSGQRLAHYSSFNIHSTNTSM----HLLPANDEFIENKG-NGSSIFQTIGANL 1251
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSI 1386
DGSI I PL E +FRRL +Q++++D+ H AGLNPR R +++ RP +
Sbjct: 1252 DGSIFKILPLSEDSFRRLYVIQQQIIDTEVHAAGLNPRMERLSNEYYQLTNVTRP----L 1307
Query: 1387 VDCELLSHYEMLPLEEQLEIAHQTG 1411
+D L+ Y L ++++ IA + G
Sbjct: 1308 LDFNLIRRYSNLSIKKRKSIAQKAG 1332
Score = 44.3 bits (103), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 70/365 (19%), Positives = 153/365 (41%), Gaps = 71/365 (19%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
S L L ++L+G V +A++ Q + + D +I+ AK+S++ F+ + L
Sbjct: 45 STNKLHLNYEFKLNGRVSDIALIKQVDS----KLDYLIILTATAKLSLVNFNVFTNSLET 100
Query: 157 TSMHCFESPEWLH--LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
S+H +E + LK +ES R +D C VL++ I + +
Sbjct: 101 ISLHYYEDKFRQNSILKLAKESKLR-----IDQAKNC--VLLFNNDNIAILPISSTTDEF 153
Query: 215 GDED-----------------TFGSGGGFSARIESSHVINLRDLDM----KHVKDFIFVH 253
DED F S +I +S +I L+ ++ +++ D F+
Sbjct: 154 EDEDLGQESSAKTVKRGNMSIKFPSQSQKKNKITNSSII-LKSTELNSKIQNIIDIQFLS 212
Query: 254 GYIEPVMVILHERELTWAGR-----VSWKHHTCMISAL-------------SISTTLKQH 295
+ +P + +L++ +L W G + ++ ++ L S++ L +
Sbjct: 213 NFSKPTLSVLYQPKLAWIGNSNLVTLPTQYMILTLNILERENIKSQENGENSLNQDLIET 272
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY--HSQSASCALALNNY--AVSLDS 351
+I LP++ + ++ + + G +VG+N I Y H+ + +N + +L
Sbjct: 273 TIIGQVSELPYELHTIIPLNN---GSTLVGSNEIIYIDHTGVLQSLIIINQFQDKETLKK 329
Query: 352 SQELPRSSFSVELDA------AHATWLQNDV-----ALLSTKTGDLVLLTVVYDGRVVQR 400
+ + +S ++ L+ A + N+V L+ + ++ L+ + +GR++
Sbjct: 330 GRVIDKSKQNIILNKPIKFINAGSRVESNNVDDKNNVLIFDENNNIYLVNITLEGRLLIN 389
Query: 401 LDLSK 405
D++K
Sbjct: 390 FDINK 394
>gi|443919095|gb|ELU39366.1| cleavage factor protein [Rhizoctonia solani AG-1 IA]
Length = 788
Score = 114 bits (284), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 198/869 (22%), Positives = 335/869 (38%), Gaps = 166/869 (19%)
Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS-IA 667
G T+ AG F + V QV R+L+ + L G++ + V+ +A
Sbjct: 33 GLTMVAGAFFQQTCVAQVTTNSIRLLEPDGAERQLYL------DAEGNKPRPKIKVAHVA 86
Query: 668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTST 727
DP++++ D + L VGD + V + + + +S + + D +RK
Sbjct: 87 DPFIVVLREDDTFGLFVGDTAKGRVRRKDVSHFGENGTICASASFFTDHTDLFQIRKPGD 146
Query: 728 DAWLSTGVGE--------------AIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
+G G A + GGP + ++V E G + V N
Sbjct: 147 VVATHSGPGSHRRRPGESSSSKRRASRKSRGGPAASTTLENIVDAEHGT-QWLVVLRKNG 205
Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
F V S R H + + L D ++S + +I + + + + R
Sbjct: 206 FFEVSTRTSTRQHQL-----QGLPD--VLVDSGQAHTVCTDGETDIEHVIIAPIGITR-- 256
Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
+P L I T+ Y+ P ++S++ PV ++ V FS
Sbjct: 257 ---PKPHLVVITKSRTLAIYEPVPAPPPPDSSENSAPVRDQLTVQFVKV---------FS 304
Query: 894 RT-PLDAYT------REETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRV 945
R PLD + R P +P N+SG F++G P W + LR+
Sbjct: 305 RALPLDMHDTKRVAGRSLVPFKSP--------NLSG---IFVTGDHPFWLLRTDASALRI 353
Query: 946 HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPH 1005
+P H YV S G + LP +IP +
Sbjct: 354 YP-------------------HAAQYVNSFGTTVVEWLPDVDIS------HEIPCR---- 384
Query: 1006 QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD-LHRTYTVEEYE 1064
+Y ++ V+ V S L + D++ L + D H +
Sbjct: 385 --SYASDDGRVYTSVAYDVSTRHILAASALRTTFAYYDEDSNELYTPDATHPNPEIHCSA 442
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDV 1123
+ ++ PD W T +E V + L +T+ + + +GT +GED+
Sbjct: 443 LELITPDT----WTTVDGYEFAQNEFVNAVESIPLETLSTERGLKDYVVVGTTISRGEDL 498
Query: 1124 AARGRVLLFSTGR-----NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1178
A +G +F + Q + + ++ KGA++AL + G+L+ + G KI +
Sbjct: 499 AVKGATYVFEVVEVVPEPGSKTRQYRLRLLCREDSKGAVTALCGMNGYLVSSMGQKIFVR 558
Query: 1179 KWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
+ E L GIAF D + V SL +KN +L+GD+ KS++F++++E+ +L L KD
Sbjct: 559 AFDLDEKLTGIAFMDVG-VCVTSLRPLKNLLLVGDMVKSVWFVAFQEEPFKLVPLGKDRQ 617
Query: 1238 SLDCFATEFLIDG-STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT- 1295
L +F + LS V D + +F G +L+ +EFH HVT
Sbjct: 618 QLSVTHADFFFGSQAQLSFAVLD---DFGVF-----------GLRLICSSEFH--THVTH 661
Query: 1296 --------------KFLRLQMLATSSD----RTGAAPGSDKTNRFALLFGTLDGSIGCIA 1337
+ +Q L T S T P + + G+ DG+I +
Sbjct: 662 RGVLSVSRKADFDSDVMSIQSLGTESSLIFGETKPYPFHQHNSILTMCGGSSDGTIASLT 721
Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
PL+E F RLQ LQ +L+ R H NG ++D LL+ +E
Sbjct: 722 PLNESEFGRLQLLQGQLI--------------RNVH-NG---------VLDGNLLAAFEE 757
Query: 1398 LPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
LP+ +Q+E+ Q G R +IL++L L +
Sbjct: 758 LPVSKQVEMTQQIGAEREKILNDLLKLRI 786
>gi|50305395|ref|XP_452657.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74606921|sp|Q6CTT2.1|CFT1_KLULA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|49641790|emb|CAH01508.1| KLLA0C10274p [Kluyveromyces lactis]
Length = 1300
Score = 112 bits (281), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 164/338 (48%), Gaps = 30/338 (8%)
Query: 1088 SENALTVRVVTLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP--- 1141
SEN++ + T+ N+ T+ L+ IG+++V+ ED + G +L+ P
Sbjct: 954 SENSMVNDIKTMLIQLNSKTRRKRELVIIGSSFVKEEDQPSTGCLLVLDITEVVAEPGKP 1013
Query: 1142 -QNL-VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNG---IAFYDAPPL 1196
N +++ +E++G+++A+ + G +I K ++ E N +AF D P +
Sbjct: 1014 DSNFKFKQLFEEEIRGSVNAVCEISGRFMIGQSSKALVRDMQ--EDNSAVPVAFLDMP-V 1070
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
++ N +++GD + F+ + + ++ +L K EFL++ ++ +
Sbjct: 1071 FITDAKSFSNLMIIGDSMQGFTFVGFDAEPYRMIVLGKSTSKFQVMNLEFLVNNGNINFI 1130
Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
V+D Q ++ + YAP + S GQ+L+ F++ +++L R GS
Sbjct: 1131 VTDRQNHLHVLRYAPDEANSLSGQRLVHCNSFNMFT-TNNYMKLV-------RKHVEFGS 1182
Query: 1317 DKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFH 1373
+N AL T DGSI + PL+E ++RR +Q++L+D +AG N + R +++
Sbjct: 1183 KTSNYIALGCQT-DGSIFRMIPLNEASYRRFYLVQQQLLDHEIPLAGFNTKMERLDNEYY 1241
Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
G + RP DS ++L Y LP+ ++ I ++ G
Sbjct: 1242 HKGHSLRPTLDS----QVLKKYIHLPITKRTTIENRVG 1275
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 136/639 (21%), Positives = 257/639 (40%), Gaps = 111/639 (17%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
A L L ++L G + + +L Q G S + IL+ +K+S++ FD L
Sbjct: 45 AQKLVLAYEWKLAGKIIDMQLLPQIG---SPLKMLAILS-SKSKVSLVRFDPVAESLETL 100
Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD 216
S+H + ++++L S ++ VDP RC +LV+ ++ IL + D
Sbjct: 101 SLHYYHD-KFVNL--STSSLKTESIMAVDPLFRC--LLVFNEDVLAILPLKLNTEDMEID 155
Query: 217 EDTFGSGGGFSARIESSHVINLRDLDM---------KHVKDFIFVHGYIEPVMVILHERE 267
ED G + R++ + I + M KHV D +++ + +P + IL++
Sbjct: 156 EDENGIKEPMAKRLKRNQGITSDSIIMPISSLHKSLKHVYDIKWLNNFSKPTVGILYQPV 215
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
L W G +T LS+ ++ +I +LP+D + L VP G VL +G N
Sbjct: 216 LAWCGNEKVLGNTMRYMVLSLDVEDEKTTVIAELADLPNDLHTL--VPLKRGYVL-IGVN 272
Query: 328 TIHYHSQSA---SCALALNNYAVSLDSSQELPRSSFSVELDAA----HATWLQNDVALLS 380
+ Y S S SC + LN +A S +++ S ++ L + + ++D+ +L
Sbjct: 273 ELLYISASGALQSC-IRLNTFATSSINTRITDNSDMNIFLSKSSIYFYKALKRHDLLILI 331
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL-----GD---- 431
+ + + +G ++ + D + I N + F SRL GD
Sbjct: 332 DENCRMYNIITESEGNLLTKFDCVQ----------VPIVNEI-FKNSRLPLSVCGDLNLE 380
Query: 432 --SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
+L+ F G + LK F + ++L + D E +LYG
Sbjct: 381 TGRVLIGFLSGDAMFLQLKNLKVAFA-------AKRQLVETVDDDDD-----EYSALYGE 428
Query: 490 ASNNTES----AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
+ NNT + Q+ F ++ DS+ NIGPL + G + + + + + E +
Sbjct: 429 SQNNTHTRIVETQEPFDISLLDSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKDEF-SIV 487
Query: 546 GCKGI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
G+ T H + + H + + + ++ + ++ + L T D E +
Sbjct: 488 ATSGVGRGSHLTALHSTVQPHIEQALKFTSATRIWN----LKIKGKDKYLVTTDADKEKS 543
Query: 601 E------------SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-----MTQDL 643
+ + D+ RTI + +R++QV G + D + +T D+
Sbjct: 544 DVYQIDRNFEPFRAQDFRKDSRTIGMETMDDDKRILQVTSGGLYLFDVDFKRLARLTIDI 603
Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
++ I DPY+L + G+I++
Sbjct: 604 E----------------IVHACIIDPYILFTDARGNIKI 626
>gi|401624207|gb|EJS42273.1| cft1p [Saccharomyces arboricola H-6]
Length = 1356
Score = 112 bits (280), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 154/346 (44%), Gaps = 25/346 (7%)
Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
W+ + P S N + ++ + + T K+ E ++A G A ED G ++
Sbjct: 1000 WKVIDKIDFPNNSVVNEMRSSMIQVNSKTKKKREYIIA-GVANATTEDTPPTGAFHIYDV 1058
Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1059 TEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1118
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
AF D P ++V N +L+GD + F+ + + ++ LL + + EFL+
Sbjct: 1119 AFLDIP-VFVTDSKSFGNLLLIGDAMQGFQFIGFDAEPYRMILLGRSISKFQTMSLEFLV 1177
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
+G + +D +N+ + YAP S GQ+L+ + F + + + L L
Sbjct: 1178 NGGDMYFSATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTLHSINSCMLLLP------- 1230
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
GS + F + G +DGSI I PL E T+RRL +Q++++D + GLNPR
Sbjct: 1231 -KNEEFGSSQVPSFQNVGGQVDGSIFKIVPLSEETYRRLYVIQQQIIDREIQLGGLNPRM 1289
Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
R F+ G + RP ++D ++ + L ++ + A + G
Sbjct: 1290 ERLANDFYQMGHSMRP----MLDFNVIRRFSELAIDRRKNTAQKAG 1331
>gi|156847699|ref|XP_001646733.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
70294]
gi|156117413|gb|EDO18875.1| hypothetical protein Kpol_1023p44 [Vanderwaltozyma polyspora DSM
70294]
Length = 1337
Score = 112 bits (279), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 156/330 (47%), Gaps = 24/330 (7%)
Query: 1091 ALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT---- 1146
++T++V N+ TK+ LL +G A + ED+ + G + P T
Sbjct: 998 SMTIQV----NSKTKKKRELLVVGVASIGTEDLPSAGSFHVIDINEVVPEPGKPDTNYKF 1053
Query: 1147 -EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIV 1204
E++ + ++G ++++ + G +I K+++ E + +AF D P +YV
Sbjct: 1054 KEIFQETVRGNVNSVCEISGRFMINQSQKLLVRDIQEDESVVPVAFLDVP-VYVTDTKSF 1112
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
N +++GD + F+ + + ++ L + A EFL++ + +VSD +
Sbjct: 1113 SNLMIVGDSMQGFQFVGFDAEPYRMIPLGRSVSKFKTVALEFLVNNGDIFFIVSDRNDIL 1172
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
+ YAP S GQ+L + F++ + T + L S++ ++P T F
Sbjct: 1173 HVLKYAPDEPNSLSGQRLAHYSSFNIHSTNTSMI----LLPSNNEFQSSPNGQAT--FQS 1226
Query: 1325 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRP 1381
+ +DGSI + PLDE +FRRL +Q++++D+ GLNPR R +++ RP
Sbjct: 1227 VGSCVDGSIFKVIPLDEDSFRRLYVIQQQVIDTEIQAGGLNPRMERLSNEYYQLVHLMRP 1286
Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
++D ++ + L + ++ +IA + G
Sbjct: 1287 ----MLDFNIIRRFSNLSITKRTKIAQKAG 1312
>gi|157873900|ref|XP_001685450.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania major strain Friedlin]
gi|68128522|emb|CAJ08654.1| cleavage and polyadenylation specificity factor-like protein
[Leishmania major strain Friedlin]
Length = 1541
Score = 112 bits (279), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 135/610 (22%), Positives = 264/610 (43%), Gaps = 71/610 (11%)
Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFE--GPENTSKSDDPV-STSRSLSVSNVSASRLRN 889
SA + L IL+ G ++ Y+ + GP K + + + V +R +
Sbjct: 929 SAAPTEATLVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVMESIKARKKR 988
Query: 890 LRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFR---ERLRV 945
L+ R L + T ++ H + +R+ F+ + ++G ++ G P + +V+ +L
Sbjct: 989 LQEERAHLASVT-QQMRHCS--ERLVPFRGLQDRYKGIYVCGQTPVF-LVYHAATNQLVC 1044
Query: 946 HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-PSGSTYD-NYWPVQKIPLKAT 1003
++ F H+ + + GF+Y +G + + P G + W ++++ L T
Sbjct: 1045 TRHHATNAVRGFAPFHSRHVHGGFVYC-GEGFVHFATMQPFGELLGCSGWWLERVRLGCT 1103
Query: 1004 PHQITYFAEKNLYPLIVS-----VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSV 1052
PHQ+ Y + ++ S P P + L ++ D+E V H I+ +L S+
Sbjct: 1104 PHQVIYSPAAHGCFVVASRPQPFSPKRAPFDVQLRMVEDEEGNRVPHVIEPVSLPPLSAT 1163
Query: 1053 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE------ 1106
T E YEV+ WQ + + +E L+ ++ + TT +
Sbjct: 1164 SGSPVPTNERYEVQFFSTLN----WQCMGRLVLDGNEKVLSATLMQVTRDTTMDAANRST 1219
Query: 1107 NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QG 1165
+ A+ TAY GEDV RGR+LL +T + + + ++ + ++G ++A+ + +
Sbjct: 1220 TAPVCALATAYPLGEDVTTRGRILLLTTSQQSGQGMQQLRTLHEEPMEGPVTAITRVGED 1279
Query: 1166 HLLIASGPKIILHKWTGT----ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
+ +A G + ++++ E I + A YV L + ++++GD+ S+ F
Sbjct: 1280 CVAVAVGGTVRVYRYDANKSTMETMAILYAGA---YVTCLQAFREYLVIGDLFNSVLFAR 1336
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK-MSESWKGQ 1280
+ E+ + +L +D ++ + + L + L+V+D+ +N+ Y P+ + E K
Sbjct: 1337 YSEEIHTITILGRDTSAISVVSNDMLYHDTRFGLLVTDDARNLMCMSYKPRVLEEHGKPP 1396
Query: 1281 KLLSR-----AEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
K+L E+ + G + K +RL+ + S ++ T G IG
Sbjct: 1397 KVLESLLTVTGEYRLAGGVLLKMMRLRAASARSSSV-------------AIYVTNMGEIG 1443
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ PL + T R Q + ++L V H GL PR F F + S+ E + H
Sbjct: 1444 YLVPLGDQTSRTGQWVVRRLQSEVAHAGGLPPRMFLGFPQDDPLR-----SLKGEEWMLH 1498
Query: 1395 YEMLPLEEQL 1404
+ PL EQL
Sbjct: 1499 F---PLLEQL 1505
Score = 43.9 bits (102), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 81/185 (43%), Gaps = 37/185 (20%)
Query: 223 GGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHERELTWAGRVS--- 275
GGG S + V + R D+K +++D FV EP++ L E++ TWAGRV
Sbjct: 282 GGGTSLLLRVGTVTHWRLQDVKSALRNIRDVQFVQSAGEPLLAFLFEKQPTWAGRVKLLE 341
Query: 276 WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-LPHDAYKLLAVPS----PIGGV 321
W+ TC I ++++ + H L S ++ LP+D + +P+ P
Sbjct: 342 WRSKTVESHMLTCSIEWMKVTLANSATPHMLSLSEVDGLPYDVTSMTPLPAFQDLPSAVF 401
Query: 322 LVVGANTIHYHSQSA----------SCALALNNYAVSLD------SSQELPRSSFSVELD 365
V +H ++S A +L + AVSL+ +SQ L V L+
Sbjct: 402 CVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLEAVQWRSASQALSTDLVKVNLN 461
Query: 366 AAHAT 370
A+AT
Sbjct: 462 FANAT 466
>gi|254580509|ref|XP_002496240.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
gi|238939131|emb|CAR27307.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
Length = 1331
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 151/321 (47%), Gaps = 20/321 (6%)
Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELK 1154
++ T+ + + +G A+V+ ED+ G + +F P T EV+ + ++
Sbjct: 999 LDSRTRRRKEYVIVGVAHVETEDLPPSGSLSVFDITEVVPEPGKPDTNFKLGEVFKENIR 1058
Query: 1155 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
G +S++ + G LI K+I+ + +AF D P ++V + NF+++GD
Sbjct: 1059 GTVSSVCDISGRFLINQSQKVIVRDVQEDNSVVPVAFLDVP-VFVTDVKSFGNFLIIGDS 1117
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
+ F+ + + ++ L + L+ A EFL++G + V+D + IF YAP
Sbjct: 1118 MQGFQFIGFDAEPYRMIPLGRSVSKLETVALEFLVNGGDIFFAVTDTSNILHIFKYAPDE 1177
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
S GQ+L+ F++ + T + L + G + ++ G DGS+
Sbjct: 1178 PNSLSGQRLVHCTSFNLHSTNTCMVLL------PKNEEFSVGEKSLSPVQVVGGQTDGSL 1231
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCE 1390
+ PL E T+RRL LQ++L + + GLNPR R +++ A RP +++
Sbjct: 1232 FKLVPLREDTYRRLYVLQQQLTEKEVQLGGLNPRMERLSNEYYHLTHAVRP----MLEFN 1287
Query: 1391 LLSHYEMLPLEEQLEIAHQTG 1411
++ + L +E++ + A + G
Sbjct: 1288 VIRRFNTLSVEKRKQTAQKAG 1308
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 140/656 (21%), Positives = 270/656 (41%), Gaps = 124/656 (18%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L ++ G + LA++ Q + D ++L AKISV+ +D++ + + S+H
Sbjct: 48 LILTHEFKFEGRITDLAVVPQKDSP----LDCLLLCTSIAKISVVRYDEASNSIETLSLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS--------- 211
+E R A+ ++VDP RC L++ +I L Q S
Sbjct: 104 YYEDS---FKDRSILELAKESTMRVDPGKRCA--LLFNNDVIALLPLQTTSLNDGEEEDE 158
Query: 212 ---GLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHER 266
D+ + G +A S + N ++L DM +V D F+ + P + ++ E
Sbjct: 159 DMDDERPDKRQKNNKGRITA---PSAIFNAKELHQDMNNVIDVTFLRNFTRPTLAVIFEN 215
Query: 267 ELTWAGR-------VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
+ WAG V++ T +++ ST +K +I + L D + ++ + +
Sbjct: 216 KPVWAGTSQVLPLPVTYMAFTLEVTSNEQSTDIKS-TVIATVKELSWDFHTMIPIAN--- 271
Query: 320 GVLVVGANTIHYHSQSASCA--LALNNYA-VSLDSSQELPRSSFSVELDAAHA-TWLQND 375
G ++VG+N + Y + S + LN+YA ++ ++ + RS + L W +D
Sbjct: 272 GCIIVGSNEMAYIDNTGSLQSIIFLNSYANKNMKKARIVDRSKSKILLHKPTTYNWSVSD 331
Query: 376 VALLSTKTGDLVLLT----------VVYDGRVVQRLDL-----------SKTNPSVLTSD 414
++TG+ +L+ + Y+GR++ + D+ + +N + ++
Sbjct: 332 ---QKSETGETLLIMDHQAAFYYIQLEYEGRLLTKFDIINLPIVNDTLKNNSNATCISRL 388
Query: 415 ITTI-GNSL-FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+T+ GN + F+G R GD+ +++ + L + ++ E +P + +
Sbjct: 389 NSTLSGNYVDLFVGFRSGDASVLRL------NNLKAAIESRDEHKEITSPPENDIEKFED 442
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADAS 529
+D + EE S N E +T F V SL NI P+ + G + D
Sbjct: 443 ---EDDLYSEEASDADKEKENKEVVVETVLPFDIEVLSSLRNIAPITSLTPGKICSVDKF 499
Query: 530 ATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-EARTM 588
G+S + E V L G T G + +M+ + A IS+ + +
Sbjct: 500 VEGLSNPNRNE-VSLVATSGNGT-------GSHLTEIQMSVRPEVQLALKFISITQMWNL 551
Query: 589 VLETADLLTEVTES---------VD----YFVQGR-----TIAAGNLFGR-RRVIQVFER 629
++ D T+S +D + +GR T + ++FG +R++QV
Sbjct: 552 KIKNKDKYLITTDSNKNKSDIYLIDKNFALYKEGRFRRDATTVSISMFGSDKRIVQVTTN 611
Query: 630 GARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+ D ++ T F V+ VS+ DPY+L+ +S G I++
Sbjct: 612 HLYLYDTNFKRLTTMKFEF--------------EVVHVSVMDPYILITVSRGDIKV 653
>gi|430810872|emb|CCJ31592.1| unnamed protein product [Pneumocystis jirovecii]
gi|430814599|emb|CCJ28188.1| unnamed protein product [Pneumocystis jirovecii]
Length = 203
Score = 111 bits (278), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/192 (34%), Positives = 101/192 (52%), Gaps = 2/192 (1%)
Query: 1231 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1290
L KD SL + +FL+D L V+ D+ NI +F Y P+ +S+ GQKLL R +FHV
Sbjct: 3 LFGKDHSSLSVSSADFLVDDEHLYFVIGDDDGNIHVFNYDPENPQSFSGQKLLKRGDFHV 62
Query: 1291 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL-LFGTLDGSIGCIAPLDELTFRRLQS 1349
G+H+ L L A + N+ +L L + DGS+G + L E T+RRL
Sbjct: 63 GSHIKSILMLPKEAFPQNVNDKEETRASKNQDSLCLCASQDGSMGVLISLPEKTYRRLYF 122
Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
+Q +L+++ VAGLNP S+R K P I+D +LL Y L +Q ++A +
Sbjct: 123 IQGQLINTEDKVAGLNPISYRTSTYVSKTSNPAR-GILDGKLLYQYNNLERNKQKDMARK 181
Query: 1410 TGTTRSQILSNL 1421
+G I+ +L
Sbjct: 182 SGMPVETIIYDL 193
>gi|401841121|gb|EJT43641.1| CFT1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 1355
Score = 111 bits (277), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 151/346 (43%), Gaps = 25/346 (7%)
Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
W+ + P S N + ++ + N+ TK + G A ED G ++
Sbjct: 999 WKVIDKIDFPKNSVVNEMRSSMIQI-NSKTKRKREYIVAGVANATTEDTPPTGSFYIYDV 1057
Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1058 IEVVPEPGKPDTNYKLKEIFQEEVNGTVSTVCEISGRFMISQSQKVLVRDIQEDNSVIPV 1117
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
AF D P ++V N +++GD + F+ + + ++ LL + + EFL+
Sbjct: 1118 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMILLGRSVSKFQTMSLEFLV 1176
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
+G + +D +N+ I YAP S GQ+L+ + F V + + + L
Sbjct: 1177 NGGDMYFAATDADRNVHILKYAPDEPNSLSGQRLVHCSSFTVHSINSCMMLLP------- 1229
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
GS + F + G +DGS+ I PL E T+RRL +Q++++D + GLNPR
Sbjct: 1230 -KNQEFGSSQVPSFQNVGGQVDGSVFKIVPLSEETYRRLYLIQQQIIDRELQLGGLNPRM 1288
Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
R F+ G + RP ++D ++ + L ++ + A + G
Sbjct: 1289 ERLANDFYQMGHSMRP----MLDFNVIRRFSGLSIDRRKNTAQKAG 1330
>gi|356527660|ref|XP_003532426.1| PREDICTED: disease resistance response protein 206-like [Glycine
max]
Length = 281
Score = 110 bits (275), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 56/106 (52%), Positives = 76/106 (71%), Gaps = 8/106 (7%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK--RGIGPVPNL 58
MSFAAYKMM PTGI NC GF+THSR+D+VP +Q +++D E PS+ +G +PNL
Sbjct: 1 MSFAAYKMMQCPTGIDNCAVGFLTHSRSDFVP----LQPDDIDVEWPSRPCHHVGSLPNL 56
Query: 59 VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELV 104
+VT ANV+E+Y VR+QE+ S K + +++ L+DGI ASLELV
Sbjct: 57 IVTVANVLEVYAVRLQEDQSP--KAAIDSRSDTLLDGIVGASLELV 100
>gi|71021721|ref|XP_761091.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
gi|46100541|gb|EAK85774.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
Length = 1597
Score = 109 bits (272), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/341 (28%), Positives = 158/341 (46%), Gaps = 30/341 (8%)
Query: 1085 MQSSENALTVRVVTLFN-TTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1143
+++E ++ +VTL + +T + +A GT GED A+G V LF
Sbjct: 1190 FEANETVTSLEIVTLDSPSTVSGRKQFVAAGTTTFHGEDRTAKGSVYLFEIISVVSAASE 1249
Query: 1144 LVTEVYSK-----ELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLY 1197
L +++ K + + ++A++ + G+L+ G K+ + E L IAF D P Y
Sbjct: 1250 LGSDLRLKLVCRDDSRAPVTAISHINGYLISTCGQKLYVRALEKQEWLISIAFLDCP-FY 1308
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD---------FGSLDCFATEFLI 1248
+ S+ +VKN +LLGD + + +++E + LAK F D + I
Sbjct: 1309 ITSIEVVKNLVLLGDCKRGLGLWAFQEDPYKFVELAKAEDGCVGVGAFLVRDEKVSMLSI 1368
Query: 1249 DGSTLSLVVSDEQKN--IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL--QMLA 1304
GS L S E I+++ YAP ++ G+KL+ R+EF + + + L+
Sbjct: 1369 SGSRLGGDASMEASAGVIRLYEYAPHLAVG--GKKLVLRSEFQTTSEAVARVECSGRWLS 1426
Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1364
S R +T R ++F +GS+ +A +DE +RL LQ +LV SV H A L
Sbjct: 1427 DSELR------GRETLRNKVVFAKANGSVESVAAVDEKVGKRLHLLQGQLVRSVMHTAAL 1480
Query: 1365 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
NPRSFR N R ++D LL + L + LE
Sbjct: 1481 NPRSFRMVR-NDYVPRALVKGVLDARLLDEFMRLSRPKMLE 1520
Score = 87.4 bits (215), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 95/399 (23%), Positives = 177/399 (44%), Gaps = 54/399 (13%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKE-----SKNSGETKRRVLMDGISAASLELVCHYRLHG 111
LV +V+ IY V Q S S+++ + S +L + ++ L G
Sbjct: 46 QLVTARDDVLTIYDVYGQPHASASTIPGISRHTATSSVSSNTSACSHKNLVISRNHTLFG 105
Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHL 170
V L + Q A + RD ++++F+DAK+++LE++D+I L S+H +E +P+ L+L
Sbjct: 106 AVTGLQRV-QTLASDKDNRDRLLVSFKDAKLALLEWNDAIDDLETISIHTYERAPQLLNL 164
Query: 171 KRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD-------EDTFGSG 223
P++ VDP RC +L+ + IL + + D +
Sbjct: 165 A----PHLFHPILNVDPLSRCAALLLPHDSLAILPFYRDAADFDFDLDDHLEIAKDDVAA 220
Query: 224 GGFSARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
+A ++S S V+ +R++D ++++K F F+ G+ +P + +L TW G +S
Sbjct: 221 VVAAADLQSLPYSPSFVLTMREVDPKIRNLKHFCFLPGFQKPTVAVLFSHNPTWTGLLSE 280
Query: 277 KHHTCMI--------------------SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
+ T + AL T HP++ ++ LP+D ++A P
Sbjct: 281 RKDTFSVYLFTLDLSASLDGATFSSSAEALDDGTARSAHPVVTTSTPLPYDCLYMVACPQ 340
Query: 317 PIGGVLVVGANTIHYHSQSASCAL-ALNNY-----AVSLDSSQELPRSSFSVELDAAHAT 370
+GGV+VV +++ + QS + ALN + A+ +S EL S +L +
Sbjct: 341 TLGGVIVVCMSSLLHVDQSGRVMVTALNQWFKTTSAIEPESILEL---SDIADLQGSQLV 397
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+ +L+ G++ DGR V+ + L + S
Sbjct: 398 FTSKTQGVLTLVNGEIYRFRCQTDGRSVEGIRLERMQES 436
>gi|444313909|ref|XP_004177612.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
gi|387510651|emb|CCH58093.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
Length = 1459
Score = 108 bits (271), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 76/327 (23%), Positives = 149/327 (45%), Gaps = 22/327 (6%)
Query: 1096 VVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS-------TGRNADNPQNLVTEV 1148
+V N+ T+ + G A + ED+ G ++ TG+ N + E+
Sbjct: 1120 MVIQLNSRTRAKREYIVAGLANIGSEDLPPTGSFYIYDISPVLPETGKPDTNYK--FKEI 1177
Query: 1149 YSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
++++++G ++++ + G I KI++ + +AF D P +YV NF
Sbjct: 1178 FTEDVRGLVTSVCEISGRFTINQSQKIMVRDVQEDNSVVPVAFLDIP-VYVTDTKSFGNF 1236
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
+L+ D + + F+ + + ++ LL K L EF+ + + +D + IF
Sbjct: 1237 LLISDSMQGLQFVGFDAEPFRMILLGKSIPDLKISTVEFIANNGNIYFAATDYDNILHIF 1296
Query: 1268 YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
YAP S GQKL+ + F++ + + + L +D + F L G
Sbjct: 1297 KYAPDEPNSLSGQKLVHCSSFNLHSSTSCMIML----PGNDEFSENEQDNFIPSFQTLGG 1352
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPD 1384
+DGSI + PL+E +RRL +Q+++ D V GLNP+ R +++ +P
Sbjct: 1353 QVDGSIFKVIPLEESPYRRLYVIQQQITDYEVQVGGLNPKMERLSNEYYQKSNMLKP--- 1409
Query: 1385 SIVDCELLSHYEMLPLEEQLEIAHQTG 1411
++D ++ + MLP++++ A + G
Sbjct: 1410 -MLDFNIIRRFSMLPIDKRRRTAQKAG 1435
Score = 45.8 bits (107), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 58/116 (50%), Gaps = 17/116 (14%)
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC----F 162
++ G + + ++ Q G++ D ++L +AKIS+++FD+ ++ L+ S+H F
Sbjct: 54 FKFSGKITDIVLIPQRGSE----LDCLLLVTPNAKISIIKFDEELNTLKTISLHYYTDEF 109
Query: 163 ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
E L L AR ++V+P+ +C VL++ + I + + DED
Sbjct: 110 EKLSMLQL-------ARTSQLRVEPKKKC--VLLFNTESIAILPFTQQFNIDNDED 156
>gi|367014525|ref|XP_003681762.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
gi|359749423|emb|CCE92551.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
Length = 1327
Score = 108 bits (271), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 71/321 (22%), Positives = 144/321 (44%), Gaps = 20/321 (6%)
Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN-----LVTEVYSKELK 1154
++ T+ + + +G A V ED+ G +F P ++EV+ +E++
Sbjct: 995 LDSRTRRKKEYVIVGVAVVGTEDLPPSGSFFVFDITEVVPEPGKPDTNFKLSEVFQEEIR 1054
Query: 1155 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
G +S + + G LI K+++ + +AF D P ++V NF+++GD
Sbjct: 1055 GTVSTVCEISGRFLINQSQKVLVRDVQDDNSVVPVAFLDIP-VFVTDAKSFGNFMIIGDA 1113
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
+ F+ + + ++ L + ++ + EFL++G + ++D + +F YAP
Sbjct: 1114 MQGFQFVGFDAEPYRMIPLGRSIAKMETVSVEFLVNGGDIFFAITDTDDILHVFKYAPDE 1173
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
S GQ+LL F++ + T +A P F + G +DGS+
Sbjct: 1174 PNSLSGQRLLHCTSFNLHSTNT------CMALLPKNEEFEPAQANMKNFQAIGGQVDGSV 1227
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCE 1390
+ PL E +RRL +Q+++ + + GLNPR R + + RP ++D
Sbjct: 1228 FKLLPLREDVYRRLYVVQQQITEKELQLGGLNPRMERLSNEHYKTTHVLRP----MLDFN 1283
Query: 1391 LLSHYEMLPLEEQLEIAHQTG 1411
++ ++ L + + +I+ + G
Sbjct: 1284 VIQRFKRLSTDRRKQISQKVG 1304
Score = 73.6 bits (179), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 130/638 (20%), Positives = 269/638 (42%), Gaps = 86/638 (13%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
+A L L ++ HG + LA++ Q + D ++L AK+S+++FD + +
Sbjct: 45 SAKLFLTNEFKFHGKITDLALIPQVNSS----LDCLLLCTSIAKVSIVKFDPLSNSIETA 100
Query: 158 SMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYG-LQMIILKASQGGSGLV 214
S+H +E + L+ ++S+ R +DP RC +L L ++ +A+
Sbjct: 101 SLHYYEDKFRDLSLLEIAQQSYFR-----LDPSKRCAIILNNDVLALLPFRAA------T 149
Query: 215 GDEDTFGSGGGFSARIES--------SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILH 264
D++ + R+++ S + ++L ++++V D F++ + +P + IL
Sbjct: 150 DDDEEADAENNDVKRMKTSSDKVTYPSKIFVAKELHSEIRNVIDVQFLNNFSKPTIAILF 209
Query: 265 ERELTWAG--RVSWKHHTCMISALSISTTLKQHPL----IWSAMNLPHDAYKLLAVPSPI 318
E L WAG +++ + + MI L IS+T I L D + L+ + +
Sbjct: 210 EPTLIWAGNRQLNPQPISYMIFTLEISSTDNTTKFGATTIGKLTGLSWDFHSLVPISN-- 267
Query: 319 GGVLVVGANTIHYHSQSASC--ALALNNYA-VSLDSSQELPRSSFSVELDAAHA-TW--- 371
G ++VGAN + + S + + LN+++ +L + + S + + L + A W
Sbjct: 268 -GCMIVGANELAFADNSGALQSVILLNSFSDRNLRQGRIIDNSKYEILLPQSIARCWSPP 326
Query: 372 ----LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFF 424
+ ++ LL ++ + + +GR++ + D+ K N ++ + T + L
Sbjct: 327 TSDKVNDETLLLMDANSNVYYVQLESEGRLLIKFDIIKLPIVNDTLKNNQGCTCMSRLNS 386
Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-SSDALQDMVNGEE 483
S LL+ F G + + LK + ++ + S D +D + +E
Sbjct: 387 RSSNNNMDLLMGFKSGDALVVRLNNLKSAAESRDEHKIFSEAMESSFDKDEDEDNLYSDE 446
Query: 484 LSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGI--SKQSN 538
S G A +N E +T F + ++ NIGP+ + G + + G+ ++
Sbjct: 447 ASDAGKADDNKEVIVETVTPFDIELLSTIKNIGPITSLAVGKVCSVEKYVKGLLNPNRNE 506
Query: 539 YELVELP--GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL- 595
Y +V G T S R + + + ++ + ++ R L T D
Sbjct: 507 YSMVATSGNGSGSHLTEIQGSVRPTVEVALKFISVTQIWN----LKIKNRDKYLVTTDSN 562
Query: 596 -----LTEVTESVDYFVQGR------TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
+ E+ + +GR T+ G +R++QV + D ++ + L+
Sbjct: 563 KAKSDIYEIDNNFALHKEGRFRRDATTVCISMFGGDKRIVQVTTNNLILYDTNF--RRLT 620
Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
+ E V+ VS+ DPY+L+ +S G I++
Sbjct: 621 TMKFDYE---------VVHVSVMDPYILITVSRGDIKI 649
>gi|365984967|ref|XP_003669316.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
gi|343768084|emb|CCD24073.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
Length = 1388
Score = 108 bits (269), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 76/322 (23%), Positives = 148/322 (45%), Gaps = 24/322 (7%)
Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKG 1155
N+ + + G ++ ED+ G ++ P + EV+ +E++G
Sbjct: 1055 NSKARRKREYIIAGVTHIGTEDLPPTGAFHIYDITEVVPEPGKPDTNYRLKEVFKEEVRG 1114
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGI---AFYDAPPLYVVSLNIVKNFILLGD 1212
+S + + G L+ K+++ E N + AF D P +++ +F++LGD
Sbjct: 1115 IVSTVCEISGRFLVNQSQKVMVRD--AQEDNSVVPVAFLDIP-VFINDAKSFGDFLILGD 1171
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
+ ++F+ + + ++ L K + + EF+++G L ++D + + YAP
Sbjct: 1172 AMQGLHFIGFDAEPYRMINLGKSVTKFETVSVEFVVNGGDLYFALTDRNNILHVLKYAPD 1231
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
S GQKL+ + F++ + + L L D T AP + F + G +DGS
Sbjct: 1232 ELNSLSGQKLVHCSSFNLFSGNSSLLLLPKNEEFED-TKNAPLT-----FQTIGGQVDGS 1285
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDC 1389
I + PL E T+RRL +Q+ + D P + GLNPR R +++ RP ++D
Sbjct: 1286 IFKVIPLREDTYRRLYVIQQHMNDKEPQLGGLNPRMERLSNEYYQLCHVMRP----MLDF 1341
Query: 1390 ELLSHYEMLPLEEQLEIAHQTG 1411
++ + LP++ + +A + G
Sbjct: 1342 NIIRRFSELPIDRRTRVAKRAG 1363
Score = 70.5 bits (171), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 148/725 (20%), Positives = 286/725 (39%), Gaps = 145/725 (20%)
Query: 58 LVVTAANVIEIYVVR-VQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
L+V N++ IY + + S S + ET + A L L+ ++L+G V+ +
Sbjct: 29 LLVIRTNILSIYHLETILSPRSNTSSSQLETIEDATVTTSKQAKLFLINEFKLNGKVQDI 88
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + G NS + I+L+ AK+S+L FD SI+ S+H +E S
Sbjct: 89 ASIPLG---NSSSLECILLSTGTAKLSILNFDPSINSFETLSLHYYEEK---FKDISLVS 142
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL--------------KASQGGSGLVGDEDTFGS 222
A+ +++DP RC +L++ ++ L + ++ + + S
Sbjct: 143 LAKKSQLRMDPLNRC--LLMFNNDVMALLPLHSNNEDEEEEEEDENEEDEVLDNYEANLS 200
Query: 223 GGGFSARIE--------SSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
+ RI+ S + N+ L D+K++ D F++ + +P + +L++ LTWAG
Sbjct: 201 KTSPNKRIKYNNNQFEGKSKIFNINKLHEDVKNISDIQFLNNFNKPTIAVLYQPTLTWAG 260
Query: 273 RVSWKHHTC--MISALSI----STTLKQHP-----------LIWSAMNLPHDAYKLLAVP 315
V MI L I ST H +I L D +K++ +
Sbjct: 261 NVQLNPLPTHFMIFTLDILSENSTNNANHTTENNNNDLNLIIIAKLKELAWDWFKIIPIS 320
Query: 316 SPIGGVLVVGANTIHYHSQSA--SCALALNNYA-VSLDSSQELPRSSFSVELD------- 365
+ G +V+G N I Y + + LN++A +L ++ + S F + +
Sbjct: 321 N---GCVVIGNNEIAYIDNTGVLQSIILLNSFADKNLKKTRIIDESKFQIFFNENVTHVW 377
Query: 366 ----AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL-----------SKTNPSV 410
+ + T ++ LL +L + + +GR++ + D+ NP+
Sbjct: 378 SPSTSKNKTTEDDETLLLMDAQSNLYYVRLEAEGRLLTKFDIINLPIVNDVLRENCNPTC 437
Query: 411 LTSDITTIGNSL--FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
++ + NS F+G GDSL+V+ + LK + + S + +
Sbjct: 438 ISRLDSNATNSTMDLFIGFLSGDSLVVRL----------NNLKSAIDTRDEHSESNEHTQ 487
Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQ------------KTFSFAVRDSLVNIGPLK 516
+ D +E +LY + E A+ + F SL NIGP+
Sbjct: 488 LNGFDE------EDEDNLYSDDEVDVEDARSKRDMETIIHTVQPFDIEYLTSLKNIGPIT 541
Query: 517 DFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGH-NADSSRMAAYDDEY 575
+ G + D + G+ + E I T S+ H N + ++
Sbjct: 542 SLTVGKVSSLDLNVKGLQNPNKNEF-------SIVTTSGNSTGSHLNVIQQTVQPIVEKA 594
Query: 576 HAYLIIS------LEARTMVLETADL------LTEVTESVDYFVQGR-----TIAAGNLF 618
++ ++ ++ + L T D + ++ + +GR T +F
Sbjct: 595 LKFISVTQIWNLKIKNKDKYLVTTDSTKSKSDIYDIDNNFSLHKEGRLRRDATTVYIAMF 654
Query: 619 GR-RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
G +RV+Q+ + D ++ + L+ + E V+ VS+ DPY+L+ +S
Sbjct: 655 GDGKRVVQITTNHLYLFDTNF--RRLTAIKFDFE---------VVHVSVMDPYILITVSR 703
Query: 678 GSIRL 682
G I++
Sbjct: 704 GDIKI 708
>gi|76157351|gb|AAX28300.2| SJCHGC08809 protein [Schistosoma japonicum]
Length = 225
Score = 107 bits (268), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 91/154 (59%), Gaps = 5/154 (3%)
Query: 242 DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 301
+ +V D F+HG+ EP +++L+E TWAGRVS + TC I ALS + + +P+IW
Sbjct: 52 KINNVLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQ 111
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VSLDSSQELPR 357
+LP D ++ VP PIGGV+++ AN+I Y Q+ SC+L LN YA + Q++P
Sbjct: 112 ESLPFDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCSLPLNCYAQISTNFPMRQDVP- 170
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
S + +D L L+ T++G+L LL++
Sbjct: 171 SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSL 204
>gi|198432469|ref|XP_002129207.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
DNA-binding protein 1) (UV-damaged DNA-binding factor)
(DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
(UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
pigmentosum group E-co... isoform 1 [Ciona intestinalis]
Length = 1150
Score = 107 bits (267), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 148/316 (46%), Gaps = 28/316 (8%)
Query: 1111 LAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+GTA+V E+ + GR+L+F DN LV E KE+KGA+ L GH+L
Sbjct: 836 FVVGTAFVYMEETEPKHGRILVF---HYIDNKLTLVAE---KEVKGAVFCLCQFNGHVLA 889
Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
A + +++WT + + + + L +F+L+GD+ +S+ L++K L
Sbjct: 890 AINTSVSIYQWTTEKELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYKHVEGNL 949
Query: 1230 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1289
+ +AKD+ A E L D + L ++ N+ I + + KL A FH
Sbjct: 950 DEIAKDYSPNWMTAVEILDDDNFLG---AENFYNVFICQKDSGATTDEERSKLREAALFH 1006
Query: 1290 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
VG + F ++ + T + ++ +LFGT+ GSIG I +DE + L S
Sbjct: 1007 VGDSINTFRHGSLVMQNVGETAVS------SKGHILFGTVHGSIGVITTVDEDLYAFLHS 1060
Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGK--AHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
+Q +L + V ++ S+R F +N K AHR VD +L+ + L E+ E+A
Sbjct: 1061 IQNRLAKVIKSVGNIDHESWRSFCTNEKTEAHR----GFVDGDLIECFLDLNREKMAEVA 1116
Query: 1408 HQTGTTRSQILSNLND 1423
+ ++ N ND
Sbjct: 1117 ------KGLMVKNFND 1126
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 92/438 (21%), Positives = 167/438 (38%), Gaps = 109/438 (24%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F+ RIE VI+ + F+HGY P +VI+++ +H I
Sbjct: 155 FNIRIEELSVIDAK-----------FLHGYTTPTLVIIYQNS-------QGRHVKTYIVD 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA---- 341
+ + W N+ +A ++ VP P+ G +++G +I YH+ +A
Sbjct: 197 VRDKEVVAGP---WKQENIDAEANFIINVPKPLAGSIIIGQESITYHNGDKYIPIAPPQI 253
Query: 342 ---LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYD 394
+N YA +D + +L D+A G L +L + + D
Sbjct: 254 KDTINCYA----------------PVDKDGSRYLLGDLA------GHLFILLLESDEMMD 291
Query: 395 G-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
G V+ L + + I+ + N + ++GSRLGDS L++
Sbjct: 292 GTNTVRDLKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIR----------------- 334
Query: 454 FGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRDSLVN 511
+ D+ R + S L N G + + + Q T S A ++
Sbjct: 335 ---LPTDSSMEGRPKPSLISVLDTYTNLGPIIDMCVVDLDRQGQGQVVTCSGAFKE---- 387
Query: 512 IGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
G L+ G+ I AS ++LPG KG+W + D+SR +Y
Sbjct: 388 -GSLRIIRNGIGIQEHAS------------IDLPGIKGLWPL-------RVFDTSR--SY 425
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
D L+IS + +L+ + E T+ + + +T N+ +++Q+ E+
Sbjct: 426 DT-----LVISFVGHSRILQLSGEEVEETDLPGFDDESQTFYCSNVC-HNQLVQITEKSI 479
Query: 632 RILDGSYMTQDLSFGPSN 649
R++ + Q + P N
Sbjct: 480 RLISHTERRQVHEWKPKN 497
>gi|198432471|ref|XP_002129229.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
DNA-binding protein 1) (UV-damaged DNA-binding factor)
(DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
(UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
pigmentosum group E-co... isoform 2 [Ciona intestinalis]
Length = 1142
Score = 107 bits (267), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 89/319 (27%), Positives = 149/319 (46%), Gaps = 27/319 (8%)
Query: 1111 LAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+GTA+V E+ + GR+L+F DN LV E KE+KGA+ L GH+L
Sbjct: 832 FVVGTAFVYMEETEPKHGRILVF---HYIDNKLTLVAE---KEVKGAVFCLCQFNGHVLA 885
Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
A + +++WT + + + + L +F+L+GD+ +S+ L++K L
Sbjct: 886 AINTSVSIYQWTTEKELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYKHVEGNL 945
Query: 1230 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1289
+ +AKD+ A E L D + L ++ N+ I + + KL A FH
Sbjct: 946 DEIAKDYSPNWMTAVEILDDDNFLG---AENFYNVFICQKDSGATTDEERSKLREAALFH 1002
Query: 1290 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1349
VG + F ++ + T + ++ +LFGT+ GSIG I +DE + L S
Sbjct: 1003 VGDSINTFRHGSLVMQNVGETAVS------SKGHILFGTVHGSIGVITTVDEDLYAFLHS 1056
Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGK--AHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
+Q +L + V ++ S+R F +N K AHR VD +L+ + L E+ E+A
Sbjct: 1057 IQNRLAKVIKSVGNIDHESWRSFCTNEKTEAHR----GFVDGDLIECFLDLNREKMAEVA 1112
Query: 1408 -----HQTGTTRSQILSNL 1421
+ GT R + +L
Sbjct: 1113 KGLMVKEHGTKREATVDDL 1131
Score = 48.5 bits (114), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 92/217 (42%), Gaps = 40/217 (18%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F+ RIE VI+ + F+HGY P +VI+++ GR H I
Sbjct: 155 FNIRIEELSVIDAK-----------FLHGYTTPTLVIIYQNS---QGR----HVKTYIVD 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
+ + W N+ +A ++ VP P+ G +++G +I YH+ +A
Sbjct: 197 VRDKEVVAGP---WKQENIDAEANFIINVPKPLAGSIIIGQESITYHNGDKYIPIA---- 249
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDG-RVVQR 400
L Q+ V+ D + +L D+A G L +L + + DG V+
Sbjct: 250 --PLCFFQDTINCYAPVDKDGSR--YLLGDLA------GHLFILLLESDEMMDGTNTVRD 299
Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L + + I+ + N + ++GSRLGDS L++
Sbjct: 300 LKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIRL 336
>gi|363750592|ref|XP_003645513.1| hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
gi|356889147|gb|AET38696.1| Hypothetical protein Ecym_3197 [Eremothecium cymbalariae DBVPG#7215]
Length = 1318
Score = 107 bits (266), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 73/321 (22%), Positives = 146/321 (45%), Gaps = 16/321 (4%)
Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELK 1154
N+ TK L +G YV+ ED++ G L+ P T E++ ++++
Sbjct: 979 LNSNTKRKREYLVVGNTYVRDEDISGTGSFYLYDITEVVPEPGKPDTNYKFKEIFQEDIR 1038
Query: 1155 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
G +S + + G +I+ K ++ + +AF D P +++ N +++GD
Sbjct: 1039 GTVSTVCEISGRFMISQSSKAMVRDIQEDNSVVPVAFLDMP-VFITDAKSFGNLMIIGDA 1097
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
F+ + + ++ L K L+ + EFL++ + +++D + + + YAP
Sbjct: 1098 MHGFTFVGFDAEPYRMITLGKSVTKLETMSLEFLVNNGDMYFIITDRSQVMHVLKYAPDE 1157
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
S GQ+L+ F++ + + +RL GS + F + +DGSI
Sbjct: 1158 PNSLSGQRLVYCTSFNLHS-INTCMRLIQKNNEFVDLRRNYGSHMST-FQCIGCHIDGSI 1215
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCE 1390
+ PL E ++RRL +Q++++D + GLNPR R ++ G RP ++D
Sbjct: 1216 FKVVPLTESSYRRLYLVQQQIIDKEVQLCGLNPRMERLQNPYYQLGHLLRP----MLDFT 1271
Query: 1391 LLSHYEMLPLEEQLEIAHQTG 1411
+L + L + ++ +A + G
Sbjct: 1272 ILKKFSTLSISKRRSMASKAG 1292
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 135/646 (20%), Positives = 266/646 (41%), Gaps = 105/646 (16%)
Query: 97 SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
+ L L ++L G+V S+A++ Q G++ +++ K+S+L+FD L
Sbjct: 44 AKGQLVLSYEWKLSGHVHSMALIPQPGSE----LYCLVILTGCGKLSILKFDHMSQSLDT 99
Query: 157 TSMHCFESP-EWLHLKRGRESFARGPLVKVDPQGRCGGV----LVYGLQMIILKASQGGS 211
S+H +E + L L + P + VD RC V + L + + K +
Sbjct: 100 LSLHYYEDKFKELSLLE----ISNTPSLIVDRSFRCLLVRNNDCIAILPLNVTKEEEEEE 155
Query: 212 GLVGDEDTFGSGGGFSAR------------IESSHVINLRDL--DMKHVKDFIFVHGYIE 257
++ +GG FS + + SS ++ L D+K+V D F+HG+ +
Sbjct: 156 EDNEKDEDRSNGGRFSFKRHKLNGGSVKQFVNSSTIMPASHLHSDIKNVLDVQFLHGFNK 215
Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
P + IL++ L W+G + T + LS+ ++ +I LP+D + L+ + +
Sbjct: 216 PTLAILYQPILAWSGNEKLRSQTVKVIILSLDFEDEKSTVINIIQGLPNDLHTLIPLSN- 274
Query: 318 IGGVLVVGANTIHYHSQSASC--ALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ-- 373
+VVG N + Y + + ++LN+++ ++ +++ SS + +
Sbjct: 275 --ASIVVGVNELIYIDNTGALQGTVSLNSFSKTVLNTKVKDNSSLQAFFNRPVCQYTTIS 332
Query: 374 --NDVALLSTKTGDLVLLTVVYDGRVVQ-----RL----DLSKTN--PSVLTSDITTIGN 420
D+ LL + + + + +GR+V RL D+ K N P+ + D+
Sbjct: 333 KGKDIMLLMDEKSQMYNVIIESEGRLVTAFNCVRLPIVNDIFKNNHLPTCICGDVDLETG 392
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
+L F+G + GD++ V+ +S+ S G E +EAD +
Sbjct: 393 NL-FIGFKSGDAMRVRLN-NLRSSLASKGNVVE--TMEADEDYDE--------------- 433
Query: 481 GEELSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
LYG ++ + T F D+L+NIGPL + G + + + ++ +
Sbjct: 434 -----LYGGSTEVEKKNMDTETPFDIETLDNLINIGPLTSLAVGKVSSIEPTIAKLTNPN 488
Query: 538 NYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYD------------DEYHAYLIISLEA 585
EL + G T H + + + A + YL+ + +
Sbjct: 489 RCEL-SIVATSGNSTGSHLTVFENTIVPTVEKALKFISVTQIWNLKIKDKDKYLVTTDSS 547
Query: 586 RTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY---MTQ 641
++ + + D + +S D+ T++ +R++QV +G + D ++ MT
Sbjct: 548 QSKSDIYSIDRDFKPFKSFDFKKNDTTVSTAVTGAGKRIVQVTSKGVYLFDINFKRMMTM 607
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
+ F V+ V I DP++LL S G I++ +P
Sbjct: 608 NFDF--------------EVVHVCINDPFLLLTNSKGDIKIYELEP 639
>gi|323309632|gb|EGA62840.1| Cft1p [Saccharomyces cerevisiae FostersO]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|323338222|gb|EGA79455.1| Cft1p [Saccharomyces cerevisiae Vin13]
gi|365766372|gb|EHN07870.1| Cft1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|151942273|gb|EDN60629.1| cleavage factor II (CF II) component [Saccharomyces cerevisiae
YJM789]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|71413583|ref|XP_808925.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|70873226|gb|EAN87074.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 444
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 104/451 (23%), Positives = 195/451 (43%), Gaps = 51/451 (11%)
Query: 996 QKIPLKATPHQITYFAEKNLYPLIVSV-----PVLKPLNQVLSLLIDQEVG--HQIDNH- 1047
++I L TPH + Y ++ S P P + L+++ D+E G I
Sbjct: 2 RRIHLGVTPHFVVYHPPARSCFVVTSKKEPFRPQRAPFDFQLNIVYDEESGGVQSITTEA 61
Query: 1048 ---NLSSVDLH---RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFN 1101
N+ + + R + +E+R++ + W T+ ++ +E L +++ +
Sbjct: 62 PVCNMPPIAPNAGIRVPMADRFEIRLM----STTDWACTDTLLLEENERVLGAQMMEIHC 117
Query: 1102 TTTKE---NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAIS 1158
E + + TA+ GED+ RGR+LL +T + L+ +S+ L G +
Sbjct: 118 EKDAEGLHTAPVCVVSTAFPLGEDITCRGRILLLATMCTKKKRKILL--FHSEPLNGPAT 175
Query: 1159 ALASLQGHLLIASGPKIILHK--WTGTELN-GIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
A+ ++ H+ +A G I L + W +L G Y YV ++ +N+++ GD+ +
Sbjct: 176 AVVGIRHHIAVAVGGTIKLFRFDWEKRKLVVGALLYAGT--YVTRMSSFRNYLIYGDLSR 233
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
S + E+ L++L KD ++ + + L+ SD+++N+ + Y P++ E
Sbjct: 234 SCAIARFNEENHTLSVLGKDRNAVSVVHCDMMYHDRAFGLLCSDDERNLLVMGYTPRVQE 293
Query: 1276 SWKG--QKLLSR-----AEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
+ G K+L E+ + G + K LR + LA +S T L+
Sbjct: 294 TEAGSPNKVLESVLSLDGEYRLSGGCLVKSLRFRSLAGNSSVT--------------LYV 339
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF-HSNGKAHRPGPDSI 1386
T G IG I P+ E R L ++L +PH AGL PR F + + + +
Sbjct: 340 TNYGEIGFIVPIGEQANRTASWLMRRLQIDLPHSAGLTPRMFLGLSQGSPRTAMRAKEML 399
Query: 1387 VDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
V LL+ + L + + IA T ++
Sbjct: 400 VSASLLNEFFFLDIHSRKTIASAAYTQLERV 430
>gi|207346484|gb|EDZ72967.1| YDR301Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|6320507|ref|NP_010587.1| Cft1p [Saccharomyces cerevisiae S288c]
gi|74583567|sp|Q06632.1|CFT1_YEAST RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|849213|gb|AAB64737.1| Ydr301wp [Saccharomyces cerevisiae]
gi|256271799|gb|EEU06830.1| Cft1p [Saccharomyces cerevisiae JAY291]
gi|285811316|tpg|DAA12140.1| TPA: Cft1p [Saccharomyces cerevisiae S288c]
gi|392300415|gb|EIW11506.1| Cft1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|190404756|gb|EDV08023.1| 150 kDa protein associated with polyadenylation factor 1
[Saccharomyces cerevisiae RM11-1a]
gi|259145538|emb|CAY78802.1| Cft1p [Saccharomyces cerevisiae EC1118]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|349577352|dbj|GAA22521.1| K7_Cft1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 1357
Score = 106 bits (264), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 155/346 (44%), Gaps = 25/346 (7%)
Query: 1077 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1134
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1135 GRNADNP-----QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1188
P + E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
GS + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EF----GSPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1369 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
>gi|302652141|ref|XP_003017930.1| hypothetical protein TRV_08062 [Trichophyton verrucosum HKI 0517]
gi|291181516|gb|EFE37285.1| hypothetical protein TRV_08062 [Trichophyton verrucosum HKI 0517]
Length = 844
Score = 105 bits (262), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 183/741 (24%), Positives = 288/741 (38%), Gaps = 110/741 (14%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V ++++++ + GS + R D A L L Y + G + L
Sbjct: 28 NLIVAKTSLLQVFSLVNVTYGSTTGTQPDQKGRH---DRSQHAKLVLAAEYEVPGTITGL 84
Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHL 170
+ NS+ D+I+++ DAK+S++E+D HG+ S+H +E E W+
Sbjct: 85 QRVR---ISNSKSGGDAILVSSRDAKLSLIEWDPEKHGISTISIHYYEGEESHMSPWVP- 140
Query: 171 KRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDEDT- 219
S + G + VDP G C + +G+ + IL Q G LV GD+ T
Sbjct: 141 --DLGSCSSG--LTVDPNGNC-AIFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTD 195
Query: 220 FGSGGGFSARIESSH--------VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
S SA +SH V+ + LD + H F+H Y EP IL+ +
Sbjct: 196 MVSDAQKSAPGNTSHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVAR 255
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
+ S ++ K + + LP D +K++ +P P+GG L++G N
Sbjct: 256 STSLTIDRKDVVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNEL 315
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
+H + A+ +N +A + +S + L+ L + LL G +
Sbjct: 316 VHVDQAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRM 375
Query: 387 VLLTVVYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
+L+ DGR V + L +K PS S +G + F GS GDS+L+
Sbjct: 376 SILSFKVDGRSVSGISLHFVAEQSGGSITKARPSCSAS----LGRNKLFYGSEEGDSVLL 431
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE 495
++ S T+ S K G E A D D + ++L AS E
Sbjct: 432 GWSRPSSTTKRPS--KSVDGVDENGAADLSDEADQDDDGDDDDMYEDDLYSVNPASTRQE 489
Query: 496 S------AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
+ F+F D L ++GP +D + G + + S +EL +G
Sbjct: 490 KQVVNGDSPADFTFRAYDRLWSLGPYRDITLGKPSKSKSKDQQDSVPEIAAPLELVAARG 549
Query: 550 I-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART-----------MVLETA 593
TV + + DS +M DD Y + I ++ ++ +L
Sbjct: 550 FGKSGGLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTGLSRSYDKYLLLAK 606
Query: 594 DLLTEVTESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGS 637
+ ESV Y V + TI G L RV+QV R D
Sbjct: 607 SKGEDKEESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLATGTRVVQVLRTEIRSYD-- 664
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQ 695
Y P E SE TV+ S A+PY+L D S+ +L D + V VQ
Sbjct: 665 YNLGLAQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQ 722
Query: 696 TPAAIESSKKPVSSCTLYHDK 716
AA S K +S C LY DK
Sbjct: 723 GSAA---SGKWISGC-LYEDK 739
>gi|45184764|ref|NP_982482.1| AAL060Wp [Ashbya gossypii ATCC 10895]
gi|74695871|sp|Q75EY8.1|CFT1_ASHGO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|44980110|gb|AAS50306.1| AAL060Wp [Ashbya gossypii ATCC 10895]
gi|374105681|gb|AEY94592.1| FAAL060Wp [Ashbya gossypii FDAG1]
Length = 1305
Score = 105 bits (262), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 79/343 (23%), Positives = 150/343 (43%), Gaps = 33/343 (9%)
Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-----EVYSKELK 1154
N+ TK L +G YV+ ED+ G L+ P T +++ ++++
Sbjct: 967 LNSNTKRRREYLVVGNTYVRDEDIGGTGSFYLYDITEVVPEPGKPDTNYKFKDIFQEDIR 1026
Query: 1155 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
G +S + + G +I+ K ++ + +AF D P +++ N +++GD
Sbjct: 1027 GTVSTVCEISGRFMISQSSKAMVRDIQEDNSVVPVAFLDMP-VFITDAKSFGNLMIIGDS 1085
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
+ FL + + ++ L K L+ EFL++ + +V+D + + YAP
Sbjct: 1086 MQGFSFLGFDAEPYRMLTLGKSVSKLETMCVEFLVNNGDVYFLVTDRNNLMHVLKYAPDE 1145
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR--------FALL 1325
S GQ+L+ F++ + T L +D G K +R F +
Sbjct: 1146 PNSLSGQRLVHCTSFNLHSTNT----CMRLIKKNDEFG------KVSRGFGIYMPSFQCI 1195
Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPG 1382
DG+I + PL E ++R L +Q++L+D + GLNPR R F+ G RP
Sbjct: 1196 GSQADGTIFKVVPLSEASYRSLYLIQQQLIDKEVQLCGLNPRMERLENPFYQMGHILRP- 1254
Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTG-TTRSQILSNLNDL 1424
++D +L + L + ++ +A + G ++I +L D+
Sbjct: 1255 ---MLDFTVLKRFATLSIPTRMTMASKAGRQAHAEIWRDLIDI 1294
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 120/624 (19%), Positives = 248/624 (39%), Gaps = 124/624 (19%)
Query: 140 AKISVLEFDDSIHGLRITSMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
++S++ FD L S+H +++ E L G P ++ +P RC +LV+
Sbjct: 82 GRVSIVRFDAENQTLETESLHYYDAKFEELSALTVGA-----APRLEQEPAARC--LLVH 134
Query: 198 GLQMIILKASQGGSGLV-------------GDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+ + +G D G G S + +SH+ + D+K
Sbjct: 135 NGDCLAVLPLRGHEEEGEEAEEEEEHPAKRARTDADGRLVGASTVMPASHLHS----DIK 190
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
+VKD F+ G + + +L++ +L+W G T LS+ ++ +I L
Sbjct: 191 NVKDMRFLRGLNKSAVGVLYQPQLSWCGNEKLTRQTMKFIILSLDLDDEKSTVINMLQGL 250
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC--ALALNNYAVS-----------LDS 351
P+ + ++ + + G ++ G N + Y + + A++LN ++ S L +
Sbjct: 251 PNTLHTIIPLSN---GCVLAGVNELLYVDNTGALQGAISLNAFSNSGLNTRIQDNSKLQA 307
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL---------D 402
E P F+ + + D+ LL + + + + +GR++ +
Sbjct: 308 FFEQPLCYFATQSNG-------RDILLLMDEKARMYNVIIEAEGRLLTTFNCVQLPIVNE 360
Query: 403 LSKTN--PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
+ K N P+ + ++ SL F+G + GD++ V+ + L S L+
Sbjct: 361 IFKRNMMPTSICGNMNLETGSL-FIGFQSGDAMHVRL------NNLKSSLEH-------- 405
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGP 514
+ + S+ L+ + + + LYG NN E +K F D L+NIGP
Sbjct: 406 -------KGTVSETLE--TDEDYMELYG---NNAEKEKKNLETESPFDIECLDRLLNIGP 453
Query: 515 LKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADSSRMAA 570
+ + G + + + ++ + EL + G T+ + + + +
Sbjct: 454 VTSLAVGKASSIEHTVAKLANPNKDELSIVATSGNGTGSHLTILENTIVPTVQQALKFIS 513
Query: 571 YDDEYH-------AYLIISLEARTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
++ YL+ + ++T + + D + ++ D+ T++ G +R
Sbjct: 514 VTQIWNLKIKGKDKYLVTTDSSQTRSDIYSIDRDFKPFKAADFRKNDTTVSTAVTGGGKR 573
Query: 623 VIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
++QV +G + D ++ MT + F V+ V I DP++LL S G
Sbjct: 574 IVQVTSKGVHLFDINFKRMMTMNFDF--------------EVVHVCIKDPFLLLTNSKGD 619
Query: 680 IRLLVGDPSTCTVSVQT--PAAIE 701
I++ +P V+T P A++
Sbjct: 620 IKIYELEPKHKKKFVKTVLPDALK 643
>gi|340059653|emb|CCC54046.1| putative mitochondrial carrier protein [Trypanosoma vivax Y486]
Length = 1481
Score = 105 bits (262), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 115/507 (22%), Positives = 214/507 (42%), Gaps = 63/507 (12%)
Query: 899 AYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRER--LRVHPQLCDGSIV 955
A ET C R I F +++G+ G ++ G P + + R L + G +
Sbjct: 939 AMIESETQLTRHCSRSIIPFDSLAGNVGAYVCGRHPLFLLWDRRTGLLSGYRHQIQGPVR 998
Query: 956 AFTVLH-----NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYF 1010
F V C GF T ++ P G + W ++I + ATPH I+Y
Sbjct: 999 GFAPFPLMEGGFVYCGEGF---TDFAVMNTYCRPIG----HGWLGRRIDVGATPHFISYN 1051
Query: 1011 AEKNLYPLIVS-----VPVLKPLNQVLSLLIDQEVG--HQIDNHNLS-------SVDLHR 1056
++ S P P + L + ++E G I L+ S R
Sbjct: 1052 MPGRGCFVVTSHKQPFRPQRAPFDVQLKISYNEETGAIQSIATEPLTCSMPPIASSAGVR 1111
Query: 1057 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETL--LAI 1113
+ +EVR + A W T ++ +E L++++V + + K N T+ +
Sbjct: 1112 VPMADWFEVRFMST--AHVDWPCEDTFKLEENERVLSIQMVQIDGDRGMKINGTVPVCVV 1169
Query: 1114 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1173
TA+ G+DV RGR+ L +T + + + ++++ L G +A+A ++ H+ +A G
Sbjct: 1170 STAFPLGDDVTCRGRIHLLAT--KSLRRGHKIVHLHAEALNGPATAVAEIRHHIAVAVGG 1227
Query: 1174 KIILHKW---TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1230
I ++++ +G + + Y +Y L++++N+I+ GD+ S + E+ L
Sbjct: 1228 TIKIYRYDWQSGKLVVSVLLYAG--IYATKLSVIRNYIVYGDLIHSCAMARFNEENHTLT 1285
Query: 1231 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK-------LL 1283
+L ++ S+ + + ++ SD+Q+N+ + Y P++ E+ G+ L
Sbjct: 1286 VLGRNRNSISVVDCNMMYHDRSFGILCSDDQRNVLVMGYTPRVQEAGAGRPAKTLESLLT 1345
Query: 1284 SRAEFHV-GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
E+ + + K LR SD N +L+ + G +G I P+ E
Sbjct: 1346 LDGEYRLPSGCLAKSLRFS--------------SDFGNSSVMLYTSNYGEVGFIVPIGEQ 1391
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
R + ++L VP AGL PR F
Sbjct: 1392 ANRTALWVTRRLQTDVPCDAGLTPRMF 1418
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/231 (26%), Positives = 103/231 (44%), Gaps = 41/231 (17%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHH-------TCMISALSISTTL 292
+++V+D F+ EP++ IL ER+ TWAGRV W+ T ++ + IS ++
Sbjct: 268 IRYVRDLQFIGSSGEPLLAILCERQPTWAGRVKLVEWRTKVVESNTLTMHVTWVQISASM 327
Query: 293 KQHP---LIWSAMNLPHDAYKLLAV---PSPIGGVLVVGANTIHYHSQSASCALALNNY- 345
HP LI +P++ +L V + GV+ G N I + + N+
Sbjct: 328 TAHPKLLLIGEVEGVPYNVTHMLPVEPFSQTMSGVVCFGTNVIMHITTKRGYGAYFNDTG 387
Query: 346 ----------AVS----------LDSSQELPRSSFSVELDAAHATW--LQNDVALLSTKT 383
AVS LD S L R + S+ AA + + +++ +L+
Sbjct: 388 REECINSKFSAVSFGKAVWSDPQLDKSSALARVNMSLANCAATSMVGKMGDELQVLALLE 447
Query: 384 GDLVLLTV--VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
D V++T+ V G V+ + ++ S ++ IG L FLGS +GDS
Sbjct: 448 EDGVVITLHFVARGSSVEEVRITMLGSGCYCSSVSRIGRQLVFLGSTVGDS 498
>gi|390358537|ref|XP_001201130.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 283
Score = 105 bits (261), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 126/273 (46%), Gaps = 51/273 (18%)
Query: 3 FAAYKMMHWPTGIANC-GSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVT 61
+A Y+ +H PTG+ +C F + P ++ NLVV
Sbjct: 2 YAFYREIHPPTGVEHCVYCHFFS----------------------PDQQ------NLVVA 33
Query: 62 AANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ 121
+ + +Y + + K+S + LE + + G V S+ Q
Sbjct: 34 KGSELTVYSMITVDSNKPTDKDSKPKNK-----------LEEAATFHIFGKVMSM----Q 78
Query: 122 GGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGP 181
RD+++L+F +AK+S++E+D ++H L+ SMH FE E K G P
Sbjct: 79 SAQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEEDE---TKEGVYRNIFHP 135
Query: 182 LVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL 241
+VKVDP RC +L YG ++++L + GLV D D S + S+VI L ++
Sbjct: 136 VVKVDPDHRCAIMLTYGSKLVVLPFRR--DGLVEDLDKSMSASTRRGALMPSYVIRLNEM 193
Query: 242 D--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
D + +V D F+HGY EP ++IL+E TWAG
Sbjct: 194 DDPICNVLDIQFLHGYYEPTLLILYEPLRTWAG 226
>gi|348679451|gb|EGZ19267.1| hypothetical protein PHYSODRAFT_492468 [Phytophthora sojae]
Length = 736
Score = 104 bits (259), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 111/427 (25%), Positives = 169/427 (39%), Gaps = 102/427 (23%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR+L++ V D F+ GY+EP +++LHE + + GR++ T I+ +SI+
Sbjct: 261 LLRLRELEITGKVIDLAFLDGYLEPTLMVLHEENEKNSTCGRLAAGFDTYCITVISINMN 320
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+ HP IW+ NLP D +KL+ +P+GGV+V+ AN Y +Q+ LA N L
Sbjct: 321 TRLHPKIWTVKNLPSDCFKLIPCRAPLGGVVVLSANAFLYFNQTQFHGLATN----VLRE 376
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL-DLSKTNPSV 410
+ + ++ L +L LL+ GD +L++ Y+ R V+ + KT
Sbjct: 377 QDDHEMAQLNIVLYDCQFEYLHEKEVLLTMPNGDAYVLSLPYEDRSVRFWRSIKKT---- 432
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF------TCGSGTSMLSSG----LKEEFGDIEAD 460
F+GSR GDS+L + G S L +KEE E
Sbjct: 433 ------------LFVGSRSGDSVLYALDQKKLTSAGGEASKLQEDEEMLIKEEVVKEEVT 480
Query: 461 -----------------------APSTKRLRRSSSDALQDMVNGEELSLYGSASN----N 493
AP+ + S S + VNG S N
Sbjct: 481 AEVKAEPAEEEEEDEDDLFLCGAAPTKEEPTTSGS---TEAVNGTNGSAVKKEENGHAVE 537
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV----------- 542
ES + D L +IG + + NAD S + ELV
Sbjct: 538 EESGPYDYVLHQIDVLPSIGQITSIELSIENNAD------SNEKREELVISGGYEHSGAI 591
Query: 543 ---------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
EL GC+ +WTV + R Y+AYLI+S+ RT
Sbjct: 592 SVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATKSSDGR------SYNAYLILSVAHRT 645
Query: 588 MVLETAD 594
MVL T +
Sbjct: 646 MVLRTGE 652
>gi|298711490|emb|CBJ26578.1| n/a [Ectocarpus siliculosus]
Length = 1135
Score = 102 bits (253), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/339 (28%), Positives = 157/339 (46%), Gaps = 25/339 (7%)
Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFST-GRNADNP 1141
P+ + EN ++ V +F KE L +GT YV+ ++ A GR+L+FS G+ A+
Sbjct: 789 PLDAFENGSSM-VSCVFANDKKE---YLVVGTGYVREDECEPAVGRLLVFSVEGQGAERK 844
Query: 1142 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY---V 1198
+L EV E +GA+ L G LL K+ L +W + +GI Y +
Sbjct: 845 VDLAAEV---ETRGAVYVLNGFNGKLLACINSKVQLFRWIEKD-DGIQELQTECGYHGHI 900
Query: 1199 VSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
++L++ +FI++GD+ +S+ L +K + +A+D+ + A E L D +
Sbjct: 901 LALHMQSRGDFIIVGDLMRSVSLLVYKAVDGAIEEVARDYHANWMTAVEMLNDDVYIG-- 958
Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG- 1315
+ NI + + +L + EFH+G V KF R +L SS+ +PG
Sbjct: 959 -GEADCNIFTLRRNADAATEEERARLEIQGEFHLGEFVNKFCRGSLLMQSSEVN--SPGG 1015
Query: 1316 --SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
S LLFGT++G +G I L E R L LQ + V V G + +R F
Sbjct: 1016 MDSPLVKGQPLLFGTVNGMVGTILTLTEDNHRFLAQLQTAMTKVVKGVGGFSHDEWRSF- 1074
Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
+NG+ P + +D +L+ Y +P Q E+ T
Sbjct: 1075 TNGRRTSPSSN-FIDGDLVESYLDMPRHNQEEVLRHVDT 1112
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 96/418 (22%), Positives = 166/418 (39%), Gaps = 94/418 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
A+ + N+R L+ V D F+ G + + +L++ + + +H I
Sbjct: 143 MDAKGQLKDAFNIR-LEELEVLDIQFLSGCPKATIAVLYQDQR------NARH----IKT 191
Query: 286 LSISTTLKQHPL-IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
+IST K+ W+ +N+ H+A +L+ VP+P GGVL++G TI YHS A + + N
Sbjct: 192 YTISTRDKEFDTGPWAQLNVEHNASELIPVPAPFGGVLILGHQTICYHSGKAFITIPIQN 251
Query: 345 YAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL--VLLTVVYDGRVVQR 400
+ A+ W+ D + L+S +G L V+LT V+
Sbjct: 252 TRM------------------CAYG-WVDADGSRLLVSDHSGGLHVVILTPDATNTAVET 292
Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
+ + S I+ + N + F+GS GDS L++ + E D
Sbjct: 293 AHIEALGETSCASSISYLDNGVVFIGSASGDSQLIKL------------------NPEKD 334
Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
A T + D L +++ + T S +D G L+
Sbjct: 335 AQGTYIQVLETYDNLGPILD----MCVADLDRQGQGQAVTCSGCSKD-----GSLRIIRN 385
Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
G+ IN A+ +EL G KG+W++ R N + + YL+
Sbjct: 386 GIGINEHAA------------IELAGIKGMWSL-----RPSNTNHDK----------YLV 418
Query: 581 ISLEARTMVL---ETADLLTEVTE-SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + T VL E D ++ E + F +G T+ G G +QV +RG ++
Sbjct: 419 QAFISETRVLAFEEDEDGDHQLAEGEIAGFQEGCTLFCG-CVGGNMAVQVTKRGVVLI 475
>gi|242208344|ref|XP_002470023.1| predicted protein [Postia placenta Mad-698-R]
gi|220730923|gb|EED84773.1| predicted protein [Postia placenta Mad-698-R]
Length = 696
Score = 100 bits (250), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 95/313 (30%), Positives = 141/313 (45%), Gaps = 33/313 (10%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDV 1123
+ ++ P+ G W T E + VTL T+T + +GT ED+
Sbjct: 396 LELISPEPEG--WVTMDGFESAQKEFVTCLDCVTLETTSTGSGMMDFIIVGTTINCREDL 453
Query: 1124 AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
A +G V +FS + Q + KG ++AL L L+ + G KI + +
Sbjct: 454 AVKGAVYIFSIVEVVPDLQ------CRDDAKGPVAALCGLNNSLVSSMGQKIFVRAFDLN 507
Query: 1184 E-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
E L G+AF D +Y+ SL VKN +++ D KS + +L +L KD + C
Sbjct: 508 ERLVGVAFLDVG-VYITSLRAVKNLLVISDAVKS-------KDPYKLVILGKDPYQV-CV 558
Query: 1243 ATE--FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1300
T F DG L++ DE I+I+ Y P ES GQ LL R EFH
Sbjct: 559 TTADLFFADGQVF-LLIGDEDGVIRIYEYDPHDPESRGGQHLLRRTEFHG---------- 607
Query: 1301 QMLATSSDRTGAAPGSD-KTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVP 1359
QM + S G D + L+ G+ +GS+ +DE+ +RL LQ +L +V
Sbjct: 608 QMESRMSILIIRRRGKDTDIPQARLISGSTNGSLSMFTYVDEVASKRLHLLQGQLTRNVQ 667
Query: 1360 HVAGLNPRSFRQF 1372
HV GLNP+ FR +
Sbjct: 668 HVVGLNPKVFRPY 680
>gi|159470707|ref|XP_001693498.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283001|gb|EDP08752.1| predicted protein [Chlamydomonas reinhardtii]
Length = 366
Score = 100 bits (249), Expect = 6e-18, Method: Composition-based stats.
Identities = 68/216 (31%), Positives = 93/216 (43%), Gaps = 53/216 (24%)
Query: 921 SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILK 979
+ H G F++G+RP W + R L H +G + A T HNVNC GFI S +G LK
Sbjct: 164 ANHSGVFVAGARPLWLVAGRGGLAAHAMWSEGPVAALTPFHNVNCPLGFITACSARGQLK 223
Query: 980 ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1039
+C LP + D W +++PLK TPH++ +F E IV+ +P
Sbjct: 224 VCCLPPHTRLDGAWATRRVPLKVTPHRLAWFREAG----IVAAITSRPAPS--------- 270
Query: 1040 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1099
R EE GG E AL ++ V L
Sbjct: 271 ----------------RPRPAEE----------PGG-------------EQALCLKFVYL 291
Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1135
N TT + +TLLA+GT GED GR+LL+S
Sbjct: 292 RNATTGDTDTLLAVGTGTPLGEDYPCLGRLLLYSVA 327
Score = 56.6 bits (135), Expect = 9e-05, Method: Composition-based stats.
Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 7/111 (6%)
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL------SFGPSNSESGSGSE 657
+Y TIAAGNLF ++Q G R+L+G + QDL + G + S G
Sbjct: 4 EYITDQPTIAAGNLFHNAVIVQACPGGVRLLEGMSLVQDLPLSELQALGGVAAASRPGVA 63
Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPA-AIESSKKPV 707
T+ + +ADPYVL+ +S+G+ LL D + T+ + A + ++PV
Sbjct: 64 PPTITHMQVADPYVLVSLSNGTACLLEADLLSMTLGLGGCAGGVPDCERPV 114
>gi|403218521|emb|CCK73011.1| hypothetical protein KNAG_0M01580 [Kazachstania naganishii CBS 8797]
Length = 1345
Score = 100 bits (248), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 65/321 (20%), Positives = 147/321 (45%), Gaps = 18/321 (5%)
Query: 1100 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELK 1154
N+ TK + +GT++V ED+ A G ++ P + + + +EL+
Sbjct: 1010 LNSRTKRKIEYVVVGTSFVGTEDLPATGSFQMYDIAEVVPEPGKPDTNYKIKQFFKEELR 1069
Query: 1155 GAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
A++++ + G +I+ K+++ + +AF D P L+ + N +++GD
Sbjct: 1070 SAVTSVCDISGRFVISQSQKLMVRDAQEDNSVVPVAFLDIP-LFTADMKSFGNLLIIGDA 1128
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
+ I + + + ++ L + + + EFL++G L + D + + YAP
Sbjct: 1129 MQGIQLVGFDAEPYRMIPLGRSVLKFETLSLEFLVNGGDLYFTLIDRNDILHVLKYAPDE 1188
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
S GQ+L+ + F++ + + L ++ P + + ++ G DGS+
Sbjct: 1189 PNSLSGQRLIHCSSFNMYSTTS----CTRLIPKNELFVDGPLNPAIQSYQVIGGQADGSL 1244
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---QFHSNGKAHRPGPDSIVDCE 1390
+ P+ E +RRL +Q++++D +AG+NP+ R ++ RP ++D
Sbjct: 1245 FKVMPVPETVYRRLYVVQQQIIDKETPLAGINPKMERLSNDYYQTSHLLRP----MLDYN 1300
Query: 1391 LLSHYEMLPLEEQLEIAHQTG 1411
++ + + + ++ +AH+ G
Sbjct: 1301 VVKQFCAMSIPKRTTLAHKLG 1321
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 149/708 (21%), Positives = 285/708 (40%), Gaps = 150/708 (21%)
Query: 61 TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
T A+ +E+ +VR + SG+ L L ++L + LA++
Sbjct: 22 TTADYVELLIVRTNLLSIYKVTESGK--------------LLLTHEFKLQARITDLALV- 66
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF--- 177
G +N+ + ++L + K+S+++F+ + L S+H +E K SF
Sbjct: 67 -GSVENTGL-NYLLLGIGNCKLSIVKFNSLNNSLETISLHYYEE------KFKANSFIEL 118
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA--------R 229
A+ +++DPQ RC +L ++IL SQ E+ +
Sbjct: 119 AKKTELRIDPQNRCA-LLFNNDNIVILPFSQQQEEEDYGEEEEEEDNYNMEDGPNVKKLK 177
Query: 230 IES--------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH- 278
+ES S + + + LD +++V D F+ + P + IL++ +LTWAG +
Sbjct: 178 LESASTNLTLPSIITDSKKLDSTIENVVDIQFLRNFSRPTLGILYQPKLTWAGNLQLNPL 237
Query: 279 -HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
++ +L+I+ + + +I LP D++ L +P+ G VL+ G+N + Y +
Sbjct: 238 PTKFLVISLNIAVSELEGTVITKLEGLPWDSHTL--IPTWNGCVLL-GSNEVSYIDNTGV 294
Query: 337 -SCALALNNYA-VSLDSSQELPRSSFSVEL--DAAHATWLQ--------NDVALLSTKTG 384
A+ LN+YA SL + + + + L D + W +++ LL ++
Sbjct: 295 LQSAIFLNSYADASLRKVRVVDHTDQQITLNKDLVKSLWSAPTKESGGADEILLLMDESS 354
Query: 385 DLVLLTVVYDGRVVQRLDL-----------SKTNPSVLT--SDITTIGNSLFFLGSRLGD 431
+L + + ++GR++ + D+ +P+ +T + N F+G + GD
Sbjct: 355 NLYYIQLEFEGRLMTKFDMINLPIVNDIFVHNLHPTCITRIDESKHNININLFIGFQTGD 414
Query: 432 SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS---LYG 488
SL+V+ +I + + +++SS++ V E+ LYG
Sbjct: 415 SLVVRL-----------------NNIRSAIETRHEYKQTSSESGLGKVEDEDEDEDDLYG 457
Query: 489 -------SASNNTESAQ----KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK-- 535
+AS N ++A + F + L NIGP+ G + G+
Sbjct: 458 DDGAHDKNASVNNDNAVVHTVQPFDIEMMSCLRNIGPVTSLVIGEASSVQPVIKGLPNPN 517
Query: 536 QSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL--------EART 587
+ Y LV G + G N +++ + A IS+ + R
Sbjct: 518 KGEYSLVATCG----------NGTGSNLMVGQISVQPEVELALKFISVTQIWNLKVKNRD 567
Query: 588 MVLETADL------LTEVTESVDYFVQGR------TIAAGNLFGRRRVIQVFERGARILD 635
L T D + E+ + + QGR T+ G +R++QV + D
Sbjct: 568 KYLITTDSTKTKSDIYEIENNFALYKQGRLRRDATTVYISMFGGEKRIVQVTTNHLYLYD 627
Query: 636 GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
++ + L N E V+ VS+ DPY+L+ +S G I +
Sbjct: 628 TNF--RRLFLNKFNYE---------VVHVSVMDPYLLITLSRGDIMIF 664
>gi|410079681|ref|XP_003957421.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
gi|372464007|emb|CCF58286.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
Length = 1350
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 76/340 (22%), Positives = 153/340 (45%), Gaps = 25/340 (7%)
Query: 1085 MQSSENALTVRVVTLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1141
++ EN+L + ++ ++ TK + G + V ED+ G L+ P
Sbjct: 996 VEFEENSLVNDIRSMIIQIDSRTKRKREYIVAGFSAVGTEDLPPSGSFHLYDITAVVPEP 1055
Query: 1142 QNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGIAFYDAPP 1195
T + +E++G+++++ + G I+ KI++ + +AF D P
Sbjct: 1056 GKPDTNYKFERFFKEEVRGSVTSVCEISGRFAISQSQKIMVRDAQEDGSVVPVAFLDIP- 1114
Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
++V + N +++ D F+ + + ++ L K + EFL++ +
Sbjct: 1115 IFVTDMKSFGNLMIISDAMHGFQFVGFDAEPYRMIQLGKSVSKFKTMSVEFLVNNGDIYF 1174
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG 1315
V+D + + YAP S+ GQKL+ + F++ A + +L +D
Sbjct: 1175 AVTDRDNILHVLKYAPDEPNSFSGQKLVHCSSFNLYADNS----CMVLLAKNDEFNKV-- 1228
Query: 1316 SDKTNR-FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR---Q 1371
D TNR + ++ G DGS+ I PL E ++RRL +Q++++D + GLNPR R Q
Sbjct: 1229 -DDTNRTYQVVGGQTDGSMFKIVPLSEESYRRLYVIQQQIIDKETQLGGLNPRMERLSNQ 1287
Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
+ RP ++D ++ + +P+ ++ +A + G
Sbjct: 1288 YLPLCHVMRP----MLDFNVIRKFSAMPISKRQALAQKLG 1323
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 144/669 (21%), Positives = 270/669 (40%), Gaps = 133/669 (19%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L ++ G + +A+L + A D ++L AKIS+++FD + + S+H
Sbjct: 48 LFLTNEFKFDGRITDIALLPRQDA----ALDYLLLCTAVAKISIVKFDLESNSIETVSLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRC------GGVLVYGLQMIILKASQGGSGLV 214
+E ++ L R +++DP RC + V M + G
Sbjct: 104 YYED-KFKDLSLAE--LTRESKLRLDPASRCLVLFNEDNIAVLPFVMKEDEEDDDEEGEE 160
Query: 215 GDEDTFGSG-GGFSARIES-----SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHER 266
DEDT+ F A I S +++ + + D++++ D F++ Y +P + IL++
Sbjct: 161 EDEDTYEPRIKRFRANINGRVTFPSTILSAKTIHEDIQNIIDIEFLNNYSKPTVAILYQP 220
Query: 267 ELTWAGRVSWKH--HTCMISALSIST----TLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
+LTW G + +I L +T T H +I LP D ++L+ V + G
Sbjct: 221 KLTWVGNLQLHPLPTKLLIVTLECNTNGFETSLSHIVIARLNELPWDWHRLIPVTN---G 277
Query: 321 VLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQND 375
+++VG N + Y + + LN++A + L +S S E + + +++
Sbjct: 278 IVIVGINELAYVDNTGVLQTVILLNSFA-----DRNLKKSRIIDHSKEESVFNHSAMKH- 331
Query: 376 VALLSTKTGD---------------LVLLTVVYDGRVVQRLDLSK--------------T 406
+ +L T G+ L + ++ +GR++ + D+ K T
Sbjct: 332 ICILKTTDGNEDDADLLLLMDDRSNLYYVQMISEGRLMTQFDIIKLPIINNIFINNLNPT 391
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
+ S L S + + LFF G + GD+ F C + ++E D+ D PS
Sbjct: 392 SISRLDSSSSRVNLDLFF-GFQSGDA----FVCRLNNIKSAVETRKEHKDV-LDYPS--- 442
Query: 467 LRRSSSDALQDMVNGEEL----SLYGSASNNTESAQ-------------KTFSFAVRDSL 509
++D + +G +L LY + +T+ A + F A+ SL
Sbjct: 443 ----NADEYDE--DGADLYGDDDLYSDEATSTQRANSKENGRSNMIETVEPFDIALLSSL 496
Query: 510 VNIGPLKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADS 565
NIGPL + G D + G+S +N EL + G T S R +
Sbjct: 497 NNIGPLTSLTSGKVSAVDQNNKGLSNPNNNELSIVATSGNGTGSHLTAVLPSVRPEIELA 556
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADL------LTEVTESVDYFVQGR-----TIAA 614
+ + ++ + + + L T D + E+ + +GR T +
Sbjct: 557 LKFISITQIWN----LKFKGKDKFLVTTDSTKSKSDIYEIDNNFALHREGRLRRDATTVS 612
Query: 615 GNLFGR-RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
+FG +R++QV +LD ++ + + V+ VS+ DPY+L+
Sbjct: 613 IAMFGSDKRIVQVTTNHLYLLDTTF-----------RRLNTIKFDYEVVHVSVMDPYILI 661
Query: 674 GMSDGSIRL 682
+S G I++
Sbjct: 662 TVSRGDIKV 670
>gi|9794906|gb|AAF98387.1| cleavage and polyadenylation specificity factor [Drosophila
melanogaster]
Length = 279
Score = 97.8 bits (242), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 81/300 (27%), Positives = 141/300 (47%), Gaps = 65/300 (21%)
Query: 425 LGSRLGDSLLVQFTCGSGTSMLS------------SGLKEEFGDIEADAPSTKRLRRSSS 472
LGSRLG+SLL+ FT +++++ L++E ++E + +L + +
Sbjct: 1 LGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQRNLQDEDQNLE-EIFDVDQLEMAPT 59
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD----- 527
A + EEL +YGS + + + F F V DSL+N+ P+ G R+ +
Sbjct: 60 QAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSLMNVAPINYMCAGERVEFEEDGVT 119
Query: 528 ---------------ASATGISKQS---------NYELV---ELPGCKGIWTVYHKSSRG 560
+ATG SK N +++ EL GC +WTV+
Sbjct: 120 LRPHAESLQDLKIELVAATGHSKNGALSVFVNCINPQIITSFELDGCLDVWTVFD----- 174
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
D+++ ++ +D+ H ++++S T+VL+T + E+ E+ + V TI GNL +
Sbjct: 175 ---DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQEINEI-ENTGFTVNQPTIFVGNLGQQ 229
Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
R ++QV R R+L G+ + Q++ S V+ VSIADPYV L + +G +
Sbjct: 230 RFIVQVTTRHVRLLQGTRLIQNVPIDVG----------SPVVQVSIADPYVCLRVLNGQV 279
>gi|303391353|ref|XP_003073906.1| pre-mRNA cleavage and polyadenylation specificity factor
[Encephalitozoon intestinalis ATCC 50506]
gi|303303055|gb|ADM12546.1| pre-mRNA cleavage and polyadenylation specificity factor
[Encephalitozoon intestinalis ATCC 50506]
Length = 601
Score = 96.3 bits (238), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 106/457 (23%), Positives = 206/457 (45%), Gaps = 61/457 (13%)
Query: 974 SQGILKICQLPS---GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLK--PL 1028
S+G L +C++PS G + + + +KIP+ TP I Y + Y ++ S ++ P
Sbjct: 185 SKGYLMVCRVPSIENGYVFGSGFIGKKIPVLRTPKHIEY---ADRYMVVASCEEVEFSPK 241
Query: 1029 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1088
N ++ G ++ + VDL+ E+YE +T ++ +
Sbjct: 242 N-------GKDCGVPVNTYRFY-VDLYS----EKYE--------------HISTYELEEN 275
Query: 1089 ENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE 1147
E ++ + L + ++ L + T +++GED A+GR+ + + ++ +
Sbjct: 276 EYVFDIQYLVLDDMQGNYGKSPFLLVCTTFIEGEDRPAKGRLHVLEIISVVPSLESPFKD 335
Query: 1148 VYSKEL-----KGAISALASLQGHLLIASGPKIILHKW-TGTELNGIAFYDAPPLYVVSL 1201
K L KG+I + ++G +++ G KI+++K G+ + I F+D + S+
Sbjct: 336 CKLKVLGIEKTKGSIVQCSEVRGKIVLCLGTKIMIYKIDRGSGIIPIGFHDLHT-FTSSI 394
Query: 1202 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1261
++VKN+IL DI++ + F ++ + +L+L++ + +TE LI G+ LS+V D +
Sbjct: 395 SVVKNYILASDIYRGLSFFFFQSKPIRLHLISSSEPLKNATSTELLITGNELSMVCCDIK 454
Query: 1262 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1321
I + Y+P S G KL+ R E T RL SS R G S
Sbjct: 455 GTIHAYTYSPNNIISMDGAKLVKRVEMK-----TNLGRL-----SSSRVGFRKNS----- 499
Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
++ + + + +D ++ +L +Q ++ V GLNPR + +S+ H
Sbjct: 500 --IMLYSRSNQLVHVNGVDNSSYLKLLGIQTSIMGCFKAVFGLNPRDY--LNSDIHLHSL 555
Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
S + +L+ + L Q I+ +R +IL
Sbjct: 556 SLKSPIVLHILNLFSYFDLSIQESISSSVVMSRREIL 592
>gi|301093651|ref|XP_002997671.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110061|gb|EEY68113.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 478
Score = 96.3 bits (238), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 159/387 (41%), Gaps = 101/387 (26%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR++++ V D F+ GY+EP +++LHE + + GR++ T ++ +SI+
Sbjct: 106 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 165
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+ HP IW+ NLP D ++L+ +P+GGV+V+ AN I Y +Q+ LA N +A
Sbjct: 166 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTHE 225
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
+ +L +V L +LQ LL+ G + +L++ Y+ + L
Sbjct: 226 TAQL-----NVVLYDCQFEYLQEKELLLTMPCGQVYVLSLPYEDTSSRGL---------- 270
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE----------ADA 461
G F+GSR GDS+L L + +EE D E A
Sbjct: 271 ---YGFGGKQTLFIGSRSGDSVLFVLD----KKKLVTATEEEPKDEEMPIKEVVIKQESA 323
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT--------------------- 500
P K S A ++ + ++L LYG+A E A +
Sbjct: 324 PEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPAATSSTECTNGVGVSSVKTEENGAP 378
Query: 501 ------FSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYELV---------- 542
+ + +R D L +IG + G+ NAD S + ELV
Sbjct: 379 EQDTGPYDYELRQIDVLPSIGQITSIELGVENNAD------SNEKREELVISGGYERSGA 432
Query: 543 ----------------ELPGCKGIWTV 553
EL GC+ +WTV
Sbjct: 433 ISVLHNGLRPIVGTEAELNGCRAMWTV 459
>gi|168066745|ref|XP_001785293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663100|gb|EDQ49884.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1090
Score = 95.5 bits (236), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/355 (26%), Positives = 164/355 (46%), Gaps = 35/355 (9%)
Query: 1060 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQ 1119
+E + VR++E ++ + P+ EN ++ + T ++ +GTAY
Sbjct: 743 METHYVRLIEDQ----TFEIISGFPLDPYENGCSIITCSF----TDDSNVYYCVGTAYAL 794
Query: 1120 GEDVA-ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1178
E+ ++GR+L+FS D LV E KE+KGA+ L + G LL KI L+
Sbjct: 795 PEESEPSKGRILVFSV---EDGKIQLVAE---KEVKGAVYNLNAFNGKLLAGINQKIALY 848
Query: 1179 KWT----GTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLL 1232
KWT GT + + + ++++L + +FI++GD+ KSI L +K + +
Sbjct: 849 KWTLRDDGTR--ELQYESSHHGHILALYVQSRGDFIVVGDLMKSISLLIYKPEEGAIEER 906
Query: 1233 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1292
A+D+ + A E L D + L ++ N+ + + +L E+H+G
Sbjct: 907 ARDYNANWMTAVEILDDDTYLG---AENSFNLFTVRKNNDAATDEERGRLEVVGEYHLGE 963
Query: 1293 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
V +F ++ P S+ + ++FGT++G IG IA L + F LQ LQ+
Sbjct: 964 FVNRFRHGSLVMR-------LPDSEASQIPTVIFGTVNGVIGVIASLPQDQFLFLQKLQQ 1016
Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
LV + V GL+ +R F + K + +D +L+ + L + EIA
Sbjct: 1017 ALVKVIKGVGGLSHEQWRSFSNERKT--VDARNFLDGDLIESFLDLSRNKMEEIA 1069
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 126/557 (22%), Positives = 220/557 (39%), Gaps = 121/557 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A+ L+ + ++G + +L + G +D + ++FE K VL++D
Sbjct: 39 RIEIHLLTASGLQPMLDVPIYGRIATLELFRPPG----ESQDVLFISFERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
GL +T S + GR + G + VDP R G+ +Y GL +I ++
Sbjct: 95 ET-GLLVTRAMGDVSD-----RIGRPT-DNGQIGIVDPDCRLIGLHLYDGLFKVIPIDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCAKPTIAVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + P W NL + A L+ VP P+GG +++G T
Sbjct: 186 D-------ARHVKTYEVQLKEKDFGEGP--WLQNNLDNGAGLLIPVPLPLGGAIIIGEQT 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y++ S A+ + + ++ V+ D + LLS G L L
Sbjct: 237 IVYYNGSVFKAIPIR---------PSITKAYGRVDSDGSR--------YLLSDHNGMLYL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + +D V L++ + S ++ + N + F+GS GDS L++
Sbjct: 280 LVISHDKERVSALNVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
+ +ADA + S + L+ VN G + L Q T S A +
Sbjct: 329 -------NHQADA------KNSYVEVLESYVNLGPIVDLCVVDLERQGQGQVVTCSGAFK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN ASA EL G KG+W++ SS
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSLRASSS-------- 410
Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
D Y +L++S E R + + T D L E TE + + +T+ N +++
Sbjct: 411 ------DVYDTFLVVSFISETRILAMNTDDELEE-TEIDGFDSEAQTLFCYNAV-HDQLV 462
Query: 625 QVFERGARILDGSYMTQ 641
QV R++D Q
Sbjct: 463 QVTAGSLRLVDAKTRRQ 479
>gi|159470705|ref|XP_001693497.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283000|gb|EDP08751.1| predicted protein [Chlamydomonas reinhardtii]
Length = 461
Score = 95.5 bits (236), Expect = 2e-16, Method: Composition-based stats.
Identities = 58/164 (35%), Positives = 95/164 (57%), Gaps = 7/164 (4%)
Query: 230 IESSHVINLRDL-DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+ +S+V+NL + ++ V+D +F+HGY EPV+++LHE + TWAGR+ + TC ++A+S+
Sbjct: 128 VGNSYVLNLHKMMGIREVRDCVFLHGYTEPVLLLLHEPDPTWAGRLRERKDTCCLTAISV 187
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ-SASCALALNNYAV 347
S LK+H ++W A LP+D Y+LL +P LV+ + + SQ S A ALN+ A+
Sbjct: 188 SLRLKRHTVLWRAAGLPYDCYRLLPLPQR-PAALVLSPSLVMLTSQASQPQAAALNSTAL 246
Query: 348 SLDSSQEL----PRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
++ L R + SV A + ND A ++ LV
Sbjct: 247 PGEAPPPLVFDPAREAPSVTAARMAAEFALNDCAPALGRSAALV 290
>gi|301103688|ref|XP_002900930.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
gi|262101685|gb|EEY59737.1| cleavage and polyadenylation specificity factor subunit, putative
[Phytophthora infestans T30-4]
Length = 613
Score = 94.7 bits (234), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/383 (24%), Positives = 159/383 (41%), Gaps = 93/383 (24%)
Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
++ LR++++ V D F+ GY+EP +++LHE + + GR++ T ++ +SI+
Sbjct: 241 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 300
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+ HP IW+ NLP D ++L+ +P+GGV+V+ AN I Y +Q+ LA N +A
Sbjct: 301 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTHE 360
Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
+ +L +V L +LQ LL+ +G + +L++ Y+ + L
Sbjct: 361 TVQL-----NVVLYDCQFEYLQEKELLLTMPSGQVYVLSLPYEDTSSRGL---------- 405
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI------EADAPSTK 465
G F+GSR GDS+L + K+E I + AP K
Sbjct: 406 ---YGFGGKQTLFIGSRSGDSVLFVLDKKKLVTATEEEPKDEEMPIKEVVIKQESAPEIK 462
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------------------------- 500
S A ++ + ++L LYG+A E A +
Sbjct: 463 -----SEPAEEEEEDEDDLFLYGAAPTKEEPAATSSTECTNGVGVSSVKTEENGAPEQDT 517
Query: 501 --FSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYELV-------------- 542
+ + +R D L +IG + G+ NAD S + ELV
Sbjct: 518 GPYDYELRQIDVLPSIGQITSIELGVENNAD------SNEKREELVISGGYERSGAISVL 571
Query: 543 ------------ELPGCKGIWTV 553
EL GC+ +WTV
Sbjct: 572 HNGLRPIVGTEAELNGCRAMWTV 594
>gi|301121252|ref|XP_002908353.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
gi|262103384|gb|EEY61436.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
Length = 1150
Score = 94.7 bits (234), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 84/318 (26%), Positives = 145/318 (45%), Gaps = 33/318 (10%)
Query: 1107 NETLLAIGTAYVQGEDVAA-RGRVLLFS-TGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
N + +GTAY+ E+ +GR+L+F+ TG + + LVTE KE+KGA+ L S
Sbjct: 802 NASYFVVGTAYIHEEEAEPHQGRILVFAVTGIHGERKLQLVTE---KEVKGAVYCLNSFN 858
Query: 1165 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLY----VVSLNIVKNFILLGDIHKSIYFL 1220
G +L K L+KW+ N Y V+ + +FI++GD+ KSI L
Sbjct: 859 GKVLAGVNSKAQLYKWSENTDNEKELVSECGHYGHTLVLYMESRGDFIVVGDLMKSISLL 918
Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
S+K+ + +AKD S + + ++D T + S+ N+ + +
Sbjct: 919 SYKQLDGTIEEIAKDLNS-NWMSAVGIVDDDTY--IGSETDFNLFTVQRNSGAASDEERG 975
Query: 1281 KLLSRAEFHVGAHVTKF----LRLQMLATSS---------------DRTGAAPGSDKTNR 1321
+L + EFH+G V +F L +Q +++S D +AP +
Sbjct: 976 RLETVGEFHLGEFVNRFRYGSLVMQNSSSTSQTPSGVVSTGPTAMVDVGESAPAAPVVQN 1035
Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
++LFGT+ G IG I P+ + + L +Q+ L V V G + + +R F +
Sbjct: 1036 QSMLFGTVSGMIGVILPISKDQYSFLLRVQQALTHVVKGVGGFSHKDWRTFENRRSVSE- 1094
Query: 1382 GPDSIVDCELLSHYEMLP 1399
+ +D +L+ + LP
Sbjct: 1095 -ARNFIDGDLVESFLDLP 1111
Score = 74.7 bits (182), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 111/467 (23%), Positives = 183/467 (39%), Gaps = 108/467 (23%)
Query: 174 RESFARGPLV----KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
R+S R + +DP+GR G+ +Y ++ G L DTF
Sbjct: 107 RDSIGRSSEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGIL---RDTF--------- 154
Query: 230 IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
N+R LD V D F+HGY +P + +L+E + A V H L
Sbjct: 155 -------NIR-LDELRVIDIKFLHGYNKPTICVLYE-DYKAARHVKTYH------ILLKE 199
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
+ P WS N+ A L+ VP+P GGVL+V TI YH+ S A+ + + + +
Sbjct: 200 KDFAEGP--WSQSNVESGASLLIPVPAPTGGVLIVSNQTIVYHNGSTFHAIPMQSTVIQV 257
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+ + S F LL+ + G L ++ + + G+ V + L +
Sbjct: 258 YGAVDKDGSRF-----------------LLADQYGTLSVVALQHTGKEVSGVHLEVLGET 300
Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
+ S ++ + N + F+GS GDS L++ + AD T
Sbjct: 301 NIASCLSYLDNGVVFIGSTFGDSQLIK--------------------LNADRDETG---- 336
Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINAD 527
S + L VN + + + + + T S A +D G L+ G+ IN
Sbjct: 337 SYIEVLDSYVNVGPIIDFCVMDLDRQGQGQIVTCSGADKD-----GTLRVIRNGIGINEQ 391
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
ASA ELPG KG+W + AA D++ +S E R
Sbjct: 392 ASA------------ELPGIKGMWAL-----------RETFAAEHDKFLLQSYVS-EVRI 427
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + D + E + + F +T+ N++G +QV E R++
Sbjct: 428 LAIGDEDEMEE--KEIPAFTNVKTLLCRNMYGDYW-LQVTESEVRLI 471
>gi|401828022|ref|XP_003888303.1| pre-mRNA cleavage and polyadenylation specificity factor
[Encephalitozoon hellem ATCC 50504]
gi|392999575|gb|AFM99322.1| pre-mRNA cleavage and polyadenylation specificity factor
[Encephalitozoon hellem ATCC 50504]
Length = 1155
Score = 94.4 bits (233), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 85/346 (24%), Positives = 164/346 (47%), Gaps = 31/346 (8%)
Query: 1082 TIPMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1140
T ++ +E V+ + L + ++ L I T +++GED A+GR+ + +
Sbjct: 823 TYELEENEYVFDVKYLILDDMQGNYGKSPFLLICTTFIEGEDKPAKGRLHVLEIISVVPS 882
Query: 1141 PQNLVTEVYSKEL-----KGAISALASLQGHLLIASGPKIILHKWTGTELNGI---AFYD 1192
P++ + K L KG+I + ++G + + G KI+++K + NGI FYD
Sbjct: 883 PESPFKDCKLKVLGIEKTKGSIVQCSEIRGKIALCLGTKIMIYKIDRS--NGIIPIGFYD 940
Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
++ S+++VKN+IL DI++ + F ++ + +L+L++ + +TE LI G+
Sbjct: 941 LH-IFTSSISVVKNYILASDIYRGLSFFFFQSKPIRLHLISSSEPLKNVTSTELLIAGNE 999
Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
LS+V D + I + Y+P S G KL+ RAE + T+ R +
Sbjct: 1000 LSMVCCDSKGTIHAYTYSPNNIISMDGAKLVKRAE---------------MKTNLGRLFS 1044
Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
+ + N + + + SI +A +D+L + +L +Q ++ + V GLN R +
Sbjct: 1045 SGIGFRKNSI-MFYSKTNLSIH-LAGIDDLNYPKLLEIQTSIMVHLKSVLGLNQRDY--L 1100
Query: 1373 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
+S+ H S + +L+ + L Q I+ +R +IL
Sbjct: 1101 NSDIHLHSLSLKSPIVMHILNLFSYFDLNTQKLISSSVRMSRREIL 1146
>gi|159470709|ref|XP_001693499.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283002|gb|EDP08753.1| predicted protein [Chlamydomonas reinhardtii]
Length = 279
Score = 94.0 bits (232), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 123/262 (46%), Gaps = 28/262 (10%)
Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
L+ AF+D P L L VK+++L D+H+ ++FL + + L ++KDF D
Sbjct: 29 LDKRAFFDLPSL-ATGLVTVKDYLLASDVHQGLFFLRYSDASRVLEFMSKDFDGRDVLTC 87
Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMS---ESWKGQKLLSRAEFHVGAHVTKFLRLQ 1301
+I L + +D +Q+ + K E W GQ+L HV V +Q
Sbjct: 88 GVVIAEPKLHFLAADAAGTLQMMEFYGKRDTNPEFWAGQRLAPMGLLHVARRVGVAASVQ 147
Query: 1302 MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKLVDSVPH 1360
+ + D NR ALL G+ +G + +AP+ D RL +LQ + ++PH
Sbjct: 148 LAS-----------RDGRNRHALLCGSAEGGLSFVAPVPDPQAAARLAALQAHMSATLPH 196
Query: 1361 VAGLNPRSFRQFH-------SNGKAHR-PGP----DSIVDCELLSHYEMLPLEEQLEIAH 1408
VAGLNPRSFR G+ HR P P ++D +LL + L ++Q E A
Sbjct: 197 VAGLNPRSFRHRFIRIPKALGGGEHHRAPLPPRNNSGLLDGQLLLGFPHLSRQQQAEAAE 256
Query: 1409 QTGTTRSQILSNLNDLALGTSF 1430
G++ Q+L +L +A +F
Sbjct: 257 AVGSSPQQLLEDLRAIAAAATF 278
>gi|298715583|emb|CBJ28136.1| cleavage and polyadenylation specificity factor CG10110-PA
[Ectocarpus siliculosus]
Length = 1906
Score = 94.0 bits (232), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 93/303 (30%), Positives = 136/303 (44%), Gaps = 77/303 (25%)
Query: 215 GDEDTFGSGGGFSARIESSHVINLR-------DLDMKHVKDFI----FVHGYIEPVMVIL 263
G+ED G G G +A+ + NL DL+ + FI F+ G+ EP + +L
Sbjct: 265 GEEDG-GLGNGATAKGDGGAGGNLAVSKPFTIDLEEAGITGFIKAAAFLEGFHEPALALL 323
Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
+E T AGR++ K TC ++ LSI+ T + P+IW NLPHD++ L+ VPSPIGG+ V
Sbjct: 324 YEPIQTCAGRLASKRSTCRLALLSINLTQGRAPVIWQVENLPHDSWDLVPVPSPIGGLQV 383
Query: 324 VGANTIHYHSQS-ASCALALNNYA-VSLDSS-QELP------------------------ 356
+ N + + +QS LA+N YA ++D + E P
Sbjct: 384 ISTNAVMHVNQSEVRSILAVNGYARATVDPALLECPLRGGDSDWGWTSFRRSHPEREVVD 443
Query: 357 RSSFSV--ELDAAHATWLQNDVALLSTKTGD-----LVLLTVVYD--------------- 394
SS+ V ELD +L LLS +TG+ L L TV
Sbjct: 444 LSSYDVCIELDVVRCAFLTPTSMLLSLRTGEVYALRLHLTTVTAAAADAAGCSRPPGGAA 503
Query: 395 ----GRVVQR--LDLSKTNP-SVLT---------SDITTIGNSLFFLGSRLGDSLLVQFT 438
RVV + + + +P SVL + L F+GSR+GDSLLV ++
Sbjct: 504 FGTPNRVVGQSMRPVGRASPCSVLAVAASGGSGGDGGSGASKGLVFMGSRVGDSLLVDYS 563
Query: 439 CGS 441
S
Sbjct: 564 VAS 566
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 50/225 (22%), Positives = 91/225 (40%), Gaps = 60/225 (26%)
Query: 3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
+ Y+ +H PTG+ + G +T + + +LVV
Sbjct: 7 YTCYRQLHPPTGVDHAVFGSVTAAGSR---------------------------DLVVAK 39
Query: 63 ANVIEIYVVRVQEEGS----------KESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
A+ +E+Y V + S +++ N E D S LEL + L GN
Sbjct: 40 ASTLELYRVHRDDHSSTAAAAAAAAARDTSNGDERDD----DDASGYYLELAGTFPLAGN 95
Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
+ +LA++ D ++++F AK++++ +D + L S+H F++
Sbjct: 96 ITALAVIP----------DILVVSFGVAKMALVAYDSVLGRLETISIHNFDAGAIGPGAG 145
Query: 173 GRES-FARGPLVK--------VDPQGRCGGVLVYGLQMIILKASQ 208
G ES + +K DP GRC +V G Q+++L A +
Sbjct: 146 GVESGYGLAAALKDRPRTISSSDPAGRCLAAVVAGCQLVVLPARR 190
>gi|452824087|gb|EME31092.1| DNA damage-binding protein 1 isoform 1 [Galdieria sulphuraria]
Length = 1128
Score = 92.8 bits (229), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 121/550 (22%), Positives = 229/550 (41%), Gaps = 94/550 (17%)
Query: 840 FLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDA 899
+ A L DG +L Y+ L + ++T + R +S+ AS L
Sbjct: 612 YFLAALGDGRLLTYR--LDKSAKDTDSEKKFLYDQRQMSIGTQPAS-----------LSI 658
Query: 900 YTREETPH-GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFT 958
+ + H A C R T+ + SG G C + RE RV C S AF
Sbjct: 659 FETQNALHVFAACDRPTVIHSSSG------GGKLLCSNVNLREVTRV----CSFSSEAFP 708
Query: 959 VLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP--VQKIPLKATPHQITYFAEKNLY 1016
+C + + ++G L + T DN ++ IPL P +I + +++
Sbjct: 709 -----DC----LALVTEGSLLL------GTVDNIQKLHIRTIPLGEQPRRIAHLDTHHVF 753
Query: 1017 PLIVS---VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA 1073
++ + V + + N+ LS ++ ID+ + +++ +Y +E++E
Sbjct: 754 AVLTTKQVVTISEDGNEALSETTEEGYVRLIDD---TMMEIVHSYKLEQFETPC------ 804
Query: 1074 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG-EDVAARGRVLLF 1132
+ I + ++A K+N+ +GTAY E +RGR+L+F
Sbjct: 805 -------SVITVNFGDDA-----------AAKDNQDYFVVGTAYSYADEPEPSRGRMLVF 846
Query: 1133 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1192
+ + +T V + KGA+ ++ + G +L + + L +W+ TE +
Sbjct: 847 AV------REQRLTLVAERTFKGALYSMDAFNGKILASVNSMLKLVRWSETESGARTLTE 900
Query: 1193 A----PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
++++ + + +FIL+GD+ +S+ L++K + +A+D E L
Sbjct: 901 ECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIEDVARDIDPSWITVIEML- 959
Query: 1249 DGSTLSLVVSDEQK-NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
L +S E N+ S + +L E+H+G V + +++
Sbjct: 960 ---DLDYYISAENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIRHGRLVL--- 1013
Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
P S + +LL+GT +G++G IA +DE TF+ L SLQ L + + V G+
Sbjct: 1014 ----QIPESGISILKSLLYGTANGALGVIASIDEKTFQFLHSLQTALNEVIKGVGGIQHE 1069
Query: 1368 SFRQFHSNGK 1377
+R+F S +
Sbjct: 1070 DWRRFTSERR 1079
Score = 41.2 bits (95), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 84/211 (39%), Gaps = 35/211 (16%)
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
I L +LD V D F++G+ +P + +L + +H +L
Sbjct: 151 IRLEELD---VLDIQFLYGHSKPTIAVL------YTDSEENRHLKTYTVSLK-DKDFGNG 200
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
PL NL A L+ VP+PIGGV+V+G T+ Y S S L Y S+ S +
Sbjct: 201 PLFQG--NLESGASMLIPVPTPIGGVVVLGQETVTYISGS-----GLRGYH-SIPVSATI 252
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL---------TVVYDGRVVQRLDLSKT 406
R+ ++ D LL + G L LL T + L +
Sbjct: 253 FRAYGRIDKDGTR--------YLLGDEKGILYLLVLEQSTSLSTFTETETKITGLKIQTL 304
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ L S I + N ++GS GDS L++
Sbjct: 305 GETSLPSTIDYLDNGFVYIGSCHGDSQLIRL 335
>gi|58383228|ref|XP_312466.2| AGAP002472-PA [Anopheles gambiae str. PEST]
gi|55242305|gb|EAA08181.2| AGAP002472-PA [Anopheles gambiae str. PEST]
Length = 1138
Score = 92.8 bits (229), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/374 (24%), Positives = 167/374 (44%), Gaps = 50/374 (13%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ E G +++ HNL +D + + ++ MQ+ E AL++
Sbjct: 778 NTEFGQEVEVHNLLIIDQNTFEVLHAHQF-------------------MQT-EYALSLMS 817
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
L N + T +GT V E+ + GR++++ R ADN +V++ KE+KG
Sbjct: 818 AKLGN----DPNTYFIVGTGLVNPEEPEPKTGRIIIY---RYADNELKMVSD---KEVKG 867
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
A +L G +L + L++WT + L F + LY + +FIL+GD
Sbjct: 868 ACYSLVEFNGRVLACINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKG---DFILVGD 924
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
+ +SI L +K+ +A+D+ A E L D + L +D N+ +
Sbjct: 925 LMRSITLLQYKQMEGSFEEIARDYQPNWMTAVEILDDDAFLG---ADNSNNLFVCLKDSA 981
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDG 1331
+ + Q++ A+FH+G V F ++ + S+R+ G +LFGT+ G
Sbjct: 982 ATTDEERQQMPEVAQFHLGDMVNVFRHGSLVMQNISERSTPTTG-------CVLFGTVSG 1034
Query: 1332 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
+IG + + + L+ LQ+ L +++ V ++ +R FH+ K R + +D +L
Sbjct: 1035 AIGLVTQIQSDFYEFLRKLQENLTNTIKSVGKIDHSYWRSFHTETKMER--CEGFIDGDL 1092
Query: 1392 LSHYEMLPLEEQLE 1405
+ + L E+ E
Sbjct: 1093 VESFLDLSREKMRE 1106
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 117/519 (22%), Positives = 195/519 (37%), Gaps = 126/519 (24%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G L +DP+ R G+ +Y GL II D+DT +L
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DKDT-----------NELKATSL 157
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
R +M HV+D F++G P ++++H+ ++ +H I IS K+ I
Sbjct: 158 RMEEM-HVQDVEFLYGTTHPTLIVIHQD-------INGRH----IKTHEISLKDKEFTKI 205
Query: 299 -WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAVSL 349
W N+ +A L+AVP P+GG +V+G +I YH + A+A +N YA
Sbjct: 206 AWKQDNVETEATMLIAVPMPLGGAIVIGQESIVYHDGDSYVAVAPAIIKQSTINCYA--- 262
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLS 404
+D+ +L ++A G+L ++ + + V+ + +
Sbjct: 263 -------------RIDSKGLRYLLGNMA------GNLFMMFLETEENAKGQTTVRDIKVE 303
Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
+ IT + N + F+GSR GDS LV+ +G + L E F ++ AP
Sbjct: 304 LLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVMLMETFTNL---APIV 360
Query: 465 KRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI 524
L+ G+ ++ GS G L+ G+ I
Sbjct: 361 DMCVVD----LERQGQGQMITCSGSFKE--------------------GSLRIIRNGIGI 396
Query: 525 NADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
A ++LPG KG+W + R+ D Y LI+S
Sbjct: 397 QEHAC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLILSFV 431
Query: 585 ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQD 642
T VL + E TE +T N+ +++QV AR++ D M +
Sbjct: 432 GHTRVLMLSGDEVEETEIAGILGDQQTFYCANV-SHGQILQVTPSSARLISCDNKAMICE 490
Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIR 681
P N G N+T + + A + + DG +
Sbjct: 491 WK-PPDNKRIGVVGANTTQIVCASAQDVYYVEIGDGKLE 528
>gi|168047617|ref|XP_001776266.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672361|gb|EDQ58899.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1089
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 149/329 (45%), Gaps = 28/329 (8%)
Query: 1104 TKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
T ++ +GTAY E+ +GR+L+F D LV E KE+KGA+ L +
Sbjct: 779 TDDSNVYYCVGTAYALPEESEPTKGRILVFLV---EDGKLQLVAE---KEMKGAVYNLNA 832
Query: 1163 LQGHLLIASGPKIILHKWT---GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
G LL KI L+KWT GT + I + + + +FI++GD+ KSI
Sbjct: 833 FNGKLLAGINQKIALYKWTLRDGTRVLEIESSHHGHILALYVQSRGDFIVVGDLMKSISL 892
Query: 1220 LSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
L +K + + A+D+ + A E L D + L ++ N+ + +
Sbjct: 893 LIYKPEEGAIEERARDYNANWMTAVEILDDDTYLG---AENSFNLFTVRKNNDAATDEER 949
Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
+L E+H+G V +F ++ P S+ + ++FGT++G IG IA L
Sbjct: 950 GRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSEASLIPTVIFGTVNGVIGVIASL 1002
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL------LS 1393
+ F LQ LQ+ LV + V GL+ +R F + K + +D +L LS
Sbjct: 1003 PQDKFLFLQKLQQALVKVIKGVGGLSHEQWRSFSNERKT--VDARNFLDGDLIESFLDLS 1060
Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
+M + LEI+ + R + L+ L+
Sbjct: 1061 RNKMEEIAAPLEISVEELCKRVEELTRLH 1089
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 122/556 (21%), Positives = 214/556 (38%), Gaps = 119/556 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A+ L+ + L+G + +L + G +D + ++FE K VL++D
Sbjct: 39 RIEIHLLTASGLQSMLDVPLYGRIATLELFRPPG----ESQDVLFISFERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
G IT S + GR + G + VDP R G+ +Y ++
Sbjct: 95 ET-GSPITRAMGDVSD-----RTGRPT-DNGQIGIVDPDCRLIGLHLYDGMFKVIPIDNK 147
Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
G F+ R+E V++++ F++G P + +L++
Sbjct: 148 GQ----------LKEAFNIRLEELQVLDIK-----------FLYGCANPTIAVLYQDNKD 186
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI 329
H + P W NL + A L+ VP P+GG +++G TI
Sbjct: 187 -------ARHVKTYEVNLKEKDFGEGP--WLQNNLDNGAGLLIPVPLPLGGAIIIGEQTI 237
Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
Y++ S A+ + + ++ V+ D + LLS G L LL
Sbjct: 238 VYYNGSVFKAIPIR---------PSITKAYGRVDSDGSR--------YLLSDHNGMLYLL 280
Query: 390 TVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
+ +D V L++ + S ++ + N + F+GS GDS L++
Sbjct: 281 VISHDKERVSALNVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRL------------ 328
Query: 450 LKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRD 507
+ +AD ++ S + L+ VN G + L Q T S A +D
Sbjct: 329 ------NHQAD------VKGSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAFKD 376
Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
G L+ G+ IN AS VEL G KG+W++ SS
Sbjct: 377 -----GSLRIVRNGIGINEQAS------------VELQGIKGMWSLRASSS--------- 410
Query: 568 MAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
D Y +L++S E R + + T D L E TE + + +T+ N +++Q
Sbjct: 411 -----DVYDTFLVVSFISETRILAMNTDDELEE-TEIDGFDSEAQTLFCHNAV-HDQLVQ 463
Query: 626 VFERGARILDGSYMTQ 641
V R+++ Q
Sbjct: 464 VTAGSLRLVNAKTRKQ 479
>gi|255080490|ref|XP_002503825.1| predicted protein [Micromonas sp. RCC299]
gi|226519092|gb|ACO65083.1| predicted protein [Micromonas sp. RCC299]
Length = 1114
Score = 90.9 bits (224), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 89/325 (27%), Positives = 147/325 (45%), Gaps = 30/325 (9%)
Query: 1113 IGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GT Y + E RGR+L+F R D LV E KE+KGA+ L + G LL
Sbjct: 802 VGTGYSLPEEPEPTRGRILVF---RAEDGKLQLVAE---KEVKGAVYNLNAFNGKLLAGI 855
Query: 1172 GPKIILHKW---TGTELNGIAFYDAPPL-----YVVSLNIV--KNFILLGDIHKSIYFLS 1221
K+ L + G + G + Y+ ++V+L + FI++GD+ KS+ L+
Sbjct: 856 NSKVELFRGGDPVGADGAGGSTYELAKECSHHGHIVALYVAVRGEFIVVGDLMKSVSLLA 915
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
+K + + + A+D+ + A + L D + L ++ N+ + + +
Sbjct: 916 YKPEESVIEERARDYNANWMTAVDILDDDTYLG---AENNFNLFTLRRQSDAATDEERSR 972
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
L E+HVG V +F R ++ D+ A + LLFGT+ G IG +A L
Sbjct: 973 LEVVGEYHVGEFVNRFRRGSLVMRLPDQENA-------DVPTLLFGTVSGVIGVLATLPR 1025
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP--GPDSIVDCELLSHYEMLP 1399
F L +LQ L +V V GL+ ++R F N HR G VD +L+ + L
Sbjct: 1026 EQFEFLSALQAALNKTVSGVGGLSHDAWRSFQ-NEHRHRAKDGARGFVDGDLIESFLDLR 1084
Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDL 1424
E+ E+A + ++ + DL
Sbjct: 1085 PEKAREVAAAVKLSVDELTRRVEDL 1109
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 89/212 (41%), Gaps = 33/212 (15%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F R+E +V++++ F+HG P + +L+E H
Sbjct: 153 FDVRLEELNVVDVK-----------FMHGCATPTICVLYED-------TKEARHVKTYEV 194
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
TL+ P WS ++ + ++ VP+P+GG +VVG + I Y ++
Sbjct: 195 DVKEKTLRDGP--WSQSDVEGGSSLIIPVPAPLGGAIVVGESVIVYLNKDGG-------- 244
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ SV + A LLS TG L LL +V+D R V L L
Sbjct: 245 -----NGAGGAIATKSVNVMAHGVVDADGSRYLLSDSTGMLHLLVLVHDRRRVHALKLES 299
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ + S ++ + N + ++GS GDS LV+
Sbjct: 300 LGQTSIASTLSYLDNGVVYVGSAYGDSQLVRL 331
>gi|348681092|gb|EGZ20908.1| hypothetical protein PHYSODRAFT_259403 [Phytophthora sojae]
Length = 1137
Score = 90.9 bits (224), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 81/312 (25%), Positives = 142/312 (45%), Gaps = 22/312 (7%)
Query: 1111 LAIGTAYVQGEDVAA--RGRVLLFS-TGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
+GTAY+ ED A +GR+L+F+ TG + + LVTE KE+KGA+ L + G +
Sbjct: 806 FVVGTAYIH-EDEAEPHQGRILVFAVTGIHGERKLQLVTE---KEVKGAVYCLNAFNGKV 861
Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLY----VVSLNIVKNFILLGDIHKSIYFLSWK 1223
L K L+KW+ N Y V+ + +FI++GD+ KS+ LS+K
Sbjct: 862 LAGVNSKAQLYKWSENTDNEKELVSECGHYGHTLVLYMESRGDFIVVGDLMKSVSLLSYK 921
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
+ + +AKD S + + ++D T + S+ N+ + + +L
Sbjct: 922 QLDGTIEEIAKDLNS-NWMSALGIVDDDTY--IGSETDFNLFTVQRNSGAASDEERGRLE 978
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSS------DRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1337
+ EFH+G V +F + ++ D AP ++LFGT+ G IG I
Sbjct: 979 TVGEFHLGEFVNRFRYGSLTPAAAGPTDMVDVVEQAPIVPAAQNQSMLFGTVSGMIGVIL 1038
Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
PL + + L +Q+ L V V G + + +R F + + +D +L+ +
Sbjct: 1039 PLTKDQYSFLLRVQQALTQVVKGVGGFSHKDWRMFENRRSVSE--ARNFIDGDLVESFLD 1096
Query: 1398 LPLEEQLEIAHQ 1409
LP + ++ +
Sbjct: 1097 LPKAQMTKVVDK 1108
Score = 77.0 bits (188), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 109/467 (23%), Positives = 182/467 (38%), Gaps = 108/467 (23%)
Query: 174 RESFARGPLV----KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
R+S R + +DP+GR G+ +Y ++ G L +DTF
Sbjct: 107 RDSIGRSSEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGIL---KDTF--------- 154
Query: 230 IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
N+R LD V D F+HGY +P + +L+E H L
Sbjct: 155 -------NIR-LDELRVIDIKFLHGYTKPTICVLYED-------YKAARHIKTYHILLKE 199
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
+ P WS N+ A L+ VP+P+GGVL+V TI YH+ S A+ + + + +
Sbjct: 200 KDFAEGP--WSQSNVESGASLLIPVPAPVGGVLIVSNQTIVYHNGSTFHAIPMQSTVIQV 257
Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
+ + S F LL+ + G L ++ + + G+ V + L +
Sbjct: 258 YGAVDKDGSRF-----------------LLADQYGTLSVVALQHTGKEVTGVHLEVLGET 300
Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
+ S ++ + N + F+GS GDS L++ ++E G
Sbjct: 301 NIASCLSYLDNGVVFIGSTFGDSQLIKLNAD----------RDENG-------------- 336
Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINAD 527
S + L VN + + + + + T S A +D G L+ G+ IN
Sbjct: 337 SYIEVLDTYVNVGPIIDFCVMDLDRQGQGQIVTCSGADKD-----GTLRVIRNGIGINEQ 391
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
ASA ELPG KG+W + AA D+Y +S E R
Sbjct: 392 ASA------------ELPGIKGMWAL-----------RETFAAEHDKYLLQSYVS-EIRI 427
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + D + E + + F +T+ N++G +QV E R++
Sbjct: 428 LAIGDEDEMEE--KEIPAFTNVKTLLCRNMYGDVW-LQVTESEVRLI 471
>gi|302788810|ref|XP_002976174.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
gi|300156450|gb|EFJ23079.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
Length = 1089
Score = 90.1 bits (222), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 88/304 (28%), Positives = 140/304 (46%), Gaps = 38/304 (12%)
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQN 1143
+ + EN T+ + T + T +GTAY + E+ ++GR+L+F+ D
Sbjct: 764 LDTFENGCTIITCSF----TDDPATYYCVGTAYALPEENEPSKGRILIFTV---EDGKFQ 816
Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT----GTELNGIAFYDAP--PLY 1197
LVTE KE KGA+ L + G LL KI L+KWT EL + LY
Sbjct: 817 LVTE---KETKGAVYNLNAFNGKLLAGINQKIQLYKWTQRDSTRELQSECGHHGHILALY 873
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTL 1253
V S +FI++GD+ KSI L +K + + A+D+ + A E L D G+
Sbjct: 874 VQSRG---DFIVVGDLMKSISLLLYKPEEGAIEERARDYNANWMTAVEILDDDIYLGAEN 930
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
S + +KN + ++ +G +L E+H+G V +F ++
Sbjct: 931 SFNLFTVRKN------SDAATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMR-------L 976
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
P ++ + ++FGT++G IG +A L + F LQ LQ L + V GL+ +R F
Sbjct: 977 PDNETSQIPTVIFGTVNGVIGVVASLQQEQFNFLQRLQHCLAKVIKGVGGLSHEQWRSFS 1036
Query: 1374 SNGK 1377
S K
Sbjct: 1037 SERK 1040
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 122/557 (21%), Positives = 217/557 (38%), Gaps = 121/557 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ ++A L+ + ++G + +L + G +D + ++ E K VL++D
Sbjct: 39 RIEFHLLTAQGLQPLLDVPIYGRIATLELFRPPG----ETQDVLFVSTERYKFCVLQWDS 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + VDP+ R G+ +Y GL +I ++
Sbjct: 95 ETTELVTRAMGDVSD------RIGRPT-DNGQIGIVDPECRLIGLHLYDGLFKVIPIDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIAVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + P W NL + A L+ VP+P+GGV+++G T
Sbjct: 186 D-------ARHVKTYEIQLKEKDFGEGP--WLQNNLDNGAGMLIPVPTPLGGVIIIGEQT 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y+S SA A+ + + ++ V+ D + LLS TG L L
Sbjct: 237 IVYYSGSAFKAIPIR---------PSITKAYGKVDADGSR--------YLLSDHTGSLHL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + S ++ + N + ++GS GDS L++
Sbjct: 280 LVITHERDRVLGLKVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
+ + D+ R S + L+ VN G + L Q T S A +
Sbjct: 329 -------NAQVDS------RNSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN ASA EL G KG+W++
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
A D + +L++S E R + + D L E TE + + +T+ N ++I
Sbjct: 406 -RATSKDVFDIFLVVSFISETRILAMNMDDELEE-TEIEGFDSEAQTLFCHNAI-HDQII 462
Query: 625 QVFERGARILDGSYMTQ 641
QV R++D + Q
Sbjct: 463 QVTSTSLRLVDATSRRQ 479
>gi|302769568|ref|XP_002968203.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
gi|300163847|gb|EFJ30457.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
Length = 1089
Score = 90.1 bits (222), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 88/304 (28%), Positives = 140/304 (46%), Gaps = 38/304 (12%)
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQN 1143
+ + EN T+ + T + T +GTAY + E+ ++GR+L+F+ D
Sbjct: 764 LDTFENGCTIITCSF----TDDPATYYCVGTAYALPEENEPSKGRILIFTV---EDGKFQ 816
Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT----ELNGIAFYDAP--PLY 1197
LVTE KE KGA+ L + G LL KI L+KWT EL + LY
Sbjct: 817 LVTE---KETKGAVYNLNAFNGKLLAGINQKIQLYKWTQRDSTRELQSECGHHGHILALY 873
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTL 1253
V S +FI++GD+ KSI L +K + + A+D+ + A E L D G+
Sbjct: 874 VQSRG---DFIVVGDLMKSISLLLYKPEEGAIEERARDYNANWMTAVEILDDDIYLGAEN 930
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
S + +KN + ++ +G +L E+H+G V +F ++
Sbjct: 931 SFNLFTVRKN------SDAATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMR-------L 976
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
P ++ + ++FGT++G IG +A L + F LQ LQ L + V GL+ +R F
Sbjct: 977 PDNETSQIPTVIFGTVNGVIGVVASLQQEQFNFLQRLQHCLAKVIKGVGGLSHEQWRSFS 1036
Query: 1374 SNGK 1377
S K
Sbjct: 1037 SERK 1040
Score = 70.1 bits (170), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 123/557 (22%), Positives = 218/557 (39%), Gaps = 121/557 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ ++A L+ + ++G + +L + G +D + ++ E K VL++D
Sbjct: 39 RIEFHLLTAQGLQPLLDVPIYGRIATLELFRPPG----ETQDVLFVSTERYKFCVLQWDS 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + VDP+ R G+ +Y GL +I ++
Sbjct: 95 ETTELVTRAMGDVSD------RIGRPT-DNGQIGIVDPECRLIGLHLYDGLFKVIPIDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIAVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + P WS NL + A L+ VP+P+GGV+++G T
Sbjct: 186 D-------ARHVKTYEIQLKEKDFGEGP--WSQNNLDNGAGMLIPVPTPLGGVIIIGEQT 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y+S SA A+ + + ++ V+ D + LLS TG L L
Sbjct: 237 IVYYSGSAFKAIPIR---------PSITKAYGKVDADGSR--------YLLSDHTGSLHL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + S ++ + N + ++GS GDS L++
Sbjct: 280 LVITHERDRVLGLKVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
+ + D+ R S + L+ VN G + L Q T S A +
Sbjct: 329 -------NAQVDS------RNSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN ASA EL G KG+W++
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
A D + +L++S E R + + D L E TE + + +T+ N ++I
Sbjct: 406 -RATSKDVFDIFLVVSFISETRILAMNMDDELEE-TEIEGFDSEAQTLFCHNAI-HDQII 462
Query: 625 QVFERGARILDGSYMTQ 641
QV R++D + Q
Sbjct: 463 QVTSTSLRLVDATSRRQ 479
>gi|218197365|gb|EEC79792.1| hypothetical protein OsI_21216 [Oryza sativa Indica Group]
Length = 1089
Score = 90.1 bits (222), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 149/331 (45%), Gaps = 33/331 (9%)
Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+ +N +GTAYV E+ ++GR+L+F+ D L+ E KE KGA+ +L +
Sbjct: 778 SDDNNVYYCVGTAYVLPEENEPSKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 831
Query: 1163 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL A KI L+KW EL + L + + +FI++GD+ KSI
Sbjct: 832 FNGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYT-QTRGDFIVVGDLMKSI 890
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY---APKMS 1274
L +K + + + LA+D+ + A E L D + + N IF + +
Sbjct: 891 SLLVYKHEESAIEELARDYNANWMSAVEMLDDE-----IYIGAENNYNIFTVRKNSDAAT 945
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGSI 1333
+ +G +L E+H+G V + ++ D G P ++FGT++G I
Sbjct: 946 DEERG-RLEVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIP--------TVIFGTINGVI 996
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
G IA L + L+ LQ LV + V L+ +R FH++ K + +D +L+
Sbjct: 997 GIIASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTSE--ARNFLDGDLIE 1054
Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ L + E+A G ++ + +L
Sbjct: 1055 SFLDLSRNKMEEVAKGMGVPVEELSKRVEEL 1085
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + ++ +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G ++P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL + P WS NL + A L+ VP+P+GGV+++G T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y + +++ ++ Q + R+ V+ D + LL G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N + ++GSR GDS LV+
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++AD S + L+ VN + + + + + T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++D Y YL++S + T L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429
>gi|396082420|gb|AFN84029.1| pre-mRNA cleavage and polyadenylation [Encephalitozoon romaleae
SJ-2008]
Length = 1156
Score = 90.1 bits (222), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 110/480 (22%), Positives = 209/480 (43%), Gaps = 62/480 (12%)
Query: 960 LHNVNCNHGFIYVTSQGILKICQLPS----GSTYDNYWPVQKIPLKATPHQITYFAEKNL 1015
L++ N + S+G L +C++PS + + +KIP+ P I Y +
Sbjct: 725 LNSATVNKNQLIQLSRGHLMVCKVPSVRDEQYVFGDGLVGRKIPILRIPKHIEY---ADR 781
Query: 1016 YPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG 1075
Y ++ S D E + + V+ +R Y V+ Y R
Sbjct: 782 YMVVASCK-------------DVEFSSKDEKDCGIPVNTYRFY-VDLYSER--------- 818
Query: 1076 PWQTRATIPMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRV---LL 1131
++ +T + +E V+ + L + ++ L + T +++GED ARGR+ +
Sbjct: 819 -YEHISTYELDENEYIFDVKYLVLDDMQGNYGKSPFLLVCTTFIEGEDRPARGRLHVLEI 877
Query: 1132 FSTGRNADNP-QNLVTEVYSKE-LKGAISALASLQGHLLIASGPKIILHKWT-GTELNGI 1188
S + ++P ++ +V E KG+I + ++G + + G KI+++K T + I
Sbjct: 878 ISVVPSLESPFRDCKLKVLGIEKTKGSIVQCSEVRGKIALCLGTKIMIYKIDRSTGIIPI 937
Query: 1189 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
FYD ++ S++++KN+IL DI++ + F ++ + +L+L++ + +TE L
Sbjct: 938 GFYDLH-IFTSSISVMKNYILASDIYRGLSFFFFQSKPIRLHLISSSEPLKNVTSTELLT 996
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
G+ LS+V D + I + Y+P S G KL+ R+E L L++S
Sbjct: 997 AGNELSMVCCDAKGTIHAYTYSPNNIISMDGAKLVKRSEMKTN--------LGRLSSSGI 1048
Query: 1309 --RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNP 1366
R + KTN L G +D+ + +L +Q ++ + V GLN
Sbjct: 1049 GFRKNSIMFYSKTNLLIYLVG-----------MDDSYYLKLLKIQTSIMVHLKSVLGLNQ 1097
Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
R + +S+ H S + +L+ + L Q I+ +R +IL L L +
Sbjct: 1098 RDY--LNSDIHLHSLSLKSPIIMHILNLFSYFDLNTQKLISTSVKMSRREILDVLASLNI 1155
>gi|115465791|ref|NP_001056495.1| Os05g0592400 [Oryza sativa Japonica Group]
gi|48475231|gb|AAT44300.1| putative DNA damage binding protein 1 [Oryza sativa Japonica Group]
gi|113580046|dbj|BAF18409.1| Os05g0592400 [Oryza sativa Japonica Group]
gi|215694552|dbj|BAG89545.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222632766|gb|EEE64898.1| hypothetical protein OsJ_19757 [Oryza sativa Japonica Group]
Length = 1090
Score = 90.1 bits (222), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 149/331 (45%), Gaps = 33/331 (9%)
Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+ +N +GTAYV E+ ++GR+L+F+ D L+ E KE KGA+ +L +
Sbjct: 779 SDDNNVYYCVGTAYVLPEENEPSKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 832
Query: 1163 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL A KI L+KW EL + L + + +FI++GD+ KSI
Sbjct: 833 FNGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYT-QTRGDFIVVGDLMKSI 891
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY---APKMS 1274
L +K + + + LA+D+ + A E L D + + N IF + +
Sbjct: 892 SLLVYKHEESAIEELARDYNANWMSAVEMLDDE-----IYIGAENNYNIFTVRKNSDAAT 946
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGSI 1333
+ +G +L E+H+G V + ++ D G P ++FGT++G I
Sbjct: 947 DEERG-RLEVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIP--------TVIFGTINGVI 997
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
G IA L + L+ LQ LV + V L+ +R FH++ K + +D +L+
Sbjct: 998 GIIASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTSE--ARNFLDGDLIE 1055
Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ L + E+A G ++ + +L
Sbjct: 1056 SFLDLSRNKMEEVAKGMGVPVEELSKRVEEL 1086
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 152/367 (41%), Gaps = 83/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G ++P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCVKPTIVVLYQ------DNKDARHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P WS NL + A L+ VP+P+GGV+++G TI Y + +++
Sbjct: 197 LK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEETIVYCNANSTFR------ 247
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
++ Q + R+ V+ D + LL G L LL + ++ V L +
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHLLVLTHERERVTGLKIEY 297
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N + ++GSR GDS LV+ +++AD
Sbjct: 298 LGETSIASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADPNG-- 337
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
S + L+ VN + + + + + T S A +D G L+ G+
Sbjct: 338 ----SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIG 388
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ KSS ++D Y YL++S
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------FNDPYDMYLVVSF 422
Query: 584 EARTMVL 590
+ T L
Sbjct: 423 ISETRFL 429
>gi|325186344|emb|CCA20849.1| predicted protein putative [Albugo laibachii Nc14]
Length = 1148
Score = 89.7 bits (221), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 161/355 (45%), Gaps = 40/355 (11%)
Query: 1096 VVTLFNTTTKENETLLAIGTAYVQGEDVAA-RGRVLLFS-TGRNADNPQNLVTEVYSKEL 1153
++T T T +GTA+V E+ +GR+L+F+ +G + D LVTE KE+
Sbjct: 799 IITCIFTGDSSGGTYYVVGTAFVHEEEAEPHQGRILVFTVSGIHGDRRLQLVTE---KEV 855
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY----VVSLNIVKNFIL 1209
KG++ L + G LL K+ L KW+ +E NG + V+ + +FI+
Sbjct: 856 KGSVYCLNAFNGKLLAGVNSKVYLFKWSESEENGEELVSECGHHGHTLVLYMESRGDFIV 915
Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTLSLVVSDEQKNIQ 1265
+GD+ KSI L+ K+ + +A+D S A + D GS + Q+N
Sbjct: 916 VGDLMKSISLLNHKQLDGSIEEIARDLNSNWMTAVGIIDDDNYVGSETDFNLFTVQRN-- 973
Query: 1266 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA-APG--------- 1315
+ S+ +G +L + E+H+G V +F ++ + GA APG
Sbjct: 974 ----SGAASDEERG-RLETIGEYHLGEFVNRFRYGSLVMQHNLSIGAEAPGISLSDDRPE 1028
Query: 1316 --SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
S + + ++LFGT+ G IG I P+ + L +Q L + V G + +R F
Sbjct: 1029 SLSPLSVQRSMLFGTVSGMIGVILPISKEKHEFLMRVQSALNQVIQGVGGFSHSEWRTFE 1088
Query: 1374 ---SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
S+ +AH + +D +L+ + L +E ++ + + + + L LA
Sbjct: 1089 NRRSSIEAH-----NFIDGDLIESFLDLSKDEMKQVVDELNRDQLEGKTTLEALA 1138
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 114/511 (22%), Positives = 193/511 (37%), Gaps = 115/511 (22%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+D I L + + VL +D ++ + + + R E G +DP G
Sbjct: 74 QDWIFLVTQRFQFCVLAYDTTLQQIITKANGSLRDT----IGRNSEILTNG---NIDPDG 126
Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDF 249
R G+ +Y ++ L F+ R++ LR LD+K
Sbjct: 127 RLIGMNIYEGYFKVIPIDNHSKSL---------KAAFNIRLD-----ELRILDIK----- 167
Query: 250 IFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAY 309
F++GY +P + +L+E H L + P WS N+ A
Sbjct: 168 -FLYGYNKPTICVLYED-------FKAARHVKTYFILLKEKDFAEGP--WSQSNVEAGAN 217
Query: 310 KLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHA 369
L+ VP P GGVL++ TI YH+ + A+ + N + + + S F
Sbjct: 218 LLIPVPMPYGGVLIISNQTIVYHNGTYFHAIPMQNTMIQVYGAVGDDGSRF--------- 268
Query: 370 TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL 429
LL+ + G L ++ + +G+ V + L + + S ++ + N + F+GS
Sbjct: 269 --------LLADQYGALHVVALQTEGKEVLDVYLEVLGQTSIASCVSYLDNGVVFVGSTF 320
Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
GDS LV+ + ++E G S + L VN + +
Sbjct: 321 GDSQLVKL----------NSKRDESG--------------SYIEVLDSYVNIGPIIDFCV 356
Query: 490 ASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
+ + + T S A +D G L+ G+ IN ASA ELPG
Sbjct: 357 MDLDRQGQGQIVTCSGADKD-----GSLRVIRNGIGINEQASA------------ELPGI 399
Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDY 605
KG+W + + EY YL+ S E R M + +D + EV ++
Sbjct: 400 KGMWALRESLA--------------SEYDKYLVQSYLNEIRIMTIGDSDEMEEV--EIEA 443
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
F+ +T+ N+ +QV E RI+D
Sbjct: 444 FLDAKTLYCRNV-NEDGWLQVTETEVRIIDA 473
>gi|255571318|ref|XP_002526608.1| DNA repair protein xp-E, putative [Ricinus communis]
gi|223534048|gb|EEF35767.1| DNA repair protein xp-E, putative [Ricinus communis]
Length = 1033
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 89/330 (26%), Positives = 145/330 (43%), Gaps = 31/330 (9%)
Query: 1104 TKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+ +N +GTAYV E+ +GR+L+F D ++TE KE KGA+ +L S
Sbjct: 722 SDDNNLYYCVGTAYVMPEENEPTKGRILVFLV---EDGKLQVITE---KETKGAVYSLNS 775
Query: 1163 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL A KI L+KW EL + L + + +FI++GD+ KSI
Sbjct: 776 FNGKLLAAINQKIQLYKWMLRDDGSRELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 834
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
L +K + + A+D+ + A E L D L + N +F K SE
Sbjct: 835 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFT-VRKNSEGA 888
Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ +L E+H+G V +F ++ P SD ++FGT++G IG
Sbjct: 889 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTVNGVIG 941
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
IA L + L+ LQ L + V GL+ +R F++ K + +D +L+
Sbjct: 942 VIASLPHEQYIFLEKLQSNLRRVIKGVGGLSHEQWRSFNNEKKTVE--AKNFLDGDLIES 999
Query: 1395 YEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ L EI+ G + ++ + +L
Sbjct: 1000 FLDLSRNRMDEISKAIGVSVEELCKRVEEL 1029
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 146/362 (40%), Gaps = 85/362 (23%)
Query: 231 ESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
E+S +I L+ V D F++G +P +V+L++ +H AL
Sbjct: 95 ETSELIT--RLEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVALK-DK 145
Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
+ P W+ NL + A L+ VP P+ GVL++G TI Y S +A A+ +
Sbjct: 146 DFGEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR------- 196
Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV 410
+ R+ V+ D + LL G L LL + ++ V L + +
Sbjct: 197 --PSITRAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETS 246
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
+ S I+ + N++ ++GS GDS LV+ +++ DA + S
Sbjct: 247 IASTISYLDNAVVYIGSSYGDSQLVKL------------------NLQPDA------KGS 282
Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADA 528
+ L+ VN + + + + T S A +D G L+ G+ IN A
Sbjct: 283 YVEVLESYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIGINEQA 337
Query: 529 SATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
S VEL G KG+W++ ++ DD + +L++S + T
Sbjct: 338 S------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSFISETR 371
Query: 589 VL 590
+L
Sbjct: 372 IL 373
>gi|12082087|dbj|BAB20761.1| UV-damaged DNA binding protein [Oryza sativa Japonica Group]
Length = 1090
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 146/328 (44%), Gaps = 27/328 (8%)
Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+ +N +GTAYV E+ ++GR+L+F+ D L+ E KE KGA+ +L +
Sbjct: 779 SDDNNVYYCVGTAYVLPEENEPSKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 832
Query: 1163 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL A KI L+KW EL + L + + +FI++GD+ KSI
Sbjct: 833 FNGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYT-QTRGDFIVVGDLMKSI 891
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
L +K + + + LA+D+ + A E L D + ++ NI +
Sbjct: 892 SLLVYKHEESAIEELARDYNANWMSAVEMLDDEIYIG---AENNYNIFTVRKNSDAATDE 948
Query: 1278 KGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
+ +L E+H+G +F ++ D G P ++FGT++G IG I
Sbjct: 949 ERGRLEVVGEYHLGEFGNRFRHGSLVMRLPDSEMGQIP--------TVIFGTINGVIGII 1000
Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
A L + L+ LQ LV + V L+ +R FH++ K + +D +L+ +
Sbjct: 1001 ASLPHEQYVFLEKLQSTLVKFIKGVGNLSHEQWRSFHNDKKTSE--ARNFLDGDLIESFL 1058
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
L + E+A G ++ + +L
Sbjct: 1059 DLSRNKMEEVAKGMGVPVEELSKRVEEL 1086
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + ++ +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G ++P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL + P WS NL + A L+ VP+P+GGV+++G T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y + +++ ++ Q + R+ V+ D + LL G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N + ++GSR GDS LV+
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++AD S + L+ VN + + + + + T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++D Y YL++S + T L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429
>gi|393905247|gb|EJD73911.1| CPSF A subunit region family protein [Loa loa]
Length = 1145
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 74/311 (23%), Positives = 146/311 (46%), Gaps = 25/311 (8%)
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQN 1143
++ SE A+++ L N +++ +GTA + ++ ++ GR+++F + ++ P+
Sbjct: 805 LEGSEMAMSLASCQLGN----DSQPYFVVGTAVIMSDETESKMGRIMMF---QASEGPER 857
Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNI 1203
+ VY KE+KGA ++ S+ G L++A + L +WT + + D + + L
Sbjct: 858 MRL-VYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKT 916
Query: 1204 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
+ IL+GD+ +S+ LS+K + +A+DF + A E + S L + +
Sbjct: 917 KNDLILVGDLMRSLSLLSYKSVESTFEKVARDFMTNWMSACEIIDSDSFLG-----AENS 971
Query: 1264 IQIFYYAPKMSESWK--GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1321
+F +K G +L F++G V F + AT D AP
Sbjct: 972 YNLFTVVKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVD---VAP----LYH 1024
Query: 1322 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
++L+GT DG IG I + + + L +QK+L D + ++ +R F + ++
Sbjct: 1025 SSILYGTSDGGIGVIVQMPPVLYTFLHDVQKRLADYTENCMRISHTQYRTFETEKRSEV- 1083
Query: 1382 GPDSIVDCELL 1392
P+ +D +L+
Sbjct: 1084 -PNGFIDGDLI 1093
Score = 47.0 bits (110), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 78/356 (21%), Positives = 136/356 (38%), Gaps = 90/356 (25%)
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
+W NL +A ++ VP P GG L+ G + I YH + AL YA S
Sbjct: 201 LWKHDNLEGEASMVIGVPEPAGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----------VVQRLDLSKTN 407
+ +D +L D+A G+L +L + + G+ V+ + +
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLEF-GKGQEQDESSTVSVKDMKVESLG 309
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPST 464
+ + + + N + F+GSR GDS L++ + GT +S L + + ++ AP
Sbjct: 310 NTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS--LLDSYTNL---AP-- 362
Query: 465 KRLRRSSSDALQDM----VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
++DM NG++ L S + + + + + L +
Sbjct: 363 ----------IRDMTVMRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS--------- 403
Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
VEL G K ++T+ + D E+ YLI
Sbjct: 404 ---------------------VELKGIKNMFTLRTR---------------DHEFDDYLI 427
Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
+S ++ T VL E T+ + V G T+ AG LF ++QV ++DG
Sbjct: 428 LSFDSDTHVLLINGEELEDTQITGFVVDGATLWAGCLFQSTTILQVTHGEVILIDG 483
>gi|413946716|gb|AFW79365.1| hypothetical protein ZEAMMB73_562969 [Zea mays]
Length = 1089
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/396 (23%), Positives = 170/396 (42%), Gaps = 36/396 (9%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ + L+ +VEE E +R+L+ +++ P+ E ++
Sbjct: 717 RRICHQEQSRTLAFCSFKYNQSVEESETHLIRLLDHQ----TFESLCVYPLDQYECGCSI 772
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
+ ++ +GTAYV E+ +GR+L+F+ D L+ E KE
Sbjct: 773 ISCSF----ADDSNVYYCVGTAYVIPEENEPTKGRILVFAV---EDGSLQLIVE---KET 822
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGT-----ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KGA+ +L + G LL A KI L+KW EL + L + + +FI
Sbjct: 823 KGAVYSLNAFNGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILALYT-QTRGDFI 881
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
++GD+ KSI L +K + + + A+D+ + A E L D V ++ N+
Sbjct: 882 VVGDLMKSISLLVYKHEESAIEERARDYNANWMTAVEMLDDE---VYVGAENSYNLFTVR 938
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
+ + +L E+H+G V +F ++ P SD ++FGT
Sbjct: 939 KNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDIGQIPTVIFGT 991
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
++G IG IA L + L+ LQ LV + V L+ +R FH++ K + +D
Sbjct: 992 INGVIGIIASLPHDQYIFLEKLQSTLVKYIKGVGNLSHEQWRSFHNDKKTAE--ARNFLD 1049
Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+L+ + L + E++ G ++ + +L
Sbjct: 1050 GDLIESFLDLSRSKMEEVSKAMGVPVEELSKRVEEL 1085
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 151/367 (41%), Gaps = 83/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F+HG +P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLHGCAKPTIVVLYQ------DNKDVRHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P WS N+ + A L+ VP+P+GGV+++G I Y + +++
Sbjct: 197 LK-DKDFVEGP--WSQNNVDNGAGLLIPVPAPLGGVIIIGEEQIVYCNANSTFK------ 247
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
++ Q + R+ V+ D + LL TG L LL + ++ V L +
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSRY--------LLGDNTGILHLLVLTHERERVTGLKIEY 297
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N + ++GSR GDS LV+ +++ADA
Sbjct: 298 LGETSIASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADASG-- 337
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
S + L+ VN + + + + + T S A +D G L+ G+
Sbjct: 338 ----SFVEILERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIG 388
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ KSS +D + YL++S
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------INDPFDMYLVVSF 422
Query: 584 EARTMVL 590
+ T L
Sbjct: 423 ISETRFL 429
>gi|226510488|ref|NP_001145925.1| uncharacterized protein LOC100279448 [Zea mays]
gi|219884971|gb|ACL52860.1| unknown [Zea mays]
Length = 416
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/407 (23%), Positives = 174/407 (42%), Gaps = 41/407 (10%)
Query: 1027 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATI 1083
PLN+ + + HQ + L+ +VEE E +R+L+ +++
Sbjct: 38 PLNEQA-----RRICHQEQSRTLAFCSFKYNQSVEESETHLIRLLDHQ----TFESLCVY 88
Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQ 1142
P+ E ++ + ++ +GTAYV E+ +GR+L+F+ D
Sbjct: 89 PLDQYECGCSIISCSF----ADDSNVYYCVGTAYVIPEENEPTKGRILVFAV---EDGSL 141
Query: 1143 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-----ELNGIAFYDAPPLY 1197
L+ E KE KGA+ +L + G LL A KI L+KW EL + L
Sbjct: 142 QLIVE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILA 198
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
+ + +FI++GD+ KSI L +K + + + A+D+ + A E L D V
Sbjct: 199 LYT-QTRGDFIVVGDLMKSISLLVYKHEESAIEERARDYNANWMTAVEMLDDE---VYVG 254
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
++ N+ + + +L E+H+G V +F ++ P SD
Sbjct: 255 AENSYNLFTVRKNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSD 307
Query: 1318 KTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
++FGT++G IG IA L + L+ LQ LV + V L+ +R FH++ K
Sbjct: 308 IGQIPTVIFGTINGVIGIIASLPHDQYIFLEKLQSTLVKYIKGVGNLSHEQWRSFHNDKK 367
Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ +D +L+ + L + E++ G ++ + +L
Sbjct: 368 T--AEARNFLDGDLIESFLDLSRSKMEEVSKAMGVPVEELSKRVEEL 412
>gi|402913617|ref|XP_003919276.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Papio anubis]
Length = 132
Score = 88.6 bits (218), Expect = 3e-14, Method: Composition-based stats.
Identities = 48/121 (39%), Positives = 72/121 (59%), Gaps = 12/121 (9%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LEL + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 18 LELAASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 73
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVG 215
FE PE L+ G P V+VDP GRC +LVYG ++++L ++ GLVG
Sbjct: 74 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVG 130
Query: 216 D 216
+
Sbjct: 131 E 131
>gi|413948669|gb|AFW81318.1| hypothetical protein ZEAMMB73_456332 [Zea mays]
Length = 674
Score = 88.2 bits (217), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 158/363 (43%), Gaps = 45/363 (12%)
Query: 1027 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATI 1083
PLN+ + + HQ + L+ +VEE E +R+L+ +++
Sbjct: 296 PLNEQA-----RRICHQEQSKTLAFCSFKYNQSVEESETHLIRLLDHQ----TFESLCVY 346
Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQ 1142
P+ E ++ + + +N +GTAYV E+ +GR+L+F+ D
Sbjct: 347 PLDQYECGCSIISCSFVD----DNNVYYCVGTAYVIPEENEPTKGRILVFAV---EDGSL 399
Query: 1143 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-----ELNGIAFYDAPPLY 1197
L+ E KE KGA+ +L + G LL A KI L+KW EL + L
Sbjct: 400 QLIVE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILA 456
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
+ + +FI++GD+ KSI L +K + + + A+D+ + A E L D V
Sbjct: 457 LYT-QTRGDFIVVGDLMKSISLLVYKHEESAIEERARDYNANWMTAVEMLDDE---VYVG 512
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGS 1316
++ N+ + + KL E+H+G V +F ++ D G P
Sbjct: 513 AENGYNLFTVRKNSDAATDDERAKLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGKIP-- 570
Query: 1317 DKTNRFALLFGTLDGSIGCIA--PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
++FGT++G IG IA P D TF L+ Q LV + V ++ +R FH+
Sbjct: 571 ------TVIFGTINGVIGIIASLPHDHYTF--LEKFQSTLVKYIKGVGNMSHEQWRSFHN 622
Query: 1375 NGK 1377
+ K
Sbjct: 623 DKK 625
>gi|312076590|ref|XP_003140929.1| CPSF A subunit region family protein [Loa loa]
Length = 655
Score = 87.8 bits (216), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 149/314 (47%), Gaps = 31/314 (9%)
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQN 1143
++ SE A+++ L N +++ +GTA + ++ ++ GR+++F + ++ P+
Sbjct: 306 LEGSEMAMSLASCQLGN----DSQPYFVVGTAVIMSDETESKMGRIMMF---QASEGPER 358
Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVS 1200
+ VY KE+KGA ++ S+ G L++A + L +WT + L F + LY+ +
Sbjct: 359 MRL-VYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKT 417
Query: 1201 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
N + IL+GD+ +S+ LS+K + +A+DF + A E + S L
Sbjct: 418 KN---DLILVGDLMRSLSLLSYKSVESTFEKVARDFMTNWMSACEIIDSDSFLG-----A 469
Query: 1261 QKNIQIFYYAPKMSESWK--GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1318
+ + +F +K G +L F++G V F + AT D AP
Sbjct: 470 ENSYNLFTVVKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVD---VAP---- 522
Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
++L+GT DG IG I + + + L +QK+L D + ++ +R F + ++
Sbjct: 523 LYHSSILYGTSDGGIGVIVQMPPVLYTFLHDVQKRLADYTENCMRISHTQYRTFETEKRS 582
Query: 1379 HRPGPDSIVDCELL 1392
P+ +D +L+
Sbjct: 583 EV--PNGFIDGDLI 594
>gi|301093655|ref|XP_002997673.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110063|gb|EEY68115.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 176
Score = 87.8 bits (216), Expect = 4e-14, Method: Composition-based stats.
Identities = 56/175 (32%), Positives = 89/175 (50%), Gaps = 18/175 (10%)
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS-----DRTGAAPGSDKTNRFA 1323
+AP+ ES GQ+LL ++FH+G V+ R ++ A+ S + AAP S+ N
Sbjct: 3 FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVN--- 59
Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
+ GT +G +G + P+ E FRRL +LQ +V+++P LNPR FR +N + P
Sbjct: 60 -VMGTSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRP 118
Query: 1384 DS---------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
D+ +D +L + L Q E+A GTT ++ NL ++ TS
Sbjct: 119 DAWSKKKWKKSFLDAFVLFRFLQLDYVAQKELARCIGTTPEVVMHNLLEVQHATS 173
>gi|55976392|sp|Q6E7D1.1|DDB1_SOLCE RecName: Full=DNA damage-binding protein 1; AltName: Full=UV-damaged
DNA-binding protein 1
gi|49484911|gb|AAT66742.1| UV-damaged DNA binding protein 1 [Solanum cheesmaniae]
Length = 1095
Score = 87.8 bits (216), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 152/356 (42%), Gaps = 40/356 (11%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED-VAARGRVLLFSTGRNAD 1139
+T P+ E ++ L + + ++ IGTAYV E+ +GR+L+F D
Sbjct: 764 STYPLDQFEYGCSI----LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIV---ED 816
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP----- 1194
L+ E KE KGA+ +L + G LL A KI L+KW E G
Sbjct: 817 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTECGHHG 873
Query: 1195 ---PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
LYV + +FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 874 HILALYVQTRG---DFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAVEILDDDI 930
Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
L + N +F K SE + +L E+H+G V +F ++
Sbjct: 931 YLG-----AENNFNLFT-VRKNSEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMR--- 981
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
P SD ++FGT++G IG IA L + L+ LQ L + V GL+
Sbjct: 982 ----LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQ 1037
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+R F++ K + +D +L+ + L EI+ +++ + +L
Sbjct: 1038 WRSFYNEKKT--VDAKNFLDGDLIESFLDLSRNRMEEISKAMSVPVEELMKRVEEL 1091
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 113/502 (22%), Positives = 193/502 (38%), Gaps = 123/502 (24%)
Query: 95 GISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
G+ L+ + ++G + +L + G +D + +A E K VL++D +
Sbjct: 49 GLQCICLQPMLDVPIYGRIATLELFRPHG----ETQDLLFIATERYKFCVLQWDTEASEV 104
Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGL 213
+M + GR + G + +DP R G+ +Y GL +I ++G
Sbjct: 105 ITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLK- 156
Query: 214 VGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
F+ R+E V++++ F++G +P +V+L++
Sbjct: 157 ----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ------DN 189
Query: 274 VSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
+H + +LK I W+ NL + A L+ VP P+ GVL++G TI
Sbjct: 190 KDARH------VKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIGEETIV 243
Query: 331 YHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT 390
Y S SA A+ + + R+ V+ D + LL G L LL
Sbjct: 244 YCSASAFKAIPIR---------PSITRAYGRVDADGSR--------YLLGDHNGLLHLLV 286
Query: 391 VVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
+ ++ V L + + + S I+ + N+ F+GS GDS LV+
Sbjct: 287 ITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNL----------- 335
Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDS 508
P TK S + L+ VN + + + + T S A +D
Sbjct: 336 ----------QPDTK---GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD- 381
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRM 568
G L+ G+ IN AS VEL G KG+W++
Sbjct: 382 ----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--------------R 411
Query: 569 AAYDDEYHAYLIISLEARTMVL 590
+A DD Y +L++S + T VL
Sbjct: 412 SATDDPYDTFLVVSFISETRVL 433
>gi|350537001|ref|NP_001234275.1| DNA damage-binding protein 1 [Solanum lycopersicum]
gi|350539125|ref|NP_001233864.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
gi|55976440|sp|Q6QNU4.1|DDB1_SOLLC RecName: Full=DNA damage-binding protein 1; AltName: Full=High
pigmentation protein 1; AltName: Full=UV-damaged
DNA-binding protein 1
gi|38455768|gb|AAR20885.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
gi|42602165|gb|AAS21683.1| UV-damaged DNA binding protein 1 [Solanum lycopersicum]
Length = 1090
Score = 87.8 bits (216), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 152/356 (42%), Gaps = 40/356 (11%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED-VAARGRVLLFSTGRNAD 1139
+T P+ E ++ L + + ++ IGTAYV E+ +GR+L+F D
Sbjct: 759 STYPLDQFEYGCSI----LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIV---ED 811
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP----- 1194
L+ E KE KGA+ +L + G LL A KI L+KW E G
Sbjct: 812 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWASREDGGSRELQTECGHHG 868
Query: 1195 ---PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
LYV + +FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 869 HILALYVQTRG---DFIVVGDLMKSISLLIFKHEEGAIEERARDYNANWMSAVEILDDDI 925
Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
L + N +F K SE + +L E+H+G V +F ++
Sbjct: 926 YLG-----AENNFNLFT-VRKNSEGATDEERSRLEVVGEYHLGEFVNRFRHGSLVMR--- 976
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
P SD ++FGT++G IG IA L + L+ LQ L + V GL+
Sbjct: 977 ----LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLEKLQTNLRKVIKGVGGLSHEQ 1032
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+R F++ K + +D +L+ + L EI+ +++ + +L
Sbjct: 1033 WRSFYNEKKT--VDAKNFLDGDLIESFLDLSRNRMEEISKAMSVPVEELMKRVEEL 1086
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 113/507 (22%), Positives = 196/507 (38%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----ETQDLLFIATERYKFCVLQWDT 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
+ +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EASEVITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK I W+ NL + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S SA A+ + + R+ V+ D + LL G
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITRAYGRVDADGSR--------YLLGDHNGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
L LL + ++ V L + + + S I+ + N+ F+GS GDS LV+
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNL------ 330
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
P TK S + L+ VN + + + + T S
Sbjct: 331 ---------------QPDTK---GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
+A DD Y +L++S + T VL
Sbjct: 406 ----RSATDDPYDTFLVVSFISETRVL 428
>gi|301124072|ref|XP_002909688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262107255|gb|EEY65307.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 176
Score = 87.4 bits (215), Expect = 5e-14, Method: Composition-based stats.
Identities = 56/175 (32%), Positives = 89/175 (50%), Gaps = 18/175 (10%)
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS-----DRTGAAPGSDKTNRFA 1323
+AP+ ES GQ+LL ++FH+G V+ R ++ A+ S + AAP S+ N
Sbjct: 3 FAPQDIESRGGQRLLRVSDFHLGVQVSSMFRKRVDASGSVVSATNGRNAAPLSNYVN--- 59
Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
+ GT +G +G + P+ E FRRL +LQ +V+++P LNPR FR +N + P
Sbjct: 60 -VMGTSEGGVGALVPVGERVFRRLFTLQNVMVNTLPQNCALNPREFRMLKTNAQRRCGRP 118
Query: 1384 DS---------IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
D+ +D +L + L Q E+A GTT ++ NL ++ TS
Sbjct: 119 DAWSKKKWKKSFLDAFVLFRFLQLNYVAQKELARCIGTTPEVVMHNLLEVQHATS 173
>gi|224061051|ref|XP_002300334.1| predicted protein [Populus trichocarpa]
gi|222847592|gb|EEE85139.1| predicted protein [Populus trichocarpa]
Length = 1088
Score = 87.4 bits (215), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 104/403 (25%), Positives = 176/403 (43%), Gaps = 48/403 (11%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ + S + EE E +R+L+ ++ +T P+ + E ++
Sbjct: 716 RRICHQEQSRTFSICSMKNQSNAEESEMHFIRLLDDQ----TFEFISTYPLDTFEYGCSI 771
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
L + + ++ +GTAYV E+ +GR+L+F D L+ E KE
Sbjct: 772 ----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGKLQLIAE---KET 821
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KGA+ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
++GD+ KSI L +K + + A+D+ + A E L D L + N +F
Sbjct: 881 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFT 935
Query: 1269 YAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
K SE + +L E+H+G V +F ++ P SD ++
Sbjct: 936 -VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVI 987
Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
FGT++G IG IA L + L+ LQ L + V GL+ +R F++ K +
Sbjct: 988 FGTVNGVIGVIASLPHEQYLFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKT--VDAKN 1045
Query: 1386 IVDCEL------LSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
+D +L LS M + + +EI+ + R + L+ L+
Sbjct: 1046 FLDGDLIESFLDLSRSRMDEISKAMEISVEELCKRVEELTRLH 1088
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 109/507 (21%), Positives = 193/507 (38%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ ++ ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEINLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ETSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F+HG +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLHGCSKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + LK I WS NL + A L+ VP P GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVALKDKDFIEGPWSQNNLDNGADLLIPVPPPFCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S + A+ + + ++ V+ D + LL G
Sbjct: 234 EETIVYCSANVFRAIPIR---------PSITKAYGRVDADGSR--------YLLGDHAGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
L LL + ++ V L + + + S I+ + N+ F+GS GDS LV+
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKL-------- 328
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
++ DA T + L VN + + + + T S
Sbjct: 329 ----------NLHPDAKGT------YVEVLDRYVNLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
+ DD + +L++S + T +L
Sbjct: 406 ----RSLTDDPFDTFLVVSFISETRIL 428
>gi|412992547|emb|CCO18527.1| predicted protein [Bathycoccus prasinos]
Length = 1275
Score = 87.0 bits (214), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 97/391 (24%), Positives = 175/391 (44%), Gaps = 52/391 (13%)
Query: 1061 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-- 1118
EE+ VR+ + ++T A P++ +EN ++ + +++ +GTA+
Sbjct: 892 EEFFVRLFD----NKTFETLAKYPLEPNENDASIISCSF----DGDDDIYFVVGTAFADP 943
Query: 1119 QGEDVAARGRVLLF-------STGRNA-----DNP-----------QNLVTEVYSKELKG 1155
E ++RGR+L+F S G NA D+ Q +T V KE +G
Sbjct: 944 HSEPESSRGRILVFKVSNTSSSGGGNAVVNGNDHGDGRASASSSVLQKSLTLVCEKETRG 1003
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAF-YDAPPL-YVVSLNI--VKNFILLG 1211
A+ L + G LL + L W ++ N ++ + ++++L + N I++G
Sbjct: 1004 AVYNLNAFCGKLLAGINSLVKLFNWGVSKENKRELVHECSHMGHIIALKVETKDNLIVVG 1063
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D+ KSI L ++ + ++ +A DF S A E L D + L S +Q A
Sbjct: 1064 DLMKSITLLQYQRESGRIEEVAHDFSSNWMTAVEILDDNTYLGAESSYNLFTVQ--RNAD 1121
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA----LLFG 1327
+E +G L A FH+G V +F R ++ D SD T+ + LFG
Sbjct: 1122 ADTEDKRGTLELCGA-FHLGDSVNRFRRGSLVMRMPDL------SDDTSSLSEISTWLFG 1174
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
T+ G +G +A L + F L +Q+ + V V + FR FH+ ++ + +
Sbjct: 1175 TISGGLGVVATLPKRDFMLLNKVQEAMQKVVTGVGNFSHSDFRSFHNVQRSVE--MRNFI 1232
Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
D +L+ + L E+Q+ ++ +G + S+ L
Sbjct: 1233 DGDLVEIFLDLSKEDQVAVSELSGVSNSEDL 1263
>gi|428164905|gb|EKX33915.1| hypothetical protein GUITHDRAFT_158867 [Guillardia theta CCMP2712]
Length = 1092
Score = 87.0 bits (214), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 90/365 (24%), Positives = 156/365 (42%), Gaps = 31/365 (8%)
Query: 1060 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY-V 1118
VEE +++ + ++ T +Q EN +V + + T +GTA V
Sbjct: 744 VEEQFIKLFDDQ----TFEILDTYQLQEFENTCSVECASFSDDPT----LYYIVGTATAV 795
Query: 1119 QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1178
E GR+L+F D +L SKE+KGA + G LL KI L
Sbjct: 796 PQESEPKEGRLLVFEV---IDRKLHLKA---SKEIKGAPYQIKPFNGKLLAGINSKIELF 849
Query: 1179 KWTGTELNGIAFYDA----PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
+ + ++ + + V+ L +FI+ GD+ +SI L++K+ Q+ +A+
Sbjct: 850 RLSDSDTGHMELVSECCHRGHILVLYLQTRGDFIVAGDLMRSISLLTYKQVDGQIEEIAR 909
Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
DF + A + L D + L ++ N+ + + +L E+H+G V
Sbjct: 910 DFNANWMTAVDILDDDTFLG---AEGYFNLFTVRKNTDATSDEERARLEVVGEYHLGDMV 966
Query: 1295 TKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1354
+F R ++ SSD P +D ++FGT++G IG IA L + + L +Q L
Sbjct: 967 NRFQRGSLVLRSSD----TPTTD-----TIIFGTVNGMIGVIAVLSKEEYEFLLKVQDAL 1017
Query: 1355 VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
+ V GL +R F + P +D +L+ + L E+ E+ H G+
Sbjct: 1018 NFVIKGVGGLRHEDWRSFENERTQGARAPKGFIDGDLIESFLDLRREKMEEVCHAIGSIT 1077
Query: 1415 SQILS 1419
+ LS
Sbjct: 1078 VEELS 1082
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 128/601 (21%), Positives = 236/601 (39%), Gaps = 127/601 (21%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+++ ++ L+ V ++G + ++ + + GA+ R+S+ + E K ++E+D
Sbjct: 37 RLVIYTLTPEGLQPVLDTGIYGRIAAIELYTVAGAE----RESLYILTERLKFCIVEYDS 92
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVY-GLQMIIL 204
S L +M + +S R GP+ +DP+ R G L+Y GL +I
Sbjct: 93 STGELITKAMGDVQ-----------DSVGRPVDGGPIAHIDPERRMIGFLLYDGLFKVIP 141
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH 264
++ G F+ R+E V++++ F++GY +P +V+L
Sbjct: 142 IDTRNGQ----------LREAFNIRLEELQVLDVQ-----------FLYGYAQPTIVLL- 179
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGV 321
++ M + +++ I WS + A ++ VP+PIGG
Sbjct: 180 -----------YQDPKEMRHLKTYQVSIRDKDFIAGPWSQTGVEIGATMIIPVPTPIGGC 228
Query: 322 LVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
+++G TI Y + + + +D + + R+ ++ D LL
Sbjct: 229 ILLGEQTISYLNGDKG-----DTKTIHMDMT--VIRAWGKIDEDGRR--------YLLGD 273
Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
G L +L + +DG V L L + IT + + + F+GS GDS L++
Sbjct: 274 HLGQLYVLVLEFDGNKVLGLKLDTLGETSSAKTITYLDSGVVFIGSCFGDSQLIRL---- 329
Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
K S+ + L+ N + + + +
Sbjct: 330 --------------------HPDKDENDSNIEVLESFTNLGPIQDFCVVDLERQGQGQVV 369
Query: 502 SFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGH 561
+ + G LKD S LR+ + GI++Q+ VELPG KG+W++
Sbjct: 370 TCS--------GTLKDGS--LRVVRN--GIGINEQAA---VELPGIKGLWSLRE------ 408
Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
+ D +Y YLI S T VLE AD TE + +TI N+ G
Sbjct: 409 --------SIDAQYDKYLIQSFVNETRVLEIADEELSETEIDGFDHNAQTIFCSNVLG-D 459
Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSE--SGSGSENSTVLSVSIADPYVLLGMSDGS 679
++Q+ E R++ + P N E + +G V+ S + L +S+G
Sbjct: 460 CLLQITEVSLRLVSTKSKQLLKEWFPPNGERITVAGGNVQQVVLTSGKRTLIYLDVSNGD 519
Query: 680 I 680
+
Sbjct: 520 V 520
>gi|170057515|ref|XP_001864517.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167876915|gb|EDS40298.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1138
Score = 86.7 bits (213), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 75/304 (24%), Positives = 138/304 (45%), Gaps = 26/304 (8%)
Query: 1109 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
T +GTA V E+ + GR++++ A +T+V KE+KGA +L G +
Sbjct: 826 TYYIVGTAMVNPEEREPKVGRIIIYHYADGA------LTQVSEKEIKGACYSLVEFNGRV 879
Query: 1168 LIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
L + L++WT + L F + LY + +FIL+GD+ +SI L +K+
Sbjct: 880 LATINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKG---DFILVGDLMRSITLLQYKQ 936
Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
+A+D+ A E L D + L ++ N+ + + + Q++
Sbjct: 937 MEGSFEEIARDYQPKWMTAVEILDDDAFLG---AENSNNLFVCLKDSAATTDDERQQMPE 993
Query: 1285 RAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
A+FH+G V F ++ + +RT G +LFGT+ G+IG + +
Sbjct: 994 VAQFHLGDMVNVFRHGSLVMQNIGERTTPTSG-------CVLFGTVSGAIGLVTQIPPDY 1046
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
+ L+ LQ+ L +++ V ++ +R FH+ K + +D +L+ + L E+
Sbjct: 1047 YEFLRKLQENLTNTIKSVGRIDHTYWRSFHTEMKTE--NSEGFIDGDLVESFLDLTREKM 1104
Query: 1404 LEIA 1407
E A
Sbjct: 1105 HEAA 1108
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 105/473 (22%), Positives = 175/473 (36%), Gaps = 129/473 (27%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G L +DP+ R G+ +Y GL II D DT H +
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154
Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
L M+ HV+D F++G P ++++H+ ++ +H I I+ K
Sbjct: 155 TSLRMEEMHVQDVEFLYGTAHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203
Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
I W N+ +A L+ VP+P+GG +V+G ++ YH + A+A +N YA
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYA- 262
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
+D+ + LL G L ++ + + +L +
Sbjct: 263 ---------------RVDSRGFRY------LLGNMIGHLFMMFLETEENTRGQLTVKDIK 301
Query: 408 PSVL-----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
+L IT + N + F+GSR GDS LV+ + S + E F ++ AP
Sbjct: 302 VELLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAAASGAYVTVMETFTNL---AP 358
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
L+ G+ ++ GS G L+ G+
Sbjct: 359 IIDMCIVD----LERQGQGQMITCSGSYKE--------------------GSLRIIRNGI 394
Query: 523 RINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
I A ++LPG KG+W + R+ D Y L++S
Sbjct: 395 GIQEHAC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLS 429
Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
T +L + E TE + +T N+ FG ++IQV AR++
Sbjct: 430 FVGHTRILMLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPMTARLI 480
>gi|170589357|ref|XP_001899440.1| CPSF A subunit region family protein [Brugia malayi]
gi|158593653|gb|EDP32248.1| CPSF A subunit region family protein [Brugia malayi]
Length = 655
Score = 86.3 bits (212), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 140/293 (47%), Gaps = 27/293 (9%)
Query: 1106 ENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
+++ +GTA + ++ ++ GR+++F + ++ P+ + VY KE+KGA ++ S+
Sbjct: 331 DSQPYFVVGTAVIMSDETESKMGRIMMF---QASEGPERMRL-VYEKEIKGAAYSIQSMD 386
Query: 1165 GHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
G L++A + L +WT + L F + LY+ + N + IL+GD+ +S+ LS
Sbjct: 387 GKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKTKN---DLILVGDLMRSLSLLS 443
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK--G 1279
+K + +A+DF + A E + + L + + +F +K G
Sbjct: 444 YKSMESTFEKVARDFMTNWMSACEIIDSDNFLG-----AENSYNLFTVMKDSFTVFKEEG 498
Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
+L F++G V F + AT D AP ++L+GT DG IG I +
Sbjct: 499 TRLQELGLFYLGEMVNVFCHGSLTATQVD---VAP----LYHSSILYGTSDGGIGVIVQM 551
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+ + LQ +QK+L + + ++ +R F + ++ P+ +D +L+
Sbjct: 552 PPVLYTFLQDVQKRLAEYAENCMRISHTQYRTFETEKRSE--APNGFIDGDLI 602
>gi|157128864|ref|XP_001655231.1| DNA repair protein xp-e [Aedes aegypti]
gi|108882186|gb|EAT46411.1| AAEL002407-PB [Aedes aegypti]
Length = 1138
Score = 86.3 bits (212), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/304 (25%), Positives = 141/304 (46%), Gaps = 26/304 (8%)
Query: 1109 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
T +GTA V E+ + GR++++ AD NL T+V KE+KG+ +L G +
Sbjct: 826 TYYIVGTALVNPEEPEPKVGRIIIY---HYADG--NL-TQVSEKEIKGSCYSLVEFNGRV 879
Query: 1168 LIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
L + + L++WT + L F + LY + +FIL+GD+ +SI L +K+
Sbjct: 880 LASINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKG---DFILVGDLMRSITLLQYKQ 936
Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
+A+D+ A E L D + L +D N+ + + + Q++
Sbjct: 937 MEGSFEEIARDYQPNWMTAVEILDDDAFLG---ADNSNNLFVCLKDGAATTDDERQQMPE 993
Query: 1285 RAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
A+ H+G V F ++ + +RT G +LFGT+ G+IG + +
Sbjct: 994 VAQVHLGDMVNVFRHGSLVMENIGERTTPTSG-------CVLFGTVSGAIGLVTQIPADY 1046
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
+ L+ LQ+ L D++ V ++ +R FH+ K R + +D +L+ + L E+
Sbjct: 1047 YEFLRKLQENLTDTIKSVGKIDHAYWRSFHTEMKTER--CEGFIDGDLVESFLDLSREKM 1104
Query: 1404 LEIA 1407
E A
Sbjct: 1105 HEAA 1108
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 109/468 (23%), Positives = 174/468 (37%), Gaps = 119/468 (25%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G L +DP+ R G+ +Y GL II D DT H +
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154
Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
L M+ HV+D F++G P ++++H+ ++ +H I I+ K
Sbjct: 155 TSLRMEEVHVQDVEFLYGTQHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203
Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
I W N+ +A L+ VP+P+GG +V+G ++ YH + A+A +N YA
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYAR 263
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
+ S L +N LLS K + LL + T
Sbjct: 264 VDSKGFRYLLGNMSGHLFMMFLETEENSKGLLSVKDIKVELLGDI-------------TI 310
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
P IT + N + F+GSR GDS LV+ +G + + E F ++ AP
Sbjct: 311 PEC----ITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMETFTNL---APIIDMC 363
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
L+ G+ ++ GS G L+ G+ I
Sbjct: 364 IVD----LEKQGQGQMITCSGSYKE--------------------GSLRIIRNGIGIQEH 399
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
A ++LPG KG+W + R+ D Y L++S T
Sbjct: 400 AC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLSFVGHT 434
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
+L + E TE + +T N+ FG ++IQV AR++
Sbjct: 435 RILTLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPTTARLI 480
>gi|225443992|ref|XP_002280744.1| PREDICTED: DNA damage-binding protein 1 isoform 2 [Vitis vinifera]
Length = 1068
Score = 85.9 bits (211), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 139/306 (45%), Gaps = 33/306 (10%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNAD 1139
+T P+ + E ++ L + + ++ +GTAYV E+ +GR+L+F D
Sbjct: 738 STYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---ED 790
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP 1194
L+ E KE KGA+ +L + G LL A KI L+KW GT EL + +
Sbjct: 791 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGH 847
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
L + + +FI++GD+ KSI L +K + + A+D+ + A E L D L
Sbjct: 848 IL-ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 906
Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTG 1311
+ N IF K SE + +L E+H+G V +F ++
Sbjct: 907 -----AENNFNIFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------ 954
Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
P SD ++FGT++G IG IA L + L+ LQ L + V GL+ +R
Sbjct: 955 -LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEKLQANLRKVIKGVGGLSHEQWRS 1013
Query: 1372 FHSNGK 1377
F++ K
Sbjct: 1014 FNNEKK 1019
Score = 57.4 bits (137), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 88/391 (22%), Positives = 153/391 (39%), Gaps = 84/391 (21%)
Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMV 261
+I +A S +G G F + + N+R L+ V D F++G +P +V
Sbjct: 99 VITRAMGDVSDRIGRPTDNGQVIPFDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIV 157
Query: 262 ILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGV 321
+L++ +H AL + P W+ NL + A L+ VP P+ GV
Sbjct: 158 VLYQ------DNKDARHVKTYEVALK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGV 208
Query: 322 LVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
L++G TI Y S SA A+ + + ++ V+ D + LL
Sbjct: 209 LIIGEETIVYCSASAFKAIPIR---------PSITKAYGRVDADGSR--------YLLGD 251
Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
G L LL + ++ V L + + + S I+ + N+ ++GS GDS L++
Sbjct: 252 HAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKI---- 307
Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK-- 499
++ DA + S + L+ VN + + + +
Sbjct: 308 --------------HLQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVV 347
Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSR 559
T S A +D G L+ G+ IN AS VEL G KG+W++
Sbjct: 348 TCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL------ 384
Query: 560 GHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 385 --------RSSTDDPHDTFLVVSFISETRIL 407
>gi|225443990|ref|XP_002280735.1| PREDICTED: DNA damage-binding protein 1 isoform 1 [Vitis vinifera]
Length = 1089
Score = 85.9 bits (211), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 139/306 (45%), Gaps = 33/306 (10%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNAD 1139
+T P+ + E ++ L + + ++ +GTAYV E+ +GR+L+F D
Sbjct: 759 STYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---ED 811
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP 1194
L+ E KE KGA+ +L + G LL A KI L+KW GT EL + +
Sbjct: 812 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGH 868
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
L + + +FI++GD+ KSI L +K + + A+D+ + A E L D L
Sbjct: 869 IL-ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 927
Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTG 1311
+ N IF K SE + +L E+H+G V +F ++
Sbjct: 928 -----AENNFNIFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------ 975
Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
P SD ++FGT++G IG IA L + L+ LQ L + V GL+ +R
Sbjct: 976 -LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEKLQANLRKVIKGVGGLSHEQWRS 1034
Query: 1372 FHSNGK 1377
F++ K
Sbjct: 1035 FNNEKK 1040
Score = 57.0 bits (136), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 83/367 (22%), Positives = 145/367 (39%), Gaps = 84/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P W+ NL + A L+ VP P+ GVL++G TI Y S SA A+ +
Sbjct: 197 LK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR-- 251
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ V+ D + LL G L LL + ++ V L +
Sbjct: 252 -------PSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 296
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N+ ++GS GDS L++ ++ DA
Sbjct: 297 LGETSIASTISYLDNAFVYVGSSYGDSQLIKI------------------HLQPDA---- 334
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
+ S + L+ VN + + + + T S A +D G L+ G+
Sbjct: 335 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIG 387
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ ++ DD + +L++S
Sbjct: 388 INEQAS------------VELQGIKGMWSL--------------RSSTDDPHDTFLVVSF 421
Query: 584 EARTMVL 590
+ T +L
Sbjct: 422 ISETRIL 428
>gi|356512638|ref|XP_003525025.1| PREDICTED: DNA damage-binding protein 1a-like isoform 2 [Glycine max]
Length = 1068
Score = 85.9 bits (211), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)
Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+ +N +GTAYV E+ +GR+L+F+ D L+ E KE KGA+ L +
Sbjct: 757 SDDNNVYYCVGTAYVLPEENEPTKGRILVFAV---EDGKLQLIAE---KETKGAVYCLNA 810
Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL A KI L+KW GT EL + L + + +FI++GD+ KSI
Sbjct: 811 FNGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 869
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
L +K + + A+D+ + A E + D L +N + K SE
Sbjct: 870 SLLIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLG------AENSFNLFTVRKNSEGA 923
Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ +L E+H+G V +F ++ P SD ++FGT++G IG
Sbjct: 924 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 976
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
IA L + L+ LQ L + V GL+ +R F++ K
Sbjct: 977 VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1019
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 85/367 (23%), Positives = 148/367 (40%), Gaps = 84/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P +V+L++ +H A
Sbjct: 123 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 175
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L L + P WS NL + A L+ VP P+ GVL++G TI Y S +A A+ +
Sbjct: 176 LKDKDFL-EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-- 230
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ V+ D + LL TG L LL + ++ V L +
Sbjct: 231 -------PSITKAYGRVDPDGSR--------YLLGDHTGLLSLLVITHEKEKVTGLKIEP 275
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N+ ++GS GDS L++ +++ DA
Sbjct: 276 LGETSIASTISYLDNAFVYIGSSYGDSQLIKL------------------NLQPDA---- 313
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
+ S + L+ VN + + + + T S A +D G L+ G+
Sbjct: 314 --KGSYVEGLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRVVRNGIG 366
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ ++ DD + +L++S
Sbjct: 367 INEQAS------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSF 400
Query: 584 EARTMVL 590
+ T +L
Sbjct: 401 ISETRIL 407
>gi|357519461|ref|XP_003630019.1| DNA damage-binding protein [Medicago truncatula]
gi|355524041|gb|AET04495.1| DNA damage-binding protein [Medicago truncatula]
Length = 1171
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)
Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+ +N +GTAYV E+ +GR+L+FS + LV E KE KGA+ L +
Sbjct: 860 SDDNNVYYCVGTAYVLPEENEPTKGRILVFSV---EEGKLQLVAE---KETKGAVYCLNA 913
Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL A KI L+KW GT EL + L + + +FI++GD+ KSI
Sbjct: 914 FNGKLLAAINQKIQLYKWVLREDGTRELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 972
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
L +K + + A+D+ + A E L D L +N + K SE
Sbjct: 973 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDVYLG------AENSFNLFTVRKNSEGA 1026
Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ +L E+H+G + +F ++ P SD ++FGT++G IG
Sbjct: 1027 TDEERGRLEVAGEYHLGEFINRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 1079
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
IA L + L+ LQ L + V GL+ +R F++ K
Sbjct: 1080 VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1122
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 79/349 (22%), Positives = 145/349 (41%), Gaps = 60/349 (17%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A L+ + L+G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTAQGLQSILDVPLYGRIATLELFRPHG----ETQDFLFIATERYKFCVLQWDT 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L SM + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 EKSELVTRSMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL + P WS +L + A L+ VP P+ GVL++G T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNSLDNGADLLIPVPPPLCGVLIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S + A+ + + ++ V+ D + LL TG L L
Sbjct: 237 IVYCSANGFKAIPIR---------AAITKAYGRVDPDGSRY--------LLGDHTGLLSL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L + ++ V L + + + S I+ + N+ ++GS GDS L++
Sbjct: 280 LVITHEKEKVTGLKIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKL 328
>gi|91087281|ref|XP_975549.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270010588|gb|EFA07036.1| hypothetical protein TcasGA2_TC010010 [Tribolium castaneum]
Length = 1149
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/401 (24%), Positives = 167/401 (41%), Gaps = 73/401 (18%)
Query: 1041 GHQIDNHNLSSVDLHRTYTV--------EEYEVRILEPDRAGGPWQTRATIPMQSSENAL 1092
G +++ HNL +D H T+ V +EY + I+ +R GG
Sbjct: 791 GQEVEVHNLLIIDQH-TFEVLHAHQLMQQEYAMSIISTNRLGGDM--------------- 834
Query: 1093 TVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSK 1151
NE + +GTA V E+ + GR+L+F N +T+V K
Sbjct: 835 --------------NEYYI-VGTATVNPEESEPKQGRILIFQWNDNK------LTQVSEK 873
Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1211
E+KGA +LA G LL + + L +WT + + + + L +FILLG
Sbjct: 874 EIKGACYSLAEFNGKLLASINSTVRLFEWTVEKELRLECSHFNNILTLFLKTKGDFILLG 933
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D+ +S+ L +K +A+D+ A E L D L ++ NI +
Sbjct: 934 DLMRSMTLLQYKTMEGSFEEIARDYNPNWMTAVEILDDDIFLG---AENSFNIFVCQKDS 990
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLA-TSSDRTGAAPGSDKTNRFALLF 1326
+ + ++ FHVG + F L +Q L TS+ TG +LF
Sbjct: 991 AATTDEERSQMHEVGRFHVGDMINVFRHGSLVMQNLGETSTPTTG-----------CVLF 1039
Query: 1327 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1386
GT+ G+IG + + + + L LQ KL + V ++ +R F+++ K +
Sbjct: 1040 GTVSGAIGLVTQITQDFYDFLLELQNKLSTVIKSVGKIDHSQWRAFNTDIKTEP--SEGF 1097
Query: 1387 VDCEL------LSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+D +L LSH +M + + L+I + G + + +L
Sbjct: 1098 IDGDLIESFLDLSHDKMKEVADGLQITGEGGMKQDCTVDDL 1138
Score = 68.6 bits (166), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 104/469 (22%), Positives = 175/469 (37%), Gaps = 110/469 (23%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G L +DP+ R G+ +Y I+ + S L N+R
Sbjct: 119 GILAVIDPKARVIGLRLYDGLFKIIPLEKDNSELKAS--------------------NIR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG P ++++H+ V+ +H + IS K+ +
Sbjct: 159 -IDELQVHDVEFLHGCANPTLILIHQD-------VNGRH----VKTHEISLREKEFVKVP 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VPSP+GG +++G I YH +A + +S
Sbjct: 207 WRQDNVETEASMIIPVPSPLGGAIIIGQENILYHDGITPVVVA----------PAVIKQS 256
Query: 359 SFS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVL 411
+ ++D +L D+A G L +L + D R VV+ L +
Sbjct: 257 TIVCYAKVDPGGLRYLLGDMA------GHLFMLFLEVDNRGDGNDVVKDLKVELLGEIAT 310
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
IT + N + F+GSRLGDS LV+ T S + E F ++ AP L
Sbjct: 311 PECITYLDNGVLFIGSRLGDSQLVKLTTKPNESGSYVTVMESFTNL---API---LDMCV 364
Query: 472 SDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
D L+ G+ ++ G+ G L+ G+ I AS
Sbjct: 365 VD-LERQGQGQLVTCSGAFKE--------------------GSLRIIRNGIGIQEHAS-- 401
Query: 532 GISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE 591
++LPG KG+W + A D Y L+++ +T VL
Sbjct: 402 ----------IDLPGIKGMWAL--------------QVASDGRYDNTLVLAFVGQTRVLS 437
Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
E T+ + +T GN+ +++Q+ AR++ T
Sbjct: 438 LNGEEVEETDIAGFASDQQTFFCGNVI-HEQIVQITPISARLISAQNKT 485
>gi|145351726|ref|XP_001420218.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580451|gb|ABO98511.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1120
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/394 (23%), Positives = 175/394 (44%), Gaps = 31/394 (7%)
Query: 1038 QEVGHQIDNHNLS-SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ + HQ+D + + +V+ + +E +R+++ G + T ++ E A ++
Sbjct: 746 RRIAHQVDTNTFAVAVEHLMSKGDQELFIRLIDD----GSFDTLHQFRLEEHELASSLMS 801
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1156
+ F ++E ++ G AY Q ED +RGR+L+ +A LV+E KE++GA
Sbjct: 802 CS-FAGDSREY-YVVGTGFAYEQ-EDEPSRGRILVLRVEADA---LELVSE---KEVRGA 852
Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA----PPLYVVSLNIVKNFILLGD 1212
+ L + +G LL K+ L KWT E + + S+ ++IL+GD
Sbjct: 853 VYNLNAFKGKLLAGINSKLELFKWTPREDDAHELVSECSHHGQIITFSVKTRGDWILVGD 912
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
+ KS+ L +K + ++ +A+DF + A L D T + ++ +F A
Sbjct: 913 LLKSMSLLQYKPEEGAIDEIARDFNANWMTAVAMLDDDETY----LGAENSLNLFTVARN 968
Query: 1273 MSESWKGQ--KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1330
M+ + +L E+H+G V F ++ + D D LLFGT +
Sbjct: 969 MNAMTDEERSRLEITGEYHLGEFVNVFSPGSLVMSLKD-------GDSLEVPTLLFGTGN 1021
Query: 1331 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
G IG +A L + + + LQ + + V GL +R F + + VD +
Sbjct: 1022 GVIGVLASLPKDAYDFAERLQTSMNKHIQGVGGLKHAEWRSFRHTLRRKSDPSRNFVDGD 1081
Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
L+ + L +E+ +A R++I+ + +L
Sbjct: 1082 LVESFLDLKVEQADVVAADMKCDRAEIIRRVEEL 1115
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 127/549 (23%), Positives = 224/549 (40%), Gaps = 113/549 (20%)
Query: 96 ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
+ A L+ V ++G + ++++ G D R + L E +VL +D++ L+
Sbjct: 58 LHAEGLKPVLDVPINGRIATMSLCQTGSGDGKAR---LYLTTERYGFTVLSYDEANEELK 114
Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLV 214
+ + GR + G + VD R G+ +Y GL +I +GG
Sbjct: 115 TEAFGDVQD------NIGRPA-DDGQIGIVDDTCRAIGLRLYDGLFKVIPCDEKGG---- 163
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
++ + I L +L V+D F+HG +P + +L+ R+ A V
Sbjct: 164 ---------------VKEAFNIRLEEL---RVEDIKFLHGTPKPTIAVLY-RDTKDA--V 202
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
K + I ++ P W+ +L + K++ VP+PIGGV+V+G I Y
Sbjct: 203 HIKTYEIGIREKEFVSS----P--WAQNDLEGGSNKIIPVPAPIGGVVVLGQEIIVY--- 253
Query: 335 SASCALALNNYAVSLD---SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
LN + D + +P + A LL G L LL +
Sbjct: 254 -------LNKFEDDADVFLKAINIPNIPDRTNITCYGAIDPDGSRYLLGDADGMLYLLVI 306
Query: 392 VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
++DG+ V+ L + + + + S ++ + N + F+GS GDS L++ L
Sbjct: 307 LHDGKRVRELKIERLGDTSIASTLSYLDNGVVFVGSTYGDSQLIK-------------LH 353
Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGE--ELSLYGSASNNTESAQKTFSFAVRDSL 509
E I+ D T L +V+ +L +G T S
Sbjct: 354 AEKTSIDKDGNPTYVQILEEFTNLGPIVDFAFVDLERHGQGQVVTCS------------- 400
Query: 510 VNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMA 569
G LKD S LR+ + GI +Q+ +++LPG KG++++ ++D S+M
Sbjct: 401 ---GALKDGS--LRVVRN--GIGIDEQA---VIQLPGVKGLFSL-------RDSDDSQM- 442
Query: 570 AYDDEYHAYLIISLEARTMVL----ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
YL+++ T +L + D L E TE + + +T+ GN+ G +Q
Sbjct: 443 ------DKYLVVTFINETRILGFVGDEGDTLDE-TEIAGFDAEAQTLCCGNMQG-NVFLQ 494
Query: 626 VFERGARIL 634
V RG R++
Sbjct: 495 VTHRGVRLV 503
>gi|297799958|ref|XP_002867863.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
lyrata]
gi|297313699|gb|EFH44122.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
lyrata]
Length = 1088
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 99/397 (24%), Positives = 170/397 (42%), Gaps = 38/397 (9%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ + L + EE E VR+L+ ++ +T P+ + E ++
Sbjct: 716 RRICHQEQTRTFAICCLRNQPSAEESEMHFVRLLDAQ----SFEFLSTYPLDAFEYGCSI 771
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
L + T + +GTAYV E+ +GR+L+F + L+TE KE
Sbjct: 772 ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 821
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KGA+ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
++GD+ KSI L +K + + A+D+ + A E L D L +D N+
Sbjct: 881 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMAAVEILDDDIYLG---ADNCFNLFTVK 937
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
+ + + ++ E+H+G V +F ++ D G P ++FG
Sbjct: 938 KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
T+ G IG IA L + + L+ LQ L + V GL+ +R F N + S +
Sbjct: 990 TVSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSF--NNEKRTAEAKSYL 1047
Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
D +L+ + L + EI+ ++ + +L
Sbjct: 1048 DGDLIESFLDLSRGKMEEISKGMDVQVEELCKRVEEL 1084
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 112/513 (21%), Positives = 203/513 (39%), Gaps = 135/513 (26%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + L+G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQTILDVPLYGRIATLELFRPHG----EAQDFLFVATERYKFCVLQWD- 93
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
+ES E + G S G + +DP R G+ +Y GL +
Sbjct: 94 ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKV 141
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
I ++G F+ R+E V++++ F++G +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
L++ +H + +LK+ + WS NL + A L+ VPSP+
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKEKDFVEGPWSQNNLDNGADLLIPVPSPLC 227
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVL++G TI Y S +A A+ + + ++ V+LD + LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
+G + LL + ++ V L + + + S I+ + N++ F+GS GDS L++
Sbjct: 271 GDHSGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+++ DA S + L+ VN + + + +
Sbjct: 329 ----------------NLQPDATG------SYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366
Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
T S A +D G L+ G+ IN AS VEL G KG+W++ KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
S D+ + +L++S + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428
>gi|356512636|ref|XP_003525024.1| PREDICTED: DNA damage-binding protein 1a-like isoform 1 [Glycine max]
Length = 1089
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)
Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+ +N +GTAYV E+ +GR+L+F+ D L+ E KE KGA+ L +
Sbjct: 778 SDDNNVYYCVGTAYVLPEENEPTKGRILVFAV---EDGKLQLIAE---KETKGAVYCLNA 831
Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL A KI L+KW GT EL + L + + +FI++GD+ KSI
Sbjct: 832 FNGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 890
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
L +K + + A+D+ + A E + D L +N + K SE
Sbjct: 891 SLLIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLG------AENSFNLFTVRKNSEGA 944
Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ +L E+H+G V +F ++ P SD ++FGT++G IG
Sbjct: 945 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 997
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
IA L + L+ LQ L + V GL+ +R F++ K
Sbjct: 998 VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1040
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 110/504 (21%), Positives = 200/504 (39%), Gaps = 117/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQPMLDVPIYGRIATLELFRPHG----EAQDYLFIATERYKFCVLQWDS 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ETAELVTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL L + P WS NL + A L+ VP P+ GVL++G T
Sbjct: 183 ---DNKDARHVKTYEVALKDKDFL-EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S +A A+ + + ++ V+ D + LL TG L L
Sbjct: 237 IVYCSANAFKAIPIR---------PSITKAYGRVDPDGSR--------YLLGDHTGLLSL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N+ ++GS GDS L++
Sbjct: 280 LVITHEKEKVTGLKIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++ DA + S + L+ VN + + + + T S A +
Sbjct: 329 -------NLQPDA------KGSYVEGLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428
>gi|440492924|gb|ELQ75450.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
(CPSF subunit) [Trachipleistophora hominis]
Length = 1254
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/231 (26%), Positives = 123/231 (53%), Gaps = 25/231 (10%)
Query: 1094 VRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST----GRNADNPQNLVTEVY 1149
+++++L ++ N+ ++ + + E RGR+L+F AD ++
Sbjct: 946 LKIMSLCDSNGDHNDFVVVVLSQRTSNE--ILRGRILVFEVIDVISDTADRKTKKALKLL 1003
Query: 1150 -SKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPLYVVSLNIVKNF 1207
S+ KG IS A+++G + ++ +++++++ T + IAFYD +Y VSL ++KN+
Sbjct: 1004 GSERTKGPISCCAAVRGRIAVSLATRLMVYEFDRNTGIVAIAFYDLY-MYAVSLAVIKNY 1062
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAK-----DFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
I++GDI ++F+ ++ + +L+LL+K + GSLD F G L + D+
Sbjct: 1063 IVVGDIMMGLHFVYFQSEPVKLHLLSKSGRVANLGSLDFFNA-----GDRLFITGIDKTG 1117
Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
+QIF ++P S +G+KL+ R +F AH Q + T++ R+ A+
Sbjct: 1118 EVQIFSFSPGNLYSNEGEKLVKRQQFETYAH------FQSIRTNTYRSYAS 1162
>gi|186511557|ref|NP_001118940.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
gi|332657118|gb|AEE82518.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
Length = 1067
Score = 85.1 bits (209), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 153/350 (43%), Gaps = 36/350 (10%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ L EE E VR+L+ ++ +T P+ S E ++
Sbjct: 695 RRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 750
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
L + T++ +GTAYV E+ +GR+L+F D L+ E KE
Sbjct: 751 ----LSCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 800
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KGA+ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 801 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 859
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
++GD+ KSI L +K + + A+D+ + A E L D L ++ N+
Sbjct: 860 VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 916
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
+ + + +L E+H+G V +F ++ D G P ++FG
Sbjct: 917 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 968
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
T++G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 969 TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1018
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 100/453 (22%), Positives = 178/453 (39%), Gaps = 108/453 (23%)
Query: 147 FDDSIHGLRITSMHCF----ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
D I+G RI ++ F E+ ++L + R F +++ DP+ +
Sbjct: 54 LDVPIYG-RIATLELFRPHGEAQDFLFIATERYKFC---VLQWDPES----------SEL 99
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
I +A S +G G F + + N+R L+ V D F+ G +P + +
Sbjct: 100 ITRAMGDVSDRIGRPTDNGQVIPFDNKGQLKEAFNIR-LEELQVLDIKFLFGCAKPTIAV 158
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
L++ +H + +LK + WS +L + A L+ VP P+
Sbjct: 159 LYQ------DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLC 206
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVL++G TI Y S SA A+ + + ++ V++D + LL
Sbjct: 207 GVLIIGEETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLL 249
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
G + LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 250 GDHAGMIHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-- 307
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
++ DA + S + L+ +N + + + +
Sbjct: 308 ----------------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQ 345
Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
T S A +D G L+ G+ IN AS VEL G KG+W++ KS
Sbjct: 346 VVTCSGAFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KS 386
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
S D+ + +L++S + T +L
Sbjct: 387 S------------IDEAFDTFLVVSFISETRIL 407
>gi|297809743|ref|XP_002872755.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
gi|297318592|gb|EFH49014.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
Length = 1088
Score = 85.1 bits (209), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 152/350 (43%), Gaps = 36/350 (10%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ L EE E VR+L+ ++ +T P+ S E ++
Sbjct: 716 RRICHQEQTRTFGICSLGNQSNAEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 771
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
L + T + +GTAYV E+ +GR+L+F D L+ E KE
Sbjct: 772 ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 821
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KGA+ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
++GD+ KSI L +K + + A+D+ + A E L D L ++ N+
Sbjct: 881 VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLVTVK 937
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
+ + + +L E+H+G V +F ++ D G P ++FG
Sbjct: 938 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
T++G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 990 TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1039
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F+ G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK + WS NL + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S +A A+ + + ++ V++D + LL G
Sbjct: 234 EETIVYCSANAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
+ LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
++ DA + S + L+ +N + + + + T S
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
D+ + +L++S + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428
>gi|15235577|ref|NP_192451.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
gi|55976605|sp|Q9M0V3.1|DDB1A_ARATH RecName: Full=DNA damage-binding protein 1a; AltName: Full=UV-damaged
DNA-binding protein 1a; Short=DDB1a
gi|7267302|emb|CAB81084.1| UV-damaged DNA binding factor-like protein [Arabidopsis thaliana]
gi|25054828|gb|AAN71904.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
gi|332657117|gb|AEE82517.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
Length = 1088
Score = 85.1 bits (209), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 153/350 (43%), Gaps = 36/350 (10%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ L EE E VR+L+ ++ +T P+ S E ++
Sbjct: 716 RRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 771
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
L + T++ +GTAYV E+ +GR+L+F D L+ E KE
Sbjct: 772 ----LSCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 821
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KGA+ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
++GD+ KSI L +K + + A+D+ + A E L D L ++ N+
Sbjct: 881 VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 937
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
+ + + +L E+H+G V +F ++ D G P ++FG
Sbjct: 938 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
T++G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 990 TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1039
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDP 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F+ G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK + WS +L + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S SA A+ + + ++ V++D + LL G
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGM 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
+ LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
++ DA + S + L+ +N + + + + T S
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
D+ + +L++S + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428
>gi|110741229|dbj|BAF02165.1| UV-damaged DNA binding factor - like protein [Arabidopsis thaliana]
Length = 727
Score = 85.1 bits (209), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 153/350 (43%), Gaps = 36/350 (10%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ L EE E VR+L+ ++ +T P+ S E ++
Sbjct: 355 RRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQ----TFEFMSTYPLDSFEYGCSI 410
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
L + T++ +GTAYV E+ +GR+L+F D L+ E KE
Sbjct: 411 ----LSCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGRLQLIAE---KET 460
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KGA+ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 461 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 519
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
++GD+ KSI L +K + + A+D+ + A E L D L ++ N+
Sbjct: 520 VVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 576
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
+ + + +L E+H+G V +F ++ D G P ++FG
Sbjct: 577 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 628
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
T++G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 629 TVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 678
>gi|449519304|ref|XP_004166675.1| PREDICTED: DNA damage-binding protein 1a-like [Cucumis sativus]
Length = 596
Score = 84.7 bits (208), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 145/330 (43%), Gaps = 31/330 (9%)
Query: 1104 TKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+ +N +GTAYV E+ +GR+L+F + L+ E KE KG++ +L +
Sbjct: 285 SDDNNVYYCVGTAYVMPEENEPTKGRILVFVV---EEGKLQLIAE---KETKGSVYSLNA 338
Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL A KI L+KWT GT EL + L + + +FI++GD+ KSI
Sbjct: 339 FNGKLLAAINQKIQLYKWTLRDDGTRELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 397
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
L +K + + A+D+ + A E L D L +N + K SE
Sbjct: 398 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG------AENYFNLFTVRKNSEGA 451
Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ +L E+H+G V +F ++ P SD ++FG+++G IG
Sbjct: 452 TDEERSRLEVVGEYHLGEFVNRFQHGSLVMR-------LPDSDVGQIPTVIFGSVNGVIG 504
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
IA L + L+ LQ L + V GL+ +R F N + + +D +L+
Sbjct: 505 VIASLPHDQYVFLERLQSNLRKVIKGVGGLSHEQWRSF--NNEKRTAEAKNFLDGDLIES 562
Query: 1395 YEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ L + EI+ + ++ + +L
Sbjct: 563 FLDLNRSKMEEISRAMSVSAEELCKRVEEL 592
>gi|356525403|ref|XP_003531314.1| PREDICTED: DNA damage-binding protein 1-like isoform 2 [Glycine max]
Length = 1068
Score = 84.7 bits (208), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 80/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)
Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+ +N +GTAYV E+ +GR+++F+ D L+ E KE KGA+ L +
Sbjct: 757 SDDNNVYYCVGTAYVLPEENEPTKGRIIVFAV---EDGKLQLIAE---KETKGAVYCLNA 810
Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL A KI L+KW GT EL + L + + +FI++GD+ KSI
Sbjct: 811 FNGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 869
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
L +K + + A+D+ + A E + D L +N + K SE
Sbjct: 870 SLLIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLG------AENSFNLFTVRKNSEGA 923
Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ +L E+H+G V +F ++ P SD ++FGT++G IG
Sbjct: 924 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 976
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
IA L + L+ LQ L + V GL+ +R F++ K
Sbjct: 977 VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1019
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 83/367 (22%), Positives = 148/367 (40%), Gaps = 84/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P +V+L++ +H A
Sbjct: 123 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 175
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P WS NL + A L+ VP P+ GVL++G TI Y S +A A+ +
Sbjct: 176 LK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-- 230
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ V+ D + LL TG + LL ++++ V L +
Sbjct: 231 -------PSITKAYGRVDPDGSR--------YLLGDHTGLVSLLVIIHEKEKVTGLKIEP 275
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N+ ++GS GDS L++ +++ DA
Sbjct: 276 LGETSIASTISYLDNAFVYVGSSYGDSQLIKL------------------NLQPDA---- 313
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
+ S + L+ VN + + + + T S A +D G L+ G+
Sbjct: 314 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRVVRNGIG 366
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ ++ DD + +L++S
Sbjct: 367 INEQAS------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSF 400
Query: 584 EARTMVL 590
+ T +L
Sbjct: 401 ISETRIL 407
>gi|356525401|ref|XP_003531313.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Glycine max]
Length = 1089
Score = 84.7 bits (208), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 80/283 (28%), Positives = 127/283 (44%), Gaps = 29/283 (10%)
Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+ +N +GTAYV E+ +GR+++F+ D L+ E KE KGA+ L +
Sbjct: 778 SDDNNVYYCVGTAYVLPEENEPTKGRIIVFAV---EDGKLQLIAE---KETKGAVYCLNA 831
Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL A KI L+KW GT EL + L + + +FI++GD+ KSI
Sbjct: 832 FNGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 890
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
L +K + + A+D+ + A E + D L +N + K SE
Sbjct: 891 SLLIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLG------AENSFNLFTVRKNSEGA 944
Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ +L E+H+G V +F ++ P SD ++FGT++G IG
Sbjct: 945 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-------LPDSDVGQIPTVIFGTINGVIG 997
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
IA L + L+ LQ L + V GL+ +R F++ K
Sbjct: 998 VIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKK 1040
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 108/504 (21%), Positives = 200/504 (39%), Gaps = 117/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQPMLDVPIYGRIATLELFRPHG----EAQDYLFIATERYKFCVLQWDS 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ETGELVTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G +P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIVVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
+H AL + P WS NL + A L+ VP P+ GVL++G T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S +A A+ + + ++ V+ D + LL TG + L
Sbjct: 237 IVYCSANAFKAIPIR---------PSITKAYGRVDPDGSR--------YLLGDHTGLVSL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L ++++ V L + + + S I+ + N+ ++GS GDS L++
Sbjct: 280 LVIIHEKEKVTGLKIEPLGETSIASTISYLDNAFVYVGSSYGDSQLIKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++ DA + S + L+ VN + + + + T S A +
Sbjct: 329 -------NLQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428
>gi|449435512|ref|XP_004135539.1| PREDICTED: DNA damage-binding protein 1-like [Cucumis sativus]
Length = 1093
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 145/330 (43%), Gaps = 31/330 (9%)
Query: 1104 TKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+ +N +GTAYV E+ +GR+L+F + L+ E KE KG++ +L +
Sbjct: 782 SDDNNVYYCVGTAYVMPEENEPTKGRILVFVV---EEGKLQLIAE---KETKGSVYSLNA 835
Query: 1163 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL A KI L+KWT GT EL + L + + +FI++GD+ KSI
Sbjct: 836 FNGKLLAAINQKIQLYKWTLRDDGTRELQSECGHHGHIL-ALYVQTRGDFIVVGDLMKSI 894
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
L +K + + A+D+ + A E L D L +N + K SE
Sbjct: 895 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG------AENYFNLFTVRKNSEGA 948
Query: 1278 KGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ +L E+H+G V +F ++ P SD ++FG+++G IG
Sbjct: 949 TDEERSRLEVVGEYHLGEFVNRFQHGSLVMR-------LPDSDVGQIPTVIFGSVNGVIG 1001
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
IA L + L+ LQ L + V GL+ +R F N + + +D +L+
Sbjct: 1002 VIASLPHDQYVFLERLQSNLRKVIKGVGGLSHEQWRSF--NNEKRTAEAKNFLDGDLIES 1059
Query: 1395 YEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ L + EI+ + ++ + +L
Sbjct: 1060 FLDLNRSKMEEISRAMSVSAEELCKRVEEL 1089
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 107/504 (21%), Positives = 197/504 (39%), Gaps = 117/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTAQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDT 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ESSELITRAMGDVSD------RIGRPTDS-GQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F++G P +V+L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSRPTIVVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + + P WS NL + A L+ VP P+ GV+++G T
Sbjct: 186 D-------ARHVKTYEVVLKDKDFVEGP--WSQNNLDNGAAVLIPVPPPLCGVIIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S +A A+ + + R+ V+ D + LL G L L
Sbjct: 237 IVYCSATAFKAIPVR---------PSITRAYGRVDADGSR--------YLLGDHAGLLHL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N+ ++GS GDS LV+
Sbjct: 280 LVITHEKERVTGLKIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++ DA + S + L+ VN + + + + T S A +
Sbjct: 329 -------NVQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428
>gi|385304556|gb|EIF48568.1| rna-binding subunit of the mrna cleavage and polyadenylation factor
[Dekkera bruxellensis AWRI1499]
Length = 289
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 62/272 (22%), Positives = 126/272 (46%), Gaps = 14/272 (5%)
Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP-----QNLVTEVYSKELKGAISAL 1160
+ + + +G+ + ED+A +G +++ +P +N + + S+ +G+I
Sbjct: 3 DTKNYVIVGSGKYRVEDLATKGSWMVYEIIDVVPDPNHPEAKNRLKLIKSESSRGSILGS 62
Query: 1161 ASLQGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
++ G + ++++ K G + +AF D LY + ++ +++GD +
Sbjct: 63 CNISGRFSLVQAQRMLVRTIKKDGNAV-PVAFXDTS-LYTKDVKSFEDMMIIGDAFDGLS 120
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
+ + ++ L K+ +L A +F++ L ++ +DE + + Y P ES K
Sbjct: 121 LYGFDAEPYRMLKLGKETQNLSLTACDFIVXEGGLYIIAADEDSVLHLLEYDPYDPESMK 180
Query: 1279 GQKLLSRAEFHVGAHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
G KLL+R+ F + T R + + D PG+D F ++ ++GS
Sbjct: 181 GXKLLTRSVFRFNGYTTAMRLCDRKNSIFSMLDTLAIPPGADLG--FEVIGCNIEGSFYK 238
Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
+ P +E T+RRL +LQ + D H GLNP+
Sbjct: 239 VTPANEYTYRRLYALQNHISDKESHWLGLNPK 270
>gi|154421858|ref|XP_001583942.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
gi|121918186|gb|EAY22956.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
Length = 1297
Score = 83.6 bits (205), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 153/368 (41%), Gaps = 49/368 (13%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV + G + + GG DSII+ + +K+ VL+ D+ L+ T H
Sbjct: 48 LRLVWEKKFWGEIFGVYRHKSGG-----EYDSIIVGCDTSKVIVLQVIDN--DLKETEYH 100
Query: 161 CFESPEWLHLKRGRES--------FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
F P + ++ DP G C +L+ + +L +
Sbjct: 101 EFNRPGPPEPDPPKPERPFDISTRLRNKTIMDADPTGTCLALLLAQNILYVLPLANK--- 157
Query: 213 LVGDEDTFGSGGGFSAR---IESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHE 265
+ E T +G + + I+ + ++ D K ++D +F+ GY P + I+HE
Sbjct: 158 -IKIESTEKAGDEYHSSWKVIKDAFAYDVH-TDFKSPLYRIRDMVFLDGYKNPTLAIIHE 215
Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLI---------WSAMNLPHDAYKLLAVPS 316
TW+ R+ + T +S +S K+ LI W++ LPH+++ L+ VP
Sbjct: 216 LIPTWSVRLPLQKSTVAVSIVSPPLKKKETVLISASIDKVTMWTSRALPHNSFGLVHVPD 275
Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYA-----VSLDSSQELPRSSFSVELDAAHATW 371
PIGG LV+ N I Y + ALALN A V +D + P EL + T
Sbjct: 276 PIGGFLVLSKNAIIYMDHTNIVALALNKLAYLDDEVPVDITANGPGCH---ELYSKVGTA 332
Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVV-----QRLDLSKTNPSVLTSDITTIGNSLFFLG 426
+ LL+ L +LT+ Y+G V + +PS S T SL F+G
Sbjct: 333 IDKSHILLTVDQHYLSILTLHYNGVKVTNLSLNVNLNLEFHPSCFLSLNYTNNRSLVFMG 392
Query: 427 SRLGDSLL 434
S DS L
Sbjct: 393 STTHDSTL 400
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 98/463 (21%), Positives = 175/463 (37%), Gaps = 50/463 (10%)
Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWP-----VQKIPLKATPHQITYFAEKNLYPLIV 1020
NH I Q +++C L + N++ V++IP+ T +I Y N I
Sbjct: 810 NHFLIADEDQ--IRLCNLENIKPEHNFFIIDGCIVERIPVGMTVRRIAYCQNPNCVAFIA 867
Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE-VRILEPDRAGGPWQT 1079
S P +P ID EV + H + Y E+YE + +R T
Sbjct: 868 SHP--EPFTTENEKKIDVEVYENLQVHYQEPPSPAKVYPDEDYETIPKWNEERYSLFLYT 925
Query: 1080 RATIP-MQSSENALTVRVVTLFNTTTKENE------TLLAIGTAYVQGEDVAARGRVLLF 1132
+ + M N V V +TT + T LA+G+ ++ + RG + ++
Sbjct: 926 KDGLQQMVDYANHEIVNTVQFVHTTPMPEDGITLLNTYLAVGSGFLSQPEKMMRGVLYIY 985
Query: 1133 ST-------GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1185
G N + L E +K K I + G++ I G + L ++
Sbjct: 986 QIRYMQNDEGFNEITLRPLYNET-NKIYKNPIIEITDNSGYMAIFCGNLLYLMRFFNENT 1044
Query: 1186 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
I + + S+ +KN++L D ++ W++ G +L +A+D + +
Sbjct: 1045 VKIEAFLVGRFFASSIVSLKNYLLYADSYEGFEVARWRKYGKKLISMARDTMTKLPLSAA 1104
Query: 1246 FLIDGSTLSLVVSDEQKNIQIF----YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1301
FL L VV D+ N IF Y P ++ ++ F++G +
Sbjct: 1105 FLQYEDCLGGVVFDDDGNAHIFDVDEYAIP-------ADAVVRKSIFYIGGRAISSGQFP 1157
Query: 1302 MLATSSDRTGAAPGSDKTNRFALL----------FGTLDGSIGCIAPLDELTFRRLQSLQ 1351
+ A + T P + L + T G IG P+DE +L +Q
Sbjct: 1158 IKAVTQ-ATQQNPNEEIDEELLQLQTKIGGHIAWYVTTHGKIGAFTPIDENDRHKLVGVQ 1216
Query: 1352 KKLVDSVPHVAGLNPRS--FRQFHSNGKAHRPGPDSIVDCELL 1392
S+ ++ L RS F+ ++ P +++DC++L
Sbjct: 1217 SAYEKSLCGLSHLEYRSGKFKNMIEQDIFNQ-SPKNVIDCDML 1258
>gi|307186138|gb|EFN71863.1| DNA damage-binding protein 1 [Camponotus floridanus]
Length = 1136
Score = 83.6 bits (205), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 93/380 (24%), Positives = 165/380 (43%), Gaps = 62/380 (16%)
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
E+G +I+ HNL +D H T E L P +E AL+
Sbjct: 777 EIGQEIEVHNLLIIDQH---TFEVLHAHTLMP-----------------TEYALS----- 811
Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
L +T E+ T +GTA++ ++ + GR+LLF S G+ +++V KE+K
Sbjct: 812 LISTRLGEDSTSYYVVGTAFINPDETEPKMGRILLFHWSDGK--------LSQVAEKEIK 863
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
G+ +L G LL + + L +WT + L F + LY L +F+L+G
Sbjct: 864 GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKSDFVLVG 920
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D+ +S+ L +K +A+D+ + E L D + L ++ N+ I
Sbjct: 921 DLMRSLTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFLG---AENCFNLFICQKDS 977
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 978 AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFG 1027
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
T+ G+IG + + + L++L+ KL + V + +R F ++ K + + +
Sbjct: 1028 TVSGAIGLVTQIPFGFYEFLRNLEDKLTSVIKSVGKIEHNFWRSFKTDLKIEQ--CEGFI 1085
Query: 1388 DCELLSHYEMLPLEEQLEIA 1407
D +L+ + L ++ E+A
Sbjct: 1086 DGDLIESFLDLSHDKMAEVA 1105
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/205 (23%), Positives = 94/205 (45%), Gaps = 35/205 (17%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + I+ K+ I W
Sbjct: 159 MDEQQVQDVNFLHGCTNPTLILIHQD-------INGRH----VKTHEINLREKEFSKIPW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + A+ + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 257
Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
+ ++D +L D+A G L +L + + + VV+ L + +
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLELEKKPDGTQVVKDLKVELLGEISIP 311
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
IT + N + ++GSRLGDS L++
Sbjct: 312 ECITYLDNGVIYVGSRLGDSQLIKL 336
>gi|255316764|gb|ACU01763.1| putative DNA damage binding protein [Brachypodium distachyon]
Length = 384
Score = 83.6 bits (205), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 149/332 (44%), Gaps = 35/332 (10%)
Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+ +N +GTAYV E+ +GR+L+F+ D L+ E KE KGA+ +L +
Sbjct: 73 SDDNNFYYCVGTAYVLPEENEPTKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 126
Query: 1163 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL A KI L+KW EL + L + + +FI++GD+ KSI
Sbjct: 127 FNGKLLAAINQKIQLYKWMTREDGSHELQSECGHHGHILALFT-QTRGDFIVVGDLMKSI 185
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTLSLVVSDEQKNIQIFYYAPKM 1273
L +K + + + LA+D+ + A E + D G+ S + +KN +
Sbjct: 186 SLLVYKHEESAIEELARDYNANWMTAVEMIDDDIYVGAENSYNLFTVRKN------SDAA 239
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGS 1332
++ +G +L E+H+G V +F ++ D G P ++FGT++G
Sbjct: 240 TDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDTEMGQIP--------TVIFGTINGV 290
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
IG IA L + L+ LQ L + V L+ +R FH+ K + +D +L+
Sbjct: 291 IGIIASLPHDQYVFLEKLQSILGKFIKGVGSLSHDQWRSFHNEKKT--AEARNFLDGDLI 348
Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ L + E++ G + + + +L
Sbjct: 349 ESFLDLNRSKMEEVSKGMGVSVENLSKRVEEL 380
>gi|452824086|gb|EME31091.1| DNA damage-binding protein 1 isoform 2 [Galdieria sulphuraria]
Length = 1150
Score = 83.2 bits (204), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 123/565 (21%), Positives = 230/565 (40%), Gaps = 102/565 (18%)
Query: 840 FLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDA 899
+ A L DG +L Y+ L + ++T + R +S+ AS L
Sbjct: 612 YFLAALGDGRLLTYR--LDKSAKDTDSEKKFLYDQRQMSIGTQPAS-----------LSI 658
Query: 900 YTREETPH-GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFT 958
+ + H A C R T+ + SG G C + RE RV C S AF
Sbjct: 659 FETQNALHVFAACDRPTVIHSSSG------GGKLLCSNVNLREVTRV----CSFSSEAFP 708
Query: 959 VLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP--VQKIPLKATPHQITYFAEKNLY 1016
+C + + ++G L + T DN ++ IPL P +I + +++
Sbjct: 709 -----DC----LALVTEGSLLL------GTVDNIQKLHIRTIPLGEQPRRIAHLDTHHVF 753
Query: 1017 PLIVS---VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA 1073
++ + V + + N+ LS ++ ID+ + +++ +Y +E++E
Sbjct: 754 AVLTTKQVVTISEDGNEALSETTEEGYVRLIDD---TMMEIVHSYKLEQFETPC------ 804
Query: 1074 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG-EDVAARGRVLLF 1132
+ I + ++A K+N+ +GTAY E +RGR+L+F
Sbjct: 805 -------SVITVNFGDDA-----------AAKDNQDYFVVGTAYSYADEPEPSRGRMLVF 846
Query: 1133 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1192
+ + +T V + KGA+ ++ + G +L + + L +W+ TE +
Sbjct: 847 AV------REQRLTLVAERTFKGALYSMDAFNGKILASVNSMLKLVRWSETESGARTLTE 900
Query: 1193 A----PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
++++ + + +FIL+GD+ +S+ L++K + +A+D E L
Sbjct: 901 ECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIEDVARDIDPSWITVIEML- 959
Query: 1249 DGSTLSLVVSDEQK-NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF----LRLQML 1303
L +S E N+ S + +L E+H+G V + L LQ+
Sbjct: 960 ---DLDYYISAENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIRHGRLVLQIP 1016
Query: 1304 AT-----SSDRTGAAPGSDKT------NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1352
+ S G D +++ GT +G++G IA +DE TF+ L SLQ
Sbjct: 1017 ESGISILKSLLYGMYICFDDNLKELFMHKYRFNLGTANGALGVIASIDEKTFQFLHSLQT 1076
Query: 1353 KLVDSVPHVAGLNPRSFRQFHSNGK 1377
L + + V G+ +R+F S +
Sbjct: 1077 ALNEVIKGVGGIQHEDWRRFTSERR 1101
Score = 40.8 bits (94), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 84/211 (39%), Gaps = 35/211 (16%)
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
I L +LD V D F++G+ +P + +L + +H +L
Sbjct: 151 IRLEELD---VLDIQFLYGHSKPTIAVL------YTDSEENRHLKTYTVSLK-DKDFGNG 200
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
PL NL A L+ VP+PIGGV+V+G T+ Y S S L Y S+ S +
Sbjct: 201 PLFQG--NLESGASMLIPVPTPIGGVVVLGQETVTYISGS-----GLRGYH-SIPVSATI 252
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL---------TVVYDGRVVQRLDLSKT 406
R+ ++ D LL + G L LL T + L +
Sbjct: 253 FRAYGRIDKDGTR--------YLLGDEKGILYLLVLEQSTSLSTFTETETKITGLKIQTL 304
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ L S I + N ++GS GDS L++
Sbjct: 305 GETSLPSTIDYLDNGFVYIGSCHGDSQLIRL 335
>gi|193644722|ref|XP_001942922.1| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
Length = 1156
Score = 83.2 bits (204), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 85/339 (25%), Positives = 149/339 (43%), Gaps = 42/339 (12%)
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQN 1143
+ S+E AL++ L + + T +GTA V ED + GR+L+F + D+ +
Sbjct: 816 LNSNEYALSIISAKLGD----DPATYYILGTAVVNPEDQDPKLGRILIF----HWDDSSS 867
Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVS 1200
+T + KE+KGA +A G LL A + L +WT + L F + L+V +
Sbjct: 868 KLTPITEKEVKGACYGMAEFNGKLLAAVNCTVRLFEWTAEKELRLECSHFNNIVALFVKT 927
Query: 1201 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1260
+FI+ GD+ +S+ L +K +A+D+ A E + D L ++
Sbjct: 928 KG---DFIVCGDLMRSLTLLQYKTMEGSFEEIARDYNPKWSTAIEIIDDDVFLG---AEN 981
Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
KN+ I + ++ +L +FH G + F R G+ T+
Sbjct: 982 DKNLFIIHKDSTLTSDEARHQLQEIGQFHCGDLINVF-----------RHGSLVMQHFTD 1030
Query: 1321 RFA-----LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
+ +L+GT G++G + L F L L+K L V V +N + +R +H+
Sbjct: 1031 TYVSVQGGILYGTCSGALGLVTQLTPKMFDFLSDLEKSLATVVKGVGKINHQFWRSYHTE 1090
Query: 1376 GKAHRPGPDSIVDCEL------LSHYEMLPLEEQLEIAH 1408
+ +S VD +L LS EM+ + + L+ A+
Sbjct: 1091 IRTE--PSESFVDGDLIESFLDLSKREMIAVVDALQGAY 1127
Score = 47.8 bits (112), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 58/267 (21%), Positives = 109/267 (40%), Gaps = 59/267 (22%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G + +DP R G+ +Y GL II D G + R+E + +
Sbjct: 122 GAMAVIDPSARVIGLKLYDGLFKII------------PLDKEGELKAYCLRMEE---VEV 166
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH-PL 297
+D+D F++G P ++I+H+ + GR I A +S K+
Sbjct: 167 QDID--------FLYGCANPTIIIIHQDTM---GR--------HIKAKELSIKDKEFVKT 207
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
W N+ +A ++ VP P+ G +++G ++ YH+ S+ A+ S + +
Sbjct: 208 PWKQENVETEASMIIPVPEPLCGAIIIGRESVLYHNGSSFIAI----------SPPVIKQ 257
Query: 358 SSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL---- 411
S+ +D +L D+A G L +L + Y+ + +L
Sbjct: 258 STIVCYARIDPEGTRYLLGDMA------GHLFMLLLNYEKNPDGTFKIKDPKVDLLGEIS 311
Query: 412 -TSDITTIGNSLFFLGSRLGDSLLVQF 437
+T + N + ++ SR+GDS L++
Sbjct: 312 IPESLTYLDNKIIYVASRVGDSQLIKL 338
>gi|15233515|ref|NP_193842.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
gi|73620956|sp|O49552.2|DDB1B_ARATH RecName: Full=DNA damage-binding protein 1b; AltName: Full=UV-damaged
DNA-binding protein 1b; Short=DDB1b
gi|110739453|dbj|BAF01636.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
gi|332659001|gb|AEE84401.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
Length = 1088
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 153/349 (43%), Gaps = 34/349 (9%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ + L + EE E VR+L+ ++ ++ P+ + E ++
Sbjct: 716 RRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQ----SFEFLSSYPLDAFECGCSI 771
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
L + T + +GTAYV E+ +GR+L+F + L+TE KE
Sbjct: 772 ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 821
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KGA+ +L + G LL + KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
+GD+ KSI L +K + + A+D+ + A E L D L +D NI
Sbjct: 881 AVGDLMKSISLLIYKHEEGAIEERARDYNANWMTAVEILNDDIYLG---TDNCFNIFTVK 937
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
+ + + ++ E+H+G V +F ++ P SD ++FGT
Sbjct: 938 KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVM-------KLPDSDIGQIPTVIFGT 990
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 991 VSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKR 1039
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + L+G + ++ + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
+ES E + G S G + +DP R G+ +Y GL +
Sbjct: 94 ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
I ++G F+ R+E V++++ F++G +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
L++ +H + +LK + WS NL + A L+ VPSP+
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVL++G TI Y S +A A+ + + ++ V+LD + LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
G + LL + ++ V L + + + S I+ + N++ F+GS GDS L++
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+++ DA + S + L+ VN + + + +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366
Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
T S A +D G L+ G+ IN AS VEL G KG+W++ KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
S D+ + +L++S + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428
>gi|62318656|dbj|BAD95136.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
Length = 1088
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 153/349 (43%), Gaps = 34/349 (9%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ + L + EE E VR+L+ ++ ++ P+ + E ++
Sbjct: 716 RRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQ----SFEFLSSYPLDAFECGCSI 771
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
L + T + +GTAYV E+ +GR+L+F + L+TE KE
Sbjct: 772 ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 821
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KGA+ +L + G LL + KI L+KW GT EL + L + + +FI
Sbjct: 822 KGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
+GD+ KSI L +K + + A+D+ + A E L D L +D NI
Sbjct: 881 AVGDLMKSISLLIYKHEEGAIEERARDYNANWMAAVEILNDDIYLG---TDNCFNIFTVK 937
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
+ + + ++ E+H+G V +F ++ P SD ++FGT
Sbjct: 938 KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVM-------KLPDSDIGQIPTVIFGT 990
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 991 VSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKR 1039
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ + L+G + ++ + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
+ES E + G S G + +DP R G+ +Y GL +
Sbjct: 94 ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141
Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
I ++G F+ R+E V++++ F++G +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179
Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
L++ +H + +LK + WS NL + A L+ VPSP+
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVL++G TI Y S +A A+ + + ++ V+LD + LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
G + LL + ++ V L + + + S I+ + N++ F+GS GDS L++
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328
Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
+++ DA + S + L+ VN + + + +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366
Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
T S A +D G L+ G+ IN AS VEL G KG+W++ KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
S D+ + +L++S + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428
>gi|345328202|ref|XP_003431248.1| PREDICTED: DNA damage-binding protein 1-like [Ornithorhynchus
anatinus]
Length = 1045
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 145/333 (43%), Gaps = 42/333 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 728 KDANTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 781
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 782 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 836
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 837 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 893
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 894 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 943
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 944 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKT-EPAT-GFIDGDLIES 1001
Query: 1395 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ +M + L+I +G R + +L
Sbjct: 1002 FLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1034
>gi|290998415|ref|XP_002681776.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
gi|284095401|gb|EFC49032.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
Length = 1103
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/346 (23%), Positives = 159/346 (45%), Gaps = 44/346 (12%)
Query: 1103 TTKENETLLAIGTAYVQG-EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALA 1161
T +NE + +GTA +G E+ ++GR+L+ D+ L E K++KGA+ L
Sbjct: 775 TDDDNEYFI-VGTAITEGDEEEPSKGRILVLQV---QDDKLVLKAE---KDVKGAVMVLH 827
Query: 1162 SLQGHLLIASGPKIILHKWTGTEL--NGIAFYDAP---PLYVVSLNIVKNFILLGDIHKS 1216
S G LL +++L KW ++ N + +Y++ ++ +FIL+GD+ KS
Sbjct: 828 SFNGKLLAGVSGRLMLFKWAESDDGDNKDLVQECSCSGGIYILDIDSHGDFILIGDMMKS 887
Query: 1217 IYFLSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
++ ++ Q L L++KD+ + +++ S V D+Q N+
Sbjct: 888 VHLFVYENPEEQHVSGNLRLISKDY-QYSWLSCSLMLNES--EYVAVDQQGNMITLKKND 944
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTK----FLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
+ + + ++L+ +++ V + F+ ++ +SSD P LFG
Sbjct: 945 EAASEEERKQLVRVGKYYCSDRVNRIQPGFIGMRFANSSSD-INTQPVK------TALFG 997
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI- 1386
T+ G IG +A L TF + +QK + V +A ++ ++RQ+ S R DS+
Sbjct: 998 TISGGIGVLAQLPPETFAFVTKIQKAMSSVVTGLANISRETYRQYRS----ERTREDSVG 1053
Query: 1387 -VDCELLSHYEMLPLE------EQLEIAHQTGTTRSQILSNLNDLA 1425
+D + + + E E+L HQ T +++ N+ DL+
Sbjct: 1054 FIDGDFVESFLEFDFETQQRVIEELSNNHQEQITLEELVKNIEDLS 1099
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 77/358 (21%), Positives = 151/358 (42%), Gaps = 60/358 (16%)
Query: 82 KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAK 141
KN R+ +G+S+ V + G ++++++ G ++D + + ED
Sbjct: 35 KNQYLQVNRLSEEGVSS-----VVEFEAPGRIDTMSLFRPSG----EKQDLLFITIEDTF 85
Query: 142 ISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQ 200
++ D I L S+ + P GR S G + +DP R + +Y GL
Sbjct: 86 FTLGFIDGKIETLSSGSI---DDP------VGRRS-ESGSITTIDPLCRAVALSIYEGLL 135
Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
II ++ F F+ R+E +VI++ L+ K P
Sbjct: 136 KII--------PFENNKHQFKEA--FNVRLEELNVIDIAFLESLGSK------SKSGPTF 179
Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
+L++ + H ++ +++ L + +N+ H A L+ VP+P+GG
Sbjct: 180 ALLYQDHV-------GSRHVKTYEVKTLDKDMEESSL--NQLNVDHGANILIPVPAPLGG 230
Query: 321 VLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
V+ VG + Y ++S N++V+ ++ + S+ +LD + W D
Sbjct: 231 VICVGEAQVSYINESN------KNHSVASPANSRMAIRSYG-KLD--NTRWFLGD----- 276
Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
++G L LL++ V L L + + ++S I+ + N F+GS GDS +++ +
Sbjct: 277 -QSGQLYLLSLQVSDSEVTGLTLKELGVTSISSCISYLDNGYVFIGSNYGDSQVIRIS 333
>gi|357132340|ref|XP_003567788.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
distachyon]
Length = 1090
Score = 82.4 bits (202), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 149/332 (44%), Gaps = 35/332 (10%)
Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+ +N +GTAYV E+ +GR+L+F+ D L+ E KE KGA+ +L +
Sbjct: 779 SDDNNFYYCVGTAYVLPEENEPTKGRILVFAV---EDGRLQLIVE---KETKGAVYSLNA 832
Query: 1163 LQGHLLIASGPKIILHKWT-----GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL A KI L+KW EL + L + + +FI++GD+ KSI
Sbjct: 833 FNGKLLAAINQKIQLYKWMTREDGSHELQSECGHHGHILALFT-QTRGDFIVVGDLMKSI 891
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLID----GSTLSLVVSDEQKNIQIFYYAPKM 1273
L +K + + + LA+D+ + A E + D G+ S + +KN +
Sbjct: 892 SLLVYKHEESAIEELARDYNANWMTAVEMIDDDIYVGAENSYNLFTVRKN------SDAA 945
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGS 1332
++ +G +L E+H+G V +F ++ D G P ++FGT++G
Sbjct: 946 TDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDTEMGQIP--------TVIFGTINGV 996
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
IG IA L + L+ LQ L + V L+ +R FH+ K + +D +L+
Sbjct: 997 IGIIASLPHDQYVFLEKLQSILGKFIKGVGSLSHDQWRSFHNEKKTAE--ARNFLDGDLI 1054
Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ L + E++ G + + + +L
Sbjct: 1055 ESFLDLNRSKMEEVSKGMGVSVENLSKRVEEL 1086
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 152/367 (41%), Gaps = 83/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G + P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCLRPTIVVLYQ------DNKDARHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P WS NL + A L+ VP+P+GGV+++G TI Y + +++
Sbjct: 197 LK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEETIVYCNANSTFK------ 247
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
++ Q + R+ V+ D + LL TG L LL + + V L +
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSR--------YLLGDNTGILHLLVLTQERERVTGLKIEH 297
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N + ++GSR GDS LV+ +++ADA
Sbjct: 298 LGETSVASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADATG-- 337
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
S + L+ VN + + + + + T S A +D G ++ G+
Sbjct: 338 ----SFVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSIRVVRNGIG 388
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ KSS ++D Y +L++S
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------FNDPYDTFLVVSF 422
Query: 584 EARTMVL 590
+ T L
Sbjct: 423 ISETRFL 429
>gi|2911067|emb|CAA17529.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
gi|7268907|emb|CAB79110.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
Length = 1102
Score = 82.4 bits (202), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 153/349 (43%), Gaps = 34/349 (9%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ + L + EE E VR+L+ ++ ++ P+ + E ++
Sbjct: 730 RRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQ----SFEFLSSYPLDAFECGCSI 785
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
L + T + +GTAYV E+ +GR+L+F + L+TE KE
Sbjct: 786 ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KET 835
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KGA+ +L + G LL + KI L+KW GT EL + L + + +FI
Sbjct: 836 KGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 894
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
+GD+ KSI L +K + + A+D+ + A E L D L +D NI
Sbjct: 895 AVGDLMKSISLLIYKHEEGAIEERARDYNANWMTAVEILNDDIYLG---TDNCFNIFTVK 951
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
+ + + ++ E+H+G V +F ++ P SD ++FGT
Sbjct: 952 KNNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVM-------KLPDSDIGQIPTVIFGT 1004
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 1005 VSGMIGVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKR 1053
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 85/370 (22%), Positives = 150/370 (40%), Gaps = 90/370 (24%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P + +L++ +H
Sbjct: 158 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCTKPTIAVLYQ------DNKDARH------V 204
Query: 286 LSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALAL 342
+ +LK + WS NL + A L+ VPSP+ GVL++G TI Y S +A A+ +
Sbjct: 205 KTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLIIGEETIVYCSANAFKAIPI 264
Query: 343 NNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLD 402
+ ++ V+LD + LL G + LL + ++ V L
Sbjct: 265 R---------PSITKAYGRVDLDGSR--------YLLGDHAGLIHLLVITHEKEKVTGLK 307
Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
+ + + S I+ + N++ F+GS GDS L++ +++ DA
Sbjct: 308 IELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL------------------NLQPDA- 348
Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSY 520
+ S + L+ VN + + + + T S A +D G L+
Sbjct: 349 -----KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRN 398
Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
G+ IN AS VEL G KG+W++ KSS D+ + +L+
Sbjct: 399 GIGINEQAS------------VELQGIKGMWSL--KSS------------IDEAFDTFLV 432
Query: 581 ISLEARTMVL 590
+S + T +L
Sbjct: 433 VSFISETRIL 442
>gi|348560393|ref|XP_003465998.1| PREDICTED: DNA damage-binding protein 1-like [Cavia porcellus]
Length = 1140
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 127/286 (44%), Gaps = 34/286 (11%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
+ L E + L +Q +L + V + +R FH+ K +
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ 1084
>gi|403255013|ref|XP_003920244.1| PREDICTED: DNA damage-binding protein 1 [Saimiri boliviensis
boliviensis]
Length = 1140
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|395544366|ref|XP_003774082.1| PREDICTED: DNA damage-binding protein 1 [Sarcophilus harrisii]
Length = 1239
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 922 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 975
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 976 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 1030
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 1031 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1087
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1088 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1137
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1138 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1180
>gi|159155577|gb|AAI54419.1| Cpsf1 protein [Danio rerio]
Length = 400
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 110/240 (45%), Gaps = 51/240 (21%)
Query: 483 ELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRINADAS---- 529
E+ +YGS A + T+ A T+SF V DS++NIGP S G + N +
Sbjct: 54 EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVV 111
Query: 530 -ATGISKQSNYELV------------ELPGCKGIWTVYHKSSR---------GHNADSSR 567
+G K ++ ELPGC +WTV + + G + + +
Sbjct: 112 VCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEK 171
Query: 568 MAAY---DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
D + H +LI+S E TM+L+T + E+ S + QG T+ AGN+ + +I
Sbjct: 172 REPTIEDDKKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYII 230
Query: 625 QVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
QV G R+L+G L F P + S ++ S+ADPYV++ ++G + + V
Sbjct: 231 QVSPMGIRLLEG---VNQLHFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFV 280
>gi|384250802|gb|EIE24281.1| hypothetical protein COCSUDRAFT_28729 [Coccomyxa subellipsoidea
C-169]
Length = 1101
Score = 81.6 bits (200), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/321 (25%), Positives = 143/321 (44%), Gaps = 27/321 (8%)
Query: 1113 IGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GTA V E +GR+L+F +LV E KE+KGA L QG L+
Sbjct: 796 VGTAITVAEEPEPTKGRILVFGA---KGGKLSLVCE---KEVKGAAYNLHPFQGKLIAGI 849
Query: 1172 GPKIILHKWTGTELNGIAFYDAPPL--YVVSLNIVK--NFILLGDIHKSIYFLSWKEQGA 1227
++ L KWT +E + +V++L IV +F+++GD+ +S+ L ++
Sbjct: 850 NSRVQLFKWTQSEDGSRELTNECSHVGHVLALYIVTRGDFVIVGDLMRSLQLLIYRADEG 909
Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
L + A+D+ + A E L D + L ++ NI + +L + +
Sbjct: 910 ILEVRARDYKTHWMTAVEVLDDDTYLG---AENSNNIFTLRKNTDAAADEDRNRLETVGQ 966
Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1347
+H+G V +F ++ P S+ +LF T++GSIG IA L + F+ L
Sbjct: 967 YHLGVFVNRFRHGSLVMK-------LPDSEAAKIPTVLFVTINGSIGVIASLPQQQFQFL 1019
Query: 1348 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELLSHYEMLPLEEQLEI 1406
LQ L + V GL+ ++R F H P + VD +L+ + L + +
Sbjct: 1020 SRLQDCLRKVIKGVGGLSHVAWRTFQDE---HTKMPSQNFVDGDLIEQFLDLKRDSMERV 1076
Query: 1407 AHQT--GTTRSQILSNLNDLA 1425
A + G T +L + +L+
Sbjct: 1077 AREMGEGVTSEDLLRMVEELS 1097
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 125/551 (22%), Positives = 206/551 (37%), Gaps = 113/551 (20%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ V ++G V ++ + G +D + L+ E K VLE+D
Sbjct: 44 RIEIHTLTPEGLKGVADVAIYGRVATMELFRPVG----ESKDLLFLSTERYKFCVLEYDS 99
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
L + E + GR G + VDP G +MI L G
Sbjct: 100 ETGELVTRANGDIED------QVGRPC-DNGQIGIVDP----------GCRMIGLHLYDG 142
Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
++ +D F+ RI+ +VI D IF+ G +P + +L++
Sbjct: 143 LFKVIPIDDKGQLHEAFNMRIDELNVI-----------DMIFLEGCAKPTIAVLYQDN-- 189
Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI 329
H + L + P W NL A +++AVP P+GG LVVG + I
Sbjct: 190 -----KDARHIKTYEVVLKEKDLTEGP--WRQSNLDAGASRVIAVPEPLGGALVVGESVI 242
Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVL 388
Y Q Q + + + AH ++ LL G+L L
Sbjct: 243 AYMGQ-----------------GQAMKCTPIKATIIRAHGRVDEDGSRYLLGDYVGNLYL 285
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + +DG V L + + S +T + N + F+GS GDS LV+
Sbjct: 286 LVLQHDGEHVAGLKVEPLGRTSAPSTLTYLDNGVVFVGSSGGDSQLVRL----------- 334
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDS 508
P T + + + L+ M N + + + + +
Sbjct: 335 ----------HPTPVTPQEPSNFVEVLETMTNLGPIIDFVVVDLERQGQGQVVMCS---- 380
Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRM 568
G + D S LRI + G+ +Q+ VELPG KG+W + +S M
Sbjct: 381 ----GIMADGS--LRIVRN--GIGMIEQAT---VELPGIKGMWALR----------ASHM 419
Query: 569 AAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQV 626
A+D +L+IS E R + + D L E E + +T+ GN ++QV
Sbjct: 420 DAFD----TFLVISFVGETRILAINADDELDE-AELPGFSADAQTLCCGNTVS-DHLVQV 473
Query: 627 FERGARILDGS 637
R++D S
Sbjct: 474 AGADVRLVDAS 484
>gi|119594340|gb|EAW73934.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_b [Homo
sapiens]
Length = 923
Score = 81.6 bits (200), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 606 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 659
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 660 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 714
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 715 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 771
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 772 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 821
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 822 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 864
>gi|307111604|gb|EFN59838.1| hypothetical protein CHLNCDRAFT_29381 [Chlorella variabilis]
Length = 1108
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 125/292 (42%), Gaps = 22/292 (7%)
Query: 1109 TLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
T +GTA+ E +GR+ + + + V KE +GA+ +LA QG L
Sbjct: 798 TYYVVGTAFAPPNEPEPTKGRIFVLAAAGGK------LCVVCEKETRGAVYSLAEFQGRL 851
Query: 1168 LIASGPKIILHKWTGTELNGIAFYD----APPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
L ++ ++KW G A A + + L + +++GD+ KSI L+W
Sbjct: 852 LAGINSRVQMYKWLEQGEGGRALVPECSHAGHVLALYLATRGDLVVVGDLMKSIQLLAWG 911
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
E+ L L A+DF A L D + + ++ N+ + + +L
Sbjct: 912 EEEGALELRARDFHPNWMSAVTVLDDDTYMG---AENSYNLFTVRRNADAATDEERSRLE 968
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
+ +H+G V +F ++ P S+ + +LFGT++G IG +A L
Sbjct: 969 TVGRYHLGEFVNRFQPGSLVMR-------LPDSELSQIPTVLFGTINGVIGVVASLPHAQ 1021
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
++ L+SLQ+ + V V G + +R F + P VD +L+ +
Sbjct: 1022 YQLLESLQEAMRKVVKGVGGFDHAQWRAFSNQHMPATPA-RQFVDGDLIEQF 1072
>gi|328770638|gb|EGF80679.1| hypothetical protein BATDEDRAFT_11194 [Batrachochytrium dendrobatidis
JAM81]
Length = 1098
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/333 (24%), Positives = 148/333 (44%), Gaps = 27/333 (8%)
Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQ 1142
P + + + +T+R T ++ +GT + ED RGR+L+F N
Sbjct: 765 PFEIASSLITIRF-------TDDDTLYYTVGTGFAFPHEDEPVRGRILVFKV-----NDM 812
Query: 1143 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW-TGTELNGIAFYDAPPLYVVSL 1201
L+ V+ +++G+ + S+ G L+ +++ +W + T L + + + +SL
Sbjct: 813 RLLQLVHEYDIRGSAYSFVSVHGRLVAGVNSNVMVLRWNSDTSLLELQSMNHGHVLALSL 872
Query: 1202 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1261
+ +FIL+ D+ KSI L + L LA D S A E + D + L +D
Sbjct: 873 AVRGDFILVADLIKSITLLQFDLATDSLKELAYDADSNWMTAAELIDDDTFLG---ADSS 929
Query: 1262 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1321
NI + Q+L + FH G + +F + + ++D T A P +
Sbjct: 930 MNIFALSKQGDQVSEEERQRLRPKGWFHTGELINRFRKGSLTLHATDETLALPAIPE--- 986
Query: 1322 FALLFGTLDGSIGCIA--PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
+L+ T+ G+IG +A P DE T + L +LQ+ L V V GL +R++ + ++
Sbjct: 987 --ILYCTVHGAIGVVARIPSDE-TAKILSTLQEALKSVVQGVGGLIHSDWRRYRTERRSI 1043
Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1412
+ I+D +L+ + L Q + Q T
Sbjct: 1044 KSA--GIIDGDLIESFLELDRSMQDHVFTQVAT 1074
>gi|395852550|ref|XP_003798801.1| PREDICTED: DNA damage-binding protein 1 [Otolemur garnettii]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|300707023|ref|XP_002995737.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
gi|239604943|gb|EEQ82066.1| hypothetical protein NCER_101290 [Nosema ceranae BRL01]
Length = 1155
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/236 (25%), Positives = 116/236 (49%), Gaps = 12/236 (5%)
Query: 1071 DRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN--ETLLAIGTAYVQGEDVAARGR 1128
D ++ +T ++S E L ++ ++L N + N + I V+GED +RGR
Sbjct: 820 DLYTSAYKFISTFDLESDEYVLDIKELSL-NDSIGINGKNNFIVICVTKVEGEDKHSRGR 878
Query: 1129 VLLFSTGRNADNPQNL-----VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
+++F + N+ + + S+ +KG I+ ++G+L++A G K +++K +
Sbjct: 879 IIVFELIDIIVDKANVHKDKKLKVLASENIKGCITKCDEIKGNLIVALGIKTMIYKIDRS 938
Query: 1184 E-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
E L I +D L S+ +KNF+L DI++ + F ++ + +LNL+ +
Sbjct: 939 EGLIPIGIHDLYTL-TTSMITIKNFVLFSDIYRGLSFFYYQNKPVRLNLVCTSESIKNAV 997
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE--FHVGAHVTK 1296
+F++ L ++ +D NI + Y+P S G K + R E F++G V K
Sbjct: 998 HVDFIVKEPALGIICTDFAGNIHTYTYSPVNILSCNGTKFVKRCETNFNLGKLVIK 1053
>gi|410974071|ref|XP_003993471.1| PREDICTED: DNA damage-binding protein 1 [Felis catus]
Length = 1193
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 876 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 929
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 930 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 984
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 985 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1041
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1042 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1091
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1092 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1134
>gi|312283457|dbj|BAJ34594.1| unnamed protein product [Thellungiella halophila]
Length = 1088
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 151/350 (43%), Gaps = 36/350 (10%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ L EE E VR+L+ ++ +T P+ + E ++
Sbjct: 716 RRICHQEQTRTFGICSLGNQTNAEESEMHFVRLLDDQ----SFEFVSTYPLDAFEYGCSI 771
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
L + + +GTAYV E+ +GR+L+F D L+ E KE
Sbjct: 772 ----LSCSFADDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGKLQLIAE---KET 821
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KG++ +L + G LL A KI L+KW GT EL + L + + +FI
Sbjct: 822 KGSVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL-ALYVQTRGDFI 880
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
++GD+ KSI L +K + + A+D+ + A E L D L ++ N+
Sbjct: 881 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLLTVK 937
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFG 1327
+ + + +L E+H+G V +F ++ D G P ++FG
Sbjct: 938 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP--------TVIFG 989
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
T++G IG IA L + + L+ LQ L + V GL+ +R F++ +
Sbjct: 990 TVNGVIGVIASLPQEQYMFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKR 1039
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 109/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++ L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTPQGLQPMLDVPMYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + G + +DP R G+ +Y GL +I ++
Sbjct: 95 ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
G F+ R+E V++++ F+ G +P + +L++
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
+H + +LK + WS NL + A L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 233
Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
TI Y S +A A+ + + ++ V++D + LL G
Sbjct: 234 EETIVYCSANAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGL 276
Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
+ LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328
Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
++ DA + S + L+ VN + + + + T S
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
A +D G L+ G+ IN AS VEL G KG+W++ KSS
Sbjct: 373 AFKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408
Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
D+ + +L++S + T VL
Sbjct: 409 -------IDEAFDTFLVVSFISETRVL 428
>gi|413081953|ref|NP_741992.2| DNA damage-binding protein 1 [Rattus norvegicus]
gi|293344614|ref|XP_002725831.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
gi|293356422|ref|XP_002728912.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
gi|149062405|gb|EDM12828.1| damage-specific DNA binding protein 1 [Rattus norvegicus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|119594343|gb|EAW73937.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_e [Homo
sapiens]
Length = 896
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/284 (26%), Positives = 126/284 (44%), Gaps = 34/284 (11%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 579 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 632
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 633 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 687
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 688 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 744
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 745 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 794
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 795 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKT 838
>gi|354504619|ref|XP_003514371.1| PREDICTED: DNA damage-binding protein 1-like [Cricetulus griseus]
gi|344258340|gb|EGW14444.1| DNA damage-binding protein 1 [Cricetulus griseus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|270346571|pdb|3I7H|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Hbx
gi|270346573|pdb|3I7K|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Whx
gi|270346575|pdb|3I7L|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Ddb2
gi|270346577|pdb|3I7N|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdtc1
gi|270346579|pdb|3I7O|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Iqwd1
gi|270346581|pdb|3I7P|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr40a
gi|270346583|pdb|3I89|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr22
gi|270346585|pdb|3I8C|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr21a
gi|270346587|pdb|3I8E|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr42a
gi|270346588|pdb|3I8E|B Chain B, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr42a
Length = 1143
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 826 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 879
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 880 NGKLLASINSTVRLYEWTTEKDVRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 934
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 935 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 991
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 992 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1041
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1042 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1084
>gi|259155222|ref|NP_001158852.1| DNA damage-binding protein 1 [Salmo salar]
gi|223647700|gb|ACN10608.1| DNA damage-binding protein 1 [Salmo salar]
Length = 1139
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 128/293 (43%), Gaps = 36/293 (12%)
Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GTA V E+ + GR+++F D V E KE+KGA+ ++ G LL +
Sbjct: 830 VGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEFNGKLLASI 883
Query: 1172 GPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
+ L++WT TE N + + LY L +FIL+GD+ +S+ L++K
Sbjct: 884 NSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPME 938
Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
+A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 939 GNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVG 995
Query: 1287 EFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
FH+G V F L LQ L SS T + +LFGT++G IG + L E
Sbjct: 996 VFHLGEFVNVFSHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEG 1045
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
+ L LQ +L + V + +R FH+ K + +D +L+ +
Sbjct: 1046 WYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 1096
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 77/341 (22%), Positives = 131/341 (38%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG VV + L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLRVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G D L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ +++IQ+ G R++
Sbjct: 440 GEEVEETELPGFVDNLQTFYCGNV-AHQQLIQITSGGVRLV 479
>gi|311247551|ref|XP_003122699.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Sus scrofa]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|384941436|gb|AFI34323.1| DNA damage-binding protein 1 [Macaca mulatta]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|221046721|pdb|3EI4|A Chain A, Structure Of The Hsddb1-Hsddb2 Complex
gi|221046723|pdb|3EI4|C Chain C, Structure Of The Hsddb1-Hsddb2 Complex
gi|221046725|pdb|3EI4|E Chain E, Structure Of The Hsddb1-Hsddb2 Complex
Length = 1158
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 841 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 894
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 895 NGKLLASINSTVRLYEWTTEKDVRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 949
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 950 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1006
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1007 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1056
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1057 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1099
>gi|122692537|ref|NP_001073731.1| DNA damage-binding protein 1 [Bos taurus]
gi|426251842|ref|XP_004019630.1| PREDICTED: DNA damage-binding protein 1 [Ovis aries]
gi|134034086|sp|A1A4K3.1|DDB1_BOVIN RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|119223918|gb|AAI26630.1| Damage-specific DNA binding protein 1, 127kDa [Bos taurus]
gi|296471644|tpg|DAA13759.1| TPA: DNA damage-binding protein 1 [Bos taurus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|90108797|pdb|2B5L|A Chain A, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
Protein
gi|90108798|pdb|2B5L|B Chain B, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
Protein
gi|90108801|pdb|2B5M|A Chain A, Crystal Structure Of Ddb1
gi|116667897|pdb|2HYE|A Chain A, Crystal Structure Of The Ddb1-cul4a-rbx1-sv5v Complex
gi|1136228|gb|AAA88883.1| UV-damaged DNA binding factor [Homo sapiens]
gi|1588524|prf||2208446A xeroderma pigmentosum group E-binding factor
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKDVRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|355683071|gb|AER97036.1| damage-specific DNA binding protein 1, 127kDa [Mustela putorius furo]
Length = 1122
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|361132523|pdb|4A0L|A Chain A, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
gi|361132525|pdb|4A0L|C Chain C, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
Length = 1144
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 827 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 880
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 881 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 935
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 936 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 992
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 993 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1042
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1043 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1085
>gi|73983859|ref|XP_533275.2| PREDICTED: DNA damage-binding protein 1 [Canis lupus familiaris]
gi|291409601|ref|XP_002721069.1| PREDICTED: damage-specific DNA binding protein 1 [Oryctolagus
cuniculus]
gi|301781686|ref|XP_002926259.1| PREDICTED: DNA damage-binding protein 1-like [Ailuropoda melanoleuca]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|418316|sp|P33194.1|DDB1_CERAE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=DDBa; AltName:
Full=Damage-specific DNA-binding protein 1; AltName:
Full=UV-damaged DNA-binding protein 1; Short=UV-DDB 1
gi|304026|gb|AAA03021.1| UV-damaged DNA-binding protein [Chlorocebus aethiops]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|221046711|pdb|3EI1|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 14 Bp 6-4 Photoproduct
Containing Dna-Duplex
gi|221046715|pdb|3EI2|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Abasic Site
Containing Dna-Duplex
gi|221046719|pdb|3EI3|A Chain A, Structure Of The Hsddb1-Drddb2 Complex
Length = 1158
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 841 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 894
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 895 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 949
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 950 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1006
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1007 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1056
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1057 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1099
>gi|194381178|dbj|BAG64157.1| unnamed protein product [Homo sapiens]
Length = 826
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 509 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 562
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 563 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 617
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 618 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 674
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 675 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 724
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 725 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 767
>gi|344295432|ref|XP_003419416.1| PREDICTED: DNA damage-binding protein 1 [Loxodonta africana]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/285 (26%), Positives = 126/285 (44%), Gaps = 34/285 (11%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE 1083
>gi|149725200|ref|XP_001502072.1| PREDICTED: DNA damage-binding protein 1 [Equus caballus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|441604084|ref|XP_004087862.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
[Nomascus leucogenys]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|358440070|pdb|4A0B|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
gi|358440072|pdb|4A0B|C Chain C, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
Length = 1159
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 842 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 895
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 896 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 950
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 951 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1007
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1008 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1057
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1058 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1100
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 96/461 (20%), Positives = 172/461 (37%), Gaps = 116/461 (25%)
Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+DP+ R G+ +Y ++ + L F+ R+E HVI+++
Sbjct: 143 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 187
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
F++G P + +++ GR H + P W N+
Sbjct: 188 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 231
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
+A ++AVPSP GG +++G +I YH+ A+A + + S V
Sbjct: 232 EAEASMVIAVPSPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 280
Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
+D + +L D+ G L +L + DG V ++ L + + + +T
Sbjct: 281 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 334
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ N + F+GSRLGDS LV+ S G+ +++ G I D R+
Sbjct: 335 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 393
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ T S A ++ G L+ G+ I+ AS
Sbjct: 394 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 420
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +R E L++S +T VL
Sbjct: 421 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 458
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + + +T GN+ +++IQ+ R++
Sbjct: 459 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 498
>gi|148529014|ref|NP_001914.3| DNA damage-binding protein 1 [Homo sapiens]
gi|296218432|ref|XP_002807395.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
[Callithrix jacchus]
gi|397516558|ref|XP_003828491.1| PREDICTED: DNA damage-binding protein 1 [Pan paniscus]
gi|402893195|ref|XP_003909786.1| PREDICTED: DNA damage-binding protein 1 [Papio anubis]
gi|426368721|ref|XP_004051351.1| PREDICTED: DNA damage-binding protein 1 [Gorilla gorilla gorilla]
gi|12643730|sp|Q16531.1|DDB1_HUMAN RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=DNA damage-binding protein a;
Short=DDBa; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=HBV X-associated protein 1;
Short=XAP-1; AltName: Full=UV-damaged DNA-binding factor;
AltName: Full=UV-damaged DNA-binding protein 1;
Short=UV-DDB 1; AltName: Full=XPE-binding factor;
Short=XPE-BF; AltName: Full=Xeroderma pigmentosum group
E-complementing protein; Short=XPCe
gi|203282525|pdb|3E0C|A Chain A, Crystal Structure Of Dna Damage-Binding Protein 1(Ddb1)
gi|695362|gb|AAA62838.1| X-associated protein 1, partial [Homo sapiens]
gi|1052865|gb|AAC50349.1| DDBa p127 [Homo sapiens]
gi|15079750|gb|AAH11686.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|29792243|gb|AAH50530.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|30354567|gb|AAH51764.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|61354161|gb|AAX44048.1| damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|119594341|gb|EAW73935.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_c [Homo
sapiens]
gi|168275638|dbj|BAG10539.1| DNA damage-binding protein 1 [synthetic construct]
gi|189065506|dbj|BAG35345.1| unnamed protein product [Homo sapiens]
gi|355566436|gb|EHH22815.1| Damage-specific DNA-binding protein 1 [Macaca mulatta]
gi|380784123|gb|AFE63937.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|380808126|gb|AFE75938.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|380810144|gb|AFE76947.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|383408123|gb|AFH27275.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|410305600|gb|JAA31400.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
gi|410352015|gb|JAA42611.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|400260815|pdb|4E54|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
Dimerization And Its Roles In Chromatinized Dna Repair
gi|401871507|pdb|4E5Z|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
Dimerization And Its Roles In Chromatinized Dna Repair
Length = 1150
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 833 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 886
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 887 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 941
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 942 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 998
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 999 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1048
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1049 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1091
>gi|5353754|gb|AAD42230.1|AF159853_1 damage-specific DNA binding protein 1 [Mus musculus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/285 (26%), Positives = 126/285 (44%), Gaps = 34/285 (11%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE 1083
>gi|359546285|pdb|4A11|A Chain A, Structure Of The Hsddb1-Hscsa Complex
gi|361132519|pdb|4A0K|C Chain C, Structure Of Ddb1-Ddb2-Cul4a-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
Length = 1159
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 842 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 895
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 896 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 950
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 951 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1007
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1008 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1057
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1058 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1100
>gi|358440066|pdb|4A0A|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.6 A Resolution (Cpd 3)
Length = 1159
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 842 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 895
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 896 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 950
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 951 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1007
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1008 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1057
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1058 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1100
>gi|348526664|ref|XP_003450839.1| PREDICTED: DNA damage-binding protein 1-like [Oreochromis niloticus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 94/365 (25%), Positives = 151/365 (41%), Gaps = 60/365 (16%)
Query: 1041 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1100
G +++ HNL VD H +EV +P SE AL++ L
Sbjct: 783 GEEVEVHNLLVVDQHT------FEV-----------LHAHQFLP---SEYALSLVSCRL- 821
Query: 1101 NTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISA 1159
K+ +GTA V E+ + GR+++F D V E KE+KGA+ +
Sbjct: 822 ---GKDPSVYFIVGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYS 872
Query: 1160 LASLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
+ G L + + L++WT TE N + + LY L +FIL+GD+
Sbjct: 873 MVEFNGKFLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLM 927
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+S+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 928 RSVLLLAYKSMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAAT 984
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1330
+ Q L FH+G V F L LQ L SS T + +LFGT++
Sbjct: 985 TDEERQHLQEVGLFHLGEFVNVFCHGSLVLQNLGESSTPTQGS----------VLFGTVN 1034
Query: 1331 GSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
G IG + L E + L LQ +L + V + +R FH+ K + +D +
Sbjct: 1035 GMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGD 1092
Query: 1391 LLSHY 1395
L+ +
Sbjct: 1093 LIESF 1097
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 74/341 (21%), Positives = 129/341 (37%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG V ++ L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVAVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G D L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ +++IQ+ R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHQQLIQITSGSVRLV 479
>gi|358440058|pdb|4A08|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 13 Bp Cpd-Duplex (
Purine At D-1 Position) At 3.0 A Resolution (Cpd 1)
gi|358440062|pdb|4A09|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 15 Bp Cpd-Duplex
(Purine At D-1 Position) At 3.1 A Resolution (Cpd 2)
Length = 1159
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 842 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 895
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 896 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 950
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 951 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 1007
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 1008 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1057
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1058 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1100
>gi|224587439|gb|ACN58665.1| DNA damage-binding protein 1 [Salmo salar]
Length = 444
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 128/293 (43%), Gaps = 36/293 (12%)
Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GTA V E+ + GR+++F D V E KE+KGA+ ++ G LL +
Sbjct: 135 VGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMMEFNGKLLASI 188
Query: 1172 GPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
+ L++WT TE N + + LY L +FIL+GD+ +S+ L++K
Sbjct: 189 NSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPME 243
Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
+A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 244 GNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVG 300
Query: 1287 EFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
FH+G V F L LQ L SS T + +LFGT++G IG + L E
Sbjct: 301 VFHLGEFVNVFSHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEG 350
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
+ L LQ +L + V + +R FH+ K + +D +L+ +
Sbjct: 351 WYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 401
>gi|74215029|dbj|BAE33503.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQRDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|7657011|ref|NP_056550.1| DNA damage-binding protein 1 [Mus musculus]
gi|134034087|sp|Q3U1J4.2|DDB1_MOUSE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=UV-damaged DNA-binding factor
gi|5931596|dbj|BAA84699.1| XPE UV-damaged DNA binding factor [Mus musculus]
gi|16307148|gb|AAH09661.1| Damage specific DNA binding protein 1 [Mus musculus]
gi|74182145|dbj|BAE34102.1| unnamed protein product [Mus musculus]
gi|74196166|dbj|BAE32993.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|74138855|dbj|BAE27231.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|385865228|gb|AFI92852.1| DNA damage-binding protein 1 [Danio rerio]
Length = 1140
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 128/293 (43%), Gaps = 36/293 (12%)
Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GTA V E+ + GR+++F D V E KE+KGA+ ++ G LL +
Sbjct: 831 VGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEFNGKLLASI 884
Query: 1172 GPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
+ L++WT TE N + + LY L +FIL+GD+ +S+ L++K
Sbjct: 885 NSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPME 939
Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
+A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 940 GSFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVG 996
Query: 1287 EFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
FH+G V F L LQ L SS T + +LFGT++G IG + L E
Sbjct: 997 LFHLGEFVNVFSHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEG 1046
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
+ L LQ +L + V + +R FH+ K + +D +L+ +
Sbjct: 1047 WYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 1097
>gi|74178494|dbj|BAE32502.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKLGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|355752055|gb|EHH56175.1| Damage-specific DNA-binding protein 1, partial [Macaca fascicularis]
Length = 1125
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 808 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 861
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 862 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 916
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 917 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 973
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 974 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1023
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1024 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1066
>gi|383863765|ref|XP_003707350.1| PREDICTED: DNA damage-binding protein 1-like [Megachile rotundata]
Length = 1138
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 161/378 (42%), Gaps = 58/378 (15%)
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
++ +I+ HNL +D H T E +L P+ E AL+
Sbjct: 779 DISQEIEVHNLLIIDQH---TFEVLHAHMLMPN-----------------EYALS----- 813
Query: 1099 LFNTTTKENETLL-AIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGA 1156
L +T E+ T +GTA V ++ + GR+LL+ +T+V KE+KG+
Sbjct: 814 LISTKLGEDPTFYYVVGTALVNPDETEPKMGRILLYHWNDGK------LTQVAEKEIKGS 867
Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
+L G LL + + L +WT + L F + LY L +F+L+GD+
Sbjct: 868 CYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVGDL 924
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
+S+ L +K +A+D+ A E L D + L ++ N+ +
Sbjct: 925 MRSLTLLQYKTMEGSFEEIARDYNPNWMTAVEILDDDTFLG---AENCFNLFVCQKDSAA 981
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
+ + Q++ +FH+G V F L +Q L SS T +LFGT+
Sbjct: 982 TSEDERQQMQEIGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFGTV 1031
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
G+IG + + + L+ L+ +L + + V + R +R F++ K + +D
Sbjct: 1032 SGAIGLVTQIPFTFYEFLRHLEYRLTEVIKSVGKIEHRFWRSFNTELKVE--NCEGFIDG 1089
Query: 1390 ELLSHYEMLPLEEQLEIA 1407
+L+ + L ++ E+A
Sbjct: 1090 DLIESFLDLSPDKMAEVA 1107
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 92/422 (21%), Positives = 166/422 (39%), Gaps = 90/422 (21%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 162 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + A+ + +S+
Sbjct: 211 RQDNVEREATMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 260
Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DG-RVVQRLDLSKTNPSVLT 412
+ ++D +L D+A G L +L + DG +VV+ L + +
Sbjct: 261 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKNPDGTQVVKDLKVELLGEISIP 314
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
IT + N + F+GSRLGDS L++ +AD + + +
Sbjct: 315 ECITYLDNGVIFVGSRLGDSQLIKLIT------------------KADENGSYCVPMETF 356
Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATG 532
L +V+ + L + T S A ++ G L+ G+ I AS
Sbjct: 357 TNLAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS--- 404
Query: 533 ISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 ---------IDLPGIKGMWAL---RIGGGNFDNT------------LVLSFVGQTRILTL 440
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
E T+ + +T GN+ IQ+ AR++ T + P N +
Sbjct: 441 NGEEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRT 499
Query: 653 GS 654
S
Sbjct: 500 IS 501
>gi|223647932|gb|ACN10724.1| DNA damage-binding protein 1 [Salmo salar]
Length = 1139
Score = 80.9 bits (198), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 128/293 (43%), Gaps = 36/293 (12%)
Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GTA V E+ + GR+++F D V E KE+KGA+ ++ G LL +
Sbjct: 830 VGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMMEFNGKLLASI 883
Query: 1172 GPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
+ L++WT TE N + + LY L +FIL+GD+ +S+ L++K
Sbjct: 884 NSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPME 938
Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
+A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 939 GNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVG 995
Query: 1287 EFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
FH+G V F L LQ L SS T + +LFGT++G IG + L E
Sbjct: 996 VFHLGEFVNVFSHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEG 1045
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
+ L LQ +L + V + +R FH+ K + +D +L+ +
Sbjct: 1046 WYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 1096
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 76/341 (22%), Positives = 131/341 (38%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG VV + L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLRVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + ++ R E L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPLRSEAGR--------------ETDDMLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ +++IQ+ G R++
Sbjct: 440 GEEVEETELPGFVDNLQTFYCGNV-AHQQLIQITSGGVRLV 479
>gi|19074861|ref|NP_586367.1| CLEAVAGE AND POLYADENYLATION SPECIFIC FACTOR [Encephalitozoon
cuniculi GB-M1]
gi|19069586|emb|CAD25971.1| CLEAVAGE AND POLYADENYLATION SPECIFIC FACTOR [Encephalitozoon
cuniculi GB-M1]
Length = 1156
Score = 80.9 bits (198), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 72/323 (22%), Positives = 148/323 (45%), Gaps = 26/323 (8%)
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL-----KGAISALASLQ 1164
L + T +++GED ARGR+ + + ++ + K L KG+I ++
Sbjct: 853 FLLVCTTFIEGEDRPARGRLHVLEIISVVPSLESPFKDCKLKVLGIEKTKGSIVRCEEVR 912
Query: 1165 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
G + + G KI+++K + + I FYD ++ S+++VKN+IL DI++ + F ++
Sbjct: 913 GKIALCLGTKIMIYKIDRSSGIIPIGFYDLH-IFTSSISVVKNYILASDIYRGLSFFFFQ 971
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
+ +L+L++ + +TE L G+ LS++ D + I + Y+P S G +L+
Sbjct: 972 SKPIRLHLISSSEPLRNATSTELLSTGNELSMLCCDAKGTIHGYTYSPNNIISMDGARLV 1031
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT 1343
RAE + T+ R + K N +++F + + ++ +D+
Sbjct: 1032 KRAE---------------IKTNLGRLSSFGAGFKKN--SIMFYSRSNMLIHVSGIDDAH 1074
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
+ +L +Q ++ + V GLN R + +S+ H S + +L+ + + Q
Sbjct: 1075 YLKLLGVQTAIMAHLKSVFGLNQRDY--LNSDIHLHSLSLKSPIVLHILNLFSYFDMSTQ 1132
Query: 1404 LEIAHQTGTTRSQILSNLNDLAL 1426
++ R +I + L L
Sbjct: 1133 ESVSSSARIDRKEISDMIASLNL 1155
>gi|197097564|ref|NP_001126613.1| DNA damage-binding protein 1 [Pongo abelii]
gi|75041202|sp|Q5R649.1|DDB1_PONAB RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|55732122|emb|CAH92767.1| hypothetical protein [Pongo abelii]
Length = 1140
Score = 80.5 bits (197), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ +
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYPMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|429965418|gb|ELA47415.1| hypothetical protein VCUG_01066 [Vavraia culicis 'floridensis']
Length = 1176
Score = 80.5 bits (197), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 99/181 (54%), Gaps = 18/181 (9%)
Query: 1150 SKELKGAISALASLQGHLLIASGPKIILHKWT-GTELNGIAFYDAPPLYVVSLNIVKNFI 1208
S+ KG IS A+++G + ++ K+++++ + + IAFYD +Y VSL ++KN+I
Sbjct: 927 SERTKGPISCCAAVRGKIAVSLATKLMVYECDRNSGIVAIAFYDLY-MYAVSLAVIKNYI 985
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAK-----DFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
++GDI ++F+ ++ + +L+LL+K + GSLD F G +L + D+
Sbjct: 986 IVGDIMMGLHFVYFQSEPVKLHLLSKSDRIANLGSLDFFNA-----GESLFITGIDKTGK 1040
Query: 1264 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
+QIF ++P S G+KL+ R EF A Q + TSS R+ A+ S + A
Sbjct: 1041 VQIFSFSPSNLYSNGGEKLVKRQEFETYAC------FQSIKTSSYRSYASFFSSQNFLIA 1094
Query: 1324 L 1324
L
Sbjct: 1095 L 1095
>gi|345498295|ref|XP_001607743.2| PREDICTED: DNA damage-binding protein 1-like [Nasonia vitripennis]
Length = 1140
Score = 80.5 bits (197), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 89/380 (23%), Positives = 163/380 (42%), Gaps = 62/380 (16%)
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
E+G +++ HNL VD H T E L P T ++
Sbjct: 781 EIGQEVEIHNLLIVDQH---TFEVLHAHTLVP----------------------TEYAMS 815
Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
L +T E+ T +GTA + ++ + GR+LL+ + G+ +T+V KE+K
Sbjct: 816 LISTKLGEDPTPYYIVGTAMINPDESEPKSGRILLYHWNDGK--------LTQVAEKEIK 867
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
G+ +L G LL + + L +WT + L F + LY L +F+L+G
Sbjct: 868 GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVG 924
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D+ +S+ L +K +A+D+ + E L D + L ++ N+ +
Sbjct: 925 DLMRSVTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFLG---AENCFNLFVCQKDS 981
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 982 AATSEEERQQMQEVGQFHLGDMVNVFRHGSLVMQHLGESSTPTHGC----------VLFG 1031
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
T+ G+IG + + + L++L+ +L + V + +R F+++ K + + +
Sbjct: 1032 TVCGAIGLVTQIPSTFYEFLRNLEDRLTSVIKSVGKIEHNFWRSFNTDLKIEQ--CEGFI 1089
Query: 1388 DCELLSHYEMLPLEEQLEIA 1407
D +L+ + L E+ E+A
Sbjct: 1090 DGDLIESFLDLSHEKMAEVA 1109
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 90/420 (21%), Positives = 160/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D ++V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 162 MDEQNVQDVNFLHGCTNPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + Y + + S
Sbjct: 211 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTT--------YVTVVPPIIKQSTIS 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L + D + V++ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDLA------GHLFMLFLEQDKKADGSMVIKDLKVELLGEVSIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSRLGDS L++ + E F ++ AP A
Sbjct: 317 ITYLDNGVIFIGSRLGDSQLIKLNTKPDENGSYCSTMETFTNL---APIVDM-------A 366
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+ D+ + T S A ++ G L+ G+ I AS
Sbjct: 367 VVDL------------ERQGQGQIVTCSGAFKE-----GSLRIIRNGIGIQEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + S N L++S +T +L
Sbjct: 405 -------IDLPGIKGMWALKVDSVNFDNT---------------LVLSFVGQTRILMLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E TE + +T GN+ +IQ+ AR++ + + P N + S
Sbjct: 443 EEVEETEIPGFVADEQTFHTGNV-TNDVIIQITPTSARLISNKSSSVISEWEPDNKRTIS 501
>gi|194377326|dbj|BAG57611.1| unnamed protein product [Homo sapiens]
Length = 451
Score = 80.5 bits (197), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 126/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 134 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 187
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 188 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 242
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 243 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 299
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 300 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 349
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 350 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 392
>gi|147906138|ref|NP_001083624.1| DNA damage-binding protein 1 [Xenopus laevis]
gi|82186503|sp|Q6P6Z0.1|DDB1_XENLA RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|38303806|gb|AAH61946.1| Ddb1 protein [Xenopus laevis]
Length = 1140
Score = 80.5 bits (197), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 144/333 (43%), Gaps = 42/333 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V ++ + GR+++F N L T V KE+KGA+ ++
Sbjct: 823 KDPTTYFVVGTAMVYPDEAEPKQGRIVVFQY-----NDGKLQT-VAEKEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSPPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 1039 LVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIES 1096
Query: 1395 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ +M + L+I +G R + +L
Sbjct: 1097 FLDISRPKMQEVIANLQIDDGSGMKRETTVDDL 1129
>gi|301616502|ref|XP_002937687.1| PREDICTED: DNA damage-binding protein 1-like [Xenopus (Silurana)
tropicalis]
Length = 1140
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 144/333 (43%), Gaps = 42/333 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V ++ + GR+++F N L T V KE+KGA+ ++
Sbjct: 823 KDPTTYFVVGTAMVYPDEAEPKQGRIVVFQY-----NDGKLQT-VAEKEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSPPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 1039 LVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIES 1096
Query: 1395 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ +M + L+I +G R + +L
Sbjct: 1097 FLDISRPKMQEVIANLQIDDGSGMKRETTVDDL 1129
>gi|327278830|ref|XP_003224163.1| PREDICTED: DNA damage-binding protein 1-like [Anolis carolinensis]
Length = 1140
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 145/333 (43%), Gaps = 42/333 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V ++ + GR+++F +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPDEAEPKQGRIVVF---HYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LYV + +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALYVKTKG---DFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEFGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 1039 LVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTERKT-EPAT-GFIDGDLIES 1096
Query: 1395 Y------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ +M + L+I +G R + +L
Sbjct: 1097 FLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129
>gi|320163506|gb|EFW40405.1| UV-damaged DNA binding protein [Capsaspora owczarzaki ATCC 30864]
Length = 1123
Score = 80.1 bits (196), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 142/302 (47%), Gaps = 31/302 (10%)
Query: 1074 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLF 1132
G ++ R + + S+E ++ + F + ++ L +GTA+V ED RGR+L+F
Sbjct: 840 GQTFEIRDSFQLPSTETIMSF-ISCSFANDSSDSTVYLVVGTAFVIPSEDEPKRGRILVF 898
Query: 1133 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW--TGTELNGIAF 1190
A +LVT +K++KG + +L + G LL K+ L KW TG + +
Sbjct: 899 DVAGGA---LHLVT---AKDVKGCVYSLNAFNGKLLAGINSKVNLFKWNLTGDGIRELVS 952
Query: 1191 YDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKD-----FGSLDCFA 1243
+ ++++L + +FI++GD+ +SI L +K + + +A+D ++D
Sbjct: 953 ECSHHGHILTLYLKSRGDFIIVGDLMRSISLLMYKSGTSSIEEIAQDTCPNWVTAVDMLD 1012
Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 1303
+ I G + S + ++N++ S + ++L EFHVG + +F ++
Sbjct: 1013 DDVFIGGES-SFNIFTCRRNLE-------ASTDEERKRLEVVGEFHVGEFINQFRAGSLV 1064
Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1363
D ++ + + LFGT +G IG IA L + LQ +Q + + V G
Sbjct: 1065 MKLPDE------QEQPIQPSTLFGTGNGVIGVIARLTRSQYEFLQLVQAAMAKVIKGVGG 1118
Query: 1364 LN 1365
LN
Sbjct: 1119 LN 1120
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 75/318 (23%), Positives = 127/318 (39%), Gaps = 69/318 (21%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
N+R L+ V D F+ GY P +++L++ H L + P
Sbjct: 208 NIR-LEELQVFDIKFLRGYDRPTILVLYQD-------TKETRHVKTYQVLLKEKEFAEGP 259
Query: 297 LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
W+ N+ A L+ V P+GGVL+VG TI YHS SA ++A+ +
Sbjct: 260 --WAQNNVEGGASLLIPVLMPLGGVLIVGEQTITYHSGSAFRSVAMRPAII--------- 308
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDI 415
+SV + + LL+ G+L+ + + +D + V + + + + + S +
Sbjct: 309 -KCYSV---------IDTNRFLLADSEGNLLSVLLTHDRQDKVTAIKIDRLGVTSILSCL 358
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
T + N + F GS+ GDS L++ ++E G S L A+
Sbjct: 359 TYLDNGVVFGGSQFGDSQLLRLATE----------RDETGSFVRVLESFSNLGPICDMAV 408
Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
D+ + T S A +D G L+ G+ GI +
Sbjct: 409 VDL------------ERQGQCQVVTCSGAFKD-----GSLRVVRNGV---------GIEE 442
Query: 536 QSNYELVELPGCKGIWTV 553
Q+ +ELPG KGIW++
Sbjct: 443 QAT---IELPGIKGIWSL 457
>gi|449328561|gb|AGE94838.1| cleavage and polyadenylation specific factor [Encephalitozoon
cuniculi]
Length = 1156
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/325 (22%), Positives = 149/325 (45%), Gaps = 30/325 (9%)
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL-----KGAISALASLQ 1164
L + T +++GED ARGR+ + + ++ + K L KG+I ++
Sbjct: 853 FLLVCTTFIEGEDRPARGRLHVLEIISVVPSLESPFKDCKLKVLGIEKTKGSIVRCEEVR 912
Query: 1165 GHLLIASGPKIILHKWTGTELNGI---AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
G + + G KI+++K + NGI FYD ++ S+++VKN+IL DI++ + F
Sbjct: 913 GKIALCLGTKIMIYKIDRS--NGIIPIGFYDLH-IFTSSISVVKNYILASDIYRGLSFFF 969
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
++ + +L+L++ + +TE L G+ LS++ D + I + Y+P S G +
Sbjct: 970 FQSKPIRLHLISSSEPLRNATSTELLSTGNELSMLCCDAKGTIHGYTYSPNNIISMDGAR 1029
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1341
L+ RAE + T+ R + K N +++F + + ++ +D+
Sbjct: 1030 LVKRAE---------------IKTNLGRLSSFGAGFKKN--SIMFYSRSNMLIHVSGIDD 1072
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
+ +L +Q ++ + V GLN R + +S+ H + + +L+ + +
Sbjct: 1073 AHYLKLLGVQTAIMAHLKSVFGLNQRDY--LNSDIHLHSLSLKNPIVLHILNLFSYFDMS 1130
Query: 1402 EQLEIAHQTGTTRSQILSNLNDLAL 1426
Q ++ R +I + L L
Sbjct: 1131 TQESVSSSARIDRKEISDMIASLNL 1155
>gi|74208347|dbj|BAE26370.1| unnamed protein product [Mus musculus]
Length = 599
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+ +F + +D V E KE+KGA+ ++
Sbjct: 282 KDPNTYFIVGTAMVYPEEAEPKQGRIAVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 335
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 336 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 390
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 391 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 447
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 448 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGEASTPTQGS----------VLFGTVNGMIG 497
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 498 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 540
>gi|444513057|gb|ELV10249.1| DNA damage-binding protein 1 [Tupaia chinensis]
Length = 1146
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 128/291 (43%), Gaps = 40/291 (13%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS------FRQFHSNGKAH 1379
+ L E + L +Q +L + ++P S +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKRCFQISPNSLTDMSTWRSFHTERKTE 1089
>gi|328788389|ref|XP_396048.3| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Apis
mellifera]
Length = 1141
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 92/380 (24%), Positives = 166/380 (43%), Gaps = 62/380 (16%)
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
++ +I+ HNL +D H T E +L P +E AL+
Sbjct: 782 DICQEIEVHNLLIIDQH---TFEVLHAHMLMP-----------------TEYALS----- 816
Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
L +T E+ T +GTA V ++ + GR+LL+ S G+ +T+V KE+K
Sbjct: 817 LISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGK--------LTQVAEKEIK 868
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
G+ +L G LL + + L +WT + L F + LY+ S +FIL+G
Sbjct: 869 GSCYSLTEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKSKG---DFILVG 925
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D+ +S+ L +K +A+D+ A E L D + L ++ N+ +
Sbjct: 926 DLMRSLTLLQYKTMEGCFEEIARDYNPNWMTAIEILDDDTFLG---AENCFNLFVCQKDS 982
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 983 AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFG 1032
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
T+ G+IG + + + + L++L+ +L + V + +R F++ K + + +
Sbjct: 1033 TVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1090
Query: 1388 DCELLSHYEMLPLEEQLEIA 1407
D +L+ + L ++ E+A
Sbjct: 1091 DGDLIESFLDLSPDKMAEVA 1110
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
++ V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH N Y + + +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L V + + VV+ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKADGTQVVKDLKVELLGEISIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSRLGDS LV+ +AD + + +
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L + T S A ++ G L+ G+ I AS
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 -------IDLPGIKGMWAL---KIGGGNFDNT------------LVLSFVGQTRILTLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E T+ + +T GN+ IQ+ AR++ T + P N + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISYETKTVVSEWEPENKRTIS 501
>gi|195996153|ref|XP_002107945.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
gi|190588721|gb|EDV28743.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
Length = 1134
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 79/336 (23%), Positives = 152/336 (45%), Gaps = 35/336 (10%)
Query: 1080 RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED---VAARGRVLLFSTGR 1136
+ + +Q E +++ T N + E +GTA+V ED R+L + G+
Sbjct: 798 QCALQLQDCEWGMSLISCTFEN----DPEAYYCVGTAFVNLEDKEPTKGNIRILKYFEGK 853
Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE--LNGIAFYDAP 1194
+ +V+SKE+ GA+ + + G LL + + +++WT + + +F++
Sbjct: 854 --------IQQVHSKEVSGAVYCMVAFNGRLLASVNSTVSVYEWTSNKELVEETSFHNN- 904
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
+ + L +FIL+GD+ +SI +++ ++ L+ K+ A E + D S L
Sbjct: 905 -VLALYLKTKGDFILIGDLMRSISLCAYRPMNNEIELICKNNDPNWMTAVEIIDDDSYLG 963
Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQK-LLSRAEFHVGAHVTKFLRLQMLATSS-DRTGA 1312
+ + +F S S + QK L + +HVG V F + ++ ++ D +
Sbjct: 964 -----GENSHNLFTCQKNSSSSEEEQKHLPTVGVYHVGEFVNVFRQGSLVMQNTVDIPDS 1018
Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
GS +LFGT+ G++G + L F + ++ KL V V + + +R F
Sbjct: 1019 VQGS-------ILFGTVSGAVGVVVTLAPAMFEFVSAIANKLSTVVKGVGKIEHQFWRSF 1071
Query: 1373 HSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1408
SN + P S VD +L+ + L E+ +A+
Sbjct: 1072 -SNDRKTEPC-QSFVDGDLVESFLDLSPEDMQRVAN 1105
Score = 47.4 bits (111), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 49/201 (24%), Positives = 91/201 (45%), Gaps = 31/201 (15%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT-LKQHPLIW 299
L+ V D F++G+ EP + +++E S ++ +S+ + + P W
Sbjct: 158 LEELQVLDVKFLYGFTEPTIALIYE---------SGQNRYLKTYEISLQNADIHRQP--W 206
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
+ + +A+ +L VP P G++V+GA +I Y+ S L+ SL R +
Sbjct: 207 NIGKVEEEAFMILPVPPPSCGMVVIGAGSISYYKGQDS----LHITPASLKD-----RIT 257
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----GRVVQRLDLSKTNPSVLTSDI 415
+D+ +L D + G L +L +V + G V+ L L + + S I
Sbjct: 258 CFGRVDSNGCRYLLGDYS------GRLFMLILVQEHSQSGIKVKDLCLEYLGETSIPSCI 311
Query: 416 TTIGNSLFFLGSRLGDSLLVQ 436
T + N+ ++GS GDS L++
Sbjct: 312 TYLDNAFAYIGSSCGDSQLIK 332
>gi|194389106|dbj|BAG61570.1| unnamed protein product [Homo sapiens]
Length = 1009
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 692 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 745
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 746 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 800
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N + + +
Sbjct: 801 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNSFVCQKDSAATTDEE 857
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 858 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 907
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 908 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 950
>gi|2632123|emb|CAA05770.1| Xeroderma Pigmentosum Group E Complementing protein [Homo sapiens]
Length = 1140
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/283 (26%), Positives = 125/283 (44%), Gaps = 34/283 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KG + ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGDVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKDVRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T + +LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+ L E + L +Q +L + V + +R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|307205760|gb|EFN83990.1| DNA damage-binding protein 1 [Harpegnathos saltator]
Length = 1138
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 164/380 (43%), Gaps = 62/380 (16%)
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
E+G +I+ HNL +D H T E L P +E AL+
Sbjct: 779 EIGQEIEVHNLLIIDQH---TFEVLHAHTLMP-----------------TEYALS----- 813
Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
L +T E+ T +GTA + ++ + GR+LL+ S G+ +T+V KE+K
Sbjct: 814 LISTRLGEDPTSYFVVGTALINPDETEPKMGRILLYHWSDGK--------LTQVAEKEIK 865
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
G+ +L G LL + + L +WT + L F + LY L +F+L+G
Sbjct: 866 GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVG 922
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D+ +S+ L +K +A+D+ + E L D + L ++ N+ +
Sbjct: 923 DLMRSLTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFLG---AENCFNLFVCQKDS 979
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 980 AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTLG----------CVLFG 1029
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
T+ G+IG + + + L++L+ +L + V + +R F++ K + + +
Sbjct: 1030 TVSGAIGLVTQIPFAFYEFLRNLEDRLNSVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1087
Query: 1388 DCELLSHYEMLPLEEQLEIA 1407
D +L+ + L ++ E+A
Sbjct: 1088 DGDLIESFLDLNHDKMAEVA 1107
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/205 (24%), Positives = 94/205 (45%), Gaps = 35/205 (17%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + A+ + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYIAVV----------PPIIKQST 257
Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
+ ++D +L D+A G L +L + + + VV+ L + +
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGTQVVKDLKVELLGEISIP 311
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
IT + N + F+GSRLGDS L++
Sbjct: 312 ECITYLDNGVIFVGSRLGDSQLIKL 336
>gi|224135035|ref|XP_002321967.1| predicted protein [Populus trichocarpa]
gi|222868963|gb|EEF06094.1| predicted protein [Populus trichocarpa]
Length = 60
Score = 78.6 bits (192), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 36/48 (75%), Positives = 41/48 (85%)
Query: 684 VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
+ DPSTC VSV TP+A +SSKK VS+CTLYHDKGPEP LRKTS +AWL
Sbjct: 1 MTDPSTCMVSVNTPSAFQSSKKSVSACTLYHDKGPEPLLRKTSPNAWL 48
>gi|313238818|emb|CBY20011.1| unnamed protein product [Oikopleura dioica]
gi|313245836|emb|CBY34826.1| unnamed protein product [Oikopleura dioica]
Length = 1135
Score = 78.2 bits (191), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 73/308 (23%), Positives = 143/308 (46%), Gaps = 26/308 (8%)
Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
K++E + +GTA E GR+ +FS + + +T V +K++ GA+ ++ +L
Sbjct: 821 KKDEQFIVVGTAITADEQECKNGRICVFSYSK-----EEKLTLVSTKQVNGAVYSVKALN 875
Query: 1165 GHLLIASGPKIILHKWTGTELNGIAFY--DAP---PLYVVSLNIVKN-FILLGDIHKSIY 1218
G+ +I + I + E+N +AP + V++++ KN FIL D+ +SI
Sbjct: 876 GNKIICA----INQQLKVFEMNEQTTLQSEAPIANHITCVAVDVSKNGFILSADLMRSIS 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
S+K L +A+D+ A + + D + + ++ +NI I + +
Sbjct: 932 VFSYKPLEGALEEIARDYHPNWMTAIKMIDDDNYIG---AENSENIFICTRNTEAPDEED 988
Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
Q+LL +HVG H+ + ++ + P T F L G++ G +G +A
Sbjct: 989 RQQLLPTGYYHVGEHINTIVEGNLVMDVHVESSITP----TRTF--LMGSVSGYVGLLAI 1042
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
E ++ L L+ K+ + V ++ S+R+F S+ + VD +L+ ++ L
Sbjct: 1043 FPEKQWQFLSKLEAKMRKVIRGVGKIDHESWRRFESDSRME--DCKGFVDGDLIEMFQDL 1100
Query: 1399 PLEEQLEI 1406
E+Q E+
Sbjct: 1101 RPEKQKEV 1108
Score = 49.3 bits (116), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 130/626 (20%), Positives = 228/626 (36%), Gaps = 173/626 (27%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ ++ + L+ V + L+G + + + + ++D + + E +LE+ D
Sbjct: 43 RIEVNLSTQTGLKPVTEFNLYGRIAVIEVFRY----KNEKKDCLFILTESCYACILEYVD 98
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
G IT + ++ S ++ G VDP+ RC + +Y + I+ +
Sbjct: 99 ---GKIITRA-------YGDMRDKNYSVSQSGMHACVDPEARCIALRLYDGVLKIINLNS 148
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
L E RIE V+ D F+H +P + +L++
Sbjct: 149 SSKHLTSAEQ----------RIEEILVV-----------DMCFLHTANKPTLALLYDDN- 186
Query: 269 TWAGRVSWKH-HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
S +H T I+ + + H + + D ++AVP P+ G+L++G
Sbjct: 187 ------SSRHLSTIAITLDNSGSGASIHKGPFRHTQVEQDTILIVAVPEPLAGILLLGHV 240
Query: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
I YH ++ N V+ T + L G+L
Sbjct: 241 NITYHDSKNRSTCSIENI----------------VKRTIECVTPIDKHRYLCGDSNGELF 284
Query: 388 LLTVVYDGRVV----QRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
LL + Y+ + RL + L + ++ I N + F+GS GDS L++
Sbjct: 285 LLLLDYNENRIPEERMRLATKYLGRTTLPNTLSYIDNYVVFVGSTFGDSELIR------- 337
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
IE + NN S Q S
Sbjct: 338 -------------IEV------------------------------SDNN--SGQHFTSL 352
Query: 504 AVRDSLVNIGPLKDFSY--------GLRINADASATGISKQ--------SNYELVELPGC 547
D+L GP+KD G + A TG S + Y ++L G
Sbjct: 353 HQYDNL---GPIKDMCIVDFEKQGQGQLVTASGVGTGGSLRIIRNGVGIHEYASIDLEGV 409
Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMV--LETADLLTEVTESVDY 605
KG+W + + SS S++ + L++S +T+ LE D +TEV E +
Sbjct: 410 KGLWALKYLSS------STKQDS--------LLLSFVGQTIFLRLEGQD-VTEV-EEIPG 453
Query: 606 FVQG-RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS----ENST 660
F G +T+ AGN+ ++ +Q+ E+ R++ ES GS EN+
Sbjct: 454 FTNGEQTMYAGNV-TDQQFLQITEKQVRLI--------------ADESLKGSWEPEENTQ 498
Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGD 686
+ S+ VLLG+ +I L + D
Sbjct: 499 INLCSVNKNQVLLGVGSTAIYLEIND 524
>gi|81868411|sp|Q9ESW0.1|DDB1_RAT RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|9843869|emb|CAB89874.2| damage-specific DNA binding protein 1 [Rattus norvegicus]
Length = 1140
Score = 78.2 bits (191), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 125/285 (43%), Gaps = 38/285 (13%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
K+ T +GTA V E+ + GR+++F S G+ + V KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSGGK--------LQTVAEKEVKGAVYSMV 874
Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
G LL + + L++WT TE N + + LY L +FIL+GD+ +S
Sbjct: 875 EFNGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 929
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 930 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 986
Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+ Q L FH+G V F L +Q L +S T ++L GT++G
Sbjct: 987 EERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLLGTVNGM 1036
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
IG + L E + L +Q +L + V + +R FH+ K
Sbjct: 1037 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERK 1081
>gi|332030156|gb|EGI69950.1| DNA damage-binding protein 1 [Acromyrmex echinatior]
Length = 1138
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 86/380 (22%), Positives = 164/380 (43%), Gaps = 62/380 (16%)
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
E+G +I+ HNL +D H + + + ++E AL+
Sbjct: 779 EIGQEIEVHNLLIIDQHTFEVLHAH--------------------TLMATEYALS----- 813
Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
L +T E+ T +GTA++ ++ + GR+LL+ S G+ T+V KE+K
Sbjct: 814 LISTKLGEDPTSYFVVGTAFINPDETEPKMGRILLYHWSEGK--------FTQVAEKEIK 865
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
G+ +L G LL + + L +WT + L F + LY L +F+L+G
Sbjct: 866 GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVG 922
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D+ +S+ L +K +A+D+ + E L D + L ++ N+ +
Sbjct: 923 DLMRSLTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFLG---AENCFNLFVCQKDS 979
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 980 AATSEDERQQMQEIGQFHLGDMVNVFRHGSLVMQNLGESSTPTLG----------CVLFG 1029
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
T+ G+IG + + + L++++ +L + V + +R F++ K + + +
Sbjct: 1030 TVSGAIGLVTQIPVTFYEFLRNMEDRLNSVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1087
Query: 1388 DCELLSHYEMLPLEEQLEIA 1407
D +L+ + L ++ E+A
Sbjct: 1088 DGDLIESFLDLNHDKMAEVA 1107
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/205 (23%), Positives = 94/205 (45%), Gaps = 35/205 (17%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + I+ K+ I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEINLRDKEFAKIPW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH + A+ + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 257
Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
+ ++D +L D+A G L +L + + + VV+ L + +
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGSQVVKDLKVELLGEISIP 311
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
IT + N + ++GSRLGDS L++
Sbjct: 312 ECITYLDNGVIYVGSRLGDSQLIKL 336
>gi|326919947|ref|XP_003206238.1| PREDICTED: DNA damage-binding protein 1-like [Meleagris gallopavo]
Length = 1079
Score = 77.4 bits (189), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 144/335 (42%), Gaps = 46/335 (13%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
K+ T +GTA V E+ + GR+++F S G+ + + KE+KGA+ ++
Sbjct: 762 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 813
Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
G LL + + L++WT TE N + + LY L +FIL+GD+ +S
Sbjct: 814 EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 868
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 869 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 925
Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+ Q L H+G V F L +Q L +S T + +LFGT++G
Sbjct: 926 EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGM 975
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
IG + L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 976 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKT-EPAT-GFIDGDLI 1033
Query: 1393 SHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ +M + L+I +G R + +L
Sbjct: 1034 ESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1068
>gi|224050582|ref|XP_002191856.1| PREDICTED: DNA damage-binding protein 1 [Taeniopygia guttata]
Length = 1140
Score = 77.4 bits (189), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 144/335 (42%), Gaps = 46/335 (13%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
K+ T +GTA V E+ + GR+++F S G+ + + KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 874
Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
G LL + + L++WT TE N + + LY L +FIL+GD+ +S
Sbjct: 875 EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 929
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 930 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 986
Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+ Q L H+G V F L +Q L +S T + +LFGT++G
Sbjct: 987 EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGM 1036
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
IG + L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 1037 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLI 1094
Query: 1393 SHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ +M + L+I +G R + +L
Sbjct: 1095 ESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129
>gi|410912407|ref|XP_003969681.1| PREDICTED: DNA damage-binding protein 1-like [Takifugu rubripes]
Length = 1140
Score = 77.4 bits (189), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 123/296 (41%), Gaps = 26/296 (8%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ +GTA V E+ + GR+++F D V E KE+KGA+ ++
Sbjct: 823 KDPSVYFVVGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
G LL + + L++WT + + + L +FIL+GD+ +S+ L++K
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECSHYNNIMALYLKTKGDFILVGDLMRSVLLLAYK 936
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
+A+DF A E L D + L ++ N+ + + Q L
Sbjct: 937 PMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEDRQHLQ 993
Query: 1284 SRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
FH+G V F L LQ L +S T + +LFGT+ G IG + L
Sbjct: 994 EVGVFHLGEFVNVFCHGSLVLQNLGETSTPTQGS----------VLFGTVTGMIGLVTSL 1043
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
E L LQ +L + V + +R FH+ K + +D +L+ +
Sbjct: 1044 SEGWHSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQ--AKGFIDGDLIESF 1097
Score = 47.4 bits (111), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 73/341 (21%), Positives = 127/341 (37%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG V ++ L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGD LV+ S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDPQLVKLNVDSNDQGSFVTVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G D L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ ++IQ+ R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHNQLIQITSGSVRLV 479
>gi|119594342|gb|EAW73936.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_d [Homo
sapiens]
Length = 1146
Score = 77.0 bits (188), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 76/289 (26%), Positives = 128/289 (44%), Gaps = 40/289 (13%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGL------NPRSFRQFHSNGK 1377
+ L E + L +Q +L + + +P ++R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKRCFLISTCSLTHPSTWRSFHTERK 1087
>gi|45383688|ref|NP_989547.1| DNA damage-binding protein 1 [Gallus gallus]
gi|82098863|sp|Q805F9.1|DDB1_CHICK RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=UV-damaged DNA-binding factor
gi|28375613|dbj|BAC56999.1| damaged-DNA binding protein DDB p127 subunit [Gallus gallus]
gi|53130071|emb|CAG31438.1| hypothetical protein RCJMB04_6h2 [Gallus gallus]
Length = 1140
Score = 77.0 bits (188), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 144/335 (42%), Gaps = 46/335 (13%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
K+ T +GTA V E+ + GR+++F S G+ + + KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 874
Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
G LL + + L++WT TE N + + LY L +FIL+GD+ +S
Sbjct: 875 EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 929
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 930 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 986
Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+ Q L H+G V F L +Q L +S T + +LFGT++G
Sbjct: 987 EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQGS----------VLFGTVNGM 1036
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
IG + L E + L +Q +L + V + +R FH+ K P +D +L+
Sbjct: 1037 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLI 1094
Query: 1393 SHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
+ +M + L+I +G R + +L
Sbjct: 1095 ESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129
>gi|260790329|ref|XP_002590195.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
gi|229275385|gb|EEN46206.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
Length = 1152
Score = 77.0 bits (188), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 74/305 (24%), Positives = 129/305 (42%), Gaps = 26/305 (8%)
Query: 1109 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
T IGTA V E+ + GR+++F + +V KE+KGA+ +L L
Sbjct: 834 TYFIIGTAMVYPEESEPKSGRIIVFQYTDGK------LQQVAEKEVKGAVYSLVQFNNKL 887
Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
L + + L +WT + + + + L +FIL+GD+ +S+ L++K
Sbjct: 888 LASINSTVRLFEWTAEKELRVECNHYNNILALYLKTKGDFILVGDLMRSVTLLAYKPMEG 947
Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW---KGQKLLS 1284
+A+DF A E L D + L +N F+ K S + + Q L
Sbjct: 948 CFEEIARDFNPNWMSAVEILDDDNFLG------AENSFNFFTCQKDSAATTDEERQHLQE 1001
Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-NRFALLFGTLDGSIGCIAPLDELT 1343
FH+G V F ++ PG T + ++LFGT++G++G + L
Sbjct: 1002 VGHFHLGEFVNVFRHGSLVMQH-------PGETSTPTQGSVLFGTVNGAVGLVTQLPADF 1054
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
F LQ +Q KL + V + +R F++ K +D +L+ + L ++
Sbjct: 1055 FNFLQEVQSKLTRVIKSVGKIEHSFWRSFNTERKTE--ACQGFIDGDLIESFLDLSRDKM 1112
Query: 1404 LEIAH 1408
E+
Sbjct: 1113 QEVVQ 1117
Score = 48.5 bits (114), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 53/228 (23%), Positives = 98/228 (42%), Gaps = 46/228 (20%)
Query: 225 GFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
F+ R+E +VI+++ F++G P +V +++ H +
Sbjct: 154 AFNIRLEELNVIDVK-----------FLYGCQVPTVVFVYQ-----------DPHGRHVK 191
Query: 285 ALSISTTLKQHPL-IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALN 343
IS K+ W N+ +A ++AVP P G L++G +I YH+ A+A
Sbjct: 192 TYEISVRDKEFSKGPWKQDNVETEASMVIAVPEPFCGSLIIGQESITYHNGDKYVAVA-- 249
Query: 344 NYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV 397
+ +S+ +DA + +L D++ G L +L + + DG V
Sbjct: 250 --------PPAIKQSTLICHGRVDANGSRYLLGDMS------GRLFMLLLEKEELIDGSV 295
Query: 398 -VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
V+ L + + + +T + N + +LGSRLGDS L++ + S
Sbjct: 296 TVKDLKVELLGETSIAECLTYLDNGVVYLGSRLGDSQLIKLNVDADDS 343
>gi|432851195|ref|XP_004066902.1| PREDICTED: DNA damage-binding protein 1-like [Oryzias latipes]
Length = 1140
Score = 77.0 bits (188), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 129/301 (42%), Gaps = 36/301 (11%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ +GTA V E+ + GR+++F D V E KE+KGA+ ++
Sbjct: 823 KDPSVYFIVGTAMVYPEEPEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L LQ L SS T + +LFGT++G IG
Sbjct: 989 RQHLQEVGVFHLGEFVNVFCHGSLVLQNLGESSTPTQGS----------VLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ L E L LQ +L + V + +R F++ K + +D +L+
Sbjct: 1039 LVTSLSEGWHSLLLDLQNRLNKVIKSVGKIEHSFWRSFYTERKTEQ--ATGFIDGDLIES 1096
Query: 1395 Y 1395
+
Sbjct: 1097 F 1097
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 74/341 (21%), Positives = 130/341 (38%), Gaps = 73/341 (21%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG V ++ L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S + E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVTVMETFTNL---------------G 357
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 358 PILDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + +S G +D L++S +T VL +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRESDD------------MLVLSFVGQTRVLMLS 439
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ +++IQ+ R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHQQLIQITSGSVRLV 479
>gi|402592185|gb|EJW86114.1| CPSF A subunit region family protein [Wuchereria bancrofti]
Length = 278
Score = 77.0 bits (188), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 64/250 (25%), Positives = 116/250 (46%), Gaps = 22/250 (8%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIV 1204
VY KE+KGA ++ S+ G L++A + L +WT + L F + LY+ + N
Sbjct: 15 VYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWTADKELRLECSDFDNVTALYLKTKN-- 72
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
+ IL+GD+ +S+ LS+K + +A+DF + A E + + L + +
Sbjct: 73 -DLILVGDLMRSLSLLSYKSMESTFEKVARDFMTNWMSACEIIDSDNFLG-----AENSY 126
Query: 1265 QIFYYAPKMSESWK--GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
+F +K G +L F++G V F + AT D AP
Sbjct: 127 NLFTVMKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVD---VAP----LYHS 179
Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
++L+GT DG IG I + + + LQ +QK+L + + ++ +R F + ++
Sbjct: 180 SILYGTSDGGIGVIVQMPPVLYTFLQDVQKRLAEYAENCMRISHTQYRTFETEKRSE--A 237
Query: 1383 PDSIVDCELL 1392
P+ +D +L+
Sbjct: 238 PNGFIDGDLI 247
>gi|357623954|gb|EHJ74904.1| putative DNA repair protein xp-e [Danaus plexippus]
Length = 1128
Score = 76.6 bits (187), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 78/299 (26%), Positives = 136/299 (45%), Gaps = 27/299 (9%)
Query: 1112 AIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
A+GTA + E+ + GR+LLF +T+V KE+KG L G LL +
Sbjct: 819 AVGTAILNPEESEPKQGRILLFHWCEGK------LTQVAEKEIKGGCYTLVEFNGKLLAS 872
Query: 1171 SGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
+ L +WT + L F + LY L + +FIL+GD+ +S+ L +K+
Sbjct: 873 INSTVRLFEWTSEKELRLECSHFNNIVALY---LKVKGDFILVGDLMRSMSLLQYKQMEG 929
Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
+A+D+ A E L D + L ++ N+ + + + Q++ +
Sbjct: 930 SFEEIARDYSPNWMTAVEILDDDTFLG---AENSFNLFVCQKDSAATTDEERQQMGYMGQ 986
Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1347
FHVG V R ++A +D AAP + +L T+ G+I + L + F L
Sbjct: 987 FHVGDMVNVMRRGALVAQLADT--AAPVAR-----PVLLATVSGAICLVVQLSQELFDFL 1039
Query: 1348 QSLQKKLVDSVPHVAGLNPRSF-RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
L+++L ++ V + P SF R F+++ K P + +D +L+ + L + Q E
Sbjct: 1040 HQLEERLTHTIKSVGKI-PHSFWRSFNTDIKTE-PA-EGFIDGDLIESFLDLSRDMQQE 1095
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 102/460 (22%), Positives = 176/460 (38%), Gaps = 107/460 (23%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G L +DPQ R G+ +Y I+ + + L S R+E +N+
Sbjct: 120 GILAVIDPQARVIGLRLYDGLFKIIPLDKDSTEL----------KAASLRLEE---LNVY 166
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
DL+ F+HG P ++++H+ ++ +H I I+ K+ I
Sbjct: 167 DLE--------FLHGCSNPTLILIHQD-------LNGRH----IKTHEINLRDKEFMKIP 207
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSP+GG +V+G +I YH + A+A ++
Sbjct: 208 WKQDNVETEASILIPVPSPLGGAIVIGQESIVYHDGQSYVAVAPPQIKTPINC------- 260
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----VVQRLDLSKTNPSVLTSD 414
+D +L D+A G L +L + R V+ L + +
Sbjct: 261 --YCRVDVRGLRYLLGDIA------GRLFMLLLELSERDGTASVRDLKVELLGDIPIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+T + N + F+GSRLGDS LV+ + DA + + + +
Sbjct: 313 MTYLDNGVVFVGSRLGDSALVRLAA-----------------VRDDASQYVQPMETFT-S 354
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L N + F +G L+ G+ I AS
Sbjct: 355 LAPIVDMCVVDLERQGQNQLITCSGAF---------KMGSLRIIRNGIGIQEQAS----- 400
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + + G +H L++S +T VL
Sbjct: 401 -------IDLPGIKGMWAL----TLGQGP-----------HHDTLVLSFVGQTRVLTLNG 438
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ ++IQV + G R++
Sbjct: 439 EEVEETEIKGFVSDRQTFFTGNVC-HDQLIQVTDEGIRLI 477
>gi|431910407|gb|ELK13480.1| DNA damage-binding protein 1 [Pteropus alecto]
Length = 1143
Score = 76.6 bits (187), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 126/286 (44%), Gaps = 37/286 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF---RQFHSNGK 1377
+ L E + L +Q +L + V + + R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSQRSFHTERK 1084
>gi|440893607|gb|ELR46310.1| DNA damage-binding protein 1 [Bos grunniens mutus]
Length = 1143
Score = 76.6 bits (187), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 126/286 (44%), Gaps = 37/286 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF---RQFHSNGK 1377
+ L E + L +Q +L + V + + R FH+ K
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSQRSFHTERK 1084
>gi|387593561|gb|EIJ88585.1| hypothetical protein NEQG_01275 [Nematocida parisii ERTm3]
gi|387597215|gb|EIJ94835.1| hypothetical protein NEPG_00359 [Nematocida parisii ERTm1]
Length = 1261
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/251 (23%), Positives = 126/251 (50%), Gaps = 26/251 (10%)
Query: 1054 LHRTYTVEEY---EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ET 1109
L R Y+++ Y E+++ R G ++++E ++VTL + E
Sbjct: 894 LTRAYSLKIYSHEEMKVC--SRTEGVLMAVDEYRLENNEYIAYHKIVTLPDKQNTEGFSE 951
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQ 1164
+ + T Y+ ED+ ARGR+++ R+ ++ + + +++ KGA + ++
Sbjct: 952 FVIVCTTYITDEDLMARGRLIVLEIASVVPQRDRIETRHKLKALAAEKTKGATTCCDIVK 1011
Query: 1165 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
G++++ G K++++ + E L +AF+D +++ S +++N I+ GD +K + L ++
Sbjct: 1012 GNIVVCVGTKLMIYMFDRNEGLRAVAFHDIH-VFLTSCMVMRNIIVCGDAYKGTFLLFYQ 1070
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDG-------STLSLVVSDEQKNIQIFYYAPKMSES 1276
L++L++ G + +L+ G + LSL+ D K + I+ Y+P+ S
Sbjct: 1071 SDPPLLHMLSQSSGGV------YLLKGIGMTLHDTALSLISYDSLKTVCIYTYSPQHILS 1124
Query: 1277 WKGQKLLSRAE 1287
G +L+SR E
Sbjct: 1125 QDGSRLISRGE 1135
>gi|390366809|ref|XP_780126.3| PREDICTED: DNA damage-binding protein 1-like isoform 1
[Strongylocentrotus purpuratus]
Length = 630
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/303 (24%), Positives = 132/303 (43%), Gaps = 30/303 (9%)
Query: 1113 IGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+G A V ++ + GR+++F S G+ + E+ KE+KGA +L G LL
Sbjct: 322 VGLANVHPDEAEPKSGRIVVFQYSDGK--------LQEIAEKEIKGAPYSLVEFNGKLLA 373
Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
+ + L +WT + + + L +FI++GD+ +SI L++K L
Sbjct: 374 SVNSVVRLFEWTPEHSLRVECSHYNNVLALYLKTKGDFIVVGDLMRSITLLAYKPMEGCL 433
Query: 1230 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1289
+A+D+ A E L D + L ++ N+ + + + L FH
Sbjct: 434 EEIARDYSPNWMSAVEILDDDTFLG---AENSSNLFTCQKDSAATTDEERRHLQEVGLFH 490
Query: 1290 VGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1345
+G V F L +Q + S+ T + +LFGT+ GS+G + L+E +R
Sbjct: 491 LGEFVNVFRHGSLVMQNIGESTIPTTGS----------VLFGTVSGSVGLVTQLNEEFYR 540
Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
L +Q KL + V + +R F+S K D+ +D +LL + L + E
Sbjct: 541 FLLEVQNKLTKVIKSVGKIKHSFWRSFYSERKTE--PMDNFIDGDLLESFLDLSRDTMDE 598
Query: 1406 IAH 1408
+A
Sbjct: 599 VAQ 601
>gi|281345356|gb|EFB20940.1| hypothetical protein PANDA_015888 [Ailuropoda melanoleuca]
Length = 1124
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 128/285 (44%), Gaps = 36/285 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 805 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 858
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 859 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 913
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 914 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 970
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 971 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1020
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAG--LNPRSFRQFHSNGK 1377
+ L E + L +Q +L + ++ + ++R FH+ K
Sbjct: 1021 LVTSLSESWYNLLLDMQNRLNKVIKNITHSLTHLSTWRSFHTERK 1065
>gi|167384458|ref|XP_001736962.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165900458|gb|EDR26769.1| hypothetical protein EDI_171140 [Entamoeba dispar SAW760]
Length = 836
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/326 (25%), Positives = 141/326 (43%), Gaps = 50/326 (15%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
++L F++AK+S+L +D++ + I S+HCFE P LKR +E P + +D +
Sbjct: 74 LVLLFKEAKVSILRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTNPRLLIDKR 129
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
GRC ++ Y M ++ GF ++S+ INL + + D
Sbjct: 130 GRCISLICYDRLMWVIPL------------------GFD---KTSYSINLEKFGINRIID 168
Query: 249 FIFVHGYIEPVMVILHERELTWAGR-VSWKHHTCMISALSISTTL---KQHPLIWSAMNL 304
I + GY P + LH + TW GR V+ T I LS+ + KQ + +
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIVLSLEPDVIHEKQDIVATVSYQF 228
Query: 305 PHDAYKLLAVPS--PIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSF 360
+ Y L + P G+L++ N+I Y S ++ S L + V + + P SSF
Sbjct: 229 SYVPYNALQIVDCYPTNGILILTINSIIYLSTTSFESFILPFGKFFV-IPKNNNRPLSSF 287
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSVL 411
+ T + N V + T L ++ V+ + R+ D+ TN P
Sbjct: 288 QI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYKY 344
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
TI ++ F+GS + DS + +
Sbjct: 345 HPTTFTISSNHLFIGSTVHDSYIYNY 370
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/311 (22%), Positives = 136/311 (43%), Gaps = 36/311 (11%)
Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
K+ + L +G ED +G+ +F N +N L+ ++ + K ++ A+ +
Sbjct: 519 KQLKNYLVVGVNKQTTEDNPVKGKTYIF----NIENQIQLINKI--GDGKKSVHAVNEIG 572
Query: 1165 GHLLIASGPKIIL------HKWTGTELNGIAFY----DAPPLYVVSLNIVKN--FILLGD 1212
G L +ASG ++ L +W + I+ + PL V+ K ILL D
Sbjct: 573 GFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMEKGNEKECYLILLSD 632
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
++S+ L +K + L KD ++ C + F+I S++ D ++N+ + Y+
Sbjct: 633 FYRSVVLLLFKPYDYTVIPLGKDARNIHCIDSTFIITKDYFSVLEFDSEQNLSLLNYSSA 692
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+E ++ A F++G ++ KF RL G + ++ T++GS
Sbjct: 693 ATEQLSIFEI--DATFNLGMNLLKFTRLW--------NGKG--------YIYMYVTVEGS 734
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+G I+ ++E ++ L+ + K+ H AG N +R G +D ++L
Sbjct: 735 VGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDML 794
Query: 1393 SHYEMLPLEEQ 1403
+ +L E+Q
Sbjct: 795 KQFRLLNEEQQ 805
>gi|350410909|ref|XP_003489174.1| PREDICTED: DNA damage-binding protein 1-like [Bombus impatiens]
Length = 1141
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 159/368 (43%), Gaps = 62/368 (16%)
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
++ +I+ HNL +D H T E +L P +E AL+
Sbjct: 782 DICQEIEVHNLLIIDQH---TFEVLHAHMLMP-----------------TEYALS----- 816
Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
L +T E+ T +GTA V ++ + GR+LL+ S G+ +T+V KE+K
Sbjct: 817 LISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGK--------LTQVAEKEIK 868
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
G+ +L G LL + + L +WT + L F + LY L +FIL+G
Sbjct: 869 GSCYSLTEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFILVG 925
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D+ +S+ L +K +A+D+ A E L D + L ++ N+ +
Sbjct: 926 DLMRSLTLLQYKTMEGCFEEIARDYNPNWMTAIEILDDDTFLG---AENCFNLFVCQKDS 982
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 983 AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFG 1032
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
T+ G+IG + + + L++L+++L + V + +R F++ K + + +
Sbjct: 1033 TVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1090
Query: 1388 DCELLSHY 1395
D +L+ +
Sbjct: 1091 DGDLIESF 1098
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 90/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
++ V+D F+HG P ++++H+ ++ +H + IS K+ + W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKVPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH N Y + + +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L V + + VV+ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKPDGTQVVKDLKVELLGEISIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSRLGDS LV+ +AD + + +
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L + T S A ++ G L+ G+ I AS
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 -------IDLPGIKGMWAL---KVGGGNFDNT------------LVLSFVGQTRILTLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E T+ + +T GN+ IQ+ AR++ T + P N + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRTIS 501
>gi|166158025|ref|NP_001107422.1| damage-specific DNA binding protein 1, 127kDa [Xenopus (Silurana)
tropicalis]
gi|157422734|gb|AAI53474.1| Zgc:63840 protein [Danio rerio]
gi|163916541|gb|AAI57552.1| LOC100135265 protein [Xenopus (Silurana) tropicalis]
Length = 306
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 121/279 (43%), Gaps = 35/279 (12%)
Query: 1126 RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG--- 1182
+GR+++F D V E KE+KGA+ ++ G LL + + L++WT
Sbjct: 11 QGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKE 64
Query: 1183 --TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
TE N + + LY L +FIL+GD+ +S+ L++K +A+DF
Sbjct: 65 LRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPMEGSFEEIARDFNPNW 119
Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF--- 1297
A E L D + L ++ N+ + + + Q L FH+G V F
Sbjct: 120 MSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFSHG 176
Query: 1298 -LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
L LQ L SS T + +LFGT++G IG + L E + L LQ +L
Sbjct: 177 SLVLQNLGESSTPTQGS----------VLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNK 226
Query: 1357 SVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
+ V + +R FH+ K + +D +L+ +
Sbjct: 227 VIKSVGKIEHSFWRSFHTERKTEQ--ATGFIDGDLIESF 263
>gi|119580419|gb|EAW60015.1| hCG2010549, isoform CRA_a [Homo sapiens]
Length = 323
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 46/73 (63%)
Query: 943 LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKA 1002
LR+HP +G + +F + HNVNC GF+Y QG L+I LP+ +YD+ WPV+KIPL
Sbjct: 184 LRLHPVGINGPVNSFALFHNVNCPRGFLYFNRQGKLRISVLPAYLSYDSPWPVRKIPLCC 243
Query: 1003 TPHQITYFAEKNL 1015
T H + Y E +
Sbjct: 244 TVHCVAYHVESKI 256
>gi|297740793|emb|CBI30975.3| unnamed protein product [Vitis vinifera]
Length = 1043
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 128/283 (45%), Gaps = 33/283 (11%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNAD 1139
+T P+ + E ++ L + + ++ +GTAYV E+ +GR+L+F D
Sbjct: 759 STYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---ED 811
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP 1194
L+ E KE KGA+ +L + G LL A KI L+KW GT EL + +
Sbjct: 812 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGH 868
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
L + + +FI++GD+ KSI L +K + + A+D+ + A E L D L
Sbjct: 869 IL-ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 927
Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTG 1311
+ N IF K SE + +L E+H+G V +F ++
Sbjct: 928 -----AENNFNIF-TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------ 975
Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1354
P SD ++FGT++G IG IA L + L+ LQ L
Sbjct: 976 -LPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEKLQANL 1017
Score = 56.6 bits (135), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 83/367 (22%), Positives = 145/367 (39%), Gaps = 84/367 (22%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P +V+L++ +H A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 196
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
L + P W+ NL + A L+ VP P+ GVL++G TI Y S SA A+ +
Sbjct: 197 LK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR-- 251
Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+ ++ V+ D + LL G L LL + ++ V L +
Sbjct: 252 -------PSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 296
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + N+ ++GS GDS L++ ++ DA
Sbjct: 297 LGETSIASTISYLDNAFVYVGSSYGDSQLIKI------------------HLQPDA---- 334
Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
+ S + L+ VN + + + + T S A +D G L+ G+
Sbjct: 335 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIG 387
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
IN AS VEL G KG+W++ ++ DD + +L++S
Sbjct: 388 INEQAS------------VELQGIKGMWSL--------------RSSTDDPHDTFLVVSF 421
Query: 584 EARTMVL 590
+ T +L
Sbjct: 422 ISETRIL 428
>gi|340714589|ref|XP_003395809.1| PREDICTED: DNA damage-binding protein 1-like [Bombus terrestris]
Length = 1141
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 89/368 (24%), Positives = 159/368 (43%), Gaps = 62/368 (16%)
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
++ +I+ HNL +D H T E +L P +E AL+
Sbjct: 782 DICQEIEVHNLLIIDQH---TFEVLHAHMLMP-----------------TEYALS----- 816
Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
L +T E+ T +GTA V ++ + GR+LL+ S G+ +T+V KE+K
Sbjct: 817 LISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGK--------LTQVAEKEIK 868
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
G+ +L G LL + + L +WT + L F + LY L +FIL+G
Sbjct: 869 GSCYSLTEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFILVG 925
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D+ +S+ L +K +A+D+ A E L D + L ++ N+ +
Sbjct: 926 DLMRSLTLLQYKTMEGCFEEIARDYNPNWMTAIEILDDDTFLG---AENCFNLFVCQKDS 982
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
+ + Q++ +FH+G V F L +Q L SS T +LFG
Sbjct: 983 AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLFG 1032
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
T+ G+IG + + + L++L+++L + V + +R F++ K + + +
Sbjct: 1033 TVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1090
Query: 1388 DCELLSHY 1395
D +L+ +
Sbjct: 1091 DGDLIESF 1098
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 88/420 (20%), Positives = 160/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
++ V+D F+HG P ++++H+ ++ +H + IS K+ + W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKVPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH N Y + + +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L V + + VV+ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKPDGTQVVKDLKVELLGEISIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSR GDS LV+ +AD + + +
Sbjct: 317 ITYLDNGVIFVGSRFGDSQLVKLIT------------------KADENGSYCVPMETFTN 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +++ + L + T S A ++ G L+ G+ I AS
Sbjct: 359 LAPIIDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 -------IDLPGIKGMWAL---KVGGGNFDNT------------LVLSFVGQTRILTLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E T+ + +T GN+ IQ+ AR++ T + P N + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRTIS 501
>gi|119594339|gb|EAW73933.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_a [Homo
sapiens]
Length = 1094
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 132/296 (44%), Gaps = 38/296 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKL---VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
+ L E + L +Q +L + SV + +P F +G+ ++P S V
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSHPPG-DPFTPSGRQNQPQVSSTV 1093
>gi|297267724|ref|XP_001082958.2| PREDICTED: DNA damage-binding protein 1 [Macaca mulatta]
Length = 1092
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 125/286 (43%), Gaps = 38/286 (13%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
+ L E + L +Q +L + V + FH +HR
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIE----HSFHLEILSHR 1080
>gi|221040048|dbj|BAH11787.1| unnamed protein product [Homo sapiens]
Length = 1092
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 125/286 (43%), Gaps = 38/286 (13%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1279 GQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
Q L FH+G V F L +Q L +S T ++LFGT++G IG
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMIG 1038
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
+ L E + L +Q +L + V + FH +HR
Sbjct: 1039 LVTSLSESWYNLLLDMQNRLNKVIKSVGKIE----HSFHLEILSHR 1080
>gi|380025901|ref|XP_003696702.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
[Apis florea]
Length = 1141
Score = 75.1 bits (183), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 164/380 (43%), Gaps = 62/380 (16%)
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
++ +I+ HNL +D H T E +L P +E AL+
Sbjct: 782 DICQEIEVHNLLIIDQH---TFEVLHAHMLMP-----------------TEYALS----- 816
Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
L +T E+ T +GTA V ++ + GR+LL+ S G+ +T+V KE K
Sbjct: 817 LISTKLGEDPTSYYIVGTALVHPDETEPKMGRILLYHWSDGK--------LTQVAEKEXK 868
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
G+ +L G LL + + L +WT + L F + LY+ S +FIL+G
Sbjct: 869 GSCYSLTEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALYLKSKG---DFILVG 925
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D+ +S+ L +K +A+D+ A E L D + L ++ N+ +
Sbjct: 926 DLMRSLTLLQYKTMEGCFEEIARDYNPNWMTAIEILDDDTFLG---AENCFNLFVCQKDS 982
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
+ + Q++ +FH+G V F L +Q L SS T +L G
Sbjct: 983 AATSEDERQQMQEVGQFHLGDMVNVFRHGSLVMQNLGESSTPTQGC----------VLXG 1032
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
T+ G+IG + + + + L++L+ +L + V + +R F++ K + + +
Sbjct: 1033 TVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVGKIEHNFWRSFNTELKIEQ--CEGFI 1090
Query: 1388 DCELLSHYEMLPLEEQLEIA 1407
D +L+ + L ++ E+A
Sbjct: 1091 DGDLIESFLDLSPDKMAEVA 1110
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 91/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
++ V+D F+HG P ++++H+ ++ +H + IS K+ I W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSPI G +++G +I YH N Y + + +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L V + + VV+ L + +
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKTDGTQVVKDLKVELLGEISIPEC 316
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + F+GSRLGDS LV+ +AD + + +
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L + T S A ++ G L+ G+ I AS
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + G N D++ L++S +T +L
Sbjct: 405 -------IDLPGIKGMWAL---KIGGGNFDNT------------LVLSFVGQTRILTLNG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
E T+ + +T GN+ IQ+ AR++ T + P N + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISYETKTVVSEWEPENKRTIS 501
>gi|440302955|gb|ELP95261.1| hypothetical protein EIN_430670 [Entamoeba invadens IP1]
Length = 1175
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 96/469 (20%), Positives = 191/469 (40%), Gaps = 60/469 (12%)
Query: 983 LPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGH 1042
LP T++ P+Q+I + TPH I +Y + + + P ++ H
Sbjct: 743 LPKFETFNYGIPIQEIIVGGTPHNIISSPIGLVYTISTTTTLESP-------DCPPQLPH 795
Query: 1043 QIDNHN-------LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1095
N L++ L +E Y++ + +++ T N+L R
Sbjct: 796 SSTQQNALEKPPELTATPLRAELFLEHYKLVYFDQNQS-------FTFDKGMFVNSL--R 846
Query: 1096 VVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
V+ L T L+ G ED +G V LFS ++ ++ V K
Sbjct: 847 VLEL--TINSVKRVLVGCGVNTQTTEDDPVKGNVFLFSLESTSEGTIRHISTVCDG--KK 902
Query: 1156 AISALASLQGHLLIASGPKIILHK------WTGTELNGIAFYDAPPLYVVSLNIVKN--- 1206
A+ A+ S+ G+L +A G ++ + K W + I+ + + + + KN
Sbjct: 903 AVHAINSIGGYLAVAEGNELQILKGKTESLWVKKCFSDISIL-INTITFLPMTLSKNKVD 961
Query: 1207 ----FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1262
ILL D+++S+ L ++ Q + L KD + F++D ++ D ++
Sbjct: 962 EMCYLILLNDMYRSVILLLFQPQKKSVIPLGKDGRDIHAIDAAFVLDKDYFHVLEIDYER 1021
Query: 1263 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
N+ + Y +E+ + A F+VG + + RL++ N +
Sbjct: 1022 NLSVMNYL--RTETERISIFEVAATFNVGVDILRLTRLRL----------------GNGY 1063
Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
++ + GS+G + ++E +++ L+ + K+ H AG NP FR G +
Sbjct: 1064 VFVYLSAQGSVGYLTVVNERSYQTLRQINAKMNREPWHFAGTNPEEFRMEKGYGVGYGRR 1123
Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
I+D ++L + L E+Q + + T+ S +++ L++ +S L
Sbjct: 1124 KQVILDGDILKEFHFLTQEQQKRVCLR-NTSISDVVNILDNALQRSSLL 1171
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 82/363 (22%), Positives = 153/363 (42%), Gaps = 48/363 (13%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCG 192
+IL F+ A++SV+ ++ + + S+HCFE PE ++ + P + +D +GRC
Sbjct: 74 LILLFKQARLSVMRYNTETNRFVVHSLHCFEYPELRIREKCTPTAYDDPRMFIDKKGRCI 133
Query: 193 GVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFV 252
+L Y + ++ GS SS+ ++L + + D I +
Sbjct: 134 SLLCYDRLLWVIP--------------LGSN-------RSSYRVDLEKFGVSRIVDVISL 172
Query: 253 HGYIEPVMVILHERELTWAGR-VSWKHHTCMISALSISTTL--KQHPLIWSAMN----LP 305
GY P + LH TW R V+ T I+ ++++ + ++ + +N LP
Sbjct: 173 SGYETPTLAFLHMTVPTWDARTVNTGEATNEIAIINVNPGVVGEEEQECANVVNRISRLP 232
Query: 306 HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE----------L 355
++ K++ P+ G+L++ + ++ Y S ++S + L + + + L
Sbjct: 233 YNTLKMVEC-YPLPGILLLASVSVLYISTTSSESFIL-PFGTYFNPPEVWKGVVPFLKLL 290
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
P ++L + QN + L T GD + + +VQ + LS P +
Sbjct: 291 PMKIRIIQLVKSIHQLSQN-LYLTFTDKGDSYYIHLNCVEGIVQEIVLSNA-PYKFIPNT 348
Query: 416 TTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
++ + FLGS DS L +T C G G + FG DA K L+ S
Sbjct: 349 VSLYDDYIFLGSVFHDSYLFNYTICEYG-----KGDIKPFGIHCGDAVRIKNLQERSGQM 403
Query: 475 LQD 477
+D
Sbjct: 404 EED 406
>gi|324501533|gb|ADY40680.1| Splicing factor 3B subunit 3 [Ascaris suum]
Length = 1214
Score = 74.7 bits (182), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 129/290 (44%), Gaps = 37/290 (12%)
Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
++A+ +G L+ G KI L+ +L P+ VV + + I++ D +S
Sbjct: 946 VNAVHDFRGMALVGVGKKIRLYDLGKKKLLAKCENKQLPVQVVDIRSMGQRIVVSDSQES 1005
Query: 1217 IYFLSWKEQGAQLNLLAKD----FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
++F+ +K+Q QL++ D F + C ++D T++ V D +I +
Sbjct: 1006 LHFMRYKKQDNQLSIFCDDTSPRFVTCIC-----ILDYDTVA--VGDRFGSIAVLRLPKG 1058
Query: 1273 MSESWKGQKLLSRAEFHVG---------AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
++E + RA + G H+ +F + TS +T PG++
Sbjct: 1059 VTEEVQEDPTGVRALWDRGNLNGASQKVEHIGQFY-VGDTVTSMQKTSLVPGAND----C 1113
Query: 1324 LLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
L++ T+ G+IG + P DE F Q+L+ L P + G + +FR F+ K
Sbjct: 1114 LVYTTISGTIGMLVPFVSRDEFDF--FQNLEMHLRVEFPPLCGRDHLAFRSFYFPVKC-- 1169
Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
I+D +L Y ++PL++Q +A + G ++I L D+ +F
Sbjct: 1170 -----IIDGDLCEQYALMPLDKQKAVAEELGRKPAEIHKKLEDIRTRYAF 1214
>gi|378755148|gb|EHY65175.1| hypothetical protein NERG_01621 [Nematocida sp. 1 ERTm2]
Length = 822
Score = 74.3 bits (181), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 114/212 (53%), Gaps = 9/212 (4%)
Query: 1084 PMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
P++ +E ++VTL + E + + + T Y+ ED+ ARGR+++ R+
Sbjct: 486 PLEDNEYIAHHKIVTLPDKQNTEGVSEFVIVCTTYITDEDLMARGRLIVLEIASVVPQRD 545
Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPL 1196
++ + + +++ KGA + ++G++++ G K++++ + E L +AF+D +
Sbjct: 546 RIETRHKLKALAAEKTKGATTCCDIVKGNIVVCVGTKLMIYMFDRNEGLRAVAFHDIH-V 604
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF-ATEFLIDGSTLSL 1255
++ S +++N I+ GD +K + L ++ + + L+LL++ G + + GS LSL
Sbjct: 605 FLTSCMVMRNIIVCGDAYKGTFLLFYQSEPSLLHLLSQSSGGVYLLKGIGMTLYGSVLSL 664
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
+ D K + I+ Y+P+ S G +L+SR E
Sbjct: 665 LSYDSAKTVCIYSYSPQHILSQGGTRLISRGE 696
>gi|242010743|ref|XP_002426118.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
gi|212510165|gb|EEB13380.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
Length = 1148
Score = 74.3 bits (181), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 95/398 (23%), Positives = 166/398 (41%), Gaps = 64/398 (16%)
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
E G +++ HNL +D H + ++ MQS E AL++
Sbjct: 789 EFGQEVEVHNLLVIDQHTFEVLHAHQF-------------------MQS-EYALSLISTK 828
Query: 1099 LFNTTTKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAI 1157
L + + T +GTA V E + +GR+L+F + + +V KE+KGA
Sbjct: 829 LGD----DPNTYYIVGTAMVNPDESESKQGRILIFQF------QEGKLYQVAEKEIKGAA 878
Query: 1158 SALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
+L G LL + + L +WT + L F + LY L +FIL+GD+
Sbjct: 879 YSLVEFNGKLLASINSTVRLFEWTAEQELRLECSHFNNIISLY---LKTKGDFILVGDLI 935
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+S+ L +K +A+D A E + D + L ++ N+ + +
Sbjct: 936 RSMTLLQYKTMEGCFEEMARDHNPNWMTAVEIIDDDTFLG---AENSFNLFVCQKDSAAA 992
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKF----LRLQMLA-TSSDRTGAAPGSDKTNRFALLFGTL 1329
+ Q++ + FH+G V F L +Q + TS+ TG +LFGT+
Sbjct: 993 TDEERQQMHAVGMFHLGDMVNVFRHGSLVMQNVGETSTPTTGC-----------ILFGTV 1041
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
G+IG + + + L L+ KL + + V + +R F + K D +D
Sbjct: 1042 SGAIGLVTQISANFYNFLHELECKLTEVIKSVGKIKHSFWRSFTTEIKTEP--CDGFIDG 1099
Query: 1390 EL------LSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+L LSH +M + L+I + +G + + +L
Sbjct: 1100 DLIESFLDLSHEKMKEVAAGLQIDNGSGMKQEATVDDL 1137
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/276 (21%), Positives = 116/276 (42%), Gaps = 61/276 (22%)
Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES 232
G++S G + +DP+ R G+ +Y + I+ + S L S R+E
Sbjct: 113 GKQS-ETGIIAVIDPEARVIGLRLYDGLLKIIPLGKDNSELKAS----------SIRMEE 161
Query: 233 SHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
V +D F+HG P ++++H+ ++ +H + IS
Sbjct: 162 VEV-----------QDLNFLHGCQNPTIILIHQD-------INGRH----VKTHEISLRD 199
Query: 293 KQH-PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA---LNNYAVS 348
K+ + W N+ DA ++ VP P+ G +++G +I YH+ + A+A +N ++
Sbjct: 200 KEFVKMPWKQDNVEPDASIVIPVPEPLCGAIIIGQESILYHNGAGYVAVAPPVINQSTIT 259
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-------VQRL 401
+ ++D+ + +L D+A G L +L + + ++ L
Sbjct: 260 CYT-----------QVDSNGSRYLLGDMA------GHLFMLLLETEEKIDGTPCVKENGL 302
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ + IT + N + F+GSR GDS LV+
Sbjct: 303 KVELLGEISIPEAITYLDNGVLFIGSRCGDSQLVKL 338
>gi|297820284|ref|XP_002878025.1| hypothetical protein ARALYDRAFT_906938 [Arabidopsis lyrata subsp.
lyrata]
gi|297323863|gb|EFH54284.1| hypothetical protein ARALYDRAFT_906938 [Arabidopsis lyrata subsp.
lyrata]
Length = 454
Score = 73.6 bits (179), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 88/381 (23%), Positives = 161/381 (42%), Gaps = 57/381 (14%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+RIL+P A T + +Q +E A +V V N KE TLLA+GT V+G
Sbjct: 103 IRILDPKTA----TTTCLLELQDNEAAYSVCTV---NFHDKEYGTLLAVGT--VKGMQFW 153
Query: 1125 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
+ ++ R + ++L ++ +++G AL QG LL GP + L+
Sbjct: 154 PKKNLVAGFIHIYRFVEEGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGK 212
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD------- 1235
L P ++S+ ++ I +GDI +S ++ ++ QL + A D
Sbjct: 213 KRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLT 272
Query: 1236 ---------FGSLDCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
D F + + LS + ++ +I + K++ + K+
Sbjct: 273 ASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGA--PNKVDEI 330
Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1342
+FHVG VT + M+ S+ ++++GT+ GSIG + D++
Sbjct: 331 VQFHVGDVVTCLQKASMIPGGSE--------------SIMYGTVMGSIGALHAFTSRDDV 376
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
F L+ + P + G + ++R A+ P D ++D +L + LP++
Sbjct: 377 DF--FSHLEMHMRQEYPPLCGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDL 427
Query: 1403 QLEIAHQTGTTRSQILSNLND 1423
Q +IA + T ++IL L D
Sbjct: 428 QRKIADELDRTPAEILKKLED 448
>gi|255588145|ref|XP_002534515.1| spliceosomal protein sap, putative [Ricinus communis]
gi|223525135|gb|EEF27867.1| spliceosomal protein sap, putative [Ricinus communis]
Length = 1214
Score = 73.6 bits (179), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 93/387 (24%), Positives = 168/387 (43%), Gaps = 67/387 (17%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA----YVQG 1120
+R+L+P A T + +Q +E A +V V N KE+ TLLA+GTA +
Sbjct: 863 IRVLDPRTAA----TTCLLELQDNEAAFSVCTV---NFHDKEHGTLLAVGTAKGLQFWPK 915
Query: 1121 EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
++A G + ++ + D+ + L ++ +++G AL+ QG LL GP + L+
Sbjct: 916 RSLSA-GFIHIY---KFVDDGRALEL-LHKTQVEGVPLALSQFQGRLLAGIGPVLRLYDL 970
Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----- 1235
L P +VS+ ++ I +GDI +S +F ++ QL + A D
Sbjct: 971 GKKRLLRKCENKLFPNSIVSIQTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDCVPRW 1030
Query: 1236 -----------FGSLDCFATEFLIDGSTLSLVVSDEQKNI----QIFYYAPKMSESWKGQ 1280
D F + + L VSDE + +I + K++ +
Sbjct: 1031 LTASHHVDFDTMAGADKFGNIYFV---RLPQDVSDEIEEDPTGGKIKWEQGKLNGA--PN 1085
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL- 1339
K+ +FH+G VT + ++ PG + +++GT+ GS+G + P
Sbjct: 1086 KVEEIVQFHIGDVVTSLSKASLI----------PGGGE----CIIYGTVMGSVGALLPFT 1131
Query: 1340 --DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
D++ F L+ L P + G + ++R A+ P D ++D +L +
Sbjct: 1132 SRDDVDF--FSHLEMHLRQDHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFPT 1182
Query: 1398 LPLEEQLEIAHQTGTTRSQILSNLNDL 1424
LPL+ Q +IA + T +IL L ++
Sbjct: 1183 LPLDAQRKIADELDRTPGEILKKLEEV 1209
>gi|443922899|gb|ELU42250.1| splicing factor 3B subunit 3 [Rhizoctonia solani AG-1 IA]
Length = 1212
Score = 73.6 bits (179), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 117/486 (24%), Positives = 212/486 (43%), Gaps = 49/486 (10%)
Query: 965 CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPV 1024
C G I + +L+I Q+P S D V +PL TP +I E L+ +I S
Sbjct: 753 CPEGLIGIAGS-VLRIFQIPKLS--DKLKQV-TMPLSYTPRKIAVHPEHQLFYVIESDHR 808
Query: 1025 L---KPLNQVLSLLIDQEVGHQIDNH--NLSSVD--LHRTYTVEEYE-VRILEPDRAGGP 1076
+ N+ L+ L+ G QID +L + D L R + +RI++P
Sbjct: 809 TWGSEAKNKRLAELV--RAGRQIDQELVDLPAEDFGLPRAGAGQWASCIRIIDPTEVFCS 866
Query: 1077 WQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1136
T I + ++E+A +V VV +ENE L +GTA + +V R V ++G
Sbjct: 867 -ATLFKIELDNNESAFSVAVVPF---AARENELFLVVGTA--KDTNVLPRQCVGAVTSGS 920
Query: 1137 --NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
+++T E+ AL ++G L G + +++ +L +
Sbjct: 921 LVKLGWSTHILTRPIQTEVDDVPLALLGIKGRLCAGVGKALRIYEMGKKKLLRKSENKGF 980
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
+V+L + I++G++ +S+++ ++K + +L + A D S + L+D T++
Sbjct: 981 ATAIVTLTSQGSRIIVGEMQESVHYATYKPESNRLLVFADD-TSARWVTSAALVDYDTVA 1039
Query: 1255 LVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEF-HVGAHVTKFL---RLQMLATS 1306
V D+ NI + +S+ G ++ EF H H TK L + + TS
Sbjct: 1040 --VGDKFGNIFVNRLPANISQQVDDDPTGAGIMHEREFLHGAPHKTKLLAHYNVGDIVTS 1097
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
R PG R + + L G+IG + PL +++ F + +L++ + + G
Sbjct: 1098 VHRAALVPG----GRDVVAYTGLHGTIGVLIPLASKEDVDF--ITTLEQHMRSEHSSLVG 1151
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
+ ++R ++ KA +VD +L + MLP +Q IA + T ++L L
Sbjct: 1152 RDHLAYRGYYVPVKA-------VVDGDLCERFAMLPSTKQKSIAGELDRTVGEVLKKLEG 1204
Query: 1424 LALGTS 1429
L + S
Sbjct: 1205 LRVAGS 1210
>gi|18410222|ref|NP_567015.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
gi|18410226|ref|NP_567016.1| putative splicing factor [Arabidopsis thaliana]
gi|7019653|emb|CAB75754.1| spliceosomal-like protein [Arabidopsis thaliana]
gi|7019655|emb|CAB75756.1| spliceosomal-like protein [Arabidopsis thaliana]
gi|332645831|gb|AEE79352.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
gi|332645833|gb|AEE79354.1| putative splicing factor [Arabidopsis thaliana]
Length = 1214
Score = 73.2 bits (178), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 90/382 (23%), Positives = 163/382 (42%), Gaps = 59/382 (15%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+R+L+P A T + +Q +E A +V V N KE TLLA+GT V+G
Sbjct: 863 IRVLDPKTA----TTTCLLELQDNEAAYSVCTV---NFHDKEYGTLLAVGT--VKGMQFW 913
Query: 1125 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
+ ++ R ++ ++L ++ +++G AL QG LL GP + L+
Sbjct: 914 PKKNLVAGFIHIYRFVEDGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGK 972
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
L P ++S+ ++ I +GDI +S ++ ++ QL + A D
Sbjct: 973 KRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLT 1032
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR------ 1285
A+ +D T++ +D+ N+ +SE W+ KL
Sbjct: 1033 ASHH-VDFDTMA--GADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGAPNKVDE 1089
Query: 1286 -AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1341
+FHVG VT + M+ S+ ++++GT+ GSIG + D+
Sbjct: 1090 IVQFHVGDVVTCLQKASMIPGGSE--------------SIMYGTVMGSIGALHAFTSRDD 1135
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
+ F L+ + P + G + ++R A+ P D ++D +L + LP++
Sbjct: 1136 VDF--FSHLEMHMRQEYPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFPTLPMD 1186
Query: 1402 EQLEIAHQTGTTRSQILSNLND 1423
Q +IA + T ++IL L D
Sbjct: 1187 LQRKIADELDRTPAEILKKLED 1208
>gi|242089089|ref|XP_002440377.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
gi|241945662|gb|EES18807.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
Length = 1783
Score = 73.2 bits (178), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 151/373 (40%), Gaps = 59/373 (15%)
Query: 1038 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1094
+ + HQ + L+ +VEE E +R+L+ +++ P+ E ++
Sbjct: 717 RRICHQEQSKTLAFCSFKYNQSVEESETHLIRLLDHQT----FESLCVYPLDQYEFGCSI 772
Query: 1095 RVVTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
+ +N +GTAYV E+ +GR+L+F+ D L+ E KE
Sbjct: 773 ISCSF----ADDNNVYYCVGTAYVIPEENEPTKGRILVFAV---EDGSLQLIVE---KET 822
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGT-----ELNGIAFYDAPPLYVVSLNIVKNFI 1208
KGA+ +L + G LL A KI L+KW EL + L + + +FI
Sbjct: 823 KGAVYSLNAFNGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILALYT-QTRGDFI 881
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLL-----------------------AKDFGSLDCFATE 1245
++GD+ KSI L +K + L A+D+ + A E
Sbjct: 882 VVGDLMKSISLLVYKVVPLTVCLTHIVLSVIFFVSLFVVLESAIEERARDYNANWMTAVE 941
Query: 1246 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
L D V ++ N+ + + +L E+H+G V +F ++
Sbjct: 942 MLDDE---VYVGAENGYNLFTVRKNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMR 998
Query: 1306 SSD-RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1364
D G P ++FGT++G IG IA L + L+ LQ LV + V L
Sbjct: 999 LPDSEIGQIP--------TVIFGTINGVIGIIASLPHDQYVFLEKLQSTLVKYIKGVGNL 1050
Query: 1365 NPRSFRQFHSNGK 1377
+ +R FH++ K
Sbjct: 1051 SHEQWRSFHNDKK 1063
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 108/464 (23%), Positives = 189/464 (40%), Gaps = 112/464 (24%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+D + +A E K VL++D L +M + GR + G + +DP
Sbjct: 75 QDFLFIATERYKFCVLQWDAEKSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDC 127
Query: 190 RCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
R G+ +Y GL +I ++G F+ R+E V++++
Sbjct: 128 RLIGLHLYDGLFKVIPFDNKGQLK-----------EAFNIRLEELQVLDIK--------- 167
Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
F+HG ++P +V+L++ +H AL + P WS N+ + A
Sbjct: 168 --FLHGCVKPTIVVLYQ------DNKDVRHVKTYEVALK-DKDFVEGP--WSQNNVDNGA 216
Query: 309 YKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAH 368
L+ VP+P+GGV+++G I Y + +++ ++ Q + R+ V+ D +
Sbjct: 217 GLLIPVPAPLGGVIIIGEEQIVYCNANSTFK--------AIPIKQSIIRAYGRVDPDGSR 268
Query: 369 ATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSR 428
LL TG L LL + ++ V L + + + S I+ + N + ++GSR
Sbjct: 269 Y--------LLGDNTGILHLLVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSR 320
Query: 429 LGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
GDS LV+ +++ADA S + L+ VN + +
Sbjct: 321 FGDSQLVKL------------------NLQADASG------SFVEILERYVNLGPIVDFC 356
Query: 489 SASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
+ + + T S A +D G L+ G+ IN AS VEL G
Sbjct: 357 VVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIGINEQAS------------VELQG 399
Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
KG+W++ KSS ++D Y YL++S + T L
Sbjct: 400 IKGLWSL--KSS------------FNDPYDMYLVVSFISETRFL 429
>gi|303271531|ref|XP_003055127.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463101|gb|EEH60379.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1223
Score = 73.2 bits (178), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 87/360 (24%), Positives = 147/360 (40%), Gaps = 62/360 (17%)
Query: 1113 IGTAYVQGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GTA+ E+V +RGR+L+ R + +LV E KE+KGA+ L + G LL
Sbjct: 873 VGTAFSLPEEVEPSRGRILVL---RADEGRLSLVAE---KEVKGAVYNLNAFNGKLLAGI 926
Query: 1172 GPKIILHKW---------------------------------------TGTELNGIAFYD 1192
K+ L KW T EL +
Sbjct: 927 NSKVQLFKWVSRGAGAGAGAGGGAEGGAVAMADGGGGGGGGGGAPAAATTCELASECSHH 986
Query: 1193 APPLYVVSL--NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDG 1250
++V+L ++ +FI++GD+ KSI L +K + A+DF A L D
Sbjct: 987 G---HIVALYVDVRGDFIVVGDLMKSISLLVYKPDEGVIEERARDFNPNWMTAVCALDDE 1043
Query: 1251 STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF------LRLQMLA 1304
+ L ++ N+ + + +L E+H+G V +F +RL
Sbjct: 1044 TYLG---AENSFNLFTVRKNSDAAADEERSRLDVIGEYHLGEFVNRFRAGSLVMRLPGDG 1100
Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1364
+ S++ LFGT++G+IG +A L E T L +LQK + V V G
Sbjct: 1101 DGAGLGLGLDASNEAP--TQLFGTVNGAIGVVASLPESTHTFLAALQKAMNKVVSGVGGF 1158
Query: 1365 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ ++R FH+ ++ VD +L+ + L E+ E+A G ++ + +L
Sbjct: 1159 SHDAWRSFHNEHRSRLVEARGFVDGDLIESFLDLRPEKASEVASVVGVGVEELTKRIEEL 1218
Score = 49.3 bits (116), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 62/262 (23%), Positives = 113/262 (43%), Gaps = 35/262 (13%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
GP+ VDP+ R G+ +Y ++ Q G FS R+E V +++
Sbjct: 130 GPIGAVDPECRMYGLHLYDGLFKVIPMDQTGQ----------LREAFSVRLEELQVFDVK 179
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
F+ G +P + +L++ T GR + C+ +P W
Sbjct: 180 -----------FLAGTPKPTIAVLYQD--TKEGRHIKTYEVCLKDK-------DFNPGPW 219
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
+ ++ + L+AVP+P+GGV+VVG I Y ++ + + + ++
Sbjct: 220 AQNDVESGSRFLIAVPAPLGGVVVVGEKVIAYLNKETTHGVGDGGGGGGGGGGGMIVKA- 278
Query: 360 FSVELDAAHATWLQNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
+++ DA T+ D LLS G L LL +++D V+ L L + + S +
Sbjct: 279 IAMQSDATIMTYGAVDKDGSRYLLSDSAGRLHLLVLMHDKTRVRALKLESLGQTSIASSL 338
Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
+ + N + ++GS GDS LV+
Sbjct: 339 SYLDNGVVYVGSAYGDSQLVRL 360
>gi|297816810|ref|XP_002876288.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322126|gb|EFH52547.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 73.2 bits (178), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 88/381 (23%), Positives = 162/381 (42%), Gaps = 57/381 (14%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+R+L+P A T + +Q +E A +V V N KE TLLA+GT V+G
Sbjct: 282 IRVLDPKTA----TTTCLLELQDNEAAYSVCTV---NFHDKEYGTLLAVGT--VKGMQFW 332
Query: 1125 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
+ ++ R + ++L ++ +++G AL QG LL GP + L+
Sbjct: 333 PKKNLVAGFIHIYRFVEEGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGK 391
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD------- 1235
L P ++S+ ++ I +GDI +S ++ ++ QL + A D
Sbjct: 392 KRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLT 451
Query: 1236 ---------FGSLDCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
D F + + LS + ++ +I + K++ + K+
Sbjct: 452 ASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGA--PNKVDEI 509
Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1342
+FHVG VT + M+ PG ++ +++GT+ GSIG + D++
Sbjct: 510 VQFHVGDVVTCLQKASMI----------PGGSES----IMYGTVMGSIGALHAFTSRDDV 555
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
F L+ + P + G + ++R A+ P D ++D +L + LP++
Sbjct: 556 DF--FSHLEMHMRQEYPPLCGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDL 606
Query: 1403 QLEIAHQTGTTRSQILSNLND 1423
Q +IA + T ++IL L D
Sbjct: 607 QRKIADELDRTPAEILKKLED 627
>gi|430810873|emb|CCJ31593.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 301
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 128/292 (43%), Gaps = 41/292 (14%)
Query: 245 HVKDFIFV-HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
H+ D F+ + Y EP + IL+ T G + ++ T +A I++
Sbjct: 6 HIVDLWFIFYDYREPTLAILYSAFQTSTGLLPYRQDTMTSTA------------IYTVDK 53
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC-ALALNNYAVSLDSSQELPRSSFSV 362
LP+D + +L +P+PIGG L++G N + Y Q+A A+++N++A + ++
Sbjct: 54 LPYDLFSVLPLPNPIGGTLLIGNNELVYVDQAARVKAVSVNSFARKCTHLDFIEDYDLNL 113
Query: 363 ELDAAHATWL-----QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV----LTS 413
L+ A +L Q LL + G V + DGRVV L + + SV L S
Sbjct: 114 RLNGAVGVYLELLDDQPGAVLLVIEDGRFVQVGFKLDGRVVSSLSVKILDQSVKNDFLKS 173
Query: 414 D---ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
+ I + N F+GS++ +S+L+++ S + E + +
Sbjct: 174 EASCIVLLNNEQLFIGSKVSNSVLLEWKRQSEIA-------------EKLLSEPRVIFDE 220
Query: 471 SSDALQDMVNGEELSLYGSASN-NTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ L D+ GE+ + ++S F + D+L + GP+ D + G
Sbjct: 221 DREVLNDLY-GEDFDIVDTSSILQRNGVFGDIQFRLFDTLYSCGPIVDMTIG 271
>gi|357135348|ref|XP_003569272.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
distachyon]
Length = 1074
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/323 (24%), Positives = 143/323 (44%), Gaps = 44/323 (13%)
Query: 1112 AIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
+GTAY+ ++ +GR+L+F + LV E +E KGA+ +L +L G LL A
Sbjct: 782 CVGTAYILPYEIEPTKGRILIFLV---EERKLRLVAE---RETKGAVYSLNALTGKLLAA 835
Query: 1171 SGPKIILHKWTGT----ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
KII++KW +L Y L + +FI++GD+ +S+ L +K +
Sbjct: 836 VNQKIIVYKWVRRDNRHQLQSECSYRGCVL-ALHTQTHGHFIVVGDMVRSVSLLRYKYEE 894
Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA-PKMSESWKGQKLLSR 1285
+ ++ +DF + A L D + +D N+ + P +
Sbjct: 895 GLIEVVTRDFNTKWITAVAMLDDDIYIG---ADNCCNLFTLHSGRPGVV----------- 940
Query: 1286 AEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFALLFGTLDGSIGCIA--PLDEL 1342
E+H+G V + ++ +D G P ++FGT+ G+IG IA P D+
Sbjct: 941 GEYHLGDLVNRMHHGSLVMHHTDSEIGQIP--------TVIFGTISGAIGVIASFPYDQY 992
Query: 1343 TF-RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
F +LQS+ K + SV +++ + RSF +A + VD +L+ + L
Sbjct: 993 VFLEKLQSVLVKFIKSVGNLSHVEWRSFYNVSRTAEAR-----NFVDGDLIESFLSLSPS 1047
Query: 1402 EQLEIAHQTGTTRSQILSNLNDL 1424
+ E++ G ++ + +L
Sbjct: 1048 KMEEVSQVMGLRADELCKIVEEL 1070
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 135/605 (22%), Positives = 240/605 (39%), Gaps = 141/605 (23%)
Query: 57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
NL+V N +EIY++ Q L+L+ L+G + +L
Sbjct: 31 NLIVAKCNRMEIYLLTPQ-------------------------GLQLMVDVPLYGTIATL 65
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+ S +D + ++ E + VL +D L S + +++ GR +
Sbjct: 66 ELFRS----RSETQDFLFISMERYRCIVLHWDGRNSELITRSGG--DVSDFI----GRPT 115
Query: 177 FARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
G + +DPQ R G+ +Y GL +I F + G
Sbjct: 116 -DNGQIGVIDPQNRLIGLSLYDGLFKVI---------------PFDNKGNLK------EA 153
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
+N+R L V D F++G P +V+LH+ +H AL ++
Sbjct: 154 LNIR-LQEFLVLDIKFLYGCARPTVVVLHQ------DNKDSRHVKTYEVALEDKDFVEGS 206
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
WS NL + A+ L +P P+GGV+++G +TI Y S + AL++ Q +
Sbjct: 207 ---WSQSNLDNSAH--LLIPVPLGGVIIIGEHTIVYCSATTFKALSIK---------QSI 252
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
R+ V+ D + + N TG L L+ + ++ V L + + S I
Sbjct: 253 IRAVGRVDPDGSRYLYGDN--------TGALHLIVITHEWGRVTDLKTHYMGETSIASTI 304
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
+ + + L ++GSR GDS L++ +I+ADA + S + L
Sbjct: 305 SYLDSGLVYIGSRFGDSQLIKL------------------NIQADASA------SFVEIL 340
Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
+ +N + + + + + G KD S I A + I+
Sbjct: 341 EQFMNTGPIVDFCVVDTERRGQGQVITCS--------GAYKDGS----IRAVRNGVVITD 388
Query: 536 QSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
Q++ VEL G KG+W++ KSS D+ + + +E H +L +++E LE D+
Sbjct: 389 QAS---VELRGMKGLWSM--KSSLNDPYDTFLVVTFINETH-FLAMNMENE---LEEVDI 439
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQDLSFGPSNSESGS 654
+E+ +T+A G+ ++IQV R R++ S D F P+
Sbjct: 440 KGFDSET-------QTLACGSAI-HNQLIQVTSRSVRLVSSVSLELLDQWFAPARFSVNV 491
Query: 655 GSENS 659
+ N+
Sbjct: 492 AAANA 496
>gi|308808936|ref|XP_003081778.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
gi|116060244|emb|CAL56303.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
Length = 1282
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/343 (23%), Positives = 157/343 (45%), Gaps = 33/343 (9%)
Query: 1038 QEVGHQIDNHNLSSVDLHR-TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ + HQ + + + V H + + ++ VR+++ G ++T + ++ E ++
Sbjct: 907 RRIAHQPETNTFAVVVEHLWSKSSQDCFVRLVD----DGSFETLSQFQLEDQELTSSLTS 962
Query: 1097 VTLFNTTTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
T +T T +GT ++ ED +RGR+L+F D+ LV+E KE++G
Sbjct: 963 CTFAGDST----TYYVVGTGIALETEDEPSRGRILVFKVD---DDQLVLVSE---KEVRG 1012
Query: 1156 AISALASLQGHLLIASGPKIILHKWT--GTELNGIAFYDAPPLYVVSLNIVK--NFILLG 1211
A+ L + +G LL K+ L KWT E++ + + +V+ + ++IL+G
Sbjct: 1013 AVYNLNAFKGKLLAGINSKLELFKWTPREDEVHELVSECSHHGQIVTFAVKTRGDWILVG 1072
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
D+ KS+ L +K + ++ +A+DF + A L D T + ++ +F +
Sbjct: 1073 DLMKSMSLLLYKPEEGAIDEVARDFNANWMTAVAMLDDDETY----LGAENSLNLFTVSR 1128
Query: 1272 KMSE--SWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
++ + +L E+H+G V F ++ + D + + LLFGT
Sbjct: 1129 NVNAVTDEERSRLEITGEYHLGELVNAFAPGSLVMSLRD-------GESLSVPTLLFGTA 1181
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
+G IG +A L + + + LQ + + V GL +R F
Sbjct: 1182 NGVIGVLASLPKDVYEFTERLQASINKHIQGVGGLKHADWRSF 1224
Score = 44.7 bits (104), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 101/206 (49%), Gaps = 24/206 (11%)
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
N+R L+ V+D F+HG +P + +L+ R++ A V K + + ++
Sbjct: 376 EAFNIR-LEELRVEDIQFLHGTAKPTIAVLY-RDMKEA--VHIKTYEIGVREKEFVSS-- 429
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
W+ +L + K++ VP+P+GGV+V+G TI Y ++++ ++ V L +
Sbjct: 430 ----PWAQNDLEGGSSKIIPVPAPVGGVVVLGEETIVYLNKTS------DDTDVFLKAIN 479
Query: 354 ELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
RSS +D + +L D G L LL +V+DG+ V L + + + +
Sbjct: 480 IPERSSIVCYGAIDPDGSRYLLGD------HDGTLYLLVLVHDGKRVNELKIERLGETSI 533
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
S ++ + N + F+GS GDS L++
Sbjct: 534 PSTVSYLDNGVVFVGSAYGDSQLIKL 559
>gi|397627714|gb|EJK68584.1| hypothetical protein THAOC_10223, partial [Thalassiosira oceanica]
Length = 456
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 123/289 (42%), Gaps = 47/289 (16%)
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
G + AL QG LL+ G + L++ +L P V +L + +GD+
Sbjct: 189 GPVLALVHFQGRLLVGIGKSLRLYEMGKRQLLKKCELRGLPTMVKTLQAAGDRAFVGDMM 248
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+S+ F+ + +L L+A+D + E L+D +T++ V D+ N+ P+ +
Sbjct: 249 QSMQFVRYDATANRLVLVARDRSARPITCQE-LLDVNTVA--VGDKFGNVTTLRL-PRGA 304
Query: 1275 ES-----------WKGQ------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
++ W KL + +HVG VT R ++A ++
Sbjct: 305 DTGAVDVSGTRALWDSSREDATPKLETLCTYHVGEVVTSLTRASLVAGGAE--------- 355
Query: 1318 KTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
+L++ T+ G IG + P +++ F SL+ + VP G +P+S+R F+
Sbjct: 356 -----SLIYVTVTGRIGALVPFTSREDVEF--YTSLESHVRSEVPRPTGRDPQSYRSFYC 408
Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
K ++D +L Y LP E + IA Q + +++ L D
Sbjct: 409 PVK-------HVIDGDLCEAYGGLPYEARERIADQMERSTGEVMKKLED 450
>gi|384253371|gb|EIE26846.1| hypothetical protein COCSUDRAFT_52476 [Coccomyxa subellipsoidea
C-169]
Length = 1205
Score = 72.0 bits (175), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 118/495 (23%), Positives = 201/495 (40%), Gaps = 75/495 (15%)
Query: 964 NCNHGFIYVTSQGILKICQLPS-GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV 1022
C GF V S+ +L+I L G ++ Q L+ TP + E N+ + +
Sbjct: 747 QCPEGFCAV-SKSMLRILTLERLGEAFNQ----QVTRLRYTPRKFVVHPESNMLIVAEAD 801
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--------------VRIL 1068
PL + ++ E G ++D ++ EE + +R+L
Sbjct: 802 HAAVPLAERRAV----EDGMEMDAALTEGIEFDEERAAEEEQHGAPKNSTGRWASCIRVL 857
Query: 1069 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG----EDVA 1124
+P QT + + + +E A+++ ++ N E +LA+GT VQG A
Sbjct: 858 DPTS----LQTSSVLELDGNEAAVSLCLLRFSNW--PEEGMVLAVGT--VQGLAFYPRTA 909
Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAI-SALASLQGHLLIASGPKIILHKWTGT 1183
G + L+ R D+ + L E+ K G I ALA+ +G LL GP + +++
Sbjct: 910 DEGYIRLY---RFRDSGRQL--ELIHKTPTGGIPGALAAFKGRLLAGVGPTLRIYEAGKK 964
Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
+L + P ++ +L + I +GD+ +S+++ +K L A D A
Sbjct: 965 KLLRKCEHRKLPTHIATLATSGDRIFVGDLQESMHYFRYKANENALYEYADDIAPRHLTA 1024
Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFY------YAPKMSESWKGQKLLSRAEFHVGA----- 1292
+D T V+ K IF + ++ E G K A GA
Sbjct: 1025 A-LPLDYDT----VAGADKFCNIFVTRLPRDVSTQVEEDPTGGKFAGAAGLLNGAPHKLE 1079
Query: 1293 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQS 1349
V F + L TS R PG R LL+ T+ G+IG + P +++ F
Sbjct: 1080 DVVNF-HVGDLVTSLQRAVLQPG----GREVLLYATVMGAIGAMLPFPSREDVDF--FSH 1132
Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
L+ L P + G + S+R ++ P D ++D +L H+ LP +Q IA +
Sbjct: 1133 LEMHLRQEHPPMGGRDHMSYR------GSYFPVKD-VIDGDLCEHFSQLPAAKQKSIADE 1185
Query: 1410 TGTTRSQILSNLNDL 1424
T +IL L D+
Sbjct: 1186 LERTPGEILKKLEDI 1200
>gi|405970039|gb|EKC34976.1| DNA damage-binding protein 1 [Crassostrea gigas]
Length = 1160
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 74/319 (23%), Positives = 131/319 (41%), Gaps = 30/319 (9%)
Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GTA V E+ + GR+++F N ++ KE+KGA L G LL +
Sbjct: 851 VGTALVHPEEAEPKQGRIVIFHFHEGKLN------QIAEKEIKGAAYTLVEFNGKLLASI 904
Query: 1172 GPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
+ L +WT + L F LY L +FIL+GD+ +SI L +K
Sbjct: 905 NSTVRLFEWTTDKELRLECNYFNSIVALY---LKTKGDFILVGDLMRSITLLLYKPMEGT 961
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
+A+D A E L D + L ++ N+ + Q L F
Sbjct: 962 FEEIARDCNPNWTTAVEILDDDNFLG---AENSFNLFTCQKDSASTTDEDRQNLQEVGMF 1018
Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1348
H+G V F ++ S T + ++L+GT++G++G + + + + LQ
Sbjct: 1019 HLGEFVNVFRHGSLVMQHSGETSTP------TQGSVLYGTVNGAVGLVTQVPQEFYSFLQ 1072
Query: 1349 SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY------EMLPLEE 1402
+Q +L + V + +R FH+ K + +D +L+ + +M +
Sbjct: 1073 DIQSRLAKVIKSVGKIEHSFWRSFHTERKTE--ACEGFIDGDLIESFLDLNRDKMQETVK 1130
Query: 1403 QLEIAHQTGTTRSQILSNL 1421
L+I +G R + +L
Sbjct: 1131 GLQIDDGSGMKREATVDDL 1149
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 98/219 (44%), Gaps = 35/219 (15%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTW--------AGRVSWKH--HTCMISAL 286
N+R L+ V D F+HG P ++++H+ L +S+K H +
Sbjct: 156 NIR-LEELTVIDIQFLHGCTTPTLILIHQANLNCYHLMTLCITNLLSFKQDQHGRHVKTY 214
Query: 287 SISTTLKQ-HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
IS K+ W N+ +A L+AVP P GG L++G +I YH +A
Sbjct: 215 EISLRDKEFQKGPWKQDNVETEACMLIAVPEPFGGALIIGQESITYHKGDNFIPIA---- 270
Query: 346 AVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----V 398
+ +S+ + ++DA + +L D+ G L +L + + ++ V
Sbjct: 271 ------PPAIKQSTLTCYGKVDANGSRYLLGDMM------GRLFMLMLEKEEKMDSTVTV 318
Query: 399 QRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ L + + + IT + N++ ++GSRLGDS LV+
Sbjct: 319 KDLKVELLGETTIAECITYLDNAVVYIGSRLGDSQLVKL 357
>gi|407035910|gb|EKE37921.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
Length = 836
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
++L F++AK+SVL +D++ + I S+HCFE P LKR +E P + +D +
Sbjct: 74 LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
GRC ++ Y M ++ +G + T S+ INL + + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168
Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
I + GY P + LH + TW GR+ T +I +L ++ ++ +
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
+P++A +++ P G+L++ N+I Y S ++ S L + V + + P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTVNSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
F + T + N V + T L ++ V+ + R+ D+ TN P
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
TI ++ F+GS + DS + +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 69/311 (22%), Positives = 136/311 (43%), Gaps = 36/311 (11%)
Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
K+ + L +G ED +G+ +F N +N L+ ++ + K ++ A+ +
Sbjct: 519 KQLKNYLVVGVNKQTTEDNPVKGKTYIF----NIENQIQLINKI--GDGKKSVHAVNEIG 572
Query: 1165 GHLLIASGPKIIL------HKWTGTELNGIAFY----DAPPLYVVSLNIVKN--FILLGD 1212
G L +ASG ++ L +W + I+ + PL V+ K ILL D
Sbjct: 573 GFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLILLSD 632
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
++S+ L +K + L KD ++ C + F+I S++ D ++N+ + Y+
Sbjct: 633 FYRSVVLLLFKPYDYTVIPLGKDARNIHCIDSTFIITKDYFSVLEFDSEQNLSLLNYSSA 692
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+E ++ A F++G ++ KF RL G + ++ T++GS
Sbjct: 693 ATEQLSIFEI--DATFNLGMNLLKFTRLW--------NGKG--------YIYMYVTVEGS 734
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+G I+ ++E ++ L+ + K+ H AG N +R G +D ++L
Sbjct: 735 VGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDML 794
Query: 1393 SHYEMLPLEEQ 1403
+ +L E+Q
Sbjct: 795 KQFRLLNEEQQ 805
>gi|443707495|gb|ELU03057.1| hypothetical protein CAPTEDRAFT_148808 [Capitella teleta]
Length = 1084
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/270 (24%), Positives = 118/270 (43%), Gaps = 16/270 (5%)
Query: 1109 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
T +GTA V E+ + GR+++F R D +T+V KE+KGA L G L
Sbjct: 771 TYYVVGTAMVYPEEAEPKQGRIIVF---RFHDGK---LTQVAEKEIKGAAYTLTEFNGKL 824
Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
L + + L +WT + + + + L +FIL+GD+ +S+ LS+K
Sbjct: 825 LASINSTVRLFEWTAEKELRVECSYFNNIIALYLKTKGDFILVGDLMRSVTLLSYKPMEG 884
Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
+A+D+ + + L D + L ++ NI + + Q L
Sbjct: 885 CFEEIARDYNPNWMTSIDVLDDDTFLG---AENSFNIFTCQKDSAATTDEERQHLQEVGL 941
Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1347
+H+G V F R L +P + ++LFGT++G++G + L + + L
Sbjct: 942 YHLGEFVNVF-RHGSLVMQHPGECTSP-----TQGSVLFGTVNGALGLVTQLPQEFYLFL 995
Query: 1348 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+Q KL ++ V + +R FH+ K
Sbjct: 996 LEVQNKLAKTIKSVGKVEHAFWRSFHTERK 1025
>gi|357478323|ref|XP_003609447.1| Splicing factor 3B subunit [Medicago truncatula]
gi|355510502|gb|AES91644.1| Splicing factor 3B subunit [Medicago truncatula]
Length = 1225
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 165/379 (43%), Gaps = 51/379 (13%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+R+L+P R G T + +Q +E A ++ V N KE TLLA+GTA
Sbjct: 874 IRVLDP-RTG---NTTCLLELQENEAAFSICTV---NFHDKEYGTLLAVGTAKGLQFTPK 926
Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
V R D+ ++L ++ +++G AL QG LL GP + L+
Sbjct: 927 RSLTVGFIHIYRFLDDGRSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGKRR 985
Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
L + P+ +VS++ ++ I +GDI +S ++ ++ QL + A D S+ + T
Sbjct: 986 LLRKCENKSFPISIVSIHAYRDRIYVGDIQESFHYCKYRRDENQLYIFADD--SVPRWLT 1043
Query: 1245 -EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKL---LSRAEFH 1289
+ ID T++ +D+ NI +S+ W+ KL L++ E
Sbjct: 1044 ASYHIDFDTMA--GADKFGNIFFARLPQDVSDEVEEDPTSGKIKWEQGKLNGALNKVEEI 1101
Query: 1290 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR--- 1346
V HV + TS + PG + +++GT+ +GC+ L T R
Sbjct: 1102 VQFHVGDVI------TSLQKAALVPGGGE----CIVYGTV---MGCVGALHAFTSRDDVD 1148
Query: 1347 -LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
L+ + P + G + ++R A+ P D ++D +L + LP++ Q +
Sbjct: 1149 FFSHLEMHMRQDNPPLCGRDHMAYR------SAYFPVKD-VLDGDLCEQFPTLPMDLQRK 1201
Query: 1406 IAHQTGTTRSQILSNLNDL 1424
IA + T +IL L +L
Sbjct: 1202 IADELDRTPGEILKKLEEL 1220
>gi|351699158|gb|EHB02077.1| DNA damage-binding protein 1 [Heterocephalus glaber]
Length = 1144
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 79/306 (25%), Positives = 133/306 (43%), Gaps = 42/306 (13%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKEL-KGAISALAS 1162
K+ T +GTA V E+ + GR+++F + +D + EV S+ L KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDEER----EVSSRGLVKGAVYSMVE 875
Query: 1163 LQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1217
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 876 FNGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSV 930
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 931 LLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDE 987
Query: 1278 KGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
+ Q L FH+G V F L +Q L +S T ++LFGT++G I
Sbjct: 988 ERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGMI 1037
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVA----GLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
G + L E + L +Q +L + V L P FH+ K + +D
Sbjct: 1038 GLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSRAVSFHTERKTEQ--ATGFIDG 1095
Query: 1390 ELLSHY 1395
+L+ +
Sbjct: 1096 DLIESF 1101
>gi|321478515|gb|EFX89472.1| hypothetical protein DAPPUDRAFT_303245 [Daphnia pulex]
Length = 1158
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 128/297 (43%), Gaps = 17/297 (5%)
Query: 1112 AIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
+GTA V E+ + GR++LF + AD +T V KE+KGA +L +L A
Sbjct: 847 VVGTALVVPEESEPKQGRIVLF---QWADGK---LTTVAEKEVKGACYSLVDFNSKILAA 900
Query: 1171 SGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1230
+ L++WT + + + + + L +FIL+GD+ +SI L +K
Sbjct: 901 INNVVRLYEWTAEKELRLECSNFNHIIALYLKRKGDFILVGDLMRSITLLQYKTMEGSFE 960
Query: 1231 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1290
+A+D A E L D + L ++ N+ + + + Q+L FH+
Sbjct: 961 EMARDSNPNWMSAVEILDDDTFLG---AENSFNLFVCQKDSAATTEEERQQLTEVGRFHL 1017
Query: 1291 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1350
G V F ++ + T P + +LFGT+ G+IG + L + L +
Sbjct: 1018 GDMVNVFRHGSLVMDHAAETLTTP-----TQGCVLFGTVHGAIGVVTQLPSEFYHFLSEV 1072
Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
Q ++ + V + +R F + K + +D +L+ + L ++ E+A
Sbjct: 1073 QTRMARVIKPVGKIEHSFWRSFATERKVEP--CEGFIDGDLIESFLDLSSDKMKEVA 1127
Score = 56.6 bits (135), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 89/395 (22%), Positives = 149/395 (37%), Gaps = 83/395 (21%)
Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-WSAMNL 304
++D F++G P +VI+H+ H + IS K+ W N+
Sbjct: 164 IQDIAFLYGCANPTVVIIHQ-----------DAHGRHVKTREISLRDKEFAKTSWKQDNV 212
Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVEL 364
+A LL VP P GG L++G +I YH+ NY + + ++
Sbjct: 213 ETEAAMLLPVPEPYGGALIIGQESITYHNG--------QNYVTIAPPIIKQSTVTCYGKV 264
Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDITTIG 419
D + +L D+A G L +L + DG V V+ + + + +T +
Sbjct: 265 DPNGSRYLLGDLA------GHLFMLVLEKEEKMDGTVTVRDIKIELLGEVSIPECLTYLD 318
Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
N + F+GSR GDS LV+ + + E F ++ AP + DM
Sbjct: 319 NGVVFIGSRFGDSQLVKLNVTPDDNNSYVTVMETFTNL---AP------------IVDMT 363
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
+ T S A ++ G L+ G+ I+ AS
Sbjct: 364 -------IVDLDRQGQGQLVTCSGAYKE-----GSLRIIRNGIGIHEQAS---------- 401
Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
++LPG KGIW + SS + D + +++S +T VL E
Sbjct: 402 --IDLPGIKGIWALKMGSSGNPSVDDT------------VVLSFVGQTRVLMLNGEEMEE 447
Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
TE +T GN+ G+ V+Q+ R++
Sbjct: 448 TEIPGLTADQQTFFCGNV-GKDSVLQITTGSVRLI 481
>gi|67463896|ref|XP_648489.1| cleavage and polyadenylation specificity factor subunit [Entamoeba
histolytica HM-1:IMSS]
gi|56464653|gb|EAL43100.1| cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba histolytica HM-1:IMSS]
Length = 1150
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
++L F++AK+SVL +D++ + I S+HCFE P LKR +E P + +D +
Sbjct: 74 LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
GRC ++ Y M ++ +G + T S+ INL + + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168
Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
I + GY P + LH + TW GR+ T +I +L ++ ++ +
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
+P++A +++ P G+L++ N+I Y S ++ S L + V + + P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTINSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
F + T + N V + T L ++ V+ + R+ D+ TN P
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
TI ++ F+GS + DS + +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 69/315 (21%), Positives = 137/315 (43%), Gaps = 36/315 (11%)
Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
K+ + L +G ED +G+ +F N +N L+ ++ + K ++ A+ +
Sbjct: 833 KQLKNYLVVGVNKQTTEDNPVKGKTYIF----NIENQIQLINKI--GDGKKSVHAVNEIG 886
Query: 1165 GHLLIASGPKIIL------HKWTGTELNGIAFY----DAPPLYVVSLNIVKN--FILLGD 1212
G L +ASG ++ L +W + I+ + PL V+ K ILL D
Sbjct: 887 GFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLILLSD 946
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
++S+ L +K + L KD ++ C + F+I S++ D ++N+ + Y+
Sbjct: 947 FYRSVVLLLFKPYDYTVIPLGKDARNIHCIDSTFIITKDYFSVLEFDSEQNLSLLNYSSA 1006
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+E ++ A F++G ++ KF RL G + ++ T++GS
Sbjct: 1007 ATEQLSIFEI--DATFNLGMNLLKFTRLW--------NGKG--------YIYMYVTVEGS 1048
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+G I+ ++E ++ L+ + K+ H AG N +R G +D ++L
Sbjct: 1049 VGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDML 1108
Query: 1393 SHYEMLPLEEQLEIA 1407
+ +L E+Q +
Sbjct: 1109 KQFRLLNEEQQKRVC 1123
>gi|449704103|gb|EMD44407.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
Length = 1088
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 86/351 (24%), Positives = 161/351 (45%), Gaps = 48/351 (13%)
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLL--FSTGRNADNP 1141
++ +E+AL++ + + ENE + IGTA+ + +V + GR+L+ GR
Sbjct: 756 LKENEHALSIEQIVI-----DENE-MFVIGTAFAKPNEVEPSSGRILIVQIKDGR----- 804
Query: 1142 QNLVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGTELNG---IAFYDAPP-- 1195
+ ++ K++ GA+ ++ +L + +L ++ K+++ ++ NG + +
Sbjct: 805 ---LEIIFEKDVNGAVYSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVKLQEKGSCN 861
Query: 1196 -----LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL---AKDFGSLDCFATEFL 1247
LYV +L N IL+GD+ KSI S+ G N L ++DF + A EF+
Sbjct: 862 VKLIGLYVKTLG---NKILVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFV 918
Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
+ LS SD NI IF +ES + +L + A HVG + + + T S
Sbjct: 919 DEDCYLS---SDSNSNILIFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHS 974
Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN-P 1366
+ + +LFG + G IG I + + L +Q +++ + + P
Sbjct: 975 TY-------ETVQKKCILFGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECTTP 1027
Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
++++ + K R +I+D ++ Y + E+Q EIAH +G QI
Sbjct: 1028 DNWKKVIDDWK--RMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEQI 1076
>gi|449710759|gb|EMD49776.1| cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba histolytica KU27]
Length = 836
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)
Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
++L F++AK+SVL +D++ + I S+HCFE P LKR +E P + +D +
Sbjct: 74 LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129
Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
GRC ++ Y M ++ +G + T S+ INL + + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168
Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
I + GY P + LH + TW GR+ T +I +L ++ ++ +
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
+P++A +++ P G+L++ N+I Y S ++ S L + V + + P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTINSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
F + T + N V + T L ++ V+ + R+ D+ TN P
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
TI ++ F+GS + DS + +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 69/311 (22%), Positives = 136/311 (43%), Gaps = 36/311 (11%)
Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
K+ + L +G ED +G+ +F N +N L+ ++ + K ++ A+ +
Sbjct: 519 KQLKNYLVVGVNKQTTEDNPVKGKTYIF----NIENQIQLINKI--GDGKKSVHAVNEIG 572
Query: 1165 GHLLIASGPKIIL------HKWTGTELNGIAFY----DAPPLYVVSLNIVKN--FILLGD 1212
G L +ASG ++ L +W + I+ + PL V+ K ILL D
Sbjct: 573 GFLAVASGNELELIERVDETRWIKKCFSDISILINSIEYLPLKVMERGNEKECYLILLSD 632
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
++S+ L +K + L KD ++ C + F+I S++ D ++N+ + Y+
Sbjct: 633 FYRSVVLLLFKPYDYTVIPLGKDARNIHCIDSTFIITKDYFSVLEFDSEQNLSLLNYSSA 692
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+E ++ A F++G ++ KF RL G + ++ T++GS
Sbjct: 693 ATEQLSIFEI--DATFNLGMNLLKFTRLW--------NGKG--------YIYMYVTVEGS 734
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+G I+ ++E ++ L+ + K+ H AG N +R G +D ++L
Sbjct: 735 VGYISVVEEKIYQVLRQINIKMNREPWHFAGTNAEEYRFEKGYGMGFGTRKHVFLDGDML 794
Query: 1393 SHYEMLPLEEQ 1403
+ +L E+Q
Sbjct: 795 KQFRLLNEEQQ 805
>gi|323447810|gb|EGB03719.1| hypothetical protein AURANDRAFT_72671 [Aureococcus anophagefferens]
Length = 760
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 95/403 (23%), Positives = 168/403 (41%), Gaps = 65/403 (16%)
Query: 1019 IVSVPVL-KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPW 1077
+VSVP+ +PL + H + +H + + + +R L R P+
Sbjct: 374 VVSVPLAEQPLC----------ICHDLQSHLFAVCTIDHREGDNQGVIRFL---RDEAPY 420
Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTT-KENETLLAIGTAYVQGED--VAARGRVLLFST 1134
++ E L +++L + +T K+ +GTA+ E+ GR+++F +
Sbjct: 421 NDVHREALEPLEIPLCCSIISLDSISTYKDQRAHFVVGTAFAAQENDFEPCSGRMIIFRS 480
Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-------ELNG 1187
G+ P L + E GA+ +A+++ LL+ + I H + L
Sbjct: 481 GQANVAPSVL----FFVEANGAVYDVAAMRASLLVCAVNHAI-HIYDPVVRDNRRGHLKP 535
Query: 1188 IAFYDAPPLYVVSLNI--VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
A YD VV+L + N I++GD+ +S+ L+ Q + +A D+ + A E
Sbjct: 536 RASYDG---LVVALKVQCYGNLIVVGDMMRSVTLLNLIRQKMIIVEVACDYNTNWVCALE 592
Query: 1246 FLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
+ DGS + S ++ Y S KG L SRA+ H+G VT F R ++
Sbjct: 593 VIGDGSFIIADASGSLVALESLY-----GNSDKGYFLESRAKMHLGDVVTCFARGSIMTQ 647
Query: 1306 SSDRTGAAPGSDKTNRFA--LLFGTL-----------------DGSIGCIAPLDELTFRR 1346
R + K ++ A L+FG + G+IGCI +D++T
Sbjct: 648 DDWR------NPKVSKVATPLIFGCVTSSRVLVLTPLIARYQVSGAIGCIVSIDDVTHSL 701
Query: 1347 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVD 1388
L+ L K L++ V + +F+ H+N P D +D
Sbjct: 702 LERLSKVLLEFHSGVGDFDHETFQALHNNVATCNAAPMDDFID 744
>gi|183232997|ref|XP_653855.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
gi|169801778|gb|EAL48469.2| damaged DNA binding protein, putative [Entamoeba histolytica
HM-1:IMSS]
Length = 1088
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 86/351 (24%), Positives = 160/351 (45%), Gaps = 48/351 (13%)
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLL--FSTGRNADNP 1141
++ +E+AL++ + + ENE + IGTA+ + +V + GR+L+ GR
Sbjct: 756 LKENEHALSIEQIVI-----DENE-MFVIGTAFAKPNEVEPSSGRILIVQIKDGR----- 804
Query: 1142 QNLVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGTELNG---IAFYDAPP-- 1195
+ ++ K++ GA+ ++ +L + +L ++ K+++ ++ NG + +
Sbjct: 805 ---LEIIFEKDVNGAVYSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVKLQEKGSCN 861
Query: 1196 -----LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL---AKDFGSLDCFATEFL 1247
LYV +L N IL+GD+ KSI S+ G N L ++DF + A EF+
Sbjct: 862 VKLIGLYVKTLG---NKILVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFV 918
Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
+ LS SD NI IF +ES + +L + A HVG + + + T S
Sbjct: 919 DEDCYLS---SDSNSNILIFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHS 974
Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN-P 1366
+ + +LFG + G IG I + + L +Q +++ + + P
Sbjct: 975 TY-------ETVQKKCILFGGVTGYIGGICEIPNEIYDVLIKVQNQILLQMKGIVECTTP 1027
Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
+++ + K R +I+D ++ Y + E+Q EIAH +G QI
Sbjct: 1028 DDWKKVIDDWK--RMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEQI 1076
>gi|224100909|ref|XP_002312063.1| predicted protein [Populus trichocarpa]
gi|222851883|gb|EEE89430.1| predicted protein [Populus trichocarpa]
Length = 1213
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 167/384 (43%), Gaps = 61/384 (15%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+R+L+P A T + +Q +E A +V V N KE+ TLLA+GTA +G
Sbjct: 862 IRVLDPRSA----TTTCLLELQDNEAAFSVCTV---NFHDKEHGTLLAVGTA--KGLQFW 912
Query: 1125 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
+ ++ + D+ ++L ++ +++G AL QG LL G + L+
Sbjct: 913 PKRSLIAGFIHIYKFVDDGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGSVLRLYDLGK 971
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
L P +VS++ ++ I +GDI +S +F ++ QL + A D S+ +
Sbjct: 972 KRLLRKCENKLFPNSIVSIHTYRDRIYVGDIQESFHFCKYRRDENQLYIFADD--SVPRW 1029
Query: 1243 AT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR----- 1285
T + +D T++ +D+ NI +S+ W+ KL
Sbjct: 1030 LTASYHVDFDTMA--GADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKVE 1087
Query: 1286 --AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1340
+FH+G V + ++ PG + +++GT+ GS+G + P D
Sbjct: 1088 EIVQFHIGDVVNSLQKASLI----------PGGGE----CIMYGTVMGSVGALLPFTSRD 1133
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
++ F L+ L P + G + ++R A+ P D ++D +L + LPL
Sbjct: 1134 DVDF--FSHLEMHLRQDHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFPTLPL 1184
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDL 1424
+ Q +IA + T +IL L ++
Sbjct: 1185 DAQRKIADELDRTPGEILKKLEEV 1208
>gi|407044103|gb|EKE42371.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
Length = 1088
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 87/351 (24%), Positives = 160/351 (45%), Gaps = 48/351 (13%)
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLL--FSTGRNADNP 1141
++ +E+AL++ + + ENE + IGTA+ + +V + GR+L+ GR
Sbjct: 756 LKENEHALSIEQIVI-----DENE-MFVIGTAFAKPNEVEPSSGRILIVQIKDGR----- 804
Query: 1142 QNLVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGTELNG---IAFYDAPP-- 1195
+ V+ K++ GA+ ++ +L + +L I+ K+++ ++ NG + +
Sbjct: 805 ---LEIVFEKDVNGAVYSMKTLLKKYLAISIEKKLVVFEYQRVITNGEFEVKLQEKGSCN 861
Query: 1196 -----LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL---AKDFGSLDCFATEFL 1247
LYV +L N IL+GD+ KSI S+ G N L ++DF + A EF+
Sbjct: 862 VKLIGLYVKTLG---NKILVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFV 918
Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSS 1307
+ LS SD NI IF +ES + +L + A HVG + + + T S
Sbjct: 919 DEDCYLS---SDSNSNILIFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHS 974
Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN-P 1366
+ + +LFG + G IG I + + L +Q +++ + + P
Sbjct: 975 TY-------ETVQKKCILFGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTP 1027
Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
+++ + K R +I+D ++ Y + E+Q EIAH +G +I
Sbjct: 1028 DDWKKVIDDWK--RMPSSNIIDGSIVESYLEMSKEKQCEIAHLSGVNEEKI 1076
>gi|18377609|gb|AAL66955.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
Length = 270
Score = 70.9 bits (172), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 69/253 (27%), Positives = 113/253 (44%), Gaps = 24/253 (9%)
Query: 1151 KELKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP--PLYVVSLNI 1203
KE KGA+ +L + G LL A KI L+KW GT EL + LYV +
Sbjct: 1 KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG- 59
Query: 1204 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
+FI++GD+ KSI L +K + + A+D+ + A E L D L ++ N
Sbjct: 60 --DFIVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFN 114
Query: 1264 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRF 1322
+ + + + +L E+H+G V +F ++ D G P
Sbjct: 115 LLTVKKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIP-------- 166
Query: 1323 ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
++FGT++G IG IA L + + L+ LQ L + V GL+ +R F N +
Sbjct: 167 TVIFGTVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSF--NNEKRTAE 224
Query: 1383 PDSIVDCELLSHY 1395
+ +D +L+ +
Sbjct: 225 ARNFLDGDLIESF 237
>gi|397615212|gb|EJK63291.1| hypothetical protein THAOC_16062, partial [Thalassiosira oceanica]
Length = 322
Score = 70.9 bits (172), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 123/289 (42%), Gaps = 47/289 (16%)
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
G + AL QG LL+ G + L++ +L P V +L + +GD+
Sbjct: 55 GPVLALVHFQGRLLVGIGKSLRLYEMGKRQLLKKCELRGLPTMVKTLQAAGDRAFVGDMM 114
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+S+ F+ + +L L+A+D + E L+D +T++ V D+ N+ P+ +
Sbjct: 115 QSMQFVRYDATANRLVLVARDRSARPITCQE-LLDVNTVA--VGDKFGNVTTLRL-PRGA 170
Query: 1275 ES-----------WKGQ------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
++ W KL + +HVG VT R ++A ++
Sbjct: 171 DTGAVDVSGTRALWDSSREDATPKLETLCTYHVGEVVTSLTRASLVAGGAE--------- 221
Query: 1318 KTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
+L++ T+ G IG + P +++ F SL+ + VP G +P+S+R F+
Sbjct: 222 -----SLIYVTVTGRIGALVPFTSREDVEF--YTSLESHVRSEVPRPTGRDPQSYRSFYC 274
Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
K ++D +L Y LP E + IA Q + +++ L D
Sbjct: 275 PVK-------HVIDGDLCEAYGGLPYEARERIADQMERSTGEVMKKLED 316
>gi|385304555|gb|EIF48567.1| rna-binding subunit of the mrna cleavage and polyadenylation factor
[Dekkera bruxellensis AWRI1499]
Length = 353
Score = 70.5 bits (171), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 67/293 (22%), Positives = 129/293 (44%), Gaps = 39/293 (13%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERE-LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 301
+K++ D+ F++ Y EP + IL+ E L+WAG + + LS++ + I
Sbjct: 69 VKNIMDYQFLYSYREPTIAILYAPEGLSWAGYLXKLKDNMKVVVLSLNLDTHKADSIMVL 128
Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSF 360
NLP+D + +PSPI G L++G+N I H +S + + N Y + S
Sbjct: 129 PNLPYDLNSIYPLPSPINGFLLIGSNEILHVNSLGSIKGVYTNKYFPETSDMKLRDESDL 188
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG------RVVQRLDLSKTNPSVLTSD 414
++E + +++ +D LL ++ G +L+ G ++++ + + N SV ++
Sbjct: 189 NLECEGCSVSFVGDDQVLLISQIGKFYVLSFNESGGISNLNKIIEIPEANYCNVSV--NN 246
Query: 415 ITTIGN----SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
+ I N + FL + DS+L+ + + P+ + +S
Sbjct: 247 VLQITNIEDCNSAFLCCQGSDSILLHWN--------------------YNVPTRGTVSKS 286
Query: 471 SSDALQDMVNGEELSLY--GSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
++ ++ E+ LY S + + +F D LVN GP DF+ G
Sbjct: 287 NAGIEKE---DEDSWLYHEDETSQTSNRPLTSCTFTXIDKLVNCGPTSDFTIG 336
>gi|324502823|gb|ADY41238.1| DNA damage-binding protein 1, partial [Ascaris suum]
Length = 1129
Score = 70.1 bits (170), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 65/275 (23%), Positives = 127/275 (46%), Gaps = 23/275 (8%)
Query: 1112 AIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
A+GTA V ++ ++ GR+L+F +++ + + V+ KE+KGA ++ L G L++A
Sbjct: 815 AVGTAVVLTDETESKSGRLLIFQVAPSSEGGRMRL--VHDKEIKGAAYSIQVLMGKLVVA 872
Query: 1171 SGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
+ L +WT + L F + LY+ + N + +L+GD+ +S+ L++K +
Sbjct: 873 INSCVRLFEWTAEKELRLECSDFDNVTALYLRTKN---DVVLVGDLMRSLSVLAYKPMES 929
Query: 1228 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR-- 1285
+A+DF + A E +ID T + +F S +G +L +
Sbjct: 930 SFEKIARDFVTNWMTACE-IIDMETF----LGAEIMFNLFTVVKDCSSKDEGIRLQLQET 984
Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1345
+++G V F ++AT D T + +L+GT DG +G I L +
Sbjct: 985 GMYYLGESVNAFCHGSLIATHIDLT-------PSFTTPILYGTSDGGLGVIVQLTPQFYD 1037
Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
+ L+ ++ + + +R F S+G+ +
Sbjct: 1038 FVHELETRIAAVTKNCMRIEHGQYRTFESDGRTEQ 1072
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 65/144 (45%), Gaps = 12/144 (8%)
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
P +W N+ +A ++ +P P GGV+VVG I YH + N Y+
Sbjct: 200 PPLWKQDNIEAEACMVIPIPQPYGGVIVVGHEAISYHKDA-------NAYSAIAPPLIHQ 252
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVL-LTVVYDGRV-VQRLDLSKTNPSVLTS 413
+ S ++D +L D LS + L+L L V DG V+ L + + +
Sbjct: 253 SQISCYGKIDRDGQRYLLGD---LSGRIFMLLLDLDVATDGTASVKDLKVELLGETSIPE 309
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
+ + N + F+GSR GDS LV+
Sbjct: 310 CVVYLDNGVVFIGSRFGDSQLVRL 333
>gi|156389050|ref|XP_001634805.1| predicted protein [Nematostella vectensis]
gi|156221892|gb|EDO42742.1| predicted protein [Nematostella vectensis]
Length = 1157
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 76/316 (24%), Positives = 135/316 (42%), Gaps = 31/316 (9%)
Query: 1102 TTTKENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAIS 1158
T + + T +GTAYV E+ + GR+LLF S G+ + +V KE+KGA+
Sbjct: 833 TLSDDPHTYYCVGTAYVFPEEPEPKAGRLLLFHLSEGK--------LVQVAEKEVKGAVY 884
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTE--LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
+L G +L + + +WT + ++YD + + L +FIL+GD+ +S
Sbjct: 885 SLVEFNGKVLAGINSTVSIFEWTADKEFRYECSYYDN--ILALYLKTKGDFILVGDLMRS 942
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
+ L + +A DF A E L D + L ++ N+ +
Sbjct: 943 MTLLVYLPLEGSFQEIAHDFSPKWMTAIEILDDDTFLG---AENSYNLFTCTKDSGATTD 999
Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF--ALLFGTLDGSIG 1334
+ L ++H+G V F ++ PG D + F +LFGT++G IG
Sbjct: 1000 EERYHLQDAGQYHLGEFVNVFRHGSLVMEH-------PG-DASTPFQGCVLFGTVNGRIG 1051
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD---SIVDCEL 1391
+A + + F L +QKKL + V ++ + H + +H + +D +L
Sbjct: 1052 IVAQIAQDLFNFLIQVQKKLNKVIKSVGKIDHSLYPFPHCSNLSHSRKMEPAHGFIDGDL 1111
Query: 1392 LSHYEMLPLEEQLEIA 1407
+ + LP E+
Sbjct: 1112 IESFLDLPRARMEEVV 1127
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 95/212 (44%), Gaps = 40/212 (18%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
N+R L+ HV D F++G P +V +++ H + I+ L+ H
Sbjct: 155 NIR-LEELHVVDIQFLYGCANPTIVFIYQDP-----------HGRHVKTYEIN--LRDHE 200
Query: 297 LI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
W N+ +A +++AVP+P+GG L++G +I YH S A+A
Sbjct: 201 FAKGPWKQDNVEVEACRVIAVPNPLGGALIIGQESITYHKGSNYHAIA----------PP 250
Query: 354 ELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKT 406
L +SS + ++D + +L D+ G L +L + + DG V+ L L
Sbjct: 251 ALKQSSLTCHGKIDTNGSRYLLGDM------NGRLYMLLLERQELIDGTYEVKDLKLEML 304
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+ + + + N + F+GS LGDS L + +
Sbjct: 305 GETSIAHCLVYLDNGVVFIGSMLGDSQLAKLS 336
>gi|145348011|ref|XP_001418451.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578680|gb|ABO96744.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1196
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 88/377 (23%), Positives = 158/377 (41%), Gaps = 49/377 (12%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
VRI++P A ++ + + SE AL++ V L T NE LLA+GTA A
Sbjct: 847 VRIVDPKEA----KSTFVLELHKSEAALSLCHVFL----TGPNELLLAVGTAV--NLTFA 896
Query: 1125 AR----GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
R G + L+ G N + V+S G + AL +GHLL + ++ +
Sbjct: 897 PRNCDGGFIHLYRYG----NDGRTLNLVHSTPTDGPVGALCGYKGHLLAGVNNSLRIYDY 952
Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
+L P ++ +L+ + I +GD+ +SI+++ +K + + A D
Sbjct: 953 GKKKLLRKVENRNFPNFITTLHAAGDRIYVGDVQESIHYVKYKADEGSIYIFADDTKPRY 1012
Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----------GQKLLSRAEFHV 1290
AT +D TL+ +D+ NI + +SE Q +L+ A
Sbjct: 1013 ITAT-LPLDYDTLA--GADKFGNIFVNRLPKDVSEDMDDDPTGGKNIYSQGVLNGAPNKS 1069
Query: 1291 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRL 1347
++ + A + + PG + +++GT G IGC+ P E+ F
Sbjct: 1070 ETSAQTYIGETVCALT--KGALQPGGIEI----IMYGTFMGGIGCLLPFSSRSEIEF--F 1121
Query: 1348 QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
L+ + P + G + +FR +++ K +++D +L + LP + Q IA
Sbjct: 1122 THLEMHMRQEAPSIVGRDHMAFRSYYAPVK-------NVIDGDLCEQFGALPADVQRRIA 1174
Query: 1408 HQTGTTRSQILSNLNDL 1424
+ T +IL L +
Sbjct: 1175 EEMDRTPGEILKKLEQV 1191
>gi|312072035|ref|XP_003138882.1| hypothetical protein LOAG_03297 [Loa loa]
gi|307765956|gb|EFO25190.1| hypothetical protein LOAG_03297 [Loa loa]
Length = 1197
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 81/364 (22%), Positives = 148/364 (40%), Gaps = 32/364 (8%)
Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRN 1137
+T + P E A + +V F + L+ G A G + F N
Sbjct: 855 ETLSHFPFAEDEAAFAIAMVQ-FQNQSDTQFVLVGCGCELQLKPRKANGGCIYTFLLAAN 913
Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
Q L+ + E+ ++A+ +G L G K+ L+ +L P
Sbjct: 914 GTTLQ-LLHRTATDEV---VNAIHDFRGMALAGVGKKVRLYDLGKRKLLAKCENRQIPTQ 969
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
VV + + I++ D +S++F+ +K+Q QL++ D S L+D T++ V
Sbjct: 970 VVDIRSMGQRIVVSDSQESVHFMRYKKQDGQLSIFC-DETSPRYVTCVCLLDYDTVA--V 1026
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA--------TSSDR 1309
D NI + ++E + RA + G +L+ +A TS +
Sbjct: 1027 GDRFGNIAVLRLPKGVTEEVQEDPTGVRALWDRGNLNGASQKLEAIAHLYIGDAITSMQK 1086
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1366
T PG++ L + T+ G IG + P DE F Q+L+ + P + G +
Sbjct: 1087 TSLVPGAND----CLCYTTISGIIGILVPFMSRDEFEF--FQNLEMHMRVEYPPLCGRDH 1140
Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
++R ++ K SI+D +L Y ++PL++Q + + G ++I L D+
Sbjct: 1141 LAYRSYYFPVK-------SIIDGDLCEQYSLMPLDKQKSVGEELGRKSTEIHKKLEDIRT 1193
Query: 1427 GTSF 1430
+F
Sbjct: 1194 RYAF 1197
>gi|168064351|ref|XP_001784126.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664326|gb|EDQ51050.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1214
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 93/385 (24%), Positives = 167/385 (43%), Gaps = 62/385 (16%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA---YVQGE 1121
+R+L+P + T + +Q +E A ++ V + KE TL+A+GTA +
Sbjct: 862 IRVLDPKTS----TTTCLLELQENEAAFSLCAVNFHDN--KELGTLIAVGTAKDLQFMPK 915
Query: 1122 DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
A+ G + ++ R AD + ++ V+ + G +AL QG LL+ G + ++
Sbjct: 916 KEASGGFIHIY---RFADEGK-VLELVHKTPVDGVPTALCQFQGRLLVGVGQVLRIYDLG 971
Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
+L P +++++ + I +GDI +S +++ ++ QL A D S
Sbjct: 972 KRKLLRKCENKNFPNTIIAIHTYGDRIYVGDIQESFHYVKYRRDENQLYTFADD--SCPR 1029
Query: 1242 FATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKG-------QKL 1282
+ T L ID T++ +D+ NI + +SE W+ K+
Sbjct: 1030 WLTASLHIDFDTMA--GADKFGNIYVMRLPQDVSEEIEDDPTGGKIKWEQGRLNGAPNKV 1087
Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1339
+FHVG VT + ++ PG ++ +L+GT+ GS+G + P
Sbjct: 1088 EEIIQFHVGEVVTSLQKASLI----------PGGGES----VLYGTIMGSMGALLPFSSR 1133
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
+++ F L+ L P + G + FR A+ P D ++D +L Y ML
Sbjct: 1134 EDVDF--FSHLEMHLRQENPPLCGRDHMGFR------SAYFPVKD-VIDGDLCEQYPMLT 1184
Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDL 1424
E Q +IA T +IL L D+
Sbjct: 1185 SELQKKIADDLDRTPGEILKKLEDI 1209
>gi|449283451|gb|EMC90093.1| DNA damage-binding protein 1 [Columba livia]
Length = 1140
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 85/346 (24%), Positives = 147/346 (42%), Gaps = 57/346 (16%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
K+ T +GTA V E+ + GR+++F S G+ + + KE+KGA+ ++
Sbjct: 812 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 863
Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
G LL + + L++WT TE N + + LY L +FIL+GD+ +S
Sbjct: 864 EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRS 918
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 919 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 975
Query: 1277 WKGQKLLSRAEFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+ Q L H+G V F L +Q L +S T ++LFGT++G
Sbjct: 976 EERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGETSTPTQG----------SVLFGTVNGM 1025
Query: 1333 IGCIAPLDELTFRRLQSLQKKL---VDSV--------PHVAGLNPRSFRQFHSNGKAHRP 1381
IG + L E + L +Q +L + SV P + L + + FH+ K P
Sbjct: 1026 IGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSLYPSLVQLRAWASQSFHTERKT-EP 1084
Query: 1382 GPDSIVDCELLSHY------EMLPLEEQLEIAHQTGTTRSQILSNL 1421
+D +L+ + +M + L+I +G R + +L
Sbjct: 1085 AT-GFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATVDDL 1129
>gi|224109600|ref|XP_002315251.1| predicted protein [Populus trichocarpa]
gi|222864291|gb|EEF01422.1| predicted protein [Populus trichocarpa]
Length = 1213
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 92/385 (23%), Positives = 167/385 (43%), Gaps = 63/385 (16%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+R+L+P A T + +Q +E A ++ V N KE+ TLLA+GTA +G
Sbjct: 862 IRVLDPRSAA----TTCLLELQDNEAAFSLCTV---NFHDKEHGTLLAVGTA--KGLQFW 912
Query: 1125 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
+ ++ + D+ ++L ++ +++G AL QG LL G + L+
Sbjct: 913 PKRSLVTGFIHIYKFVDDGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGSVLRLYDLGK 971
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD------- 1235
L P +VS++ ++ I +GDI +S +F ++ QL + A D
Sbjct: 972 KRLLRKCENKLFPNTIVSIHTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLT 1031
Query: 1236 ------FGSL---DCFATEFLIDGSTLSLVVSDEQKNI----QIFYYAPKMSESWKGQKL 1282
F S+ D F + + L VSDE + +I + K++ + K+
Sbjct: 1032 SSYHVDFDSMAGADKFGNIYF---ARLPQDVSDEIEEDPTGGKIKWEQGKLNGA--PNKV 1086
Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1339
+FH+G V + ++ PG + +++GT+ GS+G + P
Sbjct: 1087 EEIVQFHIGDVVNSLQKASLI----------PGGGE----CIIYGTVMGSVGALLPFTSR 1132
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
D++ F L+ L P + G + S+R A+ P D ++D +L + LP
Sbjct: 1133 DDVDF--FSHLEMHLRQDHPPLCGRDHMSYR------SAYFPVKD-VIDGDLCEQFPTLP 1183
Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDL 1424
L+ Q +IA + T +IL L ++
Sbjct: 1184 LDAQRKIADELDRTPGEILKKLEEV 1208
>gi|224004656|ref|XP_002295979.1| spliceosome associated factor 3b, subunit 3; 130kD spliceosome
associated protein [Thalassiosira pseudonana CCMP1335]
gi|209586011|gb|ACI64696.1| spliceosome associated factor 3b, subunit 3; 130kD spliceosome
associated protein [Thalassiosira pseudonana CCMP1335]
Length = 1212
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 122/289 (42%), Gaps = 47/289 (16%)
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
G + +L QG LL+ G + L++ +L P V +L + +GD+
Sbjct: 945 GPVLSLVHFQGRLLVGVGKTVRLYEMGKRQLLKKCELRGMPTMVKTLQAAGDRAFVGDMM 1004
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+S+ F+ + +L L+AKD E L+D +T++ V D+ N+ I P+ +
Sbjct: 1005 QSMQFIRYDSTANRLVLVAKDRNPRPITCQE-LLDINTVA--VGDKFGNVTILRL-PRGA 1060
Query: 1275 ES-----------WKGQ------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
++ W KL + +HVG VT R ++A ++
Sbjct: 1061 DAGAIDVTGTRALWDSARDDATPKLETLCTYHVGEVVTSMTRASLVAGGAE--------- 1111
Query: 1318 KTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
+L++ T+ G +G P D++ F SL+ L P G +P+S+R +++
Sbjct: 1112 -----SLIYVTVTGRVGAFVPFTSRDDVEF--YTSLEGFLRTETPRPTGRDPQSYRSYYA 1164
Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
K IVD +L + LP E + +IA + +++ L D
Sbjct: 1165 PMK-------HIVDGDLCDAFAQLPYETKQKIAESLDRSVGEVMKKLED 1206
>gi|356576847|ref|XP_003556541.1| PREDICTED: splicing factor 3B subunit 3-like [Glycine max]
Length = 1214
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/389 (23%), Positives = 168/389 (43%), Gaps = 71/389 (18%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA----YVQG 1120
+R+L+P R G T + +Q +E A ++ + N KE TLLA+GTA ++
Sbjct: 863 IRVLDP-RTG---NTTCLLELQENEAAFSICTI---NFHDKEYGTLLAVGTAKGLQFLPK 915
Query: 1121 EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
+ A G + ++ R ++ ++L ++ +++G AL QG LL GP + L+
Sbjct: 916 RTITA-GFIHIY---RFVEDGRSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDL 970
Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
L P +VS++ ++ I +GD+ +S ++ ++ QL + A D
Sbjct: 971 GKRRLLRKCENKLFPNTIVSIHAYRDRIYVGDVQESFHYCKYRRDENQLYIFADD----- 1025
Query: 1241 C----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR 1285
C + ID T++ +D+ NI +S+ W+ KL
Sbjct: 1026 CVPRWLTASYHIDFDTMA--GADKFGNIYFVRLPQDVSDEIEEDPTGGRIKWEQGKLNGA 1083
Query: 1286 -------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
+FH+G VT + ++ PG + ++FGT+ GS+G +
Sbjct: 1084 PNKVEEIVQFHIGDVVTCLQKASLI----------PGGGE----CIVFGTVMGSVGALHA 1129
Query: 1339 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
D++ F L+ + P + G + ++R A+ P D ++D +L Y
Sbjct: 1130 FTSRDDVDF--FSHLEMHMRQDHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQY 1180
Query: 1396 EMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
LP++ Q +IA + T +IL L ++
Sbjct: 1181 PTLPMDLQRKIADELDRTPGEILKKLEEV 1209
>gi|356536504|ref|XP_003536777.1| PREDICTED: splicing factor 3B subunit 3-like [Glycine max]
Length = 1214
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/389 (23%), Positives = 167/389 (42%), Gaps = 71/389 (18%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA----YVQG 1120
+R+L+P + T + +Q +E A ++ V N KE TLLA+GTA ++
Sbjct: 863 IRVLDPRTS----NTTCLLELQENEAAFSICTV---NFHDKEYGTLLAVGTAKGLQFLPK 915
Query: 1121 EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
V A G + ++ R ++ ++L ++ +++G AL QG LL GP + L+
Sbjct: 916 RTVTA-GFIHIY---RFVEDGRSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDL 970
Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
L P ++S++ ++ I +GD+ +S ++ ++ QL + A D
Sbjct: 971 GKKRLLRKCENKLFPNTIISIHAYRDRIYVGDVQESFHYCKYRRDENQLYIFADD----- 1025
Query: 1241 C----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR 1285
C + ID T++ +D+ NI +S+ W+ KL
Sbjct: 1026 CVPRWLTASYHIDFDTMA--GTDKFGNIYFVRLPQDVSDEIEEDPTGGRIKWEQGKLNGA 1083
Query: 1286 -------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
+FHVG VT + ++ PG + ++FGT+ GS+G +
Sbjct: 1084 PNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CIVFGTVMGSVGALHA 1129
Query: 1339 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
D++ F L+ + P + G + ++R A+ P D ++D +L Y
Sbjct: 1130 FTSRDDVDF--FSHLEMHMRQDHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQY 1180
Query: 1396 EMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
LP++ Q +IA + T +IL L ++
Sbjct: 1181 PTLPMDLQRKIADELDRTPGEILKKLEEV 1209
>gi|340520436|gb|EGR50672.1| predicted protein [Trichoderma reesei QM6a]
Length = 1212
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 104/461 (22%), Positives = 200/461 (43%), Gaps = 67/461 (14%)
Query: 998 IPLKATPHQITYFAEKNLY-----------PLIVSVPVLKP--LNQVLSLLIDQEVGHQI 1044
IPL TP ++ ++ L+ P + + + P +N +L +E GH
Sbjct: 791 IPLTYTPKKMVKHPDQPLFYVIEADNHTLSPALCAQLLADPARVNGDSKVLPPEEFGHPR 850
Query: 1045 DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT 1104
N +S + +++P G Q I ++ +E A+++ +VT +
Sbjct: 851 GNRRWASC------------ISVVDPLAEDG--QVLQRIDLEENEAAVSLAIVTF---AS 893
Query: 1105 KENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAISALA 1161
+ENET L +GT G+D+ R R + + LV ++ +++ A+
Sbjct: 894 QENETFLVVGT----GKDMVLNPRSFSDAFVHIYRFERDGRGLVF-IHKTKVEEPPMAMI 948
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
QG +L+ G + ++ +L + + P +VSLN + I++GD+ + I ++
Sbjct: 949 PFQGRVLVGIGKMLRIYDLGMRQLLRKSQAEVAPQQIVSLNAQGSRIVVGDVQQGITYVV 1008
Query: 1222 WKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK-- 1278
+K+Q +L D ++ + T ++D T + D+ NI I K SE
Sbjct: 1009 FKQQTNKLIPFVDD--TVARWTTCSTMVDYETTA--GGDKFGNIFIVRAPQKASEEADEE 1064
Query: 1279 --GQKLL-SRAEFHVGAHVTKF---LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
G LL +R+ H +H L Q + TS +T G + LL+ L G+
Sbjct: 1065 PAGLHLLNARSYLHGTSHRLDLMCHLYTQDIPTSITKTSLVVGGQE----VLLWSGLMGT 1120
Query: 1333 IGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
IG + P ++ F QSL++ L P +AG + +R +++ K ++D
Sbjct: 1121 IGVLIPFVTREDADF--FQSLEQHLRAEDPPLAGRDHLMYRSYYAPMKG-------VIDG 1171
Query: 1390 ELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+L Y +LP +++ IA + + +I ++D+ ++F
Sbjct: 1172 DLCERYALLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1212
>gi|358056808|dbj|GAA97158.1| hypothetical protein E5Q_03834 [Mixia osmundae IAM 14324]
Length = 1243
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 83/321 (25%), Positives = 139/321 (43%), Gaps = 35/321 (10%)
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR----GRVLLFSTGRNADN 1140
+QS E+ + V+L A+GTA+ V AR GRVL F R+ D
Sbjct: 914 LQSDEHGTALETVSLHGAAH------FAVGTAF-SDRTVDAREPKKGRVLTFM--RDGDK 964
Query: 1141 PQNLVTEVYSKELKGAISALASLQGHLL--IASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
+ V V L+G + L L L IA+ + H ++ + + A
Sbjct: 965 FEQHVHAV----LEGGVFGLCQLPNSFLAAIANAQVKVFHVTEQAHIDQMTCW-AGTFLA 1019
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
S++ + I++GD+++S+ L W E L+ +A++ A EFL G T +
Sbjct: 1020 QSISSRDSQIIVGDLYRSVVLLQWDEAKDTLSEVAREHHVNGMSAVEFL--GFTDDRYIG 1077
Query: 1259 DEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRT-GAAPGSD 1317
EQ+ + IF K + L + FH+G +VT+ + ++ +D + GAAP
Sbjct: 1078 TEQE-LNIFTLT-KTKTRERIDILETEGMFHIGEYVTRIRKGALVPGYTDTSFGAAP--- 1132
Query: 1318 KTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
LLFGT DGS+G I +L +L++ + + GL +R F + +
Sbjct: 1133 -----QLLFGTSDGSLGVIVNCTPEVSLKLFALERNMRAVIRAFGGLEQVDWRAFRAPHR 1187
Query: 1378 AHRPGPDSIVDCELLSHYEML 1398
H P VD +++ + L
Sbjct: 1188 VHEPV--GFVDGDMIGRFAEL 1206
>gi|268568396|ref|XP_002640241.1| C. briggsae CBR-TAG-203 protein [Caenorhabditis briggsae]
Length = 1218
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/320 (21%), Positives = 145/320 (45%), Gaps = 39/320 (12%)
Query: 1126 RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1185
RG V F N D L + E + A+ +G L+ G + ++ +L
Sbjct: 923 RGCVYTFHLSPNGDRFDFL----HRTETPLPVGAIHDFRGMALVGFGKFLRMYDIGQKKL 978
Query: 1186 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----FGSLDC 1241
P+ +V++ I++ D +S++FL +++ QL + A D + S C
Sbjct: 979 LAKCENKNFPVNIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQLVVFADDTTPRYVSCVC 1038
Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQ 1301
++D T++ ++D+ N+ + +++E + +S++ + G +++
Sbjct: 1039 -----VLDYHTVA--IADKFGNLSVVRLPERVNEDVQDDPTVSKSVWDRGWLNGASQKVE 1091
Query: 1302 MLA--------TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSL 1350
++A TS +T PG+++ AL++ T+ G+IGC+ DE+ F +L
Sbjct: 1092 LVANFFIGDTITSLQKTSLMPGANE----ALVYTTIGGAIGCLVSFMSKDEVDF--FTNL 1145
Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1410
+ + P + G + S+R +++ K S++D ++ + ++ L +Q E+A +
Sbjct: 1146 EMHVRSEYPPLCGRDHLSYRSYYAPCK-------SVIDGDICEQFSLMELSKQKEVAEEL 1198
Query: 1411 GTTRSQILSNLNDLALGTSF 1430
G T S+I L D+ +F
Sbjct: 1199 GKTVSEISKKLEDIRTRYAF 1218
>gi|324518783|gb|ADY47203.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
suum]
Length = 108
Score = 68.9 bits (167), Expect = 2e-08, Method: Composition-based stats.
Identities = 37/100 (37%), Positives = 60/100 (60%), Gaps = 5/100 (5%)
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
A +FLID ++ ++SDE NI +F Y P+ ES G++L+ R+E ++G +V F+R++
Sbjct: 10 AAQFLIDNRQMAFIMSDEAANIAVFNYLPEALESSGGERLILRSEINIGTNVNSFMRVKG 69
Query: 1303 LATSSDRTGAAPGSDKT-NRFALLFGTLDGSIGCIAPLDE 1341
+S G + NR ++LF +LDGS G + PL E
Sbjct: 70 HISS----GFVENEHYSLNRQSVLFCSLDGSFGFVRPLSE 105
>gi|68471462|ref|XP_720279.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
gi|46442139|gb|EAL01431.1| likely Cleavage and Polyadenylation Specificity Factor subunit
fragment [Candida albicans SC5314]
Length = 423
Score = 68.6 bits (166), Expect = 3e-08, Method: Composition-based stats.
Identities = 45/199 (22%), Positives = 93/199 (46%), Gaps = 15/199 (7%)
Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
F+H Y EP + +L ++ WAG + L++ LK ++ NLP++ +
Sbjct: 3 FLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTLDLNLKSTISVFKIDNLPYEIDR 62
Query: 311 LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELD 365
++ +PSP+ G L+VG N IH + +A+N + S S Q+ +S +++L+
Sbjct: 63 VIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTRLITASFKSFQD--QSDLNLKLE 120
Query: 366 AAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITT 417
+ +D LL +TG+ + DG+ ++R+ + + ++
Sbjct: 121 NCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAI 180
Query: 418 IGNSLFFLGSRLGDSLLVQ 436
+ ++ F+ + G+S L+Q
Sbjct: 181 LDKNMLFIANSNGNSPLIQ 199
>gi|358378986|gb|EHK16667.1| hypothetical protein TRIVIDRAFT_40938 [Trichoderma virens Gv29-8]
Length = 1212
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 104/460 (22%), Positives = 195/460 (42%), Gaps = 65/460 (14%)
Query: 998 IPLKATP--------HQITYFAEKN---LYPLIVSVPVLKP--LNQVLSLLIDQEVGHQI 1044
IPL TP H + Y E + L P + + + P +N +L +E GH
Sbjct: 791 IPLTYTPKKMVKHPDHPLFYVIEADNHTLAPELCAKLLADPARVNGDTKILPAEEFGHPR 850
Query: 1045 DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT 1104
N +S + +++P G Q I ++ +E A++V +VT +
Sbjct: 851 GNRRWASC------------ISVVDPLAEDG--QVLQRIDLEENEAAVSVAIVTF---AS 893
Query: 1105 KENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAISALA 1161
+ENET L +GT G+D+ R R + + LV ++ +++ A+
Sbjct: 894 QENETFLVVGT----GKDMVVNPRSFSDAFVHIYRFERDGRGLVF-IHKTKVEEPPMAMI 948
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
QG +L+ G + ++ +L A + P ++SL+ + I++GD+ + I +
Sbjct: 949 PFQGRVLVGIGKTLRIYDLGMRQLLRKAQAEVAPQQIISLSTQGSRIVVGDVQQGITYAV 1008
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
+K+ +L D + T ++D T++ D+ NI I K SE ++
Sbjct: 1009 YKQSTNKLIPFVDDTVARWTTCTT-MVDYETVA--GGDKFGNIFIVRSPQKASEEADEEQ 1065
Query: 1282 -----LLSRAEFHVGAHVTKF---LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
L +R H +H L Q + TS +T G LL+ L G+I
Sbjct: 1066 AGLHLLNARDYLHGTSHRLDLMCHLFTQDIPTSIAKTSLVVGGQD----VLLWSGLMGTI 1121
Query: 1334 GCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
G + P ++ F QSL++ + P +AG + +R +++ K I+D +
Sbjct: 1122 GVLIPFITREDTDF--FQSLEQHMRAEDPPLAGRDHLMYRSYYAPMKG-------IIDGD 1172
Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
L Y +LP +++ IA + + +I ++D+ ++F
Sbjct: 1173 LCERYALLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1212
>gi|168045572|ref|XP_001775251.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673464|gb|EDQ59987.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1201
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 88/385 (22%), Positives = 163/385 (42%), Gaps = 62/385 (16%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA---YVQGE 1121
+R+L+P + T + +Q +E A ++ V + KE TL+A+GTA
Sbjct: 849 IRVLDPKTS----TTTCLLELQENEAAFSICTVNFHDN--KELGTLIAVGTAKDLQFMPR 902
Query: 1122 DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
A+ G + ++ ++ V+ + G +AL QG LL+ G + ++
Sbjct: 903 KEASGGFIHIYRFAEEG----RVLELVHKTPVDGVPTALCQFQGRLLVGVGQVLRIYDLG 958
Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
+L P +++++ + I +GDI +S +++ ++ QL A D S
Sbjct: 959 KRKLLRKCENKNFPNTIIAIHTYGDRIYVGDIQESFHYVKYRRDENQLYTFADD--SCPR 1016
Query: 1242 FATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSRA--- 1286
+ T L ID T++ +D+ N+ + +SE W+ +L
Sbjct: 1017 WLTASLHIDFDTMA--GADKFGNVYVMRLPQDVSEEIEDDPTGGKIKWEQGRLNGAPNKV 1074
Query: 1287 ----EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1339
+FHVG VT + ++ PG ++ +L+GT+ GS+G + P
Sbjct: 1075 DEIIQFHVGEVVTSLQKASLI----------PGGGES----MLYGTVMGSMGALLPFSSR 1120
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
+++ F L+ L P + G + +FR A+ P D ++D +L Y ML
Sbjct: 1121 EDVDF--FSHLEMHLRQENPPLCGRDHMAFR------SAYFPVKD-VIDGDLCEQYSMLT 1171
Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDL 1424
E Q +IA T +I+ L D+
Sbjct: 1172 SELQKKIADDLDRTPGEIVKKLEDI 1196
>gi|340381612|ref|XP_003389315.1| PREDICTED: DNA damage-binding protein 1-like [Amphimedon
queenslandica]
Length = 1142
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 137/318 (43%), Gaps = 27/318 (8%)
Query: 1102 TTTKENETLLAIGTAYVQGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1160
TT E ++ +GTA V+ E+ ++ GR+L+F+ + ++ K GA+ +
Sbjct: 821 TTNDEERSVYVVGTALVKPEEKESSTGRILVFAVNSGK------LELLHEKLENGAVFQV 874
Query: 1161 ASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
G +L + + ++ L Y L + L +FIL+GDI +S+ L
Sbjct: 875 LGFNGKILNSVNSGVFVNALVDGALKEECAYKNNIL-ALYLKTKGDFILVGDILRSLKLL 933
Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
+KE+ L + D CF T +ID + + ++I + K +E+
Sbjct: 934 VYKEE-LGLEEIGVDHNISPCFCTAIEMIDDENY---LGADGRHI---FICQKNTEATSE 986
Query: 1280 QKLLSRAE---FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
LL + + G +V F R + D GA S + +LFGT+ G+IG I
Sbjct: 987 ADLLYMVQPSRMYFGDNVNVFSRGSFVM---DHPGAGASSLLQGK-PILFGTVHGAIGLI 1042
Query: 1337 APLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELLSHY 1395
L+ T+ L LQ+K+ ++ V + +R F + HR P +D +L+ +
Sbjct: 1043 GTLNMDTYTLLSKLQQKMAANIKSVGNIEHEIYRSFSNE---HRSKPFAGFIDGDLVEKF 1099
Query: 1396 EMLPLEEQLEIAHQTGTT 1413
LP + +I TT
Sbjct: 1100 LELPRPQMSQIVQGIKTT 1117
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 104/447 (23%), Positives = 165/447 (36%), Gaps = 92/447 (20%)
Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
I L DL ++ D F+HG P + + E GRV + IS K+
Sbjct: 156 IRLEDL---YITDIQFLHGTENPTIAYISEEPSVATGRV--------LKTFVISQRDKEL 204
Query: 296 -PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
P W + A L +VPSP G++VVGA+++ Y N+ + ++D
Sbjct: 205 LPGPWKPNTIEGQASLLCSVPSPYNGLIVVGADSVAY----------FNDTSHTVDPIV- 253
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV------VQRLDLSKTNP 408
+ S S H+ +L D G L+ L + + + + + L
Sbjct: 254 IKESVISCIEPLDHSRYLLGDFR------GRLLTLFLEFSEEMESGMTNIVNMKLEVLGE 307
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
+ ++ + N + F+GS GDS LV+ LSS E G I
Sbjct: 308 ISIPHTLSYLDNGVVFVGSTKGDSQLVK---------LSSSPLENGGYI----------- 347
Query: 469 RSSSDALQDMVN-GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
D L+ M N G L + S Q L G L+ G+ IN
Sbjct: 348 ----DVLESMTNIGPILDM----SVVDLDKQGRDVLVCCSGLGKDGALRIVKSGIGINEA 399
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
AS ++LPG KGIW++ + A +DE ++++ +T
Sbjct: 400 AS------------IDLPGIKGIWSL-------------KCAGREDELDDTVVLTFVGQT 434
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
M L A E TE +T N+ G +IQ+ + R++D M + P
Sbjct: 435 MALRLAGEEVEETELPALVTDQQTFYCSNVTG-NAIIQITTKSVRLMDDKAMELICDWSP 493
Query: 648 SNSE--SGSGSENSTVLSVSIADPYVL 672
+ S + +S V+ D Y L
Sbjct: 494 PDGRGISTAACNSSQVMVAVGCDLYYL 520
>gi|339259094|ref|XP_003369733.1| splicing factor 3B subunit 3 [Trichinella spiralis]
gi|316965959|gb|EFV50595.1| splicing factor 3B subunit 3 [Trichinella spiralis]
Length = 1241
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 67/276 (24%), Positives = 125/276 (45%), Gaps = 29/276 (10%)
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
A++ALAS +G LL ++G + ++ +L P + + + + I +GD+ +
Sbjct: 982 AVTALASFRGRLLASAGKMLRIYDLGKKKLLRKCENKHMPNLITHILTMGHRIFVGDVQE 1041
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL-VVSDEQKNIQ-----IFYY 1269
S++F +K QL + A D C A ++D T++L + SD ++Q I
Sbjct: 1042 SVFFYRYKPIENQLVVFADDTHQRFCSAM-CILDYDTVALRLPSDCTDDVQEDPTGIRAL 1100
Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
K + QK A F+VG VT + ++ PGS ++ L++ T+
Sbjct: 1101 WDKGILNGASQKCEMVATFYVGECVTCLQKAMLI----------PGSSES----LVYSTM 1146
Query: 1330 DGSIGCIAPL-DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
G IG + P + + Q L+ L P + G + ++R F++ K ++D
Sbjct: 1147 SGMIGALVPFSSKEDYEFFQHLEMHLRTEYPPLCGRDHLAYRSFYAPVKG-------VID 1199
Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+L Y +L +Q EI+++ S+I+ L D+
Sbjct: 1200 GDLCEQYCLLEYGKQKEISNELDRVPSEIMKKLEDI 1235
>gi|223994993|ref|XP_002287180.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976296|gb|EED94623.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1517
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 74/314 (23%), Positives = 137/314 (43%), Gaps = 42/314 (13%)
Query: 1104 TKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQ----NLVTEVYSKELK---- 1154
T E + + IGTAY ED +GR+L+ P + + + YS+ ++
Sbjct: 1124 TSEYKPYILIGTAYAYPDEDEPTQGRILVVECNSGEAEPHLKSDDDMEDTYSRYVRHVTQ 1183
Query: 1155 ----GAISALASLQGHLLIAS-GPKIILHKWT--GTELNGIAFYDAP------PLYVVSL 1201
G + +++ G ++A+ K L + + ++ + F A L+V SL
Sbjct: 1184 MPTRGGVYSISPFYGGTVLATVNSKTHLCRLSIGCDQIGELKFVGAGHHGHMLSLFVKSL 1243
Query: 1202 -------------NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1248
K ++GD+ +SI + ++ + + LA+D+ + C A E L
Sbjct: 1244 AGSESESESSGTNRQAKQLAIVGDLMRSISLVEYQPKHNVIEELARDYNANFCTAVEMLT 1303
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
+G+ L S+ N+ + + S +L + E+H+G KF+ ++ S+
Sbjct: 1304 NGTYLG---SEGFNNLFVLRHNANASSEEARVRLDTVGEYHLGEMTNKFMGGSLIMPSN- 1359
Query: 1309 RTGAAPGSDKTNRFA-LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
+G G+ + LFGT+DGSIG + LD TF L LQ+ ++ V V ++
Sbjct: 1360 -SGGIMGAQNAYVGSQTLFGTVDGSIGSVLGLDGPTFAFLACLQRAILSIVKTVGDISHE 1418
Query: 1368 SFRQFHSNGKAHRP 1381
+R F + + RP
Sbjct: 1419 EYRAFRAERQV-RP 1431
>gi|156097003|ref|XP_001614535.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148803409|gb|EDL44808.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 2558
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 53/215 (24%), Positives = 105/215 (48%), Gaps = 12/215 (5%)
Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
P +++SL++V+N+I++GDI S+ LS+ + A LN + +D+ ++ C + L S
Sbjct: 2320 PSSWIMSLDVVENYIVVGDIMTSVTLLSYDFENAILNEVCRDYANIWCTSVSAL---SEN 2376
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL--RLQMLATSSDRTG 1311
+VSD + N + + + KL ++F+ G+ V K L+ L +R
Sbjct: 2377 HFLVSDMESNFLVLQKSNIKFNDEESFKLSLVSQFNHGSVVNKMFSTSLRNLVDDEERRN 2436
Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPLDE-LTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
+++ +L + +GSI + P L F+R ++ + D++ + L+ S+R
Sbjct: 2437 EILQKEQS----ILCASSEGSISALIPFSNFLQFKRALCIEIAINDNISSLGNLSHSSYR 2492
Query: 1371 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
++ + + +VD EL + LP E QL+
Sbjct: 2493 EYKVSLAS--KNCKGVVDGELFKMFFYLPFERQLK 2525
>gi|357478269|ref|XP_003609420.1| Splicing factor 3B subunit [Medicago truncatula]
gi|355510475|gb|AES91617.1| Splicing factor 3B subunit [Medicago truncatula]
Length = 1225
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 96/416 (23%), Positives = 180/416 (43%), Gaps = 68/416 (16%)
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYE-----VRILEPDRAGGPWQTRATIPMQSSENALT 1093
E G + ++++ S D H Y E + +R+L+P R G T + +Q +E A +
Sbjct: 843 ENGGEDEDNDDSLSDEHYGYPKSESDKWVSCIRVLDP-RTG---NTTCLLELQENEAAFS 898
Query: 1094 VRVVTLFNTTTKENETLLAIGTA----YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVY 1149
+ V N KE TLLA+GTA + + A G + ++ R D+ ++L ++
Sbjct: 899 ICTV---NFHDKEYGTLLAVGTAKGLQFTPKRSLTA-GFIHIY---RFLDDGRSLEL-LH 950
Query: 1150 SKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFIL 1209
+++G AL QG LL GP + L+ L + P +VS++ ++ I
Sbjct: 951 KTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGKRRLLRKCENKSFPSSIVSIHAYRDRIY 1010
Query: 1210 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1269
+G I +S ++ ++ QL + A D + + ID T++ +D+ NI
Sbjct: 1011 VGGIQESFHYCKYRRDENQLYIFADD-SVPRWLTSSYHIDFDTMA--GADKFGNIFFARL 1067
Query: 1270 APKMSE-----------SWKGQKLLSR-------AEFHVGAHVTKFLRLQMLATSSDRTG 1311
+S+ W+ KL +FHVG +T + ++
Sbjct: 1068 PQDVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDVITSLQKASLV-------- 1119
Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
PG + +++GT+ GS+G + D++ F L+ + P + G + +
Sbjct: 1120 --PGGGE----CIVYGTVMGSVGALHAFTSRDDVDF--FSHLEMHMRQDNPPLCGRDHMA 1171
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+R A+ P D ++D +L + LP++ Q +IA + T +IL L ++
Sbjct: 1172 YRS------AYFPVKD-VIDGDLCEQFPTLPMDLQRKIADELDRTPGEILKKLEEV 1220
>gi|428180158|gb|EKX49026.1| hypothetical protein GUITHDRAFT_68305 [Guillardia theta CCMP2712]
Length = 1202
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 89/391 (22%), Positives = 157/391 (40%), Gaps = 71/391 (18%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
VR+++P+ +T+ I + +E AL+V V T ++ ++ T L GTA G V
Sbjct: 847 VRVIDPNE----RETKQIIELDPNEAALSVCVATFYD---RKGHTFLCFGTAV--GHKVG 897
Query: 1125 ARGRVLLFSTGRNADNPQNLV----TEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
+R TG + ++V T V+ + G AL S QG LL+ G + L++
Sbjct: 898 SR-------TGSGFLHTYSVVGSQLTFVHKTPIDGVPRALCSFQGRLLVGVGSALRLYEM 950
Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----- 1235
+L P VV+++ + + I +GD+ +SI FL + +L + A D
Sbjct: 951 GKRKLLRKCENRNIPNLVVTISTMGDRIYVGDVAESISFLKYNRILNELVIFADDTHPRW 1010
Query: 1236 -----------FGSLDCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
D F FL +S +S+E + +F +K ++++
Sbjct: 1011 MTAACPVDYDTVAGADKFGNIFLTRLPDNVSDEISEEPGAVGMFEGNDLQGAHYKAEEIV 1070
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---- 1339
++HVG V + + SD A+++GT+ G IG + P
Sbjct: 1071 ---QYHVGETVCSLQKATLSPGGSD--------------AIIYGTMYGGIGALQPFVSRE 1113
Query: 1340 DELTFRRLQSLQKKLVDSVPH------VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
D F L+ + + H + G + SFR ++ K +VD +L
Sbjct: 1114 DVDFFLHLEMHLRGAAGAREHKPAGEGICGRDQLSFRSYYFPVK-------DVVDGDLCE 1166
Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ L Q +IA T ++ L D+
Sbjct: 1167 TFNYLSPSRQKQIAEDLDRTPGEVAKKLEDM 1197
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 110/554 (19%), Positives = 203/554 (36%), Gaps = 97/554 (17%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
++ +C G + S+A G++ +D ++L + +ISVLEF + +
Sbjct: 49 IQSICQMECFGLIRSMASFRLPGSN----KDYLVLGADSGRISVLEFSKERNQFERVHLE 104
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
+ K G G + DP+GR + Q ++ V + D
Sbjct: 105 TYG-------KSGCRRIVPGQFLASDPKGRAVMISAIEKQKLVY---------VFNRDA- 147
Query: 221 GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSW 276
S+++ S + H G+ P+ L + + G+ S
Sbjct: 148 ------SSKLTISSPLEAHKASTIHFSIVGVDVGFDNPIFAALEMDYSDADADETGQ-SA 200
Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP-----IGGVLVVGANTIHY 331
+ +++ + L + + P DA + +P P GVLV N I Y
Sbjct: 201 EEFNKVLTFYELDLGLNH---VVRKASEPIDAASNMLIPVPGDTDGPSGVLVCAENKIAY 257
Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
+AL + Q L + +S H LL ++ GDL LT+
Sbjct: 258 KKPDHEDVVALIPRRQGMPLDQPLLITGYS------HLKQKDGFFFLLQSEIGDLYRLTL 311
Query: 392 VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---MLSS 448
Y+ V ++++ + + IT + F+ S G+ L QF G+ M+
Sbjct: 312 TYNDEEVSEINITYFDTVPVAQSITILKTGFLFVASEFGNHALYQFLSIKGSDESDMMPV 371
Query: 449 GLKEEFGDIE----ADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
++ E IE A P L ++L +++ L L G E + ++
Sbjct: 372 EVEIEGETIEIPHFAPRPLKNLLLVDEMESLSPILDMRVLDLAG------EETPQIYA-- 423
Query: 505 VRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGCK-GIWTVYHKSSRG 560
L GP L+ +GL + + + ELP +WTV +G
Sbjct: 424 ----LCGKGPRSTLRTLRHGLAV------------AEMAVSELPSNPLAVWTV-----KG 462
Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
+ D++ Y++++ T+VL D + EVT+S + +T++ +L G
Sbjct: 463 SSKDAA---------DKYIVVTFANATIVLSIGDTVEEVTDS-GFLATNKTLSV-SLLGD 511
Query: 621 RRVIQVFERGARIL 634
++QV G R +
Sbjct: 512 DSLLQVHPNGLRTV 525
>gi|300176205|emb|CBK23516.2| unnamed protein product [Blastocystis hominis]
Length = 702
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 83/358 (23%), Positives = 159/358 (44%), Gaps = 25/358 (6%)
Query: 1077 WQTRATIPMQSSENALTVRVVTLFN-TTTKENETLLAIGTAYV-QGEDVAARGRVLLFST 1134
+ R +P++ SE AL V ++F + E + +GTA+V E+ ++GR+L+
Sbjct: 317 YAIRDELPLKPSEIALCVASGSIFPLSNAPERNEVFVVGTAFVLPEENEPSQGRLLVL-- 374
Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLL--IASGPKIILHKWTGTELNGIAFYD 1192
R ++ LV E L G ++ +G ++ + S ++ + ++ +A +
Sbjct: 375 -RAVEHRLELVAETM---LSGGCLSICLFKGKVVCGVNSELQVFDVDEKTSTISKLA-SE 429
Query: 1193 APPLYVVSL--NIVKNFILLGDI------HKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
+ V SL N I LGDI +K + + Q AQL +A + D A
Sbjct: 430 VACISVTSLSPNEADETIALGDILYSVVVYKLVLEVVRGRQLAQLECIASERRRRDVTAL 489
Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPK--MSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
E L + + +VV D N+ + + + S + ++++ FH+ + +F+ +Q+
Sbjct: 490 ERLPEAQS-EMVVGDAYGNLMVMQVVEEADLDRSNPQKIVVTKESFHLDDQINRFVPVQL 548
Query: 1303 LAT-SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHV 1361
+ + D+ + F L F T+ G IG I L++ FR L++++ + + + V
Sbjct: 549 FRSGAEDKKKEKRAEESEIAFNLAFATVSGRIGMIGALNDREFRMLRAIETAMENVITPV 608
Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1419
GL+ + +R SN +D +L+ + L E Q +IA T LS
Sbjct: 609 GGLDHKQWR--CSNTPFGIKNLAYCIDGDLVEMFLELDDESQAKIADSVSTELRSALS 664
>gi|339235331|ref|XP_003379220.1| DNA damage-binding protein 1 [Trichinella spiralis]
gi|316978142|gb|EFV61158.1| DNA damage-binding protein 1 [Trichinella spiralis]
Length = 1329
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 75/322 (23%), Positives = 142/322 (44%), Gaps = 35/322 (10%)
Query: 1098 TLFNTTTKENETLLAIGTAYVQGEDVAA--RGRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
++ + T E++ I +A V D +GR+L+ R+ +L V+ KE+ G
Sbjct: 995 SILSCTMGEDQNPFFILSAAVITADETEPLQGRLLMLRYERDGQGNSSL-NLVHEKEVNG 1053
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS--LNIVKNFILLGDI 1213
+ A+AS + LL+A ++L +W +++ G+ + L+V + L + IL+GDI
Sbjct: 1054 CVYAMASFKSKLLVAMNSSVLLFEW--SDVTGLQLVSSCSLFVTAMHLKVRDEVILVGDI 1111
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDF-----GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
+SI L + + A+D+ +++ ++ + SL ++ QK++Q
Sbjct: 1112 QRSIAVLRYVPSESSFVEEARDYHPNWISAIEVIDNDYFMAAEN-SLNITVSQKDLQ--- 1167
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
+SES Q + S H+G ++ F S + A S +N ++ GT
Sbjct: 1168 -QQPVSES---QVVKSAGRLHLGEYINVFKH----GALSMYSYAGISSLVSN--PIMIGT 1217
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL----NPRSFRQFHSNGKAHRPGPD 1384
+GSI + + FR L LQ+ D VP G + R + + N A
Sbjct: 1218 AEGSILIYCQIHDSHFRVLNDLQRCFSDIVPDNVGCIAYDSYRRYVVYEKNAPAF----- 1272
Query: 1385 SIVDCELLSHYEMLPLEEQLEI 1406
+D +L+ +P +E + +
Sbjct: 1273 GFIDGDLIEQLLEMPRQEAIRL 1294
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 81/359 (22%), Positives = 152/359 (42%), Gaps = 74/359 (20%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R + +SA L+ V ++ G + + + + G + + +++ ++++E+D+
Sbjct: 205 RFEVHSVSAEGLQYVTEGKMFGRIGAAKLFTPKGENKAL----MVIVTLKQDVAIVEYDN 260
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVYGLQMIILK 205
RI + L + E+F R G L+ V P G G+ + +
Sbjct: 261 G----RIKT---------LASRNISENFGRPASNGILLSVHPDGEVIGLRIMSSTFKCIT 307
Query: 206 ASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHE 265
++ S L S++ +N + H+ DF+F+HG+ PV+ +++
Sbjct: 308 WNRATSKL------------------STYSLNY---SLTHLSDFVFLHGFQFPVIALIY- 345
Query: 266 RELTWAGRVSWKHH-TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
G + +H TC IS + P WS ++ +A+ L+AVP P+ GV+VV
Sbjct: 346 ------GDLVGRHVITCRISL--DEQEFENGP--WSRGHIEWEAHTLIAVPPPLCGVIVV 395
Query: 325 GANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTG 384
G +++ Y +N +S S L +S + DAA L G
Sbjct: 396 GCSSLLY---------IRDNSTISTVSPPFLSKSIVNC-YDAAP----DGLTYFLGQLDG 441
Query: 385 DLVLLTVVYDGRVVQRLDLSKTNPSVL--TSDITTIG----NSLFFLGSRLGDSLLVQF 437
L LL + + ++ LS+ ++L TS ++ SL F+GSR+ DS L++
Sbjct: 442 TLSLLKLDIETDAEGKVTLSRMRATILGVTSPPDSLSYMHKESLLFVGSRIADSKLLRL 500
>gi|221055487|ref|XP_002258882.1| CPSF (cleavage and polyadenylation specific factor), subunit A
[Plasmodium knowlesi strain H]
gi|193808952|emb|CAQ39655.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
putative [Plasmodium knowlesi strain H]
Length = 2478
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 70/322 (21%), Positives = 139/322 (43%), Gaps = 36/322 (11%)
Query: 1109 TLLAIGTAYVQGEDVA--ARGRVLLFSTGRNADNPQNLVTEVYSKELK-GAISALASLQG 1165
TL+ +GTA E + + G + +F + + Q + +Y+ + G I+ L +
Sbjct: 2135 TLICVGTA-NNNERITEPSSGHIYVFVAKKKTN--QFEIKHIYTYNVNCGGITHLKQFRD 2191
Query: 1166 HLLIASGPKIILHKWTGTELN-GIAFYDA-------------------PPLYVVSLNIVK 1205
++ A +++ N G Y+A P +++SL++VK
Sbjct: 2192 KIVAAVNNTVLILDIRNFLTNLGTYIYNASKAMKVESNDAFLEVASFTPSSWIMSLDVVK 2251
Query: 1206 NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
N+I++GDI S+ LS+ + A LN + +D+ ++ C A S +VSD + N
Sbjct: 2252 NYIVVGDIMTSVTLLSYDFENAILNEVCRDYANIWCTAL------SEDHFLVSDMESNFL 2305
Query: 1266 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTGAAPGSDKTNRFAL 1324
+ + + KL ++F+ G+ V K L + + + P ++
Sbjct: 2306 VLQKSNIKFNDEESFKLSLVSQFNHGSVVNKMLSTSLRNLVDEYESEERPNEIVQKERSI 2365
Query: 1325 LFGTLDGSIGCIAPLDE-LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
L + +GSI + P + F+R ++ + D++ + L+ S+R++ +
Sbjct: 2366 LCASSEGSISTLIPFSNFIQFKRALCIEIAINDNISSLGNLSHSSYREYKITLAS--KNC 2423
Query: 1384 DSIVDCELLSHYEMLPLEEQLE 1405
+VD EL + LP E QL+
Sbjct: 2424 KGVVDGELFKMFFYLPFERQLK 2445
>gi|281208174|gb|EFA82352.1| UV-damaged DNA binding protein1 [Polysphondylium pallidum PN500]
Length = 1054
Score = 67.0 bits (162), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 78/300 (26%), Positives = 137/300 (45%), Gaps = 28/300 (9%)
Query: 1111 LAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+ +GTA+ + E ++GR+L+F R DN L+ EV L + L G LL
Sbjct: 754 VVVGTAFHNEVESQQSKGRILVF---RIEDNRLILLDEV---ALPACVYCLLPFNGRLLA 807
Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK--NFILLGDIHKSIYFLSWKEQGA 1227
++ W G + N + ++ + +S ++V +F+L+ D+ KS+ L +QGA
Sbjct: 808 GINKRVQAFNW-GVDTNKLTKAESYSGHTLSHSMVSRGHFVLVADLMKSMTLLVEDQQGA 866
Query: 1228 QLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
+ LA++ L + + +ID T + D N+ + + S + L +
Sbjct: 867 -IKELARN--PLPIWLSRIEMIDDETF--IGGDNSYNLIVVQKNAEASSEIDNELLDTVG 921
Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1346
+FH+G + KF + L TS P D +LFGT+ G+IG I + + +
Sbjct: 922 QFHLGETINKF-KHGSLVTS-------PDMDSPKLPTILFGTVSGAIGVIVSISKDDYEF 973
Query: 1347 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP-DSIVDCELLSHYEMLPLEEQLE 1405
+ LQK L V V GL ++R F + H P + +D +L+ + L ++ LE
Sbjct: 974 FEKLQKGLNRVVHGVGGLPFENWRSFSTE---HMTIPSKNFIDGDLIETFLDLRHDKMLE 1030
>gi|241560031|ref|XP_002400960.1| spliceosomal protein sap, putative [Ixodes scapularis]
gi|215501812|gb|EEC11306.1| spliceosomal protein sap, putative [Ixodes scapularis]
Length = 1019
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 91/395 (23%), Positives = 167/395 (42%), Gaps = 64/395 (16%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDV 1123
+R+L P Q+ + ++ +E AL+V +V T+ +E + +G A +Q
Sbjct: 660 IRVLNPSDG----QSLCKVALEQNEAALSVALVRF---TSHPDEQFVVVGAAREMQLNPR 712
Query: 1124 AARGRVLLFSTGRNADNPQNLVTE------VYSKELKGAISALASLQGHLLIASGPKIIL 1177
RG LL T R A NP+ + V++ ++ A +AL QG LL G + L
Sbjct: 713 VCRGGGLLL-TYRLAPNPEEPMAGPTQLELVHATPVEEAPTALCPFQGRLLAGVGKCLRL 771
Query: 1178 HKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1237
+ +L P +VS+ + N +++ D+ +S +FL +K Q QL + A D
Sbjct: 772 YDLGRKKLLRKCENKYIPNAIVSIQAMGNRVVVSDVQESFFFLRYKRQENQLVIFADD-- 829
Query: 1238 SLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES---------------WKG-- 1279
S+ + T ++D T++ +D+ N+ I +S+ W G
Sbjct: 830 SVPRWITASCMLDYETVA--GADKFGNVSIIRLPSSISDDVDEDPTGIKSLWDRGWLGGS 887
Query: 1280 -QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
QK + FH+G V + ++ PG ++ L++ TL G++G + P
Sbjct: 888 SQKADVISNFHIGETVLSLQKATLI----------PGGSES----LVYVTLSGTVGVLVP 933
Query: 1339 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
++ F Q L+ + P + G + SFR + K +++D +L +
Sbjct: 934 FTAHEDHDF--FQHLEMHMRYENPPLCGRDHLSFRSSYFPVK-------NVIDGDLCEQF 984
Query: 1396 EMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
L +Q IA + S++ L D+ +F
Sbjct: 985 NSLDPSKQKSIAEELDRNPSEVSKKLEDIRTRYAF 1019
>gi|449684814|ref|XP_004210722.1| PREDICTED: DNA damage-binding protein 1-like, partial [Hydra
magnipapillata]
Length = 725
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 67/272 (24%), Positives = 121/272 (44%), Gaps = 19/272 (6%)
Query: 1109 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
T +GT+ V E+ + G+++LF + ++ SK + GA+ L G L
Sbjct: 414 TYYCVGTSMVYPEESEPKEGKIILFQLFEGK------LVQIGSKTVNGAVYVLQGFNGKL 467
Query: 1168 LIASGPKIILHKWTG-TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
L + +++WT EL Y L + L +FIL+GD+ +S+ L++K G
Sbjct: 468 LAGVNSLVSVYEWTSDKELKQECCYHNTIL-ALYLKSKGDFILVGDLMRSMTLLAYKPLG 526
Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
+L +A DF A E + D + L ++ N+ I + L +
Sbjct: 527 -RLEEIAHDFSPNWMTAVEIIDDDTFLG---AENSFNLFICQKDNSSVNDEERHHLQTIG 582
Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1346
++H+G V F ++ S S ++L+GT+ G+IG +A L + TF
Sbjct: 583 KYHLGDFVNVFKHGSLVMHHSTEQLTPISS------SILYGTVRGAIGLVAGLPKNTFDF 636
Query: 1347 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
L +Q+KL ++ V + +R F+++ K
Sbjct: 637 LSQVQEKLSKTIKSVGKIEHEFWRSFYNDKKT 668
>gi|449459948|ref|XP_004147708.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
gi|449513493|ref|XP_004164340.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
Length = 1214
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 97/403 (24%), Positives = 172/403 (42%), Gaps = 70/403 (17%)
Query: 1053 DLHRTYTVEEYE-----VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 1107
D H Y E E +R+L+P A T + +Q +E A +V V N KE
Sbjct: 846 DEHYGYPKAESEKWVSCIRVLDPRSA----TTTCLLELQDNEAAFSVCTV---NFHDKEY 898
Query: 1108 ETLLAIGTA----YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
TLLA+GTA + + A G + ++ R ++ ++L ++ +++G ALA
Sbjct: 899 GTLLAVGTAKGLQFFPKRSLVA-GYIHIY---RFLEDGKSLEL-LHKTQVEGVPLALAQF 953
Query: 1164 QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
QG LL G + L+ L P +VS+ ++ I +GDI +S ++ ++
Sbjct: 954 QGRLLAGLGSVLRLYDLGKRRLLRKCENKLFPNTIVSIQTYRDRIYVGDIQESFHYCKYR 1013
Query: 1224 EQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE------- 1275
QL + A D S+ + T + +D T++ +D+ NI +S+
Sbjct: 1014 RDENQLYIFADD--SVPRWLTASYHVDFDTMA--GADKFGNIYFVRLPQDVSDEIEEDPT 1069
Query: 1276 ----SWKGQKLLSRA-------EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL 1324
W+ KL +FH+G VT + ++ PG + +
Sbjct: 1070 GGKIKWEQGKLNGAPNKVEEIIQFHIGDVVTSLQKASLI----------PGGGE----CI 1115
Query: 1325 LFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
L+GT+ GS+G + D++ F L+ + P + G + +R A+ P
Sbjct: 1116 LYGTVMGSLGALHAFTSRDDVDF--FSHLEMHMRQEHPPLCGRDHMGYR------SAYFP 1167
Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
D ++D +L + LPL+ Q +IA + T +IL L ++
Sbjct: 1168 VKD-VIDGDLCEQFPSLPLDMQRKIADELDRTPGEILKKLEEV 1209
Score = 44.7 bits (104), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 126/324 (38%), Gaps = 57/324 (17%)
Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
GVLV N + Y +Q A+ + +LP + + AA LL
Sbjct: 246 GVLVCAENFVIYKNQGHPDVRAV------IPRRADLPAERGVLIVSAAMHKQKTMFFFLL 299
Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
T+ GD+ +T+ ++ V+ L + + +T+ + + + F S G+ L QF
Sbjct: 300 QTEYGDIFKVTLEHNNDSVKELKIKYFDTIPVTASMCVLKSGFLFAASEFGNHSLYQFQA 359
Query: 440 -GSGTSMLSSG-----LKEEFGDIEADAPSTKRLRR-SSSDALQDMVNGEELSLYGSASN 492
G + SS +E F + K L R ++L +++ + ++L+
Sbjct: 360 IGEDADVESSSATLMETEEGFQPVFFQPRRLKNLMRIDQVESLMPIMDMKIINLF----- 414
Query: 493 NTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-K 548
E + F+ R GP L+ GL I S + ELPG
Sbjct: 415 -EEETPQIFTLCGR------GPRSSLRILRPGLAI------------SEMAVSELPGVPS 455
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+WTV +DE+ AY+++S T+VL + + EV++S F+
Sbjct: 456 AVWTVKKN--------------INDEFDAYIVVSFANATLVLSIGETVEEVSDS--GFLD 499
Query: 609 GRTIAAGNLFGRRRVIQVFERGAR 632
A +L G ++QV G R
Sbjct: 500 TTPSLAVSLIGDDSLMQVHPNGIR 523
>gi|170580631|ref|XP_001895346.1| splicing factor 3B subunit 3 [Brugia malayi]
gi|158597745|gb|EDP35799.1| splicing factor 3B subunit 3, putative [Brugia malayi]
Length = 1181
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/364 (21%), Positives = 145/364 (39%), Gaps = 32/364 (8%)
Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRN 1137
+T + P E A + +V F + L+ G A G + F N
Sbjct: 839 ETLSHFPFAEDEAAFAIAMVQ-FQNQSDTQFVLVGCGCDLQLKPRKANGGCIYTFLLAAN 897
Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
Q L + ++A+ +G L G K+ L+ +L P
Sbjct: 898 GTTLQLL----HRTPTDEVVNAIHDFRGMALAGVGKKVRLYDLGKRKLLAKCENRQIPTQ 953
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
VV + + I++ D +S++F+ +K+Q QL++ D S L+D T++ V
Sbjct: 954 VVDIRSMGQRIVVSDSQESVHFMRYKKQDGQLSIFC-DETSPRYVTCVCLLDYDTVA--V 1010
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA--------TSSDR 1309
D N+ + ++E + RA + G +L+ +A TS +
Sbjct: 1011 GDRFGNVAVLRLPKGVTEEVQEDPTGVRALWDRGNLNGASQKLEAIAHLYIGDAITSMQK 1070
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1366
T PG++ L + T+ G IG + P DE F Q+L+ + P + G +
Sbjct: 1071 TSLVPGAND----CLSYTTISGIIGILVPFMSRDEFEF--FQNLEMHMRVEYPPLCGRDH 1124
Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
++R ++ K S++D +L Y ++PL++Q + + G ++I L D+
Sbjct: 1125 LAYRSYYFPVK-------SVIDGDLCEQYSLMPLDKQKSVGEELGRKPTEIHKKLEDIRT 1177
Query: 1427 GTSF 1430
+F
Sbjct: 1178 RYAF 1181
>gi|395330962|gb|EJF63344.1| hypothetical protein DICSQDRAFT_153890 [Dichomitus squalens LYAD-421
SS1]
Length = 1263
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 141/314 (44%), Gaps = 42/314 (13%)
Query: 1111 LAIGTAYVQGEDV-AARGRVLLFST----GRNADNPQNLVTEVYSKELKGAISALASLQG 1165
A+GT Y++ E+ ++GR+LLFS G N ++L T + S + G + ALA+L
Sbjct: 944 FALGTVYIRPEEREPSKGRILLFSVSSTEGARGANVRSLHT-LASVNVGGCVYALANLSE 1002
Query: 1166 HLLIAS-GPKIILHKWTGTELNGIAFYDAPPL------------YVVSLNIVKNFILLGD 1212
+L++A+ ++L K T E ++ PL +V ++ + IL+GD
Sbjct: 1003 NLIVAAINTSVVLFKSTENEAG-----ESTPLSLEKVTEWNHNHFVTNVVVDGERILVGD 1057
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
S+ L W E+ +L +A+D+G L A I+G+ L+ ++ N+ F
Sbjct: 1058 AISSVSVLKWNERLERLESIARDYGPLWPIA----IEGTGNGLIGANADCNLFSFSLQSV 1113
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
++ L +H+ KF+R + TS+D D+ + + +F T G
Sbjct: 1114 PHRTY----LEKDGVYHLNDVTNKFVRGAL--TSTDV-----AEDQVVKASHVFFTSTGC 1162
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS--IVDCE 1390
IG I ++++T + +LQ+ + ++ G N R S + H S +D +
Sbjct: 1163 IGAILDMNDVTSLHMTALQRNMAKTLTGPGGDNHTKLRA-PSTPRGHTDAEASYGFLDGD 1221
Query: 1391 LLSHYEMLPLEEQL 1404
L Y P EQ
Sbjct: 1222 FLEQYLTHPHPEQF 1235
>gi|168046759|ref|XP_001775840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672847|gb|EDQ59379.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1214
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 89/388 (22%), Positives = 166/388 (42%), Gaps = 68/388 (17%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA---YVQGE 1121
+R+L+P + T + +Q +E A ++ V + KE TL+A+GTA +
Sbjct: 862 IRVLDPKTS----TTTCLLELQENEAAFSLCAVNFHDN--KELGTLIAVGTAKNMQFMPK 915
Query: 1122 DVAARGRVLLF---STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1178
++ G + ++ GR ++ V+ + G +AL QG LL+ G + ++
Sbjct: 916 KESSGGFIHIYRFVEEGR-------ILELVHKTPVDGVPTALCQFQGRLLVGVGQVLRIY 968
Query: 1179 KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
+L P +++++ + I +GDI +S +++ ++ QL A D S
Sbjct: 969 DLGKRKLLRKCENKNFPNTIIAIHTYGDRIYVGDIQESFHYVKYRRDENQLYTFADD--S 1026
Query: 1239 LDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKG------- 1279
+ T L ID T++ +D+ N+ + +SE W+
Sbjct: 1027 CPRWLTASLHIDFDTMA--GADKFGNVYVMRLPQDVSEEIEDDPTGGKIKWEQGRLNGAP 1084
Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
K+ +FHVG VT + ++ PG ++ +L+GT+ GS+G + P
Sbjct: 1085 NKVEEIIQFHVGEVVTSLQKASLI----------PGGGES----VLYGTIMGSVGALLPF 1130
Query: 1340 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
+++ F L+ L P + G + +FR A+ P D ++D +L Y
Sbjct: 1131 SSREDVDF--FSHLEMHLRQENPPLCGRDHMAFR------SAYFPVKD-VIDGDLCEQYP 1181
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
ML E Q +IA T ++L L D+
Sbjct: 1182 MLTSELQRKIADDLDRTPGEVLKKLEDI 1209
>gi|195500686|ref|XP_002097479.1| GE26244 [Drosophila yakuba]
gi|194183580|gb|EDW97191.1| GE26244 [Drosophila yakuba]
Length = 1140
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/368 (22%), Positives = 146/368 (39%), Gaps = 44/368 (11%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ EVG +ID HNL +D + +L P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHHFVSPETISSLMSAKLGDDPNTYYV 831
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
V T+ V E+ + GR+++F N +T+V ++ G
Sbjct: 832 V----------------ATSLVIPEEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMR 929
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
SI L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986
Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ + + + L LQ++L + V + +R F N K + +D +L+
Sbjct: 1040 IVTQIPQDFYDFLHGLQERLKKIIKSVGKIEHTYYRNFQINNKVE--PSEGFIDGDLIES 1097
Query: 1395 YEMLPLEE 1402
+ L E+
Sbjct: 1098 FLDLSREK 1105
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 107/264 (40%), Gaps = 53/264 (20%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVMAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELTVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251
Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
+F +A N + LL G L +L + G V+ + + + +
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTSETSKGVTVKDIKVEQLGEISIPE 311
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
IT + N ++G+R GDS LV+
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335
>gi|353236335|emb|CCA68332.1| probable splicing factor 3B subunit 3 [Piriformospora indica DSM
11827]
Length = 1243
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 113/494 (22%), Positives = 213/494 (43%), Gaps = 70/494 (14%)
Query: 965 CNHGFIYVTSQGILKICQLPS-GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
C GFI + + L+I Q+P GS + +PL TP + + + + LI S
Sbjct: 789 CPDGFIGI-ANNTLRIFQVPKLGSKLKQ----EILPLSYTPRKFVSHPQNSYFYLIES-- 841
Query: 1024 VLKPLNQVLSLLIDQEV------GHQIDNHNLS-SVDLHRTYTVEEYE----VRILEPDR 1072
+ +I++ V G ++D L ++ + E +RI++P
Sbjct: 842 ---DHRAMSETMIEERVKAIEMSGEKVDREMLELDPRIYGHFKAPEGVWASCIRIIDPVN 898
Query: 1073 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARG----- 1127
++ A + ++E A ++ VV + E LL +GTA +A R
Sbjct: 899 ----LRSVAAFSLDNNEAAFSIAVVPF---AARNGELLLVVGTAV--DTHLAPRSCSTGY 949
Query: 1128 -RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELN 1186
RV F+ G + + ++ ++ +AL + QG L+ G + L+ +L
Sbjct: 950 LRVYSFTEGGSG------LELLHKTDIDEVPTALMAFQGRLIAGVGKALRLYDIGKKKLL 1003
Query: 1187 GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEF 1246
A +V+L+ + IL GDI++SIY++++K +L + A D S
Sbjct: 1004 RKAENRQFATAIVTLSTQGSRILAGDINQSIYYVAYKAAENRLLIFADD-TSARWITAST 1062
Query: 1247 LIDGSTLSLVVSDEQKNIQIFYY----APKMSESWKGQKLLSRAEFHVGA-HVTKFL--- 1298
++D +T +V +D+ N+ + + ++ + G +L +GA H TK L
Sbjct: 1063 MLDYNT--VVAADKFGNVFVNRLDEAVSKQVDDDPTGAGILHEKSVLMGAPHKTKMLTHF 1120
Query: 1299 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLV 1355
+ + TS R G+ R +++ L G+IG + PL D++ F + +L++ +
Sbjct: 1121 HVGDVITSIQRVALVAGA----REVVVYFGLHGTIGILVPLVTKDDVDF--ISTLEQHMR 1174
Query: 1356 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
+ G + S+R +++ KA VD +L ++ LP ++QL IA + T
Sbjct: 1175 TENLSLVGRDHLSWRGYYTPVKA-------TVDGDLCEYFAKLPTQKQLAIAGELDRTVG 1227
Query: 1416 QILSNLNDLALGTS 1429
+L L L + S
Sbjct: 1228 DVLKKLESLRVTAS 1241
>gi|17508021|ref|NP_491953.1| Protein TEG-4 [Caenorhabditis elegans]
gi|351060889|emb|CCD68627.1| Protein TEG-4 [Caenorhabditis elegans]
Length = 1220
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/394 (19%), Positives = 169/394 (42%), Gaps = 55/394 (13%)
Query: 1070 PDRAGGPWQTRATIPMQSSENALT---------VRVVTLFNTTTKENETLLAIGTAYVQG 1120
P A G W + ++ +S + L+ + V L + NE ++ +G +
Sbjct: 849 PRAARGKWASAISLISATSGDKLSYFELPQDENAKCVALVQFSKHPNEAMVLVGCGVNEV 908
Query: 1121 EDV-----------AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+V RG V F N D L + E + A+ +G L+
Sbjct: 909 LNVHDIDPNDTSIRPTRGCVYTFHLSANGDRFDFL----HRTETPLPVGAIHDFRGMALV 964
Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
G + ++ +L P+ +V++ I++ D +S++FL +++ QL
Sbjct: 965 GFGRFLRMYDIGQKKLLAKCENKNFPVSIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQL 1024
Query: 1230 NLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
+ A D + C ++D T++ V+D+ N+ + +++E + +S++
Sbjct: 1025 VVFADDTTPRYVTCVC---VLDYHTVA--VADKFGNLAVVRLPERVNEDVQDDPTVSKSV 1079
Query: 1288 FHVGAHVTKFLRLQMLA--------TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
+ G ++++++ TS +T PG+++ AL++ T+ G+IGC+
Sbjct: 1080 WDRGWLNGASQKVELVSNFFIGDTITSLQKTSLMPGANE----ALVYTTIGGAIGCLVSF 1135
Query: 1340 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
DE+ F +L+ + P + G + ++R +++ K S++D ++ +
Sbjct: 1136 MSKDEVDF--FTNLEMHVRSEYPPLCGRDHLAYRSYYAPCK-------SVIDGDICEQFS 1186
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
++ ++Q ++A + G T S+I L D+ +F
Sbjct: 1187 LMDTQKQKDVAEELGKTVSEISKKLEDIRTRYAF 1220
>gi|392570042|gb|EIW63215.1| hypothetical protein TRAVEDRAFT_161375 [Trametes versicolor FP-101664
SS1]
Length = 1213
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 127/556 (22%), Positives = 221/556 (39%), Gaps = 103/556 (18%)
Query: 919 NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
NI + SRP ++ + P + + A++ + C G I + S +L
Sbjct: 714 NIQQNPAILALSSRPWLNYTYQNFMHFTPLIFENLDYAWSFSAEL-CTEGLIGI-SGSLL 771
Query: 979 KICQLPSGSTY--DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1036
+I Q+P T + P+ P K PH P N +L L+
Sbjct: 772 RIFQIPKLGTKLKQDSLPLSYTPRKFMPH---------------------PTNGLLYLI- 809
Query: 1037 DQEVGHQIDNHNLSSVDLH----RTYTVEEYEVRILEPDRAGGP------WQT--RATIP 1084
E H++ + +S L R ++E EV +L P++ G P W + R P
Sbjct: 810 --EGDHRVMSEEAASKKLQEMRARGERIDE-EVLLLPPEQFGRPKAPAGTWASCIRIINP 866
Query: 1085 MQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQGEDVAARGRVLLF- 1132
++SS TV+V+ L N + NE L +GTA Q ++ R F
Sbjct: 867 LESS----TVKVIHLDNNEAAFSMAIVPFAARGNELHLVVGTA--QDTFLSPRSCTSGFL 920
Query: 1133 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1192
T R D+ ++L ++ E A+ + QG L+ G + L+ +L
Sbjct: 921 RTYRFIDDGRDL-EFLHKTETSDVPLAVMAFQGKLIAGVGKSLRLYDVGKKKLLRKVENK 979
Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
P +V+LN + I++GD+ +S+++ +K +L + A D AT L D +T
Sbjct: 980 GFPAAIVTLNTQGSRIIVGDMQESVFYAVYKAPENRLLVFADDAQPRWVTATTML-DYNT 1038
Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESW------------KG------QKLLSRAEFHVGAHV 1294
+V D N+ + K+S+ KG K + + FHVG V
Sbjct: 1039 --VVAGDRFGNVFVNRLDSKISDQIDDDPTGAGILHEKGVLFGAPHKSVMLSHFHVGDIV 1096
Query: 1295 TKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKK 1353
T ++ ++A R LL+ L G+IG + P + + L +L++
Sbjct: 1097 TSLHKVALVA--------------GGREVLLYTCLHGTIGILVPFVSKEDVDLLTTLEQH 1142
Query: 1354 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
+ + G + ++R ++ K S+VD +L + LP +Q IA + T
Sbjct: 1143 MRTEQLSLVGRDHLTWRGYYVPVK-------SVVDGDLCESFAKLPANKQSTIAGELDRT 1195
Query: 1414 RSQILSNLNDLALGTS 1429
++L L L + S
Sbjct: 1196 VGEVLKKLEQLRVTAS 1211
>gi|341884150|gb|EGT40085.1| CBN-DDB-1 protein [Caenorhabditis brenneri]
Length = 1134
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/310 (19%), Positives = 129/310 (41%), Gaps = 19/310 (6%)
Query: 1106 ENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
+N + +GTA V ++ + GR+++F +TE+ ++GA + L
Sbjct: 816 DNNSYYIVGTALVYPDESETKIGRIIVFEVDETDKTKLRFMTEIV---VRGAPMGIRILN 872
Query: 1165 GHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
G L+ A + + +WT + + + V L ++ I + D+ +S+ LS++
Sbjct: 873 GKLVAAINSSVRMFEWTAEKELRVECSTFNHIAAVDLKVLNEEIAVADVMRSVSLLSYRT 932
Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
+AKD+ S EF+ S L +++ P + G+ +L
Sbjct: 933 LEGNFEEVAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPVTDD---GRYVLE 989
Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPLDEL 1342
+ +TK + +L D D + R+ +++GT GS+G + +D++
Sbjct: 990 PTGYWYLGELTKVMIRAVLVPQPD--------DNSIRYTQPIMYGTNQGSLGLVVQIDDM 1041
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
+ L S++K + D+ + + ++R F N + P +D +L+ +
Sbjct: 1042 YKKFLLSIEKAISDAEKNCMQIEHSTYRSFTYNKRIEPPS--GFIDGDLIESILDMDRSR 1099
Query: 1403 QLEIAHQTGT 1412
+EI + T
Sbjct: 1100 AIEILEKANT 1109
Score = 46.6 bits (109), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 87/354 (24%), Positives = 143/354 (40%), Gaps = 82/354 (23%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
D+ L+ VPSPI GV+V+G +++ Y S N+ V SS L + F+ +
Sbjct: 210 DSSMLIPVPSPISGVVVLGTHSLLYKSSE-------NDGEVVPYSSPLLENTIFTSHSIV 262
Query: 365 DAAHATWLQNDV--ALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGN 420
D ++ +D LL ++LL V + G V+ + + + + I I N
Sbjct: 263 DPTGERFIVSDTDGRLL------MLLLNAVENQSGLSVKEIRIDLLGDTSVAESINYIDN 316
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
+ F+GSR GDS L++ S S L + + ++DM+
Sbjct: 317 GVVFIGSRFGDSQLIRLLSEKTNSSYISVLDTYY----------------NIGPIRDMIM 360
Query: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
E ++ + T S A +D G L+ G+ I A+
Sbjct: 361 VE---------SDGQPQLVTCSGAEKD-----GSLRVIRNGIGIEELAT----------- 395
Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
V+LPG GI+ + SS AD+ Y+I+SL T VL+ E
Sbjct: 396 -VDLPGVVGIFPIRLDSS----ADN------------YVIVSLVEETHVLQITGEELEDV 438
Query: 601 ESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSE 651
+ + T+ AG LFG V+QV ER R++ +++ + P+N E
Sbjct: 439 QFLQIDTALPTMFAGTLFGPNDSGLVVQVTERQVRLMSNGGLSK--FWEPANGE 490
>gi|307109500|gb|EFN57738.1| hypothetical protein CHLNCDRAFT_56079 [Chlorella variabilis]
Length = 1144
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 90/385 (23%), Positives = 162/385 (42%), Gaps = 63/385 (16%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
VRI++P QT I M+ +E AL+V +V + E+ TLLA+GTA QG
Sbjct: 793 VRIVDPVS----LQTTHCIEMEDNEAALSVCLVEF--DSHPEHGTLLAVGTA--QGLKFY 844
Query: 1125 AR----GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
+ G V L+ R D+ + + ++ ++G A+A+ +G LL+ + L+
Sbjct: 845 PKECQNGFVHLY---RFLDDGKR-IELLHKTAVEGVPGAMAAFKGRLLVGVDAVLRLYDM 900
Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
+ Y P + +L++ + I +GD +S +F+ +K+ Q + A D
Sbjct: 901 GKKRMLRKCEYRRLPTRIATLHVSGSRIYVGDGQESTFFMRYKKGDNQFYIFADDIVPRH 960
Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG------------------QKL 1282
A L D TL+ +D N+ + ++S + KL
Sbjct: 961 VTAALHL-DYDTLA--GADRFGNVFVSRLPQEVSAQVEDDPTGGKYATETGLLGGAPNKL 1017
Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1339
+ FHVG VT R + PG R +++GT++G+IG + P
Sbjct: 1018 RTINSFHVGETVTALQRAVL----------QPG----GRELIVYGTINGAIGVLYPFTSK 1063
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
++ F Q L+ + P + G + ++R F+ K +VD +L Y L
Sbjct: 1064 EDCDF--FQHLEMHMRQEHPPLLGRDHLAYRSFYFPVK-------DVVDGDLCEQYPQLA 1114
Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDL 1424
++ +A + + ++L L D+
Sbjct: 1115 ADKARGVAEELDRSPGEVLKKLEDI 1139
>gi|147779836|emb|CAN63685.1| hypothetical protein VITISV_020449 [Vitis vinifera]
Length = 64
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/47 (74%), Positives = 42/47 (89%)
Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
+PRSSFSVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGR+ L
Sbjct: 1 MPRSSFSVELDAANATWLSNDVAMLSTKTGELLLLTLXYDGRLFTDL 47
>gi|195571247|ref|XP_002103615.1| GD18880 [Drosophila simulans]
gi|194199542|gb|EDX13118.1| GD18880 [Drosophila simulans]
Length = 1140
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/361 (21%), Positives = 144/361 (39%), Gaps = 44/361 (12%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ EVG +ID HNL +D + +L + P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVSPETISSLMSAKLGDDPNTYYV 831
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
V T+ V E+ + GR+++F N +T+V ++ G
Sbjct: 832 V----------------ATSLVIPEEPEPKVGRIIIFHYNENK------LTQVAETKVDG 869
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMR 929
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
SI L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986
Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ + + + L L+++L + V + +R F N K + +D +L+
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIES 1097
Query: 1395 Y 1395
+
Sbjct: 1098 F 1098
Score = 57.0 bits (136), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 92/207 (44%), Gaps = 33/207 (15%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
NLR +D +V D F+HG + P ++++H+ GR H I+ K+
Sbjct: 156 NLR-MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFM 203
Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
I W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 204 KIAWKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA------- 249
Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSV 410
+F +A N + LL G L +L + G V+ + + +
Sbjct: 250 -PLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEIS 308
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
+ IT + N ++G+R GDS LV+
Sbjct: 309 IPECITYLDNGFLYIGARHGDSQLVRL 335
>gi|195037449|ref|XP_001990173.1| GH18378 [Drosophila grimshawi]
gi|193894369|gb|EDV93235.1| GH18378 [Drosophila grimshawi]
Length = 1140
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/371 (21%), Positives = 149/371 (40%), Gaps = 44/371 (11%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ EVG +ID HNL +D T+ V +P ++ ++ ++
Sbjct: 780 NAEVGQEIDVHNLLIID-QNTFEV----------------LHAHQFVPPETISTLMSAKL 822
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
+ T + T+ V E+ + GR+++F N +T+V ++ G
Sbjct: 823 -------GDDPNTYYVVATSLVYPEEPEPKVGRIIIFHYNDNK------LTQVAETKVDG 869
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
SI L K+ +A+D A E L D + L D N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLGCETHD---NLFVCQKDSAATT 986
Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ Q L A FH+G + F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTINVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ + + + L L+++L + V ++ +R + N K + +D +L+
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIDHTYYRNYQINTKVE--PSEGFIDGDLIES 1097
Query: 1395 YEMLPLEEQLE 1405
+ L E+ E
Sbjct: 1098 FLDLNREKMRE 1108
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 66/264 (25%), Positives = 109/264 (41%), Gaps = 51/264 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG + P ++++H GR H I+ K+ I
Sbjct: 159 -MDELTVYDVEFLHGCLNPTVIVIHRDN---DGRHVKSHE--------INLRDKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPI GV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLIPVPSPICGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSD 414
++ +D + LL G L +L T G V+ + + + +
Sbjct: 260 CYA-RIDEKGLRY------LLGNMDGQLYMLFLGTTETSKGITVKDIKVEQLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
IT + N ++GSR GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGSRHGDSQLVRLS 336
>gi|21357503|ref|NP_650257.1| piccolo [Drosophila melanogaster]
gi|74872881|sp|Q9XYZ5.1|DDB1_DROME RecName: Full=DNA damage-binding protein 1; Short=D-DDB1; AltName:
Full=Damage-specific DNA-binding protein 1; AltName:
Full=Protein piccolo
gi|4928452|gb|AAD33592.1|AF132145_1 damage-specific DNA binding protein DDBa p127 subunit [Drosophila
melanogaster]
gi|7299719|gb|AAF54901.1| piccolo [Drosophila melanogaster]
gi|220942640|gb|ACL83863.1| DDB1-PA [synthetic construct]
Length = 1140
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/361 (21%), Positives = 144/361 (39%), Gaps = 44/361 (12%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ EVG +ID HNL +D + +L + P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVAPETISSLMSAKLGDDPNTYYV 831
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
V T+ V E+ + GR+++F N +T+V ++ G
Sbjct: 832 V----------------ATSLVIPEEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
SI L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986
Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ + + + L L+++L + V + +R F N K + +D +L+
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINSKVE--PSEGFIDGDLIES 1097
Query: 1395 Y 1395
+
Sbjct: 1098 F 1098
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDS---DGRHVKSHE--------INLRDKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251
Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
+F +A N + LL G L +L + G V+ + + + +
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
IT + N ++G+R GDS LV+
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335
>gi|194901554|ref|XP_001980317.1| GG19434 [Drosophila erecta]
gi|190652020|gb|EDV49275.1| GG19434 [Drosophila erecta]
Length = 1140
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/361 (21%), Positives = 144/361 (39%), Gaps = 44/361 (12%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ EVG +ID HNL +D + +L + P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVSPETISSLMSAKLGDDPNTYYV 831
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
V T+ V E+ + GR+++F N +T+V ++ G
Sbjct: 832 V----------------ATSLVIPEEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMR 929
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
SI L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986
Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ + + + L L+++L + V + +R F N K + +D +L+
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIES 1097
Query: 1395 Y 1395
+
Sbjct: 1098 F 1098
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCMNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251
Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
+F +A N + LL G L +L + G V+ + + + +
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
IT + N ++G+R GDS LV+
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335
>gi|157128866|ref|XP_001655232.1| DNA repair protein xp-e [Aedes aegypti]
gi|108882187|gb|EAT46412.1| AAEL002407-PA [Aedes aegypti]
Length = 980
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 109/468 (23%), Positives = 174/468 (37%), Gaps = 119/468 (25%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
G L +DP+ R G+ +Y GL II D DT H +
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154
Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
L M+ HV+D F++G P ++++H+ ++ +H I I+ K
Sbjct: 155 TSLRMEEVHVQDVEFLYGTQHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203
Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
I W N+ +A L+ VP+P+GG +V+G ++ YH + A+A +N YA
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYAR 263
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
+ S L +N LLS K + LL + T
Sbjct: 264 VDSKGFRYLLGNMSGHLFMMFLETEENSKGLLSVKDIKVELLGDI-------------TI 310
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
P IT + N + F+GSR GDS LV+ +G + + E F ++ AP
Sbjct: 311 PEC----ITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMETFTNL---APIIDMC 363
Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
L+ G+ ++ GS G L+ G+ I
Sbjct: 364 IVD----LEKQGQGQMITCSGSYKE--------------------GSLRIIRNGIGIQEH 399
Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
A ++LPG KG+W + R+ D Y L++S T
Sbjct: 400 AC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLSFVGHT 434
Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
+L + E TE + +T N+ FG ++IQV AR++
Sbjct: 435 RILTLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPTTARLI 480
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 73/149 (48%), Gaps = 13/149 (8%)
Query: 1109 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
T +GTA V E+ + GR++++ AD NL T+V KE+KG+ +L G +
Sbjct: 826 TYYIVGTALVNPEEPEPKVGRIIIY---HYADG--NL-TQVSEKEIKGSCYSLVEFNGRV 879
Query: 1168 LIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
L + + L++WT + L F + LY + +FIL+GD+ +SI L +K+
Sbjct: 880 LASINSTVRLYEWTDDKDLRLECSHFNNVLALYCKTKG---DFILVGDLMRSITLLQYKQ 936
Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
+A+D+ A E L D + L
Sbjct: 937 MEGSFEEIARDYQPNWMTAVEILDDDAFL 965
>gi|195449948|ref|XP_002072297.1| GK22405 [Drosophila willistoni]
gi|194168382|gb|EDW83283.1| GK22405 [Drosophila willistoni]
Length = 1140
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 76/343 (22%), Positives = 134/343 (39%), Gaps = 40/343 (11%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ EVG +ID HNL +D + +L + P A + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVSPETISALMSAKLGDDPNTYYV 831
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1156
V E E + GR+++F N +T+V ++ G
Sbjct: 832 VATSLVIPDEPEPKV---------------GRIIIFHYHDNK------LTQVAETKVDGT 870
Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
AL G +L G + L++WT + + + + L +FIL+GD+ +S
Sbjct: 871 CYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMRS 930
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
I L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 931 ITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATTD 987
Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 988 EERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIGI 1040
Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
+ + + + L L+++L + V + +R F N K
Sbjct: 1041 VTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKV 1083
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 112/264 (42%), Gaps = 51/264 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMEKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELMVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
++ +D+ + LL G L +L + G V+ + + + +
Sbjct: 260 CYA-RVDSKGLRY------LLGNMHGQLYMLFLGTSESSKGITVKDIKVEQLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
IT + N ++G+R GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLS 336
>gi|195395112|ref|XP_002056180.1| GJ10363 [Drosophila virilis]
gi|194142889|gb|EDW59292.1| GJ10363 [Drosophila virilis]
Length = 1140
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 74/361 (20%), Positives = 147/361 (40%), Gaps = 44/361 (12%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ EVG +ID HNL +D T+ V + +P ++ + ++ ++
Sbjct: 780 NAEVGQEIDVHNLLVID-QNTFEV----------------LHSHQFVPPETISSLMSAKL 822
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
+ T + T+ V ++ + GR+++F N +T+V ++ G
Sbjct: 823 -------GDDPNTYYVVATSLVFPDEPEPKVGRIIIFHYNENK------LTQVAETKVDG 869
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
SI L K+ +A+D A E L D + L D N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLGCETHD---NLFVCQKDSAATT 986
Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ Q L A FH+G + F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTINVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
+ + + + L L+++L + V ++ +R + N K + +D +L+
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIDHTYYRNYQINTKVE--PSEGFIDGDLIES 1097
Query: 1395 Y 1395
+
Sbjct: 1098 F 1098
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 64/263 (24%), Positives = 107/263 (40%), Gaps = 49/263 (18%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
+D V D F+HG P ++++H+ GR H + I W
Sbjct: 159 -MDELTVYDVEFLHGCQNPTVIVIHKDN---DGRHVKSHEINLRDKEFIKVA-------W 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A L+ VPS IGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 208 KQDNVETEATMLIPVPSSIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTINC 260
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDI 415
++ +D+ + LL G L +L T G V+ + + + + I
Sbjct: 261 YA-RVDSKGLRY------LLGNMDGQLYMLFLGTTETSKGTTVKDIKVEQLGEISIPECI 313
Query: 416 TTIGNSLFFLGSRLGDSLLVQFT 438
T + N ++GSR GDS LV+ +
Sbjct: 314 TYLDNGFLYIGSRHGDSQLVRLS 336
>gi|195329354|ref|XP_002031376.1| GM24084 [Drosophila sechellia]
gi|194120319|gb|EDW42362.1| GM24084 [Drosophila sechellia]
Length = 1140
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 75/338 (22%), Positives = 136/338 (40%), Gaps = 42/338 (12%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ EVG +ID HNL +D + +L + P + + Q ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVSPETISSLMSAQLGDDPNTYYV 831
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
V T+ V E+ + GR+++F N +T+V ++ G
Sbjct: 832 V----------------ATSLVIPEEPEPKVGRIIIFHYNENK------LTQVAETKVDG 869
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMR 929
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
SI L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATT 986
Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
+ + + + L L+++L + V + + +R F
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKLVGKIGHKFYRNF 1077
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251
Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
+F +A N + LL G L +L + G V+ + + + +
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
IT + N ++G+R GDS LV+
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335
>gi|449488592|ref|XP_004158102.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
[Cucumis sativus]
Length = 570
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 108/504 (21%), Positives = 194/504 (38%), Gaps = 117/504 (23%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + ++A L+ + ++G + +L + G +D + +A E K VL++D
Sbjct: 39 RIEIHLLTAQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDT 94
Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
L +M + GR + + G + +DP R G+ +Y GL +I
Sbjct: 95 ESSELITRAMGDVSD------RIGRPTDS-GQIGIIDPDCRLIGLHLYDGLFKVI----- 142
Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
F + + N+R L+ V D F++G P +V+L++
Sbjct: 143 ----------------PFDNKGQLKEAFNIR-LEELQVLDIKFLYGCSRPTIVVLYQDNK 185
Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
H + + P WS NL + A L+ VP P+ GV+++G T
Sbjct: 186 D-------ARHVKTYEVVLKDKDFVEGP--WSQNNLDNGAAVLIPVPPPLCGVIIIGEET 236
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I Y S +A A+ + + R+ V+ D + LL G L L
Sbjct: 237 IVYCSATAFKAIPVR---------PSITRAYGRVDADGSR--------YLLGDHAGLLHL 279
Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
L + ++ V L + + + S I+ + N+ ++GS GDS LV+
Sbjct: 280 LVITHEKERVTGLKIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKL----------- 328
Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
+++ DA + S + L+ VN + + + + T S A +
Sbjct: 329 -------NVQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375
Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
D G L+ G+ IN AS VEL G KG+W++
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405
Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
++ DD + +L++S + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428
>gi|357496593|ref|XP_003618585.1| Splicing factor 3B subunit [Medicago truncatula]
gi|355493600|gb|AES74803.1| Splicing factor 3B subunit [Medicago truncatula]
Length = 702
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 100/444 (22%), Positives = 185/444 (41%), Gaps = 63/444 (14%)
Query: 998 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV-GHQIDNHNLSSVDLHR 1056
IPL+ TP + ++ L +I S +Q ++E G + ++ + D H
Sbjct: 306 IPLRYTPMKFVLQPKRKLLVVIES-------DQGAFTAEEREANGGEDEDKDDPLSDEHY 358
Query: 1057 TYTVEEYE-----VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
Y E + +RIL+P T + +Q +E A + V N KE TLL
Sbjct: 359 GYPKAESDKWASCIRILDPKTG----NTTCLLELQDNEAAFSGCTV---NFHDKEYGTLL 411
Query: 1112 AIGTAYVQGEDVAARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+GTA +G R + R ++ ++L ++ +++G AL+ QG LL
Sbjct: 412 DVGTA--KGLQFTPRRSLTAGFIHIYRFLEDGRSLEL-LHKTQVEGVPLALSQFQGRLLA 468
Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
GP + + L P +VS+ ++ I +GD +S ++ ++ QL
Sbjct: 469 GIGPVLRFYDLGKRRLLRKYENKLFPNTIVSIQTYRDRIYVGDTQESFHYCKYRWDENQL 528
Query: 1230 NLLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
+ A D C + ID T++ + D +I + K++ + K+
Sbjct: 529 YIFADD-----CVPRWLTASYHIDFDTMAGIEEDPTGG-RIKWEQGKLNGA--PNKVEEI 580
Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1342
+FHVG ++ + ++ PG + +L GT+ GSIG + D++
Sbjct: 581 VQFHVGDVISCLQKASLI----------PGGGE----CILNGTVMGSIGALHAFTSRDDV 626
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
F L+ + P + G + ++R A+ P D ++D +L + LP++
Sbjct: 627 DF--FSHLEMHMRQDNPPLCGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDL 677
Query: 1403 QLEIAHQTGTTRSQILSNLNDLAL 1426
Q +IA + TR +IL L + +
Sbjct: 678 QRKIADELDRTRGEILKKLEEYKI 701
>gi|409075182|gb|EKM75565.1| hypothetical protein AGABI1DRAFT_64324 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1213
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 115/466 (24%), Positives = 196/466 (42%), Gaps = 76/466 (16%)
Query: 998 IPLKATPHQITYFAEKNLYPLIV-------SVPVLKPLNQVL--SLLIDQEVGHQIDNHN 1048
IPL TP + + NL+ LI V K LN++ + IDQEV N
Sbjct: 788 IPLSYTPRKFITYPLNNLFYLIEGDHRVMGQDAVDKKLNELRQQNRAIDQEV------LN 841
Query: 1049 LSSVDLHRTYTVE---EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTK 1105
LS R +RI++P +T + +P+ +E+A ++ VV + K
Sbjct: 842 LSPEVFGRPKAANGTWASNIRIIDPVEG----KTISVVPLDGNESAFSLAVVPF---SAK 894
Query: 1106 ENETLLAIGTAYVQGEDVAARG------RVLLF-STGRNADNPQNLVTEVYSKELKGAIS 1158
NE L +GTA ++ R RV F GR + V+ E+
Sbjct: 895 GNELHLVVGTA--ADTKLSPRTCSTGFLRVYKFLEDGRQLE-------LVHKTEIDDVPL 945
Query: 1159 ALASLQGHLLIASGPKIILH----KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
AL + QG L+ G + ++ K ++ F A +V+L+ + IL+GD+
Sbjct: 946 ALMAFQGRLVAGVGKALRIYDIGKKKMLRKVENKQFGSA----IVTLSTQGSRILVGDMQ 1001
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+SI+F +K +L + A D + ++D +T +V +D NI + P++S
Sbjct: 1002 ESIFFAVYKAPENRLLIFADD-SQPRWISAATMVDYNT--VVAADRFGNIFVNRLDPRVS 1058
Query: 1275 ----ESWKGQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLF 1326
E G +L ++GA H TK + + L TS + G R LL+
Sbjct: 1059 DQVDEDPTGAGILHEKGLYMGAPHKTKMICHFHVGDLITSIHKVSLVAG----GREVLLY 1114
Query: 1327 GTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
L G+IG + P +++ F + +L++ + + G + +R ++ KA
Sbjct: 1115 TGLHGTIGILVPFVTKEDVDF--ISTLEQHMRTEQVSLVGRDHLGWRGYYVPVKA----- 1167
Query: 1384 DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
+VD +L Y LP +Q IA + + ++L L L + S
Sbjct: 1168 --VVDGDLCEMYAKLPGSKQSAIAGELDRSIGEVLKKLEQLRVTAS 1211
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 99/385 (25%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL +T+ ++ V+ L + + + S + + + F+ S G+ L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367
Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSDALQDMVNGEELSLY 487
S TS SSG+ E +A P + R + AL D + + +
Sbjct: 368 QKLGDDDEEPEFSSTSFPSSGMAEP----QAALPRVYFKPRPLDNLALADELESLDPIID 423
Query: 488 GSASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
N N+++ Q F+ R + + L+ +GL + S+ +L
Sbjct: 424 SKVLNLLPNSDTPQ-IFAACGRGARSS---LRTLQHGLEVEESVSS------------DL 467
Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG +WT DD Y +Y+I+S T+VL + + EV ++
Sbjct: 468 PGIPNAVWTTKRNE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT- 512
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS-------- 637
+ T+A + G ++QV G R I+ +
Sbjct: 513 GFLSSAPTLAVQQI-GSDALLQVHPHGIRHVLADRRVNEWRVPSNKTIVAATTNKRQVVV 571
Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
Y DL G N + STVL++SI D PY+ +G D ++R
Sbjct: 572 ALSSAELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGCEDQTVR 630
Query: 682 LLVGDPSTC--TVSVQT----PAAI 700
++ DP + T+S+Q P+AI
Sbjct: 631 IISLDPESTLETISLQALTAPPSAI 655
>gi|124359136|gb|ABD32504.2| CPSF A subunit, C-terminal; WD40-like [Medicago truncatula]
Length = 632
Score = 63.9 bits (154), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 98/443 (22%), Positives = 184/443 (41%), Gaps = 65/443 (14%)
Query: 998 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1057
IPL+ TP + ++ L +I S +Q ++E + ++ + D H
Sbjct: 240 IPLRYTPMKFVLQPKRKLLVVIES-------DQGAFTAEEREAAKKDEDKDDPLSDEHYG 292
Query: 1058 YTVEEYE-----VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLA 1112
Y E + +RIL+P T + +Q +E A + V N KE TLL
Sbjct: 293 YPKAESDKWASCIRILDPKTG----NTTCLLELQDNEAAFSGCTV---NFHDKEYGTLLD 345
Query: 1113 IGTAYVQGEDVAARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
+GTA +G R + R ++ ++L ++ +++G AL+ QG LL
Sbjct: 346 VGTA--KGLQFTPRRSLTAGFIHIYRFLEDGRSLEL-LHKTQVEGVPLALSQFQGRLLAG 402
Query: 1171 SGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLN 1230
GP + + L P +VS+ ++ I +GD +S ++ ++ QL
Sbjct: 403 IGPVLRFYDLGKRRLLRKYENKLFPNTIVSIQTYRDRIYVGDTQESFHYCKYRWDENQLY 462
Query: 1231 LLAKDFGSLDC----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
+ A D C + ID T++ ++ +I + K++ + K+
Sbjct: 463 IFADD-----CVPRWLTASYHIDFDTMA----EDPTGGRIKWEQGKLNGA--PNKVEEIV 511
Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELT 1343
+FHVG ++ + ++ PG + +L GT+ GSIG + D++
Sbjct: 512 QFHVGDVISCLQKASLI----------PGGGE----CILNGTVMGSIGALHAFTSRDDVD 557
Query: 1344 FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQ 1403
F L+ + P + G + ++R A+ P D ++D +L + LP++ Q
Sbjct: 558 F--FSHLEMHMRQDNPPLCGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDLQ 608
Query: 1404 LEIAHQTGTTRSQILSNLNDLAL 1426
+IA + TR +IL L + +
Sbjct: 609 RKIADELDRTRGEILKKLEEYKI 631
>gi|225448823|ref|XP_002282354.1| PREDICTED: splicing factor 3B subunit 3-like [Vitis vinifera]
Length = 1214
Score = 63.9 bits (154), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 92/392 (23%), Positives = 162/392 (41%), Gaps = 77/392 (19%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+RIL+P A T + +Q +E A ++ V N KE TLLA+GTA
Sbjct: 863 IRILDPRTA----TTTCLLELQDNEAAFSICTV---NFHDKEYGTLLAVGTA-------- 907
Query: 1125 ARGRVLLFSTGRNAD----------NPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
+ L F R+ D + ++ +++G AL QG LL G
Sbjct: 908 ---KSLQFWPKRSFDAGYIHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSV 964
Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
+ L+ L P +VS++ ++ I +GDI +S ++ ++ QL + A
Sbjct: 965 LRLYDLGKRRLLRKCENKLFPNTIVSIHTYRDRIYVGDIQESFHYCKYRRDENQLYIFAD 1024
Query: 1235 DFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKL 1282
D S+ + T + ID T++ +D+ NI +S+ W+ KL
Sbjct: 1025 D--SVPRWLTASYHIDFDTMA--GADKFGNIYFVRLPQDVSDEVEEDPTGGKIKWEQGKL 1080
Query: 1283 LSR-------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
+FHVG VT + ++ PG + +++GT+ GS+G
Sbjct: 1081 NGAPNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CIIYGTVMGSLGA 1126
Query: 1336 IAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+ D++ F L+ + P + G + ++R A+ P D ++D +L
Sbjct: 1127 LLAFTSRDDVDF--FSHLEMHMRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLC 1177
Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ LPL+ Q +IA + T +IL L ++
Sbjct: 1178 EQFPTLPLDLQRKIADELDRTPGEILKKLEEV 1209
>gi|426192113|gb|EKV42051.1| hypothetical protein AGABI2DRAFT_229642 [Agaricus bisporus var.
bisporus H97]
Length = 1213
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 115/466 (24%), Positives = 196/466 (42%), Gaps = 76/466 (16%)
Query: 998 IPLKATPHQITYFAEKNLYPLIV-------SVPVLKPLNQVL--SLLIDQEVGHQIDNHN 1048
IPL TP + + NL+ LI V K LN++ + IDQEV N
Sbjct: 788 IPLSYTPRKFITYPLNNLFYLIEGDHRVMGQDAVDKKLNELRQQNKAIDQEV------LN 841
Query: 1049 LSSVDLHRTYTVE---EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTK 1105
LS R +RI++P +T + +P+ +E+A ++ VV + K
Sbjct: 842 LSPEVFGRPKAANGTWASNIRIIDPVEG----KTISVVPLDGNESAFSLAVVPF---SAK 894
Query: 1106 ENETLLAIGTAYVQGEDVAARG------RVLLF-STGRNADNPQNLVTEVYSKELKGAIS 1158
NE L +GTA ++ R RV F GR + V+ E+
Sbjct: 895 GNELHLVVGTA--ADTKLSPRTCSTGFLRVYKFLEDGRQLE-------LVHKTEIDDVPL 945
Query: 1159 ALASLQGHLLIASGPKIILH----KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
AL + QG L+ G + ++ K ++ F A +V+L+ + IL+GD+
Sbjct: 946 ALMAFQGRLVAGVGKALRIYDIGKKKMLRKVENKQFGSA----IVTLSTQGSRILVGDMQ 1001
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+SI+F +K +L + A D + ++D +T +V +D NI + P++S
Sbjct: 1002 ESIFFAVYKAPENRLLIFADD-SQPRWISAATMVDYNT--VVAADRFGNIFVNRLDPRVS 1058
Query: 1275 ----ESWKGQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLF 1326
E G +L ++GA H TK + + L TS + G R LL+
Sbjct: 1059 DQVDEDPTGAGILHEKGLYMGAPHKTKMICHFHVGDLITSIHKVSLVAG----GREVLLY 1114
Query: 1327 GTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
L G+IG + P +++ F + +L++ + + G + +R ++ KA
Sbjct: 1115 TGLHGTIGILVPFVTKEDVDF--ISTLEQHMRTEQVSLVGRDHLGWRGYYVPVKA----- 1167
Query: 1384 DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
+VD +L Y LP +Q IA + + ++L L L + S
Sbjct: 1168 --VVDGDLCEMYAKLPGSKQSAIAGELDRSIGEVLKKLEQLRVTAS 1211
Score = 45.4 bits (106), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 99/385 (25%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL +T+ ++ V+ L + + + S + + + F+ S G+ L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367
Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSDALQDMVNGEELSLY 487
S TS SSG+ E +A P + R + AL D + + +
Sbjct: 368 QKLGDDDEEPEFSSTSFPSSGMAEP----QAALPRVYFKPRPLDNLALADELESLDPIID 423
Query: 488 GSASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
N N+++ Q F+ R + + L+ +GL + S+ +L
Sbjct: 424 SKVLNLLPNSDTPQ-IFAACGRGARSS---LRTLQHGLEVEESVSS------------DL 467
Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG +WT DD Y +Y+I+S T+VL + + EV ++
Sbjct: 468 PGIPNAVWTTKRNE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT- 512
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS-------- 637
+ T+A + G ++QV G R I+ +
Sbjct: 513 GFLSSAPTLAVQQI-GSDALLQVHPHGIRHVLADRRVNEWRVPSNKIIVAATTNKRQVVV 571
Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
Y DL G N + STVL++SI D PY+ +G D ++R
Sbjct: 572 ALSSAELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGCEDQTVR 630
Query: 682 LLVGDPSTC--TVSVQT----PAAI 700
++ DP + T+S+Q P+AI
Sbjct: 631 IISLDPESTLETISLQALTAPPSAI 655
>gi|395333071|gb|EJF65449.1| hypothetical protein DICSQDRAFT_178021 [Dichomitus squalens LYAD-421
SS1]
Length = 1213
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 117/495 (23%), Positives = 199/495 (40%), Gaps = 72/495 (14%)
Query: 965 CNHGFIYVTSQGILKICQLPSGSTY--DNYWPVQKIPLKATPHQ---ITYFAEKNLYPLI 1019
C G I + S +L+I Q+P T + P+ P K PH + Y E + + ++
Sbjct: 759 CQEGLIGI-SGSLLRIFQIPKLGTKLKQDSIPLSYTPRKLIPHPHNGLLYLIEGD-HRVM 816
Query: 1020 VSVPVLKPLNQVLSLLIDQEVGHQIDNH--NLSSVDLHRTYTVE---EYEVRILEPDRAG 1074
K L Q+ +E G +D NL R +RI+ P A
Sbjct: 817 SEEAAAKQLQQL------RESGRAVDEEMVNLPPEQFGRPKAPAGTWASCIRIISPLDA- 869
Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLF-S 1133
QT I + ++E A ++ +V K NE L +GTA Q +A R F
Sbjct: 870 ---QTVNVIHLDNNEAAFSLAIVPF---AAKNNELHLVVGTA--QDTFLAPRSCTSGFLR 921
Query: 1134 TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
T R D+ +NL ++ E A+ + QG L+ G + L+ +L
Sbjct: 922 TYRFTDDGRNLEL-LHKTETNDVPLAIMAFQGKLVAGVGKALRLYDIGKKKLLRKVENKT 980
Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
+V+LN + I++GD+ +S+++ +K +L + A D AT L D +T
Sbjct: 981 LGSTIVTLNTQGSRIIIGDMQESVFYAVYKPPENRLLVFADDVQPRWVTATTML-DYNT- 1038
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESW------------KG------QKLLSRAEFHVGAHVT 1295
+V SD N+ + K+S+ KG K A FH+G VT
Sbjct: 1039 -VVASDRFGNVFVNRLDAKISDQIDDDPTGAGILHEKGVLFGAPHKTAMLAHFHIGDIVT 1097
Query: 1296 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKL 1354
++ ++A R +L+ L G+IG + P + + L +L++ +
Sbjct: 1098 SLNKISLVAGG--------------REVILYTCLHGTIGILVPFVSKEDVDLLTTLEQHM 1143
Query: 1355 VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
+ G + ++R ++ KA +VD +L + LP +Q IA + T
Sbjct: 1144 RTEQLSLVGRDHLAWRGYYVPVKA-------VVDGDLCESFAKLPANKQSSIAGELDRTV 1196
Query: 1415 SQILSNLNDLALGTS 1429
++L L L + S
Sbjct: 1197 GEVLKKLEQLRVTAS 1211
>gi|427788481|gb|JAA59692.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
Length = 1156
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 78/345 (22%), Positives = 145/345 (42%), Gaps = 46/345 (13%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ ++G +++ HNL +D H + ++ MQ+ E A+++
Sbjct: 795 NDQLGQEVEIHNLLIIDQHTFEVLHAHQF-------------------MQT-EYAMSIVS 834
Query: 1097 VTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
L N + T +GTA V E +GR+++F D V E +E+KG
Sbjct: 835 TRLGN----DPNTYYIVGTANVLPDESDPKQGRIVVFHW---VDGKLEHVAE---QEIKG 884
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGT-ELNGIA--FYDAPPLYVVSLNIVKNFILLGD 1212
A ++ G LL A + L +W EL F + LY L +F+L+GD
Sbjct: 885 APYSMLEFNGKLLAAINSTVRLFEWNAERELRNECSHFNNILALY---LRAKGDFVLVGD 941
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
+ +S+ L++K +A+D+ + + E L D + L ++ N+ +
Sbjct: 942 LMRSMSLLAYKPLEGNFEEIARDYQTNWMSSVEILDDDTFLG---AESTTNLFVCQKDSA 998
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+ + Q L +FH+G V F ++ T + + ++LFGT+ G+
Sbjct: 999 ATTDEERQHLQEVGQFHLGEFVNVFRHGSLVMQHPGETSSP------TQGSVLFGTIHGA 1052
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
IG ++ L + L +Q+KL + V ++ +R F + K
Sbjct: 1053 IGLVSQLPADFYTFLSEVQEKLTKVIKSVGKIDHAFWRSFSTERK 1097
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 90/401 (22%), Positives = 154/401 (38%), Gaps = 81/401 (20%)
Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
V+D F+HG P +V+LH+ S H + +LK + W
Sbjct: 164 VQDMEFLHGCKTPTIVLLHQD--------SQARHM-----KTYEVSLKDKEFVKGPWKQD 210
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
++ +A ++AVP P G L++G +I YH+ + Y V + L R S V
Sbjct: 211 HVESEANLVIAVPEPFCGALIIGQESITYHNG--------DQYVV---ITPHLIRQSTIV 259
Query: 363 ---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSD 414
++DA + +L D+A G L +L + + ++ V+ L L +
Sbjct: 260 CYGKVDANGSRYLLGDMA------GRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAEC 313
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
IT + N + ++GSRLGDS L++ + E F ++
Sbjct: 314 ITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEVFTNL---------------GP 358
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
+ DM + T S A ++ G L+ G+ I+ AS
Sbjct: 359 IVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS----- 401
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KG+W + + R E L++S +T VL +
Sbjct: 402 -------IDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAERDNTLVLSFVRQTRVLMLSG 454
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
E TE + +T GN+ +++IQV R++D
Sbjct: 455 EEVEETELAGFDTSQQTFFCGNV-RNKQLIQVTAAAVRLVD 494
>gi|399216895|emb|CCF73582.1| unnamed protein product [Babesia microti strain RI]
Length = 1232
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 98/406 (24%), Positives = 164/406 (40%), Gaps = 90/406 (22%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
VRI++P T A + + + E A++ L E LA+GT V G ++A
Sbjct: 866 VRIIDPT----TLSTAAKLLLDTDEAAISCCACDL------EGYRCLAVGT--VTGWNLA 913
Query: 1125 ARG----RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
+ +++ G N + +T ++S ++ G AL + +G LL GP +IL+
Sbjct: 914 NSNSNSCHIRMYAYGPNFE-----ITFLHSTKVTGIPRALLAYEGRLLAGVGPDVILYAL 968
Query: 1181 TGTELNGIAFYD-----------APP--------LYVVSLNIVKNFILLGDIHKSIYFLS 1221
+L A Y A P V+ L N I +GDI +SI L
Sbjct: 969 GKRQLLKKAEYRGGVIDIQGYGVATPRTIGNGGLFGVMWLGASGNRIFVGDIRESITVLK 1028
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES----- 1276
+ EQ A+L+L+ D ++D T++LV D+ + + S S
Sbjct: 1029 FDEQMAKLSLICDDIRP-RWITGATVLDHHTVALV--DKFDTFAVCRVPSEASASNLSSA 1085
Query: 1277 --------------WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRF 1322
G K A+FH+G L+T D+ G +
Sbjct: 1086 LNSGSLEAVMPTILGVGNKFEQEAQFHLGD----------LSTCIDKVTLCSGCTE---- 1131
Query: 1323 ALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
A+++ T+ GSIG + P DEL LQ L+ + + P ++G +R ++
Sbjct: 1132 AVVYATILGSIGALIPFISSDELD--TLQHLELLMANENPPLSGREHSIYRSYY------ 1183
Query: 1380 RPGP-DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
GP ++D +L +E L Q IA + T ++I+ L D+
Sbjct: 1184 --GPVQHVIDGDLCEEFESLDSITQSRIAAKIDKTVTEIIKKLRDI 1227
>gi|296814646|ref|XP_002847660.1| pre-mRNA-splicing factor rse1 [Arthroderma otae CBS 113480]
gi|238840685|gb|EEQ30347.1| pre-mRNA-splicing factor rse1 [Arthroderma otae CBS 113480]
Length = 1235
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 101/462 (21%), Positives = 188/462 (40%), Gaps = 52/462 (11%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
C G + + Q + ++ S DN + IPL T + E YP+
Sbjct: 759 QCVEGMVGIQGQNL----RIFSIEKLDNNLLQESIPLAYTSRSLVRHPE---YPIFY--- 808
Query: 1024 VLKPLNQVLS-----LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGP 1076
V+ N VLS L+ + DN L D + +++++P A
Sbjct: 809 VIGSDNNVLSPSTKAKLLSESTTVNGDNAELPPEDFGYPRGTNHWASCIQVVDPINAKA- 867
Query: 1077 WQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1136
+ I ++ +E A+++ V+ T++E+ET L +GT +G V+ R F
Sbjct: 868 --VMSRIELEDNEAAVSIAAVSF---TSQEDETFLVVGTG--KGMVVSPRSFTCGFIHIY 920
Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
+ ++ +++ AL QG LL GP + ++ +L P
Sbjct: 921 RFQEEGKELEFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQITPR 980
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
+V L + I++ D+ +S+ ++ +K Q L A D T ++D T++
Sbjct: 981 VIVGLQTQGSRIIVSDVQESVTYVVYKYQENNLIPFADDIIPRWTTCTT-MVDYETVA-- 1037
Query: 1257 VSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSD 1308
D+ NI + PK SE G L+ ++ GA + Q + TS
Sbjct: 1038 GGDKFGNIWLLRCPPKASEEADEDGSGAHLIHERQYLQGAPNRLSLVAHFYSQDIPTSIQ 1097
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
+T G R L++ L G++G + P D++ F Q+L+ +L P +AG +
Sbjct: 1098 KTQLVAG----GRDILVWTGLQGTVGMLIPFVTRDDVDF--FQTLEMQLTSQNPPLAGRD 1151
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
+R +++ K ++D +L + +LP +++ IA
Sbjct: 1152 HLIYRGYYAPCKG-------VIDGDLCETFFLLPNDKKQAIA 1186
>gi|427780151|gb|JAA55527.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
Length = 1181
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 78/345 (22%), Positives = 145/345 (42%), Gaps = 46/345 (13%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ ++G +++ HNL +D H + ++ MQ+ E A+++
Sbjct: 820 NDQLGQEVEIHNLLIIDQHTFEVLHAHQF-------------------MQT-EYAMSIVS 859
Query: 1097 VTLFNTTTKENETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
L N + T +GTA V E +GR+++F D V E +E+KG
Sbjct: 860 TRLGN----DPNTYYIVGTANVLPDESDPKQGRIVVFHW---VDGKLEHVAE---QEIKG 909
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGT-ELNGIA--FYDAPPLYVVSLNIVKNFILLGD 1212
A ++ G LL A + L +W EL F + LY L +F+L+GD
Sbjct: 910 APYSMLEFNGKLLAAINSTVRLFEWNAERELRNECSHFNNILALY---LRAKGDFVLVGD 966
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
+ +S+ L++K +A+D+ + + E L D + L ++ N+ +
Sbjct: 967 LMRSMSLLAYKPLEGNFEEIARDYQTNWMSSVEILDDDTFLG---AESTTNLFVCQKDSA 1023
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+ + Q L +FH+G V F ++ T + + ++LFGT+ G+
Sbjct: 1024 ATTDEERQHLQEVGQFHLGEFVNVFRHGSLVMQHPGETSSP------TQGSVLFGTIHGA 1077
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
IG ++ L + L +Q+KL + V ++ +R F + K
Sbjct: 1078 IGLVSQLPADFYTFLSEVQEKLTKVIKSVGKIDHAFWRSFSTERK 1122
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 92/420 (21%), Positives = 156/420 (37%), Gaps = 94/420 (22%)
Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
V+D F+HG P +V+LH+ S H + +LK + W
Sbjct: 164 VQDMEFLHGCKTPTIVLLHQD--------SQARHM-----KTYEVSLKDKEFVKGPWKQD 210
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
++ +A ++AVP P G L++G +I YH+ + Y V + L R S V
Sbjct: 211 HVESEANLVIAVPEPFCGALIIGQESITYHNG--------DQYVV---ITPHLIRQSTIV 259
Query: 363 ---ELDAAHATWLQNDVA-------------------LLSTKTGDLVLLTVVYDGRV--- 397
++DA + +L D+A LL G L +L + + ++
Sbjct: 260 CYGKVDANGSRYLLGDMAGRLFMLLLEREDKMDGTXYLLGDMAGRLFMLLLEREDKMDGT 319
Query: 398 --VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
V+ L L + IT + N + ++GSRLGDS L++ + E F
Sbjct: 320 TTVKDLKLEFLGEITIAECITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEVFT 379
Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPL 515
++ + DM + T S A ++ G L
Sbjct: 380 NL---------------GPIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSL 412
Query: 516 KDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
+ G+ I+ AS ++LPG KG+W + + R E
Sbjct: 413 RIIRNGIGIHEHAS------------IDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAER 460
Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
L++S +T VL + E TE + +T GN+ +++IQV R++D
Sbjct: 461 DNTLVLSFVRQTRVLMLSGEEVEETELAGFDTSQQTFFCGNV-RNKQLIQVTAAAVRLVD 519
>gi|296086939|emb|CBI33172.3| unnamed protein product [Vitis vinifera]
Length = 934
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 92/392 (23%), Positives = 162/392 (41%), Gaps = 77/392 (19%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+RIL+P A T + +Q +E A ++ V N KE TLLA+GTA
Sbjct: 583 IRILDPRTA----TTTCLLELQDNEAAFSICTV---NFHDKEYGTLLAVGTA-------- 627
Query: 1125 ARGRVLLFSTGRNAD----------NPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
+ L F R+ D + ++ +++G AL QG LL G
Sbjct: 628 ---KSLQFWPKRSFDAGYIHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSV 684
Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
+ L+ L P +VS++ ++ I +GDI +S ++ ++ QL + A
Sbjct: 685 LRLYDLGKRRLLRKCENKLFPNTIVSIHTYRDRIYVGDIQESFHYCKYRRDENQLYIFAD 744
Query: 1235 DFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKL 1282
D S+ + T + ID T++ +D+ NI +S+ W+ KL
Sbjct: 745 D--SVPRWLTASYHIDFDTMA--GADKFGNIYFVRLPQDVSDEVEEDPTGGKIKWEQGKL 800
Query: 1283 LSR-------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
+FHVG VT + ++ PG + +++GT+ GS+G
Sbjct: 801 NGAPNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CIIYGTVMGSLGA 846
Query: 1336 IAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+ D++ F L+ + P + G + ++R A+ P D ++D +L
Sbjct: 847 LLAFTSRDDVDF--FSHLEMHMRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLC 897
Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ LPL+ Q +IA + T +IL L ++
Sbjct: 898 EQFPTLPLDLQRKIADELDRTPGEILKKLEEV 929
>gi|195145844|ref|XP_002013900.1| GL24391 [Drosophila persimilis]
gi|194102843|gb|EDW24886.1| GL24391 [Drosophila persimilis]
Length = 1140
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 114/267 (42%), Gaps = 51/267 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+++Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMVLYQGLFTIIPMDKEASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCLNPTIIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
++ +D + LL G L +L + G V+ + + K +
Sbjct: 260 CYA-RVDGKGLRY------LLGNMDGQLYMLFLGTSETSKGVTVKDIKVEKLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGS 441
IT + N ++G+R GDS LV+ + S
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLSSES 339
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 78/360 (21%), Positives = 142/360 (39%), Gaps = 42/360 (11%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ EVG +ID HNL +D T+ V L + P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID-QNTFEV-------LHAHQFVAPETISSLMSAKLGDDPNTYYV 831
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1156
V E E + GR+++F + +T+V ++ G
Sbjct: 832 VATSLVIPDEPEPKV---------------GRIIIFHYHDSK------LTQVAETKVDGT 870
Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
AL G +L G + L++WT + + + + L +FIL+GD+ +S
Sbjct: 871 CYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMRS 930
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
I L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 931 ITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATTD 987
Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 988 EERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIGI 1040
Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
+ + + + L L+++L + V + +R F N K + +D +L+ +
Sbjct: 1041 VTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIESF 1098
>gi|326426696|gb|EGD72266.1| hypothetical protein PTSG_00286 [Salpingoeca sp. ATCC 50818]
Length = 1104
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 76/323 (23%), Positives = 138/323 (42%), Gaps = 51/323 (15%)
Query: 1106 ENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
+N IGTA+V E +RGR+L+ +N + + V+ E G++ L +
Sbjct: 779 DNTEYFIIGTAFVDPTETQPSRGRILI----SKLENKKEIAI-VHECEAAGSVYCLTKMC 833
Query: 1165 GH----LLIASGPKIILHKWTGT---------ELNGIAFYDAPPLYVVSLNIVKNFILLG 1211
G L+ +++ K+ T ++G + A VVSL+ + +L+G
Sbjct: 834 GKDTDDLVAGINNQVVHFKYDATGQDAAKKLRAVSGNQNFGA----VVSLDSCDDIVLVG 889
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA----TEFLIDGSTLSLVVSDEQKNIQIF 1267
D+ +++ + + QL ++ + A T FL+ SL V + F
Sbjct: 890 DMLNAVFVMQKAQDKLQLVAGSQTANWVSSCALVNETVFLVASHAHSLSVCQRE-----F 944
Query: 1268 YYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT----NRFA 1323
M Q L ++ E ++G VT F+R + G+A D + N F
Sbjct: 945 EPGSTM------QTLNAKFEIYLGETVTSFVRAAL--------GSAAAVDSSMPLRNTF- 989
Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
+FGT+ G + C+ PL L +L+ ++ + + + GL+ R FR + +
Sbjct: 990 FVFGTMGGGLACLLPLTPPQTELLTALECRMEEKIGGLGGLDHREFRTARDEQRMAQQVN 1049
Query: 1384 DSIVDCELLSHYEMLPLEEQLEI 1406
+VD +L+ + LP EEQ E+
Sbjct: 1050 PRLVDGDLVETFLQLPEEEQKEL 1072
>gi|125774475|ref|XP_001358496.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
gi|54638233|gb|EAL27635.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
Length = 1140
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 79/360 (21%), Positives = 142/360 (39%), Gaps = 42/360 (11%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ EVG +ID HNL +D T+ V L + P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID-QNTFEV-------LHAHQFVAPETISSLMSAKLGDDPNTYYV 831
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1156
V E E + GR+++F N +T+V ++ G
Sbjct: 832 VATSLVIPDEPEPKV---------------GRIIIFHYHDNK------LTQVAETKVDGT 870
Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
AL G +L G + L++WT + + + + L +FIL+GD+ +S
Sbjct: 871 CYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALYLKAKGDFILVGDLMRS 930
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
I L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 931 ITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATTD 987
Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 988 EERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIGI 1040
Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
+ + + + L L+++L + V + +R F N K + +D +L+ +
Sbjct: 1041 VTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIESF 1098
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 113/267 (42%), Gaps = 51/267 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+++Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMVLYQGLFTIIPMDKEASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELSVYDVEFLHGCLNPTIIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
++ +D + LL G L +L + G V+ + + K +
Sbjct: 260 CYA-RVDGKGLRY------LLGNMDGQLYMLFLGTSETSKGVTVKDIKVEKLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGS 441
IT + N ++G+R GDS LV+ + S
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLSSES 339
>gi|194741158|ref|XP_001953056.1| GF17579 [Drosophila ananassae]
gi|190626115|gb|EDV41639.1| GF17579 [Drosophila ananassae]
Length = 1140
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/360 (21%), Positives = 141/360 (39%), Gaps = 42/360 (11%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ EVG +ID HNL +D + +L + P + + + ++ T V
Sbjct: 780 NAEVGQEIDVHNLLVID--------QNTFEVLHAHQFVAPETISSLMSAKLGDDPNTYYV 831
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGA 1156
V E E + GR+++F N +T+V ++ G
Sbjct: 832 VATSLVIPDEPEPKV---------------GRIIIFHYHDNK------LTQVAETKVDGT 870
Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
AL G +L G + L++WT + + + + L +FIL+GD+ +S
Sbjct: 871 CYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMRS 930
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
I L K+ +A+D A E L D + L S+ N+ + +
Sbjct: 931 ITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATTD 987
Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
+ Q L A FH+G V F ++ + +RT G +L+GT +G+IG
Sbjct: 988 EERQLLPELARFHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIGI 1040
Query: 1336 IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
+ + + + L L+++L + V + +R F N K + +D +L+ +
Sbjct: 1041 VTQIPQDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINTKVE--PSEGFIDGDLIESF 1098
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 113/264 (42%), Gaps = 51/264 (19%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKEASEL--------------------KATNLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
+D +V D F+HG + P ++++H+ GR H I+ K+ I
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 207 WKQDNVETEATMLITVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
++ +D+ + LL G L +L + G V+ + + + +
Sbjct: 260 CYA-RVDSKGFRY------LLGNMDGQLYMLFLGTSETSKGITVKDIKVEQLGEISIPEC 312
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
IT + N ++G+R GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLS 336
>gi|322707263|gb|EFY98842.1| Pre-mRNA-splicing factor rse-1 [Metarhizium anisopliae ARSEF 23]
Length = 1212
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 84/367 (22%), Positives = 160/367 (43%), Gaps = 38/367 (10%)
Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFST 1134
Q T+ ++++E A++ +V +++NE+ L +GT G+DV R
Sbjct: 870 QVVQTVDLENNEAAVSAAIVPF---ASQDNESFLIVGT----GKDVVVNPRNFSEAYIYV 922
Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
R + + L ++ +++ AL QG LL G + ++ ++ A +
Sbjct: 923 YRFQEEGREL-EFIHKTKIEEPALALIPFQGKLLAGVGKTLRVYDLGMRQMLRKAQAEVA 981
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
P +VSLN + I++GDI + + ++++K +L A D + T ++D S
Sbjct: 982 PQQIVSLNTQGSRIIVGDIQQGVTYVTYKPTTNKLIPFADDTIARWTTCTT-MVDYE--S 1038
Query: 1255 LVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGA----HVTKFLRLQMLATS 1306
+ D+ N+ I PK S E G L++ ++ G + Q + TS
Sbjct: 1039 VAGGDKFGNMFIVRCPPKASEEADEEQSGLHLMNARDYLHGTSQRLDLMCHFYTQDIPTS 1098
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
+T G LL+ L G+IG PL ++ F QSL+ L P +AG
Sbjct: 1099 MAKTSLVVGGQD----VLLWSGLMGTIGVFIPLISREDADF--FQSLESHLRTEDPPLAG 1152
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
+ +R +++ K I+D +L Y +LP +++ IA + + +I ++D
Sbjct: 1153 RDHLMYRSYYAPVKG-------IIDGDLCERYTLLPNDKKQMIAGELDRSVREIERKISD 1205
Query: 1424 LALGTSF 1430
+ ++F
Sbjct: 1206 IRTRSAF 1212
>gi|384080885|dbj|BAM11105.1| damage-specific DNA binding protein 1, 127kDa, partial
[Siebenrockiella crassicollis]
Length = 364
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 107/235 (45%), Gaps = 30/235 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELKGAISALA 1161
K+ T +GTA V E+ + GR+++F S G+ + + KE+KGA+ ++
Sbjct: 152 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFHYSDGK--------LQSLAEKEVKGAVYSMV 203
Query: 1162 SLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
G LL + + L++WT TE N + + LYV + +FIL+GD+ +S
Sbjct: 204 EFNGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALYVKTKG---DFILVGDLMRS 258
Query: 1217 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
+ L++K +A+DF A E L D + L ++ N+ + +
Sbjct: 259 VLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTD 315
Query: 1277 WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG 1331
+ Q L FH+G V F ++ + T + P D ++LFGT++G
Sbjct: 316 EERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGET-STPTQD-----SVLFGTVNG 364
>gi|429961863|gb|ELA41407.1| hypothetical protein VICG_01512 [Vittaforma corneae ATCC 50505]
Length = 1153
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/385 (22%), Positives = 165/385 (42%), Gaps = 69/385 (17%)
Query: 1062 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGE 1121
+Y + + PD ++ ++ M+ EN L + +FN L + T++ +GE
Sbjct: 804 QYSIELRSPD-----FKLISSTTME--ENELICDIKVIFNN-------FLVVCTSFPEGE 849
Query: 1122 DVAARGRVLLFSTGRNADNPQNL-VTE----VYSKELKGAISALASLQGHLLIASGPKII 1176
D +G+++++S +P NL +T+ + S+ LK ++ + + G K++
Sbjct: 850 DKMTKGKLIVYSLVNIVPDPDNLHITKKLKLICSETLKNPCLFCEEVRSLISVCVGTKLM 909
Query: 1177 LHKWT-GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA-QLNLLAK 1234
++++ T L + ++ L SL + KN I + DI IYF + + +L+LL +
Sbjct: 910 IYEFNENTGLAAVGRHEL-SLLCTSLFVTKNLIAVSDIMNGIYFFFLRPRDPLKLHLLGR 968
Query: 1235 D--------FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
G +D F F D S+V + ++IF Y+P S G +L+ RA
Sbjct: 969 SCLVPNCRFLGGID-FCPSFETDALQFSIVSVCKYGIVRIFTYSPYDPVSKNGNQLVKRA 1027
Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDG------SIGCIAPLD 1340
E VTK A P + ++FG ++ S + L
Sbjct: 1028 EI-----VTKL--------------ANP------LYKVVFGQINEFESILLSSNVMVLLR 1062
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE--LLSHYEML 1398
+ F +LQ++Q + + + G+N R++ + + P S++ CE LL +
Sbjct: 1063 AINFPKLQAIQHCISIFISNRCGINVRNYLE---TEEFVNPECKSVI-CEKILLEFFYFK 1118
Query: 1399 PLEEQLEIAHQTGTTRSQILSNLND 1423
PL ++ +I G I+ + D
Sbjct: 1119 PLVQE-KICKLVGLDYFNIVELIED 1142
>gi|237837399|ref|XP_002367997.1| splicing factor 3B subunit 3, putative [Toxoplasma gondii ME49]
gi|211965661|gb|EEB00857.1| splicing factor 3B subunit 3, putative [Toxoplasma gondii ME49]
gi|221488748|gb|EEE26962.1| splicing factor 3B subunit, putative [Toxoplasma gondii GT1]
gi|221509241|gb|EEE34810.1| splicing factor 3B subunit, putative [Toxoplasma gondii VEG]
Length = 1233
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/299 (22%), Positives = 119/299 (39%), Gaps = 49/299 (16%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
V+S ++ AL + +G LL G K+ L+ L Y P V + + +
Sbjct: 957 VHSTPVEDYPMALTAFRGMLLAGVGHKLRLYALGRKRLLKKCEYKNLPCGVAFIRVAGDR 1016
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL---SLVVSDEQKNI 1264
+ +GD+ +S++ + ++ +LA D +L G L + V +D+ ++
Sbjct: 1017 LFVGDVRESVHVMRYRLSENLFYVLADDV------VPRWLTKGEVLDYHTFVAADKFDSV 1070
Query: 1265 QIFYYAPKMSESWKGQ------------------KLLSRAEFHVGAHVTKFLRLQMLATS 1306
I + E G KL S FH+G VT R + + +
Sbjct: 1071 FICRVPSEAKEDELGDTTGLRLRGDTTYLTDKCFKLQSLLHFHIGEIVTALERATLTSAA 1130
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLN 1365
S+ ++++GT+ GSIG +P L + L+ + P +AG
Sbjct: 1131 SE--------------SIVYGTIMGSIGSFSPFLTKHELDLFTHLEMVMRSEKPPLAGRE 1176
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
FR ++ K + VD +L Y +LP E+Q IA T + IL +L D+
Sbjct: 1177 HIMFRSYYHPAK-------NTVDGDLCESYALLPYEDQKRIAQDFEKTPADILKHLEDI 1228
>gi|402222132|gb|EJU02199.1| hypothetical protein DACRYDRAFT_21931 [Dacryopinax sp. DJM-731 SS1]
Length = 1209
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 123/569 (21%), Positives = 231/569 (40%), Gaps = 61/569 (10%)
Query: 887 LRNLRFSRTPLDAY------TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFR 940
L+N RT LD TR P + I + N+ G SR ++
Sbjct: 674 LQNGVLLRTVLDPVNGQLTDTRTRFLGSRPVRLIRV--NVHGLPSILALSSRSWLNYTYQ 731
Query: 941 ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPL 1000
L P + D A++ + C G I + S +L+I Q+P+ IPL
Sbjct: 732 NLLHFTPLIFDPLEYAWSFSAEL-CPDGLIGI-SGNVLRIFQVPN---LGQKLKQDVIPL 786
Query: 1001 KATPHQITYFAEKNLYPLIVSV-PVLKPLNQVLSLLIDQEVGHQIDNHNLS-SVDLHRTY 1058
TP ++ + L+ +I S VL P L + G ++D + D+
Sbjct: 787 SYTPRKMLQHPTERLFYVIESDHRVLSPEAADKKLQKLKSTGQRLDQEVIDLPADIFGRP 846
Query: 1059 TVEEYE----VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIG 1114
+ ++I++P ++ +P+ ++E A ++ + T + E L +G
Sbjct: 847 RADAGTWASCIQIIDPANV----RSVLEVPLDNNEAAFSLAITTFI---ARPGELFLVVG 899
Query: 1115 TAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
TA +DV + T + +++ ++L ++ E+ AL S QG L+
Sbjct: 900 TA----QDVIVSPKSCKSGFLRTYKISEDGRSL-EFLHKTEVDDVPLALLSFQGRLVAGI 954
Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
G + + L + +V+L+ + I++GD+ +SIYF ++K +L +
Sbjct: 955 GKALRIFDMGKKRLLRKCENKSFATAIVTLSTQGSRIIVGDMAESIYFATYKPPENRLLI 1014
Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAE 1287
A D ++D T+ D+ N+ + PK+ E G +L
Sbjct: 1015 FADD-SQPRWITASAMVDYDTVC--AGDKFGNVFVNRLPPKVGEQVDEDPTGAGVLHEKG 1071
Query: 1288 FHVGA-HVTKFLR---LQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1340
+GA H T L + + TS + G R +L+ L G+IG + P +
Sbjct: 1072 LFMGAPHKTNMLAHYYVGDIITSMHKVALVTG----GRDIVLYTGLHGTIGVLIPFISKE 1127
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
++ F R +L++ + P + G + ++R ++ K +VD +L + +LP
Sbjct: 1128 DVDFIR--TLEQHMRTEAPSLVGRDHLTYRGYYVPVKG-------VVDGDLCELFSLLPT 1178
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
++Q IA + T S++L L L + T+
Sbjct: 1179 QKQQSIAGELDRTYSEVLKKLEQLRVTTT 1207
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 87/376 (23%), Positives = 150/376 (39%), Gaps = 83/376 (22%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL +T+ ++ V+ + + + + S + + + F+ S G+ L QF
Sbjct: 308 LLQSEDGDLFKVTIDHEDEEVKTMKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367
Query: 438 -TCGSGTSML--SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN-- 492
G + SS + G + + R R + L D +N + + +N
Sbjct: 368 QKLGDDDDEIEYSSVSYPDNGMADPIPQAYFRPRPLENLVLADELNSFDPIVDAKVTNLL 427
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
NT++ Q F+ R + + L+ +GL + S+ ELPG +W
Sbjct: 428 NTDTPQ-IFAACGRGARSSFRMLR---HGLDVEETVSS------------ELPGIPNAVW 471
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV K+ DD+Y AY+I+S T+VL + + EV+++ + T
Sbjct: 472 TVKLKA--------------DDQYDAYIILSFVNGTLVLSIGETIEEVSDT-GFLSSSPT 516
Query: 612 IAAGNLFGRRRVIQVFERGAR------------------ILDGS---------------- 637
IA + G ++QV+ G R I+ +
Sbjct: 517 IAVQQI-GEDSLLQVYPHGIRHVLSDRRVNEWRCPQHTTIVAATTNSRQVAIALSSAQLV 575
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVGDPST 689
Y DL G N S S VL++SIA+ PY+ +G D ++R++ DP T
Sbjct: 576 YFELDLE-GQLNEYQDRKSLGSGVLAMSIAEVPEGRQRTPYLAVGCEDQTVRIISLDPDT 634
Query: 690 C--TVSVQTPAAIESS 703
+S+Q A SS
Sbjct: 635 TLENISLQALTAPPSS 650
>gi|195996829|ref|XP_002108283.1| hypothetical protein TRIADDRAFT_49802 [Trichoplax adhaerens]
gi|190589059|gb|EDV29081.1| hypothetical protein TRIADDRAFT_49802 [Trichoplax adhaerens]
Length = 1208
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 83/390 (21%), Positives = 167/390 (42%), Gaps = 61/390 (15%)
Query: 1069 EPDRAGGPWQT-------RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGE 1121
EP G W + ++ + E AL+V + K +ET + +G A
Sbjct: 852 EPKAGNGQWASCIQLLAPDQSLELDQDEAALSVAICRF---AYKPDETFVVVGVAKELNL 908
Query: 1122 DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
+ ++ L+ +T R A+ LV + +E+ A+A+ QG LL+ +G + ++
Sbjct: 909 NPSSSSGGLM-NTYRMANGQLELVHKTVVEEVP---RAMAAFQGRLLVGTGRILRVYDLG 964
Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
+L P +V+++ + + +++GD+ +S++F+ ++ + +L + A D
Sbjct: 965 RKKLLRKCENKNFPYRIVTISSMGSRVIVGDVQESVHFVKYRAKENRLVVFADDVSPRYV 1024
Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLL 1283
AT FL D T++ V D+ +I I + +++ W QK
Sbjct: 1025 TATCFL-DYDTIA--VGDKFGSIAILRLSDDINDEIEEDPTGAKAFWDRGLLNGASQKAN 1081
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1340
A F++G V + LQ +T PG ++ L++ TL GSIG + P +
Sbjct: 1082 LEASFYIGETV---MSLQ-------KTTIIPGGSES----LIYTTLSGSIGVLLPFTSRE 1127
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
E+ F Q L+ L + G + ++R ++ K +++D ++ + L
Sbjct: 1128 EVDF--FQHLEMHLRSENAPICGRDHLAYRSYYFPAK-------NVIDGDMCEQFNALDG 1178
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
++ +A + T +I L D+ +F
Sbjct: 1179 SKRRTLAMELDRTPPEISKKLEDMRTRYAF 1208
>gi|374095609|gb|AEY85032.1| spliceosomal-like protein [Camellia sinensis]
Length = 1212
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 85/388 (21%), Positives = 162/388 (41%), Gaps = 69/388 (17%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA---YVQGE 1121
+R+L+P A T + +Q +E A ++ L N KE TLLA+GTA +
Sbjct: 861 IRVLDPRTA----NTTCLLELQDNEAAFSI---CLVNFHDKEYGTLLAVGTAKGLQFWPK 913
Query: 1122 DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
+ G + ++ R ++ ++L ++ ++ AL QG LL G + L+
Sbjct: 914 RSISSGYIHIY---RFVEDGKSLEL-LHKTQVDDVPLALCQFQGKLLAGVGSVLRLYDLG 969
Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
+L P + S++ ++ I +GDI +S ++ ++ QL + A D C
Sbjct: 970 KRKLLRKCENKLFPNTITSIHTYRDRIYVGDIQESFHYCKYRRDENQLYIFADD-----C 1024
Query: 1242 ----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR- 1285
+ ID T++ +D+ NI A +S+ W+ KL
Sbjct: 1025 VPRWLTASYHIDFDTMA--GADKFGNIYFVRLAQDVSDEIEEDPTGGKIKWEQGKLNGAP 1082
Query: 1286 ------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
+FHVG VT + ++ + + +++GT+ GS+G +
Sbjct: 1083 NKVEEIVQFHVGDVVTCLQKASLIPSGGE--------------CVIYGTVMGSLGALLAF 1128
Query: 1340 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
D++ F L+ + P + G + ++R A+ P D ++D +L +
Sbjct: 1129 TSRDDVDF--FSHLEMHMRQENPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQFP 1179
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
LP++ Q +IA + T +IL L ++
Sbjct: 1180 TLPMDMQRKIADELDRTPGEILKKLEEV 1207
>gi|167998730|ref|XP_001752071.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697169|gb|EDQ83506.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 172
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 8/96 (8%)
Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
E+ A+ ++ A RR S+ + +L F + R +HCFE PE+ +L R
Sbjct: 28 ETAALRTEAAAPGIHRRPSLTMRLR----IILAFTEC----RCLLIHCFEYPEYQYLNRS 79
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
RE FA V+ D GRC VL+Y Q++ LKA G
Sbjct: 80 RERFAMDLSVRADLVGRCASVLIYNSQLVTLKAGHG 115
>gi|195108657|ref|XP_001998909.1| GI23368 [Drosophila mojavensis]
gi|193915503|gb|EDW14370.1| GI23368 [Drosophila mojavensis]
Length = 1140
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/344 (20%), Positives = 138/344 (40%), Gaps = 42/344 (12%)
Query: 1037 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1096
+ EVG +ID HNL +D T+ V +P ++ ++ ++
Sbjct: 780 NAEVGQEIDVHNLLIID-QNTFEV----------------LHAHQFVPPETISALMSAKL 822
Query: 1097 VTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKG 1155
+ T + T+ V ++ + GR+++F N +T+V ++ G
Sbjct: 823 -------GDDPNTYYVVATSLVFPDEPEPKVGRIIIFHYHENK------LTQVAETKVDG 869
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
AL G +L G + L++WT + + + + L +FIL+GD+ +
Sbjct: 870 TCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMR 929
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
SI L K+ +A+D A E L D + L D N+ + +
Sbjct: 930 SITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLGCETHD---NLFVCQKDSAATT 986
Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
+ Q L A FH+G + F ++ + +RT G +L+GT +G+IG
Sbjct: 987 DEERQLLPELARFHLGDTINVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIG 1039
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
+ + + + L L+++L + V ++ +R + N K
Sbjct: 1040 IVTQIPQDFYDFLHGLEERLKKIIKSVGKIDHTYYRNYQINTKV 1083
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/263 (24%), Positives = 114/263 (43%), Gaps = 49/263 (18%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G + +DP+ R G+ +Y I+ + S L +LR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATSLR 158
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
+D V D F+HG + P ++++H+ +H C L +K L W
Sbjct: 159 -MDELIVYDVEFLHGCLNPTVIVIHKDN-------DGRHVKCHEINLRDKEFMK---LAW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ + ++ +
Sbjct: 208 KQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTINC 260
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRV--VQRLDLSKTNPSVLTSDI 415
++ +D+ + LL G L +L + + G+V V+ + + + + I
Sbjct: 261 YA-RVDSKGLRY------LLGNMDGQLYMLFLGINETGKVPTVKDIKVEQLGEISIPECI 313
Query: 416 TTIGNSLFFLGSRLGDSLLVQFT 438
T + N ++GSR GDS LV+ +
Sbjct: 314 TYLDNGFLYIGSRHGDSQLVRLS 336
>gi|330790247|ref|XP_003283209.1| CPSF domain-containing protein [Dictyostelium purpureum]
gi|325086890|gb|EGC40273.1| CPSF domain-containing protein [Dictyostelium purpureum]
Length = 1233
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/289 (21%), Positives = 128/289 (44%), Gaps = 29/289 (10%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
VY E + + A+A QG L G I ++ +L P +V+++ + +
Sbjct: 956 VYKTEAEEPVYAMAPFQGRLCAGVGKNIRIYDMGKKKLLRKCETKNLPNTIVNIHSLGDR 1015
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
+++GDI +SI+F+ +K+ L + A D + ++D T++ +D+ NI I
Sbjct: 1016 LVVGDIQESIHFIKYKKLENMLYVFADDLAP-RWITSSVMLDYDTVA--GADKFGNIFIL 1072
Query: 1268 YYAPKMSESWKGQKLLSRAEFHVG---------AHVTKFLRLQMLATSSDRTGAAPGSDK 1318
+S+ + S+ +F G H+ + T + + GSD
Sbjct: 1073 RLPSNVSDEVEEDPTGSKLKFESGLLNGAPHKLEHIANIFAGDAITTLNKTSLVVGGSD- 1131
Query: 1319 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
LL+ T+ G+IG + P +++ F SL+ +L + + G + ++R ++
Sbjct: 1132 ----VLLYTTISGAIGALIPFVSREDVDF--FSSLELQLRNEHAPLCGRDHLAYRSYYFP 1185
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
K +I+D +L + L ++Q +IA + + S++L L D+
Sbjct: 1186 VK-------NIIDGDLCEQFITLDPQKQRQIAEELSRSPSEVLKKLEDI 1227
>gi|452820919|gb|EME27955.1| splicing factor 3B subunit 3 [Galdieria sulphuraria]
Length = 1294
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 63/275 (22%), Positives = 114/275 (41%), Gaps = 26/275 (9%)
Query: 1157 ISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
I+ +AS QGHLL+A G + ++ + L A P + + + I L D+ +
Sbjct: 1002 ITTMASFQGHLLVAVGTSLRMYDLGKKQLLKKTQHPRATPHKITCIETCYDRIFLSDVQE 1061
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK--- 1272
S++ + +A D+ C T L+D T++ + D+ NI I P+
Sbjct: 1062 SVFLYRYSAADNLFLCIADDYLPKWC-TTMCLLDYDTVA--IGDKMGNISILRLPPEAGT 1118
Query: 1273 -MSESWKGQKLLSRAEFHVGAHVTKFL--RLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
+ + G L A H ++ +Q L+ TG P L +GTL
Sbjct: 1119 FIEQDPTGGLLSKEAPHHFQLEACYYVGSVIQCLSKVEWTTGDVP--------LLFYGTL 1170
Query: 1330 DGSIGCIAPL-DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
DG+IG + PL L Q+L+ +L + + G + ++R + + ++D
Sbjct: 1171 DGAIGVMIPLRSTLDMELFQALELQLREYRSPLCGRHHLAYRSYFFPVR-------HVID 1223
Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
+L + L LE+Q +I + + + L D
Sbjct: 1224 GDLCEEFYRLSLEQQEKIVKELDRSIVDVHRKLED 1258
>gi|336371417|gb|EGN99756.1| hypothetical protein SERLA73DRAFT_88390 [Serpula lacrymans var.
lacrymans S7.3]
gi|336384183|gb|EGO25331.1| hypothetical protein SERLADRAFT_355643 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1216
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 165/379 (43%), Gaps = 39/379 (10%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+RI++P A +T + I + S+E A ++ +V + + NE L +GTA +A
Sbjct: 861 IRIIDPVEA----KTLSMITLDSNECAFSLAIVPF---SARGNELHLVVGTA--ADTFLA 911
Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
R F + ++ E AL + QG L+ G + ++ +
Sbjct: 912 PRSCSSGFLRTYKFTEDGTGLELLHKTETDDVPLALMAFQGRLVAGVGKALRIYDIGKKK 971
Query: 1185 LNGIAFYDAPPLY---VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
L A + +V+LN + I++GD+ +SI F+++K +L + A D
Sbjct: 972 LLRKVENKARATFSTAIVTLNTQGSRIIVGDMQESISFVAYKAPENRLLVFADDNQPRWI 1031
Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEFHVGA-HVTK 1296
AT ++D +T++ D NI + PK+SE G +L +GA H TK
Sbjct: 1032 TATT-MVDYTTIA--AGDRFGNIFVNRLDPKVSEQVDDDPTGAGILHEKGLLMGAPHKTK 1088
Query: 1297 FL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSL 1350
L + L TS ++ G R LL+ L G+IG + P +++ F + +L
Sbjct: 1089 MLAHFHIGDLVTSINKVSLVAG----GREVLLYTGLHGTIGILVPFVSKEDVDF--ISTL 1142
Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1410
++ + + G + S+R +++ K S+VD +L Y LP +Q IA +
Sbjct: 1143 EQHMRTEQGSLVGRDQLSWRGYYTPVK-------SVVDGDLCETYARLPGTKQSAIAGEL 1195
Query: 1411 GTTRSQILSNLNDLALGTS 1429
T ++L L L + S
Sbjct: 1196 DRTVGEVLKKLEQLRVTAS 1214
>gi|413935524|gb|AFW70075.1| hypothetical protein ZEAMMB73_605375 [Zea mays]
Length = 1229
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 121/554 (21%), Positives = 213/554 (38%), Gaps = 91/554 (16%)
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+S Q SRP + + + P CD ++ + + C+ G + V +
Sbjct: 713 VSHRQAMLCLSSRPWLGYIHQGHFLLTPLSCD-TLESAASFSSDQCSEGVVAVAGDALRI 771
Query: 980 ICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVL-------KPLNQVL 1032
G T++ IPL+ TP + +K +I S + L
Sbjct: 772 FTIERLGETFNE----TAIPLRYTPRKFVILPKKKYIAVIESDKGAFSAEEREAAKKECL 827
Query: 1033 SLLIDQEVGHQIDNHNLSSVDLH------RTYTVEEYE------------VRILEPDRAG 1074
E G+ + + + D T+ E+Y +RIL+P
Sbjct: 828 DASGAAENGNANNGDPMENGDGQDGAEEGNTFPDEQYGYPKAESERWVSCIRILDPRSR- 886
Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---L 1131
T + +Q +E A+++ V N KE+ TLLAIGTA +G R R L
Sbjct: 887 ---DTTCLLELQDNEAAVSICTV---NFHDKEHGTLLAIGTA--KGLQFWPR-RTLAGGF 937
Query: 1132 FSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFY 1191
+ D ++L ++ +++ AL QG LL G + L+ +L
Sbjct: 938 IHVYKFVDEGRSLEL-LHKTQVEEVPLALCQFQGRLLAGVGSVLRLYDLGKRKLLRKCEN 996
Query: 1192 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
P +VS++ ++ I +GD+ +S ++ ++ QL + A D T ID
Sbjct: 997 KLFPRTLVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADD-SVPRWLTTAQHIDFD 1055
Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR-------AEFHVGAH 1293
T++ +D+ NI +S+ W+ KL +FHVG
Sbjct: 1056 TMA--GADKFGNIYFARLPQDISDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDV 1113
Query: 1294 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSL 1350
VT + ++ PG + L++GT+ GS+G + +++ F L
Sbjct: 1114 VTCLQKASLI----------PGGGE----CLIYGTVMGSVGALLAFTSREDVDF--FSHL 1157
Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQT 1410
+ L P + G + ++R A+ P D ++D +L Y LP + Q +IA +
Sbjct: 1158 EMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQYPSLPADMQRKIADEL 1210
Query: 1411 GTTRSQILSNLNDL 1424
T +IL L D+
Sbjct: 1211 DRTPGEILKKLEDI 1224
>gi|326483043|gb|EGE07053.1| pre-mRNA-splicing factor rse1 [Trichophyton equinum CBS 127.97]
Length = 1209
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 106/488 (21%), Positives = 201/488 (41%), Gaps = 58/488 (11%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
C G + + Q + ++ S DN + IPL TP E L+ +I S
Sbjct: 759 QCVEGMVGIQGQNL----RIFSIEKLDNNLLQEPIPLAYTPRNFVRHPEYPLFYVIGSDN 814
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRAT 1082
+L P + + L+ + D+ L D + I D P T++
Sbjct: 815 NILSPATK--AKLLSESTAVNGDSAELPPEDFGYPRGTNHWASSIQVVD----PIHTKSV 868
Query: 1083 I---PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG---- 1135
+ ++ +E A+++ V+ T++E+ET L +GT G+D+ R F+ G
Sbjct: 869 LSNLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRT--FTCGFIHI 919
Query: 1136 -RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
R + + L ++ +++ AL QG LL GP + ++ +L
Sbjct: 920 YRFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQIT 978
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
P +V L + I++ D+ +S+ ++ +K Q L A D T ++D T++
Sbjct: 979 PRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTT-MVDYETVA 1037
Query: 1255 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQMLAT 1305
D+ NI + K SE G L+ ++ GA V F Q + T
Sbjct: 1038 --GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVIHFYS-QDIPT 1094
Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
S +T G R L++ L G++G P D++ F Q+L+ +L P +A
Sbjct: 1095 SIQKTQLVAG----GRDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNPPLA 1148
Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
G + +R +++ K ++D +L + +LP +++ IA + + +I ++
Sbjct: 1149 GRDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIAGELDRSVREIERKIS 1201
Query: 1423 DLALGTSF 1430
D+ ++
Sbjct: 1202 DMRTKVAY 1209
>gi|326469377|gb|EGD93386.1| splicing factor 3B subunit 3 [Trichophyton tonsurans CBS 112818]
Length = 1188
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 106/488 (21%), Positives = 201/488 (41%), Gaps = 58/488 (11%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
C G + + Q + ++ S DN + IPL TP E L+ +I S
Sbjct: 738 QCVEGMVGIQGQNL----RIFSIEKLDNNLLQEPIPLAYTPRNFVRHPEYPLFYVIGSDN 793
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRAT 1082
+L P + + L+ + D+ L D + I D P T++
Sbjct: 794 NILSPATK--AKLLSESTAVNGDSAELPPEDFGYPRGTNHWASSIQVVD----PIHTKSV 847
Query: 1083 I---PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG---- 1135
+ ++ +E A+++ V+ T++E+ET L +GT G+D+ R F+ G
Sbjct: 848 LSNLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRT--FTCGFIHI 898
Query: 1136 -RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
R + + L ++ +++ AL QG LL GP + ++ +L
Sbjct: 899 YRFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQIT 957
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
P +V L + I++ D+ +S+ ++ +K Q L A D T ++D T++
Sbjct: 958 PRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTT-MVDYETVA 1016
Query: 1255 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQMLAT 1305
D+ NI + K SE G L+ ++ GA V F Q + T
Sbjct: 1017 --GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVIHFYS-QDIPT 1073
Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
S +T G R L++ L G++G P D++ F Q+L+ +L P +A
Sbjct: 1074 SIQKTQLVAG----GRDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNPPLA 1127
Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
G + +R +++ K ++D +L + +LP +++ IA + + +I ++
Sbjct: 1128 GRDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIAGELDRSVREIERKIS 1180
Query: 1423 DLALGTSF 1430
D+ ++
Sbjct: 1181 DMRTKVAY 1188
>gi|427798971|gb|JAA64937.1| Putative damage-specific dna binding complex subunit ddb1, partial
[Rhipicephalus pulchellus]
Length = 1259
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 84/372 (22%), Positives = 156/372 (41%), Gaps = 56/372 (15%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDV 1123
+R+L P + QT + ++ +E AL+V +V ++ +E + +G A + +
Sbjct: 861 IRVLNPADS----QTLCKVALEQNEAALSVALVKF---ASQPDEQYVVVGAARELSLQPW 913
Query: 1124 AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
AR LL T R + + + V++ ++ A +AL QG LL G + L+
Sbjct: 914 HARSGGLLL-TYRLSHAGETRLELVHATSVEEAPTALCPFQGRLLAGVGKCLRLYDLGRK 972
Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFA 1243
+L P +VS+ + N +++GD+ +S +FL +K Q QL + A D
Sbjct: 973 KLLRKCENKYIPSAIVSIQSMGNRVVVGDVQESFFFLRYKRQENQLVIFADD-AVPRWIT 1031
Query: 1244 TEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES---------------WKG---QKLLSR 1285
++D T++ +D+ N+ I +S+ W G QK
Sbjct: 1032 ASCMLDYDTVA--GADKFGNVSIIRLPNSVSDEVDEDPTGIKSLWDRGWLGGSSQKAEVI 1089
Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1342
+ FH+G V + ++ PG ++ L++ TL G+IG + P ++
Sbjct: 1090 SNFHIGETVLSLQKATLI----------PGGSES----LVYVTLSGTIGVLVPFTAHEDH 1135
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
F Q L+ + P + G + SFR + K +++D +L + L +
Sbjct: 1136 DF--FQHLEMHMRSENPPLCGRDHLSFRSSYFPVK-------NVIDGDLCEQFNSLDPSK 1186
Query: 1403 QLEIAHQTGTTR 1414
Q IA + R
Sbjct: 1187 QKSIAEELDRNR 1198
Score = 40.8 bits (94), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 60/265 (22%), Positives = 102/265 (38%), Gaps = 51/265 (19%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ + G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTIPVAASMCVLKTGFLFVAAEFGNHCLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E GD AP AL++++ EEL A T
Sbjct: 362 ARLGEEDEEPEFSSAIPLEEGDTFFFAPR----------ALRNLLPVEELDSLSPAMGCT 411
Query: 495 ------ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC- 547
E + + R I L+ +GL + S + ELPG
Sbjct: 412 IADLANEDTPQLYVACGRGPRSCIRVLR---HGLEV------------SEMAVSELPGNP 456
Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+WTV K+ D++Y AY+I+S T+VL + + EVT+S +
Sbjct: 457 NAVWTVKRKA--------------DEDYDAYIIVSFVNATLVLSIGETVEEVTDS-GFLG 501
Query: 608 QGRTIAAGNLFGRRRVIQVFERGAR 632
T++ + G ++QV+ G R
Sbjct: 502 TTPTLSCAQI-GDDALVQVYPEGIR 525
>gi|302807210|ref|XP_002985318.1| hypothetical protein SELMODRAFT_181612 [Selaginella moellendorffii]
gi|300147146|gb|EFJ13812.1| hypothetical protein SELMODRAFT_181612 [Selaginella moellendorffii]
Length = 1207
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 83/380 (21%), Positives = 159/380 (41%), Gaps = 52/380 (13%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+R+L+P A T + +Q +E A ++ V + K T+LA+GTA
Sbjct: 855 IRVLDPKAAA----TTCLLELQENEAAFSICCVNFHDN--KNLGTVLAVGTAKDLEWWPK 908
Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
R R ++ ++L V+ + G +AL QG LL G + ++ +
Sbjct: 909 RRSMGGFIHIYRFVEDGRSLEL-VHKTPIDGVPTALCQFQGRLLAGIGQILRIYDLGKRK 967
Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------- 1235
L P + S++ + I +GDI +S +++ ++ QL A D
Sbjct: 968 LLRKCENKNFPNTITSIHSYGDRIYVGDIQESFHYVKYRRDENQLYAFADDSSPRWLTAS 1027
Query: 1236 ----FGSL---DCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
F ++ D F F + LS + D+ +I + +++ + K+ +
Sbjct: 1028 LHIDFDTMAAGDKFGNLFFVRLPQDLSEEIEDDPTGGKIKWEQGRLNGA--PNKVEEIIQ 1085
Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTF 1344
FHVG VT + ++ PG ++ +++GT+ GS+G + P +++ F
Sbjct: 1086 FHVGEVVTCMQKASLI----------PGGGES----VIYGTVMGSVGALLPFSSREDVDF 1131
Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
L+ + P + G + +FR A+ P D ++D +L Y LP + Q
Sbjct: 1132 --FSHLEMHMRQEHPPLCGRDHMAFR------SAYFPVKD-VIDGDLCEQYPTLPPDLQR 1182
Query: 1405 EIAHQTGTTRSQILSNLNDL 1424
+IA + T +++ L D+
Sbjct: 1183 KIAEELDRTPGEVMKKLEDI 1202
>gi|302773427|ref|XP_002970131.1| hypothetical protein SELMODRAFT_171237 [Selaginella moellendorffii]
gi|300162642|gb|EFJ29255.1| hypothetical protein SELMODRAFT_171237 [Selaginella moellendorffii]
Length = 1207
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 83/380 (21%), Positives = 159/380 (41%), Gaps = 52/380 (13%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+R+L+P A T + +Q +E A ++ V + K T+LA+GTA
Sbjct: 855 IRVLDPKAAA----TTCLLELQENEAAFSICCVNFHDN--KNLGTVLAVGTAKDLEWWPK 908
Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
R R ++ ++L V+ + G +AL QG LL G + ++ +
Sbjct: 909 RRSMGGFIHIYRFVEDGRSLEL-VHKTPIDGVPTALCQFQGRLLAGIGQILRIYDLGKRK 967
Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------- 1235
L P + S++ + I +GDI +S +++ ++ QL A D
Sbjct: 968 LLRKCENKNFPNTITSIHSYGDRIYVGDIQESFHYVKYRRDENQLYAFADDSSPRWLTAS 1027
Query: 1236 ----FGSL---DCFATEFLID-GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
F ++ D F F + LS + D+ +I + +++ + K+ +
Sbjct: 1028 LHIDFDTMAAGDKFGNLFFVRLPQDLSEEIEDDPTGGKIKWEQGRLNGA--PNKVEEIIQ 1085
Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTF 1344
FHVG VT + ++ PG ++ +++GT+ GS+G + P +++ F
Sbjct: 1086 FHVGEVVTCMQKASLI----------PGGGES----VIYGTVMGSVGALLPFSSREDVDF 1131
Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
L+ + P + G + +FR A+ P D ++D +L Y LP + Q
Sbjct: 1132 --FSHLEMHMRQEHPPLCGRDHMAFR------SAYFPVKD-VIDGDLCEQYPTLPPDLQR 1182
Query: 1405 EIAHQTGTTRSQILSNLNDL 1424
+IA + T +++ L D+
Sbjct: 1183 KIAEELDRTPGEVMKKLEDI 1202
>gi|301124447|ref|XP_002909707.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262106897|gb|EEY64949.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 328
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/280 (23%), Positives = 110/280 (39%), Gaps = 74/280 (26%)
Query: 925 GFFLSGSRPCWCMVFRERLRVHPQLCDGS-------------------IVAFTVLHNVNC 965
G F G+ P W + R P S +++FT H+ +C
Sbjct: 3 GAFFRGAHPMWILGDRGHASFVPMCVPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHWSC 62
Query: 966 NHGFIYVTSQGILKICQLPSGST-----YDNYWPVQKIPLKATPHQITYFA--------- 1011
+GFIY S+G L++C+LPS T + +QK AT H + Y
Sbjct: 63 PNGFIYFHSRGALRVCELPSSKTSTILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGVAE 122
Query: 1012 --EKNLYPLIVSVPVLKPLNQVLSLL------------IDQEVGHQIDNHNLSSVDL--- 1054
E Y ++ S LKP + + +D N + ++
Sbjct: 123 ALEAPTYAVVCSA-RLKPADADRATEVEGAEEELEPENLDPNGNPLGSNVMAPTAEMFAD 181
Query: 1055 ----HRTYTVEE-YEVRILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT---- 1103
H +T E+ YE+R+++ D G W R + + E L+V+++ L++++
Sbjct: 182 YETDHMAHTEEDVYELRLVQTDEF-GEWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKE 240
Query: 1104 ---------TKENETLLAIGTAYV--QGEDVAARGRVLLF 1132
K+ L +GT +V GED + RGR+LL+
Sbjct: 241 EVASTSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLY 280
>gi|241260143|ref|XP_002404926.1| DNA repair protein xp-E, putative [Ixodes scapularis]
gi|215496735|gb|EEC06375.1| DNA repair protein xp-E, putative [Ixodes scapularis]
Length = 1148
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 66/259 (25%), Positives = 113/259 (43%), Gaps = 26/259 (10%)
Query: 1113 IGTAYV-QGEDVAARGRVLLFS--TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+GTA V E +GR+++F G+ + +V KE+KGA +L G LL
Sbjct: 831 VGTAIVLPDESDPKQGRIIIFHWVDGK--------LQQVAEKEIKGAPYSLLEFNGKLLA 882
Query: 1170 ASGPKIILHKWTGT-ELNGIA--FYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
+ + L +W EL+ F + LY L +FIL+GD+ +S+ L++K
Sbjct: 883 SINSTVRLFEWNAERELHNECSHFNNILALY---LKTKGDFILVGDLMRSMSLLAYKPLE 939
Query: 1227 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1286
+A+D+ + A E L D + L ++ N+ + + Q L
Sbjct: 940 GSFEEIARDYQTNWMCAVEILDDDTFLG---AESTTNLFVCQKDSAATTDEDRQHLQEVG 996
Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1346
+FH+G V F R L ++P + ++LFGT+ G+IG +A L +
Sbjct: 997 QFHLGEFVNIF-RHGSLVMQHPGEASSP-----TQGSVLFGTIHGAIGLVAQLPSDFYNF 1050
Query: 1347 LQSLQKKLVDSVPHVAGLN 1365
L +Q L + V ++
Sbjct: 1051 LLEVQGNLTKVIKSVGKID 1069
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 94/402 (23%), Positives = 155/402 (38%), Gaps = 95/402 (23%)
Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
V+D F+HG P +V+LH+ S H + IS LK + W
Sbjct: 166 VQDMEFLHGCKTPTIVLLHQD--------SQARH---MKTYEIS--LKDKEFVKGPWKQD 212
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
++ +A ++AVP P +G +I YH+ + + L R S V
Sbjct: 213 HVESEATIVIAVPEPFCDARCIGQESITYHNGDQDVVI-----------TPHLIRQSTIV 261
Query: 363 ---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSD 414
++DA + +L D+A G L +L + + ++ V+ L L +
Sbjct: 262 CYGKVDANGSRYLLGDMA------GRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAEC 315
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+T + N + ++GSRLGDS L++ L+S E+ +E T
Sbjct: 316 MTYLDNGVVYVGSRLGDSQLIK---------LNSERNEQGSYVEVMEVFTN--------- 357
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
L +V+ + L + F G L+ G+ I+ AS
Sbjct: 358 LGPIVDMCVVDLERQGQGQLVTCSGAFKE---------GSLRIIRNGIGIHEHAS----- 403
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
++LPG KGIW + N DSSR L++S +T VL +
Sbjct: 404 -------IDLPGIKGIWPLR------VNTDSSR--------DNTLVLSFVGQTRVLMLSG 442
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
E TE + + +T GN+ ++IQV R++DG
Sbjct: 443 EEVEETELAGFDISQQTFFCGNV-RNNQLIQVTAAAVRLVDG 483
>gi|242060436|ref|XP_002451507.1| hypothetical protein SORBIDRAFT_04g003000 [Sorghum bicolor]
gi|241931338|gb|EES04483.1| hypothetical protein SORBIDRAFT_04g003000 [Sorghum bicolor]
Length = 1232
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 92/384 (23%), Positives = 162/384 (42%), Gaps = 61/384 (15%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+RIL+P T + +Q +E A+++ V N KE+ TLLA+GTA +G
Sbjct: 881 IRILDPRSR----DTTCLLELQENEAAVSICTV---NFHDKEHGTLLAVGTA--KGLQFW 931
Query: 1125 ARGRVL---LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
R R L + D ++L ++ +++ AL QG LL G + L+
Sbjct: 932 PR-RTLAGGFIHIYKFVDEGRSLEL-LHKTQVEEVPLALCQFQGRLLAGVGSVLRLYDLG 989
Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1241
+L P +VS++ ++ I +GD+ +S ++ ++ QL + A D
Sbjct: 990 KRKLLRKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWL 1049
Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLLSR----- 1285
A + ID T++ +D+ NI +S+ W+ KL
Sbjct: 1050 TAAQH-IDFDTMA--GADKFGNIYFARLPQDISDEIEEDPTGGKIKWEQGKLNGAPNKVE 1106
Query: 1286 --AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1340
+FHVG VT + ++ PG + L++GT+ GS+G + +
Sbjct: 1107 EIVQFHVGDVVTCLQKASLI----------PGGGE----CLIYGTVMGSVGALLAFTSRE 1152
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
++ F L+ L P + G + ++R A+ P D ++D +L Y LP
Sbjct: 1153 DVDF--FSHLEMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCEQYPSLPA 1203
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDL 1424
+ Q +IA + T +IL L D+
Sbjct: 1204 DMQRKIADELDRTPGEILKKLEDI 1227
>gi|342320507|gb|EGU12447.1| Pre-mRNA-splicing factor RSE1 [Rhodotorula glutinis ATCC 204091]
Length = 1212
Score = 60.5 bits (145), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 114/543 (20%), Positives = 211/543 (38%), Gaps = 89/543 (16%)
Query: 920 ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
+ G SRP +R L+ P + D A++ + C G I + L+
Sbjct: 716 VQGSPAILALSSRPWLNYAYRGILQFTPLIFDALDYAWSFSAEL-CPEGLIGIVGNS-LR 773
Query: 980 ICQLPSGSTYDNYWPVQK--IPLKATPHQI-TYFAEKNLYPLIVSVPVLKP--LNQVLS- 1033
I P VQ+ I L TP Q+ T + LY + P + + +S
Sbjct: 774 IFTFPRLGQ-----KVQQTVIDLSYTPRQLLTSPHSRLLYTVEADHRTFSPSAIQKTISD 828
Query: 1034 -LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE-VRILEPDRAGGPWQTRATIPMQSSENA 1091
+ + EV ++ N + L R + VR+++P A +T + ++ +E A
Sbjct: 829 MRMAEMEVDEEVLNLDPKEFGLPRGPAGQWASCVRVIDPVTA----ETVFKVDLEQNEAA 884
Query: 1092 LTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---------LFSTGRNADNPQ 1142
+ +VT + NE L +GT G+D + R L GR
Sbjct: 885 FSAAIVTFH---SHPNEVFLVVGT----GQDTSLAPRACKQAYLHTYKLLEEGRQ----- 932
Query: 1143 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLN 1202
+ ++ E+ AL + QG L+ G + L+ +L A +++LN
Sbjct: 933 --LELLHKTEVDDIPKALIAFQGRLVAGVGKALRLYDLGKKKLLRKAENKGFATMIMTLN 990
Query: 1203 IVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST---------- 1252
I++GD +S+Y+ +K +L + A D A+ ++D T
Sbjct: 991 TQGTRIIVGDAQESVYYALYKAPENRLLIFADDISPRWTTAS-IMVDYETVAAGDKFGNF 1049
Query: 1253 --------LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA 1304
+S V D+ I + P + + LL A +H+G +T ++ ++A
Sbjct: 1050 FVNRLPKGVSSDVDDDPTGAGIMHEKPYLMGAPHRTHLL--AHYHIGDIITSLHKVALVA 1107
Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
D L++ L G++G + P +++ F +L+ L P +
Sbjct: 1108 GGRD--------------LLVYTGLMGTVGVLVPFVSNEDVDF--FTTLEMHLRSEAPSL 1151
Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
G ++R ++ KA VD +L Y LP+ +Q +IA + T S+++ L
Sbjct: 1152 CGREHLAYRSAYTPVKA-------TVDGDLCEVYRSLPMAKQGQIAGELERTVSEVIKKL 1204
Query: 1422 NDL 1424
+++
Sbjct: 1205 DNV 1207
>gi|403411971|emb|CCL98671.1| predicted protein [Fibroporia radiculosa]
Length = 1212
Score = 60.5 bits (145), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 121/494 (24%), Positives = 199/494 (40%), Gaps = 70/494 (14%)
Query: 965 CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLI----- 1019
C G I + S +L+I Q+P T IPL TP + L+ LI
Sbjct: 758 CPEGLIGI-SGSVLRIFQIPKLGTK---LKQDAIPLSYTPRKFISHPTNGLFYLIEGDHR 813
Query: 1020 -----VSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE---EYEVRILEPD 1071
S L L Q ++ D+ V NL R +RI+ P
Sbjct: 814 VRSDEASAKALGELRQQGKMVDDELV-------NLPPETFGRQKAPAGTWASCIRIINPV 866
Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1131
A +T IP+ ++E A ++ VV+ + + E L +GTA Q +A R
Sbjct: 867 DA----KTVNIIPLDNNEAAFSLAVVSF---SARSGELHLVVGTA--QDTFLAPRSCTSG 917
Query: 1132 F-STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL----N 1186
F T R D+ NL ++ E A+ QG L+ G + L+ +L
Sbjct: 918 FLRTYRFTDDGTNLEL-LHKTETNDVPLAVLGFQGRLVAGVGKALRLYDMGKKKLLRKVE 976
Query: 1187 GIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEF 1246
F A +VSL + IL+GD+ +S+ F +K +L + A D A
Sbjct: 977 NKTFASA----IVSLATQGSRILVGDMQESVSFAVYKPPENKLLVFADDTQPRWTSAMT- 1031
Query: 1247 LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEFHVGA-HVTKFL--- 1298
++D +T++ +D NI + PK+SE G +L GA H T+ L
Sbjct: 1032 MVDYNTVA--SADRFGNIYVNRLDPKVSEQVDDDPTGAGILHEKGLLAGAPHKTELLSHF 1089
Query: 1299 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLV 1355
+ + TS ++ G R LL+ L G+IG + P +++ F + +L++ +
Sbjct: 1090 HVGDIVTSINKVSLVAG----GREVLLYTGLHGTIGILVPFVSKEDVDF--ISTLEQHMR 1143
Query: 1356 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
+ G + ++R ++ KA +VD +L + LP +Q IA + T
Sbjct: 1144 TEQLSLVGRDHLTWRGYYVPVKA-------VVDGDLCETFARLPASKQSAIAGELDRTVG 1196
Query: 1416 QILSNLNDLALGTS 1429
++L L L + S
Sbjct: 1197 EVLKKLEQLRVTAS 1210
>gi|380490733|emb|CCF35810.1| pre-mRNA-splicing factor rse-1 [Colletotrichum higginsianum]
Length = 1212
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 94/399 (23%), Positives = 168/399 (42%), Gaps = 55/399 (13%)
Query: 1064 EVRILEPDRAGGP-----WQTRATIPMQSSENALTVRVVTLFNT-----------TTKEN 1107
+ RIL PD G P W + ++ SE ++ V L N +++N
Sbjct: 837 DARILPPDEFGYPKGKGRWASCISVIDPLSEEQRVLQTVDLDNNEAAVSAAIVSFASQDN 896
Query: 1108 ETLLAIGTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALAS 1162
E+ L +GT G+D+ R FS G R +++ L ++ +++ SAL
Sbjct: 897 ESFLIVGT----GKDMIVNPR--QFSEGYIHVYRFSEDGHEL-EFIHKTKVEEPPSALLG 949
Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
QG LL G + ++ ++ A D P +VSL+ + I++GD+ I ++ +
Sbjct: 950 FQGRLLAGIGQTLRIYDLGLRQMLRKAQADVAPQLIVSLSTQGSRIIVGDVQHGITYVVY 1009
Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS----ESWK 1278
K +L D S T ++D S+V D+ NI + K S E
Sbjct: 1010 KPTTNKLIPFVDDTISRWVTCTT-MVDYE--SVVGGDKFGNIFLVRCPEKASQEADEESG 1066
Query: 1279 GQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
G LL +R H H L Q + TS +T G LL+ ++G+IG
Sbjct: 1067 GLHLLNTRDYLHGTPHRLSLLGHSYTQDVPTSITKTSLVVGGQD----VLLWSGINGTIG 1122
Query: 1335 CIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
P +++ F Q+L++ + +AG + +R ++ K ++D +L
Sbjct: 1123 VFIPFVTREDVDF--FQNLEQHMRTEDAPLAGRDHLMYRSYYVPVKG-------VIDGDL 1173
Query: 1392 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
Y +LP E++ IA + + +I ++D+ ++F
Sbjct: 1174 CERYTLLPSEKKQMIAGELDRSVREIERKISDIRTRSAF 1212
Score = 45.8 bits (107), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 137/603 (22%), Positives = 226/603 (37%), Gaps = 129/603 (21%)
Query: 74 QEEGSKESK--NSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
Q G+KE + ++ +L S + V + + G + S+A G++ +D
Sbjct: 27 QFSGTKEQNIVTASGSRLTLLRPDPSQGKVITVLSHDIFGIIRSMAAFRLAGSN----KD 82
Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVDP 187
+ILA + +I+++E+ I + + F+ LHL K G G + DP
Sbjct: 83 YLILATDSGRITIIEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACDP 131
Query: 188 QGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+GR V L ++ + SQ E T S A V+++ LD+
Sbjct: 132 KGRACLIASVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVLSMVALDV- 181
Query: 245 HVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLIW 299
GY PV L E E T A G + + T ++ + L W
Sbjct: 182 ---------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRKW 230
Query: 300 SAMNLPHDAYKLLAVP---SPIGGVLVVGANTIHY-HSQSASCALAL--NNYAVSLDSSQ 353
S P A L VP GVLV G I Y HS + + + A S +
Sbjct: 231 SESVDP-TASMLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRK 289
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRLDLSKT 406
S +L + + LL T+ GDL T+ DG V+RL +
Sbjct: 290 RHAVSGVMHKLKGSAGAFF----FLLQTEDGDLFKATLDMVEDTDGNPTGEVKRLKIKYF 345
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
+ ++S + + + + S+ G+ QF E+ GD + +
Sbjct: 346 DTIPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGDDDDE------ 385
Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
L SS D D G + + + + A+ +S+ ++ PL D
Sbjct: 386 LEFSSDDFPTDPKAGYDAVYF--------HPRPLENLALVESIDSMNPLLDCKVANLTGE 437
Query: 527 DA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADSSRMA 569
DA +A G +S + ++ ELPG +WT+ K +RG
Sbjct: 438 DAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLNRG--------- 486
Query: 570 AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
D+Y AY+++S T+VL + + EV++S F+ A L G +IQV +
Sbjct: 487 ---DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQVHPK 541
Query: 630 GAR 632
G R
Sbjct: 542 GIR 544
>gi|358391805|gb|EHK41209.1| hypothetical protein TRIATDRAFT_135379 [Trichoderma atroviride IMI
206040]
Length = 1212
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 103/481 (21%), Positives = 189/481 (39%), Gaps = 107/481 (22%)
Query: 998 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1057
IPL TP ++ E L+ +I + DNH LS DL
Sbjct: 791 IPLTYTPKKMVKHPEHPLFYVI-----------------------EADNHTLSP-DLCAK 826
Query: 1058 YTVEEYEV----RILEPDRAGGP-----W--------------QTRATIPMQSSENALTV 1094
+ V ++L P+ G P W Q I + +E A+++
Sbjct: 827 LLADPARVNGDAKVLSPEEFGHPRGNRRWASCISVVDPLAEDGQALQKIDLDENEAAVSL 886
Query: 1095 RVVTLFNTTTKENETLLAIGT-------------AYVQGEDVAARGRVLLFSTGRNADNP 1141
+VT +++NET L +GT AYV GR L+F
Sbjct: 887 AIVTF---ASQDNETFLVVGTGKDMVVNPRSFSDAYVHIYRFEQEGRGLVF--------- 934
Query: 1142 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1201
++ +++ A+ QG +L+ G + ++ +L + P + SL
Sbjct: 935 ------IHKTKVEEPPMAIIPFQGRVLVGIGKILRIYDLGMRQLLRKTQAEVAPQLINSL 988
Query: 1202 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDE 1260
+ N I++GD+ + I ++ +K+ +L D ++ + T ++D T++ D+
Sbjct: 989 STQGNRIIVGDVQQGITYVVYKQTTNKLIPFVDD--TVARWTTCSTMVDYETVA--GGDK 1044
Query: 1261 QKNIQIFYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKF---LRLQMLATSSDRTGA 1312
NI + K SE ++ L +R H +H L Q + TS +T
Sbjct: 1045 FGNIFVVRSPQKASEEADEEQAGLHLLNARDYLHGRSHRLDLMCHLYTQDIPTSITKTSL 1104
Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
G LL+ L G+IG + P ++ F QSL+ L P +AG + +
Sbjct: 1105 VVGGQD----VLLWSGLMGTIGVLIPFVTREDTDF--FQSLELHLRAEDPPLAGRDHLMY 1158
Query: 1370 RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
R +++ K ++D +L Y +LP +++ IA + + +I ++D+ ++
Sbjct: 1159 RSYYAPVKG-------VIDGDLCERYTLLPNDKKQMIAAELDRSVREIERKISDIRTRSA 1211
Query: 1430 F 1430
F
Sbjct: 1212 F 1212
>gi|401407861|ref|XP_003883379.1| putative Splicing factor 3B subunit 3 [Neospora caninum Liverpool]
gi|325117796|emb|CBZ53347.1| putative Splicing factor 3B subunit 3 [Neospora caninum Liverpool]
Length = 1233
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 68/299 (22%), Positives = 117/299 (39%), Gaps = 49/299 (16%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
V+S ++ ALA +G LL G K+ L+ L Y P V + + +
Sbjct: 957 VHSTPVEDYPMALAPFRGMLLAGVGHKLRLYALGKKRLLKKCEYKNLPCGVAFIRVAGDR 1016
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL---SLVVSDEQKNI 1264
+ +GD+ +S++ + ++ +LA D +L G L + V +D+ ++
Sbjct: 1017 LFVGDLRESVHVMRYRLSENLFYVLADDV------VPRWLTKGEVLDYHTFVAADKFDSV 1070
Query: 1265 QIFYYAPKMSESWKGQ------------------KLLSRAEFHVGAHVTKFLRLQMLATS 1306
I + + G KL S FH+G VT R + A +
Sbjct: 1071 FICRVPSEAKQDELGDTTGLRLRGDTTYLTDKCFKLQSLLHFHIGEVVTALERATLTAGA 1130
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLN 1365
S+ ++++GT+ GSIG +P L + L+ L P + G
Sbjct: 1131 SE--------------SIIYGTIMGSIGAFSPFLTKHELDLFTHLEMVLRSEKPPLGGRE 1176
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
FR ++ K + VD +L Y +LP + Q IA T + IL +L D+
Sbjct: 1177 HIMFRSYYHPAK-------NTVDGDLCESYALLPYDVQKRIAQDFEKTPADILKHLEDI 1228
>gi|440792421|gb|ELR13643.1| splicing factor 3b subunit 3, putative [Acanthamoeba castellanii str.
Neff]
Length = 1227
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 59/293 (20%), Positives = 118/293 (40%), Gaps = 44/293 (15%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
V+ +++G +A+ QG LL+ G + ++ +L P + S+
Sbjct: 952 VHKTQVEGVPTAVCGFQGRLLVGIGKMLRIYDLGKRKLLRKCENKGFPHCIQSITTQGER 1011
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKD----------------FGSLDCFATEFLIDGS 1251
I++GD+ +S +F+ +++ QLN+ A D D F F++
Sbjct: 1012 IIVGDLAESFHFVKYRKAENQLNVYADDSNPRWLTASQMLDYDTMAGADKFGNVFIV--R 1069
Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTG 1311
S V + + N + K S + KL + FHVG + + + +D
Sbjct: 1070 LPSEVNEELEDNPMGNFLMSKQSLNGAAFKLQTLINFHVGDTINSMTKASLFTGGAD--- 1126
Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
L++ TL G +G + P +++ F L+ + +P + G + +
Sbjct: 1127 -----------VLVYTTLMGGMGALLPFVSREDVDF--FSHLEMHMRSELPPLCGRDHLA 1173
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
+R ++ K ++D +L + +LP E+Q IA + T ++L L
Sbjct: 1174 YRSYYFPVK-------DVIDGDLCEQFSLLPPEKQRTIAEELDRTPGEVLKKL 1219
>gi|294875343|ref|XP_002767276.1| spliceosome factor, putative [Perkinsus marinus ATCC 50983]
gi|239868839|gb|EEQ99993.1| spliceosome factor, putative [Perkinsus marinus ATCC 50983]
Length = 1258
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 82/383 (21%), Positives = 152/383 (39%), Gaps = 54/383 (14%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQGEDV 1123
+R+++P A ++ + + E A + V + K+N L +GTA V +
Sbjct: 902 IRVVDPLTASTSFK----LDLDVDEAATAMTVCYFYQL--KDNRPCLVVGTATGVDPHNP 955
Query: 1124 AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA------SGPKIIL 1177
+ + D NL ++ L+G SA+ +G LL+A P + +
Sbjct: 956 SRSAHGKCYIKTYLYDESYNLQL-IHVTPLEGVPSAMYPFEGRLLVALRGSPTVAPVLRI 1014
Query: 1178 HKWTGTELNGIAFYDAPPLY--VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
++ L Y P ++ L++ K+ I D SI L W+ Q+ +++ D
Sbjct: 1015 YELGKKRLLKKCEYKFLPESGGIMWLDVNKDRIFAADSRDSILVLRWRYSDNQMQVISDD 1074
Query: 1236 FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA--PKMSESWKGQ---------KLLS 1284
C ++D +T+ VV D+ NI + K + +W K+
Sbjct: 1075 TYP-RCITAAAVLDYNTI--VVGDKFDNIAVLRVPGDAKDAGAWGRDNDYASGNTFKMDL 1131
Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD---E 1341
FHVG +T R+ M+A ++ +++ T+ G+IG + P E
Sbjct: 1132 IGHFHVGETITSLQRVTMVAGGAE--------------IVIYSTVLGTIGALYPFSSKRE 1177
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
F + + + + P + G +R F+ K + VD +L Y LP E
Sbjct: 1178 HGFLQALEMHMRNTAASPSLTGREHVMYRSFYHPIK-------NFVDADLCEVYYQLPAE 1230
Query: 1402 EQLEIAHQTGTTRSQILSNLNDL 1424
+Q +IA T +++ L D+
Sbjct: 1231 KQRQIAVDMDKTPQEVMKKLEDI 1253
>gi|308504990|ref|XP_003114678.1| hypothetical protein CRE_28194 [Caenorhabditis remanei]
gi|308258860|gb|EFP02813.1| hypothetical protein CRE_28194 [Caenorhabditis remanei]
Length = 270
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/260 (21%), Positives = 123/260 (47%), Gaps = 37/260 (14%)
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----FGSLDCFATEFLIDG 1250
P+ +V++ I++ D +S++FL +++ QL + A D + S C ++D
Sbjct: 24 PVSIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQLVVFADDTTPRYVSCVC-----VLDY 78
Query: 1251 STLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA------ 1304
T++ V+D+ N+ + +++E + +S++ + G +++++A
Sbjct: 79 HTVA--VADKFGNLAVVRLPERVNEDVQDDPTVSKSVWDRGWLNGASQKVELVANFFIGD 136
Query: 1305 --TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVP 1359
TS +T PG+++ AL++ T+ G+IGC+ DE+ F +L+ + P
Sbjct: 137 TITSLQKTSLMPGANE----ALVYTTIGGAIGCLVSFMSKDEVDF--FTNLEMHVRSEYP 190
Query: 1360 HVAGLNPRSFRQFHSNGKAHRPGP---------DSIVDCELLSHYEMLPLEEQLEIAHQT 1410
+ G + ++R +++ K S++D ++ + ++ L +Q E+A +
Sbjct: 191 PLCGRDHLAYRSYYAPCKVCFNFLLFRSIVSLFQSVIDGDICEQFSLMDLSKQKEVAEEL 250
Query: 1411 GTTRSQILSNLNDLALGTSF 1430
G T S+I L D+ +F
Sbjct: 251 GKTVSEISKKLEDIRTRYAF 270
>gi|301630307|ref|XP_002944263.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Xenopus (Silurana) tropicalis]
Length = 92
Score = 59.7 bits (143), Expect = 1e-05, Method: Composition-based stats.
Identities = 36/93 (38%), Positives = 54/93 (58%), Gaps = 1/93 (1%)
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ E T+RRL LQ L +PH AGLNPR+FR +S+ + + +++D ELL+ Y L
Sbjct: 1 MQEKTYRRLLMLQNALT-VLPHHAGLNPRAFRMLNSSRRMLQNPVRNVLDGELLNRYLYL 59
Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1431
E+ E+A + GTT IL +L ++ TS
Sbjct: 60 SNMERSELARKIGTTTDIILDDLLEIDRVTSLF 92
>gi|401883281|gb|EJT47496.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 2479]
Length = 1216
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 128/285 (44%), Gaps = 30/285 (10%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
A+A+ QG+LL G + L++ L + P V ++N+V I++GD+ +S +
Sbjct: 949 AVAAFQGYLLAGVGKSLRLYEMGKKALLRKCENNGFPTGVATINVVGARIIVGDLQESTF 1008
Query: 1219 FLSWKE-QGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
+ ++ QL + A D S F T +D T+ +D+ NI + ++SE
Sbjct: 1009 YCVYRSIPSRQLLIFADD--SQPRFLTAVCNVDYDTV--CCADKFGNIFVNRLEERVSEK 1064
Query: 1277 WK----GQKLLSRAEFHVG-AHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
G +L F +G A+ T + + + TS + APG R +++ T
Sbjct: 1065 VDDDPTGAVILHEKGFLMGSANKTDLIAHYNVGSVVTSLTKVSVAPG----GRDVVVYTT 1120
Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
+ G++G + P D++ F + +L+ + + G + ++R +++ KA
Sbjct: 1121 ISGAVGALVPFISNDDVEF--MTTLEMHIRSLNTSLVGRDHLAYRGYYAPVKA------- 1171
Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+VD +L + MLP +Q IA ++L L L G+ F
Sbjct: 1172 VVDGDLCESFNMLPYPQQQAIAADLDRNVGEVLKKLEQLRTGSVF 1216
Score = 51.2 bits (121), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 130/595 (21%), Positives = 223/595 (37%), Gaps = 132/595 (22%)
Query: 85 GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
G T+ +L S L+ +C G V ++A G +D I+L+ + ++S+
Sbjct: 34 GSTRLEILKLNPSTGQLDSICSSEAFGTVRNVAAFRLAGMG----KDYIVLSSDSGRLSI 89
Query: 145 LEFDDSIHGLRITSMHCFES-PEWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQ 200
+E L I+ FES + ++ K G G + VDP+GR G V L
Sbjct: 90 IE-------LVISPTPHFESLYQEVYGKSGSRRTIPGQFLAVDPKGRSAMFGAVEKQKLC 142
Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
I+ + ++G A + V+N+ D GY P+
Sbjct: 143 YILNRNTEG---------KVYPSSPLEAHKNHTLVVNMIACDT----------GYDNPMF 183
Query: 261 VILHERELTW----------AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
L EL + A R + KH T L ++ +++ WS P D
Sbjct: 184 AAL---ELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRK----WSE---PTDRRA 233
Query: 311 LLAVPSP------------IGGVLVVGANTI---HYHSQSASCALALNNYAVSLDSSQEL 355
L V P GGVLV + + H +++ + ++
Sbjct: 234 NLLVQVPGGQNANTDRFDGPGGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPG---- 289
Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
+SS + + AA ++ LL ++ GDL T+ ++G V+ L + + + +
Sbjct: 290 -QSSRGIIIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATS 348
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTC---------GSGTSMLSSGLKEEFGDIE--ADAPS 463
+ + + F+ S GD L QF S T GL EE P
Sbjct: 349 LCILKSGYLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPL 408
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSY 520
L + +L +++ + ++L G+AS+ + ++ R GP + +
Sbjct: 409 QNLLLADTLSSLDPILDAQVVNLLGNASDTPQ----IYAACGR------GPRSTFRSLKH 458
Query: 521 GLRINADASATGISKQSNYELVE--LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
GL IN LVE LPG +WT+ + DDEY +
Sbjct: 459 GLDINV--------------LVESPLPGVPNAVWTL--------------KLSEDDEYDS 490
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
Y+++S T+VL + + EV ++ + G T+A L G ++QV G R
Sbjct: 491 YIVLSFPNGTLVLSIGETIEEVNDT-GFLSSGPTLAVQQL-GSAGLLQVHPAGLR 543
>gi|406698009|gb|EKD01256.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 8904]
Length = 1216
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 128/285 (44%), Gaps = 30/285 (10%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
A+A+ QG+LL G + L++ L + P V ++N+V I++GD+ +S +
Sbjct: 949 AVAAFQGYLLAGVGKSLRLYEMGKKALLRKCENNGFPTGVATINVVGARIIVGDLQESTF 1008
Query: 1219 FLSWKE-QGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
+ ++ QL + A D S F T +D T+ +D+ NI + ++SE
Sbjct: 1009 YCVYRSIPSRQLLIFADD--SQPRFLTAVCNVDYDTV--CCADKFGNIFVNRLEERVSEK 1064
Query: 1277 WK----GQKLLSRAEFHVG-AHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
G +L F +G A+ T + + + TS + APG R +++ T
Sbjct: 1065 VDDDPTGAVILHEKGFLMGSANKTDLIAHYNVGSVVTSLTKVSVAPG----GRDVVVYTT 1120
Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
+ G++G + P D++ F + +L+ + + G + ++R +++ KA
Sbjct: 1121 ISGAVGALVPFISNDDVEF--MTTLEMHIRSLNTSLVGRDHLAYRGYYAPVKA------- 1171
Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+VD +L + MLP +Q IA ++L L L G+ F
Sbjct: 1172 VVDGDLCESFNMLPYPQQQAIAADLDRNVGEVLKKLEQLRTGSVF 1216
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 129/595 (21%), Positives = 223/595 (37%), Gaps = 132/595 (22%)
Query: 85 GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
G T+ +L S L+ +C G V ++A G +D I+L+ + ++S+
Sbjct: 34 GSTRLEILKLNPSTGQLDSICSSEAFGTVRNVAAFRLAGMG----KDYIVLSSDSGRLSI 89
Query: 145 LEFDDSIHGLRITSMHCFES-PEWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQ 200
+E L I+ FES + ++ K G G + VDP+GR G V L
Sbjct: 90 IE-------LVISPTPHFESLYQEVYGKSGSRRTIPGQFLAVDPKGRSAMFGAVEKQKLC 142
Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
I+ + ++G A + V+N+ D GY P+
Sbjct: 143 YILNRNTEG---------KVYPSSPLEAHKNHTLVVNMIACDT----------GYDNPMF 183
Query: 261 VILHERELTW----------AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
L EL + A R + KH T L ++ +++ WS P D
Sbjct: 184 AAL---ELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRK----WSE---PTDRRA 233
Query: 311 LLAVPSP------------IGGVLVVGANTI---HYHSQSASCALALNNYAVSLDSSQEL 355
L V P GGVLV + + H +++ + ++
Sbjct: 234 NLLVQVPGGQNANTDRFDGPGGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPG---- 289
Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
+SS + + AA ++ LL ++ GDL T+ ++G V+ L + + + +
Sbjct: 290 -QSSRGIIIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATS 348
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTC---------GSGTSMLSSGLKEEFGDIE--ADAPS 463
+ + + F+ S GD L QF S T GL EE P
Sbjct: 349 LCILKSGYLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPL 408
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSY 520
L + +L +++ + ++L G+AS+ + ++ R GP + +
Sbjct: 409 QNLLLADTLSSLDPILDAQVVNLLGNASDTPQ----IYAACGR------GPRSTFRSLKH 458
Query: 521 GLRINADASATGISKQSNYELVE--LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
GL +N LVE LPG +WT+ + DDEY +
Sbjct: 459 GLDVNV--------------LVESPLPGVPNAVWTL--------------KLSEDDEYDS 490
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
Y+++S T+VL + + EV ++ + G T+A L G ++QV G R
Sbjct: 491 YIVLSFPNGTLVLSIGETIEEVNDT-GFLSSGPTLAVQQL-GSAGLLQVHPAGLR 543
>gi|42409127|dbj|BAD10377.1| putative splicing factor 3b, subunit 3, 130kDa [Oryza sativa Japonica
Group]
gi|42409258|dbj|BAD10521.1| putative splicing factor 3b, subunit 3, 130kDa [Oryza sativa Japonica
Group]
gi|125538000|gb|EAY84395.1| hypothetical protein OsI_05771 [Oryza sativa Indica Group]
Length = 1234
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 159/391 (40%), Gaps = 75/391 (19%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+RIL+P T + +Q +E A+++ V N KE+ TLLA+GTA
Sbjct: 883 IRILDPKSR----DTTCLLELQDNEAAVSICTV---NFHDKEHGTLLAVGTA-------- 927
Query: 1125 ARGRVLLFSTGRNAD----NPQNLVTEVYSKEL--KGAIS----ALASLQGHLLIASGPK 1174
+ L F RN + V E S EL K + AL QG LL G
Sbjct: 928 ---KGLQFWPKRNLSAGFIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSV 984
Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
+ L+ +L P +VS++ ++ I +GD+ +S ++ ++ QL + A
Sbjct: 985 LRLYDLGKRKLLRKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFAD 1044
Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLL 1283
D A ID T++ +D+ NI +S+ W+ KL
Sbjct: 1045 DSVPRWLTAANH-IDFDTMA--GADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLN 1101
Query: 1284 SR-------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
+FHVG VT + ++ PG + L++GT+ GS+G +
Sbjct: 1102 GAPNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CLIYGTVMGSVGAL 1147
Query: 1337 APL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
+++ F L+ L P + G + ++R A+ P D ++D +L
Sbjct: 1148 LAFTSREDVDF--FSHLEMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCE 1198
Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ LP + Q +IA + T +IL L D+
Sbjct: 1199 QFPSLPADMQRKIADELDRTPGEILKKLEDI 1229
>gi|347829304|emb|CCD45001.1| similar to pre-mRNA-splicing factor rse1 [Botryotinia fuckeliana]
Length = 1212
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 87/369 (23%), Positives = 168/369 (45%), Gaps = 48/369 (13%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG----- 1135
+TI ++ +E A+++ VV ++E+ET L +GT G+D+ R FS G
Sbjct: 873 STIHLEDNECAVSIAVVAF---ASQEDETFLCVGT----GKDMVVSPRS--FSAGFIHVY 923
Query: 1136 RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
R ++ + L ++ +++ AL + QG LL G + ++ +L A + P
Sbjct: 924 RFHEDGKEL-EFIHKTKVEEPPMALLAFQGRLLAGVGKDLRIYDLGMRQLLRKAQSEVAP 982
Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
+V L + I++ D+ +SI + +K Q +L D + T ++D T++
Sbjct: 983 NMIVGLQTQGSRIIVSDVQESITMVVYKFQENRLIPFVDDTIARWTSCTT-MVDYETVA- 1040
Query: 1256 VVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA-HVTKFLRLQMLATSSDRT 1310
D+ N+ + K SE G LL ++ GA H RL ++A + +
Sbjct: 1041 -GGDKFGNLWLLRCPAKASEEADEEGSGAHLLHERQYLAGAPH-----RLTLMAHNFSQD 1094
Query: 1311 GAAPGS-DKTN-----RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
P S KTN R LL+ L G++G + P +++ F Q+L++ L P +
Sbjct: 1095 --IPMSIQKTNLVAGGRDCLLWSGLQGTLGILIPFVSREDVDF--FQTLEQHLRSEDPPL 1150
Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
AG + +R ++ K ++D +L Y +LP +++ IA + + +I +
Sbjct: 1151 AGRDHLIYRSYYVPVKG-------VIDGDLCERYTLLPTDKKQMIAGELDRSVREIERKI 1203
Query: 1422 NDLALGTSF 1430
+D+ +++
Sbjct: 1204 SDIRTRSAY 1212
Score = 41.2 bits (95), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 85/377 (22%), Positives = 151/377 (40%), Gaps = 52/377 (13%)
Query: 320 GVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDV 376
GVLV G + I Y HS + +A+ + + Q V +L A +
Sbjct: 253 GVLVCGEDNITYRHSNQEAFRVAIPRRRGATEDPQRKRNIVAGVMHKLKGAAGAFF---- 308
Query: 377 ALLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL 429
LL T GDL +T+ V D V+RL + + + + + + + F+ S
Sbjct: 309 FLLQTDDGDLFKITIEMVEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGFLFVASEF 368
Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLY 487
G+ QF + + ++F G E+ P R + + +L + ++ +
Sbjct: 369 GNHQFYQFEKLGDDDEETEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESIDSMNPLMD 428
Query: 488 GSASNNT-ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
+N T E A + +S + LK +GL ++ + ELPG
Sbjct: 429 CKVANLTDEDAPQIYSICGTGARSTFRTLK---HGLEVSEIVES------------ELPG 473
Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
+WT K +RG D Y AY+I+S T+VL + + EVT++ +
Sbjct: 474 VPSAVWTT--KLTRG------------DTYDAYIILSFSNGTLVLSIGETVEEVTDT-GF 518
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
T+A L G +IQV +G R + + + + P + + + N ++V+
Sbjct: 519 LSSAPTLAVQQL-GEDSLIQVHPKGIRHIRADHRVNEWA-APQHRSIVAATTNERQVAVA 576
Query: 666 IAD-PYVLLGM-SDGSI 680
++ V M SDGS+
Sbjct: 577 LSSGEIVYFEMDSDGSL 593
>gi|327309050|ref|XP_003239216.1| pre-mRNA splicing factor rse1 [Trichophyton rubrum CBS 118892]
gi|326459472|gb|EGD84925.1| pre-mRNA splicing factor rse1 [Trichophyton rubrum CBS 118892]
Length = 1209
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 105/491 (21%), Positives = 202/491 (41%), Gaps = 64/491 (13%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
C G + + Q + ++ S DN + IPL TP E YPL
Sbjct: 759 QCVEGMVGIQGQNL----RIFSIEKLDNNLLQEPIPLAYTPRNFVRHPE---YPLFY--- 808
Query: 1024 VLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLH-------RTYTVEEYEVRILEPDRAGGP 1076
V+ N +LS ++ + N S +L R +++++P
Sbjct: 809 VIGSDNNILSPATKAKLLSESTTVNGDSAELPPEGFGYPRGTNHWASSIQVVDPIHTKS- 867
Query: 1077 WQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG- 1135
+++ ++ +E A+++ V+ T++E+ET L +GT G+D+ R F+ G
Sbjct: 868 --VLSSLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRT--FTCGF 916
Query: 1136 ----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFY 1191
R + + L ++ +++ AL QG LL GP + ++ +L
Sbjct: 917 IHIYRFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQA 975
Query: 1192 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
P +V L + I++ D+ +S+ ++ +K Q L A D T ++D
Sbjct: 976 QITPRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALISFADDIIPRWTTCTT-MVDYE 1034
Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQM 1302
T++ D+ NI + K SE G L+ ++ GA V F Q
Sbjct: 1035 TVA--GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVIHFYS-QD 1091
Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVP 1359
+ TS +T G R L++ L G++G P D++ F Q+L+ +L P
Sbjct: 1092 IPTSIQKTQLVAG----GRDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNP 1145
Query: 1360 HVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1419
+AG + +R +++ K ++D +L + +LP +++ IA + + +I
Sbjct: 1146 PLAGRDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIAGELDRSVREIER 1198
Query: 1420 NLNDLALGTSF 1430
++D+ ++
Sbjct: 1199 KISDMRTKVAY 1209
>gi|297598550|ref|NP_001045829.2| Os02g0137400 [Oryza sativa Japonica Group]
gi|255670583|dbj|BAF07743.2| Os02g0137400 [Oryza sativa Japonica Group]
Length = 845
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 159/391 (40%), Gaps = 75/391 (19%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+RIL+P T + +Q +E A+++ V N KE+ TLLA+GTA
Sbjct: 494 IRILDPKSR----DTTCLLELQDNEAAVSICTV---NFHDKEHGTLLAVGTA-------- 538
Query: 1125 ARGRVLLFSTGRNAD----NPQNLVTEVYSKEL--KGAIS----ALASLQGHLLIASGPK 1174
+ L F RN + V E S EL K + AL QG LL G
Sbjct: 539 ---KGLQFWPKRNLSAGFIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSV 595
Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
+ L+ +L P +VS++ ++ I +GD+ +S ++ ++ QL + A
Sbjct: 596 LRLYDLGKRKLLRKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFAD 655
Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLL 1283
D A ID T++ +D+ NI +S+ W+ KL
Sbjct: 656 DSVPRWLTAANH-IDFDTMA--GADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLN 712
Query: 1284 SR-------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
+FHVG VT + ++ PG + L++GT+ GS+G +
Sbjct: 713 GAPNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CLIYGTVMGSVGAL 758
Query: 1337 APL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
+++ F L+ L P + G + ++R A+ P D ++D +L
Sbjct: 759 LAFTSREDVDF--FSHLEMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCE 809
Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ LP + Q +IA + T +IL L D+
Sbjct: 810 QFPSLPADMQRKIADELDRTPGEILKKLEDI 840
>gi|342885857|gb|EGU85809.1| hypothetical protein FOXB_03657 [Fusarium oxysporum Fo5176]
Length = 1189
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 103/480 (21%), Positives = 205/480 (42%), Gaps = 64/480 (13%)
Query: 964 NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQK-IPLKATPHQITYFAEKNLYPLIV 1020
C G + + Q I I +L G T +QK IPL TP ++ ++ L+ I
Sbjct: 761 QCEEGIVGIQGQSLRIFNIDRL--GDTL-----IQKSIPLTYTPKKLVKHPDQPLFYTIE 813
Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQ 1078
+ P LL D ++ + D+ L D + + +++P G Q
Sbjct: 814 ADNNTLPPELRAQLLADPKIVNG-DSRVLPPEDFGYPKGTRRWASCINVIDPLSEEG--Q 870
Query: 1079 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNA 1138
TI ++++E A++ +V+ ++++NE+ L +GT G+D+ R +S G
Sbjct: 871 VVQTIDLENNEAAVSAAIVSF---SSQDNESFLVVGT----GKDMVVNPRS--YSEG--- 918
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
+Y + G L + QG + +A G ++ ++ ++ + + +
Sbjct: 919 ------YLHIYRFQDGGENLTLLAFQGRVAVAVGTQLRIYDLGMRQMLRKSQAEVAAQQI 972
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
VSLN + I++GD+ + + ++ +K +L D + T ++D S+
Sbjct: 973 VSLNTQGSRIIVGDVQQGVTYVVYKPASNKLIPFVDDTIARWTTCTT-MVDYE--SVAGG 1029
Query: 1259 DEQKNIQIFYYAPKMSES----WKGQKLLSRAEF-HVGAHVTKFLRLQMLATSSDRTGAA 1313
D+ N+ I K SE G L++ E+ H H + + TS +T
Sbjct: 1030 DKFGNMFIVRCPEKASEEADEEQTGLHLINAREYLHGTPH-------RDIPTSITKTSLV 1082
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
G + LL+ + G+IG P ++ F Q+L++ L P +AG + +R
Sbjct: 1083 VGGQEI----LLWSGIMGTIGVFIPFISREDADF--FQNLEQHLRTEDPPLAGRDHLMYR 1136
Query: 1371 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+++ K ++D +L Y +LP +++L IA + + +I ++D+ ++F
Sbjct: 1137 GYYAPVKG-------VIDGDLCERYNLLPNDKKLMIAGELDRSVREIERKISDIRTRSAF 1189
>gi|388582014|gb|EIM22320.1| hypothetical protein WALSEDRAFT_60013 [Wallemia sebi CBS 633.66]
Length = 1208
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/357 (23%), Positives = 144/357 (40%), Gaps = 72/357 (20%)
Query: 1106 ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE------VYSKELKGAISA 1159
E+ET L +GTA + R+++ + + A VT+ ++ ++ A
Sbjct: 892 EDETHLVVGTA---------KDRMMMPQSHKEAYLRVYKVTQDSQLELLHKTDIDDVPYA 942
Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
+ + +G LL G + L+ L + +V+LN+V + I +GD+ +S+ F
Sbjct: 943 IHAFKGRLLAGVGKALRLYDLGKKRLLRKCENKSFAAGIVNLNVVGSRIYVGDMQESVSF 1002
Query: 1220 LSWKEQGAQLNLLAKDFGSL----------------DCFATEFL--IDGSTLSLVVSDEQ 1261
+K +L + A D S D F F+ +D ST V DE
Sbjct: 1003 AVYKAPENRLLVFADDIMSRWTTTATPVDYDTVAGGDKFGNIFITRVDKSTSEWVDEDES 1062
Query: 1262 KN-----IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
+++ AP S KLL A F+VG VT + Q+ A D
Sbjct: 1063 GGGLLHARGLYHGAPNRS------KLL--AHFYVGDIVTSITKSQLSAGGRD-------- 1106
Query: 1317 DKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
L++ L G++G I P D++ F + +L+ + P + G + FR ++
Sbjct: 1107 ------VLVYTCLHGTVGMIIPFASKDDIEF--MSTLELHMRQESPSLVGRDHLGFRSYY 1158
Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
KA VD +L Y LP+ +Q IA++ T ++L + L F
Sbjct: 1159 IPCKA-------FVDGDLCELYASLPVTKQQAIANELDRTSGEVLKKIESLRSAAGF 1208
>gi|391335522|ref|XP_003742140.1| PREDICTED: DNA damage-binding protein 1-like [Metaseiulus
occidentalis]
Length = 1154
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 66/304 (21%), Positives = 134/304 (44%), Gaps = 30/304 (9%)
Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GTA++ E+ + GR+ + R D + E KE GA ++ L IA
Sbjct: 846 VGTAFINQEESEPKVGRIFVL---RWHDGKLETIAE---KEAAGAPYSIREFHQKLAIAI 899
Query: 1172 GPKIILHKWTGT---ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
+ L+ W + F++ + ++ L + ++IL+GD+ +S+ L++
Sbjct: 900 NSTVRLYSWNAEKDLQSECTPFFN---IVILHLKCLGDYILVGDLMRSMTLLNYNADITS 956
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
L + +D+ + A E L + + L+ ++ N+ + P ++ + Q + A +
Sbjct: 957 LEEIGRDYQTNWTTAVEILDEDTFLA---AESNLNLYVCKRDPSAADDTR-QHMHEVALY 1012
Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKT--NRFALLFGTLDGSIGCIAPLDELTFRR 1346
H+G V ++ ++ A PG N+ + L+G+L G++G I P+ + +
Sbjct: 1013 HLGEMVNVIVKGSLVM-------AQPGDMPLPLNK-SFLYGSLHGAVGVIVPIKQELYAI 1064
Query: 1347 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEI 1406
L +Q L ++ V + +R F + K P +D +L+ LP +E LE
Sbjct: 1065 LNQIQTNLAKTIKSVGKIEHGFWRTFLAERKIE-PAT-GFIDGDLIEQLLDLP-KEALES 1121
Query: 1407 AHQT 1410
Q+
Sbjct: 1122 VSQS 1125
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 99/428 (23%), Positives = 165/428 (38%), Gaps = 82/428 (19%)
Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
+PV+ I++E + T +H + AL L + P W NL +A L+ V
Sbjct: 178 DPVLAIVYEEQQT-------RHMKTHVIALR-DKELMKGP--WGQRNLDLEADMLIPVED 227
Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
GV++VG TI YH +Y S + S +D V
Sbjct: 228 TETGVIIVGGETIVYHYG--------QDYICIQPSFLRTTKISCYCRIDNNRL------V 273
Query: 377 ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
+L G L +LT+ + + V L + ++ + N + F+GSRLGDS L++
Sbjct: 274 FILGGICGRLFILTLRRENKKVVSHSLDLLGSVSIPECLSYLDNGVVFVGSRLGDSQLIR 333
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-DALQDMVNGEELSLYGSASNNTE 495
+ A P + L ++ A+ DM+ +L G T
Sbjct: 334 --------------------MHAQEPFIEVLESYTNLGAILDMI-VVDLEKQGQDQLITC 372
Query: 496 SAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYH 555
S Q G L+ G+ I+ A VEL G KGIW +
Sbjct: 373 SGQGA-----------CGSLRIIRNGIGIHELAC------------VELSGIKGIWAL-- 407
Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE--TADLLTEVTESVDYFVQGRTIA 613
R + A DD L++S +T V + + L +VT + + +T
Sbjct: 408 ---RMNTAQLEEDTPTDDT----LVLSFVGQTRVFNCSSTEELEQVTLPAAFDIDSQTFC 460
Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQ-DLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
A N+ G +VIQV ++ ++ + T+ D F P + N +++++ + V
Sbjct: 461 ARNVLG-NQVIQVTDKRVNLISVTSKTRVDQWFPPEGEIITQCACNDVQVALALKNVLVY 519
Query: 673 LGMSDGSI 680
L + DGS+
Sbjct: 520 LEIRDGSL 527
>gi|125580741|gb|EAZ21672.1| hypothetical protein OsJ_05303 [Oryza sativa Japonica Group]
Length = 1224
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 159/391 (40%), Gaps = 75/391 (19%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+RIL+P T + +Q +E A+++ V N KE+ TLLA+GTA
Sbjct: 873 IRILDPKSR----DTTCLLELQDNEAAVSICTV---NFHDKEHGTLLAVGTA-------- 917
Query: 1125 ARGRVLLFSTGRNAD----NPQNLVTEVYSKEL--KGAIS----ALASLQGHLLIASGPK 1174
+ L F RN + V E S EL K + AL QG LL G
Sbjct: 918 ---KGLQFWPKRNLSAGFIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSV 974
Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
+ L+ +L P +VS++ ++ I +GD+ +S ++ ++ QL + A
Sbjct: 975 LRLYDLGKRKLLRKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFAD 1034
Query: 1235 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE-----------SWKGQKLL 1283
D A ID T++ +D+ NI +S+ W+ KL
Sbjct: 1035 DSVPRWLTAANH-IDFDTMA--GADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLN 1091
Query: 1284 SR-------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCI 1336
+FHVG VT + ++ PG + L++GT+ GS+G +
Sbjct: 1092 GAPNKVEEIVQFHVGDVVTCLQKASLI----------PGGGE----CLIYGTVMGSVGAL 1137
Query: 1337 APL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLS 1393
+++ F L+ L P + G + ++R A+ P D ++D +L
Sbjct: 1138 LAFTSREDVDF--FSHLEMHLRQEHPPLCGRDHMAYR------SAYFPVKD-VIDGDLCE 1188
Query: 1394 HYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ LP + Q +IA + T +IL L D+
Sbjct: 1189 QFPSLPADMQRKIADELDRTPGEILKKLEDI 1219
>gi|346971485|gb|EGY14937.1| pre-mRNA-splicing factor RSE1 [Verticillium dahliae VdLs.17]
Length = 1230
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 103/486 (21%), Positives = 199/486 (40%), Gaps = 53/486 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
C G + + Q + G T + IPL TP ++ E ++ I S
Sbjct: 779 QCEEGVVGIQGQSLRIFAIEKLGDTLTQ----KSIPLTYTPRRMVKHPEHPMFYTIESDN 834
Query: 1024 VLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATI 1083
P LL D V + D L + + I D QT T+
Sbjct: 835 NTLPPELRAQLLADPSVVNG-DARTLPPAEFGYPRAKGRWASCISVIDPLSEELQTLQTV 893
Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RNA 1138
+ ++E A++ +V T+++NE+ L +GT G+D+ R F+ G R +
Sbjct: 894 DLDNNEAAVSAAIVPF---TSQDNESFLVVGT----GKDMIVNPR--QFTEGYIHIYRFS 944
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
++ + L ++ +++ +AL + QG L+ G + ++ ++ A D P +
Sbjct: 945 EDGREL-EFIHKTKVEEPPTALLAFQGRLVAGVGKTLRIYDLGQKQMLRKAQADVAPQLI 1003
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLV 1256
VSL+ + I++GD+ + + ++ +K +L D + + C ++D S+
Sbjct: 1004 VSLSTQGSRIVVGDVQQGVTYVVYKPLSNKLIPFVDDTVARWMTCTT---MVDYE--SVA 1058
Query: 1257 VSDEQKNIQIFYYAPKMS----ESWKGQKLL-SRAEFHVGAH----VTKFLRLQMLATSS 1307
D+ NI I K S E G L +R H H V+ F +L + +
Sbjct: 1059 GGDKFGNIFIVRAPEKASQEADEEGAGLHLTNTRDYLHGTPHRLSLVSHFYSQDVLTSIT 1118
Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGL 1364
+ G D LL+ + G+IG P ++ F Q+L++ L +AG
Sbjct: 1119 KTSLVVGGQD-----VLLWSGISGTIGVFIPFVTREDADF--FQTLEQHLRTEDAPLAGR 1171
Query: 1365 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ +R +++ K ++D +L Y +LP +++ IA + + +I ++D+
Sbjct: 1172 DHLMYRGYYAPVKG-------VIDGDLCERYTLLPNDKKQMIAGELDRSVREIERKISDI 1224
Query: 1425 ALGTSF 1430
++F
Sbjct: 1225 RTRSAF 1230
Score = 46.2 bits (108), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 122/548 (22%), Positives = 203/548 (37%), Gaps = 125/548 (22%)
Query: 128 RRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDP 187
R + +ILA + +I+++E+ + + + + F K G G + DP
Sbjct: 97 RGFNYLILATDSGRIAIIEYLPAQNRFQRLHLETFG-------KSGIRRVVPGEFLACDP 149
Query: 188 QGRCGGVLVYGLQ-----MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
+GR L+ L+ ++ + SQ E T S A HV+++ LD
Sbjct: 150 KGRA--CLIASLEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVHVLSMVALD 198
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL----- 297
+ GY PV L E + T A + +AL + T L + L
Sbjct: 199 V----------GYANPVFAAL-ETDYTEADQDPTGQ-----AALDVETQLVYYELDLGLN 242
Query: 298 -IWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNY--AVS 348
+ + P D L P G GVLV G I Y HS + + + A
Sbjct: 243 HVVRKWSEPVDNTASLLFQVPGGNDGPSGVLVCGEENITYRHSNQEAFRVPVPRRRGATE 302
Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRL 401
S + + +L + + LL T+ GDL +T+ DG V+RL
Sbjct: 303 DPSRKRCIVAGVMHKLKGSAGAFF----FLLQTEDGDLFKITIDMIEDRDGNPTGEVKRL 358
Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
+ + + S + + + ++ S+ G+ QF E+ GD
Sbjct: 359 KIKYFDTIPVASSLCILKSGFLYVASQFGNYQFYQF--------------EKLGD----- 399
Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ L SS D D E + ++ + A+ +S+ ++ PL D
Sbjct: 400 -DDEELEFSSDDFPTDPKQSYEAVFF--------HPRELENLALVESIDSMNPLIDCKVA 450
Query: 522 LRINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNAD 564
DA +A G +S + ++ ELPG +WT+ K SRG
Sbjct: 451 NLTGEDAPQIYTACGNGARSTFRILKHGLEVNEIVASELPGIPSAVWTL--KLSRG---- 504
Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
D+Y AY+++S T+VL + + EV +S F+ A L G +I
Sbjct: 505 --------DQYDAYIVLSFTNATLVLSIGETVEEVNDS--GFLTSVPTLAAQLLGGEGLI 554
Query: 625 QVFERGAR 632
QV +G R
Sbjct: 555 QVHPKGIR 562
>gi|322693432|gb|EFY85292.1| Pre-mRNA-splicing factor RSE1 [Metarhizium acridum CQMa 102]
Length = 1221
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 150/346 (43%), Gaps = 38/346 (10%)
Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFST 1134
Q T+ ++++E A++ +V +++NE+ L +GT G+D+ R
Sbjct: 870 QVVQTVDLENNEAAVSAAIVPF---ASQDNESFLIVGT----GKDIVVNPRNFSEAYIYV 922
Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
R + + L ++ +++ AL QG LL G + ++ ++ A +
Sbjct: 923 YRFQEEGREL-EFIHKTKIEEPALALIPFQGKLLAGVGKTLRVYDLGMRQMLRKAQAEVA 981
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
P +VSLN + I++GD+ + + ++++K +L A D + T ++D S
Sbjct: 982 PQQIVSLNTQGSRIIVGDVQQGVTYVTYKPTTNKLIPFADDIIARWITCTT-MVDYE--S 1038
Query: 1255 LVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGA----HVTKFLRLQMLATS 1306
+ D+ N+ I PK S E G L++ ++ G + Q + TS
Sbjct: 1039 VAGGDKFGNMFIVRCPPKASEEADEEQSGLHLMNARDYLHGTSQRLDLMCHFYTQDIPTS 1098
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
+T G LL+ L G+IG PL ++ F QSL+ L P +AG
Sbjct: 1099 MAKTSLVVGGQDV----LLWSGLMGTIGVFIPLISREDADF--FQSLESHLRTEDPPLAG 1152
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
+ +R +++ K I+D +L Y +LP +++ IA +
Sbjct: 1153 RDHLMYRSYYAPVKG-------IIDGDLCERYTLLPNDKKQMIAGE 1191
>gi|410045300|ref|XP_508472.4| PREDICTED: DNA damage-binding protein 1 [Pan troglodytes]
Length = 1107
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 90/199 (45%), Gaps = 20/199 (10%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 830 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 883
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 884 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 938
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 939 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 995
Query: 1279 GQKLLSRAEFHVGAHVTKF 1297
Q L FH+G V F
Sbjct: 996 RQHLQEVGLFHLGEFVNVF 1014
>gi|258570355|ref|XP_002543981.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
gi|237904251|gb|EEP78652.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
Length = 1209
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 79/365 (21%), Positives = 162/365 (44%), Gaps = 44/365 (12%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR------GRVLLFSTGR 1136
I ++ +E A++V V +++++ET L +GT G+D+ G + ++ R
Sbjct: 872 IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVNPPSSSCGYIHIY---R 921
Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
++ + L ++ +++ AL + QG LL G + ++ +L + P
Sbjct: 922 FQEDGKEL-EFIHKTKVESPPQALLAFQGRLLAGIGTNLRIYDLGMKQLLRKCQAEVVPR 980
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
+V L + I++ D+ +S+ ++ +K Q +L A D + T ++D T++
Sbjct: 981 MIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDIIARWTTCTT-MVDYETVA-- 1037
Query: 1257 VSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATSSD 1308
D+ N+ + K SE G L+ ++ GA L + Q + TS
Sbjct: 1038 GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMIHFYPQDIPTSIQ 1097
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
+T G R L++ L G+IG + P +++ F QSL+ +L P +AG +
Sbjct: 1098 KTQLVAG----GRDILVWTGLQGTIGMLIPFVSREDVDF--FQSLEMQLTSQTPPIAGRD 1151
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
+R +++ K +D +L Y LP +++L IA + + +I ++D+
Sbjct: 1152 HLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIAGELDRSVREIERKISDMR 1204
Query: 1426 LGTSF 1430
++
Sbjct: 1205 TKVAY 1209
Score = 41.2 bits (95), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 65/265 (24%), Positives = 108/265 (40%), Gaps = 40/265 (15%)
Query: 378 LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
LL T+ GDL +T+ V D V+RL L + + S + + N F+ S G
Sbjct: 307 LLQTEDGDLFKVTIDMVEDDNGQPTGEVRRLKLKYFDTVPIASSLCILKNGFLFVASENG 366
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGD--IEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
+ QF + ++F +E AP R R + + L + +N +
Sbjct: 367 NHHFYQFEKLGDDDEETEFTSDDFSSDPLEPLAPVYFRPRPAENLNLVESINSVNPLMSC 426
Query: 489 SASNNTES-AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
+N TE A + ++ + LK +GL + S+ EL +P
Sbjct: 427 KVANLTEDDAPQLYTLCGTGARSTFRTLK---HGLEV---------SEIVESELPSVPS- 473
Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
+WT K +R +D+Y AY+I+S T+VL + + EVT++ +
Sbjct: 474 -AVWTT--KLTR------------NDQYDAYIILSFTNGTLVLSIGETVEEVTDT-GFLS 517
Query: 608 QGRTIAAGNLFGRRRVIQVFERGAR 632
T+A L G +IQV +G R
Sbjct: 518 SAPTLAVQQL-GEDSLIQVHPKGIR 541
>gi|124505011|ref|XP_001351247.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
putative [Plasmodium falciparum 3D7]
gi|7768292|emb|CAB11136.2| CPSF (cleavage and polyadenylation specific factor), subunit A,
putative [Plasmodium falciparum 3D7]
Length = 2870
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/218 (22%), Positives = 103/218 (47%), Gaps = 16/218 (7%)
Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
P +++++++ ++I++GDI S+ L + + +QL + +D+ ++ C + L S
Sbjct: 2631 TPSSWIMTVDVYGDYIVVGDIMTSVTILQYDYENSQLFEVCRDYSNIWCTS---LCALSK 2687
Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL---QMLATSSDR 1309
+VVSD N I + KL S + F+ G+ + K L L ++ D+
Sbjct: 2688 SHIVVSDMDANFIILQKSKFKYNDEDSYKLSSVSLFNHGSIINKMLPLSNTNLIEEDYDK 2747
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL-TFRRLQSLQKKLVDSVPHVAGLNPRS 1368
+D +L + +GSI + P F++ ++ + D++ + L+ +
Sbjct: 2748 RNILTKND-----GILCASSEGSISVLIPFSSFANFKKALCIEIAITDNISSIGNLSHNA 2802
Query: 1369 FRQFHSNGKA-HRPGPDSIVDCELLSHYEMLPLEEQLE 1405
+R++ N ++ H G IVD ELL + + E+Q +
Sbjct: 2803 YREYKVNFRSKHCKG---IVDGELLKMFFHMSFEKQYK 2837
>gi|167390599|ref|XP_001739420.1| DNA damage-binding protein [Entamoeba dispar SAW760]
gi|165896898|gb|EDR24200.1| DNA damage-binding protein, putative [Entamoeba dispar SAW760]
Length = 1088
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 79/349 (22%), Positives = 158/349 (45%), Gaps = 44/349 (12%)
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQN 1143
++ +E+AL++ + + + + + IGTA+ + +V + GR+L+ D
Sbjct: 756 LKENEHALSIEEIVV------DEKEMFVIGTAFAKPNEVEPSSGRILIVQI---KDGKLE 806
Query: 1144 LVTEVYSKELKGAISALASL-QGHLLIASGPKIILHKWTGTELNG---IAFYDAPP---- 1195
+V + K++ GA+ ++ +L + +L ++ K+++ ++ NG + +
Sbjct: 807 IV---FEKDVNGAVYSIKTLLKKYLAMSIEKKLVIFEYQRIITNGEFEVKLQEKGSCNVK 863
Query: 1196 ---LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL---LAKDFGSLDCFATEFLID 1249
LYV ++ N IL+GD+ KSI S+ G N +++DF + A EF+ +
Sbjct: 864 LIGLYVKTMG---NKILVGDLMKSISVYSFDNNGNNKNCLNEVSRDFYASYTTAIEFVDE 920
Query: 1250 GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
LS SD N+ +F +ES + +L + A HVG + + + T S
Sbjct: 921 NCYLS---SDSNSNLLVFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHSTY 976
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN-PRS 1368
+ + +LFG + G IG I + + L +Q +++ + + P
Sbjct: 977 -------ETIQKKCILFGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTPDE 1029
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
+++ + K R +I+D ++ Y + E+Q EIAH +G QI
Sbjct: 1030 WKKVIDDWK--RMPSSNIIDGNIVESYLEMSKEKQCEIAHLSGVNEEQI 1076
>gi|255081708|ref|XP_002508076.1| predicted protein [Micromonas sp. RCC299]
gi|226523352|gb|ACO69334.1| predicted protein [Micromonas sp. RCC299]
Length = 1199
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 86/388 (22%), Positives = 162/388 (41%), Gaps = 72/388 (18%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
VRI++P A T+ + M +E AL V + +E LA+G+A
Sbjct: 851 VRIVDPASA----STKQIVEMTGNEAALCCCHVYF----PQADELFLAVGSA-------- 894
Query: 1125 ARGRVLLFSTGRNADN--------PQNLVTEVYSKE-LKGAISALASLQGHLLIASGPKI 1175
V L + R+++ Q+ E++ K L G A+ +G LL+ G +
Sbjct: 895 ----VSLTFSPRDSEGGFIHLYRYTQDGGIELFHKTPLDGVPGAMCGFKGRLLVGVGNTL 950
Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
L+ + +L P ++ +++ I +GD+ +S +++ +K + + ++A D
Sbjct: 951 RLYDFGKKKLLRKVENRNFPNFIKTIHAQGERIYVGDVQESFHYVRYKREDGSMYIVADD 1010
Query: 1236 ----------------FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
D F F+ S L+ VSDE + P ++ G
Sbjct: 1011 VQPRHVTAACPLDYDTIAGGDRFGNVFV---SRLAQDVSDEIEE------DPTGGKTAYG 1061
Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
Q L+ A + VT+F + + + T A G + ++++ TL G++G + P
Sbjct: 1062 QGALNGASHKIN-QVTQFHVGETVCALTKGTLQAGGLE-----SMIYATLMGTLGALMPF 1115
Query: 1340 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
+++ F L+ + +P + G + +FR ++ P D ++D +L Y
Sbjct: 1116 GNREDVDF--CTHLEMHMRQELPPLLGRDHLAFR------SSYFPVKD-VIDGDLCEMYT 1166
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+LP E Q +A T S++L L DL
Sbjct: 1167 VLPHEAQRRVAEDMDRTVSEVLKKLEDL 1194
>gi|242772631|ref|XP_002478075.1| nuclear mRNA splicing factor, putative [Talaromyces stipitatus ATCC
10500]
gi|218721694|gb|EED21112.1| nuclear mRNA splicing factor, putative [Talaromyces stipitatus ATCC
10500]
Length = 1209
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 106/484 (21%), Positives = 204/484 (42%), Gaps = 62/484 (12%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
C G + + Q + ++ S DN + IPL TP E+ L+ +I S
Sbjct: 759 QCLEGMVGIQGQNL----RIFSIEKLDNNVLQESIPLAYTPRHFVKHPEQPLFYVIESEN 814
Query: 1024 -VLKPLNQVLSLLIDQEV--GHQI----DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGP 1076
VL P Q LL D ++ G + + H +E +++P +
Sbjct: 815 NVLAPATQT-RLLEDSKLQNGEAVIPPAETFGFPRATGHWASCIE-----VVDPINSKS- 867
Query: 1077 WQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG- 1135
+ + ++ +E+A++V V+ +++NET L +GT G+DV R FS G
Sbjct: 868 --VLSRLELEENESAVSVAAVSF---ASQDNETFLVVGT----GKDVVTYPRS--FSAGF 916
Query: 1136 ----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFY 1191
R ++ + L ++ +++ AL + QG L+ G + ++ ++
Sbjct: 917 IHIYRFQEDGREL-EFIHKTKIEEPPLALLAFQGRLVAGIGKNLRVYDLGMKQMLRKCQV 975
Query: 1192 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
+A P +V L + I++ D+ +S+ ++ +K Q QL D + AT ++D
Sbjct: 976 EASPNLIVGLQTQGSRIIVSDVQESVTYVVYKYQENQLIPFVDDVIARWTTATT-MVDYE 1034
Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSES----WKGQKLLSRAEFHVGA----HVTKFLRLQML 1303
T + D+ N+ + K+SE G L+ + G + Q +
Sbjct: 1035 TTA--GGDKFGNLWLVRCPKKVSEESDEDGSGAHLIHERSYLQGTPNRLDLMVHFYTQDI 1092
Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPH 1360
TS +T G R L++ L G+IG + P +++ F Q+L+ +L P
Sbjct: 1093 PTSLHKTNLVVG----GRDILVWTGLQGTIGMMIPFISREDVDF--FQNLEMQLASQNPP 1146
Query: 1361 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
+AG +R ++ K ++D +L Y +LP +++L IA + + +I
Sbjct: 1147 LAGREHLIYRSYYVPVKG-------VIDGDLCESYFLLPNDKKLMIAGELDRSVREIERK 1199
Query: 1421 LNDL 1424
++D+
Sbjct: 1200 ISDM 1203
>gi|298713790|emb|CBJ27162.1| spliceosomal protein sap, putative [Ectocarpus siliculosus]
Length = 1256
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/378 (22%), Positives = 153/378 (40%), Gaps = 55/378 (14%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+R+L+P T + + +E AL+V V N + E +A+GTA +
Sbjct: 909 IRLLDPVEG----TTVECLDLDDNEAALSVAPVAFHN---RNGEAFVAVGTA--KSLTFH 959
Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
RG F +N + ++ E+ A+ QG LL+ G + ++ +
Sbjct: 960 PRGHEGCFVHVYRI--LENRLVLLHKTEVPDVPLAMKEFQGRLLVGVGQSLRMYDLGRKK 1017
Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----FGSLD 1240
L P VVSL + + + GD +S + ++ +L A D F +
Sbjct: 1018 LLRKCENKRMPSMVVSLTVTGDRVFAGDQMESCHCFKYRRAENRLVEFADDQVPRFMTKT 1077
Query: 1241 CFATEFLIDGST---------LSLVVSDEQKNI---QIFYYAPKMSESWKGQKLLSRAEF 1288
C I G+ + L VSD+ N ++ + + +S + K+ + +F
Sbjct: 1078 CLLDYDSIAGADKFGNIFVLRVPLDVSDDVDNPTGNRLLWDSGHLSGA--PNKVQQQLQF 1135
Query: 1289 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFR 1345
HVG V+ S RT PG + LL+ T++GSIG + P D++ F
Sbjct: 1136 HVGEVVS----------SLRRTTLVPGGAEV----LLYSTINGSIGALLPFKSRDDVDF- 1180
Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
++ + P + G + S+R ++ K ++D +L + LP E+Q
Sbjct: 1181 -FTHMEMYMRQEKPTLCGRDHISYRSYYLPAK-------DVIDGDLCEQFSSLPFEKQKL 1232
Query: 1406 IAHQTGTTRSQILSNLND 1423
+A+ T +++ L D
Sbjct: 1233 VANGLDRTVGEVVKKLED 1250
>gi|429850956|gb|ELA26181.1| DNA damage-binding protein 1 [Colletotrichum gloeosporioides Nara
gc5]
Length = 1409
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 137/321 (42%), Gaps = 40/321 (12%)
Query: 1113 IGTAYV----QGEDVAARGRVLLFSTGRNAD-NPQNLVTEVYSKELKGAISALASLQGHL 1167
+GT+Y+ E+ +GR+L+ G ++D NP +V S ELKGA +LA + L
Sbjct: 826 VGTSYLADPEMDENSEVKGRILVL--GVDSDKNPYQIV----SHELKGACRSLAVMGDKL 879
Query: 1168 LIASGPKIILHKW------TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
+ ++++ + +G+ L F P + V L++ N I + D+ +S+ +
Sbjct: 880 VAGLSKTVVVYDYAEESSTSGSLLKLATFR--PSTFPVDLDVNGNMIGVADLMQSMTLIE 937
Query: 1222 W--KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
+ + G + L+ + +AT L + +D Q N+ + P
Sbjct: 938 FIPAQDGNKARLVERARHFQYIWATAVCHLEQDL-WIEADAQGNLMVLRRNPNAPTEHDK 996
Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
+++ +EFH+G + K L +++ +D P K T++GSI A +
Sbjct: 997 KQMEVISEFHLGEQINKIRPLDVVSGEND-----PIEPKA-----FLATIEGSIYVFADI 1046
Query: 1340 DE------LTFR-RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
L F+ RL + K L + AGL+ S+R F N K GP VD EL+
Sbjct: 1047 KPEYQSLLLQFQERLAGVIKTLGQADEPGAGLSFMSWRGFR-NAKRSADGPFRFVDGELI 1105
Query: 1393 SHYEMLPLEEQLEIAHQTGTT 1413
+ L Q + G T
Sbjct: 1106 ERFLDLDAGRQEAVVQGLGPT 1126
>gi|389638952|ref|XP_003717109.1| pre-mRNA-splicing factor RSE1 [Magnaporthe oryzae 70-15]
gi|148887431|sp|Q52E49.2|RSE1_MAGO7 RecName: Full=Pre-mRNA-splicing factor RSE1
gi|351642928|gb|EHA50790.1| pre-mRNA-splicing factor RSE1 [Magnaporthe oryzae 70-15]
Length = 1216
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/369 (22%), Positives = 164/369 (44%), Gaps = 52/369 (14%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
I + ++E AL++ VV+ +++ E+ L +GT G+D+ R F+ G R
Sbjct: 879 IDLDNNEAALSMAVVSF---ASQDGESFLVVGT----GKDMVVNPR--RFTEGYIHVYRF 929
Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
+++ + L ++ +++ +AL QG L+ G + ++ +L A + P
Sbjct: 930 SEDGREL-EFIHKTKVEEPPTALLPFQGRLVAGIGRMLRIYDLGLRQLLRKAQAEVAPQL 988
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
+VSLN + I++GD+ + ++++K + +L A D + T + ST
Sbjct: 989 IVSLNTQGSRIIVGDVQHGLIYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAG--- 1045
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL-----RLQMLA-------- 1304
+D+ N+ I K S+ +E H+ H +L RL ++A
Sbjct: 1046 ADKFGNLWILRCPEKASQESDEPG----SEVHL-VHSRDYLHGTSNRLALMAHVYTQDIP 1100
Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
TS +T G + LL+G G+IG + P ++ F QSL++ L P +
Sbjct: 1101 TSICKTNLVVGGQE----VLLWGGFQGTIGVLIPFVSREDADF--FQSLEQHLRSEDPPL 1154
Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
AG + +R + K ++D +L Y MLP +++ IA + + +I +
Sbjct: 1155 AGRDHLMYRGCYVPVKG-------VIDGDLCERYTMLPNDKKQMIAGELDRSVREIERKI 1207
Query: 1422 NDLALGTSF 1430
+D+ ++F
Sbjct: 1208 SDIRTRSAF 1216
>gi|212531303|ref|XP_002145808.1| nuclear mRNA splicing factor, putative [Talaromyces marneffei ATCC
18224]
gi|210071172|gb|EEA25261.1| nuclear mRNA splicing factor, putative [Talaromyces marneffei ATCC
18224]
Length = 1209
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 105/483 (21%), Positives = 201/483 (41%), Gaps = 60/483 (12%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
C G + + Q + ++ S DN + IPL TP E+ L+ +I S
Sbjct: 759 QCLEGMVGIQGQNL----RIFSIEKLDNNVLQESIPLAYTPRHFVKHPEQPLFYVIESEN 814
Query: 1024 -VLKPLNQVL-----SLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPW 1077
VL P Q L + V + H +E +++P A
Sbjct: 815 NVLAPATQTRLLEESKLQNGEAVIPPAETFGYPRATGHWASCIE-----VVDPVNAKS-- 867
Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-- 1135
+ + ++ +E+A+++ V+ +++NET L +GT G+DV R FS G
Sbjct: 868 -VLSRLELEENESAVSIAAVSF---ASQDNETFLVVGT----GKDVVTYPRS--FSAGFI 917
Query: 1136 ---RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1192
R ++ + L ++ +++ AL + QG L+ G + ++ ++ +
Sbjct: 918 HIYRFQEDGREL-EFIHKTKIEEPPLALLAFQGRLVAGIGKNLRIYDLGMKQMLRKCQVE 976
Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
A P +V L + I++ D+ +S+ ++ +K Q QL D + AT ++D T
Sbjct: 977 AVPNLIVGLQTQGSRIIVSDVQESVTYVVYKYQENQLIPFVDDVIARWTTATT-MVDYET 1035
Query: 1253 LSLVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAHVTKFLRL----QMLA 1304
+ D+ N+ + K S E G L+ + G L + Q +
Sbjct: 1036 TA--GGDKFGNLWLVRCPQKASDDSDEDGSGAHLIHERSYLQGTANRLNLMIHYYTQDIP 1093
Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
TS +T G R L++ L G+IG + P +++ F Q+L+ +L P +
Sbjct: 1094 TSLHKTNLVVG----GRDILVWTGLQGTIGIMVPFISREDVDF--FQNLETQLASQNPPL 1147
Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
AG + +R ++ K ++D +L Y +LP +++L IA + + +I +
Sbjct: 1148 AGRDHLIYRSYYVPSKG-------VIDGDLCESYFLLPNDKKLMIAGELDRSVREIERKI 1200
Query: 1422 NDL 1424
+D+
Sbjct: 1201 SDM 1203
>gi|312370905|gb|EFR19207.1| hypothetical protein AND_22901 [Anopheles darlingi]
Length = 1287
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 68/311 (21%), Positives = 126/311 (40%), Gaps = 43/311 (13%)
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
DN N + V+ E+ A AL + QG LL G + ++ +L P +
Sbjct: 1001 DNQTNELEHVHRTEIDDAPGALCAFQGRLLAGIGKVLRMYDLGKKKLLRKCENKHIPNQI 1060
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
V++ + + + D+ +S+Y L +K QL + A D + L+D T++
Sbjct: 1061 VNIQGMGQRVYVSDVQESVYCLKYKRPENQLIIFADDTHP-RWVTSATLLDYDTVA--TG 1117
Query: 1259 DEQKNIQIFYYAPKMS----ESWKGQKLL----------SRAE----FHVGAHVTKFLRL 1300
D+ NI + +S E G K L +AE FH+G V +
Sbjct: 1118 DKFGNIAVLRLPHSVSDDVDEDPTGNKALWDRGLLNGASQKAENICTFHLGEIVMSLQKA 1177
Query: 1301 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKLVDSVP 1359
++ PG ++ L++ T+ G++G + P + Q L+ + + P
Sbjct: 1178 TLI----------PGGSES----LIYATMSGTVGALVPFTSREDYDFFQHLEMHMRNENP 1223
Query: 1360 HVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILS 1419
+ G + S+R ++ K +++D +L + L +Q IA G T S++
Sbjct: 1224 PLCGRDHLSYRSYYYPVK-------NVMDGDLCEQFTSLDPAKQKSIASDLGRTPSEVAK 1276
Query: 1420 NLNDLALGTSF 1430
L D+ +F
Sbjct: 1277 KLEDIRTRYAF 1287
>gi|340367933|ref|XP_003382507.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Amphimedon
queenslandica]
Length = 1214
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 76/389 (19%), Positives = 160/389 (41%), Gaps = 59/389 (15%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1122
+R++ P++ +T + + +E A ++ V + + E + +GTA +
Sbjct: 862 LRVMHPNQG----KTLDIVQFEQNEAAFSLAVCQF--VSKGDLEWFVVVGTAKDMIITPR 915
Query: 1123 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
+ G +++F + + V++ +L A+A QG LL+ G + ++
Sbjct: 916 AISSGSLIVFRLSPDGSK----LEHVHTTQLDDVPIAMAPFQGRLLVGVGKLLRIYDIGK 971
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
++ P VV + ++ + +GD+ ++++FL ++ QL + A + C
Sbjct: 972 KKMLRKCENKHLPYLVVDIKVMGRRVYVGDVQEAVHFLYYRPHENQLVIFADEVVPRFC- 1030
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLS 1284
T ++D +T++ +D+ NI I +++ W QK
Sbjct: 1031 TTSCILDYNTVA--SADKFGNITILRLPSDVTDQVDEDPSGSRSLWDRGFLNGATQKANV 1088
Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1341
+HVG + ++ ++ PG + L++ TL GSIG + P ++
Sbjct: 1089 MTSYHVGEGINTLHKVSLI----------PGGSE----VLVYTTLSGSIGILVPFSSKED 1134
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
F Q L+ + ++ G + SFR ++ K S++D +L Y L
Sbjct: 1135 SDF--FQHLEMHMRSEWSNLVGRDHLSFRSYYVPVK-------SVIDGDLCEVYNSLDPS 1185
Query: 1402 EQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
++ EIA + S++ L DL +F
Sbjct: 1186 KRREIALDLDRSPSEVAKKLEDLRTRYAF 1214
>gi|301110252|ref|XP_002904206.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
gi|262096332|gb|EEY54384.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
Length = 1197
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 126/298 (42%), Gaps = 48/298 (16%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
V++ E+ A+ QG LL++ G + ++ ++ P +V L +
Sbjct: 922 VHTTEIDDIPHAMCEFQGRLLVSVGRALRIYDLGKKKMLRKCENRNFPSILVELKAAGDR 981
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE-FLIDGSTLSLVVSDEQKNIQI 1266
I D+H+S +F+ +K+ QL + A D + F T L+D TL +D+ N+ +
Sbjct: 982 IYASDMHESFHFVKYKKDENQLVIFADD--CVPRFITSSVLLDYDTL--CGADKFGNVFV 1037
Query: 1267 FYYAPKMSES----------WKG-------QKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
++S+ W KL A+FHVG VT +R ++
Sbjct: 1038 SRLPSEVSDEIDNPTGNRILWDSGLLNGAPNKLEQVAQFHVGDVVTSMVRSSLV------ 1091
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1366
PG + A+++ T+ G IG + P +++ F L+ + P + G +
Sbjct: 1092 ----PGGTE----AVIYATIMGRIGALIPFTSREDVDF--YTHLEMYMRQEQPPLCGRDH 1141
Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
S+R ++ K +I D +L + L +E+Q +A T +++L L D+
Sbjct: 1142 LSYRSYYIPVK-------NITDGDLCEQFSSLSVEKQASVAEDLDRTPAEVLKKLEDI 1192
Score = 46.6 bits (109), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 68/306 (22%), Positives = 114/306 (37%), Gaps = 86/306 (28%)
Query: 321 VLVVGANTIHYHSQ---SASCALALNNYAVSLDSSQELPRSSFSVE--LDAAHATWLQND 375
VLV+G NT+ Y ++ +CA+ PR + + AT Q D
Sbjct: 247 VLVLGENTVQYKNEGHPELTCAI---------------PRREGEHRDIIIVSAATHKQRD 291
Query: 376 V--ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
+ LL ++ GDL +++ Y G VV+ + + + + S + L F S +
Sbjct: 292 LFFVLLQSELGDLYKISLDYSGNVVEEIKIQFFDTIPVASSMCITKTGLLFCASEFSNHY 351
Query: 434 LVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA---------------LQD 477
L QF + G G K ++ ST LR+ ++ A + D
Sbjct: 352 LFQFLSIGEG----DDAAKCSSLAMDPTEFSTFPLRKLTNLALASSSASLSPVTQLLVDD 407
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ N + +Y NN S+ L+ +GL I A++
Sbjct: 408 LANEQTPQMYALCGNNNRSS-----------------LRVLRHGLPITEMAASA------ 444
Query: 538 NYELVELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
LPG K +W + +Y D Y Y+++S E T+VLE + +
Sbjct: 445 ------LPGVAKAVWCLKE--------------SYADPYDKYIVVSFEDATLVLEVGETV 484
Query: 597 TEVTES 602
EV +S
Sbjct: 485 EEVAQS 490
>gi|164656549|ref|XP_001729402.1| hypothetical protein MGL_3437 [Malassezia globosa CBS 7966]
gi|159103293|gb|EDP42188.1| hypothetical protein MGL_3437 [Malassezia globosa CBS 7966]
Length = 1207
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 85/361 (23%), Positives = 143/361 (39%), Gaps = 35/361 (9%)
Query: 1074 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS 1133
G QT A ++ E AL++ +V NE L +G+A + R S
Sbjct: 859 GTTMQTCAEYALEKDEAALSMALVPF---AACGNELFLVVGSA-LGVTHAPLTWRAAFLS 914
Query: 1134 TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
T R DN L V+ E+ AL + G LL +GP + + +L
Sbjct: 915 TYRLTDNGCGLAL-VHKTEVDHVPLALRAFHGRLLAGTGPYVRIFDMGTKKLLRKCQSRP 973
Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
P VVSL + +++GD+ +S+++ +K L A D + ++D T+
Sbjct: 974 FPSKVVSLQVQGYRVIVGDMQESVHYSVYKPATNTLVAFADDIMPRWTTSALLMLDYDTV 1033
Query: 1254 SLVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAHVTKFLRLQMLA----- 1304
+ D+ N+ + S E G L + + +GA R Q+LA
Sbjct: 1034 --MAGDKFGNVFVLRIDSSASLSADEDPTGLMLQNERSYLMGAA----HRAQLLAHYHVG 1087
Query: 1305 ---TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPH 1360
TS PG R +L+ ++G+IG + P + R +L+ +
Sbjct: 1088 DIITSLSMESLVPG----GRPVVLYTCVNGTIGALVPFISREDVRLFTTLEMHMRQENLS 1143
Query: 1361 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
+ G + ++R ++ KA +VD +L Y LP E+Q IA + T + I
Sbjct: 1144 LTGRDHLAYRGHYTPVKA-------VVDADLCELYTALPHEKQESIADELDRTPADIAKK 1196
Query: 1421 L 1421
L
Sbjct: 1197 L 1197
>gi|358060450|dbj|GAA93855.1| hypothetical protein E5Q_00501 [Mixia osmundae IAM 14324]
Length = 1153
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 107/510 (20%), Positives = 192/510 (37%), Gaps = 106/510 (20%)
Query: 965 CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPV 1024
C G I VT L+I LP T + IPL TP + + L +I
Sbjct: 703 CPEGIIGVTGD-TLRIFTLPKIGTK---VKMDSIPLSLTPRRTAFHPAGTLLYMI----- 753
Query: 1025 LKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA----------- 1073
Q D+ LS + T EE ++EP A
Sbjct: 754 ------------------QSDHRTLSPI------TQEEKAKDLMEPSEAMWTAEINGLMR 789
Query: 1074 --GGPWQTRATI--PMQSSENALTVRV----------VTLFNTTTKENETLLAIGTAYVQ 1119
G W + +I P + ENA ++ V + + + L +GTA Q
Sbjct: 790 AEAGQWSSCISIIDPTEP-ENATVTQIYLDNNEAAFSVAVAQFAERPGKWFLLVGTA--Q 846
Query: 1120 GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK 1179
V+ R F + ++ EL ++A+ QG ++ G + L+
Sbjct: 847 DTTVSPRTCTHGFIRTYEITEAGRSLELLHKTELDDVPLSIAAFQGRAVVGVGRALRLYT 906
Query: 1180 WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD---- 1235
+ L + + P VVSL + + I D S+YF+++K +L + A D
Sbjct: 907 MGKSRLLRKSENKSFPAAVVSLQVQGSRIYASDAQDSVYFVAYKAADNRLLIFADDTQQR 966
Query: 1236 ------------FGSLDCFATEFLIDGSTL-SLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
S D F F+ L S V ++Q I + P + +L
Sbjct: 967 WITCNTVVDYDTVASGDKFGNVFVSRVDKLVSEDVDEDQTGAGILHEKPLFMGAPHRLQL 1026
Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL--- 1339
L+ F+VG +T ++ ++A R LL+ L G++G + P
Sbjct: 1027 LT--HFNVGDILTCIQKVSLVAG--------------GREILLYTCLGGTVGMLIPFISK 1070
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
+++ F +L+ + P + G + ++R ++ KA VD +L + +LP
Sbjct: 1071 EDVEFS--STLEMHMRAENPSIVGRDHLAYRGYYVPQKA-------TVDGDLCETFALLP 1121
Query: 1400 LEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
+++Q +IA + + S++L ++ + + +S
Sbjct: 1122 MQKQAQIAGELDRSVSEVLKKIDSMRILSS 1151
>gi|408400551|gb|EKJ79630.1| hypothetical protein FPSE_00190 [Fusarium pseudograminearum CS3096]
Length = 1212
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 105/488 (21%), Positives = 208/488 (42%), Gaps = 57/488 (11%)
Query: 964 NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQK-IPLKATPHQITYFAEKNLYPLIV 1020
C G + + Q I I +L G T +QK IPL TP ++ ++ L+ I
Sbjct: 761 QCEEGIVGIQGQSLRIFNIDRL--GETL-----IQKSIPLTYTPKKLVKHPDQPLFYTIE 813
Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQ 1078
+ P LL D V + D+ L D + + +++P G Q
Sbjct: 814 ADNNTLPPELRAQLLADPGVVNG-DSKVLPPEDFGYPKGTRRWASCINVIDPLSEEG--Q 870
Query: 1079 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG--- 1135
TI ++++E A++ +V+ ++++NE+ L IGT G+D+ R FS G
Sbjct: 871 VLQTIDLENNEAAVSAAIVSF---SSQDNESFLVIGT----GKDMVVNPRS--FSEGYLH 921
Query: 1136 --RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
R + + L ++ +++ AL + QG +L+A G + ++ ++ + +
Sbjct: 922 IYRFLEGGREL-EFIHKTKVEEPPLALLAFQGRVLVAVGTSLRIYDLGMRQMLRKSQAEV 980
Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
+VSLN + I++GD+ + + ++ +K +L D + T ++D
Sbjct: 981 ATQQIVSLNTQGSRIIVGDVQQGVTYVVYKPASNKLIPFVDDTIARWTTCTT-MVDYE-- 1037
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKFL---RLQMLAT 1305
S+ D+ N+ I K SE ++ + +R H H + Q + T
Sbjct: 1038 SVAGGDKFGNMFIVRCPEKASEEADEEQSGLHLINARDYLHGTPHRVSLMCHFYTQDIPT 1097
Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
S + G + LL+ + G+IG P ++ F Q+L++ L P +A
Sbjct: 1098 SITKASLVVGGQE----VLLWSGIMGTIGVFIPFVSREDADF--FQNLEQHLRTEDPPLA 1151
Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
G + +R +++ K ++D +L Y +LP +++ IA + + +I ++
Sbjct: 1152 GRDHLMYRGYYAPVKG-------VIDGDLCERYNLLPNDKKQMIAGELDRSVREIERKIS 1204
Query: 1423 DLALGTSF 1430
D+ ++F
Sbjct: 1205 DIRTRSAF 1212
>gi|302831461|ref|XP_002947296.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f. nagariensis]
gi|300267703|gb|EFJ51886.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f. nagariensis]
Length = 1221
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 135/344 (39%), Gaps = 60/344 (17%)
Query: 1108 ETLLAIGTA----YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL--KGAISALA 1161
E LL +G A Y+ + AA RV R AD + L E+ K + G AL
Sbjct: 906 EKLLVVGCAKGLRYMPTDCEAAYIRVY-----RLADGGKRL--ELVHKTIVDGGVPGALC 958
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
+G LL GP + L+ +L Y+ P ++++ + I +GD +S++ +
Sbjct: 959 GFKGRLLAGVGPTLRLYDMGKKKLLRKCEYNRLPHQIMNITVQGPRIYVGDAQESVHMMR 1018
Query: 1222 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG-- 1279
+K+ + A D T +D TL+ D+ N + + S+ +
Sbjct: 1019 YKKADNAFYIFADDIAP-RYLTTILPLDYDTLA--AGDKFGNFVVLRLPREASQQVEDDP 1075
Query: 1280 ----------------QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
KL +FHVG +T R +M A +
Sbjct: 1076 TGGKMAAASGRLNGAPHKLEEVVKFHVGDTITSLQRAEMQAGGQE--------------V 1121
Query: 1324 LLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
LL+ T+ G+IG + P +++ F L+ L P + G + SFR A+
Sbjct: 1122 LLYSTVMGAIGVLYPFTNREDVDF--FGHLEMHLRQEHPPLCGRDHLSFR------SAYF 1173
Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
P S VD +L Y +P ++Q IA T ++L L D+
Sbjct: 1174 P-VRSCVDGDLCGQYASIPAKKQQMIAEAMDRTPGEMLKKLEDI 1216
Score = 42.0 bits (97), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 71/353 (20%), Positives = 130/353 (36%), Gaps = 76/353 (21%)
Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGAN 327
A ++ KH T L ++ L++ W+ + + A L+AVP GGVLV N
Sbjct: 199 AASMAQKHLTFYEMDLGLNNVLRK----WTE-PIDNGANLLVAVPGGADGPGGVLVCAEN 253
Query: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
I Y +Q A+ L + + S++ A++ +L + ++ GD+
Sbjct: 254 FIIYKNQDHEEVRAVIPRRSDLPGDRGVLIVSYATHKKKAYSFFL------VQSEYGDIY 307
Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
+T+ Y+G V L + + + I + F S G+ L QF
Sbjct: 308 KVTLAYEGEAVTELKIKYFDTIPPCTSIAVLKTGFLFAASEYGNHALYQFV--------- 358
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
G E+ D+E SSS AL G + + + + + D
Sbjct: 359 -GTGEDDEDVE-----------SSSAALVQTEEGFQPVFF--------EPRPLKNLLLID 398
Query: 508 SLVNIGPLKDFS-----------------YGLRINADASATGISKQSNYELVELPGCK-G 549
+ ++ P+ D +G R + G++ + + LPG
Sbjct: 399 EMASLMPITDMKVANLLNEEIPQIYALCGHGPRASLSVLRPGLAV-TELAVSPLPGAPTA 457
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
+WTV ++ DE+ A++++S T+V + + E ES
Sbjct: 458 VWTVRRNAT--------------DEFDAFIVVSFANATLVFSIGEEVKETNES 496
>gi|348667612|gb|EGZ07437.1| hypothetical protein PHYSODRAFT_565381 [Phytophthora sojae]
Length = 1197
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 124/298 (41%), Gaps = 48/298 (16%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
V++ E+ A+ QG LL++ G + ++ ++ P +V L +
Sbjct: 922 VHTTEIDDIPHAMCEFQGRLLVSVGRALRIYDLGKKKMLRKCENRNFPSILVELKAAGDR 981
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE-FLIDGSTLSLVVSDEQKNIQI 1266
I D+H+S +F+ +K+ QL + A D + F T L+D TL +D+ N+ +
Sbjct: 982 IYASDMHESFHFVKYKKDENQLVIFADD--CVPRFITSSVLLDYDTL--CGADKFGNVFV 1037
Query: 1267 FYYAPKMSES----------WKG-------QKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
++S+ W KL A+FHVG VT + R
Sbjct: 1038 SRLPSEVSDEIDNPTANRILWDSGLLNGAPNKLEQVAQFHVGDVVTSMV----------R 1087
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1366
T PG + A+++ T+ G IG + P ++ F L+ + P + G +
Sbjct: 1088 TSLVPGGIE----AIIYATIMGRIGALIPFTSRQDVDF--YTHLEMYMRQEQPPLCGRDH 1141
Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
S+R ++ K ++ D +L + L +E+Q +A T +++L L D+
Sbjct: 1142 LSYRSYYIPVK-------NVTDGDLCEQFSSLSVEKQASVAEDLDRTPAEVLKKLEDI 1192
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 68/301 (22%), Positives = 116/301 (38%), Gaps = 76/301 (25%)
Query: 321 VLVVGANTIHYHSQ---SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV- 376
VLV+G NT+ Y ++ +CA+ Q PR V + AT Q D+
Sbjct: 247 VLVLGENTVQYKNEGHPELTCAIP---------RRQGEPRDIVIV----SAATHKQRDLF 293
Query: 377 -ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
LL ++ GDL +++ Y G V+ + + + + S + L F S + L
Sbjct: 294 FVLLQSELGDLYKISLDYSGNAVEEIKIQFFDTVPVASSMCITKTGLLFCASEFSNHYLF 353
Query: 436 QF-TCGSGT------------SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE 482
QF + G G + LS+ + +++ A S L + + D+ N +
Sbjct: 354 QFLSIGEGDDTAKCSSLAMDPTELSTFPLRKLTNLQL-ASSMPSLSPVTQLLVDDLANEQ 412
Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV 542
+Y N+ S+ L+ +GL I A++
Sbjct: 413 TPQMYALCGNSNRSS-----------------LRVLRHGLPITEMAASA----------- 444
Query: 543 ELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
LPG K +W + +Y D Y Y+++S E T+VLE + + EVT+
Sbjct: 445 -LPGVAKAVWCLKE--------------SYADPYDKYIVVSFEDATLVLEVGETVEEVTQ 489
Query: 602 S 602
S
Sbjct: 490 S 490
>gi|443918546|gb|ELU38987.1| CPSF A subunit region domain-containing protein [Rhizoctonia solani
AG-1 IA]
Length = 1037
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 79/348 (22%), Positives = 146/348 (41%), Gaps = 53/348 (15%)
Query: 1096 VVTLFNTTTKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK 1154
+VT T + + GTA + GE+ GR++LF + +N++ SK+++
Sbjct: 704 MVTSIGVYTHGGNSYILAGTAIINPGENEPLAGRIILF-----GQDEENMIKFKASKDVE 758
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTEL---NGIAFYDAPPLYVVSLNIVK-NFILL 1210
G +S++ L ++ A G I L+ E+ + +A ++ Y+V IV+ N I++
Sbjct: 759 GGVSSIKQLGARIIAAIGHGIYLYNLGRGEVTISDPVARWERG--YIVHDIIVRPNMIVV 816
Query: 1211 GDIHKSIYFLSWKEQGA-----------------QLNLLAKDFGSLDCFATEFLIDGSTL 1253
D +S+ L + E+ + Q +A D ++ + E L D T
Sbjct: 817 SDRLRSVSVLRFIERTSTPESHEEIETEEDSTILQFETVAMDMHAVWPTSVEVLPDNKT- 875
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
++ S NI ++ + L RA FH G + KF+ S+ ++ A
Sbjct: 876 -IIASQTDGNI--------LTWELEDGNLEPRAAFHTGEIIHKFI------ASTAKSSAG 920
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
P R +F T G IG ++ +D+ +L L+ KL D++ + + +R
Sbjct: 921 P------RTVAIFVTNTGRIGTLSTVDDADALQLTRLEMKLGDAIKGLGNIKHPEWRAPK 974
Query: 1374 SNGKAHRPGP-DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
+P P + D + + + L EE I +G+ I SN
Sbjct: 975 LLHTGTKPPPRRGVTDGDFIKKFLELSSEEAKRIL-SSGSAAETIGSN 1021
>gi|328874742|gb|EGG23107.1| UV-damaged DNA binding protein1 [Dictyostelium fasciculatum]
Length = 1116
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 69/309 (22%), Positives = 126/309 (40%), Gaps = 30/309 (9%)
Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
+E + +GT Y D GR+L+F + D+ L+ E ++G+I + +
Sbjct: 810 EEQSEYIVVGTTY-HCHDRKECGRILVF---KMIDSRLILLDET---TVRGSIFCMIAFN 862
Query: 1165 GHLLIASGPKIILHKWTGT----ELNGIAFY--DAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL+A + + W+G +L G Y LY+ +F+L+GD+ KS+
Sbjct: 863 GQLLVAINKSVHRYTWSGDSSSGKLTGEEIYGGHTASLYLAGRG---DFVLVGDMMKSMA 919
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
L + G + L++ F+ D + L SD N+ + + +
Sbjct: 920 LL--QASGKDVKELSRSSQPFWLTGLTFIDDDTYLG---SDNSYNLILMKKNTETANEVD 974
Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
Q L + H G + +F LAT +D P S ++F T+ G IG I+
Sbjct: 975 SQLLDNIGHIHTGEFINRF-HHGTLATLTDVDSPKPNS-------IIFATISGCIGVIST 1026
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ + + LQ L + + G + +R F N + +D +L+ + L
Sbjct: 1027 ISKQDYDFFSKLQVGLNRVIRGIGGFSHDRWRSFQ-NEHISNIESRNFIDGDLVEQFLHL 1085
Query: 1399 PLEEQLEIA 1407
++ LE+
Sbjct: 1086 RHDKMLEVT 1094
Score = 42.7 bits (99), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 44/196 (22%), Positives = 77/196 (39%), Gaps = 41/196 (20%)
Query: 504 AVRDSLVNIGPLKDF----------------SYGLR---INADASATGISKQSNYELVEL 544
V D+ N+GP+ DF S G + + + GI++Q++ ++L
Sbjct: 335 TVLDTFANLGPIPDFCLVDIEKQGQNQIVACSGGFKEGSLRVIRNGIGITEQAS---IDL 391
Query: 545 PGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
PG K IW++ S R YLI+S + T VLE E TE
Sbjct: 392 PGIKAIWSLARGSDR------------------YLILSFISSTKVLEFQGEDIEETEIAG 433
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
+ +Q T+ GN+ ++++Q+ G ++D + PS+ S + +
Sbjct: 434 FDLQSPTLYCGNV-ADKQILQISTSGIYLVDHETNLNYDVWKPSSGSINLASHQGNQILI 492
Query: 665 SIADPYVLLGMSDGSI 680
S + + D I
Sbjct: 493 SFGKTLIYFEIKDQKI 508
>gi|169848339|ref|XP_001830877.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
gi|116508046|gb|EAU90941.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
Length = 1213
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 102/484 (21%), Positives = 199/484 (41%), Gaps = 73/484 (15%)
Query: 977 ILKICQLPS-GSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1035
+L+I +P GS +PL TP ++ E N + LI S + + L
Sbjct: 770 VLRIFTIPKLGSKLKQ----DTLPLSYTPRKLITHPENNYFYLIESDHRVYSEEATKAKL 825
Query: 1036 ID-QEVGHQIDNH--NLSSVDLHRTYTVE---EYEVRILEPDRAGGPWQTRATIPMQSSE 1089
+ Q+ G +ID +L + R +RI++P +T A P+ ++E
Sbjct: 826 DELQKKGKKIDEEIISLPPSEFGRPKAPAGTWASNIRIIDPVEN----KTVAVFPLDNNE 881
Query: 1090 NALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVT 1146
A ++ +V + + E L +GTA +D R T + +N L
Sbjct: 882 AAFSIAIVPF---SARNGELHLVVGTA----KDTTVSPRTCESGFLRTYKFTENGTGLEL 934
Query: 1147 EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
++ E AL + QG L G + ++ +L + +V+L +
Sbjct: 935 -LHKTETDDVPMALLAFQGRLAAGVGKALRIYDIGKKKLLRKVENKSFTTAIVTLTTQGS 993
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAK-------------DFGSL---DCFATEFL--I 1248
IL+GD+ +S+ ++ +K+ +L A D+ ++ D F F+ +
Sbjct: 994 RILVGDMQESVQYVVYKQPENRLLTFADDTQPRWVTAITMVDYNTIVAGDRFGNIFVNRL 1053
Query: 1249 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1308
D S +S V ++ I + P + + K++ A FHVG +T ++ ++A
Sbjct: 1054 D-SKVSDQVDEDPTGAGILHEKPILMGAPHKTKMI--AHFHVGDIITSLHKVSLVA---- 1106
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
R +++ L G+IG + P +++ F + +L++ + P + G +
Sbjct: 1107 ----------GGREVIVYTGLHGTIGILMPFISKEDVDF--ISTLEQHMRTEQPSLVGRD 1154
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
++R ++ KA +VD +L Y LP +Q IA++ T ++L L +
Sbjct: 1155 QLAYRGYYVPVKA-------VVDGDLCETYAHLPASKQSSIANELDRTVGEVLKKLEQMR 1207
Query: 1426 LGTS 1429
+ +S
Sbjct: 1208 VTSS 1211
>gi|409045147|gb|EKM54628.1| hypothetical protein PHACADRAFT_210427 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1213
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 92/391 (23%), Positives = 165/391 (42%), Gaps = 66/391 (16%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+RI+ P A QT IP+ ++E A ++ VV T + E L +GTA Q +A
Sbjct: 861 IRIVSPVDA----QTVNFIPLDNNEAAFSIAVVPF---TARGGELTLVVGTA--QDTFLA 911
Query: 1125 ARGRVLLF-STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
R F T R D+ ++ + ++ E A+ + QG L G + ++
Sbjct: 912 PRSCTSGFLRTYRFLDDGRD-IELLHKTETNDVPLAIMAFQGRLAAGIGKALRIYDIGKK 970
Query: 1184 EL----NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1239
+L F +A +V+LN + I++GD+ +SI++ +K +L + A D
Sbjct: 971 KLLRKVESKNFSNA----IVTLNTQGSRIIVGDMQESIFYAVYKPPENRLLIFADDAQPR 1026
Query: 1240 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW------------KG------QK 1281
A +ID +T++ D N+ + PK+S+ KG K
Sbjct: 1027 WITAVT-MIDYNTVA--AGDRFGNVFVNRLDPKISDQVDDDPTGAGILHEKGILMGAPHK 1083
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1339
A FHVG VT ++ ++A R LL+ L G++G + P
Sbjct: 1084 TAMIAHFHVGDIVTSIHKVSLVA--------------GGRELLLYTGLHGTVGILVPFVS 1129
Query: 1340 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+++ F + +L++ + + G + ++R ++ KA +VD +L + L
Sbjct: 1130 KEDVDF--ISTLEQHMRTEQLSLVGRDHLAWRGYYVPVKA-------VVDGDLCEMFARL 1180
Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
P +Q IA + T ++L L L + S
Sbjct: 1181 PASKQSSIATELDRTVGEVLKKLEQLRVTAS 1211
>gi|432089478|gb|ELK23419.1| DNA damage-binding protein 1 [Myotis davidii]
Length = 1047
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 75/155 (48%), Gaps = 17/155 (10%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
L++K +A+DF A E L D + L
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFL 966
>gi|389740093|gb|EIM81285.1| hypothetical protein STEHIDRAFT_86633 [Stereum hirsutum FP-91666 SS1]
Length = 1213
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 91/389 (23%), Positives = 167/389 (42%), Gaps = 62/389 (15%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+RI++P ++ A IP+ ++E A ++ +V + + NE L +GTA Q +A
Sbjct: 861 IRIVDPAEG----KSVAEIPIDNNEAAFSLAIVPF---SVRNNEYHLVVGTA--QDTFLA 911
Query: 1125 ARGRVLLF-STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT 1183
R F T + D+ L ++ E +L + QG L+ G + ++
Sbjct: 912 PRSCTSGFLRTYKFVDDGAGLEL-LHKTETDDIPMSLLAFQGRLVAGIGKALRIYDIGKK 970
Query: 1184 ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDC 1241
+L A ++SLN + I++GD+ +SI + +K +L + A D + + C
Sbjct: 971 KLLRKAESKTFASAIISLNTQGSRIIVGDMQESIAYAVYKAPENKLLVFADDTQARWVTC 1030
Query: 1242 FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW------------KG------QKLL 1283
++D +T++ D NI I K+S+ KG K
Sbjct: 1031 ---STMVDYTTVA--AGDRFGNIFINRLDSKVSDQVDDDPTGAGILHEKGILMGAPHKTA 1085
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1340
A FHVG VT ++ ++A R LL+ L G+IG + PL +
Sbjct: 1086 MLAHFHVGDLVTSIHKVSLVA--------------GGREVLLYTGLHGTIGMLVPLVSKE 1131
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
++ F + +L++ + + G + ++R ++ KA +VD +L + LP
Sbjct: 1132 DVDF--ISTLEQHIRTEQTSLVGRDHLAWRGYYVPVKA-------VVDGDLCETFARLPA 1182
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
+Q IA + T S++L L+ L + S
Sbjct: 1183 AKQSMIAGELDRTVSEVLKKLDQLRVTAS 1211
>gi|402223178|gb|EJU03243.1| hypothetical protein DACRYDRAFT_115454 [Dacryopinax sp. DJM-731 SS1]
Length = 1175
Score = 57.4 bits (137), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 110/440 (25%), Positives = 180/440 (40%), Gaps = 80/440 (18%)
Query: 926 FFLSGSRPCWCMVFRERLRVHP-QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
F G RP + +RL P +L D I A +VLH FI+ ++ +L I Q+
Sbjct: 718 IFACGDRPALLFLKNDRLTASPIKLRD--IHAGSVLHIPQFPSSFIFASASTLL-IGQIR 774
Query: 985 SGSTYDNYWPVQKIPLKA-TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQ 1043
D V+ I L TP ++TY Y ++ K LN+ D+E+
Sbjct: 775 ESQKID----VRTISLGLDTPIRLTYHRGLRAYGVVCQ---RKELNRE----DDREIYS- 822
Query: 1044 IDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTT 1103
SS L T E PD + V T+ ++T
Sbjct: 823 ------SSFKLFDDITFEYLNNFTARPDEQ-------------------MMCVTTIPDST 857
Query: 1104 TKENETLLAIGTAYVQG-EDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
+E+ L +GT G E+ ++GR+L+F + P + V S ++ G + A+ +
Sbjct: 858 GEEDSDFLVVGTYEATGAEEDVSKGRILIFE-----EVPNRKLKLVVSHDVGGCVYAVTN 912
Query: 1163 LQGHLLIASGPKI---ILHK-WTGTELNGIAFYDAPPLYVVSLNIVK-NFILLGDIHKSI 1217
+ +L A + LH+ + +A + + YV S I + N +L+GD +++
Sbjct: 913 VGANLAAAINGTLQVFSLHRSHDDIRIESVAKWSSA--YVASSLICRGNTLLVGDAMRAV 970
Query: 1218 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW 1277
L W GA+L L D+ SL E + +G V+ E N + +W
Sbjct: 971 CILRWT--GAKLETLYHDYASLWIQTLESIDEGG----VIGAELNNNIV---------TW 1015
Query: 1278 KGQKLLSR-AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIG 1334
+ L R ++ G + +F R + A AAPG+ N L+F T G IG
Sbjct: 1016 RKDGKLERDGMWYFGEGINRFRRASLNA-------AAPGAGGNNAGRGNLVFCTNTGRIG 1068
Query: 1335 CIAPLDELTFRRLQSLQKKL 1354
+A LDE +L +LQ+ +
Sbjct: 1069 IVASLDEDLSMQLSNLQRNI 1088
>gi|398019848|ref|XP_003863088.1| CPSF-domain protein, putative [Leishmania donovani]
gi|322501319|emb|CBZ36398.1| CPSF-domain protein, putative [Leishmania donovani]
Length = 1347
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 70/308 (22%), Positives = 123/308 (39%), Gaps = 37/308 (12%)
Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1160
+ +E + LL IG+++ ++ AR + + T R Q L + SK++ GA+
Sbjct: 913 GVSEEEWQHLLLIGSSFTFPDEQRARSGRITWCTLREERQRQRL-HLIASKDIGGALQCC 971
Query: 1161 ASL---QGHLLIASGPKIILHKWTGTELN---------GIAFYDAPPLYVVSLNIVKNFI 1208
A++ +G + + + L++W + G+ PLY SL + +
Sbjct: 972 AAVPHYKGRIALGVNGCVCLYQWNTEDQTFVAEERCRVGLTVTKLIPLYHTSL--AASVL 1029
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
+ D+ S +F+ L +L +D D L L D+ N
Sbjct: 1030 VALDVRHSAFFIEVDTLQGNLKVLCRDADLRGIMDGHVGSDAENLCLF--DDSLNFTALK 1087
Query: 1269 YAPKMSESWKGQ-----------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
P E+ G + RA+ H+G VT +R A +S A S
Sbjct: 1088 VVPLPVEARDGDAAAAARATAQYRFEVRAQCHLGDLVT-CVRQGSFAATSLMEAPAHCSS 1146
Query: 1318 KTNRF--------ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
N+ L+F T G G + P+ T+ L++L+ LV +VP V GL+ ++F
Sbjct: 1147 AQNQLLLPGIAGPQLVFATAHGGFGVVTPVHAATYLVLRALEASLVRTVPPVGGLSHQAF 1206
Query: 1370 RQFHSNGK 1377
R+ G+
Sbjct: 1207 REVLRAGQ 1214
>gi|219110831|ref|XP_002177167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411702|gb|EEC51630.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1303
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 126/289 (43%), Gaps = 47/289 (16%)
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1214
G + +LA QG LL+ G + L++ +L + P +V ++ V +GD+
Sbjct: 1036 GPVLSLAHFQGRLLVGIGTTLRLYEMGKRQLLRKSELRNFPTFVKTVQTVGERAYIGDMM 1095
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+SI + + +L L+A D E L+D +T++ V D+ NI + P+ +
Sbjct: 1096 QSIQIVRYDVSANRLVLIANDASPRPIVCQE-LLDWNTVA--VGDKFGNISVMRL-PRGA 1151
Query: 1275 ES----WKGQKLL---SR----------AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
++ GQ+ L SR +++VG VT R ++A ++
Sbjct: 1152 DTSAIDVTGQRALWDSSREDMIPKLELLCQYYVGEVVTSMTRSSLVAGGAE--------- 1202
Query: 1318 KTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS 1374
+L++ T+ G IG P +++ F L+ +L G +P+S+R +++
Sbjct: 1203 -----SLIYVTVSGRIGAFVPFTNRNDVDF--YSQLESELRGDASRPTGRDPQSYRSYYA 1255
Query: 1375 NGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
H +VD +L + L E+Q +IA + T +I+ L D
Sbjct: 1256 P-MMH------VVDGDLCDAFNSLGPEKQNKIAEKLDRTVGEIMKKLED 1297
>gi|146094112|ref|XP_001467167.1| putative CPSF-domain protein [Leishmania infantum JPCM5]
gi|134071531|emb|CAM70220.1| putative CPSF-domain protein [Leishmania infantum JPCM5]
Length = 1347
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 70/304 (23%), Positives = 122/304 (40%), Gaps = 37/304 (12%)
Query: 1105 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL- 1163
+E + LL IG+++ ++ AR + + T R Q L + SK++ GA+ A++
Sbjct: 917 EEWQHLLLIGSSFTFPDEQRARSGRITWCTLREERQRQRL-HLIASKDIGGALQCCAAVP 975
Query: 1164 --QGHLLIASGPKIILHKWTGTELN---------GIAFYDAPPLYVVSLNIVKNFILLGD 1212
+G + + + L++W + G+ PLY SL + ++ D
Sbjct: 976 HYKGRIALGVNGCVCLYQWNTEDQTFVAEERCRVGLTVTKLIPLYHTSL--AASVLVALD 1033
Query: 1213 IHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
+ S +F+ L +L +D D L L D+ N P
Sbjct: 1034 VRHSAFFIEVDTLQGNLKVLCRDADLRGIMDGHVGSDAENLCLF--DDSLNFTALKVVPL 1091
Query: 1273 MSESWKGQ-----------KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1321
E+ G + RA+ H+G VT +R A +S A S N+
Sbjct: 1092 PVEARDGDAAAAARATAQYRFEVRAQCHLGDLVT-CVRQGSFAATSLMEAPAHCSSAQNQ 1150
Query: 1322 F--------ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
L+F T G G + P+ T+ L++L+ LV +VP V GL+ ++FR+
Sbjct: 1151 LLLPGIAGPQLVFATAHGGFGVVTPVHAATYLVLRALEASLVRTVPPVGGLSHQAFREVL 1210
Query: 1374 SNGK 1377
G+
Sbjct: 1211 RAGQ 1214
>gi|156084934|ref|XP_001609950.1| splicing factor 3b, subunit 3, 130kD [Babesia bovis T2Bo]
gi|154797202|gb|EDO06382.1| splicing factor 3b, subunit 3, 130kD, putative [Babesia bovis]
Length = 1169
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 105/489 (21%), Positives = 191/489 (39%), Gaps = 87/489 (17%)
Query: 965 CNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV 1022
CN G++ ++ I + C+L G T+ + ++PL TP ++ +
Sbjct: 734 CNDGYVAISGSNLRIFRCCRL--GETFSEH----RLPLDYTPRKLVMMPNE--------A 779
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRAT 1082
P + LN +++++ + +N S L ++ D G A
Sbjct: 780 PNVGGLNYMVAVVESDHNAYGPENVAEISKALGD-----------IKLDNEVGDLLPLAN 828
Query: 1083 IPMQSSENALTVRVVTLFNTTT------KENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1136
+ A VR+V N TT + NE A + G G + + +
Sbjct: 829 YKAGTGRWASCVRIVNPLNLTTAAKLLFETNEAATAAAVVVLDGMQCLCIGTTVGYDL-K 887
Query: 1137 NADNPQNLVTEVYS------------KELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
N D+ ++ + VY + G + A +G LL + G +I L+ +
Sbjct: 888 NTDDVESYI-RVYCYGANFEIRLLHVTRVGGVVRAFTGYEGRLLASVGKRIRLYALGKKQ 946
Query: 1185 LNGIAFYDAPPLY-VVSLNIVKNFILLGDIHKSIYFLSWK---EQGAQLNLLAKDFGSLD 1240
L A + + + LN V + I GDI + I L K E+ A+ + G
Sbjct: 947 LLLKAEHRTCSDHGFIWLNAVGSRIFAGDIREGIQILRIKFYSEEAAEFEWVGGATGP-- 1004
Query: 1241 CFATEFLIDGSTL--SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1298
+L + L S V++ ++ + IF ES + +L + +FH+G
Sbjct: 1005 ----RWLTSCAQLDYSTVIAGDKFD-SIFVTRVPQEESTRHIQLENVCQFHLGD------ 1053
Query: 1299 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLV 1355
L T+ D+ + + +L+GT+ GSIG + P DEL F LQ L+ +
Sbjct: 1054 ----LPTAMDKAALSQSTH-----VVLYGTVMGSIGALVPFQSKDELDF--LQHLEMLMA 1102
Query: 1356 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
P + G +R ++ + +VD +L + L +Q ++A Q TT +
Sbjct: 1103 TEAPPLCGREHSFYRSYYVPVQ-------QVVDGDLCEQFRHLTEAQQRKVAQQLDTTVN 1155
Query: 1416 QILSNLNDL 1424
+L L+D+
Sbjct: 1156 NVLRKLDDI 1164
>gi|328770812|gb|EGF80853.1| hypothetical protein BATDEDRAFT_29900 [Batrachochytrium dendrobatidis
JAM81]
Length = 1213
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 65/295 (22%), Positives = 127/295 (43%), Gaps = 29/295 (9%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
++ +KG + S QG LL+ G + ++ ++ P +V+L+ N
Sbjct: 936 LHKTPIKGIPKVMCSFQGRLLVGVGSLLRIYDLGKKKMLRKCECKGFPTTIVTLHTQGNR 995
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
I+LGD +S+++ ++ ++ + A D AT ++D T +V D+ NI +
Sbjct: 996 IILGDAQESVHYAMYRAFDNRIVIFADDTIPRWVTAT-CMVDYDT--VVGGDKMGNIFVN 1052
Query: 1268 YYAPKMS-----ESWKGQKLLSRAEF----HVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1318
+ ++S ++ Q + R H H F + L TS +T PG
Sbjct: 1053 RLSAEVSKGIDEDTTGNQAIFDRGYLQGAPHKVHHEADFFLGETL-TSLTKTSLVPG--- 1108
Query: 1319 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
R LL+ TL G IG + P D++ F Q+L+ + P + G + ++R F++
Sbjct: 1109 -GREILLYTTLMGGIGLLIPFISKDDVDF--FQTLEMTMRSECPPLCGRDHLAYRSFYTP 1165
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
A I+D +L + ++ +++ IA + + + L D+ +F
Sbjct: 1166 VHA-------IIDGDLCEMFNVMVGDKKRGIAESVDRSVADVGKKLEDMRTRVAF 1213
>gi|393217872|gb|EJD03361.1| hypothetical protein FOMMEDRAFT_108572 [Fomitiporia mediterranea
MF3/22]
Length = 1213
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 114/512 (22%), Positives = 194/512 (37%), Gaps = 106/512 (20%)
Query: 965 CNHGFIYVTSQGILKICQLPSGSTY--DNYWPVQKIPLKATPHQITYFAEKNLYPLIVSV 1022
C G I + S +L+I Q+P T + P+ P K PH
Sbjct: 759 CPEGLIGI-SGSVLRIFQIPRLGTKLKQDSMPLTYTPRKFIPH----------------- 800
Query: 1023 PVLKPLNQVLSLLIDQEVGHQI---DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT 1079
P+NQ ++ E H++ D +L + + EV L P+ G P
Sbjct: 801 ----PMNQYFYMI---EADHRVMGDDAAKEKLAELRQRGVKYDQEVVDLPPEVFGRPKAP 853
Query: 1080 RAT----IPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQGEDVA 1124
T I + N TV+VV L N + +E L +GTA +A
Sbjct: 854 AGTWGSCIRILDPINKATVKVVHLDNNEAAFSIAIVPFAARNSELFLCVGTA--SSTFLA 911
Query: 1125 ARG------RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1178
R R F+ G AD + V+ E AL + QG L G + ++
Sbjct: 912 PRSCSSGFIRTYAFTNG-GAD-----LELVHKTEADDVPMALMAFQGRLCAGVGKSLRIY 965
Query: 1179 KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1238
+ +L +V+LN + I++GD+ +SI + +K +L + A D
Sbjct: 966 EIGKKKLLRKVETKTYGSAIVTLNTQGSRIIVGDMQESIVYAVFKPPENRLLIFADD-SQ 1024
Query: 1239 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSR--------- 1285
+ ++D +T++ D+ N+ I K+S E G +L
Sbjct: 1025 PRWTTSAVMVDYTTIA--AGDKFGNVFINRLDSKISDQVDEDPTGAGILHEKGLLMGAPH 1082
Query: 1286 -----AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL- 1339
A FHVG VT ++ ++A R LL+ L G+IG + P
Sbjct: 1083 KTGMIAHFHVGDIVTSIHKISLVA--------------GGREVLLYTCLHGTIGILVPFV 1128
Query: 1340 --DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
+++ F + +L++ + + G + ++R ++ KA +VD +L +
Sbjct: 1129 SKEDVDF--ISTLEQHMRSEKLSLVGRDHLAWRGYYVPVKA-------VVDGDLCEQFAR 1179
Query: 1398 LPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
LP +Q IA + T ++L L L + S
Sbjct: 1180 LPANKQSAIAVELDRTVGEVLKKLEQLRVTAS 1211
>gi|393243160|gb|EJD50676.1| hypothetical protein AURDEDRAFT_112250 [Auricularia delicata
TFB-10046 SS5]
Length = 1140
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 124/281 (44%), Gaps = 33/281 (11%)
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQN 1143
MQ +N V +L E + +GTAY++ ++ +RGR+L+F + ++ +
Sbjct: 797 MQLDDNEEITSVASLPIMPESRTE-MFVVGTAYIKDSEMEPSRGRILVFGSLEDSGTGGS 855
Query: 1144 LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL------- 1196
+T ++ GA+ +L S+ G ++ +IL++ L+ L
Sbjct: 856 WLTAFL--QVTGAVLSLTSVDGLIVAGVNTAVILYELRRNTLSEAERASHLTLRQKKEWN 913
Query: 1197 --YVV-SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
YVV SL + I +GD SI L WK + L+ +A+ FG + A + + G
Sbjct: 914 HNYVVTSLAARGDTIYIGDSVASIAILRWKHE--TLHTIARHFGPIFPLALDVMSSG--- 968
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
S++ ++ N+ F+ ES +KL +H+G V KF+ ++ +A
Sbjct: 969 SVITANIDYNLHTFH-----QESPTDRKLEIDGSYHLGDQVNKFIPGRL---------SA 1014
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1354
P + +F T G IG +A D+ L +L++ +
Sbjct: 1015 PTVGASIVLEQVFVTSLGRIGIVAEADKDASWALSALERNI 1055
>gi|16197726|emb|CAC94909.1| damaged-DNA recognition protein 1 [Mus musculus]
Length = 994
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 75/155 (48%), Gaps = 17/155 (10%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
L++K +A+DF A E L D + L
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFL 966
>gi|70992271|ref|XP_750984.1| UV-damaged DNA binding protein [Aspergillus fumigatus Af293]
gi|66848617|gb|EAL88946.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
Af293]
gi|159124553|gb|EDP49671.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
A1163]
Length = 1140
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 129/314 (41%), Gaps = 40/314 (12%)
Query: 1113 IGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GTAY+ E D + RGR+L+F DN + L T+V +KGA ALA L ++ A
Sbjct: 834 VGTAYLDDEGDESIRGRILIF----EVDNGRKL-TQVAELPVKGACRALAMLGDKIVAAL 888
Query: 1172 GPKIILHK-----WTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
++++K + L +A Y AP V + + N I + D+ KS+ + +K
Sbjct: 889 VKTVVVYKVINNNFGAMRLEKLASYRTSTAP----VDVTVTGNLIAVSDLMKSMCLVEYK 944
Query: 1224 E----QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
E + +A+ F ++ + + L SD + N+ + + E
Sbjct: 945 EGENGTPDTMTEVARHFQTVWATGVANIAPDTFLE---SDAEGNLIVLHRNTTGVEEDDK 1001
Query: 1280 QKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1337
++L E +G V + + +Q LA+ P + GT++GSI A
Sbjct: 1002 RRLEVTGEISLGEMVNRIRPVNIQQLAS----VAVTPRA--------FLGTVEGSIYLFA 1049
Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
++ L LQ + V V + FR F S + + P VD EL+ +
Sbjct: 1050 IINPDHQDFLMRLQATIAGKVELVGNIPFNEFRGFRSMVREAKE-PYRFVDGELIERFLT 1108
Query: 1398 LPLEEQLEIAHQTG 1411
Q EI G
Sbjct: 1109 CEPSLQEEIVSTVG 1122
>gi|392593521|gb|EIW82846.1| hypothetical protein CONPUDRAFT_81012 [Coniophora puteana RWD-64-598
SS2]
Length = 1213
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 85/385 (22%), Positives = 166/385 (43%), Gaps = 58/385 (15%)
Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR 1126
I++P A +T T+ + ++E+A +V VV ++NE L +GTA + R
Sbjct: 863 IIDPIEA----RTIHTVELDNNESAFSVAVVPF---AARDNELHLVVGTA--ADTLLTPR 913
Query: 1127 G-RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1185
R T R D ++L ++ E A+ + QG L+ G + L++ +L
Sbjct: 914 SCRSGYLRTYRFTDEGRSLEL-LHKTETDDVPLAVMAFQGRLIAGVGKSLRLYEIGKKKL 972
Query: 1186 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATE 1245
A + +V+LN + I++GD+ +S++F ++K +L + A D A
Sbjct: 973 LRKAENKSFASAIVTLNTQGSRIIVGDMQESVHFAAYKAPENRLLIFADDMQPRWVTALT 1032
Query: 1246 FLIDGSTLSL------------------VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1287
++D +T+++ V D+ I + ++S + KLL
Sbjct: 1033 -MVDYTTIAVGDRFGNVFINRLDMRVSDQVDDDPTGAGILHEKGQLSGAPHKTKLL--CH 1089
Query: 1288 FHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTF 1344
FHVG +T ++ ++A R LL+ + G+IG + P +++ F
Sbjct: 1090 FHVGDLITSIHKVSLVA--------------GGREVLLYTGIHGTIGILVPFVSKEDVDF 1135
Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
+ +L++ + + G + S+R +++ KA +VD +L + L +Q
Sbjct: 1136 --ISTLEQHMRSEQSSLVGRDQLSWRGYYTPVKA-------VVDGDLCEAFARLTGSKQS 1186
Query: 1405 EIAHQTGTTRSQILSNLNDLALGTS 1429
IA + T ++L L L + +S
Sbjct: 1187 AIAGELDRTVGEVLKKLEQLRVTSS 1211
Score = 46.2 bits (108), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 87/384 (22%), Positives = 146/384 (38%), Gaps = 97/384 (25%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL ++ GDL +T+ +D V+ L + + + S + + + F+ S G+ L QF
Sbjct: 308 LLQSEDGDLFKVTIDHDEDEVKSLKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367
Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
S TS S G+ E F + + R R + AL D + + L
Sbjct: 368 QKLGDDDDEPEFSSTSFPSFGMAESFIPLPH---AHFRPRGLDNLALADEIESLDPILDA 424
Query: 489 SASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
N N+++ Q F+ R S L+ +GL + S+ ELP
Sbjct: 425 KVMNILPNSDTPQ-IFTACGRGSRSTFRMLR---HGLEVEESVSS------------ELP 468
Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
G +WT DD Y +Y+I+S T+VL + + EV ++
Sbjct: 469 GIPNAVWTTKRTE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT-G 513
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGA------------RILDGS--------------- 637
+ T+A + G ++QV +G R+ G
Sbjct: 514 FLSSAPTLAVQQI-GSDALLQVHPQGIRHVLSDRRVNEWRVPQGKTIVCATTNKRQVVVA 572
Query: 638 -------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRL 682
Y DL G N + STVL++S+ + PY+ +G D ++R+
Sbjct: 573 LSSAELVYFELDLD-GQLNEYQDWKAMGSTVLALSVGEVPEGRQRTPYLAVGCEDQTVRI 631
Query: 683 LVGDPSTC--TVSVQT----PAAI 700
+ DP + T+S+Q P+AI
Sbjct: 632 ISLDPESTLETISLQALTAPPSAI 655
>gi|358338734|dbj|GAA31211.2| DNA damage-binding protein 1, partial [Clonorchis sinensis]
Length = 1515
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 111/266 (41%), Gaps = 37/266 (13%)
Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD--EDTFGSGGGFSARIESSHVINLRD 240
V VDP C V +Y + I+ + G L D E + ++ RIE +++
Sbjct: 101 VLVDPGANCVVVRLYHGLLRIIPLNGIGEKLTTDSLEVNQYAANTYNVRIEEGNIV---- 156
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
D F+HGY P +++E EL H L+ L
Sbjct: 157 -------DMAFLHGYTLPTFAMIYEDELVL--------HMKTYEISGREPALRNVQLTLD 201
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
++ D+ L+ VP P GGV++VG N I+YH++ ++ Y +SQ L ++
Sbjct: 202 SIE--PDSKLLIPVPKPFGGVILVGDNIIYYHTKDGP---HISQYIPQAKASQVLCYAAV 256
Query: 361 SVEL----DAAHATWLQNDVALLSTKTGDLVLLT----VVYDGRVVQ-RLDLSKTNPSVL 411
+ D A ++ + +A T +G+ +L + V R+ R++L +
Sbjct: 257 DAQRYLLGDMAGRLYMVHLLAEDHTPSGNGLLGSTSSAAVPSARIGSIRIEL--LGETAT 314
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
I + N + F+G LGDS L++
Sbjct: 315 PESIAYVDNGVVFIGCTLGDSQLIRL 340
>gi|71004436|ref|XP_756884.1| hypothetical protein UM00737.1 [Ustilago maydis 521]
gi|74704394|sp|Q4PGM6.1|RSE1_USTMA RecName: Full=Pre-mRNA-splicing factor RSE1
gi|46095609|gb|EAK80842.1| hypothetical protein UM00737.1 [Ustilago maydis 521]
Length = 1221
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 88/374 (23%), Positives = 150/374 (40%), Gaps = 54/374 (14%)
Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFST 1134
QT + M +E A ++ VV + E E +L +G+A DV R +T
Sbjct: 878 QTTHRLEMDDNEAAFSIAVVPF---ASAEKEVMLVVGSAV----DVVLSPRSCKKAYLTT 930
Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
R DN + L ++ E+ L + QG LL G + ++ +L +
Sbjct: 931 YRLLDNGRELEL-LHKTEVDDIPLVLRAFQGRLLAGIGKALRIYDLGKKKLLRKCENRSF 989
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----------------FGS 1238
P VVSL+ + I++GD+ +SI F S+K +L A D +
Sbjct: 990 PTAVVSLDAQGSRIVVGDMQESIIFASYKPLENRLVTFADDVMPKFVTRCTMLDYDTVAA 1049
Query: 1239 LDCFATEFL--IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTK 1296
D F ++ +DG+T S V ++ + I + P + + L+ A F VG +T
Sbjct: 1050 ADKFGNIYVLRLDGNT-SRSVDEDPTGMTIVHEKPVLMGAAHKASLV--AHFFVGDIITS 1106
Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLV 1355
R M+A R LL+ L GSIG + P + + L +L+ L
Sbjct: 1107 LHRTAMVA--------------GGREVLLYTGLSGSIGALVPFVSKEDVDTLSTLESHLR 1152
Query: 1356 DSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRS 1415
+ G + ++R ++ K S++D +L + +L +Q IA +
Sbjct: 1153 QENNSIVGRDHLAYRSSYAPVK-------SVIDGDLCETFGLLSPAKQNAIAGELDRKPG 1205
Query: 1416 QILSNLNDLALGTS 1429
+I L L G +
Sbjct: 1206 EINKKLAQLREGAT 1219
>gi|121699866|ref|XP_001268198.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
1]
gi|119396340|gb|EAW06772.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
1]
Length = 1140
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 77/317 (24%), Positives = 129/317 (40%), Gaps = 46/317 (14%)
Query: 1113 IGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GTA++ E D + RGR+L+F DN + L T+V +KGA ALA L ++ A
Sbjct: 834 VGTAFLDDEGDESIRGRILIF----EVDNGRKL-TQVAELPVKGACRALAMLGNRIVAAL 888
Query: 1172 GPKIILHK-----WTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
++++K + +L +A Y AP V + + N I + D+ KS+ + +K
Sbjct: 889 VKTVVVYKAVSNNFGAMKLEKLASYRTSTAP----VDVTVTGNLIAVSDLMKSVCLVEYK 944
Query: 1224 EQ----GAQLNLLAKDFGS-----LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
E L +A+ F + + C A + ++ SD + N+ I
Sbjct: 945 EGEDGLPDTLTEVARHFQTVWATGVACIAQDTFLE--------SDAEGNLIILCRNTTGV 996
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1334
E ++L E +G V + + + +S P + T++GSI
Sbjct: 997 EEDDKRRLEVTGEISLGEMVNRIRPVNIQQLTS--VAVTPRA--------FLATVEGSIY 1046
Query: 1335 CIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSH 1394
A ++ L LQ + V V + FR FHS + + P VD EL+
Sbjct: 1047 LFAMINPDHQDFLMRLQATIAGKVELVGNMPFNEFRGFHSMVREAQE-PYRFVDGELIER 1105
Query: 1395 YEMLPLEEQLEIAHQTG 1411
+ Q EI G
Sbjct: 1106 FLACEPSVQEEIVSIVG 1122
Score = 41.6 bits (96), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 61/136 (44%), Gaps = 25/136 (18%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
L+ VP+P+GG+L++G +I Y V D+++ + R LD A
Sbjct: 248 LIPVPAPLGGLLILGETSIKY---------------VDADNNEIISRP-----LDEATIF 287
Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFL 425
W Q D LL+ G L L +V D V+ L + S + +G + FL
Sbjct: 288 VAWEQVDSQRWLLADDYGRLFFLMLVLDSDNQVESWKLDLLGKTSRASVLVYLGGGVLFL 347
Query: 426 GSRLGDSLLVQFTCGS 441
GS GDS +++ + GS
Sbjct: 348 GSHQGDSQVLRISNGS 363
>gi|302654423|ref|XP_003019019.1| hypothetical protein TRV_07032 [Trichophyton verrucosum HKI 0517]
gi|291182709|gb|EFE38374.1| hypothetical protein TRV_07032 [Trichophyton verrucosum HKI 0517]
Length = 460
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 79/344 (22%), Positives = 150/344 (43%), Gaps = 44/344 (12%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG----- 1135
+ + ++ +E A+++ V+ T++E+ET L +GT G+D+ R F+ G
Sbjct: 102 SNLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRT--FTCGFIHIY 152
Query: 1136 RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
R + + L ++ +++ AL QG LL GP + ++ +L P
Sbjct: 153 RFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQITP 211
Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
+V L + I++ D+ +S+ ++ +K Q L A D T ++D T++
Sbjct: 212 RVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTT-MVDYETVA- 269
Query: 1256 VVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQMLATS 1306
D+ NI + K SE G L+ ++ GA V F Q + TS
Sbjct: 270 -GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVIHFYS-QDIPTS 327
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
+T G R L++ L G++G P D++ F Q+L+ +L P +AG
Sbjct: 328 IQKTQLVAGG----RDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNPPLAG 381
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
+ +R +++ K ++D +L + +LP +++ IA
Sbjct: 382 RDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIA 418
>gi|322787057|gb|EFZ13281.1| hypothetical protein SINV_13198 [Solenopsis invicta]
Length = 986
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 97/222 (43%), Gaps = 43/222 (19%)
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1098
E+G +I+ HNL +D H + + + ++E AL+
Sbjct: 778 EIGQEIEVHNLLIIDQHTFEVLHAH--------------------TLMATEYALS----- 812
Query: 1099 LFNTTTKENET-LLAIGTAYVQGEDVAAR-GRVLLF--STGRNADNPQNLVTEVYSKELK 1154
L +T E+ T +GTA++ ++ + GR+LL+ S G+ T+V KE+K
Sbjct: 813 LISTRLGEDPTSYFVVGTAFINPDETEPKMGRILLYHWSEGK--------FTQVAEKEIK 864
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGTE---LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
G+ +L G LL + + L +WT + L F + LY L +F+L+G
Sbjct: 865 GSCYSLVEFNGKLLASINSTVRLFEWTAEKELRLECSHFNNIIALY---LKTKGDFVLVG 921
Query: 1212 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
D+ +S+ L +K +A+D+ + E L D + L
Sbjct: 922 DLMRSLTLLQYKTMEGSFEEIARDYNPNWMTSIEILDDDTFL 963
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 48/203 (23%), Positives = 92/203 (45%), Gaps = 31/203 (15%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
+D + V+D F+HG P ++++H+ ++ +H + I+ K+ I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEINLRDKEFSKIPW 207
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++ VPSP+ G +++G +I YH N+Y + + +
Sbjct: 208 RQDNVEREAMMVIPVPSPMCGAIIIGQESILYHDG--------NSYVAVVPPIIKQSTIT 259
Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV-VQRLDLSKTNPSVLTSD 414
++D +L D+A G L +L + DG + V+ L + +
Sbjct: 260 CYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGTLSVKDLKVELLGEISIPEC 313
Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
IT + N + ++GSRLGDS L++
Sbjct: 314 ITYLDNGVIYVGSRLGDSQLIKL 336
>gi|300122534|emb|CBK23104.2| unnamed protein product [Blastocystis hominis]
Length = 172
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 65/128 (50%), Gaps = 15/128 (11%)
Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
+ ++ +A+FH+ + +T L + + P N + T +G++G +
Sbjct: 26 RNVVRQADFHLASQITSILPISL-----------PDGQCIN----VILTAEGAMGVFLFV 70
Query: 1340 DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLP 1399
+ +L SLQK+L++++P A LN +FR++ S+G P ++D ++ Y ML
Sbjct: 71 TGEEYTKLSSLQKRLIEALPQNAALNNFNFRKYMSDGMMKYPRRKGVLDMGVIRKYLMLS 130
Query: 1400 LEEQLEIA 1407
+EQ +IA
Sbjct: 131 TQEQEDIA 138
>gi|302504585|ref|XP_003014251.1| hypothetical protein ARB_07556 [Arthroderma benhamiae CBS 112371]
gi|291177819|gb|EFE33611.1| hypothetical protein ARB_07556 [Arthroderma benhamiae CBS 112371]
Length = 460
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/344 (22%), Positives = 150/344 (43%), Gaps = 44/344 (12%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG----- 1135
+ + ++ +E A+++ V+ T++E+ET L +GT G+D+ R F+ G
Sbjct: 102 SNLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRT--FTCGFIHIY 152
Query: 1136 RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPP 1195
R + + L ++ +++ AL QG LL GP + ++ +L P
Sbjct: 153 RFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIYDLGMRQLLRKCQAQITP 211
Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
+V L + I++ D+ +S+ ++ +K Q L A D T ++D T++
Sbjct: 212 RVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTT-MVDYETVA- 269
Query: 1256 VVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQMLATS 1306
D+ NI + K SE G L+ ++ GA V F Q + TS
Sbjct: 270 -GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVIHFYS-QDIPTS 327
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
+T G R L++ L G++G P D++ F Q+L+ +L P +AG
Sbjct: 328 IQKTQLVAGG----RDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNPPLAG 381
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
+ +R +++ K ++D +L + +LP +++ IA
Sbjct: 382 RDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIA 418
>gi|148709424|gb|EDL41370.1| damage specific DNA binding protein 1 [Mus musculus]
Length = 968
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 73/151 (48%), Gaps = 17/151 (11%)
Query: 1105 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1163
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1164 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLID 1249
L++K +A+DF A E L D
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDD 962
>gi|119471789|ref|XP_001258220.1| UV-damaged DNA binding protein, putative [Neosartorya fischeri NRRL
181]
gi|119406372|gb|EAW16323.1| UV-damaged DNA binding protein, putative [Neosartorya fischeri NRRL
181]
Length = 1140
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 129/314 (41%), Gaps = 40/314 (12%)
Query: 1113 IGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GTAY+ E D + RGR+L+F DN + L T+V +KGA ALA L ++ A
Sbjct: 834 VGTAYLDDEGDESIRGRILIF----EVDNGRKL-TQVAELPVKGACRALAMLGDKIVAAL 888
Query: 1172 GPKIILHK-----WTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
+++++ + L +A Y AP V + + N I + D+ KS+ + +K
Sbjct: 889 VKTVVVYRVINNNFGAMRLEKLASYRTSTAP----VDVTVTGNLIAVSDLMKSMCLVEYK 944
Query: 1224 E----QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
E + +A+ F ++ + + L SD + N+ + + E
Sbjct: 945 EGENGTPDTMTEVARHFQTVWATGVANIAPDTFLE---SDAEGNLIVLHRNTTGVEEDDK 1001
Query: 1280 QKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1337
++L E +G V + + +Q LA+ P + GT++GSI A
Sbjct: 1002 RRLEVTGEISLGEMVNRIRPVNIQQLAS----VAVTPRA--------FLGTVEGSIYLFA 1049
Query: 1338 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEM 1397
++ L LQ + V V + FR F S + + P VD EL+ +
Sbjct: 1050 IINPDHQDFLMRLQATIAGKVELVGNMPLNEFRGFRSMVREAKE-PYRFVDGELIERFLT 1108
Query: 1398 LPLEEQLEIAHQTG 1411
Q EI G
Sbjct: 1109 CEPSLQEEIVSTVG 1122
>gi|320037168|gb|EFW19106.1| pre-mRNA-splicing factor rse1 [Coccidioides posadasii str. Silveira]
Length = 970
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 77/367 (20%), Positives = 162/367 (44%), Gaps = 48/367 (13%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------AARGRVLLFSTGR 1136
I ++ +E A++V V +++++ET L +GT G+D+ ++ G + ++
Sbjct: 633 IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVYPPSSSCGFIHIYRFQE 685
Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
+ + ++ +++ AL + QG LL G + ++ +L + P
Sbjct: 686 DGKE----LEFIHKTKVESPPHALLAFQGRLLAGIGRNLRIYDLGMKQLLRKCQAEVVPR 741
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLS 1254
+V L + I++ D+ +S+ ++ +K Q +L A D + C A ++D T++
Sbjct: 742 LIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDVIARWTTCTA---MVDYETVA 798
Query: 1255 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATS 1306
D+ N+ + K SE G L+ ++ GA L + Q + TS
Sbjct: 799 --GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMVHFYPQDIPTS 856
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
+T G R L++ L G++G + P +++ F QSL+ +L P +AG
Sbjct: 857 IQKTQLVAG----GRDILVWTGLQGTVGMLVPFVSREDVDF--FQSLEMQLTSQTPPLAG 910
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
+ +R +++ K +D +L Y LP +++L IA + + +I ++D
Sbjct: 911 RDHLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIAGELDRSVREIERKISD 963
Query: 1424 LALGTSF 1430
+ ++
Sbjct: 964 MRTKVAY 970
>gi|300120114|emb|CBK19668.2| unnamed protein product [Blastocystis hominis]
Length = 1240
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 124/542 (22%), Positives = 208/542 (38%), Gaps = 102/542 (18%)
Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTV-------------LHNVNCNHGFIYV- 972
L+G PC L V P LC + T+ N +C+ G + V
Sbjct: 715 ILAGGNPCVLA-----LSVKPWLCYCANNTLTLTSLVSDPLDLAAPFCNEDCSEGIVCVA 769
Query: 973 -TSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV 1031
T+ I++I L T IPL TP ++ +YP + +L+ +
Sbjct: 770 GTNLNIIRIDDLTQPFT------ATSIPLSYTPRELV------VYPGQPRLLLLETDHNA 817
Query: 1032 LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRI---LEPDRAGG--------PWQTR 1080
S L Q Q HN+S V+ EY+ EPD+ QT
Sbjct: 818 YSELEKQSFYQQ---HNVSYVN--------EYDCGAPIPAEPDKWASCIRVVDAISLQTL 866
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARG------RVLLFST 1134
+ + +E A ++ V +K +E + IGTA + + R V F
Sbjct: 867 ERLELADNEAAFSMCVCRF---ASKGDEPFVVIGTA--KNLKIHPRSCSQGFISVFRFVE 921
Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
G + + ++ E+ +AL G L G + ++ +L A
Sbjct: 922 GHS-------LQLLHRTEVDEVPAALCEFDGKLAAGIGRSVRVYDLGKKKLLRKCENKAM 974
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
P +V L + + GD+ ++ F+ +++ QL A D G ++D +T+
Sbjct: 975 PHFVTKLRAMGERLYAGDLTDNVSFVKFRKGTNQLVEFA-DGGIPRSITALDVLDYNTV- 1032
Query: 1255 LVVSDEQKNIQIFYYAPKM---------SESWKGQKLLSRAEFHVGAHVTKFLRLQMLAT 1305
V D+ N+ + PK+ S S LLS A A + L + T
Sbjct: 1033 -VCGDKGGNLFVERVDPKVDDDIANPTGSRSLWNSGLLSAAPNK--AEQAASIYLGEIVT 1089
Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
S +T PG D+ +L+GT+ G+IG + P+ D+L L ++ + P +
Sbjct: 1090 SVQKTVLIPGGDEV----VLYGTIFGTIGALLPMPSRDDL--HHLMHIEMYIRKQEPSLV 1143
Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
G + S+R ++ K I+D L + MLP +Q EIA+ + S I+ +
Sbjct: 1144 GRDILSWRSAYTPMKG-------IIDGNLCETFSMLPQIKQEEIANALVLSVSSIVKKME 1196
Query: 1423 DL 1424
DL
Sbjct: 1197 DL 1198
>gi|336469942|gb|EGO58104.1| pre-mRNA splicing factor RSE1 [Neurospora tetrasperma FGSC 2508]
Length = 1192
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/392 (22%), Positives = 168/392 (42%), Gaps = 62/392 (15%)
Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQG 1120
RA G W + +I SE ++ + L N ++E E+ L +GT G
Sbjct: 830 RAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEGESFLVVGT----G 885
Query: 1121 EDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1175
+D+ R F+ G R ++ ++L ++ ++ AL QG LL G +
Sbjct: 886 KDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTRVEEPPLALIPFQGRLLAGVGKTL 942
Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
++ +L A D P +VSL N I++GD+ + I ++ +K +G +L A D
Sbjct: 943 RIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGDLQQGITYVVYKAEGNRLIPFADD 1002
Query: 1236 FGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
+L+ + T ++D S+ D+ NI I ++S+ +E H+ H
Sbjct: 1003 --TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPERVSQETDEPG----SEIHL-MHA 1053
Query: 1295 TKFL-----RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1339
+L RL Q L TS +T G LL+ L G++G P
Sbjct: 1054 RNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD----VLLWSGLQGTVGVFIPFVS 1109
Query: 1340 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+++ F Q+L+ + P +AG + +R +++ K ++D +L + +L
Sbjct: 1110 REDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTPVKG-------VIDGDLCERFSLL 1160
Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
P +++ IA + + +I ++D+ ++F
Sbjct: 1161 PNDKKQMIAGELDRSVREIERKISDIRTRSAF 1192
>gi|164429062|ref|XP_957282.2| pre-mRNA splicing factor RSE1 [Neurospora crassa OR74A]
gi|157072394|gb|EAA28046.2| pre-mRNA splicing factor RSE1 [Neurospora crassa OR74A]
Length = 1192
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/392 (22%), Positives = 168/392 (42%), Gaps = 62/392 (15%)
Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQG 1120
RA G W + +I SE ++ + L N ++E E+ L +GT G
Sbjct: 830 RAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEGESFLVVGT----G 885
Query: 1121 EDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1175
+D+ R F+ G R ++ ++L ++ ++ AL QG LL G +
Sbjct: 886 KDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTRVEEPPLALIPFQGRLLAGVGKTL 942
Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
++ +L A D P +VSL N I++GD+ + I ++ +K +G +L A D
Sbjct: 943 RIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGDLQQGITYVVYKAEGNRLIPFADD 1002
Query: 1236 FGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
+L+ + T ++D S+ D+ NI I ++S+ +E H+ H
Sbjct: 1003 --TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPERVSQETDEPG----SEIHL-MHA 1053
Query: 1295 TKFL-----RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1339
+L RL Q L TS +T G LL+ L G++G P
Sbjct: 1054 RNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD----VLLWSGLQGTVGVFIPFVS 1109
Query: 1340 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+++ F Q+L+ + P +AG + +R +++ K ++D +L + +L
Sbjct: 1110 REDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTPVKG-------VIDGDLCERFSLL 1160
Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
P +++ IA + + +I ++D+ ++F
Sbjct: 1161 PNDKKQMIAGELDRSVREIERKISDIRTRSAF 1192
>gi|391341057|ref|XP_003744848.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Metaseiulus
occidentalis]
Length = 1211
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/247 (23%), Positives = 104/247 (42%), Gaps = 27/247 (10%)
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
P +V++N V N I++GD+ +S +F+ ++ QL + A DF A ++D T
Sbjct: 981 PNLIVTINAVGNRIVVGDVQESFFFIRYRMLENQLIIFADDFTPRWTTAA-CMVDYRT-- 1037
Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG--------AHVTKFLRLQMLATS 1306
+V D+ N+ I S+ R+ + G A V + L S
Sbjct: 1038 VVGGDKFGNVYILRLPGNTSDDVDEDPTGVRSLWDRGWLGGAGQKAEVLSMTHVGELIVS 1097
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
+T PG + A+++ T+ G +G + P D+ F Q L+ + P + G
Sbjct: 1098 LQKTALIPGGPE----AIVYTTIAGGVGALIPFSSKDDHEF--FQHLEMYMRTEHPPICG 1151
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
+ SFR ++ KA ++D +L Y L +Q +IA + ++ L D
Sbjct: 1152 RDHLSFRSYYFPVKA-------VIDGDLCEQYNSLDANKQKQIADELERLPHEVAKKLED 1204
Query: 1424 LALGTSF 1430
+ +F
Sbjct: 1205 IRTKFAF 1211
>gi|303324325|ref|XP_003072150.1| Splicing factor 3B subunit 3, putative [Coccidioides posadasii C735
delta SOWgp]
gi|240111860|gb|EER30005.1| Splicing factor 3B subunit 3, putative [Coccidioides posadasii C735
delta SOWgp]
Length = 1209
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/367 (21%), Positives = 165/367 (44%), Gaps = 48/367 (13%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------AARGRVLLFSTGR 1136
I ++ +E A++V V +++++ET L +GT G+D+ ++ G + ++ R
Sbjct: 872 IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVYPPSSSCGFIHIY---R 921
Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
++ + L ++ +++ AL + QG LL G + ++ +L + P
Sbjct: 922 FQEDGKEL-EFIHKTKVESPPHALLAFQGRLLAGIGRNLRIYDLGMKQLLRKCQAEVVPR 980
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLS 1254
+V L + I++ D+ +S+ ++ +K Q +L A D + C A ++D T++
Sbjct: 981 LIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDVIARWTTCTA---MVDYETVA 1037
Query: 1255 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATS 1306
D+ N+ + K SE G L+ ++ GA L + Q + TS
Sbjct: 1038 --GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMVHFYPQDIPTS 1095
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
+T G R L++ L G++G + P +++ F QSL+ +L P +AG
Sbjct: 1096 IQKTQLVAG----GRDILVWTGLQGTVGMLVPFVSREDVDF--FQSLEMQLTSQTPPLAG 1149
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
+ +R +++ K +D +L Y LP +++L IA + + +I ++D
Sbjct: 1150 RDHLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIAGELDRSVREIERKISD 1202
Query: 1424 LALGTSF 1430
+ ++
Sbjct: 1203 MRTKVAY 1209
>gi|391341059|ref|XP_003744849.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Metaseiulus
occidentalis]
Length = 1223
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/247 (23%), Positives = 104/247 (42%), Gaps = 27/247 (10%)
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1254
P +V++N V N I++GD+ +S +F+ ++ QL + A DF A ++D T
Sbjct: 993 PNLIVTINAVGNRIVVGDVQESFFFIRYRMLENQLIIFADDFTPRWTTAA-CMVDYRT-- 1049
Query: 1255 LVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG--------AHVTKFLRLQMLATS 1306
+V D+ N+ I S+ R+ + G A V + L S
Sbjct: 1050 VVGGDKFGNVYILRLPGNTSDDVDEDPTGVRSLWDRGWLGGAGQKAEVLSMTHVGELIVS 1109
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
+T PG + A+++ T+ G +G + P D+ F Q L+ + P + G
Sbjct: 1110 LQKTALIPGGPE----AIVYTTIAGGVGALIPFSSKDDHEF--FQHLEMYMRTEHPPICG 1163
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
+ SFR ++ KA ++D +L Y L +Q +IA + ++ L D
Sbjct: 1164 RDHLSFRSYYFPVKA-------VIDGDLCEQYNSLDANKQKQIADELERLPHEVAKKLED 1216
Query: 1424 LALGTSF 1430
+ +F
Sbjct: 1217 IRTKFAF 1223
>gi|17541566|ref|NP_502299.1| Protein DDB-1 [Caenorhabditis elegans]
gi|74965443|sp|Q21554.2|DDB1_CAEEL RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|5824558|emb|CAA92824.2| Protein DDB-1 [Caenorhabditis elegans]
Length = 1134
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/281 (19%), Positives = 119/281 (42%), Gaps = 17/281 (6%)
Query: 1104 TKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
T ++ T +GT + ++ + GR+++F D ++ + V+ ++G+ A+
Sbjct: 814 TNDSSTYYVVGTGLIYPDETETKIGRIVVFEVD---DVERSKLRRVHELVVRGSPLAIRI 870
Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
L G L+ A I L +WT + + + + L ++ + + D+ +S+ LS+
Sbjct: 871 LNGKLVAAINSSIRLFEWTTDKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSY 930
Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
+ +AKD+ S EF+ S L +++ P + G+ +
Sbjct: 931 RMLEGNFEEVAKDWNSQWMVTCEFITAESILGGEAHLNLFTVEVDKTRPITDD---GRYV 987
Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPLD 1340
L + + K + L + D +++ ++FGT G+IG I +D
Sbjct: 988 LEPTGYWYLGELPKVMTRSTLVIQPE--------DSIIQYSQPIMFGTNQGTIGMIVQID 1039
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
+ + L +++K + DSV + + S+R F +A P
Sbjct: 1040 DKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQKRAEPP 1080
Score = 47.8 bits (112), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 166/396 (41%), Gaps = 96/396 (24%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
D+ L+ VP IGGV+V+G+N++ Y + Y SL L ++F+ +
Sbjct: 210 DSSVLIPVPHAIGGVIVLGSNSVLYKPNDNLGEVV--PYTCSL-----LENTTFTCHGIV 262
Query: 365 DAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
DA+ +L LS G L++L T G V+ + + + + I I N
Sbjct: 263 DASGERFL------LSDTDGRLLMLLLNVTESQSGYTVKEMRIDYLGETSIADSINYIDN 316
Query: 421 SLFFLGSRLGDSLLVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
+ F+GSRLGDS L++ T +G S S + E + +I ++DMV
Sbjct: 317 GVVFVGSRLGDSQLIRLMTEPNGGSY--SVILETYSNI---------------GPIRDMV 359
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
E ++ + T + A +D G L+ G+ I+ AS
Sbjct: 360 MVE---------SDGQPQLVTCTGADKD-----GSLRVIRNGIGIDELAS---------- 395
Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
V+L G GI+ + S NAD+ Y+I+SL T VL+ E
Sbjct: 396 --VDLAGVVGIFPIRLDS----NADN------------YVIVSLSDETHVLQITGEELED 437
Query: 600 TESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSESGSGS 656
+ ++ TI A LFG ++Q E+ R++ S +++ + P+N E S
Sbjct: 438 VKLLEINTDLPTIFASTLFGPNDSGIILQATEKQIRLMSSSGLSK--FWEPTNGEIISK- 494
Query: 657 ENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
+SV+ A+ ++L D ++ LL TC V
Sbjct: 495 -----VSVNAANGQIVLAARD-TVYLL-----TCIV 519
>gi|71413926|ref|XP_809084.1| cleavage and polyadenylation specificity factor-like protein
[Trypanosoma cruzi strain CL Brener]
gi|70873410|gb|EAN87233.1| cleavage and polyadenylation specificity factor-like protein,
putative [Trypanosoma cruzi]
Length = 499
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 62/260 (23%), Positives = 105/260 (40%), Gaps = 54/260 (20%)
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
+++V+D F+ EP++ L ER TWAGRV W+ LS S
Sbjct: 250 IRYVRDMQFIDSSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309
Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
T+ ++ LI ++P++ + +P+G GV+ G NT+ + + + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFAQIPSGVICYGINTVMHVTTKRGYGVYL 365
Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQNDV---AL 378
NN + D E + F V L A+ T + N++ +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFMVNLSLANCTASFMSIVNEMLHLLV 425
Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
+S + G ++ L++ VQ + ++ S I IG+ + FLGS GDS
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTGCYCSGIARIGDQIVFLGSACGDS------ 479
Query: 439 CGSGTSMLSSGLKEEFGDIE 458
C + M S + F IE
Sbjct: 480 CIAKVDMFHSDAAKRFQIIE 499
>gi|392869416|gb|EJB11761.1| pre-mRNA-splicing factor rse1 [Coccidioides immitis RS]
Length = 1209
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 77/367 (20%), Positives = 162/367 (44%), Gaps = 48/367 (13%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------AARGRVLLFSTGR 1136
I ++ +E A++V V +++++ET L +GT G+D+ ++ G + ++
Sbjct: 872 IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVYPPSSSCGFIHIYRFQE 924
Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
+ + ++ +++ AL + QG LL G + ++ +L + P
Sbjct: 925 DGKE----LEFIHKTKVESPPHALLAFQGRLLAGIGRNLRIYDLGMKQLLRKCQAEVVPR 980
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLS 1254
+V L + I++ D+ +S+ ++ +K Q +L A D + C A ++D T++
Sbjct: 981 LIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDVIARWTTCTA---MVDYETVA 1037
Query: 1255 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATS 1306
D+ N+ + K SE G L+ ++ GA L + Q + TS
Sbjct: 1038 --GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMVHFYPQDIPTS 1095
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
+T G R L++ L G++G + P +++ F QSL+ +L P +AG
Sbjct: 1096 IQKTQLVAG----GRDILVWTGLQGTVGMLVPFVSREDVDF--FQSLEMQLTSQTPPLAG 1149
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
+ +R +++ K +D +L Y LP +++L IA + + +I ++D
Sbjct: 1150 RDHLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIAGELDRSVREIERKISD 1202
Query: 1424 LALGTSF 1430
+ ++
Sbjct: 1203 MRTKVAY 1209
>gi|302837243|ref|XP_002950181.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
nagariensis]
gi|300264654|gb|EFJ48849.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
nagariensis]
Length = 1104
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 112/268 (41%), Gaps = 30/268 (11%)
Query: 1113 IGTAYVQGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAI-SALASLQGHLLIA 1170
+GTA++ E+ +GR+L+ R LVTE KE+KGA + L ++ +L +
Sbjct: 835 VGTAFIVPEEPEPTKGRILVLEHVR-------LVTE---KEVKGAAYNVLPFVKDKILAS 884
Query: 1171 SGPKIILHKWTGTELNGIAFYDAPP------LYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
K+ +G +L G+ A + + L N +++GD+ +S+ LS+
Sbjct: 885 VNSKV---PASGCDLGGVRVELASECSYLGNILALYLATRGNLVVVGDLMRSVSLLSYNV 941
Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
+ L A D+ S + E L D + L D N+ + + + +L
Sbjct: 942 EQGVLEHRAADYNSGWTTSVEALDDDTYLE---GDNHLNLVVLRRNADSATDEERARLQV 998
Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
E+H G V +F ++ D + LLFG DG +G IA L +
Sbjct: 999 VGEYHTGTFVNRFRHGSLVMRPPDSEFV------SLPVPLLFGGTDGRLGVIARLPPGLY 1052
Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQF 1372
L LQ L V V GL+ ++ F
Sbjct: 1053 EMLTKLQSALRQVVRGVGGLSHEAWIAF 1080
Score = 47.0 bits (110), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 94/242 (38%), Gaps = 55/242 (22%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL + G + LL + +DG V L + S + + + L F+GSR GDS LV+
Sbjct: 281 LLGNRQGGMQLLVLAHDGSRVSGLRTEPLGYTCAPSCLAYLDSGLTFVGSRSGDSQLVRI 340
Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
+ + P T S +L +V+ + L
Sbjct: 341 SAQP-----------------VNQPPTYLELVDSFPSLAPIVDFVVMDL---------ER 374
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
Q + + + G L+ G+ IN A+ VELPG KG+W++
Sbjct: 375 QGQGQLVMCSGIDSDGSLRVVRNGIGINRQAT------------VELPGIKGVWSL---- 418
Query: 558 SRGHNADSSRMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
R H YDDEY YL+++ E R + L T + L E E + +T+ G
Sbjct: 419 -RSH---------YDDEYDKYLLLTFVGETRLLALNTEEELDE-AELPGFDSGSQTLWCG 467
Query: 616 NL 617
N+
Sbjct: 468 NM 469
>gi|189044515|sp|Q7RYR4.2|RSE1_NEUCR RecName: Full=Pre-mRNA-splicing factor rse-1
Length = 1209
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/392 (22%), Positives = 168/392 (42%), Gaps = 62/392 (15%)
Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQG 1120
RA G W + +I SE ++ + L N ++E E+ L +GT G
Sbjct: 847 RAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEGESFLVVGT----G 902
Query: 1121 EDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1175
+D+ R F+ G R ++ ++L ++ ++ AL QG LL G +
Sbjct: 903 KDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTRVEEPPLALIPFQGRLLAGVGKTL 959
Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
++ +L A D P +VSL N I++GD+ + I ++ +K +G +L A D
Sbjct: 960 RIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGDLQQGITYVVYKAEGNRLIPFADD 1019
Query: 1236 FGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
+L+ + T ++D S+ D+ NI I ++S+ +E H+ H
Sbjct: 1020 --TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPERVSQETDEPG----SEIHL-MHA 1070
Query: 1295 TKFL-----RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1339
+L RL Q L TS +T G LL+ L G++G P
Sbjct: 1071 RNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD----VLLWSGLQGTVGVFIPFVS 1126
Query: 1340 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+++ F Q+L+ + P +AG + +R +++ K ++D +L + +L
Sbjct: 1127 REDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTPVKG-------VIDGDLCERFSLL 1177
Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
P +++ IA + + +I ++D+ ++F
Sbjct: 1178 PNDKKQMIAGELDRSVREIERKISDIRTRSAF 1209
>gi|350290373|gb|EGZ71587.1| Pre-mRNA-splicing factor rse-1 [Neurospora tetrasperma FGSC 2509]
Length = 1209
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 89/392 (22%), Positives = 168/392 (42%), Gaps = 62/392 (15%)
Query: 1072 RAGGPWQTRATIPMQSSENALTVRVVTLFNT-----------TTKENETLLAIGTAYVQG 1120
RA G W + +I SE ++ + L N ++E E+ L +GT G
Sbjct: 847 RAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEGESFLVVGT----G 902
Query: 1121 EDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1175
+D+ R F+ G R ++ ++L ++ ++ AL QG LL G +
Sbjct: 903 KDMVLDPR--QFTEGYIHVYRFHEDGRDL-EFIHKTRVEEPPLALIPFQGRLLAGVGKTL 959
Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
++ +L A D P +VSL N I++GD+ + I ++ +K +G +L A D
Sbjct: 960 RIYDLGLKQLLRKAQADVTPTLIVSLQSQGNRIIVGDLQQGITYVVYKAEGNRLIPFADD 1019
Query: 1236 FGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1294
+L+ + T ++D S+ D+ NI I ++S+ +E H+ H
Sbjct: 1020 --TLNRWTTCTTMVDYE--SVAGGDKFGNIYIVRCPERVSQETDEPG----SEIHL-MHA 1070
Query: 1295 TKFL-----RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-- 1339
+L RL Q L TS +T G LL+ L G++G P
Sbjct: 1071 RNYLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQD----VLLWSGLQGTVGVFIPFVS 1126
Query: 1340 -DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+++ F Q+L+ + P +AG + +R +++ K ++D +L + +L
Sbjct: 1127 REDVDF--FQNLENHMRAEDPPLAGRDHLIYRGYYTPVKG-------VIDGDLCERFSLL 1177
Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
P +++ IA + + +I ++D+ ++F
Sbjct: 1178 PNDKKQMIAGELDRSVREIERKISDIRTRSAF 1209
>gi|302680006|ref|XP_003029685.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
gi|300103375|gb|EFI94782.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
Length = 1213
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 87/376 (23%), Positives = 157/376 (41%), Gaps = 36/376 (9%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+RI++P + T A IP+ ++E A ++ VV + + E L +GTA V+
Sbjct: 861 IRIIDPTQN----STVAVIPLDNNEAAFSIAVVPF---SARNGELFLVVGTA--ANTRVS 911
Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
R + N + + E AL + QG L G + ++ +
Sbjct: 912 PRTCSSGYLRTYQFTNDGAGLELHHKTETDDVPLALLAFQGRLAAGVGKALRIYDIGKKK 971
Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
L A +V+LN + I+ GD+ +S+++ +K +L + A D +
Sbjct: 972 LLRKAENKGFGTTIVTLNTQGSRIIAGDMQESLFYAVYKAPENRLLVFADD-SQPRWISA 1030
Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEFHVGA-HVTKFL- 1298
++D T++ D N+ + K+SE G +L +GA H TK L
Sbjct: 1031 ATMVDYYTVA--AGDRFGNVFVNRLDYKVSEQVDDDPTGAGILHEKGILMGAPHKTKLLC 1088
Query: 1299 --RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKK 1353
+ L TS + G R LL+ L G+IG + P +++ F + +L++
Sbjct: 1089 HFHVGDLITSIHKVALVAG----GREVLLYTGLHGTIGMLVPFVSKEDVDF--ISTLEQH 1142
Query: 1354 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
+ + G + S+R ++ KA +VD +L + LP +Q IA++ T
Sbjct: 1143 MRSEQSSLVGRDHLSWRGYYVPVKA-------VVDGDLCETFAKLPASKQSAIANELDRT 1195
Query: 1414 RSQILSNLNDLALGTS 1429
++L L+ L TS
Sbjct: 1196 VGEVLKKLDSLRTTTS 1211
>gi|219125301|ref|XP_002182922.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
1055/1]
gi|217405716|gb|EEC45658.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
1055/1]
Length = 1284
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 66/286 (23%), Positives = 121/286 (42%), Gaps = 30/286 (10%)
Query: 1110 LLAIGTAYVQ-GEDVAARGRVLLFST----GRNADNPQNLVTEVYSKELKGAISALASL- 1163
L +GTAY ED +RGR+L++S V ++ +G + ++
Sbjct: 931 FLLVGTAYAMPDEDEPSRGRILVYSCQADEASGTPTSTRAVRQITEMSTQGGVYSICQFY 990
Query: 1164 QGHLLIASGPKIILHKWTGT------ELNGIAFYDAPPLYVVSLNI---VKNFILLGDIH 1214
G+ L K + + E GI + ++VSL + K ++GD+
Sbjct: 991 DGNFLCTVNSKTHVVQIVADCGVLRLEYVGIGHHG----HIVSLFVKSRAKPLAIVGDLM 1046
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+S+ + + Q L +A+DF A E L D + + E N K +
Sbjct: 1047 RSVSLMQYYPQHETLEEVARDFNPNWTTAVEMLTD----DVYIGAENWNNLFCLRRNKAA 1102
Query: 1275 ESWKGQ-KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
S + + +L + EFH+G KF+ ++ + S ++R A LFGT++GS+
Sbjct: 1103 TSEEIRCRLDNIGEFHLGEMCNKFMSGSLVMP------VSSNSTTSSRRATLFGTVEGSL 1156
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
G I LD T +L++ + ++ V G + + +R + + H
Sbjct: 1157 GVILGLDGRTAAFFITLERAIAKTIQPVGGFSHQLYRSCQAELRVH 1202
>gi|392580116|gb|EIW73243.1| hypothetical protein TREMEDRAFT_37240 [Tremella mesenterica DSM 1558]
Length = 1214
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 78/390 (20%), Positives = 160/390 (41%), Gaps = 61/390 (15%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA-------- 1116
+RIL+P QT +TI + E A ++ + N E L +GTA
Sbjct: 862 IRILDPLTN----QTVSTIELDEDEAAFSLTIAYFENMA---GEPSLVVGTAVKTTLTPR 914
Query: 1117 -----YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+++ + GR L F ++ +L +A QG+LL+ +
Sbjct: 915 GCKEGWLRVYAIKENGRTLEF---------------MHKTKLDEIPLCVAGFQGYLLVGA 959
Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
G + L++ L ++ P + ++N++ I++GD+ +S +F ++ + L
Sbjct: 960 GKSLRLYEAGKKALLRKCENNSFPTVIATINVIGARIIVGDMQESTFFCVYRSIPTRQLL 1019
Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAE 1287
+ D +D T++ D+ N+ + +SE G +L
Sbjct: 1020 VFGDDTQPRFLTCVTNVDYDTVA--CGDKFGNVFVNRMDQAVSEKVDDDPTGAGILHEKG 1077
Query: 1288 FHVG-AHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP---LD 1340
F +G AH T + ++ + TS + PG R L++ T+ G++G + P +D
Sbjct: 1078 FLMGAAHKTTLIAHYQVGSVVTSLTKVSLVPG----GRDVLVYTTISGAVGALVPFISMD 1133
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
++ F + +L+ + + G + ++R +++ +VD +L Y LP
Sbjct: 1134 DVEF--MTTLEMHMRSQNISLVGRDHLAYRGYYAPVMG-------VVDGDLCDAYSSLPY 1184
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+Q IA++ + +L L + ++F
Sbjct: 1185 TKQSSIANELDRSVGDVLKKLEQMRTSSAF 1214
>gi|388855100|emb|CCF51231.1| probable splicing factor 3B subunit 3 [Ustilago hordei]
Length = 1221
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 129/581 (22%), Positives = 227/581 (39%), Gaps = 84/581 (14%)
Query: 887 LRNLRFSRTPLDAYTREETPH-----GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRE 941
L N RT LDA T + T G+ R+ I + G +R ++
Sbjct: 685 LSNGVLLRTVLDAMTGQLTDTRTRFLGSKAVRL-IRTKVHGQSAVMALSTRTWLSFTYQS 743
Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLK 1001
RL+ P + D A++ + C G I + L+I +PS ++ + L
Sbjct: 744 RLQFTPLIFDALDHAWSFSAEL-CPEGLIGIVG-STLRIFTIPSLASK---LKQDSVALS 798
Query: 1002 ATPHQITYFAEKN--LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT-Y 1058
TP +I + ++ Y + L P Q + + + ++ H +DL +
Sbjct: 799 YTPRKIAHHPDEQGLFYVVEADRRTLSPGAQRRRV---EALEKELKPHQRGVLDLKPAEF 855
Query: 1059 TVEEYE-------VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1111
+ E VR+++ G QT I + +E A +V +V + E ++ L
Sbjct: 856 GLIRGEAGNWASCVRVVD----GPQSQTTHKIELDDNEAAFSVAIVPF---ASAEKQSFL 908
Query: 1112 AIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSK-ELKGAISALASLQGHL 1167
+G+A DV R +T R + + L EV+ K E+ L QG L
Sbjct: 909 VVGSAV----DVVLSPRSFKKAYLTTYRLINGGREL--EVHHKTEIDDIPLVLRPFQGRL 962
Query: 1168 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
L G + ++ +L + P +VSL+ + I++GD+ +SI F S+K
Sbjct: 963 LAGVGKALRIYDLGKKKLLRKCENKSFPTAIVSLDAQGSRIVVGDMQESIVFTSYKPLEN 1022
Query: 1228 QLNLLAKD----------------FGSLDCFATEFL--IDGSTLSLVVSDEQKNIQIFYY 1269
+L A D + D F ++ ID T S V ++ + I +
Sbjct: 1023 RLVTFADDVMPKFVTRCTMLDYDTVAAADKFGNLYVLRIDADT-SRSVDEDPTGMTIVHE 1081
Query: 1270 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
P + + LL A + VG + TS +RT PG R L++ +
Sbjct: 1082 KPVLMGAAHKATLL--AHYFVGD----------IITSLNRTVMVPG----GREVLMYTGI 1125
Query: 1330 DGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
G+IG + P + + L +LQ +L + G + ++R ++ K S++D
Sbjct: 1126 SGTIGALVPFVSKEDVDTLSTLQTQLRQENNSLVGRDHLAYRSSYAPVK-------SVID 1178
Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
+L + +L +Q IA + S+I L L G +
Sbjct: 1179 GDLCETFGLLQPAKQNAIAQELDRKPSEINKKLAQLREGAT 1219
>gi|295666353|ref|XP_002793727.1| pre-mRNA-splicing factor rse1 [Paracoccidioides sp. 'lutzii' Pb01]
gi|226278021|gb|EEH33587.1| pre-mRNA-splicing factor rse1 [Paracoccidioides sp. 'lutzii' Pb01]
Length = 1209
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 77/362 (21%), Positives = 156/362 (43%), Gaps = 38/362 (10%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNAD 1139
I ++ +E A++V V+ +++++ET L +GT G+D+ R R +
Sbjct: 872 IELEENEAAVSVAAVSF---SSQDDETFLVVGT----GKDMVVNPRSCSAGFIHIYRFQE 924
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
+ + L ++ +++ AL QG LL G + ++ ++ P VV
Sbjct: 925 DGKEL-EFIHKTKVEQPPVALLGFQGRLLAGIGTDVRIYDLGMRQMLRKCQASVVPHLVV 983
Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
L + I++ D+ +S+ ++ +K Q +L D S T ++D T++ D
Sbjct: 984 GLQTQGSRIIVSDVQESVTYVVYKSQENRLIPFVDDVISRWTTCTT-MVDYETVA--GGD 1040
Query: 1260 EQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTG 1311
+ N+ + K SE G L+ ++ GA ++ Q L TS +
Sbjct: 1041 KFGNLWLLRCPAKASEEADEDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQ 1100
Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
G R L++ L G++G + P +E+ F QSL+ +L P +AG +
Sbjct: 1101 LVTG----GRDILVWTGLQGTVGMLIPFISREEVDF--FQSLEMQLAAQNPPLAGRDHLI 1154
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
+R +++ K +D +L Y +LP +++ +IA + + +I + D+
Sbjct: 1155 YRSYYAPAKG-------TIDGDLCETYLLLPNDKKQQIAGELDRSVREIERKIADMRTKV 1207
Query: 1429 SF 1430
++
Sbjct: 1208 AY 1209
>gi|390342012|ref|XP_793599.3| PREDICTED: uncharacterized protein LOC588842 [Strongylocentrotus
purpuratus]
Length = 1161
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 68/281 (24%), Positives = 111/281 (39%), Gaps = 46/281 (16%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
GP+ +DP+ R G+ +Y I+ + L F+ R+E +VI+++
Sbjct: 50 GPIGIIDPECRMIGLRLYDGLFKIIPLDRDNKEL----------KAFNIRLEELNVIDVQ 99
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
F++G +P +V LH+ GR H + P W
Sbjct: 100 -----------FLYGCHQPTIVFLHQDP---HGR-----HVKTYEVNLREKEFNRGP--W 138
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
N+ +A ++AVP P GG L++G +I YH A+A + S+
Sbjct: 139 KQDNVETEATMVIAVPQPYGGALIIGQESITYHKGDNYVAIA----------PPTIKNST 188
Query: 360 FSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDIT 416
LD + +L D L L+ DG V+ L L + + +T
Sbjct: 189 LVCYGRLDNNGSRYLLGD--LTGRLFLLLLDKEESMDGAATVKDLKLEFLGETSIAECLT 246
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
+ N + F+GSRLGDS LV+ S S + E F ++
Sbjct: 247 YLDNGVVFIGSRLGDSQLVRLNTESDESGSYVTMMETFTNL 287
>gi|330792580|ref|XP_003284366.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
gi|325085712|gb|EGC39114.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
Length = 1064
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 76/336 (22%), Positives = 141/336 (41%), Gaps = 38/336 (11%)
Query: 1102 TTTKENE---TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAIS 1158
T+TK ++ T LA+GT+ + D GRVLLF N + + LV + + +
Sbjct: 749 TSTKFDDDPCTYLAVGTS-INIPDRQTSGRVLLF----NINEAKKLVL-LEEISFRSGVL 802
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK-----NFILLGDI 1213
L G L+ A ++ +++ ++ + ++ I+K +F+L+GD+
Sbjct: 803 YLHQFNGRLIAAVLKRLYSIRYSYSKEKNCKVISSENVHKGHTMILKLASRGHFMLVGDM 862
Query: 1214 HKSIYFLSWKEQGAQLNLLAKD-----FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
KS+ L E G+ L +AK+ S+ ++ I T + V ++ N
Sbjct: 863 MKSMSLLGQSENGS-LVQIAKNPQPIWIRSIAMINDDYFIGSETSNNFVVVKKNN----- 916
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
+ + + L S +H+G + ML S R P SD +L+ +
Sbjct: 917 ---DSTNELERELLDSVGHYHIGESIN-----SMLCGSLVR---LPDSDAPPIPTILYAS 965
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
++GSIG IA + + + LQK L V + G S+R F ++ H + +D
Sbjct: 966 VNGSIGVIASISKEDYEFFSKLQKGLNRVVNGIGGFTHESWRAFSND--HHTVESRNFID 1023
Query: 1389 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+L+ + L +E ++ T + L + L
Sbjct: 1024 GDLIEMFPDLKIESMAKVIQDMNVTLDETLKRIESL 1059
Score = 42.4 bits (98), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 128/565 (22%), Positives = 214/565 (37%), Gaps = 117/565 (20%)
Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
++G + L + S GG ++D + ++ E K +L +D + + E
Sbjct: 14 IYGRISVLKLFSAGG-----KQDYLFISTESFKFCILAYDSEKKEIVTKASGNAED---- 64
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
GR + A G L +DP GR +I L +G L+ E G +
Sbjct: 65 --TIGRPTEA-GQLGIIDPDGR----------LIALHLYEGLLKLINIEK------GLNN 105
Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
I+ + N R L+ V D F++G P + +L + KH I +
Sbjct: 106 PIQKTAA-NTR-LEELQVMDMTFLYGCKIPTIAVL------FKDTKDEKH----IVTYEV 153
Query: 289 STTLKQH-PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
S ++ P WS N+ Y + V P+GGVLVV N I Y + + ++A
Sbjct: 154 SQKDQELCPGPWSQSNV--GVYSSMLVAVPLGGVLVVADNGITYMNGRTTRSIA------ 205
Query: 348 SLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
+P + F +D + +L D G L +L ++ + V L
Sbjct: 206 -------IPYTKFLAYDRVDKDGSRYLFGD------HFGRLSVLVLLNHQQRVTELKFET 252
Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
+ + S I+ + + + F+GS GDS L++ + E D P+T
Sbjct: 253 LGRTSIPSSISYLDSGVVFIGSSSGDSQLIRL------------------NTEKD-PATD 293
Query: 466 RLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRDSLVNIGPLKDFSYGLR 523
S L++ N G + + AQ T S RD G L+ G+
Sbjct: 294 ----SYISHLENFTNIGPIVDFCLVDTEKQGQAQIVTCSGTYRD-----GTLRVIRNGI- 343
Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
GI++++ L+EL G KG+W + N S + D YLI+S
Sbjct: 344 --------GIAEKA---LIELEGVKGLWPI------KENDPSDPLNPKD----QYLIVSF 382
Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQD 642
T VL+ E TE TI N+ ++QV + +++ ++ D
Sbjct: 383 IGYTKVLQFQGEEIEETEFEGLDSNSSTILCSNIDKENVIVQVTNQAINLINPITFKRVD 442
Query: 643 LSFGPSNSESGSGSENSTVLSVSIA 667
PS S S N + +++SI
Sbjct: 443 QWKSPSGSPINLVSSNQSQIALSIG 467
>gi|336257679|ref|XP_003343663.1| hypothetical protein SMAC_08834 [Sordaria macrospora k-hell]
gi|380091896|emb|CCC10625.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1209
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 82/370 (22%), Positives = 163/370 (44%), Gaps = 54/370 (14%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
I + ++E A++ +V ++E E+ L +GT G+D+ R F+ G R
Sbjct: 872 IDLDNNEAAVSAAIVPF---ASQEGESFLVVGT----GKDMVLNPR--QFTEGYIHVYRF 922
Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
++ ++L ++ ++ AL QG LL G + ++ +L A D P
Sbjct: 923 HEDGRDL-EFIHKTRVEEPPMALIPFQGRLLAGVGKTLRIYDLGLKQLLRKAQADVTPTL 981
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLV 1256
+VSL N I++GD+ + + ++ +K +G +L D +L+ + T ++D S+
Sbjct: 982 IVSLQSQGNRIIVGDLQQGVTYVVYKAEGNRLIPFVDD--TLNRWTTCTTMVDYE--SVA 1037
Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL-----RL--------QML 1303
D+ NI I ++S+ +E H+ H +L RL Q L
Sbjct: 1038 SGDKFGNISIVRCPERVSQDTDEPG----SEIHL-MHARNYLHGTPNRLSLQVHFFTQDL 1092
Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPH 1360
TS +T G LL+ L G++G P +++ F Q+L+ + P
Sbjct: 1093 PTSICKTSLVVGGQD----VLLWSGLQGTVGVFIPFVSREDVDF--FQNLENHMRAEDPP 1146
Query: 1361 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
+AG + +R +++ K ++D +L + +LP +++ IA + + +I
Sbjct: 1147 LAGRDHLIYRGYYTPVKG-------VIDGDLCERFSLLPNDKKQMIAGELDRSVREIERK 1199
Query: 1421 LNDLALGTSF 1430
++D+ ++F
Sbjct: 1200 ISDIRTRSAF 1209
>gi|315053737|ref|XP_003176243.1| pre-mRNA-splicing factor rse1 [Arthroderma gypseum CBS 118893]
gi|311338089|gb|EFQ97291.1| pre-mRNA-splicing factor rse1 [Arthroderma gypseum CBS 118893]
Length = 1181
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 102/466 (21%), Positives = 187/466 (40%), Gaps = 60/466 (12%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
C G + + Q + ++ S DN + IPL TP E YPL
Sbjct: 725 QCVEGMVGIQGQNL----RIFSIEKLDNNLLQEPIPLAYTPRNFVRHPE---YPLFY--- 774
Query: 1024 VLKPLNQVLS-----LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGP 1076
V+ N VLS L+ + D+ L D + + +++P
Sbjct: 775 VIGSDNNVLSPSTKAKLLGESTAVNGDSAELPPEDFGYPRGTNHWASCIEVVDPINTKS- 833
Query: 1077 WQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFS 1133
+ + ++ +E A+++ V+ T++E+ET L +GT G+D+ R
Sbjct: 834 --VLSKLELEDNEAAVSIAAVSF---TSQEDETFLVVGT----GKDMVVSPRTYTCGFIH 884
Query: 1134 TGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
R + + L ++ +++ AL QG LL GP + ++ +L
Sbjct: 885 IYRFQEEGKEL-EFIHKTKVEQPPLALLGFQGRLLAGVGPDLRIYDLGMRQLLRKCQAQI 943
Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
P +V L + I++ D+ +S+ ++ +K Q L A D S T ++D T+
Sbjct: 944 TPRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIISRWTTCTT-MVDYETV 1002
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAH-----VTKFLRLQMLA 1304
+ D+ NI + K SE G L+ ++ GA V F Q +
Sbjct: 1003 A--GGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRLSLVVHFYS-QDIP 1059
Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
TS +T G R L++ L G++G P D++ F Q+L+ +L +
Sbjct: 1060 TSIQKTQLVAG----GRDILVWTGLQGTVGMFVPFITRDDVDF--FQTLEMQLASQNAPL 1113
Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
AG + +R +++ K ++D +L + +LP +++ IA
Sbjct: 1114 AGRDHLIYRGYYAPCKG-------VIDGDLCETFLLLPNDKKQAIA 1152
>gi|225683909|gb|EEH22193.1| pre-mRNA-splicing factor rse1 [Paracoccidioides brasiliensis Pb03]
Length = 1209
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 77/362 (21%), Positives = 156/362 (43%), Gaps = 38/362 (10%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNAD 1139
I ++ +E A++V V+ +++++ET L +GT G+D+ R R +
Sbjct: 872 IELEENEAAVSVAAVSF---SSQDDETFLVVGT----GKDMVVNPRSCSAGFIHIYRFQE 924
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
+ + L ++ +++ AL QG LL G + ++ ++ P VV
Sbjct: 925 DGKEL-EFIHKTKVEQPPVALLGFQGRLLAGIGTDVRIYDLGMRQMLRKCQASVVPHLVV 983
Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
L + I++ D+ +S+ ++ +K Q +L D S T ++D T++ D
Sbjct: 984 GLQTQGSRIIVSDVQESVTYVVFKSQENRLIPFVDDVISRWTTCTT-MVDYETVA--GGD 1040
Query: 1260 EQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTG 1311
+ N+ + K SE G L+ ++ GA ++ Q L TS +
Sbjct: 1041 KFGNLWLLRCPAKASEEADEDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQ 1100
Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
G R L++ L G++G + P +E+ F QSL+ +L P +AG +
Sbjct: 1101 LVTG----GRDILVWTGLQGTVGMLIPFISREEVDF--FQSLEMQLAAQNPPLAGRDHLI 1154
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
+R +++ K +D +L Y +LP +++ +IA + + +I + D+
Sbjct: 1155 YRSYYAPAKG-------TIDGDLCETYLLLPNDKKQQIAGELDRSVREIERKIADMRTKV 1207
Query: 1429 SF 1430
++
Sbjct: 1208 AY 1209
>gi|440478305|gb|ELQ59147.1| pre-mRNA-splicing factor rse-1 [Magnaporthe oryzae P131]
Length = 1223
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 85/368 (23%), Positives = 162/368 (44%), Gaps = 56/368 (15%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
I + ++E AL++ VV+ +++ E+ L +GT G+D+ R F+ G R
Sbjct: 879 IDLDNNEAALSMAVVSF---ASQDGESFLVVGT----GKDMVVNPR--RFTEGYIHVYRF 929
Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
+++ + L ++ +++ +AL QG L+ G + ++ +L A + P
Sbjct: 930 SEDGREL-EFIHKTKVEEPPTALLPFQGRLVAGIGRMLRIYDLGLRQLLRKAQAEVAPQL 988
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
+VSLN + I++GD+ + ++++K + +L A D + T + ST
Sbjct: 989 IVSLNTQGSRIIVGDVQHGLIYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAG--- 1045
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL-----RLQMLA-------- 1304
+D+ N+ I K S+ +E H+ H +L RL ++A
Sbjct: 1046 ADKFGNLWILRCPEKASQESDEPG----SEVHL-VHSRDYLHGTSNRLALMAHVYTQDIP 1100
Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
TS +T G + LL+G G+IG + P ++ F QSL++ L P +
Sbjct: 1101 TSICKTNLVVGGQE----VLLWGGFQGTIGVLIPFVSREDADF--FQSLEQHLRSEDPPL 1154
Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT----RSQI 1417
AG + +R + K ++D +L Y MLP +++ IA + + +I
Sbjct: 1155 AGRDHLMYRGCYVPVKG-------VIDGDLCERYTMLPNDKKQMIAGELDRSVREIERKI 1207
Query: 1418 LSNLNDLA 1425
+N D+A
Sbjct: 1208 STNFVDIA 1215
>gi|340721347|ref|XP_003399083.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus terrestris]
gi|350406701|ref|XP_003487854.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus impatiens]
Length = 1217
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 80/389 (20%), Positives = 153/389 (39%), Gaps = 60/389 (15%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1122
+RI+ P T T + E L ++L + + ++ L +G A +
Sbjct: 866 IRIIAP-------TTGQTFEVHRLEQNLAALCLSLVKFSNQGDQLFLIVGIAKEFQLNPR 918
Query: 1123 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
V++ G + + N + V+ L A+ QG +L+ G + L+
Sbjct: 919 VSSGGFLYTYRVNSECTN----LELVHKTTLDEVPLAICPYQGRVLVGVGRMLRLYDMGK 974
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
+L P VVS+N + I + D+ +S+Y + +K Q QL + A D
Sbjct: 975 KKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHP-RWI 1033
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLS 1284
T ++D T++ +D+ NI + A +++ W QK +
Sbjct: 1034 TTTCVLDYDTVA--TADKFGNIAVIRLASGINDDVDEDPTGNKALWDRGLLNGASQKADT 1091
Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1341
A FHVG V + ++ PG ++ L++ TL G++G + P ++
Sbjct: 1092 VACFHVGETVMSLQKATLI----------PGGSES----LVYTTLSGTVGVLVPFTSHED 1137
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
F Q L+ + P + G + SFR ++ K +++D +L + +
Sbjct: 1138 HDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK-------NVIDGDLCEQFNSIEPT 1188
Query: 1402 EQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+Q I+ T S++ L D+ +F
Sbjct: 1189 KQKSISGDLERTASEVSKKLEDIRTRYAF 1217
Score = 42.4 bits (98), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
SS + E GD AP R + D+L ++ + L A+
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
+T T R +L + +GL + S + ELPG +W
Sbjct: 419 DTPELYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV + D+EY AY+I+S T+VL + + EVT+S F+
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504
Query: 612 IAAGNLFGRRRVIQVFERGAR 632
+ + G ++QV+ G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525
>gi|425768510|gb|EKV07031.1| Pre-mRNA-splicing factor rse1 [Penicillium digitatum PHI26]
gi|425775700|gb|EKV13954.1| Pre-mRNA-splicing factor rse1 [Penicillium digitatum Pd1]
Length = 1209
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 100/462 (21%), Positives = 186/462 (40%), Gaps = 66/462 (14%)
Query: 990 DNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-VPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
DN + IPL TP + +++L+ +I S VL P + LID + +
Sbjct: 781 DNNMLQESIPLSYTPRRFVKHPDQHLFYVIESDNNVLSPATR--QRLIDDSQAQNGEVAD 838
Query: 1049 LSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE 1106
L D + V+I++P +T+ ++ +E A+++ V+ ++++
Sbjct: 839 LPPADFGYPRATGHWASCVQIVDPITTKS---VISTLDLEDNEAAVSLAAVSF---SSQD 892
Query: 1107 NETLLAIGTA-------------YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKEL 1153
+ET L +GTA ++ GR L F D P
Sbjct: 893 DETFLVVGTAKDMTVSPPSSSCGFIHIYRFQEDGRELEFIHKTQVDEPP----------- 941
Query: 1154 KGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
AL QG LL GP + ++ +L P +V L + I++ DI
Sbjct: 942 ----LALLGFQGRLLAGIGPVLRVYDLGMKQLLRKCQAPVVPKTIVGLQTQGSRIIVSDI 997
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
+S+ ++ +K Q L A D + +T ++D T + D+ N+ + K+
Sbjct: 998 RESVTYVVYKYQDNVLIPFADDSIARWTSSTT-MVDYETTA--GGDKFGNLWLVRCPSKI 1054
Query: 1274 SESW----KGQKLL-SRAEFHVGAHVTKFLR---LQMLATSSDRTGAAPGSDKTNRFALL 1325
SE G L+ + H H + + Q + TS +T G R ++
Sbjct: 1055 SEQADEDGSGAHLIHEKGYLHGTPHRLELMVHFFAQDIPTSLHKTQLVAG----GRDIVV 1110
Query: 1326 FGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPG 1382
+ L G+IG P +++ F Q L+ +L P +AG + +R +++ K
Sbjct: 1111 WTGLQGTIGMFVPFVSREDVDF--FQLLETQLASQQPPLAGRDHLMYRGYYAPVKG---- 1164
Query: 1383 PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
++D +L Y +LP + +L IA + + +I ++D+
Sbjct: 1165 ---VIDGDLCEMYLLLPNDTKLMIAGELDRSVREIERKISDM 1203
>gi|323508292|emb|CBQ68163.1| probable splicing factor 3B subunit 3 [Sporisorium reilianum SRZ2]
Length = 1221
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 90/375 (24%), Positives = 152/375 (40%), Gaps = 56/375 (14%)
Query: 1078 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFST 1134
QT + + +E A +V VV + E E +L +G+A DV R +T
Sbjct: 878 QTTHKLELDDNEAAFSVAVVPF---ASAEKEAMLVVGSAV----DVVLSPRSFKKAYLTT 930
Query: 1135 GRNADNPQNLVTEVYSK-ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
R +N + L EV K E+ L QG LL G + ++ +L +
Sbjct: 931 YRLTNNGREL--EVLHKTEVDDIPLVLRPFQGRLLAGIGKALRIYDLGKKKLLRKCENKS 988
Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD----------------FG 1237
+VSL+ + I++GD+ +SI F S+K +L A D
Sbjct: 989 FATAIVSLDAQGSRIVVGDMQESIIFTSYKPLENRLVTFADDVMPKFVTRCAMLDYDTVA 1048
Query: 1238 SLDCFATEFL--IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 1295
+ D F ++ ID T S V ++ + I + P + + L+ A F VG
Sbjct: 1049 AADKFGNVYVLRIDADT-SRSVDEDPTGMTIVHEKPVLMGAAHKATLV--AHFFVGD--- 1102
Query: 1296 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKL 1354
+ TS +RT PG R LL+ + G+IG + P + + L +L+ L
Sbjct: 1103 -------IVTSLNRTVMVPG----GREVLLYTGVSGTIGALVPFVSKEDVDTLSTLESHL 1151
Query: 1355 VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTR 1414
+ G + ++R ++ K S++D +L + +LP +Q IA +
Sbjct: 1152 RQENSSLVGRDHLAYRSSYAPVK-------SVIDGDLCETFGLLPPAKQNAIATELDRKP 1204
Query: 1415 SQILSNLNDLALGTS 1429
S+I L L G++
Sbjct: 1205 SEINKKLAQLREGST 1219
>gi|310793065|gb|EFQ28526.1| CPSF A subunit region [Glomerella graminicola M1.001]
Length = 1212
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 105/506 (20%), Positives = 199/506 (39%), Gaps = 93/506 (18%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
C G + + Q + G T + IPL TP ++ E YP+ ++
Sbjct: 761 QCEEGVVGIQGQSLRIFAIENLGDTITQ----KSIPLSYTPRRLLKHPE---YPMFYTI- 812
Query: 1024 VLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE----EYEVRILEPDRAGGP--- 1076
+ DN+ L DL E + RIL PD G P
Sbjct: 813 -------------------EADNNTLPP-DLRAKLIAEPGVVNGDARILPPDEFGYPKGK 852
Query: 1077 --W--------------QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG 1120
W + TI + ++E A++ +V+ ++++E+ L +GT G
Sbjct: 853 GRWASCISVIDPLAEDQRVLQTIDLDNNEAAVSAAIVSF---ASQDSESFLIVGT----G 905
Query: 1121 EDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKI 1175
+D+ R FS G R ++ L ++ +++ SAL QG LL G +
Sbjct: 906 KDMVVNPR--QFSEGYIHVYRFGEDGHEL-EFIHKTKVEEPPSALLGFQGRLLAGIGKTL 962
Query: 1176 ILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD 1235
++ ++ A D P +VSL+ + I++GD+ I ++ +K +L D
Sbjct: 963 RIYDLGLRQMLRKAQADVTPQLIVSLSTQGSRIIVGDVQHGITYVVYKPTTNKLIPFVDD 1022
Query: 1236 FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFHV 1290
S T ++D S+ D+ N+ + + K ++ + + +R H
Sbjct: 1023 TVSRWVTCTT-MVDYE--SVAGGDKFGNMFLVRCSEKATQEADDESGGLHLINTRDYLHG 1079
Query: 1291 GAHVTKFLR---LQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTF 1344
H L Q + TS +T G LL+ ++G+IG P +++ F
Sbjct: 1080 TPHRLSLLAHSYTQDVPTSITKTSLVVGGQD----VLLWSGINGTIGVFIPFVTREDVDF 1135
Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
Q+L++ + +AG + +R ++ K ++D +L Y +LP E++
Sbjct: 1136 --FQNLEQHMRTEDAPLAGRDHLMYRGYYVPVKG-------VIDGDLCERYTLLPSEKKQ 1186
Query: 1405 EIAHQTGTTRSQILSNLNDLALGTSF 1430
IA + + +I ++D+ ++F
Sbjct: 1187 MIAGELDRSVREIERKISDIRTRSAF 1212
Score = 44.3 bits (103), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 137/606 (22%), Positives = 225/606 (37%), Gaps = 135/606 (22%)
Query: 74 QEEGSKESKNSGETKRRVLM---DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
Q G+KE + R+ + D + L+ H + G + S+A G++ +
Sbjct: 27 QFSGTKEQNIITASGSRLTLLRPDPSQGKVITLLSH-DIFGIIRSMAAFRLAGSN----K 81
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVD 186
D +ILA + +I+++E+ I + + F+ LHL K G G + D
Sbjct: 82 DYLILATDSGRITIIEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACD 130
Query: 187 PQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
P+GR V L ++ + SQ E T S A V+++ LD+
Sbjct: 131 PKGRACLIASVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVLSMVALDV 181
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLI 298
GY PV L E E T A G + + T ++ + L
Sbjct: 182 ----------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRK 229
Query: 299 WSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALAL--NNYAVSLD 350
WS P D L P G GVLV G I Y HS + + + A
Sbjct: 230 WSE---PVDPTASLLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDP 286
Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRLDL 403
S + S +L + + L+ T+ GDL T+ DG V+RL +
Sbjct: 287 SRKRHVVSGVMHKLKGSAGAFF----FLIQTEDGDLFKATIDMVEDADGNPTGEVKRLKI 342
Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
+ ++S + + + + S+ G+ QF E+ GD
Sbjct: 343 KYFDTIPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGD------D 382
Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR 523
+ L SS D D G + + + + A+ +S+ ++ PL D
Sbjct: 383 DEELEFSSDDFPTDPKAGYDAVYF--------HPRPLENLALVESIDSMNPLLDCKVANL 434
Query: 524 INADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADSS 566
DA +A G +S + ++ ELPG +WT+ K +RG
Sbjct: 435 TGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLNRG------ 486
Query: 567 RMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQV 626
D+Y AY+++S T+VL + + EV++S F+ A L G +IQV
Sbjct: 487 ------DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQV 538
Query: 627 FERGAR 632
+G R
Sbjct: 539 HPKGIR 544
>gi|66553024|ref|XP_623333.1| PREDICTED: splicing factor 3B subunit 3 isoform 1 [Apis mellifera]
gi|380015815|ref|XP_003691890.1| PREDICTED: splicing factor 3B subunit 3-like [Apis florea]
Length = 1217
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 80/389 (20%), Positives = 152/389 (39%), Gaps = 60/389 (15%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1122
+RI+ P T T + E L ++L + ++ L +G A +
Sbjct: 866 IRIIAP-------STGQTFEVHRLEQNLAALCLSLVKFANQGDQLFLIVGIAKEFQLNPR 918
Query: 1123 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
V++ G + + N + V+ L A+ QG +L+ G + L+
Sbjct: 919 VSSGGFLYTYKVNSECTN----LELVHKTTLDEVPLAICPYQGRVLVGVGRMLRLYDMGK 974
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
+L P VVS+N + I + D+ +S+Y + +K Q QL + A D
Sbjct: 975 KKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHP-RWI 1033
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLS 1284
T ++D T++ +D+ NI + A +++ W QK +
Sbjct: 1034 TTTCVLDYDTVA--TADKFGNIAVIRLASGINDDVDEDPTGNKALWDRGLLNGASQKADT 1091
Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1341
A FHVG V + ++ PG ++ L++ TL G++G + P ++
Sbjct: 1092 VACFHVGETVMSLQKATLI----------PGGSES----LVYTTLSGTVGVLVPFTSHED 1137
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
F Q L+ + P + G + SFR ++ K +++D +L + +
Sbjct: 1138 HDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPIK-------NVIDGDLCEQFNSIEPA 1188
Query: 1402 EQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+Q I+ T S++ L D+ +F
Sbjct: 1189 KQKSISSDLERTASEVSKKLEDIRTRYAF 1217
Score = 42.4 bits (98), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
SS + E GD AP R + D+L ++ + L A+
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
+T T R +L + +GL + S + ELPG +W
Sbjct: 419 DTPQLYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV + D+EY AY+I+S T+VL + + EVT+S F+
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504
Query: 612 IAAGNLFGRRRVIQVFERGAR 632
+ + G ++QV+ G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525
>gi|190345965|gb|EDK37945.2| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC
6260]
Length = 1206
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 123/611 (20%), Positives = 236/611 (38%), Gaps = 102/611 (16%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
+ ++ +CH ++ G ++++ + +GG++ D +++ + ++S+LEFD
Sbjct: 55 SGKIKQICHQQVIGVIQNIDRIRKGGSN----LDLLVITSDSGRLSILEFDKD------- 103
Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
+ F + H K G G + VDPQ R + ++ KA + L
Sbjct: 104 ELKFFPVVQEPHSKNGMNRTTPGEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKL---- 159
Query: 218 DTFGSGGGFSARIES----SHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
S+ +ES + I + LD GY P++ + E +A
Sbjct: 160 -------ELSSPLESVSKNTLTIQMVSLDT----------GYENPMLAAI---ECNYAHY 199
Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSA-----MNLPHDAYKLLAVPSPIGGVLVVGANT 328
+ + S L++ + L + A + +P + L+ +P+PIGGV+V G++
Sbjct: 200 DASLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTLVPLPTPIGGVIVAGSSF 259
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
I YH+ + L L S S +P ++V H N LL + GD
Sbjct: 260 IFYHNPTIDQQLYLP--IPSRAGSSPVPIVCYAV-----HKLKKNNFFILLHNELGDCFR 312
Query: 389 LTVVY--DGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSM 445
+ + Y D V L + + ++ I F D +L Q G S
Sbjct: 313 VLIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDKMLYQIEDLGDNDSY 372
Query: 446 LSSGLKEEFGDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
+SS D+ + + + R + AL +++ G+ +ES + +
Sbjct: 373 ISSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGALVKTSESKESRIAMI 432
Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
S LK ++G+ I+ LV P +V+
Sbjct: 433 TGHS-----HLKLKTHGIPIST--------------LVSSPLPMIATSVF---------- 463
Query: 565 SSRMAAYDDEYHAYLIISLEA--RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
++R++A + + Y++IS A +T+VL +++ EV +S FV + G +
Sbjct: 464 TTRLSA-ESKNDEYMVISSSASSKTLVLAIGEVVEEVQDSS--FVTDQPTIGVQQVGLKS 520
Query: 623 VIQVFERGARIL-----DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
+IQ++ G R + +G + + P T++S S VL+G+S+
Sbjct: 521 LIQIYSNGIRHIRQTETEGKITKKTFDWYP--------PAGITIISASTNQEQVLIGLSN 572
Query: 678 GSIRLLVGDPS 688
+ DP+
Sbjct: 573 RELCYFEIDPT 583
>gi|299751161|ref|XP_001830098.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
gi|298409248|gb|EAU91763.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
Length = 1205
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 89/389 (22%), Positives = 167/389 (42%), Gaps = 61/389 (15%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1122
+ I +P A +T AT+P++++E A ++ VV +T E L +GTA ++
Sbjct: 852 IHIFDPMEA----KTVATLPLKANEAAFSIAVVPFASTG---GEYHLVVGTAMHHLVTPP 904
Query: 1123 VAARGRVLLFSTGRNADNPQNL-VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1181
A+ + ++ + L T + EL AL + QG LL G + ++
Sbjct: 905 QASASYLKVYKIVNEGTGLELLHETPIQDSELP---RALLAFQGRLLAGVGKALRIYDLG 961
Query: 1182 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK------- 1234
+L A +P +VSL + I++GD+ +S F +KE +L +
Sbjct: 962 KKKLLRKAETKSP-TAIVSLATQGSRIVIGDMQESTLFAVYKEAENRLLIFGDDTQPRWV 1020
Query: 1235 ------DFGSL---DCFATEFL--IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
D+ ++ D F F+ +D ST+S V ++ I + ++ + K+L
Sbjct: 1021 SAMTMVDYNTVAVGDKFGNIFVNRLD-STISDQVDEDPTGAGILHEKATLNGAPHKTKML 1079
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---D 1340
A FHVG +T ++ ++ R LL+ L G+IG + PL +
Sbjct: 1080 --AHFHVGDIITSIHKVSLVV--------------GGREVLLYTGLQGTIGILVPLTSKE 1123
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
++ F L L++ + + + G + S+R ++ KA ++D +L Y L
Sbjct: 1124 DIEF--LTMLEQHIRNEQGSLVGRDHLSWRGYYVPVKA-------VIDGDLCETYGGLSS 1174
Query: 1401 EEQLEIAHQTGTTRSQILSNLNDLALGTS 1429
+Q IA + T +L L+ + + +S
Sbjct: 1175 SKQSAIASELDRTVGDVLKKLDQMRVASS 1203
>gi|302916981|ref|XP_003052301.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256733240|gb|EEU46588.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1212
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 105/488 (21%), Positives = 206/488 (42%), Gaps = 57/488 (11%)
Query: 964 NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQK-IPLKATPHQITYFAEKNLYPLIV 1020
C G + + Q I I +L G T +QK IPL TP ++ E+ L+ I
Sbjct: 761 QCEEGIVGIQGQSLRIFNIDRL--GDTL-----IQKSIPLTYTPKKLVKHPEQPLFYTIE 813
Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQ 1078
+ P LL D V + D+ L D + + +++P G Q
Sbjct: 814 ADNNTLPPELRAQLLADPNVVNG-DSQVLPPEDFGYPRANRRWASCINVVDPLSEEG--Q 870
Query: 1079 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG--- 1135
T+ +++E A++ VV+ +++NE L +GT G+D+ + +S G
Sbjct: 871 VLQTVHFENNEAAVSATVVSF---ASQDNENFLVVGT----GKDMIVNPQS--YSEGYLY 921
Query: 1136 --RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
R ++ + L ++ +++ AL QG + +A G ++ ++ ++ A +
Sbjct: 922 IYRFVEDGREL-EFIHKTKIEEPPLALLPFQGKVAVAVGTQLRIYDLGMRQMLRKAQAEV 980
Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
+VSLN + I++GD+ + + + +K +L A D + T ++D
Sbjct: 981 SAQRIVSLNTQGSRIVVGDVQQGVTLVVYKSATNKLIPFADDTVARWTTCTT-MVDYE-- 1037
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKFL---RLQMLAT 1305
S+ D+ N+ I K SE ++ + +R H H + Q + T
Sbjct: 1038 SIAGGDKFGNMFIVRCPEKASEEADEEQSGLHLINARDYLHGTPHRLGLMCHFYTQDVPT 1097
Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
S +T G + LL+ + G+IG P ++ F Q+L++ L P +A
Sbjct: 1098 SITKTSLVVGGQEI----LLWSGIMGTIGVFIPFVSREDADF--FQNLEQHLRTEDPPLA 1151
Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
G + +R +++ K ++D +L Y +LP +++ IA + + +I ++
Sbjct: 1152 GRDHLMYRGYYAPVKG-------VIDGDLCERYNLLPNDKKQMIAGELDRSVREIERKIS 1204
Query: 1423 DLALGTSF 1430
D+ ++F
Sbjct: 1205 DIRTRSAF 1212
>gi|19114492|ref|NP_593580.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe 972h-]
gi|46395602|sp|O13807.1|DDB1_SCHPO RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|2330717|emb|CAB11219.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe]
Length = 1072
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 95/492 (19%), Positives = 190/492 (38%), Gaps = 97/492 (19%)
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
RES GPL+ VDP R + VY + I+ + + + FS RI+
Sbjct: 111 RES-QSGPLLLVDPFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQEL 169
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+V+ D ++ P + +L++ + ++K ++
Sbjct: 170 NVV-----------DIAMLYNSSRPSLAVLYKDSKSIVHLSTYK------------INVR 206
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
+ + + + HD + +PS GGV V G ++Y S+ + L Y
Sbjct: 207 EQEIDEDDV-VCHDIEEGKLIPSENGGVFVFGEMYVYYISKDIQVSKLLLTY-------- 257
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P ++FS + T L + + +++ ++G L ++ V ++L K S + S
Sbjct: 258 --PITAFSPSISNDPETGLDSSIYIVADESGMLYKFKALFTDETVS-MELEKLGESSIAS 314
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+ + ++ F+GS +S+L+Q PS + +
Sbjct: 315 CLIALPDNHLFVGSHFNNSVLLQL------------------------PSITK-NNHKLE 349
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
LQ+ VN +S + + T S+ T S A +D G L+ + I
Sbjct: 350 ILQNFVNIAPISDFIIDDDQTGSSIITCSGAYKD-----GTLRIIRNSINI--------- 395
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
N L+E+ G K ++V S A YD+ + +L + E R +++
Sbjct: 396 ---ENVALIEMEGIKDFFSV------------SFRANYDN--YIFLSLICETRAIIVSPE 438
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
+ + + D + TI ++G +++Q+ + R+ DG + +S P + G
Sbjct: 439 GVF---SANHDLSCEESTIFVSTIYGNSQILQITTKEIRLFDGKKLHSWIS--PMSITCG 493
Query: 654 SGSENSTVLSVS 665
S ++ ++V+
Sbjct: 494 SSFADNVCVAVA 505
Score = 42.0 bits (97), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 64/290 (22%), Positives = 124/290 (42%), Gaps = 38/290 (13%)
Query: 1111 LAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+ +GT + +D GR+++F +DN + E +++G+++ L L HL++
Sbjct: 770 VVVGTGFNFPDQDAPDSGRLMVFEM--TSDNNIEMQAE---HKVQGSVNTLV-LYKHLIV 823
Query: 1170 A--SGPKIILHKWTGTE--LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
A + I GT N I P Y + +++ ++ I+ D+ KSI L + +
Sbjct: 824 AGINASVCIFEYEHGTMHVRNSIR----TPTYTIDISVNQDEIIAADLMKSITVLQFIDD 879
Query: 1226 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY---YAPKMSESWKGQKL 1282
QL +A+D+ L + E L S V++ N I +P++S+ +KL
Sbjct: 880 --QLIEVARDYHPLWATSVEIL---SERKYFVTEADGNAVILLRDNVSPQLSDR---KKL 931
Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
+F++G + K R D++ P LL T+DGS+ +
Sbjct: 932 RWYKKFYLGELINK-TRHCTFIEPQDKSLVTP--------QLLCATVDGSLMIVGDAGMS 982
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
L LQ + +P GL+ + ++++ + P ++D L+
Sbjct: 983 NTPLLLQLQDNIRKVIPSFGGLSHKEWKEYRGENET---SPSDLIDGSLI 1029
>gi|66811906|ref|XP_640132.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
gi|74854972|sp|Q54SA7.1|SF3B3_DICDI RecName: Full=Probable splicing factor 3B subunit 3
gi|60468134|gb|EAL66144.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
Length = 1256
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 54/289 (18%), Positives = 127/289 (43%), Gaps = 29/289 (10%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
+Y E++ + A+A QG L+ G I ++ +L P +V+++ + +
Sbjct: 979 LYKTEVEEPVYAMAQFQGKLVCGVGKSIRIYDMGKKKLLRKCETKNLPNTIVNIHSLGDR 1038
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
+++GDI +SI+F+ +K L + A D + ++D T++ +D+ NI +
Sbjct: 1039 LVVGDIQESIHFIKYKRSENMLYVFADDLAP-RWMTSSVMLDYDTVA--GADKFGNIFVL 1095
Query: 1268 YYAPKMSESWKGQKLLSRAEFHVGA---------HVTKFLRLQMLATSSDRTGAAPGSDK 1318
+S+ + ++ +F G H+ F + T + + G +
Sbjct: 1096 RLPLLISDEVEEDPTGTKLKFESGTLNGAPHKLDHIANFFVGDTVTTLNKTSLVVGGPE- 1154
Query: 1319 TNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
+L+ T+ G+IG + P +++ F +L+ + + G + ++R ++
Sbjct: 1155 ----VILYTTISGAIGALIPFTSREDVDF--FSTLEMNMRSDCLPLCGRDHLAYRSYYFP 1208
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
K +I+D +L + L ++QL I+ + + S+++ L ++
Sbjct: 1209 VK-------NIIDGDLCEQFSTLNYQKQLSISEELSRSPSEVIKKLEEI 1250
Score = 43.9 bits (102), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 124/338 (36%), Gaps = 75/338 (22%)
Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQN 374
GGVLV + I Y +Q + + +PR S L +H++ Q
Sbjct: 256 GGVLVASEDYIVYRNQDHA------------EVRSRIPRRYGSDPNKGVLIISHSSHKQK 303
Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
+ L+ ++ GDL +T+ Y G V ++++ + VL + +T + N F S GD
Sbjct: 304 GMFFFLVQSEHGDLYKITLDYQGDQVSEVNVNYFDTIVLANCLTVLKNGFLFAASEFGDH 363
Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL----RRSSSDALQDMVNGEELSLYG 488
L F S G +EE G + L R S ++++ N E S
Sbjct: 364 TLYFFK--------SIGDEEEEGQAKRLEDKDGHLWFTPRNSCGTKMEELKNLEPTSHLS 415
Query: 489 SASNNTESAQKTFSFAVRDSLVNIGP-------------LKDFSYGLRINADASATGISK 535
S S F V D + P LK +GL + +A
Sbjct: 416 SLS-------PIIDFKVLDLVREENPQLYSLCGTGLNSSLKVLRHGLSVTTITTAN---- 464
Query: 536 QSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
LPG GIWTV +S NA D+ Y+++S T VL D
Sbjct: 465 --------LPGVPSGIWTVPKSTS--PNA--------IDQTDKYIVVSFVGTTSVLSVGD 506
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ E ES ++ T G +IQVF G R
Sbjct: 507 TIQENHES--GILETTTTLLVKSMGDDAIIQVFPTGFR 542
>gi|331238007|ref|XP_003331659.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|309310649|gb|EFP87240.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1213
Score = 54.3 bits (129), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 91/398 (22%), Positives = 169/398 (42%), Gaps = 68/398 (17%)
Query: 1070 PDRAGGPWQT--RATIPMQSS----------ENALTVRVVTL-FNTTTKENETLLAIGTA 1116
P G W + R + P++ + E A ++ +V+ F E LL +G+A
Sbjct: 847 PRAEAGKWASCIRISDPIEKTTLVREDLGDNEAATSLAIVSFAFAAHHPATEPLLVVGSA 906
Query: 1117 ---YVQGEDVAARG--RVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+VQ G RV F N ++ V+ E+ SAL QG L
Sbjct: 907 KDAFVQPR-TCKNGFLRVYRFV------NDGKVIELVHKTEVDEMPSALVGFQGRLAAGV 959
Query: 1172 GPKIILHKWTGTEL----NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
G + ++ +L +F A ++SL++ + IL+GD S+ + +K
Sbjct: 960 GKALRIYDLGKKKLLRKVENKSFGSA----IISLSVQGSRILVGDSQDSVSYAVYKPAEN 1015
Query: 1228 QLNLLAKDF-GSLDCFATEFLIDGSTLS-------LVVSDEQKNIQIFYYAPKMSESWKG 1279
+L + A D AT ++D T++ L VS KN+ + ++ E G
Sbjct: 1016 RLIVFADDVVPRWTTCAT--MVDYDTVAGGDRFGNLWVSRLPKNV-----SDEVDEDPTG 1068
Query: 1280 QKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGC 1335
++ + +GA H K L L + TS +T PG R LL+ + GSIG
Sbjct: 1069 AGIMHEKGYLMGAPHKLKNLVHFHLNDIPTSIQKTSLVPG----GREVLLYTGVQGSIGI 1124
Query: 1336 IAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+ P +++ F Q+L+ + + +P + G + ++R ++ K + VD +L
Sbjct: 1125 LVPFISKEDVDF--FQTLEMHMRNEMPSLVGRDHLAYRGYYFPVK-------NCVDGDLC 1175
Query: 1393 SHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+ +LP +QL++A + + S +L + + + + +
Sbjct: 1176 ESFALLPSAKQLQVASELDRSVSDVLKKIEAVRVSSGY 1213
>gi|440473070|gb|ELQ41892.1| pre-mRNA-splicing factor rse-1 [Magnaporthe oryzae Y34]
Length = 1229
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 82/356 (23%), Positives = 158/356 (44%), Gaps = 52/356 (14%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
I + ++E AL++ VV+ +++ E+ L +GT G+D+ R F+ G R
Sbjct: 879 IDLDNNEAALSMAVVSF---ASQDGESFLVVGT----GKDMVVNPR--RFTEGYIHVYRF 929
Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
+++ + L ++ +++ +AL QG L+ G + ++ +L A + P
Sbjct: 930 SEDGREL-EFIHKTKVEEPPTALLPFQGRLVAGIGRMLRIYDLGLRQLLRKAQAEVAPQL 988
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1257
+VSLN + I++GD+ + ++++K + +L A D + T + ST
Sbjct: 989 IVSLNTQGSRIIVGDVQHGLIYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAG--- 1045
Query: 1258 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL-----RLQMLA-------- 1304
+D+ N+ I K S+ + +E H+ H +L RL ++A
Sbjct: 1046 ADKFGNLWILRCPEKASQ----ESDEPGSEVHL-VHSRDYLHGTSNRLALMAHVYTQDIP 1100
Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
TS +T G + LL+G G+IG + P ++ F QSL++ L P +
Sbjct: 1101 TSICKTNLVVGGQE----VLLWGGFQGTIGVLIPFVSREDADF--FQSLEQHLRSEDPPL 1154
Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
AG + +R + K ++D +L Y MLP +++ IA + + +I
Sbjct: 1155 AGRDHLMYRGCYVPVKG-------VIDGDLCERYTMLPNDKKQMIAGELDRSVREI 1203
>gi|124806507|ref|XP_001350742.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
3D7]
gi|23496869|gb|AAN36422.1|AE014849_41 splicing factor 3b, subunit 3, 130kD, putative [Plasmodium falciparum
3D7]
Length = 1329
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 67/286 (23%), Positives = 113/286 (39%), Gaps = 45/286 (15%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
S G L+ + G K+ ++ +L Y P +VS+ I N I DI +S+
Sbjct: 1064 CFCSYNGKLIASIGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKISGNRIFACDIRESVL 1123
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1265
+ L L++ D +E L + ++ L V +E K +
Sbjct: 1124 IFFYDPNQNTLRLISDDIIPRWITCSEILDHHTIMAADKFDSVFILRVPEEAKQDEYGIT 1183
Query: 1266 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
+Y M+ S K +KL FH+G VT ++++ TSS+
Sbjct: 1184 NKCWYGGEIMNSSTKNRKLEHMMSFHIGEIVTSMQKVRLSPTSSE--------------C 1229
Query: 1324 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
+++ T+ G+IG P D ELT Q L+ L P + G FR ++ +
Sbjct: 1230 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTEKPPLCGREHIFFRSYYHPVQ- 1284
Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
++VD +L + L + Q +IA+ T IL L D+
Sbjct: 1285 ------NVVDGDLCEQFSSLSYDAQKKIANDLERTPEDILRKLEDI 1324
Score = 48.9 bits (115), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 80/379 (21%), Positives = 147/379 (38%), Gaps = 64/379 (16%)
Query: 304 LPHDAYKLLAVPSPIG-----GVLVVGANTIHYHS---QSASCALALNNYAVSLDSSQEL 355
LP D L +P P G GVL+ N + Y + CA Y L+ Q+
Sbjct: 261 LPIDITAHLLIPLPGGQQGPSGVLICCENFLVYKKVDHEDIYCA-----YPRRLEIGQDK 315
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
S + + L+ ++ GDL + V ++ +V+ + + + + I
Sbjct: 316 NISIICWTMHRIKKFFF----ILIQSEYGDLYKIEVDHEDGIVKEIVCKYFDTVPIGNSI 371
Query: 416 TTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+ + + F+ + G+ QF+ G G A T +L+
Sbjct: 372 SVLKSGSLFVAAEFGNHYFYQFSGIGDDNKQFMCTSNHPLGKNAIIAFKTNKLKNL---Y 428
Query: 475 LQDMVNGEE--LSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADAS 529
L D + L + + NT + Q ++ R GP L+ +GL I A
Sbjct: 429 LVDQIYSLSPILDMKIIDAKNTHTPQ-IYTLCGR------GPRSSLRILQHGLSIEELAD 481
Query: 530 ATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
ELPG K IWT+ + EY Y+++S E T+
Sbjct: 482 N------------ELPGKPKYIWTIKKDNL--------------SEYDGYIVVSFEGNTL 515
Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPS 648
+LE + + EV++++ + T N+ IQV++ G R ++G + + ++ P
Sbjct: 516 ILEIGESVEEVSDTL--LLNNVTTLHINILYDNSFIQVYDTGIRHINGKVVQEWVA--PK 571
Query: 649 NSESGSGSENSTVLSVSIA 667
N + + S NS+ + +S++
Sbjct: 572 NKQIKAASSNSSQIVISLS 590
>gi|402077250|gb|EJT72599.1| pre-mRNA-splicing factor RSE1 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 1216
Score = 53.9 bits (128), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 111/505 (21%), Positives = 214/505 (42%), Gaps = 87/505 (17%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK-IPLKATPHQI----------TYFAE 1012
C G + V Q L+IC + G D+ +QK I L TP ++ T AE
Sbjct: 761 QCEEGMVGVQGQ-FLRICAI--GKLGDSM--IQKSISLAYTPKKLIKNPTHPIFYTIEAE 815
Query: 1013 KNLYPLIVSVPVLKP---LNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1069
N P + +L +N +L +E G+ N +S + +++
Sbjct: 816 NNTLPPELREQLLAAPTAVNGDTKVLPPEEFGYPRGNGRWASC------------ISVVD 863
Query: 1070 PDRAGGPWQTRA--TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARG 1127
P G + I ++++E ++V VV ++++E+ L +GT G+D+
Sbjct: 864 PLGDGEEREPSVLQQIHLENNEATVSVAVVPF---ASQDSESFLVVGT----GKDMVLNP 916
Query: 1128 RVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
R F+ G R ++ + L ++ +++ AL + QG L+ G + ++
Sbjct: 917 RC--FTEGYIHVYRFLEDGREL-EFIHKTKVEEPPMALLAFQGKLVAGVGRSLRIYDLGL 973
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
+L A + P +VSL + I++GD + ++++K++ +L A D S+ +
Sbjct: 974 RQLLRKAQSEVAPRVIVSLQTQGSRIVVGDSQHGLIYVAYKQEANKLIAFADD--SIQRW 1031
Query: 1243 AT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL--- 1298
T ++D S D+ NI I K S+ +E H+ H +L
Sbjct: 1032 TTCSTMVDYE--STAGGDKFGNIWILRCPEKASQEADQPG----SEVHL-MHARDYLHGT 1084
Query: 1299 --RL--------QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFR 1345
RL Q +ATS +T G + LL+G + G+IG + P ++ F
Sbjct: 1085 SNRLALMAHVYTQDIATSICKTNLVVGGQE----VLLWGGIQGTIGVLIPFVSREDADF- 1139
Query: 1346 RLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
Q+L++ + P +AG + +R ++ K ++D +L + MLP +++
Sbjct: 1140 -FQTLEQHMRSEDPPLAGRDHLMYRSYYVPVKG-------VIDGDLCERFTMLPNDKKQM 1191
Query: 1406 IAHQTGTTRSQILSNLNDLALGTSF 1430
IA + + +I ++D+ ++F
Sbjct: 1192 IAGELDRSVREIERKISDIRTRSAF 1216
Score = 43.5 bits (101), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 60/279 (21%), Positives = 110/279 (39%), Gaps = 68/279 (24%)
Query: 378 LLSTKTGDLVLLTVVY----DGRV---VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
LL T+ GDL +T+ +G VQRL + + ++S++ + + F+ S G
Sbjct: 310 LLQTEDGDLFKVTIDMLEDAEGNTTGEVQRLKIKYFDTIPVSSNLCILKSGFLFVASEFG 369
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
+ QF E+ GD + L SS + D E + +
Sbjct: 370 NHHFYQF--------------EKLGD------DDEELEFSSENFPSDPAEPYEPAYF--- 406
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV---- 542
+ T + A+ +S+ ++ PL D + DA + +G +S + ++
Sbjct: 407 -----YPRPTENLALVESVESMNPLMDLKVANLTDEDAPQIYTVSGNGARSTFRMLKHGL 461
Query: 543 --------ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
+LPG +WT A DD+Y +Y+++S T+VL
Sbjct: 462 EVNEIVASQLPGTPSAVWTT--------------KIARDDQYDSYIVLSFTNGTLVLSIG 507
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ + EV+++ F+ + A G ++QV RG R
Sbjct: 508 ETVEEVSDT--GFLSSVSTLAVQQLGEDGLVQVHPRGIR 544
>gi|336369683|gb|EGN98024.1| hypothetical protein SERLA73DRAFT_109335 [Serpula lacrymans var.
lacrymans S7.3]
gi|336382464|gb|EGO23614.1| hypothetical protein SERLADRAFT_449959 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1257
Score = 53.9 bits (128), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 121/269 (44%), Gaps = 26/269 (9%)
Query: 1109 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
T + GT E ++GR+L+F + + +T S E++G + A+ S+ G ++
Sbjct: 940 TYICAGTYKYVDEVEPSQGRLLVFD-AEDGSLLREKITMAVSLEVRGCVYAVGSVNGMII 998
Query: 1169 IASGPKIILHKW---TGTEL---NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
A +++++ T+L + I ++ L V +L + IL+GD SI FL
Sbjct: 999 AAINSSVVVYRPEIDASTQLLALHKITEWNHNYL-VTNLVCRGDKILVGDAINSISFLRM 1057
Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
E +Q+ LA+D+GSL E L S + + +F +A + E+ + L
Sbjct: 1058 VE--SQIQCLARDYGSLWPVCVEMLDQSSIIG-----ANSDYNLFTFA--LQETELRKSL 1108
Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DE 1341
+++G V KF+ + T+ D + P K LF T G IG I + DE
Sbjct: 1109 ERDGSYYIGDMVNKFIPGAL--TAHDVSVDMPLEPKQ-----LFFTSTGCIGVIVDMGDE 1161
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
L+ + +LQ+ + + G+ FR
Sbjct: 1162 LSL-HMTALQRNMSTYLSQTKGVTHTKFR 1189
>gi|310796681|gb|EFQ32142.1| CPSF A subunit region [Glomerella graminicola M1.001]
Length = 1163
Score = 53.9 bits (128), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 73/322 (22%), Positives = 136/322 (42%), Gaps = 41/322 (12%)
Query: 1113 IGTAYVQGEDVA----ARGRVLLFSTGRNAD-NPQNLVTEVYSKELKGAISALASLQGHL 1167
+GT+Y+ D+ +GR+L+ G ++D NP +V S ELKGA +L + L
Sbjct: 847 VGTSYLADPDIDESGDTKGRILVL--GVDSDKNPYLIV----SHELKGACRSLGVMGEKL 900
Query: 1168 LIASGPKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
+ ++++ + T L +A + P + V +++ N I + D+ +S+ + +
Sbjct: 901 VAGLSKTVVVYDYVEESTTSGALRKLATF-RPSTFPVDIDVHGNMIGIADLMQSLTLVEF 959
Query: 1223 --KEQGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG 1279
+ G + L+ + +AT ++G S + +D Q N+ + P
Sbjct: 960 VPAQDGNKAKLVERARHFQYIWATSVCHLEGH--SWLEADAQGNLMVLRRNPNAPTEHDR 1017
Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
+++ EFH+G V K L + +D P K T++GS+ A +
Sbjct: 1018 KQMEVTGEFHLGEQVNKIRSLDITPNEND-----PIIPKA-----FLATVEGSLYVFADI 1067
Query: 1340 DELTFRRLQSLQKKLVDSVPHV--------AGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
L Q++L + V + +GL+ ++R F N K GP VD EL
Sbjct: 1068 KSEYQSLLIQFQERLAEVVRALGQADGEPGSGLSFTTWRGFR-NAKRAAEGPFRFVDGEL 1126
Query: 1392 LSHYEMLPLEEQLEIAHQTGTT 1413
+ + L +Q + G T
Sbjct: 1127 IERFLDLDEAKQEAVVQGLGPT 1148
>gi|159489018|ref|XP_001702494.1| UV-damaged DNA binding complex subunit 1 protein [Chlamydomonas
reinhardtii]
gi|158280516|gb|EDP06273.1| UV-damaged DNA binding complex subunit 1 protein [Chlamydomonas
reinhardtii]
Length = 1147
Score = 53.9 bits (128), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 61/248 (24%), Positives = 105/248 (42%), Gaps = 28/248 (11%)
Query: 1144 LVTEVYSKELKGAISALASLQGHLLIAS-GPKIILHKWTGTELNGI-----AFYDAPPLY 1197
LVTE KE+KGA + G ++AS K+ +++W E +G A+ A +
Sbjct: 895 LVTE---KEVKGAAYNVRPFAGDKILASVNNKVTVYRWVVREGSGGPGGCGAYELASECH 951
Query: 1198 ----VVSLNIVK--NFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
V++L + +++GD+ +S+ LS+ + L A D+ S A E L D +
Sbjct: 952 HLGNVLALYLAARGGLVVVGDLMRSVSLLSYNAEQGVLEHRAADYNSGWTTAVEILDDDN 1011
Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR--LQMLATSSDR 1309
++ +D N+ + + + +L EFH G + + L M S+
Sbjct: 1012 YIA---ADNHCNLYVVRRNADSATDEERARLQVVGEFHTGTFINQMRNGSLVMRLPDSEH 1068
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
G P LLF DG +G +A L + LQ + V V GL+ +
Sbjct: 1069 AGLPP--------PLLFAGTDGRLGVVARLPPALYEWATKLQTAMRSVVRGVGGLDHEQW 1120
Query: 1370 RQFHSNGK 1377
R F ++ +
Sbjct: 1121 RAFANDRR 1128
>gi|47230701|emb|CAF99894.1| unnamed protein product [Tetraodon nigroviridis]
Length = 953
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 92/216 (42%), Gaps = 34/216 (15%)
Query: 1174 KIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
++ L++WT TE N + + LY L +FIL+GD+ +S+ L++K
Sbjct: 707 QVRLYEWTAEKELRTECN--HYNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPMEGN 761
Query: 1229 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1288
+A+DF A E L D + L ++ N+ + + + Q L F
Sbjct: 762 FEEIARDFNPNWMSAIEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVGVF 818
Query: 1289 HVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
H+G V F L LQ L +S T ++LFGT++G IG + L E +
Sbjct: 819 HLGEFVNVFCHGSLVLQNLGETSTPTQG----------SVLFGTVNGMIGLVTSLSEGWY 868
Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
L LQ +L + ++R FH+ K +
Sbjct: 869 SLLLDLQNRLNKVI-------KTTWRSFHTERKTEQ 897
Score = 48.1 bits (113), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 71/341 (20%), Positives = 130/341 (38%), Gaps = 64/341 (18%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A S
Sbjct: 68 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 123
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG V ++ L + + +
Sbjct: 124 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 173
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
+T + N + F+GSRLGDS LV+ S L +++++ + +
Sbjct: 174 CLTYLDNGVVFVGSRLGDSQLVKVRVTHSLSEL---------NVDSNDQGSFVTVMETFT 224
Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
L +V+ + L + F G L+ G+ I+ AS
Sbjct: 225 NLGPIVDMCVVDLERQGQGQLVTCSGAFKE---------GSLRIIRNGIGIHEHAS---- 271
Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
++LPG KG+W + ++ R E L++S +T VL +
Sbjct: 272 --------IDLPGIKGLWPLRSEAGR--------------ETDDMLVLSFVGQTRVLMLS 309
Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
E TE + +T GN+ ++IQ+ R++
Sbjct: 310 GEEVEETELPGFVDNQQTFYCGNV-AHNQLIQITSGSVRLV 349
>gi|383847297|ref|XP_003699291.1| PREDICTED: splicing factor 3B subunit 3-like [Megachile rotundata]
Length = 1217
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 47/293 (16%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
A+ QG +L+ G + L+ +L P +VS+N + I + D+ +S+Y
Sbjct: 951 AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAIVSINAIGQRIYVSDVQESVY 1010
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-- 1276
+ +K Q QL + A D T ++D T++ +D+ NI + A +++
Sbjct: 1011 AVRYKRQENQLIVFADDTHP-RWITTTCVLDYDTVA--TADKFGNIAVIRLASGINDDVD 1067
Query: 1277 ---------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
W QK + A FHVG V + ++ PG ++
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGGSES- 1116
Query: 1321 RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
L++ TL G++G + P ++ F Q L+ + P + G + SFR ++ K
Sbjct: 1117 ---LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK 1171
Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+++D +L + + +Q IA T S++ L D+ +F
Sbjct: 1172 -------NVIDGDLCEQFNSIEPAKQKSIAGDLERTPSEVSKKLEDIRTRYAF 1217
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
SS + E GD AP R + D+L ++ + L A+
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
+T T R +L + +GL + S + ELPG +W
Sbjct: 419 DTPQLYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460
Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
TV + D+EY AY+I+S T+VL + + EVT+S F+
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504
Query: 612 IAAGNLFGRRRVIQVFERGAR 632
+ + G ++QV+ G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525
>gi|307166104|gb|EFN60356.1| Splicing factor 3B subunit 3 [Camponotus floridanus]
Length = 1201
Score = 53.5 bits (127), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 65/297 (21%), Positives = 122/297 (41%), Gaps = 55/297 (18%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
A+ QG +L+ G + L+ +L P VVS+N + I + D+ +S+Y
Sbjct: 935 AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVY 994
Query: 1219 FLSWKEQGAQLNLLAKD----FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+ +K Q QL + A D F + C ++D T++ +D+ NI + A ++
Sbjct: 995 AVRYKRQENQLIVFADDTHPRFITTTC-----VLDYDTVA--TADKYGNIAVIRLATGIN 1047
Query: 1275 ES-----------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
+ W QK + A FHVG V + ++ PG
Sbjct: 1048 DDVDEDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGG 1097
Query: 1317 DKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFH 1373
++ L++ TL G++G + P ++ F Q L+ + P + G + SFR ++
Sbjct: 1098 SES----LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYY 1151
Query: 1374 SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
K +++D +L + + +Q I+ T S++ L D+ +F
Sbjct: 1152 YPVK-------NVIDGDLCEQFNSIEPVKQKSISGDLERTPSEVSKKLEDIRTRYAF 1201
>gi|268536658|ref|XP_002633464.1| C. briggsae CBR-DDB-1 protein [Caenorhabditis briggsae]
Length = 1134
Score = 53.5 bits (127), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 53/271 (19%), Positives = 115/271 (42%), Gaps = 15/271 (5%)
Query: 1113 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GT + ++ + GR+++F D + + V+ ++G+ AL L G L+ A
Sbjct: 823 VGTGLIYPDESDTKLGRIIVFEVD---DVERTKLRRVHELVVRGSPLALRILNGKLVAAI 879
Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
+ L +WT ++ + + + + L ++ + + D+ +S+ LS++
Sbjct: 880 NSSVRLFEWTADKVLRLECSNFNHIVALDLKVMNEEVAVADLMRSVSLLSYRMMEGNFEE 939
Query: 1232 LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF-HV 1290
+AKD+ S EF+ S L +++ P + G+ +L + ++
Sbjct: 940 VAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPITDD---GRYVLEPTGYWYL 996
Query: 1291 GAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSL 1350
G +R ++ D T ++FGT G+IG + +D+ + L S+
Sbjct: 997 GELPKVMVRASLVVQPEDST-------IEYSHPIMFGTNQGTIGMLVQIDDKWKKFLVSI 1049
Query: 1351 QKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
+K + DSV + + ++R F + P
Sbjct: 1050 EKAISDSVKNCMQIEHSTYRSFIFQKRIEPP 1080
Score = 48.9 bits (115), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 71/137 (51%), Gaps = 19/137 (13%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
D+ L+ VP+P+GGV+V+GAN+ Y + + + Y+ SL L + F+ +
Sbjct: 210 DSQVLIPVPAPVGGVIVLGANSALYKASDVNGDVV--PYSCSL-----LKNTIFTCHGIV 262
Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV-VYDGR---VVQRLDLSKTNPSVLTSDITTIGN 420
DA+ D LL+ G L++L + + +GR V+ + + + + + + N
Sbjct: 263 DAS------GDRFLLADTDGRLLMLLLNIGEGRSGTTVKEMRIEYLGETSVADSVNYVDN 316
Query: 421 SLFFLGSRLGDSLLVQF 437
+ F+GSRLGDS L++
Sbjct: 317 GVVFVGSRLGDSQLIRL 333
>gi|325189950|emb|CCA24429.1| splicing factor putative [Albugo laibachii Nc14]
Length = 1644
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 72/315 (22%), Positives = 129/315 (40%), Gaps = 45/315 (14%)
Query: 299 WSAMNLPHDAYKLLAVP---SPIGGVLVVGANTIHYHSQS---ASCALALNNYAVSLDSS 352
WS + +P A KL+AVP GGVLV+ I Y +++ SC+ L + +
Sbjct: 660 WSQV-VPRSANKLVAVPGGNDGPGGVLVIAQGLIQYQNENHPPLSCSFPLRSTG-GPNPV 717
Query: 353 QELPRSSFSVELDAAHATWLQNDV--ALLSTKTGDLVLLTVVYDGRVVQRLDLS--KTNP 408
Q+ + + + + + AT Q D+ L+ ++ GDL +++ Y G VQ+L + T P
Sbjct: 718 QDERKQGYPMMI-VSTATHKQRDLFFVLMQSEWGDLFKISLEYAGSSVQKLRIQYFDTIP 776
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
L IT G L F S + L QF + + D E + PS +
Sbjct: 777 VALALCITKTG--LLFAASEFSNHYLFQFLSIGEDDDAAQCVSAAENDQEPEIPSFSVRK 834
Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA 528
+ + ++ + ++ E + ++ + N L+ +GL + A
Sbjct: 835 LKNLAMISNIPSISPITQLLVDDFANEQTPQLYALCGQG---NRSSLRILRHGLPVMEMA 891
Query: 529 SATGISKQSNYELVELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
++ LPG K +W + ++ D Y+++S E T
Sbjct: 892 ASA------------LPGVAKAVWCLKE--------------SFTDTCDKYIVVSFEDAT 925
Query: 588 MVLETADLLTEVTES 602
+VLE D + E+T+S
Sbjct: 926 LVLEIGDTVEEITDS 940
Score = 47.0 bits (110), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 66/300 (22%), Positives = 123/300 (41%), Gaps = 54/300 (18%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIA---FYDAPPLYVVSLNIV 1204
V++ + G A+ QG LL++ G + ++ +L ++ +P ++ L
Sbjct: 1369 VHTTPVDGIPYAMIEFQGRLLVSVGKVLRIYDLGKRKLLRKCENRYFTSP---MIDLKSA 1425
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKN 1263
+ I D+H+SI+F+ +K + QL A D + F T L+D T++ D+ N
Sbjct: 1426 GDRIYASDVHESIHFVKYKAEDNQLITFADD--CVPHFMTSSTLLDYDTIA--GGDKFGN 1481
Query: 1264 IQIFYYAPKMSES----------WKG-------QKLLSRAEFHVGAHVTKFLRLQMLATS 1306
+ + ++S+ W KL A+FHVG +T L
Sbjct: 1482 VFVTRLPAEVSDEIDNPTGNRMLWDTGLLNGAPHKLEQIAQFHVGEVITSVL-------- 1533
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
RT PG + +L+ T+ G IG + P D++ F L+ + + G
Sbjct: 1534 --RTSLVPGGMEV----ILYTTILGRIGALVPFTSRDDVDF--YTHLEMYMRQEKAPLCG 1585
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
+ S+R + K ++ D +L + L ++Q IA T ++++ L D
Sbjct: 1586 RDHLSYRSYFIPVK-------NVTDGDLCEQFSSLGPDKQKNIAEDLDRTPTEVVKKLQD 1638
>gi|405117821|gb|AFR92596.1| pre-mRNA-splicing factor RSE1 [Cryptococcus neoformans var. grubii
H99]
Length = 1217
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/285 (22%), Positives = 120/285 (42%), Gaps = 32/285 (11%)
Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
LA QG LL G + L++ L + P VV++N+ I++GD+ +S ++
Sbjct: 951 LAGFQGFLLAGVGKSLRLYEMGKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFY 1010
Query: 1220 LSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
++ QL + A D + C + +D T++ D+ NI I P +SE
Sbjct: 1011 CVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA--CGDKFGNIFINRLDPSISEK 1065
Query: 1277 WK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
G +L F +GA H T+ + + + TS + G R L++ T
Sbjct: 1066 VDDDPTGATILHEKSFLMGAAHKTEMIAHYNIGSVVTSITKIPLVAG----GRDVLVYTT 1121
Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
+ G++G + P D++ F + + D P G + ++R ++ K
Sbjct: 1122 ISGAVGALVPFVSSDDIEFMSTLEMHMRTQDISP--VGRDHIAYRGYYVPIKG------- 1172
Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+VD +L + +LP +Q IA + +L L + ++F
Sbjct: 1173 VVDGDLCESFSLLPYPKQQAIATDLDRSVGDVLKKLEQMRTSSAF 1217
>gi|332026090|gb|EGI66238.1| Splicing factor 3B subunit 3 [Acromyrmex echinatior]
Length = 1217
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 47/293 (16%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
A+ QG +L+ G + L+ +L P VVS+N + I + D+ +S+Y
Sbjct: 951 AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVY 1010
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-- 1276
+ +K Q QL + A D T ++D T++ +D+ NI + A +++
Sbjct: 1011 AVRYKRQENQLIVFADDTHP-RWITTTCVLDYDTVA--TADKFGNIAVIRLATGINDDVD 1067
Query: 1277 ---------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
W QK + A FHVG V + ++ PG ++
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGGSES- 1116
Query: 1321 RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
L++ TL G++G + P ++ F Q L+ + P + G + SFR ++ K
Sbjct: 1117 ---LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK 1171
Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+++D +L + + +Q I+ T S++ L D+ +F
Sbjct: 1172 -------NVIDGDLCEQFNSIEPTKQKSISGDLERTPSEVSKKLEDIRTRYAF 1217
>gi|307205956|gb|EFN84082.1| Splicing factor 3B subunit 3 [Harpegnathos saltator]
Length = 1217
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 47/293 (16%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
A+ QG +L+ G + L+ +L P VVS+N + I + D+ +S+Y
Sbjct: 951 AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVY 1010
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-- 1276
+ +K Q QL + A D T ++D T++ +D+ NI + A +++
Sbjct: 1011 AVRYKRQENQLIVFADDTHP-RWITTTCVLDYDTVA--TADKFGNIAVIRLATGINDDVD 1067
Query: 1277 ---------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
W QK + A FHVG V + ++ PG ++
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGGSES- 1116
Query: 1321 RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
L++ TL G++G + P ++ F Q L+ + P + G + SFR ++ K
Sbjct: 1117 ---LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK 1171
Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+++D +L + + +Q I+ T S++ L D+ +F
Sbjct: 1172 -------NVIDGDLCEQFNSIEPAKQKSISGDLERTPSEVSKKLEDIRTRYAF 1217
>gi|340960602|gb|EGS21783.1| hypothetical protein CTHT_0036510 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1100
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 76/308 (24%), Positives = 133/308 (43%), Gaps = 48/308 (15%)
Query: 1113 IGTAYVQGEDVA---------ARGRVLLFSTGRNAD-NPQNLVTEVYSKELKGAISALAS 1162
+GT+Y+ D + ARGR+L+ G ++D NP ++ S +LKGA LA
Sbjct: 783 VGTSYLPDPDYSPAPSHGNPEARGRILVL--GIDSDRNPY----QILSYQLKGACRCLAV 836
Query: 1163 LQ-GHLLIASGPKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1216
+ G +++ + + ++ T +L +A Y P Y V + I I + D+ KS
Sbjct: 837 MDDGKVVVGLTKAVTVCEYKETSSTTAQLTKLASY-RPSTYPVEIAIHGRTIAVADLMKS 895
Query: 1217 IYFLSW--KEQGAQLNLL--AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
I + + E+G Q L+ A+ + S A ++ +G L +D Q N+Q+
Sbjct: 896 ISLVDYIPAEEGGQAKLVERARHYQSAWSTAVGYVQNGLWLE---ADAQGNLQVLRQNVD 952
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
+++ AE ++G V + + + +S P + GT++G
Sbjct: 953 GITEDDRKRMELTAEINLGEMVNRIRSITV--ETSPEALIIPRA--------FLGTVEGG 1002
Query: 1333 I---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP--GPDSIV 1387
I G IAP L Q+K+ D + V + +FR + + A R GP +
Sbjct: 1003 IYMFGTIAP---HALDLLLRFQEKVADVIKAVGDSDNANFRSYRAFKNAERVGHGPFRFL 1059
Query: 1388 DCELLSHY 1395
D ELL +
Sbjct: 1060 DGELLERF 1067
>gi|331221690|ref|XP_003323519.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|309302509|gb|EFP79100.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1213
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 134/302 (44%), Gaps = 43/302 (14%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL----NGIAFYDAPPLYVVSLNI 1203
V+ E+ SAL QG L G + ++ +L +F A ++SL++
Sbjct: 936 VHKTEVDEMPSALVGFQGRLAAGVGKALRIYDLGKKKLLRKVENKSFGSA----IISLSV 991
Query: 1204 VKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDF-GSLDCFATEFLIDGSTLS-------L 1255
+ IL+GD S+ + +K +L + A D AT ++D T++ L
Sbjct: 992 QGSRILVGDSQDSVSYAVYKPAENRLIVFADDVVPRWTTCAT--MVDYDTVAGGDRFGNL 1049
Query: 1256 VVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTG 1311
VS KN+ + ++ E G ++ + +GA H K L L + TS +T
Sbjct: 1050 WVSRLPKNV-----SDEVDEDPTGAGIMHEKGYLMGAPHKLKNLVHFHLNDIPTSIQKTS 1104
Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
PG R LL+ + GSIG + P +++ F Q+L+ + + +P + G + +
Sbjct: 1105 LVPG----GREVLLYTGVQGSIGILVPFISKEDVDF--FQTLEMHMRNEMPSLVGRDHLA 1158
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
+R ++ K + VD +L + +LP +QL++A + + S +L + + + +
Sbjct: 1159 YRGYYFPVK-------NCVDGDLCESFALLPSAKQLQVASELDRSVSDVLKKIEAVRVSS 1211
Query: 1429 SF 1430
+
Sbjct: 1212 GY 1213
>gi|221508103|gb|EEE33690.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 1878
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 67/292 (22%), Positives = 119/292 (40%), Gaps = 59/292 (20%)
Query: 1111 LAIGTAYVQGEDVAARGRVLLFS----------TGRNADNPQNLVT-------EVYSK-E 1152
LA G E+V GRV LF G D P E+++
Sbjct: 1472 LAAGVGVPLSENVECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIV 1531
Query: 1153 LKGAISALASL------QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
L G ++ + S + +++ + GP++ +H+ G++ AF DA + V S+ ++N
Sbjct: 1532 LNGPVTVVGSFFSSPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDAS-VCVTSVANIRN 1590
Query: 1207 FILLGDIHKSIYFLSWKEQGA----QLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDE 1260
F LLGD K + +SW+ ++ +++ F +L A FL + L ++ +D
Sbjct: 1591 FFLLGDALKGLNLVSWEYHAEADSRKMIRVSRTFPKSNLPVVACSFLTYENLLGMLATDI 1650
Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
N+++F Y + + F +L +L ++ AA K
Sbjct: 1651 DGNVRLFCY-------------------NADKNSPGFEKLDILQCDAEDRCAAGCVVKLQ 1691
Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRR--------LQSLQKKLVDSVPHVAGL 1364
+F + T+ S+G A L FR LQ+LQ ++ +P GL
Sbjct: 1692 QFVVDSETV-ASLGEAADSSTLVFRLLASDSYSFLQTLQDRMAQYLPEPLGL 1742
>gi|401426063|ref|XP_003877516.1| putative CPSF-domain protein [Leishmania mexicana MHOM/GT/2001/U1103]
gi|322493761|emb|CBZ29051.1| putative CPSF-domain protein [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 1347
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/308 (22%), Positives = 120/308 (38%), Gaps = 37/308 (12%)
Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1160
+ +E + LL IG+++ ++ AR + + R Q L + SK++ GA+
Sbjct: 913 GVSKEEWQQLLLIGSSFTFPDEQRARSGRITWCALREEHQRQRL-HLIASKDIGGALQCC 971
Query: 1161 ASL---QGHLLIASGPKIILHKWTGTELN---------GIAFYDAPPLYVVSLNIVKNFI 1208
A++ +G + + + L+ W + G+ PLY SL + +
Sbjct: 972 AAVPHYKGRIALGVNGCVCLYNWNTEDQTFVAEERCRIGLTVTKLIPLYDTSL--AASVL 1029
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
+ D+ S +F+ L +L +D D L L D+ N
Sbjct: 1030 VALDVRHSAFFIEVDTLQGSLKVLCRDADLRGVMDGHIGSDAENLCLF--DDSLNFTAMK 1087
Query: 1269 YAPKMSE----------SWKGQ-KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1317
P E S Q + RA+ H+G VT +R A +S A S
Sbjct: 1088 VVPLPVEAGDGDAAAAGSVTAQYRFEVRAQCHLGDLVT-CVRPGCFAATSLMEAPAACSL 1146
Query: 1318 KTNRF--------ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
NR L+F T G G + P+ T+ L++L+ LV ++ V GL+ ++F
Sbjct: 1147 SRNRLLLPGIAGPQLVFATAHGGFGVVTPVQAATYLVLRALEASLVRTLQPVGGLSHQAF 1206
Query: 1370 RQFHSNGK 1377
R+ G+
Sbjct: 1207 REVLRAGQ 1214
>gi|221486318|gb|EEE24579.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 2804
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/286 (23%), Positives = 121/286 (42%), Gaps = 47/286 (16%)
Query: 1111 LAIGTAYVQGEDVAARGRVLLFS----------TGRNADNPQNLVT-------EVYSK-E 1152
LA G E+V GRV LF G D P E+++
Sbjct: 2398 LAAGVGVPLSENVECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIV 2457
Query: 1153 LKGAISALASL------QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
L G ++ + S + +++ + GP++ +H+ G++ AF DA + V S+ ++N
Sbjct: 2458 LNGPVTVVGSFFSSPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDAS-VCVTSVANIRN 2516
Query: 1207 FILLGDIHKSIYFLSWKEQGA----QLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDE 1260
F LLGD K + +SW+ ++ +++ F +L A FL + L ++ +D
Sbjct: 2517 FFLLGDALKGLNLVSWEYHAEADSRKMIRVSRTFPKSNLPVVACSFLTYENLLGMLATDI 2576
Query: 1261 QKNIQIFYY-APKMSESWKGQKLLS-RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDK 1318
N+++F Y A K S ++ +L AE A ++LQ S+ + +
Sbjct: 2577 DGNVRLFCYNADKNSPGFEKLDILQCDAEDRCAAGCV--VKLQQFVVDSETVASL--GEA 2632
Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGL 1364
+ L+F L D +F LQ+LQ ++ +P GL
Sbjct: 2633 ADSSTLVFRLLAS--------DSYSF--LQTLQDRMAQYLPEPLGL 2668
>gi|322797581|gb|EFZ19622.1| hypothetical protein SINV_00421 [Solenopsis invicta]
Length = 1217
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 47/293 (16%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
A+ QG +L+ G + L+ +L P VVS+N + I + D+ +S+Y
Sbjct: 951 AICPYQGRVLVGVGRMLRLYDMGKKKLLRKCENKHIPNAVVSINAIGQRIYVSDVQESVY 1010
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-- 1276
+ +K Q QL + A D T ++D T++ +D+ NI + A +++
Sbjct: 1011 AVRYKRQENQLIVFADDTHP-RWITTTCVLDYDTVA--TADKFGNIAVIRLATGINDDVD 1067
Query: 1277 ---------WK-------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
W QK + A FHVG V + ++ PG ++
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI----------PGGSES- 1116
Query: 1321 RFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
L++ TL G++G + P ++ F Q L+ + P + G + SFR ++ K
Sbjct: 1117 ---LVYTTLSGTVGVLVPFTSHEDHDF--FQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK 1171
Query: 1378 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+++D +L + + +Q I+ T S++ L D+ +F
Sbjct: 1172 -------NVIDGDLCEQFNSIEPTKQKSISGDLERTPSEVSKKLEDIRTRYAF 1217
>gi|157872916|ref|XP_001684981.1| putative CPSF-domain protein [Leishmania major strain Friedlin]
gi|68128051|emb|CAJ06910.1| putative CPSF-domain protein [Leishmania major strain Friedlin]
Length = 1347
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/311 (22%), Positives = 126/311 (40%), Gaps = 43/311 (13%)
Query: 1101 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL 1160
+ +E + LL IG+++ ++ AR + + R Q L + SK++ GA+
Sbjct: 913 GVSEEEWQHLLLIGSSFTFPDEQRARSGRITWCALREEHQQQRL-HLIASKDIGGALQCC 971
Query: 1161 ASL---QGHLLIASGPKIILHKWTGTELN---------GIAFYDAPPLYVVSLNIVKNFI 1208
A++ +G + + + L+KW + G+ PLY SL + +
Sbjct: 972 AAVPHYKGRIALGVNGCVCLYKWNTEDQTFVAEERCRVGLTVTKLIPLYHTSL--AASVL 1029
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDF---GSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
+ D+ S +F+ L +L +D G +D I +L + D+ N
Sbjct: 1030 VALDVRHSAFFIEVDTLQGSLKVLCRDAELRGVMDGH-----IGSDAENLCLFDDSLNFT 1084
Query: 1266 IFYYAPKMSE----------SWKGQ-KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP 1314
P E S Q + RA+ H+G VT +R A +S A
Sbjct: 1085 ALRVVPLPVEAGDGDAAAAASVTAQYRFEVRAQCHLGDLVT-CVRQGSFAATSLMEAPAS 1143
Query: 1315 GSDKTNRF--------ALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNP 1366
+ NR L+F T G G + P+ T+ L++L+ LV ++ + GL+
Sbjct: 1144 CASAQNRLLLPGIAGPQLVFATAHGGFGVVTPVHAATYLVLRTLEASLVRTLQPLGGLSH 1203
Query: 1367 RSFRQFHSNGK 1377
++FR+ +G+
Sbjct: 1204 QAFREVLRSGQ 1214
>gi|237833631|ref|XP_002366113.1| hypothetical protein TGME49_024280 [Toxoplasma gondii ME49]
gi|211963777|gb|EEA98972.1| hypothetical protein TGME49_024280 [Toxoplasma gondii ME49]
Length = 2804
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 67/292 (22%), Positives = 119/292 (40%), Gaps = 59/292 (20%)
Query: 1111 LAIGTAYVQGEDVAARGRVLLFS----------TGRNADNPQNLVT-------EVYSK-E 1152
LA G E+V GRV LF G D P E+++
Sbjct: 2398 LAAGVGVPLSENVECGGRVYLFKLPESSLRVVPAGNAGDAPTEEAEFGTPERLELFADIV 2457
Query: 1153 LKGAISALASL------QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
L G ++ + S + +++ + GP++ +H+ G++ AF DA + V S+ ++N
Sbjct: 2458 LNGPVTVVGSFFSSPAERSYVVHSVGPRLFVHEMEGSKFLRGAFSDAS-VCVTSVANIRN 2516
Query: 1207 FILLGDIHKSIYFLSWKEQGA----QLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDE 1260
F LLGD K + +SW+ ++ +++ F +L A FL + L ++ +D
Sbjct: 2517 FFLLGDALKGLNLVSWEYHAEADSRKMIRVSRTFPKSNLPVVACSFLTYENLLGMLATDI 2576
Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
N+++F Y + + F +L +L ++ AA K
Sbjct: 2577 DGNVRLFCY-------------------NADKNSPGFEKLDILQCDAEDRCAAGCVVKLQ 2617
Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRR--------LQSLQKKLVDSVPHVAGL 1364
+F + T+ S+G A L FR LQ+LQ ++ +P GL
Sbjct: 2618 QFVVDSETV-ASLGEAADSSTLVFRLLASDSYSFLQTLQDRMAQYLPEPLGL 2668
>gi|46125735|ref|XP_387421.1| hypothetical protein FG07245.1 [Gibberella zeae PH-1]
Length = 1208
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 103/475 (21%), Positives = 200/475 (42%), Gaps = 57/475 (12%)
Query: 964 NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQK-IPLKATPHQITYFAEKNLYPLIV 1020
C G + + Q I I +L G T +QK IPL TP ++ ++ L+ I
Sbjct: 761 QCEEGIVGIQGQSLRIFNIDRL--GETL-----IQKSIPLTYTPKKLVKHPDQPLFYTIE 813
Query: 1021 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQ 1078
+ P LL D V + D+ L D + + +++P G Q
Sbjct: 814 ADNNTLPPELRAQLLADPGVVNG-DSRVLPPEDFGYPKGTRRWASCINVIDPLSEEG--Q 870
Query: 1079 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG--- 1135
TI ++++E A++ +V ++++NE+ L IGT G+D+ R FS G
Sbjct: 871 VLQTIDLENNEAAVSAAIVPF---SSQDNESFLVIGT----GKDMVVNPRS--FSEGYLH 921
Query: 1136 --RNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1193
R + + L ++ +++ AL + QG +L+A G + ++ ++ + +
Sbjct: 922 IYRFLEGGREL-EFIHKTKVEEPPLALLAFQGRVLVAVGTSLRIYDLGMRQMLRKSQAEV 980
Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
+VSLN + I++GD+ + + ++ +K +L D + T ++D
Sbjct: 981 ATQQIVSLNTQGSRIIVGDVQQGVTYVVYKPASNKLIPFVDDTIARWTTCTT-MVDYE-- 1037
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKFL---RLQMLAT 1305
S+ D+ N+ I K SE ++ + +R H H + Q + T
Sbjct: 1038 SVAGGDKFGNMFIVRCPEKASEEADEEQSGLHLINARDYLHGTPHRVSLMCHFYTQDIPT 1097
Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
S + G + LL+ + G+IG P ++ F Q+L++ L P +A
Sbjct: 1098 SITKASLVVGGQE----VLLWSGIMGTIGVFIPFVSREDADF--FQNLEQHLRTEDPPLA 1151
Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
G + +R +++ K ++D +L Y +LP +++ IA + + +I
Sbjct: 1152 GRDHLMYRGYYAPVKG-------VIDGDLCERYNLLPNDKKQMIAGELDRSVREI 1199
>gi|321249291|ref|XP_003191408.1| U2 snRNA binding protein [Cryptococcus gattii WM276]
gi|317457875|gb|ADV19621.1| U2 snRNA binding protein, putative [Cryptococcus gattii WM276]
Length = 1217
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 117/546 (21%), Positives = 206/546 (37%), Gaps = 80/546 (14%)
Query: 919 NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
N+ G SR +++ L+ P + D A++ L C G I + S L
Sbjct: 718 NVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWS-LSAAMCPDGLIGI-SGNTL 775
Query: 979 KICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1038
+I +P +K+ +TP +TY K + P N V ++
Sbjct: 776 RIFSIPKLG--------EKLKQDSTP--LTYTPRKF---------ISHPFNPVFYMI--- 813
Query: 1039 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV- 1097
+ D+ S + R +E E R ++ P + A VRV+
Sbjct: 814 ----EADHRTYSKGAIERIVKQKESEGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLD 869
Query: 1098 -----TLFNTTTKENETLLAIGTAYVQ---GEDVAARG---RVLLFSTGRN-------AD 1139
T+ E+E +I AY + GE G + L G A
Sbjct: 870 PLANETIMTFDLDEDEAAFSIAIAYFERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAI 929
Query: 1140 NPQNLVTEVYSKELKGAIS-ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
Q V E K I LA QG LL G + L++ L + P V
Sbjct: 930 KEQGRVLEFLHKTKTDDIPLCLAGFQGFLLAGVGKSLRLYEMGKKALLRKCENNGFPTAV 989
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSL 1255
V++N+ I++GD+ +S ++ ++ QL + A D + C + +D T++
Sbjct: 990 VTINVQGARIIVGDMQESTFYCVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA- 1045
Query: 1256 VVSDEQKNIQIFYYAPKMSESWK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSS 1307
D+ NI I P +SE G +L F +GA H T+ + + + TS
Sbjct: 1046 -CGDKFGNIFINRLDPSISEKVDDDPTGATILHEKSFLMGAAHKTEMIAHYNIGSVVTSI 1104
Query: 1308 DRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGL 1364
+ G R L++ T+ G++G + P D++ F + +L+ + + G
Sbjct: 1105 TKIPLVAG----GRDVLVYTTISGAVGALVPFVSPDDIEF--MSTLEMHMRTQDISLVGR 1158
Query: 1365 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ ++R ++ K +VD +L + +LP +Q IA + +L L +
Sbjct: 1159 DHIAYRGYYVPIKG-------VVDGDLCESFSLLPYPKQQAIASDLDRSVGDVLKKLEQM 1211
Query: 1425 ALGTSF 1430
++F
Sbjct: 1212 RTSSAF 1217
>gi|302406266|ref|XP_003000969.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
gi|261360227|gb|EEY22655.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
Length = 1059
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 129/574 (22%), Positives = 214/574 (37%), Gaps = 133/574 (23%)
Query: 104 VCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE 163
V + + G + S+A G++ +D +ILA + +I+++E+ + + + + F
Sbjct: 59 VLSHDVFGIIRSMAAFRIAGSN----KDYLILATDSGRIAIIEYLPAQNRFQRLHLETFG 114
Query: 164 SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-----MIILKASQGGSGLVGDED 218
K G G + DP+GR L+ L+ ++ + SQ E
Sbjct: 115 -------KSGIRRVVPGEFLACDPKGRA--CLIASLEKNKLVYVLNRNSQA-------EL 158
Query: 219 TFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
T S A HV+++ LD+ GY PV L E + T A +
Sbjct: 159 TISSP--LEAHKPGVHVLSMVALDV----------GYANPVFAAL-ETDYTEADQDPTGQ 205
Query: 279 HTCMISALSISTTLKQHPL------IWSAMNLPHDAYKLLAVPSPIG-----GVLVVGAN 327
+AL + T L + L + + P D L P G GVLV G
Sbjct: 206 -----AALDVETQLVYYELDLGLNHVVRKWSEPVDNTASLLFQVPGGNDGPSGVLVCGEE 260
Query: 328 TIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA----LLSTK 382
I Y HS + + + + E P S+ H L+ LL T+
Sbjct: 261 NITYRHSNQEAFRVPVPRRR----GATEDPSRKRSIVAGVMHK--LKGSAGAFFFLLQTE 314
Query: 383 TGDLVLLTVVY----DGRV---VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
GDL +T+ DG V+RL + + + S + + + ++ S+ G+
Sbjct: 315 DGDLFKITIDMIEDRDGNPTGEVKRLKIKYFDTIPVASSLCILKSGFLYVASQFGNYQFY 374
Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE 495
QF E+ GD + L SS D D E +
Sbjct: 375 QF--------------EKLGD------DDEELEFSSDDFPTDPKQSYEAVFF-------- 406
Query: 496 SAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV--------- 542
++ + A+ +S+ ++ PL D DA +A G +S + ++
Sbjct: 407 HPRELENLALVESIDSMNPLIDCKVANLTGEDAPQIYTACGNGARSTFRILKHGLEVNEI 466
Query: 543 ---ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
ELPG +WT+ K SRG D+Y AY+++S T+VL + + E
Sbjct: 467 VASELPGIPSAVWTL--KLSRG------------DQYDAYIVLSFTNATLVLSIGETVEE 512
Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
V +S F+ A L G +IQV +G R
Sbjct: 513 VNDS--GFLTSVPTLAAQLLGGEGLIQVHPKGIR 544
Score = 46.6 bits (109), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 60/267 (22%), Positives = 114/267 (42%), Gaps = 24/267 (8%)
Query: 964 NCNHGFIYVTSQG--ILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1021
C G + + Q I I +L T + IPL TP ++ E ++ I S
Sbjct: 761 QCEEGVVGIQGQSLRIFAIEKLSDTLTQ------KSIPLTYTPRRMVKHPEHPMFYTIES 814
Query: 1022 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRA 1081
P LL D V + D L V+ + I D QT
Sbjct: 815 DNNTLPPELRAQLLADPSVVNG-DARTLPPVEFGYPRAKGRWASCISVIDPLSEELQTLQ 873
Query: 1082 TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----R 1136
T+ + ++E A++ +V T+++NE+ L +GT G+D+ R F+ G R
Sbjct: 874 TVDLDNNEAAVSAAIVPF---TSQDNESFLVVGT----GKDMIVNPR--QFTEGYIHIYR 924
Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
+++ + L ++ +++ +AL + QG L+ G + ++ ++ A D P
Sbjct: 925 FSEDGREL-EFIHKTKVEEPPTALLAFQGRLVAGVGKTLRIYDLGQKQMLRKAQADVAPQ 983
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWK 1223
+VSL+ + I++GD+ + + ++ +K
Sbjct: 984 LIVSLSTQGSRIVVGDVQQGVTYVVYK 1010
>gi|119173562|ref|XP_001239205.1| hypothetical protein CIMG_10227 [Coccidioides immitis RS]
Length = 1208
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 75/344 (21%), Positives = 152/344 (44%), Gaps = 48/344 (13%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------AARGRVLLFSTGR 1136
I ++ +E A++V V +++++ET L +GT G+D+ ++ G + ++
Sbjct: 872 IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVYPPSSSCGFIHIYRFQE 924
Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
+ + ++ +++ AL + QG LL G + ++ +L + P
Sbjct: 925 DGKE----LEFIHKTKVESPPHALLAFQGRLLAGIGRNLRIYDLGMKQLLRKCQAEVVPR 980
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLS 1254
+V L + I++ D+ +S+ ++ +K Q +L A D + C A ++D T++
Sbjct: 981 LIVGLQTQGSRIIVSDVQESVTYVVYKYQENRLIPFADDVIARWTTCTA---MVDYETVA 1037
Query: 1255 LVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATS 1306
D+ N+ + K SE G L+ ++ GA L + Q + TS
Sbjct: 1038 --GGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQYLQGAPNRLSLMVHFYPQDIPTS 1095
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAG 1363
+T G R L++ L G++G + P +++ F QSL+ +L P +AG
Sbjct: 1096 IQKTQLVAG----GRDILVWTGLQGTVGMLVPFVSREDVDF--FQSLEMQLTSQTPPLAG 1149
Query: 1364 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
+ +R +++ K +D +L Y LP +++L IA
Sbjct: 1150 RDHLIYRSYYAPAKG-------TIDGDLCETYFTLPNDKKLMIA 1186
>gi|392578232|gb|EIW71360.1| hypothetical protein TREMEDRAFT_71141 [Tremella mesenterica DSM 1558]
Length = 1250
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 69/309 (22%), Positives = 130/309 (42%), Gaps = 46/309 (14%)
Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV------------AARGRVLL 1131
P++S E T+ VTL ++ LLA+G A +D A GR++L
Sbjct: 894 PLESREIVTTIAYVTL------GDQNLLAVGIATFSEDDEDLPDDLDMVTISAQSGRLVL 947
Query: 1132 FSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK-------WTGTE 1184
+ + D+ + + E+ S L+ A++ + ++ L +A+G + ++K T
Sbjct: 948 YEPVVDQDSAEPNLIELTSVGLESAVNDIKVIKNLLAVATGSNVTIYKHEKASHLLIPTS 1007
Query: 1185 LNGIAFYDAPPLYVVSLNIV--KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
AF A L V + + + +++GD +SI+ L E + +D +
Sbjct: 1008 RFASAFV-AKSLVVAPPDKLHPEERLVVGDGMRSIFVLDIDEGTGMIMGDERDMATHSVM 1066
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
A E L DG +++V+D NI F +++ + A F + ++ F R +
Sbjct: 1067 AMEGLRDGGQ-AVIVADAHSNISTFRLR---------EEIETAATFGLHEDISVFRRGSL 1116
Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVA 1362
SS +P ++F T+DG +G + L R L LQ+ + +
Sbjct: 1117 APASSAEDVLSP--------EIIFATIDGRLGIVGELTPSAARTLDDLQRNMDRYIRGPG 1168
Query: 1363 GLNPRSFRQ 1371
+ RS+R+
Sbjct: 1169 DIAWRSYRR 1177
>gi|389586447|dbj|GAB69176.1| splicing factor 3B subunit 3 [Plasmodium cynomolgi strain B]
Length = 1286
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 127/604 (21%), Positives = 225/604 (37%), Gaps = 114/604 (18%)
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
L+ L L+ + G + L G++ +D +++ + ++ +L+F +
Sbjct: 41 LLRADKQGKLNLIASKDVFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLVILQFSNEK 96
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
+ +HC + K G G + VDP+GR + VY L
Sbjct: 97 NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDTK 149
Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
Q+ I D G GF I +S N D K V + + G
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPIFASIEQNYEMYD-KQVTNTNEIDGCTRK 208
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
++ L E +L ++ +++H LP D+ L +P P
Sbjct: 209 TLLCLWEMDL------------------GLNHVIRKH-------TLPIDSSAHLLIPIPG 243
Query: 319 G-----GVLVVGANTIHYHSQS---ASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
G GV+V N + Y CA Y L++ QE + S+ A H
Sbjct: 244 GQQGPSGVIVCCDNYLVYKKVEHVDVYCA-----YPRRLETGQE---KNISIVCSALHRI 295
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ L+ ++ GDL + + ++ +V+ + + + + I + + F+ + G
Sbjct: 296 R-KFFFILIQSEFGDLYKIEMDHEDGIVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354
Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--LSLY 487
+ QF+ G + K G A TK+L ++ L D V L +
Sbjct: 355 NHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPILDMK 411
Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVEL 544
+ N S Q +L GP L+ +GL I A EL
Sbjct: 412 ILDAKNANSPQIY-------ALCGRGPRSSLRILQHGLSIEELADN------------EL 452
Query: 545 PG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG K IWT+ + NA +Y Y+I+S E T++LE + + EV +S+
Sbjct: 453 PGRPKFIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVVDSL 498
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
+ T N+ +IQV + G R ++G + + + P N + + + NST +
Sbjct: 499 --LLTNVTTIHVNILYDNTLIQVHDTGIRHINGKVVHEWVP--PKNKQIKAATSNSTQIV 554
Query: 664 VSIA 667
+S++
Sbjct: 555 ISLS 558
Score = 50.1 bits (118), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 63/286 (22%), Positives = 111/286 (38%), Gaps = 45/286 (15%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + G K+ ++ +L Y P ++S+ + + I DI +S+
Sbjct: 1021 CFCPFNGRLLASIGNKLRIYALGKKKLLKKCEYKDIPEAIISIKVSGDRIFASDIRESVL 1080
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1265
+ L L++ D +E L + ++ L V +E K +
Sbjct: 1081 IFFYDSNMNTLRLISDDIIPRWITCSEILDHHTIMAADKFDSVFVLRVPEEAKQEEYGIS 1140
Query: 1266 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
+Y M+ S K ++L FHVG VT ++++ TSS+
Sbjct: 1141 NKCWYGGEIMAGSNKNRRLEHIMSFHVGEIVTSLQKVKLSPTSSE--------------C 1186
Query: 1324 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
+++ T+ G+IG P D ELT Q L+ L P + G FR ++ +
Sbjct: 1187 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTENPPLCGREHIFFRSYYHPVQ- 1241
Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
++D +L + LP + Q ++A T IL L D+
Sbjct: 1242 ------HVIDGDLCEQFSSLPYDVQRKVAADLERTPDDILRKLEDI 1281
>gi|146420838|ref|XP_001486372.1| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC
6260]
Length = 1206
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 120/610 (19%), Positives = 237/610 (38%), Gaps = 96/610 (15%)
Query: 96 ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
+ + ++ +CH ++ G ++++ + +GG++ D +++ + ++S+LEFD
Sbjct: 53 LESGKIKQICHQQVIGVIQNIDRIRKGGSN----LDLLVITSDSGRLSILEFDKD----- 103
Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
+ F + H K G G + VDPQ R + ++ KA + L
Sbjct: 104 --ELKFFPVVQEPHSKNGMNRTTPGEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKL-- 159
Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
+ +++ I + LD GY P++ + E +A +
Sbjct: 160 -----ELLSPLESVSKNTLTIQMVSLDT----------GYENPMLAAI---ECNYAHYDA 201
Query: 276 WKHHTCMISALSISTTLKQHPLIWSA-----MNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
+ S L++ + L + A + +P + L+ +P+PIGGV+V G++ I
Sbjct: 202 SLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTLVPLPTPIGGVIVAGSSFIF 261
Query: 331 YHSQSASCALALNNYAVSLDS-SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
YH+ + L L + L + S +P ++V H N LL + GD +
Sbjct: 262 YHNPTIDQQLYL---PIPLRAGSSPVPIVCYAV-----HKLKKNNFFILLHNELGDCFRV 313
Query: 390 TVVY--DGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSML 446
+ Y D V L + + ++ I F D +L Q G S +
Sbjct: 314 LIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDKMLYQIEDLGDNDSYI 373
Query: 447 SSGLKEEFGDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
SS D+ + + + R + AL +++ G+ +ES + +
Sbjct: 374 SSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGALVKTSESKESRIAMIT 433
Query: 506 RDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADS 565
S LK ++G+ I+ LV P +V+ +
Sbjct: 434 GHS-----HLKLKTHGIPIST--------------LVSSPLPMIATSVF----------T 464
Query: 566 SRMAAYDDEYHAYLIISLEA--RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
+R++A + + Y++IS A +T+VL +++ EV +S FV + G + +
Sbjct: 465 TRLSA-ESKNDEYMVISSSASSKTLVLAIGEVVEEVQDSS--FVTDQPTIGVQQVGLKSL 521
Query: 624 IQVFERGARIL-----DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
IQ++ G R + +G + + P T++S S VL+G+S+
Sbjct: 522 IQIYSNGIRHIRQTETEGKITKKTFDWYP--------PAGITIISASTNQEQVLIGLSNR 573
Query: 679 SIRLLVGDPS 688
+ DP+
Sbjct: 574 ELCYFEIDPT 583
>gi|429859776|gb|ELA34542.1| pre-mRNA-splicing factor rse1 [Colletotrichum gloeosporioides Nara
gc5]
Length = 1212
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 78/365 (21%), Positives = 159/365 (43%), Gaps = 42/365 (11%)
Query: 1082 TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----R 1136
T+ + +E A++ +V+ +++NE L +GT G+D+ R F+ G R
Sbjct: 874 TVDLDDNEAAVSAAIVSF---ASQDNENFLIVGT----GKDMIVNPR--QFTEGYIHVYR 924
Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL 1196
++ L ++ +++ +AL + QG LL G + ++ ++ + D P
Sbjct: 925 FGEDGHEL-EFIHKTKVEEPPTALLAFQGRLLAGVGKTLRIYDLGLRQMLRKSQADVAPQ 983
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
+VSL+ + I++GD+ + ++ +K +L D + T ++D S+
Sbjct: 984 LIVSLSTQGSRIVVGDVQHGVTYVVYKPTTNKLIPFVDDTIARWVTCTT-MVDYE--SVA 1040
Query: 1257 VSDEQKNIQIFYYAPKMS----ESWKGQKLL-SRAEFHVGAHVTKFLR---LQMLATSSD 1308
D+ N+ + K S E G LL +R H H L Q + TS
Sbjct: 1041 GGDKFGNMFLVRCPEKASQEADEEQAGLHLLNTRDYLHGAPHRLNLLSHSYTQDVPTSIT 1100
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
+T G LL+ ++G+IG P +++ F Q+L++ + +AG +
Sbjct: 1101 KTSLVVGGQDV----LLWSGINGTIGVFIPFVTREDVDF--FQNLEQHMRTEDAPLAGRD 1154
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
+R ++ K ++D +L Y +LP +++L IA + + +I ++D+
Sbjct: 1155 HLMYRGYYVPVKG-------VIDGDLCERYTLLPNDKKLMIAGELDRSVREIERKISDIR 1207
Query: 1426 LGTSF 1430
++F
Sbjct: 1208 TRSAF 1212
Score = 44.3 bits (103), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 138/607 (22%), Positives = 226/607 (37%), Gaps = 137/607 (22%)
Query: 74 QEEGSKESKNSGETKRRVLM---DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
Q G+KE + R+ + D + L+ H + G + S+A G++ +
Sbjct: 27 QFSGTKEQNIVTASGSRLTLLRPDPSQGKVITLLSH-DIFGIIRSMAAFRLAGSN----K 81
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVD 186
D +ILA + +I+++E+ I + + F+ LHL K G G + D
Sbjct: 82 DYLILATDSGRITIVEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACD 130
Query: 187 PQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
P+GR V L ++ + +Q E T S A V+++ LD+
Sbjct: 131 PKGRACLIASVEKNKLVYVLNRNAQA-------ELTISSP--LEAHKPGVLVLSMVALDV 181
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLI 298
GY PV L E E T A G + + T ++ + L
Sbjct: 182 ----------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRK 229
Query: 299 WSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNY--AVSLD 350
WS P D L P G GVLV G I Y HS + + + A
Sbjct: 230 WSE---PVDPTASLLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDP 286
Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL--VLLTVVYDGR-----VVQRLDL 403
S + S +L + + LL T+ GDL ++ +V D V+RL +
Sbjct: 287 SRKRHIVSGVMHKLKGSAGAFF----FLLQTEDGDLFKAVIDMVEDADGNPTGEVKRLKI 342
Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
+ ++S + + + + S+ G+ QF E+ GD + +
Sbjct: 343 KYFDTVPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGDDDEEK-- 386
Query: 464 TKRLRRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
SS D D G + + Y N A+ +S+ ++ PL D
Sbjct: 387 ----EFSSDDFPADPKAGYDAVYFYPRPLEN---------LALVESIDSMNPLLDCKVAN 433
Query: 523 RINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADS 565
DA +A G +S + ++ ELPG +WT+ K SRG
Sbjct: 434 LTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLSRG----- 486
Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
D+Y AY+++S T+VL + + EV++S F+ A L G +IQ
Sbjct: 487 -------DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQ 537
Query: 626 VFERGAR 632
V +G R
Sbjct: 538 VHPKGIR 544
>gi|255946770|ref|XP_002564152.1| Pc22g01070 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591169|emb|CAP97395.1| Pc22g01070 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1209
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 99/465 (21%), Positives = 185/465 (39%), Gaps = 72/465 (15%)
Query: 990 DNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-VPVLKPLNQVLSLLIDQEVGHQIDNHN 1048
DN + IPL TP + E+ L+ +I S VL P + LID + +
Sbjct: 781 DNNMLQESIPLSYTPRRFVKHPEQPLFYVIESDNNVLSPATR--QRLIDDSQAQNGEATD 838
Query: 1049 LSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTRATI---PMQSSENALTVRVVTLFNTT 1103
L D + ++++EP T++ I ++ +E A+++ V+ +
Sbjct: 839 LPPADFGYPRATGHWASCIQVVEP------ITTKSVIFNLDLEDNEAAVSLAAVSF---S 889
Query: 1104 TKENETLLAIGTA-------------YVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYS 1150
++++ET L +GTA ++ GR L F D P
Sbjct: 890 SQDDETFLVVGTAKDMTVSPPSSSCGFIHIYRFQEDGRELEFIHKTQVDEPP-------- 941
Query: 1151 KELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILL 1210
AL QG LL GP + ++ +L P +V L + I++
Sbjct: 942 -------LALLGFQGRLLAGIGPILRVYDLGMKQLLRKCQAPVVPKTIVGLQTQGSRIIV 994
Query: 1211 GDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYA 1270
D+ +S+ ++ +K Q L A D + +T ++D T + D+ N+ +
Sbjct: 995 SDVRESVTYVVYKYQDNVLIPFADDSIARWTSSTT-MVDYETTA--GGDKFGNLWLVRCP 1051
Query: 1271 PKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRTGAAPGSDKTNRF 1322
K+SE G L+ + H H + + Q + TS +T G R
Sbjct: 1052 SKVSEQADEDGSGAHLIHEKGYLHGTPHRLELMVHFYAQDIPTSLHKTQLVAG----GRD 1107
Query: 1323 ALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
+++ G+IG P +++ F Q L+ +L P +AG + +R +++ K
Sbjct: 1108 IVVWTGFQGTIGMFVPFASREDVDF--FQLLETQLASQQPPLAGRDHLMYRGYYAPVKG- 1164
Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
++D +L Y +LP + +L IA + + +I ++D+
Sbjct: 1165 ------VIDGDLCEMYLLLPNDTKLMIAGELDRSVREIERKISDM 1203
>gi|402595041|gb|EJW88967.1| hypothetical protein WUBG_00126 [Wuchereria bancrofti]
Length = 621
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 87/361 (24%), Positives = 139/361 (38%), Gaps = 100/361 (27%)
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
+W NL +A +++VP P+GG L+ G + I YH + AL YA S
Sbjct: 201 LWKHDNLEGEANIVISVPEPVGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLT 412
+ +D +L D+A G+L +L L+L K N +V+
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLL----------LELGKDQEQDENSAVIV 300
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTC--GSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
D+ LG++ + + C +G + S FGD +L R
Sbjct: 301 RDMKV---------ESLGETCIAECMCYLDNGVCFIGS----RFGD--------SQLIRL 339
Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA-- 528
S++ A T ++ DS N+ P++D + +R N
Sbjct: 340 STEP---------------------RADGTGYISLLDSYTNLAPIRDMTV-MRCNGQQQI 377
Query: 529 -SATGISKQSNY----------EL--VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
+ +G K EL VEL G K ++T+ +RG DE+
Sbjct: 378 LTCSGAYKDGTIRIIRNGIGIEELASVELKGIKNMFTL---RTRG------------DEF 422
Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
YLI+S ++ T VL E TE + V G T+ AG LF + ++QV ++D
Sbjct: 423 DDYLILSFDSETHVLFINGEELEDTEITGFAVDGATLWAGCLFHSKTILQVTHGEVILID 482
Query: 636 G 636
G
Sbjct: 483 G 483
>gi|225560964|gb|EEH09245.1| pre-mRNA-splicing factor rse1 [Ajellomyces capsulatus G186AR]
Length = 1209
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 103/486 (21%), Positives = 201/486 (41%), Gaps = 66/486 (13%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVSVP 1023
C G + + Q L+I L +N + IPL+ TP +F + YPL
Sbjct: 759 QCVEGMVGIQGQN-LRIFSL---EKLENNLLQETIPLQYTPR---HFIKHPEYPLFY--- 808
Query: 1024 VLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE-----------VRILEPDR 1072
V++ N +LS ++ + D N + L EE+ ++I++P
Sbjct: 809 VIEAENNILSPGTRTKLLNDSDAVNGDTTPL----PPEEFGYPRGTGHWASCIQIVDPIN 864
Query: 1073 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL-- 1130
+ + + I ++ +E A++V V +++++ET L +GT G+D+ R
Sbjct: 865 SK---RVISQIELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVNPRSCTA 914
Query: 1131 -LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIA 1189
R + + L ++ +++ AL QG LL G + ++ ++
Sbjct: 915 GFIHIYRFQEEGKEL-EFIHKTKVEQPPMALLGFQGRLLAGIGTDLRIYDLGMKQMLRKC 973
Query: 1190 FYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLID 1249
P VV L + I++ D+ +S+ ++ +K Q +L D S T ++D
Sbjct: 974 QASVVPHLVVGLQTQGSRIIVSDVQESLTYVVYKYQENRLIPFVDDVISRWTTCTT-MVD 1032
Query: 1250 GSTLSLVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQ 1301
T++ D+ N+ + K SE G L+ ++ GA ++ Q
Sbjct: 1033 YETVA--GGDKFGNLWLLRCPAKASEEADEDGSGAHLIHERQYLQGAPNRLNLVAHFYPQ 1090
Query: 1302 MLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSV 1358
L TS + G R L++ L G++ + P +E+ F QSL+ +L
Sbjct: 1091 DLPTSIQKAQLVTG----GRDILVWTGLQGTVSMLIPFISREEVDF--FQSLEMQLAAQN 1144
Query: 1359 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1418
P +AG + +R +++ K +D +L Y +LP +++ +IA + + +I
Sbjct: 1145 PPLAGRDHLIYRSYYAPAKG-------TIDGDLCETYLLLPNDKKQQIAGELDRSVREIE 1197
Query: 1419 SNLNDL 1424
+ D+
Sbjct: 1198 RKIADM 1203
>gi|357606250|gb|EHJ64976.1| putative Splicing factor 3B subunit [Danaus plexippus]
Length = 1216
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 82/392 (20%), Positives = 156/392 (39%), Gaps = 60/392 (15%)
Query: 1064 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV 1123
++RIL+ G T +P++ +E A+++ VV T ++ + +
Sbjct: 860 QIRILDMSGGVGGCSTVCLLPLEQNEAAVSLCVVRWAALTDNTPHLVVGVAKDALLSPRS 919
Query: 1124 AARGRV---LLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
+ G + +++TG+ LV + E G ALA+ G LL G + L+
Sbjct: 920 CSEGSLHVYKIYNTGK-----LELVHKTPIDEYPG---ALAAFNGKLLAGVGRMLRLYDI 971
Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
+L P + + ++ I + D+ +S++ + +K++ QL + A D
Sbjct: 972 GRRKLLRKCENRHIPNLIADIKTIRQRIFVSDVQESVFCVKYKKRENQLIIFADDTNP-R 1030
Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS----ESWKGQK------LLSRA---- 1286
++D T+++ +D+ N+ + +S E G K LL+ A
Sbjct: 1031 WITNTCILDYDTVAM--ADKFGNVAVLRLPQSVSDDVDEDPTGNKALWDRGLLNGASQKG 1088
Query: 1287 ----EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1342
FHVG VT R ++ PG + ALL+ T+ G++G P
Sbjct: 1089 DITVNFHVGETVTSLQRATLI----------PGGSE----ALLYATVSGALGVFLP---F 1131
Query: 1343 TFRR----LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
T R Q L+ + + G + SFR ++ K +++D +L + L
Sbjct: 1132 TSREDHDFFQHLEMHMRSENSPLCGRDHLSFRSYYYPVK-------NVIDGDLCEQFNSL 1184
Query: 1399 PLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+Q IA T +++ L D+ +F
Sbjct: 1185 EPAKQKAIAGDLERTPAEVSKKLEDIRTRYAF 1216
>gi|440636768|gb|ELR06687.1| pre-mRNA-splicing factor rse1 [Geomyces destructans 20631-21]
Length = 1212
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 80/381 (20%), Positives = 167/381 (43%), Gaps = 43/381 (11%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
++I++P R Q I ++ +E A+++ V+ ++E+E L +GT G+D+
Sbjct: 860 IQIVDPIREKKVLQQ---IDLEDNEAAVSMATVSF---ASQEDEVFLVVGT----GKDMV 909
Query: 1125 ARGRV----LLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW 1180
A R + + D + + ++ +++ AL QG LL+ G ++ ++
Sbjct: 910 ASPRSSSGGFIHVYRFHEDGKE--IEFIHKTKVEEPPLALLGFQGRLLVGIGRELRIYDL 967
Query: 1181 TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1240
+L A + +V L + I++ D+ +SI F+ +K Q +L A D +
Sbjct: 968 GMRQLLRKAQTEIAASLIVGLQTQGSRIIVSDVQESITFVVYKFQENKLIPFADDTIARW 1027
Query: 1241 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA-HVT 1295
T ++D T++ D+ N+ + K SE G L+ ++ GA H
Sbjct: 1028 TTCTT-MVDYETVA--GGDKFGNLWLLRCPTKASEEADEEGSGAHLVHERQYLQGAPHRV 1084
Query: 1296 KFLRLQM---LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQS 1349
+ + TS +T G R LL+ L G+I + P +++ F Q+
Sbjct: 1085 ALMAHNFANDIPTSIQKTNLVAG----GRDCLLWSGLQGTIAIMIPFVSREDVDF--FQT 1138
Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
L++ L +AG + +R ++ K ++D +L Y +LP ++++ IA +
Sbjct: 1139 LEQHLRTEDAPLAGRDHLIYRSYYVPVKG-------VIDGDLCERYTLLPTDKKMMIAGE 1191
Query: 1410 TGTTRSQILSNLNDLALGTSF 1430
+ +I ++D+ +++
Sbjct: 1192 FDRSVREIERKISDMRTRSAY 1212
>gi|308477185|ref|XP_003100807.1| CRE-DDB-1 protein [Caenorhabditis remanei]
gi|308264619|gb|EFP08572.1| CRE-DDB-1 protein [Caenorhabditis remanei]
Length = 1154
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 53/283 (18%), Positives = 121/283 (42%), Gaps = 21/283 (7%)
Query: 1104 TKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
T +++ +GT + E+ + GR+++F + ++ + V+ +G+ AL
Sbjct: 834 TNDSKVYYIVGTGLIYPEETDTKFGRIVVFEVD---EVERSKLRRVHDLVCRGSPLALRI 890
Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
L G L+ A + L +WT + + + + + L ++ + + D+ +S+ LS+
Sbjct: 891 LNGKLVAAINSSVRLFEWTMDKELRLECSNFNHIMALDLKVMNEEVAVADVMRSVSLLSY 950
Query: 1223 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE--SWKGQ 1280
+ +AKD+ S EF+ L + ++ +F S + G+
Sbjct: 951 RMLEGNFEEVAKDWNSEWMVTCEFITAEQILG-----GEAHLNLFTVEVDKSRPITDDGR 1005
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAP 1338
+L ++ + + + L D D + +++ ++FGT GSIG +
Sbjct: 1006 YVLEPTGYYYLGELPRVMVRSSLVAQPD--------DCSIQYSQPIMFGTNQGSIGMVVQ 1057
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1381
+D+ + L +++K + DSV + + ++R F + P
Sbjct: 1058 IDDKWKKFLIAVEKAIADSVKNCMHIEHTTYRSFIFQKRLESP 1100
Score = 45.4 bits (106), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 147/358 (41%), Gaps = 82/358 (22%)
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
++ DA L+ VP+PI GVLV+ AN+I Y S N V +S L + F+
Sbjct: 206 SIAADASVLIPVPAPISGVLVLAANSILYKSSDV-------NGDVVPYASPLLDNTVFTC 258
Query: 363 E--LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR---VVQRLDLSKTNPSVLTSDITT 417
+D + ++ +D T+ L+L+ + +GR V+ + + + + I
Sbjct: 259 HGLVDPSGERFILSD-----TEGRLLMLILNIGEGRSGITVKDMRIEYLGETSIADSINY 313
Query: 418 IGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ 476
I + F+GSRLGDS L++ SG S S + E + +I ++
Sbjct: 314 IDAGVVFVGSRLGDSQLIRLMPTPSGGSY--SVVLETYSNI---------------GPIR 356
Query: 477 DMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ 536
DM+ E ++ ++ T S A +D G L+ G+ I AS
Sbjct: 357 DMIMVE---------SDGQAQLVTCSGAEKD-----GSLRVIRNGIGIEELAS------- 395
Query: 537 SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
VEL G GI+ + S+ + Y+I+SL T VL+
Sbjct: 396 -----VELAGVIGIFPIRLNSTTDN----------------YVIVSLAEETHVLQINGEE 434
Query: 597 TEVTESVDYFVQGRTIAAGNLFG---RRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
E + + + TI A +FG ++QV E+ R + S +++ + P N E
Sbjct: 435 LEDVQLLQICTEMPTIFASTIFGPDNSEVLLQVTEKHVRFMAFSGLSK--IWEPPNGE 490
>gi|440300137|gb|ELP92626.1| DNA damage-binding protein, putative [Entamoeba invadens IP1]
Length = 1086
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 79/337 (23%), Positives = 154/337 (45%), Gaps = 39/337 (11%)
Query: 1106 ENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISAL---- 1160
E + L +GTAY + GE +RGR ++F + + EV ++ + GA+ ++
Sbjct: 766 EEKELYIVGTAYAKLGEVEPSRGRFIIFEI------HEEKIIEVSNRYVDGAVYSVKRFE 819
Query: 1161 --------ASLQGHLLIAS-GPKIILHKWTGT-ELNGIAFYDAPPLYVVSLNIVKNFILL 1210
A++Q +++ KI+ K+ T E G A L+V +L + IL+
Sbjct: 820 NDVGNYIAATIQKKVVVYQIERKIVDGKFAVTIEEKGGANVKLIGLFVKTLG---HEILV 876
Query: 1211 GDIHKSIYFLSWKEQGAQLNLL--AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
GD+ KSI + E+ + ++ +DF + A EF+ + +S SD Q N+ +F
Sbjct: 877 GDLMKSISVFKFDEKATRNAVVETCRDFYASYTTAVEFMDEHCFMS---SDSQGNLLVFT 933
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
+ + KL + A HVG + + + ++ +T + +LFG
Sbjct: 934 ENTTTTNENEKFKLQNEAHIHVGECINVMCKGSIAVMNN-------AMWETQKKCMLFGG 986
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN-GKAHRPGPDSIV 1387
+ GSIG I ++ T+++L +L+ +++ + V + SF Q+ R +++
Sbjct: 987 ICGSIGGITEINLETYKKLFALESEMLREMKGV--IECESFGQWKMVFDDWKRMEAQNVI 1044
Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
D ++ + LP E Q IA + G ++++ L +
Sbjct: 1045 DGNVVELFLDLPKESQKHIAEKIGYAGEELVTVLESM 1081
>gi|121700262|ref|XP_001268396.1| nuclear mRNA splicing factor, putative [Aspergillus clavatus NRRL 1]
gi|119396538|gb|EAW06970.1| nuclear mRNA splicing factor, putative [Aspergillus clavatus NRRL 1]
Length = 1209
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 97/477 (20%), Positives = 204/477 (42%), Gaps = 48/477 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
C G + + Q + ++ S DN + IPL TP ++ E+ L+ +I S
Sbjct: 759 QCVEGMVGIQGQNL----RIFSIEKLDNNMLQESIPLSYTPRRLLKHPEQPLFYVIGSDN 814
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
VL P + + LI+ + L + + +++++P A
Sbjct: 815 NVLSPATR--ARLIEDSKARNGEADTLPPEEFGYPRATGHWASCIQVVDPVNAKA---VI 869
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
+TI ++ +E A+++ V +++++ET L +GTA +A G + ++ R
Sbjct: 870 STIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDLTVNPPSSAGGFIHIY---RFQ 923
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
++ + L ++ +++ AL + QG L+ G + ++ +L P +
Sbjct: 924 EDGKEL-EFIHKTKVEEPPLALLAFQGRLVAGIGSILRIYDLGMKQLLRKCQAPVVPKTI 982
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
V L + I++ D+ +S+ ++ +K Q L D S +T ++D T++
Sbjct: 983 VGLQTQGSRIIVSDVRESVTYVVYKYQENVLIPFVDDTVSRWMTSTT-MVDYETVA--GG 1039
Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1310
D+ N+ + K+SE G L+ R H + + + Q + TS +T
Sbjct: 1040 DKFGNLWLVRCPKKISEEADEDGSGAHLIHERGYLHGTPNRLELMIHTYTQDIPTSVHKT 1099
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFMSREDVDF--FQNLEMQLASQCPPLAGRDHL 1153
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+R +++ K ++D +L Y +LP + ++ IA + + +I ++D+
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|317031116|ref|XP_001392900.2| UV-damaged DNA binding protein [Aspergillus niger CBS 513.88]
Length = 1124
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 77/322 (23%), Positives = 133/322 (41%), Gaps = 52/322 (16%)
Query: 1111 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+GTAY+ E+ + RGR+L+F DN + L T+V +KGA ALA L G ++
Sbjct: 816 FVVGTAYLDDENEESIRGRILVFEI----DNGRKL-TKVAELPVKGACRALAML-GEKIV 869
Query: 1170 ASGPKIIL------HKWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
A+ K ++ + + +L +A Y AP V + + N I + D+ KS+ +
Sbjct: 870 AALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAP----VDVTVTGNVIAVADLMKSVCLV 925
Query: 1221 SWKE--QGA--QLNLLAKDFGSL-----DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
+ E G+ L +A+ F ++ C A + ++ +D + N+ +
Sbjct: 926 EYSEGENGSPDSLTEVARHFQTVWATGVSCIAKDTFLE--------TDAEGNLIVLRRNL 977
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
E ++L E +G V + + +Q LA+ + A GT+
Sbjct: 978 TGVEEDDKRRLEVTGEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTV 1025
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
+GSI A ++ L LQ + V + + FR F S + + P VD
Sbjct: 1026 EGSIYLFAIINPEHQDFLMRLQATMAGKVESLGNIPFNEFRGFRSMVREAKE-PYRFVDG 1084
Query: 1390 ELLSHYEMLPLEEQLEIAHQTG 1411
EL+ + Q EI G
Sbjct: 1085 ELIERFLTCEPSLQEEIVDSVG 1106
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G +I Y V DS++ + R LD A
Sbjct: 229 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 268
Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
W Q D LL+ G L L +V D VQ L + S + +G +
Sbjct: 269 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 328
Query: 423 FFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ GS
Sbjct: 329 IFVGSHQGDSQVLRIGNGS 347
>gi|326432370|gb|EGD77940.1| splicing factor 3b subunit 3 [Salpingoeca sp. ATCC 50818]
Length = 1232
Score = 51.2 bits (121), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 60/304 (19%), Positives = 120/304 (39%), Gaps = 47/304 (15%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
V+ E++ AL G L+ G + ++ +L P VV + ++
Sbjct: 955 VHRTEVEAMPCALTPFAGRLIAGVGNIVRIYDMGRKKLLRKCENKHLPSRVVDIEVMGTR 1014
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF 1267
+++ D +S++FL +K L++ D C A ++D ST + V+D+ N+ +
Sbjct: 1015 VVVADQRESVFFLKYKPTENVLSVFCDDTTPRWCTAM-LMVDYST--VCVADKFGNVSVL 1071
Query: 1268 YYAPKMSESWK------------------GQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
++++ + QKL+ A F++G V + + + ++
Sbjct: 1072 RCPDDVTDTLQEDPSGAKAFWARGYLNGAPQKLVQVANFYIGEIVQSLHKTTLTPSGTE- 1130
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1366
+ + TL GSIG + P ++ F Q+L+ L P + G +
Sbjct: 1131 -------------CIAYTTLSGSIGALMPFSHKEDAEF--FQTLELHLRQEHPPICGRDH 1175
Query: 1367 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLAL 1426
+FR + K S++D +L Y ML + +IA T ++ L +
Sbjct: 1176 LAFRSAYVPCK-------SVIDGDLCEEYNMLSASLKSDIADGLERTPQEVAKKLEEFRT 1228
Query: 1427 GTSF 1430
+F
Sbjct: 1229 RYAF 1232
>gi|393212467|gb|EJC97967.1| hypothetical protein FOMMEDRAFT_162310 [Fomitiporia mediterranea
MF3/22]
Length = 1161
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 72/136 (52%), Gaps = 10/136 (7%)
Query: 1108 ETLLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGH 1166
+T A+G+ Y + E +RGR+L+ STG + +++ S E+KGA++AL +QG
Sbjct: 849 DTFFAVGSVYFDETEREPSRGRILIISTGSKRNQTPHILA---STEVKGAVNALTCIQGK 905
Query: 1167 LLIASGPKIILHKWT---GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
L++A + + + T L + ++ L + ++ ++ + I++GD S+ L K
Sbjct: 906 LVVAINTSVDVFRLKHGDNTVLTAVTSWNHNYLVITAV-VMDDLIVIGDAVSSLAVL--K 962
Query: 1224 EQGAQLNLLAKDFGSL 1239
+ +L A+D+ L
Sbjct: 963 LEDDKLTTFARDYSPL 978
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 72/331 (21%), Positives = 128/331 (38%), Gaps = 73/331 (22%)
Query: 306 HDAYKLLAVPSPI-------GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
D+ L+ VP I GGVLV+G +TI ++S ++ S+ ++P++
Sbjct: 224 EDSNLLIPVPPQIKSSWNVNGGVLVLGGSTIAFYSIDRKQKKKNSSSQSKS-STSKIPQA 282
Query: 359 SFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
+ A W Q D LL G L LL + + + L + +P +
Sbjct: 283 EVNWPYFDITA-WAQIDEDGLRYLLGDSFGRLALLAINPQYAYLDIVLLGEVSPP---TS 338
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
+T + + ++GS GDS L++ T ++ + + F +I AP + + D+
Sbjct: 339 LTPLASQYIYVGSHFGDSQLIRVTSERSSNGSYLEISDTFKNI---APIMDAVFEDTDDS 395
Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
Q + ++ G S G L+ G N DA GI+
Sbjct: 396 GQPTI----ITCSGGEST--------------------GSLRVIRNGANFNEDARIEGIA 431
Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE--T 592
G+W + + YDD +H Y++++ + T +LE
Sbjct: 432 N-----------ITGMWPIRRQ--------------YDDTFHHYMLVTTDTNTHLLELPN 466
Query: 593 ADLLTEVTESVDY---FVQGRTIAAGNLFGR 620
+ T V+ S D+ + RT+ AGN+ R
Sbjct: 467 SQQETAVSRSNDFSDLTIDSRTLVAGNMLTR 497
>gi|350629921|gb|EHA18294.1| damage-specific DNA binding protein [Aspergillus niger ATCC 1015]
Length = 1140
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 76/322 (23%), Positives = 131/322 (40%), Gaps = 52/322 (16%)
Query: 1111 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+GTAY+ E+ + RGR+L+F DN + L T+V +KGA ALA L G ++
Sbjct: 832 FVVGTAYLDDENEESIRGRILVFEI----DNGRKL-TKVAELPVKGACRALAML-GEKIV 885
Query: 1170 ASGPKIIL------HKWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
A+ K ++ + + +L +A Y AP V + + N I + D+ KS+ +
Sbjct: 886 AALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAP----VDVTVTGNVIAVADLMKSVCLV 941
Query: 1221 SWKE----QGAQLNLLAKDFGSL-----DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
+ E L +A+ F ++ C A + ++ +D + N+ +
Sbjct: 942 EYSEGENGMPDSLTEVARHFQTVWATGVSCIAKDTFLE--------TDAEGNLIVLRRNL 993
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
E ++L E +G V + + +Q LA+ + A GT+
Sbjct: 994 TGVEEDDKRRLEVTGEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTV 1041
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
+GSI A ++ L LQ + V + + FR F S + + P VD
Sbjct: 1042 EGSIYLFAIINPEHQDFLMRLQATMAGKVESLGNIPFNEFRGFRSMVRETKE-PYRFVDG 1100
Query: 1390 ELLSHYEMLPLEEQLEIAHQTG 1411
EL+ + Q EI G
Sbjct: 1101 ELIERFLTCEPSLQEEIVDSVG 1122
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G +I Y V DS++ + R LD A
Sbjct: 245 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 284
Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
W Q D LL+ G L L +V D VQ L + S + +G +
Sbjct: 285 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 344
Query: 423 FFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ GS
Sbjct: 345 IFVGSHQGDSQVLRIGNGS 363
>gi|346327528|gb|EGX97124.1| pre-mRNA splicing factor RSE1 [Cordyceps militaris CM01]
Length = 1206
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 79/368 (21%), Positives = 160/368 (43%), Gaps = 50/368 (13%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGR--------VLLFST 1134
+ +++E A++V VV+ +++NE+ L +GT G+D+ R + F
Sbjct: 869 VDFENNEAAVSVAVVSF---ASQDNESFLVVGT----GKDIVLNPRSSSEAYIYIYRFQQ 921
Query: 1135 GRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1194
G + ++ +++ +AL QG LL G + ++ +L A +
Sbjct: 922 GGRE------LEFIHKTKIEEPATALLPFQGKLLAGIGKTLRMYDLGMRQLLRKAQAEVV 975
Query: 1195 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTL 1253
P +VSLN + I++ D+ + + + +K +L D S+ ++T ++D
Sbjct: 976 PQQIVSLNTQGSRIVVSDVQQGVTLVVYKSASNKLIPFVDD--SIARWSTCTTMVDYE-- 1031
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWK----GQKLL-SRAEFHVGAHVTKFL---RLQMLAT 1305
S+ D+ N+ I K SE G L+ +R H H + + Q + T
Sbjct: 1032 SVAGGDKFGNMFIVRSPAKASEEADEDAAGLHLVNARDYLHGTQHRLELMCHFFTQDILT 1091
Query: 1306 SSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVA 1362
S ++TG G LL+ + G+IG P ++ F QSL++ L +A
Sbjct: 1092 SINKTGLVVGGQDV----LLWSGIMGTIGVFIPFVSREDTDF--FQSLEQHLRTEDGPLA 1145
Query: 1363 GLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1422
G + +R +++ K ++D +L + +LP +++ IA + + +I ++
Sbjct: 1146 GRDHLMYRSYYAPVKG-------VIDGDLCERFSILPNDKKQMIAGELDRSVREIERKIS 1198
Query: 1423 DLALGTSF 1430
D+ ++F
Sbjct: 1199 DIRARSAF 1206
Score = 44.7 bits (104), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 144/639 (22%), Positives = 231/639 (36%), Gaps = 116/639 (18%)
Query: 74 QEEGSKES---KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
Q G+KE SG + D + L+ H + G + S+A+ G++ +
Sbjct: 21 QFAGTKEQLIITGSGSQLTLLRPDPAQGKVIALLSH-DIFGILRSIAVFRLAGSN----K 75
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
D IILA + +I++LE+ + M F K G G + DP+GR
Sbjct: 76 DYIILATDSGRITILEYLPGPNRFNRLHMETFG-------KSGIRRVVPGEYLACDPKGR 128
Query: 191 C---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVK 247
V L ++ + SQ E T S A VI + LD+
Sbjct: 129 ACLISAVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVIAMVALDV---- 175
Query: 248 DFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
GY PV L E + G + T ++ + L WS
Sbjct: 176 ------GYANPVFAALEIEYTEVDQDITGEALSEVETQLV-YYELDLGLNHVVRKWSD-- 226
Query: 304 LPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPR 357
P D L P G GVLV G I Y HS + + + + E P
Sbjct: 227 -PVDPTASLLFQVPGGNDGPSGVLVCGEENITYRHSNQDALRVPIPRRR----GATEDPS 281
Query: 358 SSFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKT 406
++ H L+ LL + GDL +T+ V D VQR+ +
Sbjct: 282 RKRNIVAGVMHK--LKGSAGAFFFLLQSDDGDLFKITIDMVEDEEGAPTGEVQRMKIKYF 339
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF--GDIEADAPST 464
+ + + + + + ++ S+ G+ QF E+F + A P
Sbjct: 340 DTVPVATSLCILKSGFLYVASQFGNYAFYQFEKLGDDDDEVEFSSEDFPVDPLAAYEPVY 399
Query: 465 KRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVRDSLVNIGPLKDFSYGLR 523
R + + AL D + L +N T E A + F+ + LK +GL
Sbjct: 400 FYPRLAENLALVDSIPAMNPLLDCKVANLTGEDAPQIFTICGNGARSTFRTLK---HGLE 456
Query: 524 INADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
+N ++ ELPG +WT+ S D++Y AY+++S
Sbjct: 457 VNEIVAS------------ELPGVPSAVWTLKLNS--------------DEQYDAYIVLS 490
Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQ 641
T+VL + + EV++S + TIAA L G +IQV RG R I +G
Sbjct: 491 FTNGTLVLSIGETVEEVSDS-GFLTSVPTIAA-QLLGTDGLIQVHPRGIRHIRNG----- 543
Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
N S ++ ++++ S V + +S G I
Sbjct: 544 -------NVNEWSAPQHRSIVAASTNSHQVAIALSSGEI 575
>gi|58258783|ref|XP_566804.1| U2 snRNA binding protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|338819361|sp|P0CR23.1|RSE1_CRYNB RecName: Full=Pre-mRNA-splicing factor RSE1
gi|338819362|sp|P0CR22.1|RSE1_CRYNJ RecName: Full=Pre-mRNA-splicing factor RSE1
gi|57222941|gb|AAW40985.1| U2 snRNA binding protein, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1217
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 64/285 (22%), Positives = 122/285 (42%), Gaps = 32/285 (11%)
Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
LA QG LL G + L++ L + P VV++N+ I++GD+ +S ++
Sbjct: 951 LAGFQGFLLAGIGKSLRLYEMGKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFY 1010
Query: 1220 LSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
++ QL + A D + C + +D T++ D+ NI I P +SE
Sbjct: 1011 CVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA--CGDKFGNIFINRLDPSISEK 1065
Query: 1277 WK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
G +L F +GA H T+ + + + TS + G R L++ T
Sbjct: 1066 VDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVVTSITKIPLVAG----GRDVLVYTT 1121
Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
+ G++G + P D++ F + +L+ + + G + ++R ++ K
Sbjct: 1122 ISGAVGALVPFVSSDDIEF--MSTLEMHMRTQDISLVGRDHIAYRGYYVPIKG------- 1172
Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+VD +L + +LP +Q IA + +L L + ++F
Sbjct: 1173 VVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQMRTSSAF 1217
>gi|391867503|gb|EIT76749.1| splicing factor 3b, subunit 3 [Aspergillus oryzae 3.042]
Length = 1034
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 77/357 (21%), Positives = 159/357 (44%), Gaps = 36/357 (10%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1138
+TI ++ +E A++V V T++++ET L +GTA + +A G + ++ R
Sbjct: 695 STIELEENEAAVSVAAVPF---TSQDDETFLVVGTAKDMNVNPPSSAGGYIHIY---RFQ 748
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
++ + L ++ +++ AL QG L+ GP + ++ +L P +
Sbjct: 749 EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGPMLRIYDLGMKQLLRKCNAQVVPKTI 807
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
V L + I++ D+ +S+ ++ +K Q L D S +T ++D T +
Sbjct: 808 VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETTA--GG 864
Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1310
D+ NI + K+SE G L+ R H + + + Q + T+ +T
Sbjct: 865 DKFGNIWMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKT 924
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 925 QLVAG----GRDILVWSGFHGTIGMLVPFVSREDVDF--FQNLEMQLAAQNPPLAGRDHL 978
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+R +++ K ++D +L Y +LP + ++ IA + + +I ++D+
Sbjct: 979 IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIAAELDRSVREIERKISDM 1028
>gi|261329035|emb|CBH12013.1| damage-specific DNA binding protein, putative [Trypanosoma brucei
gambiense DAL972]
Length = 1270
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 61/292 (20%), Positives = 122/292 (41%), Gaps = 28/292 (9%)
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNAD--NPQNLVTEVYSKELKGAISALA---SLQ 1164
++ IGT +V ++ +R ++ T A + L+ SK+++GA+ +
Sbjct: 852 VVLIGTTFVFPDEQLSRSSRFMWCTVEVAKLRTEKTLLRLQGSKDVEGALQCCCIVPNYA 911
Query: 1165 GHLLIASGPKIILHKWTGTELNGIA--FYDAPPLYVVSLNIVK---NFILLGDIHKSIYF 1219
G + + G ++L+ W + +A L V + +++ ++I+ D S +F
Sbjct: 912 GRVALGIGGCVVLYSWNAADATFVAEETIQIGTLIVRLIPVMQKEVSYIVASDARHSCFF 971
Query: 1220 LSWKEQGAQLNLLAKD---FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
+ LN++A+D G +DC ++ S + + D+ N + ++ S
Sbjct: 972 VRIDTIQGSLNIVARDPELRGVMDCAILQY---ESRHDVCLGDDLFNFFCVSHVEPLANS 1028
Query: 1277 -------WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA----LL 1325
+KL + A++H+G +T + A S P R ++
Sbjct: 1029 SGVSAPAMPTKKLQTSAQYHMGDLIT-VMHQGSFAPCSVLNDVVPIPATLVRGVCGPQIV 1087
Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+GT G+ G I P+ TF L+ L+ + VP + G SFR+ G+
Sbjct: 1088 YGTSHGAFGAITPISSETFILLKGLEVSVASVVPPLGGFTHASFREVLRVGQ 1139
>gi|134106833|ref|XP_777958.1| hypothetical protein CNBA4270 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260658|gb|EAL23311.1| hypothetical protein CNBA4270 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1218
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 64/285 (22%), Positives = 122/285 (42%), Gaps = 32/285 (11%)
Query: 1160 LASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
LA QG LL G + L++ L + P VV++N+ I++GD+ +S ++
Sbjct: 952 LAGFQGFLLAGIGKSLRLYEMGKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFY 1011
Query: 1220 LSWKE-QGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
++ QL + A D + C + +D T++ D+ NI I P +SE
Sbjct: 1012 CVYRSIPTRQLLIFADDSQPRWITCVTS---VDYETVA--CGDKFGNIFINRLDPSISEK 1066
Query: 1277 WK----GQKLLSRAEFHVGA-HVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
G +L F +GA H T+ + + + TS + G R L++ T
Sbjct: 1067 VDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVVTSITKIPLVAG----GRDVLVYTT 1122
Query: 1329 LDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1385
+ G++G + P D++ F + +L+ + + G + ++R ++ K
Sbjct: 1123 ISGAVGALVPFVSSDDIEF--MSTLEMHMRTQDISLVGRDHIAYRGYYVPIKG------- 1173
Query: 1386 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+VD +L + +LP +Q IA + +L L + ++F
Sbjct: 1174 VVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQMRTSSAF 1218
>gi|119473054|ref|XP_001258481.1| nuclear mRNA splicing factor, putative [Neosartorya fischeri NRRL
181]
gi|119406633|gb|EAW16584.1| nuclear mRNA splicing factor, putative [Neosartorya fischeri NRRL
181]
Length = 1209
Score = 50.4 bits (119), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 101/479 (21%), Positives = 208/479 (43%), Gaps = 52/479 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
C G + + +Q + ++ S DN + IPL TP ++ E+ L+ +I S
Sbjct: 759 QCVEGMVGIQAQNL----RIFSIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDN 814
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
VL P + + LI+ + + L D + ++I++P A
Sbjct: 815 NVLSPATR--ARLIEDSKARNGETNVLPPEDFGYPRATGHWASCIQIVDPLDAKA---VI 869
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
+TI ++ +E A+++ V +++++ET L +GTA + +A G + ++ R
Sbjct: 870 STIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDMIVNPPSSAGGFIHIY---RFQ 923
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPL-- 1196
++ + L ++ +++ AL QG LL G + ++ +L + AP +
Sbjct: 924 EDGKEL-EFIHKTKVEEPPLALLGFQGRLLAGIGSTLRVYDLGMKQL--LRKCQAPVVSK 980
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
+V L + I++ D+ +S+ ++ +K Q L D S +T ++D T++
Sbjct: 981 TIVGLQTQGSRIIVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETVA-- 1037
Query: 1257 VSDEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSD 1308
D+ N+ + K+SE G L+ R H + + Q + TS
Sbjct: 1038 GGDKFGNLWLVRCPKKVSEEADEDGSGAHLIHERGYLHGTPNRLDLMIHTYTQDIPTSLH 1097
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
+T G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1098 KTQLVAG----GRDILVWTGFQGTIGMLVPFVSREDVDF--FQNLEMQLASQCPPLAGRD 1151
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+R +++ K ++D +L Y +LP + ++ IA + + +I ++D+
Sbjct: 1152 HLIYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|156095699|ref|XP_001613884.1| Splicing factor 3B subunit 3 [Plasmodium vivax Sal-1]
gi|148802758|gb|EDL44157.1| Splicing factor 3B subunit 3, putative [Plasmodium vivax]
Length = 1230
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 127/607 (20%), Positives = 220/607 (36%), Gaps = 120/607 (19%)
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
L+ L L+ + G + L G++ +D +++ + ++++L+F +
Sbjct: 41 LLRADKQGKLNLIASKDIFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLTILQFSNEK 96
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
+ +HC + K G G + VDP+GR + VY L
Sbjct: 97 NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDTK 149
Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
Q+ I D G GF + +S N LD K V + + Y
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPMFASIEQNYEALD-KQVTNTSEIDSYTRK 208
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
++ L W + H + P DA L +P P
Sbjct: 209 TLLSL------WEMDLGLNH-------------------VIRKYTFPIDASAHLLIPIPG 243
Query: 319 G-----GVLVVGANTIHYHS---QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
G GV+V N + Y CA Y L++ QE S L
Sbjct: 244 GQQGPSGVIVCCDNFLVYKKVDHADVYCA-----YPRRLETGQEKNLSIVCSTLHRIRKF 298
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ L+ ++ GDL + + ++ VV+ + + + + I + + F+ + G
Sbjct: 299 FF----ILIQSELGDLYKIEMEHEDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354
Query: 431 DSLLVQFTCGSG----TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--L 484
+ QF+ G G +M +S K G A TK+L ++ L D V L
Sbjct: 355 NHFFYQFS-GIGDEDNEAMCTS--KHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPIL 408
Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYEL 541
+ + N S Q +L GP L+ +GL I A
Sbjct: 409 DMKVIDAKNASSPQIY-------ALCGRGPRSSLRILQHGLSIEELADN----------- 450
Query: 542 VELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
ELPG K IWT+ + NA +Y Y+I+S E T++LE + + EV
Sbjct: 451 -ELPGRPKFIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVV 495
Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
+S+ + T N+ +IQV + G R ++G + + + P N + + + N
Sbjct: 496 DSL--LLTNVTTIHVNILYDNSLIQVHDAGIRHINGKVIHEWVP--PKNKQIKAATSNCA 551
Query: 661 VLSVSIA 667
+ +S++
Sbjct: 552 QIVISLS 558
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 63/286 (22%), Positives = 111/286 (38%), Gaps = 45/286 (15%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G LL + G K+ ++ +L Y P ++S+ + + I DI +S+
Sbjct: 965 CFCPFNGRLLASIGNKLRIYALGKKKLLKKCEYKDIPEAIISIKVSGDRIFASDIRESVL 1024
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1265
+ L L++ D +E L + ++ L V +E K +
Sbjct: 1025 IFFYDANMNTLRLISDDIIPRWITCSEILDHHTIMAADKFDSVFVLRVPEEAKQEEYGIS 1084
Query: 1266 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
+Y M+ S K ++L FHVG VT ++++ TSS+
Sbjct: 1085 NKCWYGGEIMAGSNKNRRLEHIMSFHVGEIVTSLQKVKLSPTSSE--------------C 1130
Query: 1324 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
+++ T+ G+IG P D ELT Q L+ L P + G FR ++ +
Sbjct: 1131 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTENPPLCGREHIFFRSYYHPVQ- 1185
Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
++D +L + LP + Q ++A T IL L D+
Sbjct: 1186 ------HVIDGDLCEQFSSLPYDVQRKVAADLERTPDDILRKLEDI 1225
>gi|72390667|ref|XP_845628.1| damage-specific DNA binding protein [Trypanosoma brucei TREU927]
gi|62359843|gb|AAX80271.1| damage-specific DNA binding protein, putative [Trypanosoma brucei]
gi|70802163|gb|AAZ12069.1| damage-specific DNA binding protein, putative [Trypanosoma brucei
brucei strain 927/4 GUTat10.1]
Length = 1270
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 61/292 (20%), Positives = 122/292 (41%), Gaps = 28/292 (9%)
Query: 1110 LLAIGTAYVQGEDVAARGRVLLFSTGRNAD--NPQNLVTEVYSKELKGAISALA---SLQ 1164
++ IGT +V ++ +R ++ T A + L+ SK+++GA+ +
Sbjct: 852 VVLIGTTFVFPDEQLSRSSRFMWCTVEVAKLRTEKTLLRLQGSKDVEGALQCCCIVPNYA 911
Query: 1165 GHLLIASGPKIILHKWTGTELNGIA--FYDAPPLYVVSLNIVK---NFILLGDIHKSIYF 1219
G + + G ++L+ W + +A L V + +++ ++I+ D S +F
Sbjct: 912 GRVALGIGGCVVLYSWNAADATFVAEETIQIGTLIVRLIPVMQKEVSYIVASDARHSCFF 971
Query: 1220 LSWKEQGAQLNLLAKD---FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1276
+ LN++A+D G +DC ++ S + + D+ N + ++ S
Sbjct: 972 VRIDTIQGSLNIVARDPELRGVMDCAILQY---ESRHDVCLGDDLFNFFCVSHVEPLANS 1028
Query: 1277 -------WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA----LL 1325
+KL + A++H+G +T + A S P R ++
Sbjct: 1029 SGVSAPAMPTKKLQTTAQYHMGDLIT-VMHQGSFAPCSVLNDVVPIPATLVRGVCGPQIV 1087
Query: 1326 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
+GT G+ G I P+ TF L+ L+ + VP + G SFR+ G+
Sbjct: 1088 YGTSHGAFGAITPISSETFILLKGLEVSVASVVPPLGGFTHASFREVLRVGQ 1139
>gi|430813298|emb|CCJ29330.1| unnamed protein product [Pneumocystis jirovecii]
Length = 1197
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 119/531 (22%), Positives = 211/531 (39%), Gaps = 82/531 (15%)
Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
SRP + L + P + D S+ + C+ G + + Q LKI + D
Sbjct: 718 SRPWLSYIINASLHLVPLIYD-SLEYCWGFSSEQCSEGIVGIQGQD-LKIFMV---ERLD 772
Query: 991 NYWPVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1050
N I L TP + +++++ +I S + PL++ + G Q +H L
Sbjct: 773 NVLKQDSISLMYTPRRFIKHPDEHIFYIIESDHNVLPLSERQK----RVEGLQNGDHILL 828
Query: 1051 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATI------------PMQSSENALTVRVVT 1098
D+ P G W + TI + +E A ++ +VT
Sbjct: 829 PEDIGL-------------PRGLSGNWASCITILDPLSKKILTRIELDDNEAAFSIAMVT 875
Query: 1099 LFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNPQNLVTEVYSKELKG 1155
N + +E LAIG+ G++V + S R D ++ + V+ E+
Sbjct: 876 FKN---QNDEIFLAIGS----GKNVILAPKSFSAAYISIYRFIDQGKS-IELVHKTEVDD 927
Query: 1156 AISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHK 1215
AL QG LL G + +++ + A P +V L+ + I++ DI +
Sbjct: 928 IPLALLGFQGRLLAGLGKMLRIYEMGMKKCLRKCEVRAVPNCIVQLHTQGSRIIIADIQE 987
Query: 1216 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
SI+F +K +L + A D T ++D T++ D+ N I ++SE
Sbjct: 988 SIHFAVYKYLENRLIVFADDVIP-RWTTTSTMLDYETVA--AGDKFGNFWINRCPLEVSE 1044
Query: 1276 SW----KGQKLLSRAEFHVGAHVTKFLRLQMLA--------TSSDRTGAAPGSDKTNRFA 1323
S G +L+ + GA RL+MLA TS + G R
Sbjct: 1045 SADEDPSGAQLIHEKSYLFGAAK----RLKMLAHFYIGDTFTSMHKVQLIAGG----RDI 1096
Query: 1324 LLFGTLDGSIGCIAPL----DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1379
+++ + GSIG P D F++L++L + S+ G + +R ++ K
Sbjct: 1097 IVYTGMMGSIGIFLPFVGREDVDFFQQLEALMRTEDLSL---IGRDHLMYRGYYVPVK-- 1151
Query: 1380 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
S+VD +L + MLP ++ IA++ S+I + D+ + +F
Sbjct: 1152 -----SVVDGDLCERFLMLPYNKKQVIANELDREISEIAKKIEDMRVRVAF 1197
Score = 45.4 bits (106), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 118/544 (21%), Positives = 206/544 (37%), Gaps = 92/544 (16%)
Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
+HG + +L G + +D +I+ + +I++LE+ + +
Sbjct: 65 VHGIIRTLVGFRLAGTN----KDHLIVGSDSGRITILEYKPDSNAFSKVHQETYG----- 115
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
K G G + VDP+GR + ++ ++ + A
Sbjct: 116 --KSGVRRVVPGQYLAVDPKGRATMIASIEKNKLVYVLNRDSA------TNLTISSPLEA 167
Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMIS 284
S V +L +D+ GY PV L E E +G+ +++ +++
Sbjct: 168 HKSCSLVFHLIGMDV----------GYENPVFAALEVDYTEAESDPSGK-AYREIQKVLT 216
Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASC 338
+ L WS P D L V P G G LV +I Y H +
Sbjct: 217 YYELDLGLNHVVRKWSD---PVDRKANLLVTVPGGSDGPSGALVCTEGSIFYKHKGKKTH 273
Query: 339 ALALNNYAVSLDSSQ--ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR 396
+ + SL++SQ ++ SS ++ A LQN+ GDL +T+ +
Sbjct: 274 RIPIPTRIGSLENSQKKQIIVSSVVHKMRGAFFFLLQNE-------DGDLFKVTIDSNDG 326
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF-TCGSGTSMLS-SGLKEEF 454
V+ L + + +++ ++ + + F+ S G+ L QF G + + S +
Sbjct: 327 EVESLKIKYFDTVPVSTGLSILKSGFLFVASEYGNHHLYQFEKLGDDNNEIEFSSVDFPV 386
Query: 455 GDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVRDSLVNI 512
D+ E PS R R + L D +N + N T E A + ++ R
Sbjct: 387 LDLNEGYEPSYFRPRSLENLLLVDDLNSMNPLMDSKILNLTDEDAPQIYALCGR------ 440
Query: 513 GPLKDFS---YGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRM 568
GP F YGL +N + A+G LPG +WT SS
Sbjct: 441 GPRSTFRTLRYGLEVN-EIVASG-----------LPGSPTAVWTTKLTSS---------- 478
Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
D+Y AY+++S T+VL + + EV+++ + T+A L G +IQV
Sbjct: 479 ----DQYDAYIVLSFVNGTLVLSIGETVEEVSDT-GFLSSSPTLAVQQL-GDDALIQVHP 532
Query: 629 RGAR 632
+G R
Sbjct: 533 KGIR 536
>gi|154342093|ref|XP_001566998.1| putative CPSF-domain protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134064323|emb|CAM40524.1| putative CPSF-domain protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 1347
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 67/328 (20%), Positives = 132/328 (40%), Gaps = 43/328 (13%)
Query: 1084 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQ 1142
P + + V +T + ++ + LL +G+++ ++ AR GRV+ F+ + +
Sbjct: 896 PAERCGEQIDVPAITSAGVSEEDWQHLLLVGSSFTFPDEQRARSGRVMWFAL--HEERQG 953
Query: 1143 NLVTEVYSKELKGAISALASL---QGHLLIASGPKIILHKWTGTELN---------GIAF 1190
+ + SK++ GA+ A + +G + + + L+KW + G+
Sbjct: 954 QRLRLIASKDIGGALQCCAEVPYYKGRIALGVNGCVCLYKWNTEDQTFVAEERCRVGLTV 1013
Query: 1191 YDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD---FGSLDCFATEFL 1247
PLY +L + ++ D+ S +F+ L +L ++ G +D +
Sbjct: 1014 TRLIPLYNTAL--AASVLVALDVRHSAFFIEVDLLQGSLKVLCREGNLRGVMDGY----- 1066
Query: 1248 IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ-----------KLLSRAEFHVGAHVTK 1296
+ +L + D+ N P E G + RA++HVG VT
Sbjct: 1067 VGSDAENLCLFDDNLNFTALKVVPLPVEPGDGDAAAAASGTPQCRFEVRAQYHVGDLVTC 1126
Query: 1297 FLRLQMLATSSDRTGAAPGSDKTNRFA-------LLFGTLDGSIGCIAPLDELTFRRLQS 1349
ATS + S + L+F T G G + PL T+ L++
Sbjct: 1127 VRPGSFAATSLMKAPTPSSSVPSPLLLPGIAGPQLVFATAHGGFGVVTPLHAATYLVLRA 1186
Query: 1350 LQKKLVDSVPHVAGLNPRSFRQFHSNGK 1377
L+ L ++P + GL+ ++FR+ G+
Sbjct: 1187 LEASLERTLPPLGGLSHQAFREVLRAGQ 1214
>gi|221061705|ref|XP_002262422.1| splicing factor 3b, subunit 3, 130kd [Plasmodium knowlesi strain H]
gi|193811572|emb|CAQ42300.1| splicing factor 3b, subunit 3, 130kd, putative [Plasmodium knowlesi
strain H]
Length = 1276
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 63/286 (22%), Positives = 112/286 (39%), Gaps = 45/286 (15%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
+ G LL + G K+ ++ +L Y P ++S+ + + I DI +S+
Sbjct: 1011 CFSPFNGRLLASVGNKLRIYALGKKKLLKKCEYKDIPEAIISIKVSGDRIFASDIRESVL 1070
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1265
+ L L++ D +E L + ++ L V +E K +
Sbjct: 1071 VFFYDANMNALRLISDDIIPRWITCSEILDHHTIMAADKFDSVFVLRVPEEAKQEEYGIS 1130
Query: 1266 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
+Y M+ S K ++L FHVG VT ++++ TSS+
Sbjct: 1131 NKCWYGGEMMAGSNKNRRLEHIMNFHVGEIVTSLQKVKLSPTSSE--------------C 1176
Query: 1324 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
+++ T+ G+IG P D ELT Q L+ L P + G FR ++ +
Sbjct: 1177 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTENPPLCGREHIFFRSYYHPVQ- 1231
Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
++D +L + LP + Q ++A T IL L D+
Sbjct: 1232 ------HVIDGDLCEQFSSLPYDIQRKVAADLERTPDDILRKLEDI 1271
Score = 47.8 bits (112), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 123/604 (20%), Positives = 222/604 (36%), Gaps = 114/604 (18%)
Query: 92 LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
L+ L L+ + G + L G++ +D +++ + ++ +L+F +
Sbjct: 41 LLRADKQGKLNLIVSKDIFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLVILQFSNEK 96
Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
+ +HC + K G G + VDP+GR + VY L
Sbjct: 97 NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDNK 149
Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
Q+ I D G GF + +S N D K V + +
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPMFASIEQNYEMYD-KQVTNTTEIDACTRK 208
Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
++ L E +L ++ +++H LP D L +P P
Sbjct: 209 TLLCLWEMDL------------------GLNHVIRKH-------TLPIDMSAHLLIPIPG 243
Query: 319 G-----GVLVVGANTIHYHSQS---ASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
G GV+V N + Y CA Y L++ QE + S+ H
Sbjct: 244 GQQGPSGVIVCCDNYLVYKKVEHVDVYCA-----YPRRLETGQE---KNISIVCSTVHRI 295
Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ L+ ++ GDL + + + VV+ + + + + I + + F+ + G
Sbjct: 296 R-KFFFILIQSEYGDLYKIEMDHQDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354
Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--LSLY 487
+ QF+ G + K G A TK+L ++ L D V L +
Sbjct: 355 NHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPILDMK 411
Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVEL 544
+ N S Q ++ R GP L+ +GL I A EL
Sbjct: 412 ILDAKNANSPQ-IYALCGR------GPRSSLRILQHGLSIEELADN------------EL 452
Query: 545 PG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
PG K IWT+ + NA +Y Y+I+S E T++LE + + EV +++
Sbjct: 453 PGRPKYIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVVDTL 498
Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
+ T N+ +IQV + G R ++G + + + P N + + + N+T +
Sbjct: 499 --LLTNVTTIHVNILYDNSLIQVHDTGIRHINGKVINEWVP--PKNKQVKAATSNATQIV 554
Query: 664 VSIA 667
+S++
Sbjct: 555 ISLS 558
>gi|225558618|gb|EEH06902.1| DNA damage-binding protein 1a [Ajellomyces capsulatus G186AR]
Length = 1201
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 44/315 (13%)
Query: 1110 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
L +GT+Y+ GE + RGR+L F N + +V +KGA ALA +Q +
Sbjct: 885 LFVVGTSYLDDFGEG-SIRGRILAFEVTANRQ-----LAKVAEMPVKGACRALAIVQDKI 938
Query: 1168 LIASGPKIILH-----KWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
+ A ++++ ++ L+ A Y AP V + + N I + D+ KS+
Sbjct: 939 VAALMKTVVVYTLSKGQFADYTLSKTASYRTSTAP----VDIAVTGNLIAVADLMKSVSI 994
Query: 1220 LSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+ + +QGA L +A+ F +L A + + + L SD + N+ + +
Sbjct: 995 VEY-QQGANGLPDSLTEVARHFQTLWSTAVAPVAEDTWLE---SDAEGNLVMLHRNVNGV 1050
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1333
++L +E +G V + + + + +P + GT++GSI
Sbjct: 1051 TDDDRRRLEVTSEISLGEMVNRIRPVNIQGSQGAEAAISPRA--------FLGTVEGSIY 1102
Query: 1334 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
G I P + RLQS +V + G+ FR F N P VD EL
Sbjct: 1103 LFGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTIRQAEEPYRFVDGEL 1158
Query: 1392 LSHYEMLPLEEQLEI 1406
+ + +E Q EI
Sbjct: 1159 IERFLGCSVELQEEI 1173
Score = 44.7 bits (104), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 25/144 (17%)
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
L A L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 293 ELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV 341
Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITT 417
W Q D LL+ G L L +V D VQ +LDL P S +
Sbjct: 342 -------AWEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVY 392
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
+G + F+GS GDS L++ T GS
Sbjct: 393 MGGGITFIGSHQGDSELIRITEGS 416
>gi|317143715|ref|XP_001819645.2| pre-mRNA-splicing factor rse1 [Aspergillus oryzae RIB40]
Length = 1209
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 77/357 (21%), Positives = 159/357 (44%), Gaps = 36/357 (10%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1138
+TI ++ +E A++V V T++++ET L +GTA + +A G + ++ R
Sbjct: 870 STIELEENEAAVSVAAVPF---TSQDDETFLVVGTAKDMNVNPPSSAGGYIHIY---RFQ 923
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
++ + L ++ +++ AL QG L+ GP + ++ +L P +
Sbjct: 924 EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGPMLRIYDLGMKQLLRKCNAQVVPKTI 982
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
V L + I++ D+ +S+ ++ +K Q L D S +T ++D T +
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETTA--GG 1039
Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1310
D+ NI + K+SE G L+ R H + + + Q + T+ +T
Sbjct: 1040 DKFGNIWMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKT 1099
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWSGFHGTIGMLVPFVSREDVDF--FQNLEMQLAAQNPPLAGRDHL 1153
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+R +++ K ++D +L Y +LP + ++ IA + + +I ++D+
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|212539802|ref|XP_002150056.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
18224]
gi|210067355|gb|EEA21447.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
18224]
Length = 1139
Score = 50.1 bits (118), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 121/296 (40%), Gaps = 36/296 (12%)
Query: 1113 IGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1171
+GTAY+ E + RGR+LLF N ++ +KGA ALA + +++ A
Sbjct: 833 VGTAYLDDETAESIRGRILLFEVDSNRK-----LSLFLEHPVKGACRALAMMGDYIVAAL 887
Query: 1172 GPKIILHKWTGTELNGIAFYDAPPLY-----VVSLNIVKNFILLGDIHKSIYFLSWKEQG 1226
+++ + TG G +Y V + + I++ D+ KSI + +
Sbjct: 888 VKTVVIFEVTGQPQTGKYSLQKAAVYRTSTAPVDIAVTDKTIVVADLMKSISIVESNKTD 947
Query: 1227 AQLNLLAKDFGSLDCFATEF---LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
A L + AK+ FAT + + D + +VSD + N+ + ++L
Sbjct: 948 A-LTMEAKEVAR--HFATVWTTAVADIGSNQWLVSDAEGNLIVLRRNVDGMTEEDRRRLE 1004
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI---GCIAPLD 1340
+E +G V + + + TS+ P + GT++GSI I P
Sbjct: 1005 VTSELLLGEMVNRIRPVNIPQTST--MAVTPKA--------FLGTVEGSIYLFALINPEH 1054
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNP-RSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
+ RLQ+ VDS GL P FR F S + P VD EL+ +
Sbjct: 1055 QDFLMRLQTAISAYVDS----PGLMPFNKFRAFRSTVREAEE-PFRFVDGELIERF 1105
Score = 40.4 bits (93), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 62/139 (44%), Gaps = 25/139 (17%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G I Y +D ++ + S LD A
Sbjct: 245 ASHLIPVPAPLGGLLVLGETCIKY-----------------IDDAK---NETISNPLDEA 284
Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDITTIGNSL 422
W+Q D LL+ G L L +V D + V+ L + S + +G +
Sbjct: 285 TIFVAWVQVDGQRWLLADDYGRLFFLMLVLDSQNEVEGWKLDYLGEASRASVLIYLGAGM 344
Query: 423 FFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ + GS
Sbjct: 345 TFIGSHQGDSQVIRISEGS 363
>gi|115397303|ref|XP_001214243.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114192434|gb|EAU34134.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 1140
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 79/322 (24%), Positives = 132/322 (40%), Gaps = 51/322 (15%)
Query: 1111 LAIGTAYV-QGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
+GTAY+ + ED + RGR+L+F DN + L T+V +KGA ALA L ++
Sbjct: 831 FVVGTAYLDEDEDRDSIRGRILMF----EVDNGRKL-TKVAELAVKGACRALAMLGDKVV 885
Query: 1169 IASGPKIILHKWTGT-----ELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
A ++++K TG +L +A Y AP V + + N I + D+ KS +
Sbjct: 886 AALVKTVVIYKVTGNNFGAMKLEKLASYRTSTAP----VDITVTDNVIAVSDLMKSSCLV 941
Query: 1221 SW--KEQGA--QLNLLAKDFGS-----LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
+ E G L +A+ F + + C A ++ SD + N+ I
Sbjct: 942 EYIEGEDGLPDSLKEVARHFQTVWATGIACIAPHTYLE--------SDAEGNLIILRRNL 993
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
E ++L E +G V + + +Q LA+ + A GT+
Sbjct: 994 SGVEEDDKRRLEVTGEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTV 1041
Query: 1330 DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDC 1389
+GSI A ++ L LQ + + + + FR F S + + P VD
Sbjct: 1042 EGSIYLYAIINPEHQDFLMRLQATMAGKIESLGDMPFNEFRGFRSMVREAKE-PYRFVDG 1100
Query: 1390 ELLSHYEMLPLEEQLEIAHQTG 1411
EL+ + Q +I + G
Sbjct: 1101 ELIERFLTCEPSVQEDIVNSVG 1122
Score = 40.8 bits (94), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 60/146 (41%), Gaps = 25/146 (17%)
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS--LDSSQELPRS 358
A L A L+ VP+P+GG+L++G +I Y NN VS LD +
Sbjct: 237 AQELDLGASHLIPVPAPLGGLLILGETSIKYVDDD-------NNEIVSRLLDEA------ 283
Query: 359 SFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDI 415
W Q D LL+ G L L +V D VQ L + S +
Sbjct: 284 -------TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSENQVQGWQLDHLGNTSRASTL 336
Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGS 441
+G + F+GS GDS +++ GS
Sbjct: 337 VYLGGGVIFVGSHQGDSQVLRVGDGS 362
>gi|70945139|ref|XP_742421.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56521397|emb|CAH76894.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 435
Score = 50.1 bits (118), Expect = 0.010, Method: Composition-based stats.
Identities = 52/214 (24%), Positives = 99/214 (46%), Gaps = 10/214 (4%)
Query: 1194 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
P +++SL++++N+I++GDI S+ LS+ + L + +D+ ++ C F+ S
Sbjct: 197 PSSWIMSLDVIENYIVVGDIMTSVTILSYDFNNSILTEVCRDYSNVWC---TFVCALSKS 253
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA 1313
+VSD + N +F + KL A F+ G V K L + + SS
Sbjct: 254 HFLVSDMESNFLVFQKSSIKYNDEDSFKLSRVALFNHGHVVNKMLPVSL---SSLIEEEE 310
Query: 1314 PGSD-KTNRFALLFGTLDGSIGCIAPLDEL-TFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
P ++ + ++L + +GSI I P L F++ ++ L DS+ + +N S
Sbjct: 311 PQNEILRKKESILCASSEGSISSIIPFSNLANFKKALCIELALNDSLSSIGNINDNSNNT 370
Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
+ N +VD E+ + +P E+Q +
Sbjct: 371 YKMN--LSEKSCKGVVDGEVFKMFFSMPFEKQFK 402
>gi|321260749|ref|XP_003195094.1| hypothetical protein CGB_G1120W [Cryptococcus gattii WM276]
gi|317461567|gb|ADV23307.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 1276
Score = 50.1 bits (118), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 112/266 (42%), Gaps = 42/266 (15%)
Query: 1111 LAIGTAYV---QGED---------VAARGRVLLFSTGRNADNPQNLVTEVYSK-ELKGAI 1157
LA+GTA++ GED V GRVLL + D ++ ++ GA+
Sbjct: 945 LAVGTAFLPPDDGEDSSWDEGNLAVVKEGRVLLLEI-KEGDAGGGWDVKIKAELTTVGAV 1003
Query: 1158 SALASLQGHLLIASGPKIILHKW--TGTELNGIAFYDAPPLYVVSLNIV-------KNFI 1208
AL + G L +A+G K+ +H+ EL + + A + SL+ + + +
Sbjct: 1004 YALEEIHGFLAVAAGSKLTMHRLDHNSVELEETSSW-ASAYVISSLSALHPSHTRPEGAL 1062
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
++GD +S+ L+ E + ++ + A L D ++VVSD N+ +
Sbjct: 1063 IVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGD-AVVVSDAHSNLLTYR 1121
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
QKL A F + VT+F ++ T++ P +LF T
Sbjct: 1122 L---------NQKLERAATFGLHEEVTRFQNGSLVPTTTAPEIIIPD--------VLFAT 1164
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKL 1354
+G +G I L ++ R L LQ+ +
Sbjct: 1165 REGRLGVIGELGTMSSRTLDDLQRNM 1190
>gi|398391687|ref|XP_003849303.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
gi|339469180|gb|EGP84279.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
Length = 1143
Score = 50.1 bits (118), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 133/323 (41%), Gaps = 52/323 (16%)
Query: 1111 LAIGTAYVQGEDVA-ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
IGTAY+ +D + A+GR+L+ D LVTE+ ++GA LA G ++
Sbjct: 834 FVIGTAYLDDQDASNAKGRILVLEV--TEDRRLKLVTEI---SVRGACRCLAVSHGRIVA 888
Query: 1170 ASGPKIILHKWTGTELNGIAFYDAP--PLYV-----------VSLNIVKNFILLGDIHKS 1216
A +I++ + Y+ P P V + + + + I + D+ KS
Sbjct: 889 ALIKTVIIYSFE---------YETPSSPAMVKKAAYRTSTAPIDMCVTGDIIAVTDLMKS 939
Query: 1217 IYFL--SWKEQGAQLNL--LAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPK 1272
+ + + + G NL +A+ F +L A + + L SD + N+ + + K
Sbjct: 940 MSLVQHTLGQAGGPDNLTEVARHFDTLWGTAVANVDENIYLE---SDAEGNLVVLEHDVK 996
Query: 1273 MSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1332
++L +E +G V + R+ + +P + T T++GS
Sbjct: 997 GFSEEDRRRLRVTSEILLGEMVNRIRRIDV----------SPTPNATVIPRAFLATVEGS 1046
Query: 1333 I---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-QFHSNGKAHRPGPDSIVD 1388
I IA + R+Q+ ++V S HV R F+ Q G+ GP VD
Sbjct: 1047 IYLFALIAEGKQDLLIRMQNKMAEMVQSPGHVPFAKFRGFKTQVRDMGEE---GPSRFVD 1103
Query: 1389 CELLSHYEMLPLEEQLEIAHQTG 1411
EL+ + + Q E+A + G
Sbjct: 1104 GELIERFLDCDEDVQAEVAKELG 1126
>gi|358366432|dbj|GAA83053.1| UV-damaged DNA binding protein [Aspergillus kawachii IFO 4308]
Length = 1643
Score = 49.7 bits (117), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 78/325 (24%), Positives = 132/325 (40%), Gaps = 58/325 (17%)
Query: 1111 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+GTAY+ E+ + RGR+L+F DN + L T+V +KGA ALA L G ++
Sbjct: 772 FVVGTAYLDDENEESIRGRILVFEI----DNGRKL-TKVAELPVKGACRALAML-GEKIV 825
Query: 1170 ASGPKIIL------HKWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYFL 1220
A+ K ++ + + +L +A Y AP V + + N I + D+ KS+ +
Sbjct: 826 AALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAP----VDVTVTGNVIAIADLMKSVCLV 881
Query: 1221 SWKE----QGAQLNLLAKDFGSL-----DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1271
+ E L +A+ F ++ C A + ++ +D + N+ +
Sbjct: 882 EYSEGENGMPDSLTEVARHFQTVWATGVVCIAKDTFLE--------TDAEGNLIVLRRNL 933
Query: 1272 KMSESWKGQKLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1329
E ++L E +G V + + +Q LA+ + A GT+
Sbjct: 934 TGVEEDDKRRLEVTGEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTV 981
Query: 1330 DGSI---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1386
+GSI I P + RLQ+ V+S+ ++ R FR K P
Sbjct: 982 EGSIYLFAIINPEHQDFLMRLQATMAGKVESLGNIPFNEFRGFRSMVREAKE----PYRF 1037
Query: 1387 VDCELLSHYEMLPLEEQLEIAHQTG 1411
VD EL+ + Q EI G
Sbjct: 1038 VDGELIERFLTCEPSLQEEIVDSVG 1062
Score = 43.9 bits (102), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 105/264 (39%), Gaps = 29/264 (10%)
Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
+DP GR + VY + ++ Q S G + SG E R +D
Sbjct: 62 IDPSGRFMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTTAR-IDEL 120
Query: 245 HVKDFIFVHGYIEP--VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
V+ F+H P + ++ + + +V H++ S+ ++ L +
Sbjct: 121 FVRSSAFLHVQSGPPRLALLYEDNQKKVRLKVRALHYSAATSSTGADAAFEES-LDGFSQ 179
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
L A L+ VP+P+GG+LV+G +I Y V DS++ + R
Sbjct: 180 ELDLGASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP---- 220
Query: 363 ELDAA--HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITT 417
LD A W Q D LL+ G L L +V D VQ L + S +
Sbjct: 221 -LDEATIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIY 279
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
+G + F+GS GDS +++ GS
Sbjct: 280 LGGGVIFVGSHQGDSQVLRIGNGS 303
>gi|325096432|gb|EGC49742.1| pre-mRNA-splicing factor Rse1 [Ajellomyces capsulatus H88]
Length = 1209
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 76/356 (21%), Positives = 151/356 (42%), Gaps = 38/356 (10%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNAD 1139
I ++ +E A++V V +++++ET L +GT G+D+ R R +
Sbjct: 872 IELEENEAAVSVAAVPF---SSQDDETFLVVGT----GKDMVVNPRSCTAGFIHIYRFQE 924
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
+ L ++ +++ AL QG LL G + ++ ++ P VV
Sbjct: 925 EGKEL-EFIHKTKVEQPPMALLGFQGRLLAGIGTDLRIYDLGMKQMLRKCQASVVPHLVV 983
Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
L + I++ D+ +S+ ++ +K Q +L D S T ++D T++ D
Sbjct: 984 GLQTQGSRIIVSDVQESLTYVVYKYQENRLIPFVDDVISRWTTCTT-MVDYETVA--GGD 1040
Query: 1260 EQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTG 1311
+ N+ + K SE G L+ ++ GA ++ Q L TS +
Sbjct: 1041 KFGNLWLLRCPAKASEEADEDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQ 1100
Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
G R L++ L G++ + P +E+ F QSL+ +L P +AG +
Sbjct: 1101 LVTG----GRDILVWTGLQGTVSMLIPFISREEVDF--FQSLEMQLAAQNPPLAGRDHLI 1154
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+R +++ K +D +L Y +LP +++ +IA + + +I + D+
Sbjct: 1155 YRSYYAPAKG-------TIDGDLCETYLLLPNDKKQQIAGELDRSVREIERKIADM 1203
>gi|330935579|ref|XP_003305038.1| hypothetical protein PTT_17772 [Pyrenophora teres f. teres 0-1]
gi|311318228|gb|EFQ86975.1| hypothetical protein PTT_17772 [Pyrenophora teres f. teres 0-1]
Length = 1115
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 68/297 (22%), Positives = 122/297 (41%), Gaps = 35/297 (11%)
Query: 1111 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
IGTAY+ ++ + RGR+L+ P+ ++ V +KG LA+ +G ++
Sbjct: 809 FVIGTAYLDDQNTTSERGRILILEV-----TPERILKLVTEIAVKGGCRCLATCEGKIVA 863
Query: 1170 ASGPKIILHKWTG-------TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
A I+++ T+L AP + + + + I + D+ KS+ + +
Sbjct: 864 ALIKTIVIYDVEYPTQTPFLTKLATFRCSTAP----IDITVNGSKIAIADLMKSLVVVEY 919
Query: 1223 KEQGAQL-NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
+ A L + L + + + T + SD + N+ + Y P ++
Sbjct: 920 TKGEAGLPDKLVEVARHYQITWATAVAEVDTNMYLESDAEGNLMVLYRDPNGVTDDDKRR 979
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI---GCIAP 1338
L +E +G V + R+ +L T+SD P + GT++GSI G I+P
Sbjct: 980 LNVSSEMLLGEMVNRIRRIDVL-TASDAV-VIPRA--------FVGTVEGSIYLFGLISP 1029
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
+ L +LQ L VP ++ FR F NG P VD E + +
Sbjct: 1030 AHQ---NLLMTLQSNLGALVPAPGDMDFAKFRAFK-NGVREEEEPMRFVDGEFVERF 1082
>gi|358366518|dbj|GAA83139.1| nuclear mRNA splicing factor [Aspergillus kawachii IFO 4308]
Length = 1209
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 100/477 (20%), Positives = 199/477 (41%), Gaps = 48/477 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
C G + + Q + ++ S DN Q IPL TP + E+ L+ +I S
Sbjct: 759 QCVEGMVGIQGQNL----RIFSIEKLDNNMLQQSIPLSYTPRRFLKHPEQPLFYVIESDN 814
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
VL P + + L++ D L D + +++++P A
Sbjct: 815 NVLSPSTR--AKLLEDSKSRGGDETVLPPEDFGYPRATGHWASCIQVVDPLDAKA---VV 869
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1138
TI ++ +E A+++ V T++++ET L +GTA + +A G + ++ R
Sbjct: 870 HTIELEENEAAISIAAVPF---TSQDDETFLVVGTAKDMSVNPPKSAGGYIHIY---RFQ 923
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
++ + L ++ +++ AL QG L+ G + ++ +L P +
Sbjct: 924 EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGSLLRIYDLGMKQLLRKCQAPVVPKTI 982
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
V L + I++ D+ +S+ ++ +K Q L D S AT ++D T +
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTATT-MVDYETTA--GG 1039
Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRT 1310
D+ N+ + K SE G L+ + G + + Q + TS +T
Sbjct: 1040 DKFGNLWLLRCPKKTSEEADEDGSGAHLIHERGYLQGTPNRLELMIHVYTQDIPTSLHKT 1099
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFIGREDVDF--FQNLEMQLAAQHPPLAGRDHL 1153
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+R +++ K ++D +L Y +LP + ++ IA + + +I ++D+
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|68531971|ref|XP_723667.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23478038|gb|EAA15232.1| Drosophila melanogaster CG13900 gene product [Plasmodium yoelii
yoelii]
Length = 1235
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 62/295 (21%), Positives = 121/295 (41%), Gaps = 43/295 (14%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ ++ GDL + V ++ +V+ + + + + I + + F+ + G+ QF
Sbjct: 302 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 361
Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SDALQDMVNGEELSLYGSASNN 493
+ S +M +S G A T++L+ D + + ++ + + ++N
Sbjct: 362 SGIGNDSNDAMCTSN--HPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSN 419
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWT 552
R SL + +GL I A+ ELPG + IWT
Sbjct: 420 LPQIYALCGRGPRSSL------RILQHGLSIEELANN------------ELPGKPRYIWT 461
Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
V +S EY Y+I+S E T++LE + + EV +S+ + T
Sbjct: 462 VKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNVTT 505
Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
NL IQV++ G R ++G + + + P N + + + N + + VS++
Sbjct: 506 IHINLLYDNSFIQVYDTGIRHINGKIVQEWIP--PKNKQINAATSNGSQIVVSLS 558
Score = 46.2 bits (108), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 60/300 (20%), Positives = 115/300 (38%), Gaps = 59/300 (19%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G ++++ G K+ ++ +L Y P +VS+ + + I DI +S+
Sbjct: 956 CFCPFNGRVIVSVGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKVSGDRIFASDIRESVL 1015
Query: 1219 FLSWKEQGAQLNLLAKDFG----------------SLDCFATEFLIDGSTLSLVVS---- 1258
+ + L++ D + D F + F++ S L+ ++
Sbjct: 1016 IFFYDSNQNLIRLISDDIIPRWITCSEILDHHTIIAADKFDSVFILRVSLLTFFITPFCH 1075
Query: 1259 -------DEQKNI--QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
E+ I + +Y ++ S K +K+ FH+G VT ++++ TSS+
Sbjct: 1076 LVPEEAKQEEYGIANKCWYGGEVINSSTKNRKMEHIMSFHIGEIVTSLQKVKLSPTSSE- 1134
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGL 1364
+++ T+ G+IG P D ELT Q L+ L + G
Sbjct: 1135 -------------CIIYSTIMGTIGAFIPYDSKEELELT----QHLEIILRTEKHSLCGR 1177
Query: 1365 NPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
FR ++ + ++D +L + LP E Q +I T +IL L D+
Sbjct: 1178 EHIFFRSYYHPVQ-------HVIDGDLCEQFSSLPFEVQRKIGSDLEKTPDEILRKLEDI 1230
>gi|116207186|ref|XP_001229402.1| conserved hypothetical protein [Chaetomium globosum CBS 148.51]
gi|88183483|gb|EAQ90951.1| conserved hypothetical protein [Chaetomium globosum CBS 148.51]
Length = 1211
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 99/455 (21%), Positives = 192/455 (42%), Gaps = 48/455 (10%)
Query: 995 VQK-IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1053
+QK IPL TP ++ E+ + I S P ++ + L++Q D L D
Sbjct: 786 IQKSIPLTYTPKRLVKHPEQPYFYTIESDNNTLP-PELRAQLLEQSGAVNGDAAILPPED 844
Query: 1054 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1113
+ I D G I + +E A++ VV +++E E+ L +
Sbjct: 845 FGYPRATGRWASCISVVDPLGEEPSVLQRIDFEGNEAAVSAAVVPF---SSQEGESFLIV 901
Query: 1114 GTAYVQGEDVAARGRVLLFSTG-----RNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
GT G+D+ R FS G R ++ ++L ++ +++ AL QG LL
Sbjct: 902 GT----GKDMVLNPRK--FSEGYIHVYRFHEDGRDL-EFIHKTKVEEPPMALIPFQGRLL 954
Query: 1169 IASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1228
G + ++ +L A + P +VSL + I++GD+ + I ++ +K + +
Sbjct: 955 AGIGKTLRVYDLGLRQLLRKAQGEVAPQLIVSLQTQGSRIIVGDVQQGITYVVYKPESNK 1014
Query: 1229 LNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKG-----QKL 1282
L A D +++ + T ++D S+ D+ NI I + + S+ Q L
Sbjct: 1015 LLPFADD--TINRWTTCTTMVDYE--SVAGGDKFGNIWILRCSERASQESDEPGSEIQLL 1070
Query: 1283 LSRAEFHVGAH----VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
+R H GA Q L TS +T G L++ + G++G + P
Sbjct: 1071 HARNYLH-GAQSRLSAMAHFYTQDLPTSIVKTNLVVGGQD----VLVWSGIQGTVGVLIP 1125
Query: 1339 L---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
+++ F Q+L+ + P +AG + +R ++ K ++D +L +
Sbjct: 1126 FVSREDVDF--FQNLESHMRAEDPPLAGRDHLIYRGYYVPVKG-------VIDGDLCERF 1176
Query: 1396 EMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+LP +++ IA + + +I ++D+ ++F
Sbjct: 1177 SLLPNDKKQMIAGELDRSVREIERKISDIRTRSAF 1211
>gi|325094412|gb|EGC47722.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H88]
Length = 1201
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 44/315 (13%)
Query: 1110 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
L +GT+Y+ GE + RGR+L F N + +V +KGA ALA +Q +
Sbjct: 885 LFVVGTSYLDDFGEG-SIRGRILAFEVTANRQ-----LAKVAEMPVKGACRALAIVQDKI 938
Query: 1168 LIASGPKIILH-----KWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
+ A ++++ ++ L+ A Y AP V + + N I + D+ KS+
Sbjct: 939 VAALMKTVVVYTLSKGQFADYTLSKTASYRTSTAP----VDIAVTGNLIAVADLMKSVSI 994
Query: 1220 LSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+ + +QGA L +A+ F +L A + + + L SD + N+ + +
Sbjct: 995 VEY-QQGANGLPDSLTEVARHFQTLWSTAVAPVAEDTWLE---SDAEGNLVMLHRNVNGV 1050
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1333
++L +E +G V + + + + +P + GT++GSI
Sbjct: 1051 TDDDRRRLEVTSEILLGEMVNRIRPVNIQGSQGAEAAISPRA--------FLGTVEGSIY 1102
Query: 1334 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
G I P + RLQS +V + G+ FR F N P VD EL
Sbjct: 1103 LFGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTIRQTEEPYRFVDGEL 1158
Query: 1392 LSHYEMLPLEEQLEI 1406
+ + +E Q EI
Sbjct: 1159 IERFLNCGVELQEEI 1173
Score = 44.7 bits (104), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 25/144 (17%)
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
L A L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 293 ELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV 341
Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITT 417
W Q D LL+ G L L +V D VQ +LDL P S +
Sbjct: 342 -------AWEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVY 392
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
+G + F+GS GDS L++ T GS
Sbjct: 393 MGGGITFIGSHQGDSELIRITEGS 416
>gi|145240731|ref|XP_001393012.1| pre-mRNA-splicing factor rse1 [Aspergillus niger CBS 513.88]
gi|134077536|emb|CAK96680.1| unnamed protein product [Aspergillus niger]
Length = 1209
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 100/477 (20%), Positives = 198/477 (41%), Gaps = 48/477 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
C G + + Q + ++ S DN Q IPL TP + E+ L+ +I S
Sbjct: 759 QCVEGMVGIQGQNL----RIFSIEKLDNNMLQQSIPLSYTPRRFLKHPEQPLFYVIESDN 814
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
VL P + + L++ D L D + +++++P A
Sbjct: 815 NVLAPSTR--AKLLEDSKSRGGDETVLPPEDFGYPRATGHWASCIQVVDPLDAKA---VV 869
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
TI ++ +E A+++ V T++++ET L +GTA +A G + ++ R
Sbjct: 870 HTIELEENEAAISIAAVPF---TSQDDETFLVVGTAKDMTVNPPGSAGGYIHIY---RFQ 923
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
++ + L ++ +++ AL QG L+ G + ++ +L P +
Sbjct: 924 EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGSLLRIYDLGMKQLLRKCQAPVVPKTI 982
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
V L + I++ D+ +S+ ++ +K Q L D S AT ++D T +
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTATT-MVDYETTA--GG 1039
Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRT 1310
D+ N+ + K SE G L+ + G + + Q + TS +T
Sbjct: 1040 DKFGNLWLLRCPKKTSEEADEDGSGAHLIHERGYLQGTPNRLELMIHVYTQDIPTSLHKT 1099
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFIGREDVDF--FQNLEMQLAAQHPPLAGRDHL 1153
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+R +++ K ++D +L Y +LP + ++ IA + + +I ++D+
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|66361481|ref|XP_627314.1| possible spliceosome factor [Cryptosporidium parvum Iowa II]
gi|46228697|gb|EAK89567.1| possible spliceosome factor [Cryptosporidium parvum Iowa II]
Length = 1317
Score = 49.3 bits (116), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 63/295 (21%), Positives = 123/295 (41%), Gaps = 42/295 (14%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
V+ ++ + +AL +G LL+ + ++ L + Y P + + +V +
Sbjct: 1040 VHITPIENSATALTGWRGRLLVGINKTLRVYSLGKKRLLRKSEYRNIPQGLTWIKVVNDR 1099
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI- 1266
I GDI + + Q L+AKD + ++D T++ VSD+ NI +
Sbjct: 1100 IFAGDISNGVLVFKFNNTSNQFILVAKDPMPRWLTSACEVLDYHTIA--VSDKFDNIIVS 1157
Query: 1267 ---------FYYAPKMSESWKGQ--------KLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
F + +++ Q ++ + A+FH+G VT + Q+ TS++
Sbjct: 1158 RVPAEASDDFSFVTSFTDNNNSQSSALMRTHQINTVAQFHLGDIVTCLQKSQLTPTSAE- 1216
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
A+++GT+ GSIG ++P L+ L L+ L + +
Sbjct: 1217 -------------AIIYGTVLGSIGSLSPILNNEDIELLSKLEILLRKQKSTLLSRDHLM 1263
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
FR ++S H +++D + + +L + Q EIA + T +I L+D
Sbjct: 1264 FRSYYS--PVH-----NVIDGDFCQTFTILDSQIQSEIASKLDVTVEEIYKKLDD 1311
>gi|154286506|ref|XP_001544048.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407689|gb|EDN03230.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 1158
Score = 49.3 bits (116), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 74/315 (23%), Positives = 131/315 (41%), Gaps = 44/315 (13%)
Query: 1110 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
L +GT+Y+ GE + RGR+L F N + +V +KGA ALA +Q +
Sbjct: 842 LFVVGTSYLDDFGEG-SIRGRILAFEVTANRQ-----LAKVAEMPVKGACRALAIVQDKI 895
Query: 1168 LIASGPKIILH-----KWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
+ A ++++ ++ L+ A Y AP + + + N I + D+ KS+
Sbjct: 896 VAALMKTVVVYTISKGQFADYTLSKTASYRTSTAP----IDIAVTGNLIAVADLMKSVSI 951
Query: 1220 LSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+ + +QG+ L +A+ F +L A + + + L SD + N+ + +
Sbjct: 952 VEY-QQGSNGLPDSLTEVARHFQTLWSTAVAHVAEDTWLE---SDAEGNLVMLHRNVNGV 1007
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1333
++L +E +G V + + + + +P + GT++GSI
Sbjct: 1008 TDDDRRRLEVTSEILLGEMVNRIRPVNIQGSQGAEAAISPRA--------FLGTVEGSIY 1059
Query: 1334 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
G I P + RLQS +V + G+ FR F N P VD EL
Sbjct: 1060 LFGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTIRQAEEPYRFVDGEL 1115
Query: 1392 LSHYEMLPLEEQLEI 1406
+ + +E Q EI
Sbjct: 1116 IERFLSCSVELQEEI 1130
Score = 43.1 bits (100), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 63/136 (46%), Gaps = 25/136 (18%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 258 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV-------A 299
Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFL 425
W Q D LL+ G L L +V D VQ +LDL P S + +G + F+
Sbjct: 300 WEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVYMGGGITFI 357
Query: 426 GSRLGDSLLVQFTCGS 441
GS GD L++ T GS
Sbjct: 358 GSHQGDPELIRITEGS 373
>gi|240275059|gb|EER38574.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H143]
Length = 1134
Score = 49.3 bits (116), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 131/315 (41%), Gaps = 44/315 (13%)
Query: 1110 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
L +GT+Y+ GE + RGR+L F N + +V +KGA ALA +Q +
Sbjct: 818 LFVVGTSYLDDFGEG-SIRGRILAFEVTANRQ-----LAKVAEMPVKGACRALAIVQDKI 871
Query: 1168 LIASGPKIILH-----KWTGTELNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
+ A ++++ ++ L+ A Y AP V + + N I + D+ KS+
Sbjct: 872 VAALMKTVVVYTLSKGQFADYTLSKTASYRTSTAP----VDIAVTGNLIAVADLMKSVSI 927
Query: 1220 LSWKEQGAQ-----LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+ + +QGA L +A+ F +L A + + + L SD + N+ + +
Sbjct: 928 VEY-QQGANGLPDSLTEVARHFQTLWSTAVAPVAEDTWLE---SDAEGNLVMLHRNVNGV 983
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1333
++L +E +G V + + + + +P + GT++GSI
Sbjct: 984 TDDDRRRLEVTSEILLGEMVNRIRPVNIQGSQGAEAAISPRA--------FLGTVEGSIY 1035
Query: 1334 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1391
G I P + RLQS +V + G+ FR F N P VD EL
Sbjct: 1036 LFGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTIRQTEEPYRFVDGEL 1091
Query: 1392 LSHYEMLPLEEQLEI 1406
+ + +E Q EI
Sbjct: 1092 IERFLNCGVELQEEI 1106
Score = 44.7 bits (104), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 25/144 (17%)
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
L A L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 293 ELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV 341
Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITT 417
W Q D LL+ G L L +V D VQ +LDL P S +
Sbjct: 342 -------AWEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVY 392
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
+G + F+GS GDS L++ T GS
Sbjct: 393 MGGGITFIGSHQGDSELIRITEGS 416
>gi|242018509|ref|XP_002429717.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
gi|212514723|gb|EEB16979.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
Length = 1218
Score = 49.3 bits (116), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 77/388 (19%), Positives = 149/388 (38%), Gaps = 58/388 (14%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+RI++P +T + ++ +E AL++ +V FN + ++ + Y
Sbjct: 867 IRIIDPVEG----RTDKIVRLEQNEAALSIALVK-FNNHPESLFLVVGVVKEYQLSPRQV 921
Query: 1125 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1184
+ G + F + + + V+ + A +A+ G LL+ G + L+ +
Sbjct: 922 SFGYLYTFRINEDVTD----LELVHKTTVDEAPAAVCPYHGRLLVGVGRMLRLYDLGKKK 977
Query: 1185 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1244
L P +VS+ + + D+ +S+Y + +K Q QL + A D T
Sbjct: 978 LLRKCENKYIPNQIVSICATGQRVFVSDVQESVYMVRYKRQENQLIIFADDTHPRWITCT 1037
Query: 1245 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLSRA 1286
L D T++ +D+ NI I + +++ W QK A
Sbjct: 1038 TIL-DYDTVA--TADKFGNIAIIRLSSIITDDVDEDPTGNKALWDRGLLNGASQKADVLA 1094
Query: 1287 EFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1346
FHVG + ++ PG ++ L++ +L G++G + P T R
Sbjct: 1095 NFHVGETCMSLQKATLI----------PGGSES----LVYTSLSGTVGVLVP---FTSRE 1137
Query: 1347 ----LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
Q L+ + P + G + SFR ++ K +++D +L Y + +
Sbjct: 1138 DHDFFQHLEMHMRSEHPPLCGRDHLSFRSYYYPVK-------NVIDGDLCEQYNSIEPAK 1190
Query: 1403 QLEIAHQTGTTRSQILSNLNDLALGTSF 1430
Q IA S + L D+ +F
Sbjct: 1191 QKSIAEDLDRNPSDVSKKLEDIRTRYAF 1218
Score = 41.6 bits (96), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 68/323 (21%), Positives = 122/323 (37%), Gaps = 65/323 (20%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVATSMCVMKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E GD AP AL+++V +E+ S +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPR----------ALRNLVQVDEMD-----SLSP 406
Query: 495 ESAQKTFSFAVRDS-----LVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPG 546
A + A D+ L GP L+ +GL + S + ELPG
Sbjct: 407 IMACQVADLANEDTPQLYMLCGRGPRSTLRVLRHGLEV------------SEMAVSELPG 454
Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
+WTV + ++EY AY+I+S T+VL + + EVT+S
Sbjct: 455 NPNAVWTVKRR--------------VEEEYDAYIIVSFVNATLVLSIGETVEEVTDS--G 498
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
F+ + + G ++QV+ G R + N G + T++ +
Sbjct: 499 FLGTTPTLSCSALGDDALVQVYPDGIRHIRADKRV--------NEWKAPGKK--TIMKCA 548
Query: 666 IADPYVLLGMSDGSIRLLVGDPS 688
+ V++ ++ G + DP+
Sbjct: 549 VNQRQVVIALTAGELVYFEMDPT 571
>gi|171685748|ref|XP_001907815.1| hypothetical protein [Podospora anserina S mat+]
gi|170942835|emb|CAP68488.1| unnamed protein product [Podospora anserina S mat+]
Length = 1235
Score = 49.3 bits (116), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 77/365 (21%), Positives = 162/365 (44%), Gaps = 44/365 (12%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
+ + ++E A++ VV+ +++ E+ L +GT G+D+ R FS G R
Sbjct: 898 VDLDNNEAAISAAVVSF---ASQDGESFLIVGT----GKDMILSPR--QFSEGYIHVYRF 948
Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
D+ ++L ++ +++ AL QG LL G + ++ +L A + P
Sbjct: 949 HDDGRDL-EFIHKTKIEEPPMALIPFQGRLLAGIGKTLRIYDLGLKQLLRKAQAEIAPQL 1007
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLV 1256
+VSL N I++GD+ + I + +K + +L A D +++ + T ++D S+
Sbjct: 1008 IVSLQTQGNRIVVGDVQQGITYAVYKPESNKLLAWADD--TINRWTTCTAMVDYE--SVA 1063
Query: 1257 VSDEQKNIQIFYYAPKMSESWKG-----QKLLSRAEFHVGAHVTKFLR---LQMLATSSD 1308
D+ N+ I + S+ Q + +++ H + T + Q L TS
Sbjct: 1064 GGDKFGNVWILRAPERASQESDEPGSEIQLVHAKSYLHGAPNRTALMAHFYTQDLPTSIT 1123
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
+T G LL+ + G++G + P +++ F QSL+ + P +AG +
Sbjct: 1124 KTNLVVGGQDV----LLWSGIQGTVGVLIPFVSREDVDF--FQSLESHMRAEDPPLAGRD 1177
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
+R ++ K ++D +L + +L +++ IA + + +I ++D+
Sbjct: 1178 HLIYRGYYVPVKG-------VIDGDLCERFALLANDKKQMIAGELDRSVREIERKISDIR 1230
Query: 1426 LGTSF 1430
++F
Sbjct: 1231 TRSAF 1235
>gi|449019082|dbj|BAM82484.1| UV-damaged DNA binding protein [Cyanidioschyzon merolae strain 10D]
Length = 1372
Score = 49.3 bits (116), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 119/302 (39%), Gaps = 58/302 (19%)
Query: 1110 LLAIGTAYV-QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS------ 1162
+ +GTAYV E +RGR+L+FS L+ E+Y+ +SALA
Sbjct: 981 VFVVGTAYVLPSEMEPSRGRILVFSR-----EELLLLNELYTPGAVYTMSALADPSDRTC 1035
Query: 1163 ---LQGHLLIASGPK--IILHKWTGT------ELNGIAFYDAPPLYVVSLNIVKNFILLG 1211
+A+G +IL+ W + EL +A + L V+ L + +L+G
Sbjct: 1036 RFPASAARFLAAGVNNVVILYDWGQSGHGDDYELREVARHLGHVL-VLRLEARGDQLLVG 1094
Query: 1212 DIHKSIYFLSW------KEQGAQ--LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
D+ KS+ L GA L +A D+ + A FL + + L+ +D N
Sbjct: 1095 DLMKSLCVLQLVLPEGETSDGASPCLKAVAWDYETAWITACAFLNEDTYLA---ADNSYN 1151
Query: 1264 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPG-------- 1315
+ P + S L FH+G V F R +++ +S A G
Sbjct: 1152 LLSLQRNPHETRSEFRHALNRAGAFHLGDLVNVFRRGKLVTEASGNEEAGTGNGHSTIDT 1211
Query: 1316 ---------------SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH 1360
+D +R LLF T G+IG I PLD R L ++K L H
Sbjct: 1212 ESTRDVARASTGTTTADNVSRQTLLFATTAGAIGIIVPLDPAQHRMLSRVEKALRSLTDH 1271
Query: 1361 VA 1362
A
Sbjct: 1272 PA 1273
>gi|189205943|ref|XP_001939306.1| DNA damage-binding protein 1 [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187975399|gb|EDU42025.1| DNA damage-binding protein 1 [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 1115
Score = 49.3 bits (116), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 67/297 (22%), Positives = 121/297 (40%), Gaps = 35/297 (11%)
Query: 1111 LAIGTAYVQGEDVAA-RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
IGTAY+ + + RGR+L+ P+ ++ V +KG LA+ +G ++
Sbjct: 809 FVIGTAYLDDQSTTSERGRILILEV-----TPERILKLVMEIAVKGGCRCLATCEGKIVA 863
Query: 1170 ASGPKIILHKWTG-------TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1222
A I+++ T+L AP + + + I++ D+ KS+ + +
Sbjct: 864 ALIKTIVIYDVEYPTQTPFLTKLATFRCSTAP----IDITVNGPKIVIADLMKSLVVVEY 919
Query: 1223 KEQGAQL-NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQK 1281
+ A L + L + + + T + SD + N+ + Y P ++
Sbjct: 920 TKGEAGLPDKLVEVARHYQITWATAVAEVDTNMYLESDAEGNLMVLYRDPNGVTDDDKRR 979
Query: 1282 LLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI---GCIAP 1338
L +E +G V + R+ +L T+SD P + GT++GSI G I+P
Sbjct: 980 LNVSSEMLLGEMVNRIRRIDVL-TASDAV-VIPRA--------FVGTVEGSIYLFGLISP 1029
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1395
+ L +LQ L +P ++ FR F NG P VD E + +
Sbjct: 1030 AHQ---NLLMTLQSNLGALIPAPGDMDFAKFRAFK-NGVRQEEEPMRFVDGEFVERF 1082
>gi|67600754|ref|XP_666354.1| CG13900 gene product [Cryptosporidium hominis TU502]
gi|54657334|gb|EAL36124.1| CG13900 gene product [Cryptosporidium hominis]
Length = 1318
Score = 48.9 bits (115), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 63/295 (21%), Positives = 123/295 (41%), Gaps = 42/295 (14%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
V+ ++ + +AL +G LL+ + ++ L + Y P + + +V +
Sbjct: 1041 VHITPIENSATALTGWRGRLLVGINKTLRVYSLGKKRLLRKSEYRNIPQGLTWIKVVNDR 1100
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI- 1266
I GDI + + Q L+AKD + ++D T++ VSD+ NI +
Sbjct: 1101 IFAGDISNGVLVFKFNNTSNQFILVAKDPMPRWLTSACEVLDYHTIA--VSDKFDNIIVS 1158
Query: 1267 ---------FYYAPKMSESWKGQ--------KLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
F + +++ Q ++ + A+FH+G VT + Q+ TS++
Sbjct: 1159 RVPVEASDDFSFVTSFTDNNNSQSSALMRTHQINTVAQFHLGDIVTCLQKSQLTPTSAE- 1217
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
A+++GT+ GSIG ++P L+ L L+ L + +
Sbjct: 1218 -------------AIIYGTVLGSIGSLSPILNNEDIELLSKLEILLRKQKSTLLSRDHLM 1264
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
FR ++S H +++D + + +L + Q EIA + T +I L+D
Sbjct: 1265 FRSYYS--PVH-----NVIDGDFCQTFTILDSKIQSEIASKLDVTVEEIYKKLDD 1312
>gi|156086042|ref|XP_001610430.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797683|gb|EDO06862.1| conserved hypothetical protein [Babesia bovis]
Length = 1450
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 33/103 (32%), Positives = 51/103 (49%), Gaps = 5/103 (4%)
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
YVVSL+ ++ I++GD+ S+ L W QG +L + KDF S+ C A ID + S V
Sbjct: 1171 YVVSLDAYQDVIVIGDLMNSMRMLQW--QGTELREVCKDFNSVYCTAAA-AIDQT--SCV 1225
Query: 1257 VSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1299
V+D N +F ++ + K FH G + + R
Sbjct: 1226 VADSSGNFYVFAKRQVVTNDAEAIKAEDVGLFHHGELINRIRR 1268
>gi|70992737|ref|XP_751217.1| nuclear mRNA splicing factor [Aspergillus fumigatus Af293]
gi|74670386|sp|Q4WLI5.1|RSE1_ASPFU RecName: Full=Pre-mRNA-splicing factor rse1
gi|66848850|gb|EAL89179.1| nuclear mRNA splicing factor, putative [Aspergillus fumigatus Af293]
Length = 1225
Score = 48.9 bits (115), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 98/462 (21%), Positives = 197/462 (42%), Gaps = 48/462 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
C G + + +Q + ++ S DN + IPL TP ++ E+ L+ +I S
Sbjct: 759 QCVEGMVGIQAQNL----RIFSIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDN 814
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
VL P + + LI+ + + L D + ++I++P A
Sbjct: 815 NVLSPATR--ARLIEDSKARNGETNVLPPEDFGYPRATGHWASCIQIVDPLDAKA---VI 869
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
+TI ++ +E A+++ V +++++ET L +GTA + +A G + ++ R
Sbjct: 870 STIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDMIVNPPSSAGGFIHIY---RFQ 923
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
++ + L ++ +++ AL QG LL G + ++ +L +
Sbjct: 924 EDGKEL-EFIHKTKVEEPPLALLGFQGRLLAGIGSTLRIYDLGMKQLLRKCQAQVVSKTI 982
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
V L + I++ D+ +S+ ++ +K Q L D S +T ++D T++
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQDNILIPFVDDSVSRWTTSTT-MVDYETVA--GG 1039
Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATSSDRT 1310
D+ N+ + K SE G L+ + GA L + Q + TS +T
Sbjct: 1040 DKFGNLWLVRCPKKASEEADEDGSGAHLIHERGYLHGAPNRLDLMIHTYTQDIPTSLHKT 1099
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFVSREDVDF--FQNLEMQLASQCPPLAGRDHL 1153
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
+R +++ K ++D +L Y +LP + ++ IA +
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAE 1188
>gi|400597418|gb|EJP65151.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
Length = 1212
Score = 48.9 bits (115), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 87/430 (20%), Positives = 173/430 (40%), Gaps = 72/430 (16%)
Query: 1043 QIDNHNLSSVDLHRTYTVEEYEV----RILEPDRAGGPWQTR------------------ 1080
+ DNH L DL + V ++L P+ G P R
Sbjct: 813 EADNHTLPP-DLQAKLLADPAAVNGDAKVLPPEEFGHPRGNRRWASCISVVDPVSEEPSV 871
Query: 1081 -ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGR--------VLL 1131
+ +++E A++ VV+ +++NE+ L +GT G+D+ R +
Sbjct: 872 LQKVDFENNEAAVSAAVVSF---ASQDNESFLVVGT----GKDMILNPRSSSEAYIYIYR 924
Query: 1132 FSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFY 1191
F G + ++ +++ AL + QG LL G + ++ +L A
Sbjct: 925 FQEGGRE------LEFIHKTKIEEPAMALLAFQGKLLAGIGKTLRMYDLGMRQLLRKAQA 978
Query: 1192 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1251
+ P +VSLN + I++GD+ + + + +K +L A D + T ++D
Sbjct: 979 EVVPQQIVSLNTQGSRIVVGDVQQGVTLVVYKPASNKLIPFADDTIARWTTCTT-MVDYE 1037
Query: 1252 TLSLVVSDEQKNIQIFYYAPKMSESWKGQK-----LLSRAEFHVGAHVTKFL---RLQML 1303
S+ D+ N+ I K SE ++ + +R H H + + Q +
Sbjct: 1038 --SVAGGDKFGNMFIVRSPAKASEEADEEQAGLHLVNARDYLHGAQHRLELMCHFFTQDV 1095
Query: 1304 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPH 1360
TS ++T G LL+ + G+IG P ++ F QSL++ L
Sbjct: 1096 PTSINKTSLVVGGQDV----LLWSGIMGTIGVFIPFVSREDADF--FQSLEQHLRTEDAP 1149
Query: 1361 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1420
+AG + +R +++ K ++D +L + LP +++ +A + + +I
Sbjct: 1150 LAGRDHLMYRSYYAPVKG-------VIDGDLCERFAALPNDKKQMMAGELDRSVREIERK 1202
Query: 1421 LNDLALGTSF 1430
++D+ ++F
Sbjct: 1203 ISDIRTRSAF 1212
Score = 47.8 bits (112), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 143/614 (23%), Positives = 217/614 (35%), Gaps = 133/614 (21%)
Query: 64 NVIEIYVVRVQEEGSKES---KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
NV++ V Q G+KE SG + D + L+ H + G + S+A+
Sbjct: 19 NVVQ--AVLGQFAGTKEQLIITGSGSQLTILRPDPAQGKVIPLLSH-DIFGVLRSIAVFR 75
Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
G+ +D IILA + +I+VLE+ S + M F K G G
Sbjct: 76 LAGSS----KDYIILATDSGRITVLEYLPSPNRFSRLHMETFG-------KTGIRRVVPG 124
Query: 181 PLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
+ DP+GR V L ++ + SQ E T S A VI
Sbjct: 125 EYLACDPKGRACLISAVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVIA 175
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMISALSISTTLK 293
L LD+ GY PV L E + G + T ++ + L
Sbjct: 176 LTALDV----------GYANPVFAALEIDYTEVDQDNTGEALSEVETHLVY-YELDLGLN 224
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNYAV 347
WS P D L P G GVLV G + Y HS + + +
Sbjct: 225 HVVRKWSD---PVDPTASLLFQVPGGNDGPSGVLVCGEENVTYRHSNQDALRVPIPRRR- 280
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTV--VYDGR----- 396
+ E P ++ H L+ LL T GDL +T+ V D
Sbjct: 281 ---GATEDPSRKRNIVAGVMHK--LKGSAGAFFFLLQTDDGDLFKITIDMVEDEEGAPTG 335
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
VQR+ + + + + + + + ++ S+ G+ QF E+ GD
Sbjct: 336 EVQRMKIKYFDTVPVATSLCILKSGFLYVASQFGNYAFYQF--------------EKLGD 381
Query: 457 IEADAPSTKRLRRSSSDALQD-MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPL 515
L SS D D + E + Y + N A+ DS+ + PL
Sbjct: 382 ------DDDELEFSSDDFPVDPLAAYEPVYFYPRPAEN---------LALVDSIPAMNPL 426
Query: 516 KDFSYGLRINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSS 558
D DA S G +S + + ELPG +WT+ S
Sbjct: 427 LDCKVANLTGEDAPQIYSICGNGARSTFRTIKHGLEVNEIVASELPGVPSAVWTLKLNS- 485
Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
D++Y Y+++S T+VL + + EV++S + TIAA L
Sbjct: 486 -------------DEQYDTYIVLSFTNGTLVLSIGETVEEVSDS-GFLTSVPTIAA-QLL 530
Query: 619 GRRRVIQVFERGAR 632
G +IQV RG R
Sbjct: 531 GTDGLIQVHPRGIR 544
>gi|159130328|gb|EDP55441.1| nuclear mRNA splicing factor, putative [Aspergillus fumigatus A1163]
Length = 1225
Score = 48.9 bits (115), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 98/462 (21%), Positives = 197/462 (42%), Gaps = 48/462 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
C G + + +Q + ++ S DN + IPL TP ++ E+ L+ +I S
Sbjct: 759 QCVEGMVGIQAQNL----RIFSIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDN 814
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
VL P + + LI+ + + L D + ++I++P A
Sbjct: 815 NVLSPATR--ARLIEDSKARNGETNVLPPEDFGYPRATGHWASCIQIVDPLDAKA---VI 869
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
+TI ++ +E A+++ V +++++ET L +GTA + +A G + ++ R
Sbjct: 870 STIELEENEAAVSMAAVPF---SSQDDETFLVVGTAKDMIVNPPSSAGGFIHIY---RFQ 923
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
++ + L ++ +++ AL QG LL G + ++ +L +
Sbjct: 924 EDGKEL-EFIHKTKVEEPPLALLGFQGRLLAGIGSTLRIYDLGMKQLLRKCQAQVVSKTI 982
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
V L + I++ D+ +S+ ++ +K Q L D S +T ++D T++
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQDNILIPFVDDSVSRWTTSTT-MVDYETVA--GG 1039
Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLLSRAEFHVGAHVTKFLRL----QMLATSSDRT 1310
D+ N+ + K SE G L+ + GA L + Q + TS +T
Sbjct: 1040 DKFGNLWLVRCPKKASEEADEDGSGAHLIHERGYLHGAPNRLDLMIHTYTQDIPTSLHKT 1099
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFVSREDVDF--FQNLEMQLASQCPPLAGRDHL 1153
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
+R +++ K ++D +L Y +LP + ++ IA +
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAE 1188
>gi|341886298|gb|EGT42233.1| CBN-TAG-203 protein [Caenorhabditis brenneri]
Length = 108
Score = 48.5 bits (114), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 30/120 (25%), Positives = 62/120 (51%), Gaps = 16/120 (13%)
Query: 1314 PGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
PG+++ AL++ T+ G+IGC+ DE+ F +L+ + P + G + ++R
Sbjct: 2 PGANE----ALVYTTIGGAIGCLVSFMSKDEVDF--FTNLEMHVRSEYPPLCGRDHLAYR 55
Query: 1371 QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
+++ K S++D ++ + ++ L +Q E+A + G T S+I L D+ +F
Sbjct: 56 SYYAPCK-------SVIDGDICEQFSLMDLPKQKEVAEELGKTVSEISKKLEDIRTRYAF 108
>gi|167380951|ref|XP_001733297.1| DNA repair protein xp-E [Entamoeba dispar SAW760]
gi|165902459|gb|EDR28278.1| DNA repair protein xp-E, putative [Entamoeba dispar SAW760]
Length = 349
Score = 48.1 bits (113), Expect = 0.032, Method: Composition-based stats.
Identities = 59/226 (26%), Positives = 101/226 (44%), Gaps = 20/226 (8%)
Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL---LAKDFGSLDCFATEFLIDGST 1252
LYV ++ N IL+GD+ KSI S+ G N +++DF + A EF+ +
Sbjct: 128 LYVKTMG---NKILVGDLMKSISVYSFDNNGNNKNCLNEVSRDFYASYTTAIEFVDENCY 184
Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
LS SD N+ +F +ES + +L + A HVG + + + T S
Sbjct: 185 LS---SDSNSNLLVFNTNSTGNESERF-RLNNCAHIHVGECINVMCKGSIAPTHSTY--- 237
Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPH-VAGLNPRSFRQ 1371
+ + +LFG + G IG I + + L +Q +++ + V P +++
Sbjct: 238 ----ETIQKKCILFGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTPDEWKK 293
Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
+ K R +I+D ++ Y + E+Q EIAH +G QI
Sbjct: 294 VIDDWK--RMPSSNIIDGNIVESYLEMSKEKQCEIAHLSGVNEEQI 337
>gi|1399512|gb|AAC47162.1| repE [Dictyostelium discoideum]
Length = 1139
Score = 48.1 bits (113), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 57/270 (21%), Positives = 115/270 (42%), Gaps = 25/270 (9%)
Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK-----N 1206
+ + ++ L S G L+ A ++ ++T ++ + ++ I+K +
Sbjct: 871 KFRSSVYFLLSFNGRLIAAVHKRLFSIRYTHSKEKNCKVISSESVHKGHTMILKLASRGH 930
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
FIL+GD+ KS+ L + G+ L +A++ + + + D + ++ N +
Sbjct: 931 FILVGDMMKSMSLLVEQSDGS-LEQIARNPQPIWIRSVAMINDDY---FIGAEASNNFIV 986
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA---PGSDKTNRFA 1323
+ + + L S +H+G + +S R G+ P SD+
Sbjct: 987 VKKNNDSTNELERELLDSVGHYHIGESI-----------NSMRHGSLVRLPDSDQPIIPT 1035
Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
+L+ +++GSIG +A + E F LQK L V V G + ++R F ++ H
Sbjct: 1036 ILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSHETWRAFSND--HHTIDS 1093
Query: 1384 DSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
+ +D +L+ + L E QL+ G T
Sbjct: 1094 KNFIDGDLIETFLDLKYESQLKAVADLGIT 1123
Score = 40.8 bits (94), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 52/204 (25%), Positives = 84/204 (41%), Gaps = 29/204 (14%)
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+V N+R L+ V D F++G P + +L + + H S T L
Sbjct: 150 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 201
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
P WS N+ Y L VP P+GGVLVV N I Y + + ++ AVS
Sbjct: 202 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSV-----AVSYTKFL 252
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
R V+ D + L G L +L +++ + V L + + S
Sbjct: 253 AFTR----VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 300
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
I+ + + + ++GS GDS L++
Sbjct: 301 SISYLDSGVVYIGSSSGDSQLIRL 324
>gi|166240328|ref|XP_637896.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
gi|238064940|sp|B0M0P5.1|DDB1_DICDI RecName: Full=DNA damage-binding protein 1; AltName: Full=DNA repair
protein E; AltName: Full=UV-damaged DNA-binding protein 1
gi|165988543|gb|EAL64385.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
Length = 1181
Score = 48.1 bits (113), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 57/270 (21%), Positives = 115/270 (42%), Gaps = 25/270 (9%)
Query: 1152 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK-----N 1206
+ + ++ L S G L+ A ++ ++T ++ + ++ I+K +
Sbjct: 913 KFRSSVYFLLSFNGRLIAAVHKRLFSIRYTHSKEKNCKVISSESVHKGHTMILKLASRGH 972
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
FIL+GD+ KS+ L + G+ L +A++ + + + D + ++ N +
Sbjct: 973 FILVGDMMKSMSLLVEQSDGS-LEQIARNPQPIWIRSVAMINDDY---FIGAEASNNFIV 1028
Query: 1267 FYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAA---PGSDKTNRFA 1323
+ + + L S +H+G + +S R G+ P SD+
Sbjct: 1029 VKKNNDSTNELERELLDSVGHYHIGESI-----------NSMRHGSLVRLPDSDQPIIPT 1077
Query: 1324 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
+L+ +++GSIG +A + E F LQK L V V G + ++R F ++ H
Sbjct: 1078 ILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSHETWRAFSND--HHTIDS 1135
Query: 1384 DSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1413
+ +D +L+ + L E QL+ G T
Sbjct: 1136 KNFIDGDLIETFLDLKYESQLKAVADLGIT 1165
Score = 40.8 bits (94), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 87/204 (42%), Gaps = 29/204 (14%)
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
+V N+R L+ V D F++G P + +L + + H S T L
Sbjct: 192 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 243
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
P WS N+ Y L VP P+GGVLVV N I Y + + ++A+ +Y L ++
Sbjct: 244 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSVAV-SYTKFLAFTR 298
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
V+ D + L G L +L +++ + V L + + S
Sbjct: 299 --------VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 342
Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
I+ + + + ++GS GDS L++
Sbjct: 343 SISYLDSGVVYIGSSSGDSQLIRL 366
>gi|68075683|ref|XP_679761.1| splicing factor 3b, subunit 3, 130kD [Plasmodium berghei strain
ANKA]
gi|56500578|emb|CAH95367.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium berghei]
Length = 1216
Score = 48.1 bits (113), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 62/297 (20%), Positives = 120/297 (40%), Gaps = 48/297 (16%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ ++ GDL + V ++ +V+ + + + + I + + F+ + G+ QF
Sbjct: 302 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 361
Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
+ S SM +S G A T++L+ L D + +
Sbjct: 362 SGIGNDSNESMCTSN--HPSGKNAIIAFKTQKLKNL---YLVDQIYSLPIVDMKILDAKN 416
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ + ++ R GP L+ +GL I A+ ELPG + I
Sbjct: 417 SNIPQIYALCGR------GPRSSLRILQHGLSIEELANN------------ELPGKPRYI 458
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WT+ +S EY Y+I+S E T++LE + + EV +S+ +
Sbjct: 459 WTIKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNV 502
Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
T NL IQV++ G R ++G + + + P N + + + N + + +S++
Sbjct: 503 TTIHINLLYDNSFIQVYDTGIRHINGKIVQEWVP--PKNKQINAATSNGSQIVISLS 557
Score = 43.9 bits (102), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 60/286 (20%), Positives = 110/286 (38%), Gaps = 46/286 (16%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G ++++ G K+ ++ +L Y P +VS+ V N I DI +S+
Sbjct: 952 CFCPFNGKVIVSVGNKLRIYALGKKKLLKKCEYKDIPEAIVSIK-VSNRIFASDIRESVL 1010
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1265
+ + L++ D +E L + ++ L V +E K +
Sbjct: 1011 IFFYDSNQNVIRLISDDIIPRWITCSEILDHHTIIAADKFDSVFILRVPEEAKQEEYGIA 1070
Query: 1266 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
+Y ++ S K +K+ FH+G VT ++++ SS+
Sbjct: 1071 NKCWYGGEVINSSTKNRKMEHIMSFHIGEIVTSLQKVKLSPVSSE--------------C 1116
Query: 1324 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
+++ T+ G+IG P D ELT Q L+ L + G FR ++ +
Sbjct: 1117 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTEKHALCGREHIFFRSYYHPVQ- 1171
Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
++D +L + LP E Q +I T +IL L D+
Sbjct: 1172 ------HVIDGDLCEQFSSLPFEVQRKIGSDLEKTPDEILRKLEDI 1211
>gi|389744702|gb|EIM85884.1| hypothetical protein STEHIDRAFT_121882 [Stereum hirsutum FP-91666
SS1]
Length = 1255
Score = 48.1 bits (113), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 69/277 (24%), Positives = 115/277 (41%), Gaps = 48/277 (17%)
Query: 1117 YVQGEDVAARGRVLLFSTGRNADNPQNLVT----EVYSKELKGAISALASLQGHLLIASG 1172
Y E + G + +FS G P V EV +E+ G + ALAS +G+++ A
Sbjct: 936 YQLDEKEPSHGELHIFSKG--GTEPDGTVKADLLEVVKQEVSGCVYALASFEGYIVAAIN 993
Query: 1173 PKIILHKW--TGTELNGIAFYDAPPLYVV-SLNIVKNFILLGDIHKSIYFLSW---KEQG 1226
+ K +GTE + ++ Y+V SL + +++L+GD S+ L E G
Sbjct: 994 STVSFFKLDTSGTEATLVKKHEWNHNYLVTSLVVSGSYLLIGDAISSVSVLQVIQVDENG 1053
Query: 1227 A---QLNLLAKDFG-----SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
+L +A+D+G SL + E +I ++ N+ F P +
Sbjct: 1054 EITEKLKTVARDYGPLWPVSLQGWGKEGVIGANS--------DCNLFSFTLQPVTPQ--- 1102
Query: 1279 GQKLLSR-AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFAL----LFGTLDGSI 1333
+ +L R FH+ HV KFL G S+K + LF T G I
Sbjct: 1103 -KTVLDRDGHFHLDDHVNKFLH-----------GTVHSSEKAEDLDIEARSLFFTASGRI 1150
Query: 1334 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR 1370
G + + + + +LQ+ L V V+G + +R
Sbjct: 1151 GLVLDMGKELSLHMTALQRNLNGVVKDVSGTTHKRWR 1187
>gi|345570887|gb|EGX53705.1| hypothetical protein AOL_s00006g33 [Arthrobotrys oligospora ATCC
24927]
Length = 1133
Score = 48.1 bits (113), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 68/302 (22%), Positives = 123/302 (40%), Gaps = 32/302 (10%)
Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIA 1170
+GTA D + GR+L G AD L+TE+ EL GA +LA ++G++L
Sbjct: 818 FVVGTAIGNDSDESEHGRLLFLELG--ADKMLRLITEL---ELPGACHSLAIVKGYILAG 872
Query: 1171 SGPKIILHKWT------GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKE 1224
I L++++ G + I+ A L VSL++ + +GD+ K + L E
Sbjct: 873 LSKSIDLYRFSYTRGSLGASIQQISSIRAATL-PVSLSVYGKRVFVGDLVKGVMVLEVVE 931
Query: 1225 QGAQLN----LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
G + N + + +G A E L + + +S +D N+ + + +
Sbjct: 932 GGGEGNDKLVEVCRQYGVSWVTALEALDEDTCIS---ADSDGNLVLLRRESTGATDEDTR 988
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
++ +E +G V R+ T P + GT+DG + + +
Sbjct: 989 RMRPLSEIRLGEMVNCIRRVNDPITQG--YVVQPKA--------YLGTVDGGLFMLGLIH 1038
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPL 1400
F L Q + + + L+ +R +++ G P VD EL+ + L L
Sbjct: 1039 PDYFDILMKCQVNMAKVIKGIGDLDFNRYRAYNTKG-IQPEEPFRFVDGELVEKF--LDL 1095
Query: 1401 EE 1402
+E
Sbjct: 1096 DE 1097
>gi|367027320|ref|XP_003662944.1| hypothetical protein MYCTH_2304190 [Myceliophthora thermophila ATCC
42464]
gi|347010213|gb|AEO57699.1| hypothetical protein MYCTH_2304190 [Myceliophthora thermophila ATCC
42464]
Length = 1211
Score = 48.1 bits (113), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 75/365 (20%), Positives = 161/365 (44%), Gaps = 44/365 (12%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----RN 1137
I ++ +E A++ VV ++E E+ L +GT G+D+ R F+ G R
Sbjct: 874 IDLEGNEAAVSAAVVPF---ASQEGESFLVVGT----GKDMVLNPRK--FTEGYIHVYRF 924
Query: 1138 ADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLY 1197
++ + L ++ +++ AL QG LL G + ++ +L A + P
Sbjct: 925 HEDGREL-EFIHKTKVEEPPLALIPFQGRLLAGIGKMLRVYDLGLRQLLRKAQGEVAPQL 983
Query: 1198 VVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLV 1256
+V+L + I++GD+ + + ++ +K + +L + A D +++ + T ++D S+
Sbjct: 984 IVTLQTQGSRIIVGDVQQGVTYVVYKPESNKLLVFADD--TINRWTTCTTMVDYE--SVA 1039
Query: 1257 VSDEQKNIQIFYYAPKMSESWKG-----QKLLSRAEFHVGAHVTKFL---RLQMLATSSD 1308
D+ N+ I + S+ Q L +R H + + Q L TS
Sbjct: 1040 GGDKFGNVWILRCPERASQESDEPGSEIQLLHARKYLHGAPNRLDLMVHFYTQDLPTSIV 1099
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
+T G L++ + G++G + P +++ F QSL+ + P +AG +
Sbjct: 1100 KTNLVVGGQD----VLVWSGIQGTVGVLIPFVSREDVDF--FQSLESHMRAEDPPLAGRD 1153
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
+R ++ K ++D +L + +LP +++ IA + + +I ++D+
Sbjct: 1154 HLIYRGYYVPVKG-------VIDGDLCERFSLLPNDKKQMIAGELDRSVREIERKISDIR 1206
Query: 1426 LGTSF 1430
++F
Sbjct: 1207 TRSAF 1211
>gi|328858656|gb|EGG07768.1| hypothetical protein MELLADRAFT_105631 [Melampsora larici-populina
98AG31]
Length = 1216
Score = 48.1 bits (113), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 67/323 (20%), Positives = 117/323 (36%), Gaps = 81/323 (25%)
Query: 1113 IGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSK--ELKGAISALASLQGHLLI 1169
IGT +V E + GR+L D NL + ++KG + L L G +
Sbjct: 870 IGTGFVNPNESQSNTGRILTIGLSSKHDQEGNLREFKLKRMTKVKGTVHGLGGLPGGKFV 929
Query: 1170 ASGPKII----------------LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDI 1213
AS + L W G ++ + KN+I++GD+
Sbjct: 930 ASANAFVHAFGINEEEEDEGFEVLDTWGGGFVSQTVLTE------------KNWIIVGDL 977
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
+KSI L + + L +L +D+ ++ + D V +D + N+ + +M
Sbjct: 978 YKSIVVLEFDLKKFSLKVLGRDYSAMSVRPIGMISDR---VFVAADTEFNL----FTVEM 1030
Query: 1274 SESWKGQKLLSRAE--------------------------------------FHVGAHVT 1295
E KG K E FH+G +V
Sbjct: 1031 RERQKGLKEEDEDEEGLSVEEEKGDDDEWEEEERRMRVEKVFNDDHLDTVGGFHLGENVN 1090
Query: 1296 KFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT-FRRLQSLQKKL 1354
F ++ + G D L+F + G IG I L++L ++ L++L+ +L
Sbjct: 1091 HFKAGSLVKSLKHFY----GQDLKYGGKLIFVSSTGGIGVIIKLEDLKIYKHLKALEDRL 1146
Query: 1355 VDSVPHVAGLNPRSFRQFHSNGK 1377
+ + GL+ FR+F + K
Sbjct: 1147 KKEILSIGGLDSTEFRKFKNKWK 1169
>gi|384500266|gb|EIE90757.1| hypothetical protein RO3G_15468 [Rhizopus delemar RA 99-880]
Length = 1057
Score = 47.8 bits (112), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 58/257 (22%), Positives = 111/257 (43%), Gaps = 22/257 (8%)
Query: 1120 GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHK 1179
G++ GR+L+ +D L++++ + G I + ++G LL + + L++
Sbjct: 760 GKENDGLGRILVLQLA--SDRKLRLISQLKTG---GMIDCVRPIEGKLLASIQGTLYLYR 814
Query: 1180 WTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSL 1239
W L ++ P + + +NFI+ GD+ S+ + Q QL +A +
Sbjct: 815 WQSQRLVKVSSRRLPSV-TRCMTTHENFIMTGDLAYSVVMFQYDRQSDQLLEVAAHEKTK 873
Query: 1240 DCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR-AEFHVGAHVTKFL 1298
+ A + ID + LV+ E++ +F E + LL + +H+G V++F
Sbjct: 874 EVLAMK-AIDSN---LVIGAEREG-HLFVLEHCQDEVSADEPLLDVISTWHLGDVVSRFR 928
Query: 1299 --RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1356
L M D + AP +L+F T G+IG IA L ++ L +Q +
Sbjct: 929 FGSLGMNNVDPDSSPIAP--------SLIFATASGAIGVIADLSPERYKLLYQMQCNMCR 980
Query: 1357 SVPHVAGLNPRSFRQFH 1373
V + L+ +R +
Sbjct: 981 VVKGIGELSHTDWRNVN 997
>gi|448528339|ref|XP_003869702.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis Co 90-125]
gi|380354055|emb|CCG23569.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis]
Length = 1170
Score = 47.8 bits (112), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 72/302 (23%), Positives = 127/302 (42%), Gaps = 21/302 (6%)
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL--NGIAFYDAPPL 1196
D +NL ++ ELK + Q LL+ASG I L++ +L + D
Sbjct: 880 DKKKNL-QYIHKTELKYVPQTMEVFQDRLLVASGNSISLYELGQRQLLRKSLTRIDFIQT 938
Query: 1197 YVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLV 1256
V ++ ILL D SI F + ++ Q +A D + A + L D T+ +
Sbjct: 939 IVKVTPQPRDRILLADSANSIVFAKFDQEENQFVSMADDTVKRNITAWKQL-DYDTV--I 995
Query: 1257 VSDEQKNIQIFYY----APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
D+ NI + + ++ ++W K ++ ++ + V K L T
Sbjct: 996 GGDKFGNIFVSRLDREESKQIDQNWTVLKQAAKNSPNLNSCVYKLQNLCEYYIPDIITSF 1055
Query: 1313 APGS-DKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
GS + +++ L G+IG + PL E+ L+ + +VAG N
Sbjct: 1056 QLGSFNLGGEECIIYTGLTGTIGILLPLISKSEIELLHDLQLEISAYNDKVNVAGKNHAK 1115
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
R +++ K +I D + L Y LPL+E+L+IA + + ++ LND+ +
Sbjct: 1116 LRSYYNPAK-------NIFDGDFLELYLNLPLDEKLKIAKRLNKSVGEVEKKLNDIRNRS 1168
Query: 1429 SF 1430
SF
Sbjct: 1169 SF 1170
Score = 46.2 bits (108), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 81/340 (23%), Positives = 136/340 (40%), Gaps = 48/340 (14%)
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
+P+DA L VP IGGVLV GAN I Y L N ++ L + ++S +
Sbjct: 229 VPNDANYLAPVPGHIGGVLVCGANWIMYDK--------LGNESILLPLLRRKDQTSVIIS 280
Query: 364 LDAAHATWLQND--VALLSTKTGDLVLLTVVYDG--RVVQRLDLSKTNPSVLTSDITTIG 419
HA +N LL GDL L + YD +++ ++++ + + ++
Sbjct: 281 -HVTHALKKKNYGFFILLQNDLGDLFRLIIDYDSNRELIKDIEITYFDTIPVCYNLNIFK 339
Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
N L F LL QF L EE E D K ++ + ++
Sbjct: 340 NGLCFANCINRSQLLYQF----------EKLGEEIS--EEDIRINKTVQMDNIQLTKEKY 387
Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
E L G + ++ S + DS++N L S ++ T +
Sbjct: 388 --FEFKLKGLDNLALIDVVESLS-PITDSILNDDTLVTLSTKSKLKTIVHGTPTTTLVES 444
Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII--SLEARTMVLETADLLT 597
+L P I+T + A DDE YL+I +L +T+VL +++
Sbjct: 445 QLPIKP--TNIFTT-----------KTSANAVDDE---YLVITSTLSFKTLVLSLGEVIE 488
Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
EV +S FV + A G+ ++Q++ G R ++G+
Sbjct: 489 EVNDS--EFVLDQPTVAVQQVGKSSIVQIYSNGLRHINGN 526
>gi|390357128|ref|XP_001198237.2| PREDICTED: splicing factor 3B subunit 3-like [Strongylocentrotus
purpuratus]
Length = 949
Score = 47.8 bits (112), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 61/259 (23%), Positives = 100/259 (38%), Gaps = 39/259 (15%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + + + + + + + F+ S G+ L Q
Sbjct: 34 LAQTEQGDIFKITLETDDDMVTEIRMKYFDTVPVATSMNVLKTGFLFIASEYGNHYLYQI 93
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS E GD AP T R L+++ E LS S
Sbjct: 94 AHLGDDDDEPEFSSATPLEEGDTFFFAPRTLR-------NLEEVDQLESLSPILSCQIAD 146
Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTV 553
+++ T V ++ +GL + S + ELPG +WTV
Sbjct: 147 LASEDTPQLYVACGRGPRSSMRVLRHGLEV------------SEMAVSELPGNPNAVWTV 194
Query: 554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
KS DDEY AY+I+S T+VL + + EVT+S F+
Sbjct: 195 KKKS--------------DDEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTPTL 238
Query: 614 AGNLFGRRRVIQVFERGAR 632
+ +L G ++Q++ G R
Sbjct: 239 SSSLIGDDALLQIYPDGIR 257
>gi|328869269|gb|EGG17647.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
Length = 1194
Score = 47.8 bits (112), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 114/567 (20%), Positives = 211/567 (37%), Gaps = 111/567 (19%)
Query: 98 AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
+ L+ V + G + S+A G +D +I+ + ++ +LE++ S +
Sbjct: 46 SGRLDHVLYSEAFGVIRSIAPFRLTGGS----KDYLIVGSDSGRVVILEYNPSKNVFEKV 101
Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLV 214
F + G G + DP+GR G + L I+ + SQ +
Sbjct: 102 HQETFG-------RSGCRRIVPGQYISTDPKGRAFMIGAIEKQKLVYILNRDSQAKLSI- 153
Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM--VILHERELTWAG 272
A + V ++ +D+ G+ P+ + + E T
Sbjct: 154 --------SSPLEAHKAHTIVFSMCGVDV----------GFENPIFATISVDYSEETNIE 195
Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS----PIGGVLVVGANT 328
V H+T +++ + L WS + A +++VP P GGVLV
Sbjct: 196 DVEETHNTKVLTFYELDLGLNNVVRKWSE-EVDRSANLVVSVPGGSDGP-GGVLVCAQGR 253
Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQNDVA--LLSTK 382
++Y + + D S +PR + E + +HA+ Q D+ L+ ++
Sbjct: 254 VYYRNIGHA------------DISVSIPRRNGMTEEKSLMIVSHASHKQRDMFFFLVQSE 301
Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSG 442
GDL +T+ Y G +V + ++ + + IT + N F+ S GD L F
Sbjct: 302 YGDLYKITLDYSGEMVSGMQIAYFDTFPTANCITMLKNGFLFVASEFGDHGLYLFK---- 357
Query: 443 TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
S GL DAP+ + + + L L + S S
Sbjct: 358 ----SLGLD--------DAPTASSAGNTEMVFFEPVFEPRNLVLTATIS----SLSPIVD 401
Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------------ELPGC-KG 549
F V D L G + ++ +G+S+++N ++ +LPG G
Sbjct: 402 FKVAD-LAQEGTPQMYAL----------SGVSERANLRVLRHGLPITQMVDSQLPGTPAG 450
Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE----SVDY 605
IWT+ + N + + Y+++S T+VL + + EV + S
Sbjct: 451 IWTIPQSLTTMRNPQYQGIGTVESPADRYIVVSFVGSTLVLGVGETVEEVQDSGILSTTT 510
Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ R++ A NL ++Q+F +G R
Sbjct: 511 TILIRSMGA-NL---DSIVQIFAQGIR 533
>gi|238487250|ref|XP_002374863.1| nuclear mRNA splicing factor, putative [Aspergillus flavus NRRL3357]
gi|220699742|gb|EED56081.1| nuclear mRNA splicing factor, putative [Aspergillus flavus NRRL3357]
Length = 1210
Score = 47.4 bits (111), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 75/342 (21%), Positives = 152/342 (44%), Gaps = 36/342 (10%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1138
+TI ++ +E A++V V T++++ET L +GTA + +A G + ++ R
Sbjct: 870 STIELEENEAAVSVAAVPF---TSQDDETFLVVGTAKDMNVNPPSSAGGYIHIY---RFQ 923
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
++ + L ++ +++ AL QG L+ GP + ++ +L P +
Sbjct: 924 EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGPMLRIYDLGMKQLLRKCNAQVVPKTI 982
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
V L + I++ D+ +S+ ++ +K Q L D S +T ++D T +
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETTA--GG 1039
Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1310
D+ NI + K+SE G L+ R H + + + Q + T+ +T
Sbjct: 1040 DKFGNIWMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKT 1099
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWSGFHGTIGMLVPFVSREDVDF--FQNLEMQLAAQNPPLAGRDHL 1153
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
+R +++ K ++D +L Y +LP + ++ IA +
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIAAE 1188
>gi|295667673|ref|XP_002794386.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
gi|226286492|gb|EEH42058.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
Length = 1195
Score = 47.4 bits (111), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 125/305 (40%), Gaps = 42/305 (13%)
Query: 1110 LLAIGTAYVQGEDV---AARGRVLLFS-TGRNADNPQNLVTEVYSKELKGAISALASLQG 1165
+ +GT+Y+ +DV + RGR+L F TG + +V +KGA ALA +Q
Sbjct: 875 MFVVGTSYL--DDVGEGSIRGRILAFEVTGSRQ------LAKVAELPVKGACRALAVMQD 926
Query: 1166 HLLIASGPKIILHKWTGTE-----LNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSI 1217
++ A ++++ E LN A Y AP + + + N I + D+ KS+
Sbjct: 927 KIVAALMKTVVIYSIAKGELSDYTLNKTASYRTSTAP----IDIAVTGNLIAVADLMKSV 982
Query: 1218 YFLSWKE----QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
+ +K+ Q L +A+ F +L A + + L SD + N+ +
Sbjct: 983 SIIEFKQGENDQPDSLTEVARHFQTLWSTAVAPIAENMFLE---SDAEGNLVVLNQNVNG 1039
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
++L +E +G V R++ ++ P + A L GT++GSI
Sbjct: 1040 VTDDDKRRLEVTSEILLGEMVN---RIRPVSIQGSLPATGPREAVISPKAFL-GTVEGSI 1095
Query: 1334 ---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
G I P + RLQS LV P N FR F N P VD E
Sbjct: 1096 YLFGLINPAYQDLLMRLQSAMAGLV-VTPGAMPFN--KFRAFK-NAVRQAEEPYRFVDGE 1151
Query: 1391 LLSHY 1395
L+ +
Sbjct: 1152 LIERF 1156
Score = 44.7 bits (104), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 29/138 (21%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
L+ VP+P+GG+LV+G +I Y D++ E S+ LD A
Sbjct: 292 LIPVPAPLGGLLVLGETSIRYLD----------------DATNE----CISLPLDEATIF 331
Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLF 423
W Q D LL+ G L L ++ D VQ +LDL P S + +G +
Sbjct: 332 VAWEQVDGQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVT 389
Query: 424 FLGSRLGDSLLVQFTCGS 441
F+GS GDS L++ T GS
Sbjct: 390 FIGSHQGDSQLIRITEGS 407
>gi|449295711|gb|EMC91732.1| hypothetical protein BAUCODRAFT_116696 [Baudoinia compniacensis UAMH
10762]
Length = 1148
Score = 47.4 bits (111), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 74/330 (22%), Positives = 134/330 (40%), Gaps = 52/330 (15%)
Query: 1104 TKENETLLAIGTAYVQ-GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1162
T E +GTAY+ +GR+L+ + +V E+ LKGA LA
Sbjct: 832 TGETAERFVVGTAYLDDAPQQQTKGRILVLEV--TEERRLKVVAEL---GLKGACRCLAV 886
Query: 1163 LQGHLLIASGPKIILHKWTGTELNGIAFYDAP--PLYV-----------VSLNIVKNFIL 1209
+ G ++ A ++++ Y P P V + + + + I
Sbjct: 887 VLGRIVAALVKTVVIYALE---------YQTPSHPFLVKKAAYRTSTAPIDICVTGSTIA 937
Query: 1210 LGDIHKSIYFLSWKE-QGA---QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
+ D+ KS+ +S+K +G L+ +A+ + +L A + + + L +D + N+
Sbjct: 938 VTDLMKSVSLVSYKPGRGGVPDTLSEIARHYETLWGTAIANVAENTYLE---ADAEGNLV 994
Query: 1266 IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
+ + ++L +E +G V + + + T++ P +
Sbjct: 995 VLQHEVNGYSDEDRRRLRPVSEMLLGEMVNRIRSISVQPTAT--AVVVPRA--------F 1044
Query: 1326 FGTLDGSI---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFR-QFHSNGKAHRP 1381
T++GSI I+P + RLQ+L + V S HV R FR Q G
Sbjct: 1045 LATVEGSIYLFALISPGKQDLLMRLQALLAERVKSPGHVPFAKWRGFRSQVRDMGGE--- 1101
Query: 1382 GPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
GP VD EL+ Y P+E Q+++A + G
Sbjct: 1102 GPTRFVDGELVERYLEAPVEVQVDVASELG 1131
>gi|83767504|dbj|BAE57643.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1270
Score = 47.4 bits (111), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 76/350 (21%), Positives = 155/350 (44%), Gaps = 36/350 (10%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1138
+TI ++ +E A++V V T++++ET L +GTA + +A G + ++ R
Sbjct: 870 STIELEENEAAVSVAAVPF---TSQDDETFLVVGTAKDMNVNPPSSAGGYIHIY---RFQ 923
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
++ + L ++ +++ AL QG L+ GP + ++ +L P +
Sbjct: 924 EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGPMLRIYDLGMKQLLRKCNAQVVPKTI 982
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
V L + I++ D+ +S+ ++ +K Q L D S +T ++D T +
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTT-MVDYETTA--GG 1039
Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1310
D+ NI + K+SE G L+ R H + + + Q + T+ +T
Sbjct: 1040 DKFGNIWMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKT 1099
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWSGFHGTIGMLVPFVSREDVDF--FQNLEMQLAAQNPPLAGRDHL 1153
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1417
+R +++ K ++D +L Y +LP + ++ IA + + +I
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIAAELDRSVREI 1196
>gi|312076588|ref|XP_003140928.1| xeroderma Pigmentosum Group E Complementing protein [Loa loa]
Length = 516
Score = 47.4 bits (111), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 78/356 (21%), Positives = 136/356 (38%), Gaps = 90/356 (25%)
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
+W NL +A ++ VP P GG L+ G + I YH + AL YA S
Sbjct: 201 LWKHDNLEGEASMVIGVPEPAGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----------VVQRLDLSKTN 407
+ +D +L D+A G+L +L + + G+ V+ + +
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLEF-GKGQEQDESSTVSVKDMKVESLG 309
Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPST 464
+ + + + N + F+GSR GDS L++ + GT +S L + + ++ AP
Sbjct: 310 NTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS--LLDSYTNL---AP-- 362
Query: 465 KRLRRSSSDALQDM----VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
++DM NG++ L S + + + + + L +
Sbjct: 363 ----------IRDMTVMRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS--------- 403
Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
VEL G K ++T+ + D E+ YLI
Sbjct: 404 ---------------------VELKGIKNMFTLRTR---------------DHEFDDYLI 427
Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
+S ++ T VL E T+ + V G T+ AG LF ++QV ++DG
Sbjct: 428 LSFDSDTHVLLINGEELEDTQITGFVVDGATLWAGCLFQSTTILQVTHGEVILIDG 483
>gi|68071595|ref|XP_677711.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56497932|emb|CAI04454.1| conserved hypothetical protein [Plasmodium berghei]
Length = 493
Score = 47.4 bits (111), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 53/214 (24%), Positives = 98/214 (45%), Gaps = 8/214 (3%)
Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
P +++SL++++N+I++GDI S+ LS+ + L + +D+ ++ C F+ S
Sbjct: 254 TPSSWIMSLDVIENYIVVGDIMTSVTILSYDFNNSTLTEVCRDYSNVWC---TFVCALSK 310
Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
+VSD + N +F + KL A F+ G V K L + + +S
Sbjct: 311 SHFLVSDMESNFLVFQKSSIRYNDEDSFKLSRVAFFNHGHVVNKMLPVSL--SSLIEEEE 368
Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELT-FRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
A + ++L + +GSI I P LT F++ ++ L DS+ + +N S
Sbjct: 369 AQNEILRKKESILCASSEGSISSIIPFSNLTNFKKALCIEIALNDSLSFIGNINNNSNNT 428
Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLE 1405
+ N +VD EL + +P E+Q +
Sbjct: 429 YKMN--LSEKSCKGVVDGELFKMFFSMPFEKQFK 460
>gi|392566425|gb|EIW59601.1| hypothetical protein TRAVEDRAFT_167065 [Trametes versicolor FP-101664
SS1]
Length = 1263
Score = 47.4 bits (111), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 117/265 (44%), Gaps = 27/265 (10%)
Query: 1111 LAIGTAYVQGEDV-AARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL-QGHLL 1168
+GTA ++ E+ + GR+LLFS +++N +T V S +++G + AL + +G +
Sbjct: 948 FCLGTAVIRPEEREPSNGRILLFSL--SSENGVRSLTTVASHKVRGCVYALQHVSEGVIA 1005
Query: 1169 IASGPKIILHKWTGTELNGIAF---YDAPPL-----YVVSLNIVKNFILLGDIHKSIYFL 1220
A ++L+K L G F D +V SL F+L+GD S+ L
Sbjct: 1006 AAINTSVLLYKIREGNL-GEGFDRVLDKAAEWNHNHFVTSLVWDGQFLLVGDAISSVSVL 1064
Query: 1221 SWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
+ +L +A+D+ L A E +G + + +F +A + G
Sbjct: 1065 RVADDATKLESVARDYAPLWPVAIESTGNGGVIG-----ANSDCNLFSFALQRGPQRNG- 1118
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
L +H+ V K ++ + +S+D + D+ + +F T G IG I ++
Sbjct: 1119 -LEKNGVYHIDDVVNKLIKGAL--SSADVS-----QDQAVKAGHVFFTSTGRIGAILDMN 1170
Query: 1341 ELTFRRLQSLQKKLVDSVPHVAGLN 1365
+ + +LQ+ + S+ G+N
Sbjct: 1171 DTMSLHMTALQRNMAKSLIGPGGVN 1195
>gi|407919154|gb|EKG12409.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1210
Score = 47.4 bits (111), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 85/393 (21%), Positives = 160/393 (40%), Gaps = 57/393 (14%)
Query: 1064 EVRILEPD-----RAGGPWQT--RATIPMQSSENALTVRV--------VTLFNTTTKENE 1108
+ +IL P+ RA G W + + P+ S E T+ + V L T++ +E
Sbjct: 835 DAKILPPEEFGYPRAEGHWASCIQVVDPISSKEVVHTLELEENESAVSVCLAPFTSQNDE 894
Query: 1109 TLLAIGTAYVQGEDVAAR----GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQ 1164
T L +GTA + VA R G V ++ N + ++ +++ AL Q
Sbjct: 895 TFLVVGTA--KDLVVAPRSYNCGYVHIYRLQENGRE----LEFIHKTKMEAPPMALLPFQ 948
Query: 1165 GHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1223
G LL+ + L+ + L + P ++ L + I+ D+ +S+ ++ +K
Sbjct: 949 GKLLVGVEADLRLYDLGLRQLLRKAQALNVVPNILIGLQTQGSRIVCSDVQESVTYVVYK 1008
Query: 1224 EQGAQLNLLAKD--FGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW---- 1277
+L D C A ++D T + D+ NI + PK SE
Sbjct: 1009 HLENRLIQFCDDSIHRWTSCTA---MVDYETTA--GGDKFGNIWLVRCPPKASEEADEEG 1063
Query: 1278 KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
G L++ + G + Q + TS +T G R LL+ L G++
Sbjct: 1064 SGLHLINERPYLQGTPNRLDLLAHFYTQDIPTSIQKTALVAG----GRELLLWSGLQGTL 1119
Query: 1334 GCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
G P +++ F QSL+++L P +AG + ++R ++ K ++D +
Sbjct: 1120 GIFIPFVSREDVDF--FQSLEQQLRTEDPPIAGRDHLAYRSYYVPVKG-------VIDGD 1170
Query: 1391 LLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1423
L + LP +++ IA + + ++ + D
Sbjct: 1171 LCERFLRLPRDKKETIAAELDRSVREVERKIGD 1203
>gi|340367935|ref|XP_003382508.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Amphimedon
queenslandica]
Length = 1160
Score = 47.4 bits (111), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 74/386 (19%), Positives = 151/386 (39%), Gaps = 69/386 (17%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGED 1122
+R++ P++ +T + + +E A ++ V + + E + +GTA +
Sbjct: 824 LRVMHPNQG----KTLDIVQFEQNEAAFSLAVCQF--VSKGDLEWFVVVGTAKDMIITPR 877
Query: 1123 VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1182
+ G +++F + + V++ +L A+A QG LL+ G + ++
Sbjct: 878 AISSGSLIVFRLSPDGSK----LEHVHTTQLDDVPIAMAPFQGRLLVGVGKLLRIYDIGK 933
Query: 1183 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1242
++ P VV + ++ + +GD+ ++++FL ++ QL + A + C
Sbjct: 934 KKMLRKCENKHLPYLVVDIKVMGRRVYVGDVQEAVHFLYYRPHENQLVIFADEVVPRFC- 992
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES-----------WK-------GQKLLS 1284
T ++D +T++ +D+ NI I +++ W QK
Sbjct: 993 TTSCILDYNTVA--SADKFGNITILRLPSDVTDQVDEDPSGSRSLWDRGFLNGATQKANV 1050
Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTF 1344
+HVG + ++ ++ PG + L++ TL GSIG + P
Sbjct: 1051 MTSYHVGEGINTLHKVSLI----------PGGSE----VLVYTTLSGSIGILVPFS---- 1092
Query: 1345 RRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
K+ D H+ R SN S++D +L Y L ++
Sbjct: 1093 ------SKEDSDFFQHLE----MHMRSEWSNL--------SVIDGDLCEVYNSLDPSKRR 1134
Query: 1405 EIAHQTGTTRSQILSNLNDLALGTSF 1430
EIA + S++ L DL +F
Sbjct: 1135 EIALDLDRSPSEVAKKLEDLRTRYAF 1160
>gi|70954357|ref|XP_746229.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56526771|emb|CAH77136.1| hypothetical protein PC000016.02.0 [Plasmodium chabaudi chabaudi]
Length = 372
Score = 47.4 bits (111), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 58/286 (20%), Positives = 111/286 (38%), Gaps = 45/286 (15%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
G ++++ G K+ ++ +L Y P +VS+ + + I DI +S+
Sbjct: 107 CFCPFNGRVIVSVGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKVSGDRIFASDIRESVL 166
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS---------LVVSDEQKNIQ---- 1265
+ + L++ D +E L + ++ L V +E K +
Sbjct: 167 IFFYDSNQNVIRLISDDIIPRWITCSEILDHHTIMAADKFDSVFILRVPEEAKQEEYGIA 226
Query: 1266 --IFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
+Y +S S K +K+ FH+G VT ++++ SS+
Sbjct: 227 NKCWYGGEVISSSTKNRKMEHIMSFHIGEIVTSLQKVKLSPASSE--------------C 272
Query: 1324 LLFGTLDGSIGCIAPLD-----ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKA 1378
+++ T+ G+IG P D ELT Q L+ L + G FR ++ +
Sbjct: 273 IIYSTIMGTIGAFIPYDNKEELELT----QHLEIILRTEKHALCGREHIFFRSYYHPVQ- 327
Query: 1379 HRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
++D +L + LP + Q ++A T +IL L D+
Sbjct: 328 ------HVIDGDLCEQFSSLPFDVQRKVASDLEKTPDEILRKLEDI 367
>gi|402586182|gb|EJW80120.1| hypothetical protein WUBG_08972 [Wuchereria bancrofti]
Length = 162
Score = 47.0 bits (110), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 36/154 (23%), Positives = 68/154 (44%), Gaps = 26/154 (16%)
Query: 1280 QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL 1339
QKL + A ++G +T S + PG++ L + T+ G IG + P
Sbjct: 32 QKLEAIAHLYIGDAIT----------SMQKASLVPGAND----CLSYTTISGIIGILVPF 77
Query: 1340 ---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYE 1396
DE F Q+L+ + P + G + ++R ++ K S++D +L Y
Sbjct: 78 MSRDEFEF--FQNLEMHMRVEYPPLCGRDHLAYRSYYFPVK-------SVIDGDLCEQYS 128
Query: 1397 MLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
++PL++Q + + G ++I L D+ +F
Sbjct: 129 LMPLDKQKSVGEELGRKPTEIHKKLEDIRTRYAF 162
>gi|350630003|gb|EHA18376.1| hypothetical protein ASPNIDRAFT_38018 [Aspergillus niger ATCC 1015]
Length = 1219
Score = 47.0 bits (110), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 98/462 (21%), Positives = 191/462 (41%), Gaps = 48/462 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
C G + + Q + ++ S DN Q IPL TP + E+ L+ +I S
Sbjct: 759 QCVEGMVGIQGQNL----RIFSIEKLDNNMLQQSIPLSYTPRRFLKHPEQPLFYVIESDN 814
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
VL P + + L++ D L D + +++++P A
Sbjct: 815 NVLAPSTR--AKLLEDSKSRGGDETVLPPEDFGYPRATGHWASCIQVVDPLDAKA---VV 869
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
TI ++ +E A+++ V T++++ET L +GTA +A G + ++ R
Sbjct: 870 HTIELEENEAAISIAAVPF---TSQDDETFLVVGTAKDMTVNPPGSAGGYIHIY---RFQ 923
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
++ + L ++ +++ AL QG L+ G + ++ +L P +
Sbjct: 924 EDGREL-EFIHKTKVEEPPLALLGFQGRLVAGIGSLLRIYDLGMKQLLRKCQAPVVPKTI 982
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
V L + I++ D+ +S+ ++ +K Q L D S AT ++D T +
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTATT-MVDYETTA--GG 1039
Query: 1259 DEQKNIQIFYYAPKMSES----WKGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRT 1310
D+ N+ + K SE G L+ + G + + Q + TS +T
Sbjct: 1040 DKFGNLWLLRCPKKTSEEADEDGSGAHLIHERGYLQGTPNRLELMIHVYTQDIPTSLHKT 1099
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWTGFQGTIGMLVPFIGREDVDF--FQNLEMQLAAQHPPLAGRDHL 1153
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
+R +++ K ++D +L Y +LP + ++ IA +
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCEMYFLLPNDTKMMIAAE 1188
>gi|68060004|ref|XP_671977.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56488645|emb|CAI04030.1| hypothetical protein PB301494.00.0 [Plasmodium berghei]
Length = 346
Score = 47.0 bits (110), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 57/269 (21%), Positives = 108/269 (40%), Gaps = 41/269 (15%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ ++ GDL + V ++ +V+ + + + + I + + F+ + G+ QF
Sbjct: 90 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 149
Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SDALQDMVNGEELSLYGSASNN 493
+ S SM +S G A T++L+ D + + ++ + + ++N
Sbjct: 150 SGIGNDSNESMCTSN--HPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSN 207
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG-CKGIWT 552
R SL + +GL I A+ ELPG + IWT
Sbjct: 208 IPQIYALCGRGPRSSL------RILQHGLSIEELANN------------ELPGKPRYIWT 249
Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
+ +S EY Y+I+S E T++LE + + EV +S+ + T
Sbjct: 250 IKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNVTT 293
Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQ 641
NL IQV++ G R ++G + +
Sbjct: 294 IHINLLYDNSFIQVYDTGIRHINGKIVQE 322
>gi|213407660|ref|XP_002174601.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces japonicus
yFS275]
gi|212002648|gb|EEB08308.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces japonicus
yFS275]
Length = 1078
Score = 47.0 bits (110), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 70/301 (23%), Positives = 134/301 (44%), Gaps = 29/301 (9%)
Query: 1103 TTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALA 1161
T +NE ++ +GT + D GR+++F D + LVTE K +GAI ++
Sbjct: 768 TLPDNERVV-VGTGFNYPDRDEPDGGRLIVF----RLDEQEKLVTEAVYKT-QGAIFSVE 821
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIV---KNFILLGDIHKSIY 1218
+G LL+ + ++ L + P LNI K+ +++GD+ KS+
Sbjct: 822 YQEGKLLVGMNAVLCTFRYENKTLRVVGSTRTP---TYCLNIAASSKDIVVVGDMMKSLT 878
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
+ ++ A+ +A+DFG+L + + L TL + + + + + + K +S +
Sbjct: 879 LYNTEKDTAE--EVARDFGALWVTSVQPL--SETLFFCTTADGEAVTML-WDTKAPQSVE 933
Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
+KL ++ + +G V + R + +S R P L+ T++G I I
Sbjct: 934 RKKLRWKSCYRLGDMVNRTRRGCFVLSSPSRL-VKP--------ELMCVTVEGGILLIGD 984
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ LQ +Q +++VP + GL+ + + +A D +D +LL E L
Sbjct: 985 ASQHADLLLQ-IQHNFLEAVPPLGGLDFYKWHERLFPARASAANKD-FIDGDLLESIEDL 1042
Query: 1399 P 1399
P
Sbjct: 1043 P 1043
>gi|403415203|emb|CCM01903.1| predicted protein [Fibroporia radiculosa]
Length = 1267
Score = 47.0 bits (110), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 80/365 (21%), Positives = 145/365 (39%), Gaps = 61/365 (16%)
Query: 1069 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQ--------- 1119
+P+R G P +T +++ + ++ R+ + + +E ++L + ++ V
Sbjct: 889 KPNRIGEPQETTSSLKLL--DDTTFNRIASFACESDEEVTSVLTLSSSDVSSARFCVGTV 946
Query: 1120 ----GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQG---------- 1165
GE + GR+LLFS ++ LV+ S + G + L S+QG
Sbjct: 947 QFKPGETEPSSGRILLFSLNTGPESSFQLVS---STPVSGCVYQLVSIQGMIAAAVNTSV 1003
Query: 1166 ---HLLIASGPKIILHK---WTGTELNGIAFYDAP------PLYVVSLNIVKNFILLGDI 1213
++ IA P + +HK + +LN V L + +++GD
Sbjct: 1004 RTAYVFIAFDPNMTIHKVILFKPEKLNNSTVVLTKVSEWNHNYSVTGLVVHGCMLIVGDA 1063
Query: 1214 HKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
SI F+ K L +A+D+ L + E + DG + SD +F +A +
Sbjct: 1064 ISSISFV--KVDDTTLESIARDYSPLWPVSVEAM-DGDGVIGANSD----CNLFTFA--L 1114
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGS 1332
S L +++G V KFLR + S+R P LF T G
Sbjct: 1115 QRSGHRSTLERNGSYYLGDMVNKFLRGSLTNIDISERKSIEPKH--------LFFTSTGR 1166
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS--IVDCE 1390
IG I +++ + LQ+ + + G++ +FR +N K H + +D +
Sbjct: 1167 IGVILEMNDKISLHMTGLQRNMGKRIIGPGGVHHATFRA-PANSKGHSDAEAAFGFLDGD 1225
Query: 1391 LLSHY 1395
L Y
Sbjct: 1226 FLEQY 1230
>gi|226291941|gb|EEH47369.1| DNA damage-binding protein 1a [Paracoccidioides brasiliensis Pb18]
Length = 1209
Score = 46.6 bits (109), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 125/305 (40%), Gaps = 42/305 (13%)
Query: 1110 LLAIGTAYVQGEDV---AARGRVLLFS-TGRNADNPQNLVTEVYSKELKGAISALASLQG 1165
+ +GT+Y+ +DV + RGR+L F TG + +V +KGA ALA +Q
Sbjct: 889 IFVVGTSYL--DDVGEGSIRGRILAFEVTGSRQ------LAKVAELPVKGACRALAVVQD 940
Query: 1166 HLLIASGPKIILHKWTGTE-----LNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSI 1217
++ A ++++ E LN A Y AP + + + N I + D+ KS+
Sbjct: 941 KIVAALMKTVVIYSIAKGELSDYTLNKTASYRTSTAP----IDIAVTGNLIAVADLMKSV 996
Query: 1218 YFLSWKE----QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKM 1273
+ +K+ Q L +A+ F +L A + + L SD + N+ +
Sbjct: 997 SIIEFKQGENDQPDSLTEVARHFQTLWSTAVAPIAENMFLE---SDAEGNLVVLNRNVNG 1053
Query: 1274 SESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1333
++L +E +G V R++ ++ P + A L GT++GSI
Sbjct: 1054 VTDDDKRRLEVTSEILLGEMVN---RIRPVSIQGSLPATGPREAVISPKAFL-GTVEGSI 1109
Query: 1334 ---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCE 1390
G I P + RLQS LV P N FR F N P VD E
Sbjct: 1110 YLFGLINPAYQDLLMRLQSAMAGLV-VTPGAMPFN--KFRAFK-NAVRQAEEPYRFVDGE 1165
Query: 1391 LLSHY 1395
L+ +
Sbjct: 1166 LIERF 1170
Score = 44.7 bits (104), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 74/174 (42%), Gaps = 38/174 (21%)
Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
+W+ C I+ LK+ L A L+ VP+P+GG+LV+G +I Y
Sbjct: 279 AWQDTGC-IAVFKALDLLKEE--------LEMGASFLIPVPAPLGGLLVLGETSIRYLD- 328
Query: 335 SASCALALNNYAVSLDSSQELPRSSFSVELDAA--HATWLQNDVA--LLSTKTGDLVLLT 390
D++ E S+ LD A W Q D LL+ G L L
Sbjct: 329 ---------------DATNE----CISLPLDEATIFVAWEQVDGQRWLLADDYGRLFFLM 369
Query: 391 VVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
++ D VQ +LDL P S + +G + F+GS GDS L++ T GS
Sbjct: 370 LILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVTFIGSHQGDSQLIRITEGS 421
>gi|238491136|ref|XP_002376805.1| UV-damaged DNA binding protein, putative [Aspergillus flavus
NRRL3357]
gi|220697218|gb|EED53559.1| UV-damaged DNA binding protein, putative [Aspergillus flavus
NRRL3357]
Length = 1117
Score = 46.6 bits (109), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 130/313 (41%), Gaps = 34/313 (10%)
Query: 1111 LAIGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+GTAY+ E + + RGR+L+F DN + L T+V +KGA ALA L ++
Sbjct: 809 FVVGTAYLDDEGEESIRGRILMFEI----DNGRKL-TKVAELPVKGACRALAMLGDKIVA 863
Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYV----VSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
A I+++K + V + +V N I++ D+ KS+ L +KE
Sbjct: 864 ALVKTIVIYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSDLMKSVCLLEFKEG 923
Query: 1226 GA----QLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
L +A+ F ++ +AT ID T + SD + N+ + E +
Sbjct: 924 ENGLPDSLTEVARHFQTV--WATGVACIDKDT--FLESDAEGNLIVLRRNLAGVEEDDRR 979
Query: 1281 KLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
+L +E +G V + + +Q LA+ + A GT++GSI A
Sbjct: 980 RLEVTSEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTVEGSIYLFAI 1027
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
++ L LQ + V + + FR F S + P VD EL+ +
Sbjct: 1028 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVR-EATEPYRFVDGELIEQFLNC 1086
Query: 1399 PLEEQLEIAHQTG 1411
E Q EI + G
Sbjct: 1087 EPELQEEIVNSVG 1099
>gi|118369889|ref|XP_001018147.1| hypothetical protein TTHERM_00279910 [Tetrahymena thermophila]
gi|89299914|gb|EAR97902.1| hypothetical protein TTHERM_00279910 [Tetrahymena thermophila SB210]
Length = 1563
Score = 46.6 bits (109), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 82/179 (45%), Gaps = 27/179 (15%)
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC-----FATEFLIDGSTLSLVVSD 1259
K + L+GDI K + F E ++ L+A++ +++ F+ + + + L +++D
Sbjct: 1343 KQYFLVGDIQKGVQFYEMDEIQSKPRLIAEENININVRQCVLFSIQ---NKNALRALITD 1399
Query: 1260 EQKNIQIFYYAPKMSESWKGQKLLSR---AEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
E +N+ + + + E ++ +S A FHVG+ + K T+ +
Sbjct: 1400 ESRNVYAYSFIQQQQELPTDKRKISMELVASFHVGSKINK-------ITTDYKVNDTSMQ 1452
Query: 1317 DKTNRFALLFGTLDGSIGCIA----PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
+ + LL DG++ I P D L +Q + + +P++ GL+PR FR+
Sbjct: 1453 ESVSHLLLL--KQDGNLSDIQLIYEPGDNTN---LFDMQNTIFEELPYIGGLDPREFRE 1506
>gi|170090007|ref|XP_001876226.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164649486|gb|EDR13728.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1275
Score = 46.6 bits (109), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 71/322 (22%), Positives = 138/322 (42%), Gaps = 36/322 (11%)
Query: 1068 LEPDRAGGPWQTRATIPM------------QSSENALTVRVVTLFNTTTKENETLLAIGT 1115
+EP+R P +R++ + + T VV+ + +GT
Sbjct: 903 MEPNRVNDPEISRSSFKLLDDTSFANLCQFNCDPDEETTAVVSFSQKIAGKPMPFFCVGT 962
Query: 1116 -AYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
Y GE + GR+++F T + + ++ + S ++ G + AL +Q ++ A
Sbjct: 963 YVYKAGEVEPSAGRLMIF-TATTSTSSNLALSLMASTKVPGCVYALTVVQNQIVAAVNSS 1021
Query: 1175 IILHKWTGTELNG----IAFYDAPPLYVV-SLNIVKNFILLGDIHKSIYFLSWKEQGAQL 1229
++L + + + I + Y+V SL + +++GD SI L + ++L
Sbjct: 1022 VMLFRLESSSDSLSPSLIKVSEWHHNYLVTSLGSYADRVVVGDQPSSISLLQVTQ--SKL 1079
Query: 1230 NLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFH 1289
A+D+G L E L + ++ +++ N+ F M S +L +H
Sbjct: 1080 ISQARDYGPLWPVCVEALDERH---IIGANDSLNLFTFSLEKAMGRS----RLERDGCYH 1132
Query: 1290 VGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQ 1348
V VTKFLR + +SSD + +P + + +F T G IG + + DE +L
Sbjct: 1133 VADLVTKFLRGSL--SSSDASTTSPLTSEA-----MFFTSSGRIGVVVDVKDEELSLQLT 1185
Query: 1349 SLQKKLVDSVPHVAGLNPRSFR 1370
++Q+ L + + V G + +R
Sbjct: 1186 NMQRNLANVIQGVGGSSHSKYR 1207
>gi|169773185|ref|XP_001821061.1| UV-damaged DNA binding protein [Aspergillus oryzae RIB40]
gi|83768922|dbj|BAE59059.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1139
Score = 46.2 bits (108), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 130/313 (41%), Gaps = 34/313 (10%)
Query: 1111 LAIGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+GTAY+ E + + RGR+L+F DN + L T+V +KGA ALA L ++
Sbjct: 831 FVVGTAYLDDEGEESIRGRILMFEI----DNGRKL-TKVAELPVKGACRALAMLGDKIVA 885
Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYV----VSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
A I+++K + V + +V N I++ D+ KS+ L +KE
Sbjct: 886 ALVKTIVMYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSDLMKSVCLLEFKEG 945
Query: 1226 GA----QLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
L +A+ F ++ +AT ID T + SD + N+ + E +
Sbjct: 946 ENGLPDSLTEVARHFQTV--WATGVACIDKDT--FLESDAEGNLIVLRRNLAGVEEDDRR 1001
Query: 1281 KLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
+L +E +G V + + +Q LA+ + A GT++GSI A
Sbjct: 1002 RLEVTSEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTVEGSIYLFAI 1049
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
++ L LQ + V + + FR F S + P VD EL+ +
Sbjct: 1050 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVR-EATEPYRFVDGELIEQFLNC 1108
Query: 1399 PLEEQLEIAHQTG 1411
E Q EI + G
Sbjct: 1109 EPELQEEIVNSVG 1121
>gi|391865638|gb|EIT74917.1| damage-specific DNA binding complex, subunit DDB1 [Aspergillus oryzae
3.042]
Length = 1135
Score = 46.2 bits (108), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 130/313 (41%), Gaps = 34/313 (10%)
Query: 1111 LAIGTAYVQGE-DVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLI 1169
+GTAY+ E + + RGR+L+F DN + L T+V +KGA ALA L ++
Sbjct: 827 FVVGTAYLDDEGEESIRGRILMFEI----DNGRKL-TKVAELPVKGACRALAMLGDKIVA 881
Query: 1170 ASGPKIILHKWTGTELNGIAFYDAPPLYV----VSLNIVKNFILLGDIHKSIYFLSWKEQ 1225
A I+++K + V + +V N I++ D+ KS+ L +KE
Sbjct: 882 ALVKTIVIYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSDLMKSVCLLEFKEG 941
Query: 1226 GA----QLNLLAKDFGSLDCFATEF-LIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
L +A+ F ++ +AT ID T + SD + N+ + E +
Sbjct: 942 ENGLPDSLTEVARHFQTV--WATGVACIDKDT--FLESDAEGNLIVLRRNLAGVEEDDRR 997
Query: 1281 KLLSRAEFHVGAHVTKF--LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1338
+L +E +G V + + +Q LA+ + A GT++GSI A
Sbjct: 998 RLEVTSEISLGEMVNRIRPVNIQQLASVTVTPRA------------FLGTVEGSIYLFAI 1045
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
++ L LQ + V + + FR F S + P VD EL+ +
Sbjct: 1046 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVR-EATEPYRFVDGELIEQFLNC 1104
Query: 1399 PLEEQLEIAHQTG 1411
E Q EI + G
Sbjct: 1105 EPELQEEIVNSVG 1117
>gi|291000406|ref|XP_002682770.1| predicted protein [Naegleria gruberi]
gi|284096398|gb|EFC50026.1| predicted protein [Naegleria gruberi]
Length = 1216
Score = 46.2 bits (108), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 67/349 (19%), Positives = 137/349 (39%), Gaps = 64/349 (18%)
Query: 1105 KENETLLAIGTAY---VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALA 1161
K NE+L+ +GTA + G + +F + + ++ E++ AL
Sbjct: 897 KSNESLIIVGTAKNMKLYPTRTCDCGYINVFQISEDGK-----LQLIHKTEVEDVPYALH 951
Query: 1162 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1221
+ +G LL+ + ++ +L + P ++ S+ + N I +GDI +S +F+
Sbjct: 952 AFRGRLLVGVKNMLRIYDLGKKKLLRKCENKSFPNFITSIAVDGNRIFVGDITESFHFVK 1011
Query: 1222 WKEQGAQLNLLAKD----------------FGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1265
+ L + A + D F F+ S L VSDE ++
Sbjct: 1012 FNSSENSLTIFADNTTPRWLTASALVDHNTIAGGDKFGNFFI---SRLPSDVSDELED-- 1066
Query: 1266 IFYYAPKMSESW---KG------QKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGS 1316
+ E W +G QK +F+VG+ +T + ++A P
Sbjct: 1067 ----SSTGKEKWIWERGLLNGAPQKATEIVKFYVGSMITSIYKTSLIA-------GGPS- 1114
Query: 1317 DKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSN 1375
L++ T+ G++G P + L+ L + P + G + ++R ++
Sbjct: 1115 ------ILIYTTITGAVGVFFPFTSKKDIEFFTQLEMHLREKNPPLCGRDHLAYRSYYFP 1168
Query: 1376 GKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
K S+VD +L+ + + L+ + +I+ T ++I + D+
Sbjct: 1169 VK-------SVVDGDLIEQFNDVDLQTKTKISEDLQRTINEIAKKIEDM 1210
>gi|430814207|emb|CCJ28534.1| unnamed protein product [Pneumocystis jirovecii]
Length = 904
Score = 46.2 bits (108), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 64/307 (20%), Positives = 134/307 (43%), Gaps = 34/307 (11%)
Query: 1097 VTLFNTTTKENETLLAIGTAY-VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSK-ELK 1154
V + T +N+ + +GT + + E+ +++GR++LF N V+S+ ++
Sbjct: 588 VQCITSVTIDNQDIFVVGTGFSLPEEEESSKGRIILFGV-------TNKKIWVFSEIQVN 640
Query: 1155 GAISALASLQGHLLIASGPKIILHKWTGT--ELNGIAFYDAPPLYVVSLNIVKNFILLGD 1212
A+ + + ++ + ++ + + N IA Y + L +SL + +++GD
Sbjct: 641 DAVYCIGIIDNKIIAGINALVHIYAYDSSLKNFNVIATYRSTTL-CLSLAVHGTHVIIGD 699
Query: 1213 IHKSIYFLSW--KEQGAQLNLLAKDFGSL--DCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
+ KS+ L++ E G +L +AKD L C A +D + ++ + N+ +F+
Sbjct: 700 LMKSVSLLAFINTENGPRLKEVAKDCNPLWMTCVAA---LDNDLY--IGAEAEGNLSLFW 754
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
+ +++ KL +E G V + +L S+ + P + F T
Sbjct: 755 --KDFNTTFEENKLQIISEIKWGELVNQIKPGTILY--SENSIIIPKAT--------FVT 802
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1388
+DGSIG I + L +LQ + + + LN ++R F N + P +D
Sbjct: 803 VDGSIGIIFTVKREYLEFLVNLQSNMGKIISGIGCLNHSNWRAF-CNRRKKSNEPKCFID 861
Query: 1389 CELLSHY 1395
+ + +
Sbjct: 862 GDFVEIF 868
>gi|58269920|ref|XP_572116.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21]
gi|57228352|gb|AAW44809.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1276
Score = 45.8 bits (107), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 61/266 (22%), Positives = 110/266 (41%), Gaps = 42/266 (15%)
Query: 1111 LAIGTAYVQGED------------VAARGRVLLFSTGRNADNPQNLVTEVYSK-ELKGAI 1157
LA+GTA++ +D V GRVLL + D ++ ++ GA+
Sbjct: 945 LAVGTAFLPADDGEDSSWDEGNLAVVREGRVLLLEF-KEGDAGGGWDIKIKAELATVGAV 1003
Query: 1158 SALASLQGHLLIASGPKIILHKW--TGTELNGIAFYDAPPLYVVSLNIV-------KNFI 1208
AL + G L +A+G K+ +H+ EL + + A + SL+++ + +
Sbjct: 1004 YALEEIHGFLAVAAGSKLTIHRLDHNPVELEETSSW-ASAYVISSLSVLPPSHIRPEGAL 1062
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
++GD +S+ L+ E + ++ + A L D +V+SD N+ +
Sbjct: 1063 IVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGD-GVVISDAYSNLLTYR 1121
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
QKL A F + VT+F ++ T++ P +LF T
Sbjct: 1122 L---------NQKLERAATFGLHEEVTRFQSGSLVPTTTAPEIIIPD--------VLFAT 1164
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKL 1354
+G +G I L + R L LQ+ +
Sbjct: 1165 REGRLGIIGELGTRSSRTLDDLQRNM 1190
>gi|407034933|gb|EKE37449.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
Length = 995
Score = 45.8 bits (107), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 92/197 (46%), Gaps = 34/197 (17%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNAD 1139
TI ++S+E AL V + + + A+GTA ++ ++ + GR+LL
Sbjct: 692 TTIELKSNELALCVDSL---------EDNIYAVGTAIIRENEIEPSSGRILLIR-----Q 737
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
+ + L+ V +++ GA+ L Q ++ + + + G +LN PL V
Sbjct: 738 DTEGLIYIVGTEDYDGAVYCLKKCQKGIVAFINRNVHVIEKKGKDLNTKQNM-LLPLIGV 796
Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------FGSLDC-FATEFLIDG 1250
SL+I K++I+ GD+ +S+ ++ L+++ KD GS++ + T FL
Sbjct: 797 SLDICKDYIIAGDLARSLSVYRYRNDIEHLDVVGKDNQIVWSSCVGSIESEYGTSFL--- 853
Query: 1251 STLSLVVSDEQKNIQIF 1267
V+D NI+IF
Sbjct: 854 ------VADVSGNIKIF 864
>gi|390603312|gb|EIN12704.1| hypothetical protein PUNSTDRAFT_97523 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1268
Score = 45.8 bits (107), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 59/237 (24%), Positives = 106/237 (44%), Gaps = 35/237 (14%)
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV--AARGRVLLFSTGRNADNPQ 1142
+++ E+ + V++L T + + IGTA +D ++GR+++F D
Sbjct: 933 LEADEDITSAVVLSL--GTAEAYTSHFCIGTADFTSDDQLEVSKGRLVVF------DPST 984
Query: 1143 NLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLN 1202
+++ V + ++ G + ALAS+QG + A +I+++ E +G F + + + N
Sbjct: 985 KVLSPVATLDVNGCVYALASIQGLVAAAVNSAVIVYRL---ETDGPTFSSKRLVQLANWN 1041
Query: 1203 ---IVKNF------ILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1253
V N I +GD S+ L G L +A+D+G L A E
Sbjct: 1042 HNYFVTNLVTRGSRIFVGDAISSVSILELT--GQALQTVARDYGPLWPVAIE---STGPD 1096
Query: 1254 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRT 1310
S++ +D + N+ F K+SE KL +H+G V KF+ +LA T
Sbjct: 1097 SVIGADGEFNLFTF----KLSEG----KLERDGSYHLGEQVNKFVPGGLLAADPAHT 1145
>gi|448111975|ref|XP_004201977.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
gi|359464966|emb|CCE88671.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
Length = 1249
Score = 45.8 bits (107), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 98/239 (41%), Gaps = 43/239 (17%)
Query: 95 GISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
I L+ +C + + ++SL + G+ ++D +++ + K+++L++D + L
Sbjct: 52 NIDTGKLDKICVHNVFSVIQSLEKVRLTGS----QKDYLVVTSDSGKLAILQYDTGRNRL 107
Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ--MIILKASQGGSG 212
+ F+ P H K G GP + DPQ R +L+ L+ +I K G
Sbjct: 108 ----VTVFQEP---HSKTGFRRNTPGPYLLTDPQNR--AILIGALERNKLIYKVHSDDKG 158
Query: 213 LVGDEDTFGSGGGFSARIESS--HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTW 270
G S+ +ES H I L + GY PV V +
Sbjct: 159 ----------GMQISSPLESQIRHTITLAMCALDT--------GYENPVFVAIEAEYGAL 200
Query: 271 AGRV----SWKHHTCMISALSISTTLKQHPLIWSAMN--LPHDAYKLLAVPSPIGGVLV 323
+ S H T + ++ + L ++ +N LP A L+ +PSP+GGVL+
Sbjct: 201 DSKEYSIDSQAHQTLLFTSYELDQGLNH--VVRRVVNNKLPISATHLIPLPSPVGGVLI 257
>gi|134113697|ref|XP_774433.1| hypothetical protein CNBG0790 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257071|gb|EAL19786.1| hypothetical protein CNBG0790 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1276
Score = 45.8 bits (107), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 63/266 (23%), Positives = 111/266 (41%), Gaps = 42/266 (15%)
Query: 1111 LAIGTAYV---QGED---------VAARGRVLLFSTGRNADNPQNLVTEVYSK-ELKGAI 1157
LA+GTA++ GED V GRVLL + D ++ ++ GA+
Sbjct: 945 LAVGTAFLPPDDGEDSSWDEGNLAVVREGRVLLLEF-KEGDAGGGWDIKIKAELATVGAV 1003
Query: 1158 SALASLQGHLLIASGPKIILHKW--TGTELNGIAFYDAPPLYVVSLNIV-------KNFI 1208
AL + G L +A+G K+ +H+ EL + + A + SL+++ + +
Sbjct: 1004 YALEEIHGFLAVAAGSKLTIHRLDHNPVELEETSSW-ASAYVISSLSVLPPSHIRPEGAL 1062
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
++GD +S+ L+ E + ++ + A L D +V+SD N+ +
Sbjct: 1063 IVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGD-GVVISDAYSNLLTYR 1121
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
QKL A F + VT+F ++ T++ P +LF T
Sbjct: 1122 L---------NQKLERAATFGLHEEVTRFQSGSLVPTTTAPEIIIPD--------VLFAT 1164
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKL 1354
+G +G I L + R L LQ+ +
Sbjct: 1165 REGRLGIIGELGTRSSRTLDDLQRNM 1190
>gi|322700233|gb|EFY91989.1| Pre-mRNA-splicing factor rse-1 [Metarhizium acridum CQMa 102]
Length = 1039
Score = 45.8 bits (107), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 81/365 (22%), Positives = 151/365 (41%), Gaps = 48/365 (13%)
Query: 1085 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL---LFSTGRNADNP 1141
++ +E ++ VV T+++ E+ L IGT G+D+ R R D+
Sbjct: 704 LEGNEAGVSAAVVPF---TSQDGESFLIIGT----GKDMIVNPRQSSEGFIHVYRFHDDG 756
Query: 1142 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1201
++L ++ +++ +AL S G LL G + ++ +L A D P ++VSL
Sbjct: 757 RSL-EFIHKTKVEEPPTALLSFHGRLLAGIGKTLRIYDLGMRQLLRKAQADISPQHIVSL 815
Query: 1202 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCFATEFLIDGSTLSLVVSD 1259
I++GD+ + + + +L D + C A S+ D
Sbjct: 816 QSQGFRIVVGDVQHGVTMVVYNPVSNKLLPFVDDTIARWTTCLAM-----ADYESVAGGD 870
Query: 1260 EQKNIQIFYYAPKMS----ESWKGQKLLS-RAEFHVGAHVTKFLRLQMLA--------TS 1306
+ NI I K S E +L + ++ H AH RLQ++A TS
Sbjct: 871 KFGNIWIVRCPDKASAEADEPGSDVQLSNGQSYLHGAAH-----RLQLMAHMFVQDIPTS 925
Query: 1307 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKLVDSVPHVAGLN 1365
+T G LL+ L G+IG + PL T Q+L+ + + P +AG +
Sbjct: 926 ICKTSLVVGGQDV----LLWSGLQGTIGVLIPLVTRETADFFQTLEMHMRNEDPPLAGRD 981
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1425
+R +H K ++D +L + +L E++ IA + + ++ ++D+
Sbjct: 982 HLMYRGYHVPVKG-------VIDGDLCERFSLLSREKKQMIAGELDRSVREVERRISDVR 1034
Query: 1426 LGTSF 1430
+ + F
Sbjct: 1035 IRSVF 1039
>gi|322706594|gb|EFY98174.1| DNA damage-binding protein 1 [Metarhizium anisopliae ARSEF 23]
Length = 1121
Score = 45.8 bits (107), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 72/318 (22%), Positives = 136/318 (42%), Gaps = 46/318 (14%)
Query: 1113 IGTAYVQGEDVA----ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
IGT+++ +D RGR+L+ N V ++ S LKGA L +L H++
Sbjct: 794 IGTSFITDDDAIEENDTRGRILVLGVDENRQ-----VYQIVSHNLKGACRCLGTLGEHIV 848
Query: 1169 IASGPKIILHKWTGT-----ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW- 1222
++++ + L +A Y P + +SL+I N I + D+ +S+ + +
Sbjct: 849 AGLSKTVVVYHYVEETTVFGSLQKLAAY-RPASFPLSLDISGNIIGVVDLMQSLTLVEFI 907
Query: 1223 -KEQGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
E G++ L +AT +DG + +D Q NI + P+
Sbjct: 908 PSEDGSRAKLEETARHYQPGWATSVAHLDGE--RWLEADAQGNIIVLQRNPEAPTEQDRS 965
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP-------GSDKT-----NRFALLFGT 1328
KL +E ++G + + +L + S++ +P G +T N+ +L
Sbjct: 966 KLEVTSEMNIGEQINQIRKLHV--ASNENAVVSPKAFLGSVGLSETIITCWNQLLMLV-Q 1022
Query: 1329 LDGSI---GCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1384
++G++ G IAP +L L + Q +L D + ++ +R F + + GP
Sbjct: 1023 IEGTLYLFGEIAPNYQDL----LLTFQSRLQDYIYAPGNVSFNLWRAFRNKAR-EGDGPF 1077
Query: 1385 SIVDCELLSHYEMLPLEE 1402
VD E++ + L L+E
Sbjct: 1078 RFVDGEMVERF--LDLDE 1093
>gi|159486547|ref|XP_001701300.1| nuclear pre-mRNA splicing factor, component of splicing factor 3b
[Chlamydomonas reinhardtii]
gi|158271783|gb|EDO97595.1| nuclear pre-mRNA splicing factor, component of splicing factor 3b
[Chlamydomonas reinhardtii]
Length = 1078
Score = 45.8 bits (107), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 79/355 (22%), Positives = 137/355 (38%), Gaps = 84/355 (23%)
Query: 1108 ETLLAIGTA----YVQGEDVAARGRVL-LFSTGRNADNPQNLVTEVYSKELKGAI-SALA 1161
ET+L +GTA Y+ + AA RV L GR D ++ ++ G + ALA
Sbjct: 765 ETVLLVGTARGLRYMPTDCEAAYIRVYRLGDGGRRLD-------LLHKTQVDGGVPGALA 817
Query: 1162 SLQGHLLIASGPKIILH--------------KWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
+G LL GP + L+ +WT L+ + FY P + S
Sbjct: 818 GFKGRLLAGVGPTLRLYDMGKKKMLRKCEYNRWTNIFLH-VFFYR--PYFRSS------- 867
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDFG----------SLDCFATEFLIDGSTLSLVV 1257
+S++ + +K+ + A D D AT + +
Sbjct: 868 ------QESVHMMRYKKADNAFYIFADDVAPRYLSALLPLDYDTIATGDKFGNLVILRLP 921
Query: 1258 SDEQKNIQIFYYAPKMSES-----WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
+ + ++ KM+ + KL +FHVG +T R +M A +
Sbjct: 922 QEASQQVEDDPTGGKMAAASGKLNGAPHKLEELVKFHVGDTITALQRAEMQAGGQE---- 977
Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSF 1369
L++ T+ G+IG + P +++ F L+ L P +AG + ++
Sbjct: 978 ----------VLVYSTVMGAIGVVYPFTSREDVDF--FSHLEMHLRQENPPLAGRDHLAY 1025
Query: 1370 RQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
R A+ P + VD +L S Y +P+++Q IA T ++L L D+
Sbjct: 1026 RS------AYFP-VRNCVDGDLCSQYASIPMKKQQMIAEAMDRTTGEMLKKLEDI 1073
>gi|239613967|gb|EEQ90954.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ER-3]
gi|327353314|gb|EGE82171.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ATCC 18188]
Length = 1199
Score = 45.8 bits (107), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 126/314 (40%), Gaps = 42/314 (13%)
Query: 1110 LLAIGTAYVQ--GEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1167
L +GT+Y+ GE + RGR+L F P + +V +KGA ALA +Q +
Sbjct: 883 LFIVGTSYLDDFGEG-SIRGRILAFEV-----TPNRQLGKVAEMPVKGACRALAIVQDKI 936
Query: 1168 LIASGPKIILHKWTGTE-----LNGIAFY---DAPPLYVVSLNIVKNFILLGDIHKSIYF 1219
+ A ++++ + + L A Y AP + + + N I + D+ KS+
Sbjct: 937 VAALMKTVVVYTLSKGQFADYILTKTASYRTSTAP----IDIAVTGNLIAVADLMKSVSI 992
Query: 1220 LSWKEQ----GAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1275
+ +++ L +A+ F +L A + + L SD + N+ +
Sbjct: 993 VEYQQGTDGLSGSLTEVARHFQTLWSTAVAPVAQDTWLE---SDAEGNLVVLRRNVNGVT 1049
Query: 1276 SWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI-- 1333
++L +E +G V + + + A+ +P + GT++GSI
Sbjct: 1050 EDDRRRLEVTSEVLLGEMVNRIRPVNIQASLGTEAAISPRA--------FLGTVEGSIYL 1101
Query: 1334 -GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
G I P + RLQS +V + G+ FR F N P VD EL+
Sbjct: 1102 FGIINPTYQDLLMRLQSAMAGMVVT---PGGMPFNKFRAFR-NTVRQAEEPYRFVDGELI 1157
Query: 1393 SHYEMLPLEEQLEI 1406
+ E Q EI
Sbjct: 1158 ERFLGCGAELQEEI 1171
Score = 45.1 bits (105), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 21/134 (15%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 299 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLEEATIFV-------A 340
Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGS 427
W Q D LL+ G L L ++ D VQ L + S + +G + F+GS
Sbjct: 341 WEQVDGQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIPRASVLVYMGGGVTFIGS 400
Query: 428 RLGDSLLVQFTCGS 441
GDS L++ T GS
Sbjct: 401 HQGDSQLIRITEGS 414
>gi|402467441|gb|EJW02742.1| hypothetical protein EDEG_02863 [Edhazardia aedis USNM 41457]
Length = 1274
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 50/78 (64%), Gaps = 2/78 (2%)
Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLG 1211
+K + + S++G+L+I G +++++K E L IAF+D + VSL+++KNFIL+G
Sbjct: 988 IKSSAISCDSIRGNLVIGQGTRLMIYKIDRLEGLVAIAFHDLSII-AVSLSVIKNFILVG 1046
Query: 1212 DIHKSIYFLSWKEQGAQL 1229
D+ + + F ++ + ++
Sbjct: 1047 DLLRGVTFFYFQTRPVKI 1064
>gi|147787360|emb|CAN64633.1| hypothetical protein VITISV_043788 [Vitis vinifera]
Length = 1143
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 76/181 (41%), Gaps = 28/181 (15%)
Query: 1065 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1124
+RIL+P A T + +Q +E A ++ V N KE TLLA+GTA
Sbjct: 822 IRILDPRTA----TTTCLLELQDNEAAFSICTV---NFHDKEYGTLLAVGTA-------- 866
Query: 1125 ARGRVLLFSTGRNAD----------NPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1174
+ L F R+ D + ++ +++G AL QG LL G
Sbjct: 867 ---KSLQFWPKRSFDAGYIHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSV 923
Query: 1175 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1234
+ L+ L P +VS++ ++ I +GDI +S ++ ++ QL + A
Sbjct: 924 LRLYDLGKRRLLRKCENKLFPNTIVSIHTYRDRIYVGDIQESFHYCKYRRDENQLYIFAD 983
Query: 1235 D 1235
D
Sbjct: 984 D 984
>gi|183233163|ref|XP_654084.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
gi|169801703|gb|EAL48698.2| damaged DNA binding protein, putative [Entamoeba histolytica
HM-1:IMSS]
gi|449708240|gb|EMD47737.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
Length = 995
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 92/197 (46%), Gaps = 34/197 (17%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNAD 1139
TI ++S+E AL V + + + A+GTA ++ ++ + GR+LL
Sbjct: 692 TTIELKSNELALCVDSL---------EDNIYAVGTAIIRENEIEPSSGRILLIR-----Q 737
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
+ + L+ V +++ GA+ L Q ++ + + + G +LN PL V
Sbjct: 738 DTEGLIYIVGTEDYDGAVYCLKKCQKGIVAFINRNVHVIEKKGKDLNTKQNM-LLPLIGV 796
Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------FGSLDC-FATEFLIDG 1250
SL+I K++I+ GD+ +S+ ++ L+++ KD GS++ + T FL
Sbjct: 797 SLDICKDYIIAGDLARSLSVYRYRNDIEHLDVVGKDNQIVWSSCVGSIESEYGTSFL--- 853
Query: 1251 STLSLVVSDEQKNIQIF 1267
V+D NI+IF
Sbjct: 854 ------VADVSGNIKIF 864
>gi|125977518|ref|XP_001352792.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
gi|54641542|gb|EAL30292.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
Length = 1228
Score = 45.4 bits (106), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + S + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP T L+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPRT----------LKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|195586770|ref|XP_002083143.1| GD13507 [Drosophila simulans]
gi|194195152|gb|EDX08728.1| GD13507 [Drosophila simulans]
Length = 1227
Score = 45.4 bits (106), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G+SD ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLSDNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|353232348|emb|CCD79703.1| putative dna repair protein xp-E [Schistosoma mansoni]
Length = 1329
Score = 45.4 bits (106), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 126/323 (39%), Gaps = 57/323 (17%)
Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
R DS+ L A ++++E +DS+ + + S + R +G V
Sbjct: 72 RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVED--------RSARIIDQGFDVL 123
Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
+DP V +Y GL IIL G + G+ + +IE +++
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIG--------EKIGTDFLDTNQIEEGNIV------- 168
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
D F++GY P +++E EL H L+ L ++
Sbjct: 169 ----DMAFIYGYSLPTFAMIYEDELVL--------HMKTYEIYGREPVLRNVQLTLDSIE 216
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
D+ L+ VP P GGV++VG N I YH++ ++ Y +SQ L ++ +
Sbjct: 217 --PDSKLLIPVPKPYGGVILVGDNIICYHTKDGP---HISQYIPQAKASQVLCYAAVDAQ 271
Query: 364 L----DAAHATW----LQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
D A + L D+ A + T + L+ V G + L P
Sbjct: 272 RYLLGDMAGRLYMVHLLSEDISAAANNGTSNSDSLSAVRIGSIRIELLGETATP----ES 327
Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
I + N + F+GS LGDS L++
Sbjct: 328 IAYLDNGVVFIGSTLGDSQLIRL 350
>gi|336263557|ref|XP_003346558.1| hypothetical protein SMAC_04731 [Sordaria macrospora k-hell]
gi|380090453|emb|CCC11749.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1149
Score = 45.4 bits (106), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 77/337 (22%), Positives = 138/337 (40%), Gaps = 49/337 (14%)
Query: 1113 IGTAYVQGEDVAA----RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
+GT++++ D A RGR+L+F N D V EL+GA ALA + ++
Sbjct: 823 VGTSFLEDPDRGAGTDKRGRILVFGIDSNRDPYL-----VLKHELRGACRALAVMGSKIV 877
Query: 1169 IASGPKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW- 1222
A +++ ++ T L +A Y Y + + + N I + D+ KS + +
Sbjct: 878 AALHKTVVISQYEETSSTEARLVKLASYRCTT-YPIDIAVHGNIIAVADMMKSATLVEYV 936
Query: 1223 -------KEQGAQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
K + A+L A+ S +AT ++G S + +D N+ + +
Sbjct: 937 QAKTEEEKYEPAKLVECARHRHS--AWATAVAHVEGE--SWLEADANGNLVVLQRNVEGV 992
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI- 1333
+ ++L +E ++G V K +++ +S T P + T +G I
Sbjct: 993 TAEDQRQLRITSELNLGEQVNKIRPIKV--ETSPNTIIIPRA--------FLATAEGGIY 1042
Query: 1334 --GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR----PGPDSIV 1387
G IA +L R Q KL + V L+ S+R F + + GP +
Sbjct: 1043 LFGTIAREQDLLLR----FQDKLAAVIKTVGELDFNSYRAFRNAERGPETDGTTGPVRFL 1098
Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
D ELL + + Q EI G + Q+ + + +L
Sbjct: 1099 DGELLERFLDVDETTQKEICEGLGPSVEQMRNMVEEL 1135
>gi|256088964|ref|XP_002580590.1| DNA repair protein xp-E [Schistosoma mansoni]
Length = 1329
Score = 45.4 bits (106), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 126/323 (39%), Gaps = 57/323 (17%)
Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
R DS+ L A ++++E +DS+ + + S + R +G V
Sbjct: 72 RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVED--------RSARIIDQGFDVL 123
Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
+DP V +Y GL IIL G + G+ + +IE +++
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIG--------EKIGTDFLDTNQIEEGNIV------- 168
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
D F++GY P +++E EL H L+ L ++
Sbjct: 169 ----DMAFIYGYSLPTFAMIYEDELVL--------HMKTYEIYGREPVLRNVQLTLDSIE 216
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
D+ L+ VP P GGV++VG N I YH++ ++ Y +SQ L ++ +
Sbjct: 217 --PDSKLLIPVPKPYGGVILVGDNIICYHTKDGP---HISQYIPQAKASQVLCYAAVDAQ 271
Query: 364 L----DAAHATW----LQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
D A + L D+ A + T + L+ V G + L P
Sbjct: 272 RYLLGDMAGRLYMVHLLSEDISAAANNGTSNSDSLSAVRIGSIRIELLGETATP----ES 327
Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
I + N + F+GS LGDS L++
Sbjct: 328 IAYLDNGVVFIGSTLGDSQLIRL 350
>gi|443896643|dbj|GAC73987.1| predicted DNA methylase [Pseudozyma antarctica T-34]
Length = 1285
Score = 45.1 bits (105), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 62/301 (20%), Positives = 121/301 (40%), Gaps = 43/301 (14%)
Query: 1148 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNF 1207
V+ E+ L + QG LL G + +++ +L + P +V+L+ +
Sbjct: 1007 VHKTEVDDVPLVLRAFQGRLLAGVGKVLRIYELGKKKLLRKCENRSFPTAIVALDAQGSR 1066
Query: 1208 ILLGDIHKSIYFLSWKEQGAQLNLLAKDF----------------GSLDCFATEFL--ID 1249
I++GD+ +S+ F S+K +L A D + D F ++ ID
Sbjct: 1067 IVVGDMQESVIFASYKPLENRLVTFADDIMPRYVTRCTMLDYDTVAAADKFGNVYVVRID 1126
Query: 1250 GSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDR 1309
T S V ++ + + P + + LL A + VG +T R M+
Sbjct: 1127 ADT-SRSVDEDVTGMTTMHEKPLLMGAAHKATLL--AHYFVGDIITSLSRAVMV------ 1177
Query: 1310 TGAAPGSDKTNRFALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
PG R LL+ + G+IG + P + + + +L+ +L + G + +
Sbjct: 1178 ----PG----GREVLLYTGISGTIGALVPFVSKEDVDTMTTLEMQLRQQSDSLVGRDHLA 1229
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGT 1428
+R ++ K ++D +L + +LP +Q +A + S++ L L G
Sbjct: 1230 YRSSYAPVK-------HVIDGDLCESFGLLPPAKQSAVAQELDRKPSEVNKKLAQLREGA 1282
Query: 1429 S 1429
+
Sbjct: 1283 T 1283
>gi|195169735|ref|XP_002025674.1| GL20829 [Drosophila persimilis]
gi|194109167|gb|EDW31210.1| GL20829 [Drosophila persimilis]
Length = 1225
Score = 45.1 bits (105), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + S + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP T L+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPRT----------LKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|261193401|ref|XP_002623106.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
gi|239588711|gb|EEQ71354.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
Length = 1168
Score = 45.1 bits (105), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 21/134 (15%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
L+ VP+P+GG+LV+G +I Y +++ + SQ L ++ V
Sbjct: 299 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLEEATIFV-------A 340
Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGS 427
W Q D LL+ G L L ++ D VQ L + S + +G + F+GS
Sbjct: 341 WEQVDGQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIPRASVLVYMGGGVTFIGS 400
Query: 428 RLGDSLLVQFTCGS 441
GDS L++ T GS
Sbjct: 401 HQGDSQLIRITEGS 414
>gi|401413996|ref|XP_003886445.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325120865|emb|CBZ56420.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 2869
Score = 45.1 bits (105), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 61/292 (20%), Positives = 120/292 (41%), Gaps = 59/292 (20%)
Query: 1111 LAIGTAYVQGEDVAARGRVLLF----------STGRNADNPQNLVT-------EVYSK-E 1152
LA G E + GR+ LF S R+AD P + E+++
Sbjct: 2457 LAAGVGVPLSETIECSGRLYLFKLPESAMRLASPPRSADTPGDQAEYGTPERLELFADIV 2516
Query: 1153 LKGAISALASL------QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKN 1206
L G ++ + S + +++ + GP++ +H+ ++ AF D+ + V ++ ++N
Sbjct: 2517 LNGPVTVVGSFFSSPAERSYVVHSVGPRLFVHEMESSKFLRGAFSDSS-VCVTAVANLRN 2575
Query: 1207 FILLGDIHKSIYFLSWKEQGA----QLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDE 1260
F LL D K + ++W+ ++ +++ F + A FL + L ++ +D
Sbjct: 2576 FFLLADALKGLNLVAWEYHAEADSRKVTRISRTFPKSNFPVAACSFLAYENLLGMLATDI 2635
Query: 1261 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
N+++F Y + + + F +L +L ++ AA K
Sbjct: 2636 DGNVRLFCY-------------------NADKNASGFEKLDILQCDAEDRCAAGCVVKLQ 2676
Query: 1321 RFALLFGTLDGSIGCIAPLDELTFRR--------LQSLQKKLVDSVPHVAGL 1364
++ + T+ S+G A L FR LQ+LQ +L +P GL
Sbjct: 2677 QYVVDSETV-ASLGEAADGSTLAFRLLSVENHAFLQTLQDRLARYLPEPLGL 2727
>gi|322700871|gb|EFY92623.1| DNA damage-binding protein 1 [Metarhizium acridum CQMa 102]
Length = 1121
Score = 45.1 bits (105), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 137/318 (43%), Gaps = 46/318 (14%)
Query: 1113 IGTAYVQGEDVA----ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLL 1168
IGT++V +D RGR+L+ N V ++ S LKGA L++L H++
Sbjct: 794 IGTSFVTDDDAIEENDTRGRILVLGVDENRQ-----VYQIVSHNLKGACRCLSTLGEHIV 848
Query: 1169 IASGPKIILHKWTGT-----ELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS---IYFL 1220
++++ + L +A Y P + + L+I N I + D+ +S + F+
Sbjct: 849 AGLSKTVVVYNYVEETTVFGSLQKLAAY-RPASFPLGLDISGNIIGVVDLMQSLTLVEFI 907
Query: 1221 SWKEQG-AQLNLLAKDFGSLDCFATEFL-IDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1278
K+ A+L +A+ + +AT +DG + +D Q NI + P+
Sbjct: 908 PSKDGSRAKLEEVARHYQP--GWATSVTNLDGE--RWLEADAQGNIIVLQRNPEAPTEQD 963
Query: 1279 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAP-------GSDKTN----RFALLFG 1327
KL +E ++G + + RL + S++ +P G +T L+
Sbjct: 964 RSKLEVTSEINIGEQINQIRRLHV--ASNENAVVSPKAFLGSVGLSETTINCWTQLLILV 1021
Query: 1328 TLDGSI---GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1384
++G++ G IAP + L + Q +L D + ++ +R F + + GP
Sbjct: 1022 QIEGTLYLFGEIAPKYQ---DLLLTFQARLQDYIYAPGNVSFNLWRAFRNKAR-EGDGPF 1077
Query: 1385 SIVDCELLSHYEMLPLEE 1402
VD E++ + L L+E
Sbjct: 1078 RFVDGEMVERF--LDLDE 1093
>gi|115390120|ref|XP_001212565.1| splicing factor 3B subunit 3 [Aspergillus terreus NIH2624]
gi|114194961|gb|EAU36661.1| splicing factor 3B subunit 3 [Aspergillus terreus NIH2624]
Length = 1217
Score = 45.1 bits (105), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 96/460 (20%), Positives = 193/460 (41%), Gaps = 48/460 (10%)
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS-V 1022
C G + + Q + ++ S DN Q IPL TP + E+ L+ +I S
Sbjct: 759 QCLEGMVGIQGQNL----RIFSIEKLDNNMLQQSIPLSYTPRRFVKHPEQPLFYVIESDN 814
Query: 1023 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1080
VL P + + L++ D L + + +++++P A
Sbjct: 815 NVLSPSTR--ARLLEDSKSRNGDTTVLPPEEFGYPRATGHWASCIQVVDPLDAKA---VV 869
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
T+ ++ +E A++V V T++++ET L +GT +A G + ++ R
Sbjct: 870 HTVELEDNEAAVSVAAVPF---TSQDDETFLVVGTVKDMTVNPPSSAGGFIHIY---RFQ 923
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
++ + L ++ +++ AL + QG L + G + ++ +L P +
Sbjct: 924 EDGREL-EFIHKTKVEEPPLALLAFQGRLAVGLGSLLRIYDLGMKQLLRKCQAHVVPKTI 982
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
V L + I++ D+ +S+ ++ +K Q L D S +T ++D T++
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSISRWTTSTT-MVDYETVA--GG 1039
Query: 1259 DEQKNIQIFYYAPKMSESW----KGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRT 1310
D+ N+ + K+SE G L+ R H + + + Q + TS +T
Sbjct: 1040 DKFGNLWLVRCPKKVSEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTSLHKT 1099
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
G R L++ G+IG + P +++ F Q+L+ +L P +AG +
Sbjct: 1100 QLVAG----GRDILVWTGFHGTIGMLVPFVSREDVDF--FQNLEMQLASQHPPLAGRDHL 1153
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
+R +++ K ++D +L Y +LP + ++ IA
Sbjct: 1154 IYRSYYAPVKG-------VIDGDLCETYFLLPNDTKMMIA 1186
>gi|170589359|ref|XP_001899441.1| Xeroderma Pigmentosum Group E Complementing protein [Brugia malayi]
gi|158593654|gb|EDP32249.1| Xeroderma Pigmentosum Group E Complementing protein, putative
[Brugia malayi]
Length = 521
Score = 44.7 bits (104), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 67/155 (43%), Gaps = 31/155 (20%)
Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADA---SATGISKQSNY----------EL-- 541
A T ++ DS N+ P++D + +R N + +G K EL
Sbjct: 345 ADGTGYISLLDSYTNLAPIRDMTV-MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS 403
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
VEL G K ++T+ + DDE+ YLI+S ++ T VL E TE
Sbjct: 404 VELKGIKNMFTLRTR---------------DDEFDDYLILSFDSETHVLLINGEELEDTE 448
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
+ V G T+ AG LF + ++QV ++DG
Sbjct: 449 ITGFTVDGATLWAGCLFHSKTILQVTHGEVILIDG 483
>gi|241952575|ref|XP_002419009.1| pre-mRNA-splicing factor, putative; pre-spliceosome component,
putative [Candida dubliniensis CD36]
gi|223642349|emb|CAX42591.1| pre-mRNA-splicing factor, putative [Candida dubliniensis CD36]
Length = 1187
Score = 44.7 bits (104), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 93/395 (23%), Positives = 162/395 (41%), Gaps = 62/395 (15%)
Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
+K+ P ++ LP D ++ +P IGG+LV G+N Y L+ + L
Sbjct: 218 VKKKPASLNSDPLPDDVNYMIPLPGHIGGMLVCGSNWCFYD--------KLDGPRIYLPL 269
Query: 352 SQELPRSSFSVELD-AAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLS--KT 406
+ ++ S+ ++ H +N LL GDL LTV Y D ++ + ++ T
Sbjct: 270 PRRDGQTQESIIVNHVTHVLKKKNFFILLQNTLGDLFKLTVDYDFDKETIKNISITYFDT 329
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
P L+ +I N F+ D LL QF E+ GD A+
Sbjct: 330 IPPALSLNI--FKNGFLFVNVLNNDKLLYQF--------------EKLGDDLAE----NE 369
Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL--RI 524
L +SSD Y S N + TF D+L I L+ S + RI
Sbjct: 370 LVINSSD-------------YDSLDNVRGTDTTTFKLKGLDNLALIDVLETLSPIIDSRI 416
Query: 525 NADASATGISKQSNYELVE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII- 581
N D+ +S S + + +P + + + + + +DE YL+I
Sbjct: 417 N-DSKLVTLSSHSYVKSITHGVPTTTLVESPLPITPTDIFTTKLSLESANDE---YLVIS 472
Query: 582 -SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG---ARILDGS 637
SL ++T+VL +++ +V +S FV ++ + G V+QV+ G R ++G
Sbjct: 473 SSLSSKTLVLSIGEVVEDVEDS--EFVLDQSTISVQQVGIASVVQVYSNGIKHIRTVNGK 530
Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
T D F P+ S N+ + +++++ V+
Sbjct: 531 KKTTDW-FPPAGITITHASTNNQQVLIALSNLNVV 564
>gi|195490209|ref|XP_002093045.1| GE20993 [Drosophila yakuba]
gi|194179146|gb|EDW92757.1| GE20993 [Drosophila yakuba]
Length = 1227
Score = 44.7 bits (104), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|24654874|ref|NP_728546.1| CG13900, isoform A [Drosophila melanogaster]
gi|23092721|gb|AAF47416.2| CG13900, isoform A [Drosophila melanogaster]
gi|60678131|gb|AAX33572.1| LD01809p [Drosophila melanogaster]
gi|220950356|gb|ACL87721.1| CG13900-PA [synthetic construct]
gi|289803030|gb|ADD20765.1| FI04459p [Drosophila melanogaster]
Length = 1227
Score = 44.7 bits (104), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|195336406|ref|XP_002034829.1| GM14250 [Drosophila sechellia]
gi|194127922|gb|EDW49965.1| GM14250 [Drosophila sechellia]
Length = 1227
Score = 44.7 bits (104), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|60677959|gb|AAX33486.1| RE01065p [Drosophila melanogaster]
Length = 1227
Score = 44.7 bits (104), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|225680146|gb|EEH18430.1| DNA damage-binding protein [Paracoccidioides brasiliensis Pb03]
Length = 1138
Score = 44.3 bits (103), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 29/138 (21%)
Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
L+ VP+P+GG+LV+G +I Y D++ E S+ LD A
Sbjct: 318 LIPVPAPLGGLLVLGETSIRYLD----------------DATNE----CISLPLDEATIF 357
Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLF 423
W Q D LL+ G L L ++ D VQ +LDL P S + +G +
Sbjct: 358 VAWEQVDGQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVT 415
Query: 424 FLGSRLGDSLLVQFTCGS 441
F+GS GDS L++ T GS
Sbjct: 416 FIGSHQGDSQLIRITEGS 433
>gi|258572939|ref|XP_002540651.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237900917|gb|EEP75318.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1144
Score = 44.3 bits (103), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 61/142 (42%), Gaps = 21/142 (14%)
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
NL A L+ VP P+GG+L++G I Y ++N ++L +
Sbjct: 239 NLELGAEILVPVPLPLGGILILGEKCIKYVD-------TISNETITL-----------PL 280
Query: 363 ELDAAHATW--LQNDVALLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIG 419
E + W L N LL+ G L L +V D V+ + + S + +G
Sbjct: 281 EYNTVFVAWEQLDNQRWLLADDYGRLFFLMLVLDSANAVRTWKVDLLGETSRASVLVHLG 340
Query: 420 NSLFFLGSRLGDSLLVQFTCGS 441
+ FLGS GDS +++ T GS
Sbjct: 341 GGVVFLGSHQGDSHVIRITEGS 362
>gi|194749950|ref|XP_001957397.1| GF24063 [Drosophila ananassae]
gi|190624679|gb|EDV40203.1| GF24063 [Drosophila ananassae]
Length = 1228
Score = 44.3 bits (103), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|194864680|ref|XP_001971056.1| GG14635 [Drosophila erecta]
gi|190652839|gb|EDV50082.1| GG14635 [Drosophila erecta]
Length = 1227
Score = 44.3 bits (103), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV ++ DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
T+ L G ++QV+ G R I + + + G + + ++ V+++S
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563
Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
DP ++ +G++D ++R+L DP+
Sbjct: 564 VYFEMDPSGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623
Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
C TP ++++ P S L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642
>gi|320581947|gb|EFW96166.1| hypothetical protein HPODL_2449 [Ogataea parapolymorpha DL-1]
Length = 1203
Score = 44.3 bits (103), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 60/281 (21%), Positives = 111/281 (39%), Gaps = 42/281 (14%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
AL + QG LL+ + ++IL+ +L + +V L +++ D+ S+
Sbjct: 944 ALTAFQGKLLVGAKNELILYDIGQKQLVKRSSTRLECYEIVDLKTQGFRVIVSDVRDSVR 1003
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE--- 1275
+ +K D T L+D T+ VV D+ NI + ++SE
Sbjct: 1004 YTVYKPLENSFVDFIDDTMQRHVTRT-LLLDYDTV--VVGDKFGNISVLRCPEQISEMSD 1060
Query: 1276 --------SWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1327
+ KL + ++VG T F + S G A ++++G
Sbjct: 1061 EDNHGFLVKMRRTKLDNPVNYYVGDMPTFFQK------GSLTIGGAE--------SIIYG 1106
Query: 1328 TLDGSIGCIAPLDELT----FRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1383
L G +GC+ P+ L+ F+ LQ L + L R + +F K + P
Sbjct: 1107 CLQGQMGCLYPMKSLSEINFFKELQRL------IIHEFTSLTDREYLKF----KGYYNPP 1156
Query: 1384 DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
+ +D +L+ Y L E+++ IA + I ++D+
Sbjct: 1157 KNSIDGDLIEEYYRLGPEKRIRIATKMDRLPRDIDRRISDM 1197
>gi|240280498|gb|EER44002.1| pre-mRNA-splicing factor rse1 [Ajellomyces capsulatus H143]
Length = 305
Score = 44.3 bits (103), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 61/283 (21%), Positives = 118/283 (41%), Gaps = 27/283 (9%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
AL QG LL G + ++ ++ P VV L + I++ D+ +S+
Sbjct: 39 ALLGFQGRLLAGIGTDLRIYDLGMKQMLRKCQASVVPHLVVGLQTQGSRIIVSDVQESLT 98
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESW- 1277
++ +K Q +L D S T ++D T++ D+ N+ + K SE
Sbjct: 99 YVVYKYQENRLIPFVDDVISRWTTCTT-MVDYETVA--GGDKFGNLWLLRCPAKASEEAD 155
Query: 1278 ---KGQKLLSRAEFHVGA----HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1330
G L+ ++ GA ++ Q L TS + G R L++ L
Sbjct: 156 EDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLVTG----GRDILVWTGLQ 211
Query: 1331 GSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1387
G++ + P +E+ F QSL+ +L P +AG + +R +++ K +
Sbjct: 212 GTVSMLIPFISREEVDF--FQSLEMQLAAQNPPLAGRDHLIYRSYYAPAKG-------TI 262
Query: 1388 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1430
D +L Y +LP +++ +IA + + +I + D+ ++
Sbjct: 263 DGDLCETYLLLPNDKKQQIAGELDRSVREIERKIADMRTQVAY 305
>gi|154303693|ref|XP_001552253.1| hypothetical protein BC1G_08731 [Botryotinia fuckeliana B05.10]
Length = 1087
Score = 44.3 bits (103), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 34/143 (23%), Positives = 68/143 (47%), Gaps = 10/143 (6%)
Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1172
+GT+++ E+ RGR+L+F G NAD ++ S LKG+ + L G ++ A
Sbjct: 118 VGTSFLHEEEANVRGRLLIF--GVNADRAPYMIA---SHNLKGSCRCIGVLDGKIVAALN 172
Query: 1173 PKIILHKWTG-TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1231
++++ T L G+ L+ I+ + +++ + +W+ + L
Sbjct: 173 KTVVMNATTAKANLGGLVVLGETKF--TYLDDESKAIVEYALDEAVLWAAWEPIDERTYL 230
Query: 1232 LAKDFGSLDCFATEFLIDGSTLS 1254
L D+G L + L+DG+T++
Sbjct: 231 LGDDYGFL--YILTILVDGATVT 251
Score = 42.0 bits (97), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 77/331 (23%), Positives = 134/331 (40%), Gaps = 57/331 (17%)
Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1172
+GT+++ E+ RGR+L+F G NAD ++ S LKG+ + L G ++ A
Sbjct: 765 VGTSFLHEEEANVRGRLLIF--GVNADRAPYMIA---SHNLKGSCRCIGVLDGKIVAALN 819
Query: 1173 PKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
++++ + T L +A Y + + DI KSI + + GA
Sbjct: 820 KTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIA-VADIMKSIALVEYT-PGA 877
Query: 1228 -----QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
+L +A+ + F+T + + T + + +D N+ + + ++
Sbjct: 878 DGLPDKLEEVARH--AQQVFSTS-VAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRM 934
Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL-----FGTLDGSI---G 1334
E ++G V + R+ + +T++ ALL GT +GSI
Sbjct: 935 EVTCEMNLGEMVNRVKRINV---------------ETSKDALLIPRAFLGTTEGSIYLFS 979
Query: 1335 CIAPLDELTFRRLQSLQKKL---------VDSV-PHVAGLNPRS--FRQFHSNGKAHRPG 1382
I P ++ RLQS L DS PH L+P + F ++ S A R
Sbjct: 980 LIPPQNQDLLMRLQSRLASLPSASSIRGSSDSTSPHQIELSPGNLDFNKYRSYISATRET 1039
Query: 1383 --PDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
P VD EL+ + L +E Q +A G
Sbjct: 1040 SEPFRFVDGELIERFLDLEVEVQEHVAEGLG 1070
>gi|83314897|ref|XP_730560.1| multisubunit cleavage/polyadenylation specificity factor subunit A
[Plasmodium yoelii yoelii 17XNL]
gi|23490318|gb|EAA22125.1| CPSF A subunit region, putative [Plasmodium yoelii yoelii]
Length = 863
Score = 44.3 bits (103), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 52/213 (24%), Positives = 97/213 (45%), Gaps = 8/213 (3%)
Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
P +++SL++++N+I++GDI S+ LS+ + L + +D+ ++ C F+ S
Sbjct: 624 TPSSWIMSLDVIENYIVVGDIMTSVTILSYDFNNSTLTEVCRDYSNVWC---TFVCALSK 680
Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
+VSD + N +F + KL A F+ G V K L + + +S
Sbjct: 681 SHFLVSDMESNFLVFQKSSIRYNDEDSFKLSRVALFNHGHVVNKMLPVSL--SSLIEEEE 738
Query: 1313 APGSDKTNRFALLFGTLDGSIGCIAPLDELT-FRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
A + ++L + +GSI I P LT F++ ++ L DS+ + +N S
Sbjct: 739 AQNEILRKKESILCASSEGSISSIIPFSNLTNFKKALCIEIALNDSLSFIXNINNNSNNT 798
Query: 1372 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQL 1404
+ N +VD E+ + +P E+Q
Sbjct: 799 YKMN--LSEKSSKGVVDGEVFKMFFSMPFEKQF 829
>gi|405121632|gb|AFR96400.1| hypothetical protein CNAG_03173 [Cryptococcus neoformans var. grubii
H99]
Length = 1276
Score = 44.3 bits (103), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 63/266 (23%), Positives = 109/266 (40%), Gaps = 42/266 (15%)
Query: 1111 LAIGTAYV---QGED---------VAARGRVLLFSTGRNADNPQNLVTEVYSK-ELKGAI 1157
LA+GT + GED V GRVLL + D +V ++ GA+
Sbjct: 945 LAVGTGILPPDDGEDSSWDEGNLAVVREGRVLLLEF-KEGDAGSGWDIKVKAELATVGAV 1003
Query: 1158 SALASLQGHLLIASGPKIILHKW--TGTELNGIAFYDAPPLYVVSLNIV-------KNFI 1208
AL + G L +A+G K+ +H+ EL + + A + SL+++ + +
Sbjct: 1004 YALEEIHGFLAVAAGSKLTIHRLDHNPVELEETSSW-ASAYVISSLSVLPPSLMRPEGAL 1062
Query: 1209 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1268
++GD +S+ L+ E + ++ + A L D +V+SD N+ +
Sbjct: 1063 IVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGD-GVVISDAHSNLLTYR 1121
Query: 1269 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1328
QKL A F + VT+F ++ T++ P +LF T
Sbjct: 1122 L---------NQKLERAATFGLHEEVTRFQSGSLVPTTTAPEIIIPD--------VLFAT 1164
Query: 1329 LDGSIGCIAPLDELTFRRLQSLQKKL 1354
+G +G I L + R L LQ+ +
Sbjct: 1165 REGRLGIIGELGTRSSRTLDDLQRNM 1190
>gi|301091539|ref|XP_002895953.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262096049|gb|EEY54101.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 118
Score = 43.9 bits (102), Expect = 0.61, Method: Composition-based stats.
Identities = 19/57 (33%), Positives = 32/57 (56%)
Query: 1328 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1384
T + + + P+ + FRRL +LQ ++V+++P L+PR FR +N K PD
Sbjct: 6 TSEEGVSALIPVGKGVFRRLFTLQNEMVNTLPQNCALDPREFRMLKTNAKRRCGRPD 62
>gi|145549784|ref|XP_001460571.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428401|emb|CAK93174.1| unnamed protein product [Paramecium tetraurelia]
Length = 1178
Score = 43.9 bits (102), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 63/284 (22%), Positives = 120/284 (42%), Gaps = 41/284 (14%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
ALA+ +G LL+ +G + +++ + A ++ S+ + ++ I + ++ SI+
Sbjct: 913 ALAAWRGRLLVGAGCNLRVYEMGNQRILKKAEIKNLNSFITSIMVKEDRIYVAEVADSIH 972
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFL-----IDGSTL-SLVVS------DEQKNIQI 1266
L + + LA D A+ L I G ++ VS DE++
Sbjct: 973 LLRYNIRDQTFMELADDILPRYVTASTVLDYHTVIAGDKFENIFVSRVPLDIDEEQEEHP 1032
Query: 1267 FYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
+ Y KM + K+ F+VG +T ++ +++TSS+
Sbjct: 1033 YEYKMKMDQGCMNGAPFKMDQICNFYVGEVITSLQKIALVSTSSE--------------V 1078
Query: 1324 LLFGTLDGSIGCIAPLD---ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
+++GT GSI + P D ++ F L ++ V H L+ R QF S A+
Sbjct: 1079 VVYGTSMGSIAALYPFDNKEDIDF----FLHLEMYLRVEH-QPLSGRDHMQFRS---AYG 1130
Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
P SI+D +L + + +Q +A + T + I+ L D+
Sbjct: 1131 PC-KSIIDGDLCDQFGNMQYNKQRAVAEEFDRTPADIIKKLEDI 1173
>gi|116191283|ref|XP_001221454.1| hypothetical protein CHGG_05359 [Chaetomium globosum CBS 148.51]
gi|88181272|gb|EAQ88740.1| hypothetical protein CHGG_05359 [Chaetomium globosum CBS 148.51]
Length = 979
Score = 43.9 bits (102), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 71/344 (20%), Positives = 144/344 (41%), Gaps = 42/344 (12%)
Query: 1082 TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG-----R 1136
TI +Q +E A ++ + + T++++E+ L + T G D+ R L S G R
Sbjct: 558 TIHLQENEAAASLAIASF---TSQDDESFLVVST----GRDMVLNPRQL--SGGYSYVYR 608
Query: 1137 NADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE-LNGIAFYDAPP 1195
D+ ++L ++ +AL + +G L+ G ++++ + L D P
Sbjct: 609 FHDDGRDLEL-IHKTGTTEPPTALEAFRGRLVAGIGKTLVVYDLGLKQMLRKTQANDVVP 667
Query: 1196 LYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSL 1255
+VSL N I++GD+ + ++++ + QL D + T ++D S+
Sbjct: 668 GLIVSLQTQGNRIVVGDVQHGVAMVAYRTESNQLIPFVDDTIARWTTCTT-MVDYD--SV 724
Query: 1256 VVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAH---VTKFLRLQMLATSSD 1308
D+ N I + S E + L +R H H +T Q + T
Sbjct: 725 AGGDKFGNFWIVRTPQQASLEADEPGAHRLLHAREHLHGAPHRLQLTAHFHTQDIPTGIT 784
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLN 1365
+T G + L++ G++G P ++ F +L++ + P + G +
Sbjct: 785 KTHLVVGGQE----VLVWSGFQGTVGVFVPFVTREDADF--FLALEQHMRGEEPSLIGRD 838
Query: 1366 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1409
++R ++ K +VD +L Y++LP +++ IA +
Sbjct: 839 HLAYRGYYEPAKG-------VVDGDLCERYQLLPGDKKQRIAAE 875
>gi|406700450|gb|EKD03620.1| hypothetical protein A1Q2_02097 [Trichosporon asahii var. asahii CBS
8904]
Length = 1119
Score = 43.9 bits (102), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 83/352 (23%), Positives = 141/352 (40%), Gaps = 62/352 (17%)
Query: 1022 VPVLKPLNQVLSLLIDQEVGHQI-DNHNLSSVDLHRTYTVE---EYEVRILEPDRAGGPW 1077
V L N V++ + + + HQ D SSV+L T+E E+++ P+R
Sbjct: 727 VAALPGYNLVVAGTVTRSMDHQTGDVLQSSSVELRNATTLELLSEFQL----PEREA--- 779
Query: 1078 QTRATIPMQSSENALTV--RVVTLFNTTTKENETLLAIGTAYVQGEDVAA-----RGRVL 1130
+S NA+T+ R L T ENE L T EDV + RGR+L
Sbjct: 780 --------VASVNAVTLHGRKYILVGTAIFENEDALEDATL----EDVTSFIATNRGRLL 827
Query: 1131 LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGT-----EL 1185
LF +A +LVT S G + + G L +A+ K+ + + T E
Sbjct: 828 LFQINESAGPSLDLVT---SMTFNGPVYDTVVIHGFLAVATSTKVSILRLTTQPPSLEEA 884
Query: 1186 NGIAF-YDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS--LDCF 1242
AF ++ L VV ++ K + +GD +SI LS + + +D + + C
Sbjct: 885 ASFAFAFETHHLAVVEIDKEKRLV-VGDAMRSIIVLSVDPESGDIVGDQRDMNAHLVRCL 943
Query: 1243 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1302
+ ++ + ++D N+ F G+ + A + VT+ +
Sbjct: 944 SAVHDVEPGVM---IADNYANLLTFRL---------GKGITPAASIGLSEDVTRLQPGTL 991
Query: 1303 LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1354
S++ R LL T++G +G I L + + R L LQ+ +
Sbjct: 992 APVSAE--------GDILRADLLCTTVNGRLGVIGELGKGSIRTLDDLQRNM 1035
>gi|145510432|ref|XP_001441149.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408388|emb|CAK73752.1| unnamed protein product [Paramecium tetraurelia]
Length = 1174
Score = 43.9 bits (102), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 63/284 (22%), Positives = 120/284 (42%), Gaps = 41/284 (14%)
Query: 1159 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1218
ALA+ +G LL+ +G + +++ + A ++ S+ + ++ I + ++ SI+
Sbjct: 909 ALAAWRGRLLVGAGCNLRVYEMGNQRILKKAEIKNLNSFITSIMVKEDRIYVAEVSDSIH 968
Query: 1219 FLSWKEQGAQLNLLAKDFGSLDCFATEFL-----IDGSTL-SLVVS------DEQKNIQI 1266
L + + LA D A+ L I G ++ VS DE++
Sbjct: 969 LLRYNIRDQTFMELADDILPRYVTASTVLDYHTVIAGDKFENIFVSRVPLDIDEEQEEHP 1028
Query: 1267 FYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
+ Y KM + K+ F+VG +T ++ +++TSS+
Sbjct: 1029 YEYKMKMDQGCMNGAPFKMDQICNFYVGEVITSLQKIALVSTSSE--------------V 1074
Query: 1324 LLFGTLDGSIGCIAPLD---ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1380
+++GT GSI + P D ++ F L ++ V H L+ R QF S A+
Sbjct: 1075 VVYGTSMGSIAALYPFDNKEDIDF----FLHLEMYLRVEH-QPLSGRDHMQFRS---AYG 1126
Query: 1381 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1424
P SI+D +L + + +Q +A + T + I+ L D+
Sbjct: 1127 PC-KSIIDGDLCEQFGNMQYNKQRTVAEEFDRTPADIIKKLEDI 1169
>gi|453087531|gb|EMF15572.1| splicing factor 3B subunit 3 [Mycosphaerella populorum SO2202]
Length = 1223
Score = 43.9 bits (102), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 74/363 (20%), Positives = 143/363 (39%), Gaps = 36/363 (9%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1138
+T+ M +E AL V ++ E LA+GT G + A G V +F +
Sbjct: 884 STVEMGDNEAALCCACVAF---ESRNWEVFLAVGTGQHMSPGTGLQAAGYVHIFKLEEDG 940
Query: 1139 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1198
+T V+ + + AL G L + G ++ ++ L A A P +
Sbjct: 941 TK----LTFVHKTKFDQPVYALLPFHGRLALGVGNELFIYDIGQKALLRKARGQATPNQI 996
Query: 1199 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVS 1258
VSL I+ GD+ + + ++ +K +L D T +ID T +
Sbjct: 997 VSLESHGQRIICGDVSEGVTYMVYKPGYNRLIPFVDDVVQRWTTGTT-MIDYETTA--GG 1053
Query: 1259 DEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAHVTKFLR----LQMLATSSDRT 1310
D+ N+ + + S E G +++ + GA LR Q + S RT
Sbjct: 1054 DKFGNLWVVRCPEQPSQEADEEGAGGFIMNERSYLGGAPYRLDLRAHYYCQDIPMSLQRT 1113
Query: 1311 GAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPR 1367
G + L + L G++G + P +++ F SL+++L P +AG +
Sbjct: 1114 ALVAGGQEV----LFWSGLQGTLGMLVPFVTREDVEF--FTSLEQQLRIEDPPLAGRDHL 1167
Query: 1368 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1427
+R ++ K ++D +L + L + + ++A + + +I + ++
Sbjct: 1168 MYRSYYVPVKG-------VIDGDLCERFMALSYDSKQKVAAEVDRSVKEIEKKVQEMRTR 1220
Query: 1428 TSF 1430
+F
Sbjct: 1221 VAF 1223
>gi|384490729|gb|EIE81951.1| hypothetical protein RO3G_06656 [Rhizopus delemar RA 99-880]
Length = 967
Score = 43.9 bits (102), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 72/154 (46%), Gaps = 17/154 (11%)
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
S + + + L+ VP P+GG+LV+G I Y L N +S+D ++ ++
Sbjct: 198 STIKVEASTHALVPVPEPLGGLLVIGEYIITYFD-----PLTNTNRELSIDPAR---VTA 249
Query: 360 FSVELDAAHATWLQND-----VALLSTKTGDLVLLTVVYDGRVV---QRLDLSKTNPSV- 410
+ D ++ L ++ V + T +V L+ + G+V Q ++ +P V
Sbjct: 250 WEFMKDESNRYLLGDEEGYLYVFSIETSHNKVVNLSSTFIGQVPSFNQNIESKANHPQVS 309
Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
S I +GN +F++GS GDS L+Q G S
Sbjct: 310 RPSCIVDLGNLMFYIGSTHGDSCLIQLIKGQEKS 343
Score = 42.7 bits (99), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 51/224 (22%), Positives = 108/224 (48%), Gaps = 18/224 (8%)
Query: 1152 ELKGAISALASLQGHLLIASGPKII-LHKWTGTELNG--IAFYDAPPLYVVSLNI---VK 1205
++ G + + S++ ++ A KI L+ + L G I F VV+L++
Sbjct: 746 DMPGVVYRMESIKNTIIAAVDGKIYGLYNFKPDLLKGERIEFKFLLHNNVVALDMDTDNN 805
Query: 1206 NFILLGDIHKSIYFLSWK--EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKN 1263
+ +L+GD+ +S+ L + E+ +L+L A D + A +F+ + L+ +D++ N
Sbjct: 806 DTLLVGDLMESMSLLKVEKDEESLKLSLEAVDNKQVWMTAVKFVNENV---LIGADDRHN 862
Query: 1264 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA 1323
+ P++ + K KL +H+G V +F R +L D A+ D +++
Sbjct: 863 L-FTMIKPEIRQEGKTCKLELEGGYHLGTLVNRF-RKDIL---RDVENASDNIDSISKYE 917
Query: 1324 --LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLN 1365
F T++GSIG + + +F + +Q+ +++ +P+ L+
Sbjct: 918 SEFTFATVNGSIGTVKTISRESFEFFKGIQEGILNILPNNGNLD 961
>gi|156049323|ref|XP_001590628.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980]
gi|154692767|gb|EDN92505.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1153
Score = 43.9 bits (102), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 58/252 (23%), Positives = 106/252 (42%), Gaps = 37/252 (14%)
Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1172
+GT+++ +V RGR+L+F G N+D ++ S LKG+ + L G ++ A
Sbjct: 835 VGTSFLHDGEVNIRGRLLIF--GVNSDRTPYIIA---SHTLKGSCRCIGVLNGKIVAALN 889
Query: 1173 PKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
++++ + T L +A Y + ++I N I + DI KS+ + +
Sbjct: 890 KTVVMYDYEETSRTTANLRKVATYRCATC-PIDIDIRGNIIAVADIMKSVALVEYTPGVD 948
Query: 1228 QLNLLAKDFG--SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSR 1285
L ++ G + FAT + + T + + SD N+ + + +L
Sbjct: 949 GLPDKLEEVGRHAQQVFATS-IAEVDTDTYLESDHDGNLIVLKRNREGVTREDKLRLEVL 1007
Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLF-----GTLDGSI---GCIA 1337
E ++G V K R+ + +T++ ALL T +GSI I
Sbjct: 1008 CEMNLGEMVNKIKRINV---------------ETSKDALLIPRAFVATTEGSIYLFSLIP 1052
Query: 1338 PLDELTFRRLQS 1349
P ++ RLQS
Sbjct: 1053 PQNQDLLMRLQS 1064
>gi|195428692|ref|XP_002062402.1| GK16677 [Drosophila willistoni]
gi|194158487|gb|EDW73388.1| GK16677 [Drosophila willistoni]
Length = 1273
Score = 43.9 bits (102), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 347 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 406
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 407 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIVTSQ 456
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 457 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 504
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV + DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 505 WTVKKR--------------VDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 549
Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
T+ L G ++QV+ G R
Sbjct: 550 TLCCAAL-GDDALVQVYPDGIR 570
>gi|422294117|gb|EKU21417.1| uv-damaged dna-binding protein [Nannochloropsis gaditana CCMP526]
Length = 192
Score = 43.5 bits (101), Expect = 0.79, Method: Composition-based stats.
Identities = 31/106 (29%), Positives = 47/106 (44%), Gaps = 5/106 (4%)
Query: 1309 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
R+G L+FGT G IG I P+ E +R +L K L V V GL+
Sbjct: 29 RSGGEVARGHVQDLGLMFGTQQGMIGSILPISEEDYRFFVALTKCLNKVVKGVGGLSHEE 88
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLP---LEEQLEIAHQTG 1411
+R+F ++ VD +L+ + LP +EE +E+ G
Sbjct: 89 YRRFLTDKAI--SDTQGFVDGDLIESFLELPTQRMEEVVELMRVEG 132
>gi|226480826|emb|CAX73510.1| glyceraldehyde 3-phosphate dehydrogenase [Schistosoma japonicum]
Length = 332
Score = 43.5 bits (101), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 54/212 (25%), Positives = 92/212 (43%), Gaps = 34/212 (16%)
Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
R DS+ L A ++++E +DS+ + + S S E R +G V
Sbjct: 72 RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVAS----GSVE----DRSARIIDQGFDVL 123
Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFG-SGGGFSARIESSHVINLRDLD 242
+DP V +Y GL IIL G DT + +S RIE +++
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIGDKIGTDFLDTNQWTVNTYSVRIEEGNIV------ 177
Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
D F++GY P +++E EL + +++ + + ++ TL
Sbjct: 178 -----DMAFIYGYSLPTFAMIYEDELVLHMK-TYEIYGREPALRNVQLTLD--------- 222
Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
++ D+ L+ VP P GGV++VG N I YH++
Sbjct: 223 SIEPDSKLLIPVPKPYGGVILVGDNIICYHTK 254
>gi|198420618|ref|XP_002125906.1| PREDICTED: similar to Splicing factor 3B subunit 3
(Spliceosome-associated protein 130) (SAP 130)
(Pre-mRNA-splicing factor SF3b 130 kDa subunit)
(SF3b130) (STAF130) [Ciona intestinalis]
Length = 1216
Score = 43.5 bits (101), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 97/470 (20%), Positives = 164/470 (34%), Gaps = 129/470 (27%)
Query: 304 LPHDAYKLLAVP---SPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS-- 358
L A L++VP GGVLV N I Y N+ D +PR
Sbjct: 228 LEERANHLISVPGGNDGPGGVLVCAENYITY-----------KNFGDQPDIRTPIPRRRN 276
Query: 359 -------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
V A H T L+ T+ GD+ +T+ D +V + L + +
Sbjct: 277 DLDDPERGMIVVCSATHKTKSMF-FFLIQTEQGDIFKVTLETDEDMVTEIRLKYFDTVPV 335
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPSTKR-- 466
+ + + F+ + +G+ L Q + SS + E GD AP R
Sbjct: 336 SMAMCVLRTGFLFVAAEMGNHCLYQIAHLGDDDDETEFSSAMPLEEGDTFFYAPRALRNL 395
Query: 467 --------LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
L + + D+ N + LY + S+ L+
Sbjct: 396 VLVDELDSLSPIMTCLISDLANEDTPQLYVTCGRGPRSS-----------------LRVL 438
Query: 519 SYGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
+GL + S + ELPG +WTV K ++E+ +
Sbjct: 439 RHGLEV------------SEMAVSELPGNPNAVWTVKIKE--------------EEEFDS 472
Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
Y+I+S T+VL + + EVT+S F+ + +L G ++QV+ G R +
Sbjct: 473 YIIVSFVNATLVLSIGETVEEVTDS--GFLGTTPTLSCSLLGENALVQVYPDGIRHIRAD 530
Query: 638 ----------------------------------YMTQDLSFGPSNSESGSGSENSTVLS 663
Y D S G N + NS V+
Sbjct: 531 KRVNEWKTPGKKTILRCAVNQRQVVIALTGGELVYFEMDQS-GQLNEYTERKEMNSEVVC 589
Query: 664 VSIAD--------PYVLLGMSDGSIRLLVGDPSTC--TVSVQT-PAAIES 702
+ ++ ++ +G++D ++R++ DP+ C +S+Q PA ES
Sbjct: 590 MDLSKVPPTEQRTRFLAVGLADNTVRIISLDPTDCLQPLSMQALPATPES 639
>gi|195376606|ref|XP_002047087.1| GJ13230 [Drosophila virilis]
gi|194154245|gb|EDW69429.1| GJ13230 [Drosophila virilis]
Length = 1229
Score = 43.5 bits (101), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV + DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
T+ L G ++QV+ G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525
>gi|328700785|ref|XP_001945395.2| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
Length = 1072
Score = 43.5 bits (101), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 41/202 (20%), Positives = 88/202 (43%), Gaps = 38/202 (18%)
Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
++ +++D F++G+ P ++I++E +A+ + +K+
Sbjct: 188 MEETNIQDIGFLYGFTNPTIIIIYE------------------NAMGRTIKIKKIIDSKK 229
Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
++ +A ++ VPSP+ G +++G N+I YH + SC + LP
Sbjct: 230 YKSIEKEASMVIPVPSPLCGAIIIGENSIFYH--NGSCNII------------RLPIRQ- 274
Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSDI 415
+E+ L+ LL +G L++L + Y+ + V L L + +
Sbjct: 275 KIEIVCYTRVDLEGTRYLLGDHSGCLLMLFLKYEKTLNGKFKVTDLYLRYFGEISIPISL 334
Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
T + N + ++ S+ GDS L++
Sbjct: 335 TYLDNKVIYVASKFGDSQLIKL 356
>gi|302423344|ref|XP_003009502.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
gi|261352648|gb|EEY15076.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
Length = 1119
Score = 43.5 bits (101), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 61/136 (44%), Gaps = 13/136 (9%)
Query: 312 LAVPSPIGGVLV----VGANTIHYHSQSASCALA-LNNYAVSLDSS----QELPRSSFSV 362
L +P P L+ V ++ YH + + A A L V+ ++ L + S
Sbjct: 221 LEIPDPFARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISK 280
Query: 363 ELDAAH--ATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
LD +W + DV LL+ G+L LLT+ DG +V L L + S + +
Sbjct: 281 ALDEPRIFVSWAKYDVTRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYM 340
Query: 419 GNSLFFLGSRLGDSLL 434
GN + FLGS GDS L
Sbjct: 341 GNEILFLGSHHGDSQL 356
>gi|195126264|ref|XP_002007593.1| GI12293 [Drosophila mojavensis]
gi|193919202|gb|EDW18069.1| GI12293 [Drosophila mojavensis]
Length = 1227
Score = 43.5 bits (101), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV + DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
T+ L G ++QV+ G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525
>gi|195012560|ref|XP_001983703.1| GH16029 [Drosophila grimshawi]
gi|193897185|gb|EDV96051.1| GH16029 [Drosophila grimshawi]
Length = 1228
Score = 43.1 bits (100), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
SS + E G+ AP AL+++V +EL + +
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411
Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
+ L GP L+ +GL + S + ELPG +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459
Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
WTV + DDE+ AY+I+S T+VL + + EVT+S +
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504
Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
T+ L G ++QV+ G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525
>gi|449437538|ref|XP_004136549.1| PREDICTED: pre-mRNA-splicing factor RSE1-like [Cucumis sativus]
Length = 1376
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 90/216 (41%), Gaps = 12/216 (5%)
Query: 1148 VYSKELKGAISALAS-LQGHLLIASGPKIILHKWTGTELNGIAFYDA--PPLYVVSLNIV 1204
VYS L G + A+ L + L ++G + + + + + SL
Sbjct: 1076 VYSTSLPGMVLAICPYLDRYFLASAGNAFYVCGFPNDSFQRVKRFAVGRTRFMITSLTAH 1135
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNI 1264
N I +GD I F S++E +L + D S A L+D T VVSD + +I
Sbjct: 1136 VNRIAVGDCRDGILFFSYQEDAKKLEQIYSD-PSQRLVADCTLLDVDTA--VVSDRKGSI 1192
Query: 1265 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR-----LQMLATSSDRTGAAPGSD-K 1318
I + ++ ++ + L+ + + LR ++ A R A PGSD
Sbjct: 1193 AILSCSDRLEDNASPECNLTLNCAYYMGEIAMTLRKGSFSYKLPADDLLRGCAVPGSDFD 1252
Query: 1319 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKL 1354
++ ++ TL GSI PL + L+++Q KL
Sbjct: 1253 SSHNTIIASTLLGSIVIFTPLSRDEYELLEAVQAKL 1288
>gi|213405251|ref|XP_002173397.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
yFS275]
gi|212001444|gb|EEB07104.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
yFS275]
Length = 1166
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 121/561 (21%), Positives = 202/561 (36%), Gaps = 99/561 (17%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
+ L+ +G V ++A L G ++D ++L + + ++LE+D + L
Sbjct: 56 MNLMISQNCYGIVRNIAPLRLTGF----KKDYLVLTSDSGRFTILEYDIGKNKLVSVYQE 111
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGLQ------MII---L 204
F K G G + +D +GR V LVY L + I L
Sbjct: 112 AFG-------KSGIRRIVPGEYLALDAKGRAAMVASTEKNKLVYVLNRDSEANLTISSPL 164
Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH 264
+A + G+ D G G+ I ++ + DLD + +
Sbjct: 165 EAHKAGTICF---DLVGLDTGYENPIFAALEVEYSDLDHDPLGEL--------------- 206
Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS----PIGG 320
+KH +++ + L WS + + AYKL+ VP P G
Sbjct: 207 -----------YKHSEKVLTYYELDLGLNHVVKRWSKV-VDRSAYKLIRVPGGNDGP-SG 253
Query: 321 VLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
V+V+ I Y H Q S + + ++ LP + + A + LL
Sbjct: 254 VIVISTGWISYRHLQRQSHFVPIPTRETKATTNTALP-----IIVSAVMHKMRDSFFYLL 308
Query: 380 STKTGDLVLLTVVYDGRV-VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
GDL+ LT+ D V+ L + + + + + + L F G G+ L QF
Sbjct: 309 QNSDGDLLKLTMELDDHSQVKELRIKYFDTIPFAAILNILKSGLLFAGCEGGNHHLYQFE 368
Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ 498
S+ + EF +K + + L + N L S T++
Sbjct: 369 -----SLAIDDDEPEFSSANFSEEQSKHSPKKLTYKLHPLQNISLLDEIPSLFPLTDAIV 423
Query: 499 KTFSFAVRDSLVNI-GPLKDFSYGLRINADASATGISKQSNYELVELPGCK-GIWTVYHK 556
S L + G K+ S L + SAT + L ELPG IWTV K
Sbjct: 424 TRTSTDANSQLYTLCGRHKEASLRL-LKRGVSATEVV------LSELPGAPIAIWTVKQK 476
Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
+D Y Y+++S T+VL + + EV +S T+
Sbjct: 477 --------------LNDPYDKYMVLSFTNGTLVLSIGETVEEVLDS-GLLSSVSTLNVRQ 521
Query: 617 LFGRRRVIQVFERGARILDGS 637
L GR V+Q+ +G R + +
Sbjct: 522 L-GRSSVVQIHSKGIRCISAN 541
>gi|167539942|ref|XP_001741428.1| DNA repair protein xp-E [Entamoeba dispar SAW760]
gi|165894130|gb|EDR22214.1| DNA repair protein xp-E, putative [Entamoeba dispar SAW760]
Length = 1004
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 92/197 (46%), Gaps = 34/197 (17%)
Query: 1081 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-ARGRVLLFSTGRNAD 1139
T+ ++S+E AL V + + + A+GTA ++ ++ + GR+LL
Sbjct: 701 TTVELKSNELALCVDSL---------EDNIYAVGTAIIRENEIEPSSGRILLIR-----Q 746
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
+ + L+ V +++ GA+ L Q ++ + + + G +L+ PL V
Sbjct: 747 DSEGLIYIVGTEDYDGAVYCLKKYQKGIVAFINRNVHVIEKKGKDLSTKQNM-LLPLIGV 805
Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKD--------FGSLDC-FATEFLIDG 1250
SL+I K++I+ GD+ +S+ ++ L+++ KD GS++ + T FL
Sbjct: 806 SLDICKDYIIAGDLARSVSVYRYRNDIEHLDIVGKDNQIVWSSCVGSIESEYGTSFL--- 862
Query: 1251 STLSLVVSDEQKNIQIF 1267
V+D NI+IF
Sbjct: 863 ------VADVSGNIKIF 873
>gi|134077422|emb|CAK45676.1| unnamed protein product [Aspergillus niger]
Length = 1133
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G +I Y V DS++ + R LD A
Sbjct: 229 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 268
Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
W Q D LL+ G L L +V D VQ L + S + +G +
Sbjct: 269 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 328
Query: 423 FFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ GS
Sbjct: 329 IFVGSHQGDSQVLRIGNGS 347
>gi|406868052|gb|EKD21089.1| pre-mRNA-splicing factor rse1 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 1236
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 71/339 (20%), Positives = 142/339 (41%), Gaps = 38/339 (11%)
Query: 1083 IPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGR---VLLFSTGRNAD 1139
I + +E A+++ V+ +++++E L IGT G+D+ R R D
Sbjct: 875 IDLDDNEAAVSMAAVSF---SSQDDEVFLVIGT----GKDMIVSPRSSTAGFIHVYRFHD 927
Query: 1140 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1199
N + + ++ +++ AL QG LL+ G + ++ +L A + P +V
Sbjct: 928 NGKE-IEFIHKTKVEEPPMALLGFQGRLLVGIGKDLRIYDLGMRQLLRKAQAEVAPNLIV 986
Query: 1200 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1259
L + I++ D+ +SI + +K Q +L D S T ++D T++ D
Sbjct: 987 GLQTQGSRIVVSDVQESIIMIVYKFQENKLIPFVDDTISRWTSCTT-MVDYETVA--GGD 1043
Query: 1260 EQKNIQIFYYAPKMSESWKGQKLLS-----RAEFHVGAHVTKFLR---LQMLATSSDRTG 1311
+ N+ + K SE + S R+ H + Q + S +T
Sbjct: 1044 KFGNLWLLRCPTKASEEADEEGSASHLVHERSYLQGSPHRLTLMAHFFTQDIPMSIQKTN 1103
Query: 1312 AAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRS 1368
G R +L+ + G++G + P +++ F Q+L++ L +AG +
Sbjct: 1104 LVAG----GRDCILWSGIQGTLGILIPFVSREDVDF--FQTLEQHLRSEDAPLAGRDHLI 1157
Query: 1369 FRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIA 1407
+R ++ K ++D +L Y +LP +++ IA
Sbjct: 1158 YRSYYVPVKG-------VIDGDLCERYTLLPTDKKQMIA 1189
>gi|392591958|gb|EIW81285.1| hypothetical protein CONPUDRAFT_56293 [Coniophora puteana RWD-64-598
SS2]
Length = 1245
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 99/450 (22%), Positives = 193/450 (42%), Gaps = 81/450 (18%)
Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
G F GSR ++RLR H L + A LH + + TS+G++
Sbjct: 775 GRTAVFACGSRTSVLFWEKDRLR-HSPLILKEVAAAASLHTHDYRSSLVLATSEGLV--- 830
Query: 982 QLPSGSTYDNYWPVQKIPLKA------TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1035
+ ++K+ +++ P +I++ P+++ L++
Sbjct: 831 -------IGDVQNLEKLHIRSIHTGLDNPRRISH----------------SPVHKALAVG 867
Query: 1036 IDQEVGHQIDNHNLS--SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1093
+ ++ +S SV L+ T+++ +L+ D PM + AL+
Sbjct: 868 CVRHTPVRVGEPEISRGSVQLYNDTTLDKLGQVVLDHDEE----------PM--AIKALS 915
Query: 1094 VRVVTLFNTTTKENETLLAIGTAYVQG-EDVAARGRVLLFSTGRNADNPQNLVTEVYSKE 1152
VRV +E + +GT + E+ ++ GR+LL + ++ V S++
Sbjct: 916 VRV-------AEEAKDCFVVGTVIIDSLENESSSGRLLLVEP--DYSRGESFVAVSASEK 966
Query: 1153 LKGAISALASLQGHLLIASGPKIILHKWTGTE----LNGIAFYDAPPLYVVSLNIVK--N 1206
+KG + A+A++ G ++ A ++++ + L+ + + YVV+ N+V N
Sbjct: 967 VKGCVYAVAAVDGLVVAAVNSAVVIYSIEADDHTRALSFVKKVEWNHNYVVA-NLVSRGN 1025
Query: 1207 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1266
+L+GD S+ L + E+GA N+ A+D+ L + E L + + + +D N+ +
Sbjct: 1026 LLLVGDAISSVTLLQY-ERGALQNV-ARDYSPLWPTSVEMLDERNVIG---ADNDCNLFM 1080
Query: 1267 FYYAPKMSESWKGQKLLSR-AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1325
F + +K+L R ++ G V KF+ ++ S + SD L
Sbjct: 1081 FTL-----QDGAERKVLERNGHYYFGDMVNKFIPGEIYRALS----SFEASDIEVEPKQL 1131
Query: 1326 FGTLDGSIGCIAPL-DELTFRRLQSLQKKL 1354
F T GSIG + + DEL+ + SLQ+ L
Sbjct: 1132 FFTTTGSIGVVIDMSDELSL-HMSSLQRNL 1160
>gi|452986188|gb|EME85944.1| hypothetical protein MYCFIDRAFT_59215 [Pseudocercospora fijiensis
CIRAD86]
Length = 1223
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 76/363 (20%), Positives = 149/363 (41%), Gaps = 36/363 (9%)
Query: 1075 GPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA-YVQ-GEDVAARGRVLLF 1132
G + TI + +E AL+ V +K E LA+GT ++Q G V G V ++
Sbjct: 878 GGREVTCTIELGENEAALSCACVAF---ESKNWEVYLAVGTGQHMQPGTGVQTAGYVHIY 934
Query: 1133 STGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYD 1192
++ + V+ + + + AL +G L + G ++ ++ L A
Sbjct: 935 KLLKDGAE----LEFVHKTKFELPVYALMPFRGRLALGVGNELFIYDMGMKALLRKARNI 990
Query: 1193 APPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1252
A P +VSL N I+ GD+ + + +L +K +L D T ++D T
Sbjct: 991 AVPNQIVSLESQGNRIICGDVSEGVTYLVYKPTFNRLIPFVDDTVQ-RWTTTTTMVDYET 1049
Query: 1253 LSLVVSDEQKNIQIFYYAPKMS----ESWKGQKLLSRAEFHVGAHVTKFLR----LQMLA 1304
+ D+ N+ I + S E G +++ + GA LR Q +
Sbjct: 1050 AA--GGDKFGNLWIVRCPEQPSQEADEEGAGGYIMNERSYLNGAPYRLDLRAHYFCQDIP 1107
Query: 1305 TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHV 1361
S RT G + L + L G++G + P +++ F +L++++ P +
Sbjct: 1108 MSMQRTALVAGGQEV----LFWSGLQGTLGILIPFVTREDVEF--FTALEQQMRTEDPPL 1161
Query: 1362 AGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1421
AG + +R ++ K ++D +LL + L + + +IA + + +I +
Sbjct: 1162 AGRDHLMYRSYYVPVKG-------VIDGDLLERFMGLSYDTKQKIAAEVDRSVKEIEKKV 1214
Query: 1422 NDL 1424
++
Sbjct: 1215 QEM 1217
>gi|281202530|gb|EFA76732.1| CPSF domain-containing protein [Polysphondylium pallidum PN500]
Length = 933
Score = 43.1 bits (100), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 46/203 (22%), Positives = 88/203 (43%), Gaps = 33/203 (16%)
Query: 1225 QGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLS 1284
+G QLN + S+ + F +G TL LV E + + P ++G+ L+
Sbjct: 755 KGVQLNPRKVESASIHLY--RFTNNGQTLQLVYKTEVEEV------PYAISHFQGRLLVG 806
Query: 1285 RAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DE 1341
+ LR+ + +T + G + L++ TL+G+IG + P ++
Sbjct: 807 ---------IANQLRIYEMVNHISKTSLSVGGPE----VLVYATLNGTIGALVPFVSRED 853
Query: 1342 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLE 1401
+ F SL+ ++ P + G + ++R ++ K +++D +L Y L
Sbjct: 854 VDF--YTSLELQMRQENPPLCGRDHLAYRSYYFPVK-------NVIDGDLCEQYISLDPT 904
Query: 1402 EQLEIAHQTGTTRSQILSNLNDL 1424
+Q IA + + S+IL L DL
Sbjct: 905 KQQSIAEELSRSPSEILKKLEDL 927
>gi|154320780|ref|XP_001559706.1| hypothetical protein BC1G_01862 [Botryotinia fuckeliana B05.10]
Length = 238
Score = 43.1 bits (100), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 45/182 (24%), Positives = 73/182 (40%), Gaps = 36/182 (19%)
Query: 57 NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD-GIS---------- 97
NLVV +++++I+ + + E+ S +K+ RV D G+
Sbjct: 28 NLVVAKSSLLQIFTTKTVSVDLDELSEKDSSTAKDDTNIDPRVNNDDGVEDSFLGTDSIM 87
Query: 98 -------AASLELVCHYRLHGNVESL----AILSQGGADNSRRRDSIILAFEDAKISVLE 146
L LV Y L G V SL I S+ G + +I++ F+DAK+S++E
Sbjct: 88 QRPELARTTKLVLVAEYNLSGTVTSLVRVKTISSKTGGE------AILVGFKDAKLSLVE 141
Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
+D G+ S+H +E E + VDP RC + + IL
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201
Query: 207 SQ 208
Q
Sbjct: 202 KQ 203
>gi|346970653|gb|EGY14105.1| hypothetical protein VDAG_00787 [Verticillium dahliae VdLs.17]
Length = 1160
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 61/136 (44%), Gaps = 13/136 (9%)
Query: 312 LAVPSPIGGVLV----VGANTIHYHSQSASCALA-LNNYAVSLDSS----QELPRSSFSV 362
L +P P L+ V ++ YH + + A A L V+ ++ L + S
Sbjct: 221 LEIPDPFARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISK 280
Query: 363 ELDAAH--ATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
LD +W + DV LL+ G+L LLT+ DG +V L L + S + +
Sbjct: 281 ALDEPRIFVSWAKYDVTRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYM 340
Query: 419 GNSLFFLGSRLGDSLL 434
GN + FLGS GDS L
Sbjct: 341 GNEILFLGSHHGDSQL 356
>gi|451818558|ref|YP_007454759.1| succinate dehydrogenase/fumarate reductase, flavoprotein subunit
[Clostridium saccharoperbutylacetonicum N1-4(HMT)]
gi|451784537|gb|AGF55505.1| succinate dehydrogenase/fumarate reductase, flavoprotein subunit
[Clostridium saccharoperbutylacetonicum N1-4(HMT)]
Length = 493
Score = 42.4 bits (98), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 47/97 (48%), Gaps = 5/97 (5%)
Query: 1056 RTYTVEEYEVRILEPDRAGGPWQTRAT-IPMQSSENALTVRVVTLFNTTTK----ENETL 1110
R Y+ E+ +++P+ G P AT + + +E A + V +F T EN
Sbjct: 122 RAYSDGEFTWHVVKPEGGGVPGPRAATTMTKRMTEKARELGVEIIFETPVNKIIMENGEA 181
Query: 1111 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE 1147
+ + GE++ ARG+ ++ +TG DNPQ + E
Sbjct: 182 VGVIAKNKAGEEIEARGKAVILATGGFGDNPQMIKEE 218
>gi|242803623|ref|XP_002484212.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus
ATCC 10500]
gi|218717557|gb|EED16978.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus
ATCC 10500]
Length = 1140
Score = 42.4 bits (98), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 63/137 (45%), Gaps = 21/137 (15%)
Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
A L+ VP+P+GG+LV+G I Y + NN + S+ L ++ V
Sbjct: 245 ASHLIPVPAPLGGLLVLGETCIKYIDDA-------NNETI----SRPLDEATIFV----- 288
Query: 368 HATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDITTIGNSLFF 424
W+Q D LL+ G L L +V D R V+ + + S + +G + F
Sbjct: 289 --AWVQVDGQRWLLADDYGRLFFLMLVLDSRNEVEGWKIDYLGSASRASVLIYLGAGMTF 346
Query: 425 LGSRLGDSLLVQFTCGS 441
+GS GDS +++ + GS
Sbjct: 347 IGSHQGDSQVIRISEGS 363
>gi|67516629|ref|XP_658200.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
gi|40747539|gb|EAA66695.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
gi|259489136|tpe|CBF89158.1| TPA: damaged DNA binding protein (Eurofung) [Aspergillus nidulans
FGSC A4]
Length = 1132
Score = 42.0 bits (97), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 65/265 (24%), Positives = 108/265 (40%), Gaps = 31/265 (11%)
Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
G +DP GR + +Y ++++ Q S G + +G E I R
Sbjct: 117 GSRCMIDPSGRFMTLEIYDGMIVVIPIIQLPSKRRGRQVALPTGPDAPRIGELGEPIITR 176
Query: 240 DLDMKHVKDFIFVHGYI-EPVMVILHE-RELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
+D V+ F+H P + +L+E + +V ++ A S T++ +
Sbjct: 177 -IDELFVRSSAFLHVQAGSPRLALLYEDNQKKVKLKVRELKYSTAAGAESEFTSIADY-- 233
Query: 298 IWSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
A L A L+ VP+P+ GG+L++G +I Y A NN VS Q
Sbjct: 234 ---AQELDLGASHLIPVPAPLAAAGGLLILGETSIKYVD-------ADNNEIVS----QP 279
Query: 355 LPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT 412
L ++ V W Q D LL+ G L L +V V+R +L +
Sbjct: 280 LEEATIFV-------AWEQVDSQRWLLADDYGRLFFLMLVLRNSEVERWELHSLGNTSRA 332
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
S + +G + F+GS GDS +++
Sbjct: 333 SVLVYLGGGVVFVGSHQGDSQVIRI 357
>gi|82541417|ref|XP_724950.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23479780|gb|EAA16515.1| CPSF A subunit region, putative [Plasmodium yoelii yoelii]
Length = 2227
Score = 42.0 bits (97), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 51/219 (23%), Positives = 92/219 (42%), Gaps = 35/219 (15%)
Query: 1167 LLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY--FLSWKE 1224
LL + KI +H+ + AF D Y+ + I +NFI++ D++K IY S++E
Sbjct: 2005 LLHCTNSKIYIHEIKNNDFIKGAFLDNN-FYISDIKIFRNFIIISDLYKGIYINMYSYEE 2063
Query: 1225 Q--GAQLNLLAKDF--GSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1280
Q ++ ++K F +L+ + +++ S + ++ D N IF + K
Sbjct: 2064 QYDSRRIISISKTFYNHNLNILSCHYIVYNSNICIIAMDVYNNFFIFGHKNKQD----ID 2119
Query: 1281 KLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLD 1340
L F+ + KF + L + A S+ DGSI PL+
Sbjct: 2120 NLYIYNYFNFNRRILKF--INELHQNKHSNSALSISN------------DGSIHIYHPLN 2165
Query: 1341 E---LTFRRLQSLQKKLVDSVPHVA-----GLNPRSFRQ 1371
+ + F+ + + KK + P++A L P F Q
Sbjct: 2166 DKAFIFFKHIFKIVKKFI--FPNLALNINSDLKPDIFIQ 2202
>gi|62318969|dbj|BAD94072.1| spliceosomal - like protein [Arabidopsis thaliana]
Length = 165
Score = 41.6 bits (96), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 35/141 (24%), Positives = 64/141 (45%), Gaps = 26/141 (18%)
Query: 1286 AEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DEL 1342
+FHVG VT + M+ S+ ++++GT+ GSIG + D++
Sbjct: 42 VQFHVGDVVTCLQKASMIPGGSE--------------SIMYGTVMGSIGALHAFTSRDDV 87
Query: 1343 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEE 1402
F L+ + P + G + ++R A+ P D ++D +L + LP++
Sbjct: 88 DF--FSHLEMHMRQEYPPLRGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDL 138
Query: 1403 QLEIAHQTGTTRSQILSNLND 1423
Q +IA + T ++IL L D
Sbjct: 139 QRKIADELDRTPAEILKKLED 159
>gi|302820387|ref|XP_002991861.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
gi|300140399|gb|EFJ07123.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
Length = 1292
Score = 41.6 bits (96), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 47/216 (21%), Positives = 85/216 (39%), Gaps = 38/216 (17%)
Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFS----YGLRINADASATGISKQSNYELVE----- 543
E Q +F V+ NI P+ DFS YG + + + G ++ + ++
Sbjct: 419 KVEDGQLSFQSFVQ----NIAPILDFSLVDYYGEKQDQMFACCGGDEEGSVRIIRNGNSV 474
Query: 544 ---------LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
G GIWT+ ++ + D YHA+ +IS T VL
Sbjct: 475 EKLICTPPVYQGVSGIWTMRYR--------------FKDPYHAFFLISFVEETRVLSVGL 520
Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
++T++V + Q T+A G L V QV+ ++ + SN S +
Sbjct: 521 NFVDITDAVGFESQVNTLACG-LVEDGWVAQVWRYEVKLCSPTKAAHPAGVSGSNPLSTT 579
Query: 655 GSENSTVLSV-SIADPYVLLGMSDGSIRLLVGDPST 689
+ +SV ++ V+L ++ + L++G T
Sbjct: 580 WRKPGYPISVGAVCRSRVILALARPGLLLMLGATQT 615
>gi|347838030|emb|CCD52602.1| similar to DDB1B (Damaged DNA Binding protein 1 B); damaged DNA
binding / protein binding [Botryotinia fuckeliana]
Length = 1157
Score = 41.6 bits (96), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 77/331 (23%), Positives = 134/331 (40%), Gaps = 57/331 (17%)
Query: 1113 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1172
+GT+++ E+ RGR+L+F G NAD ++ S LKG+ + L G ++ A
Sbjct: 835 VGTSFLHEEEANVRGRLLIF--GVNADRAPYMIA---SHNLKGSCRCIGVLDGKIVAALN 889
Query: 1173 PKIILHKW-----TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1227
++++ + T L +A Y + + DI KSI + + GA
Sbjct: 890 KTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIA-VADIMKSIALVEYT-PGA 947
Query: 1228 -----QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1282
+L +A+ + F+T + + T + + +D N+ + + ++
Sbjct: 948 DGLPDKLEEVARH--AQQVFSTS-VAEVDTDTYLETDHDGNLILLKRNREGVTREDKTRM 1004
Query: 1283 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL-----FGTLDGSI---G 1334
E ++G V + R+ + +T++ ALL GT +GSI
Sbjct: 1005 EVTCEMNLGEMVNRVKRINV---------------ETSKDALLIPRAFLGTTEGSIYLFS 1049
Query: 1335 CIAPLDELTFRRLQSLQKKL---------VDSV-PHVAGLNPRS--FRQFHSNGKAHRPG 1382
I P ++ RLQS L DS PH L+P + F ++ S A R
Sbjct: 1050 LIPPQNQDLLMRLQSRLASLPSASSIRGSSDSTSPHQIELSPGNLDFNKYRSYISATRET 1109
Query: 1383 --PDSIVDCELLSHYEMLPLEEQLEIAHQTG 1411
P VD EL+ + L +E Q +A G
Sbjct: 1110 SEPFRFVDGELIERFLDLEVEVQEHVAEGLG 1140
>gi|313235544|emb|CBY10999.1| unnamed protein product [Oikopleura dioica]
Length = 1185
Score = 41.6 bits (96), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 87/428 (20%), Positives = 157/428 (36%), Gaps = 105/428 (24%)
Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE------LDAAHATWL 372
GGV+V N + Y N+ D +PR ++ + AHAT
Sbjct: 246 GGVIVCAENYLIY-----------KNFGDQPDIRFPIPRRRNDLDDPERGMIIVAHATHK 294
Query: 373 QNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
+ LL T+ GDL +T+ + +V + L + ++S + + F+ G
Sbjct: 295 TRSMFFFLLQTEQGDLFKVTLETEEDIVTEIRLKYFDTVPVSSSLCVLRTGFLFVAGEFG 354
Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRR----SSSDALQDMVNGEELS 485
+ L Q T G E + E + + LR D+L ++N E
Sbjct: 355 NHNLYQITRLGEDDDEPEFSSAEPLEEGETFFFTPRGLRNLALTDEMDSLSPVLNCEVAD 414
Query: 486 LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
L A+ +T T R +L + +GL + S + ELP
Sbjct: 415 L---ANEDTPQLYVTCGRGPRSTL------RVLRHGLEV------------SEMAVSELP 453
Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
G +WTV + D ++ +Y+I+S T+VL + + E+T+S
Sbjct: 454 GNPNAVWTV--------------KTSADADHDSYIIVSFVNATLVLSIGETVEEITDS-G 498
Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGAR-------------------------------I 633
+ T+++G L G ++Q++ G R
Sbjct: 499 FLGTTPTLSSG-LMGEDALVQIYPEGIRHIRSDRRVNEWRAPDRKQIVRCACNRQQVVIA 557
Query: 634 LDGS---YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRL 682
L G Y D + G N + S ++++ + D ++ +G+SDG++R+
Sbjct: 558 LTGGEIVYFEMDPT-GQLNEYTERREFGSEIIALDVGDVPAGEQRCRFLAVGLSDGTVRI 616
Query: 683 LVGDPSTC 690
+ DP+ C
Sbjct: 617 ISLDPNDC 624
>gi|340502545|gb|EGR29224.1| hypothetical protein IMG5_160230 [Ichthyophthirius multifiliis]
Length = 315
Score = 41.2 bits (95), Expect = 4.7, Method: Composition-based stats.
Identities = 39/175 (22%), Positives = 80/175 (45%), Gaps = 17/175 (9%)
Query: 1205 KNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD---CFATEFLIDGSTLSLVVSDEQ 1261
K + L+GDI K + E+ ++ + ++ +++ C F+ + +TL +V+DE
Sbjct: 93 KQYFLVGDIQKGMQLYEMDERENKVKQIGEENANINIRQCLLF-FIQNTNTLRALVADEY 151
Query: 1262 KNIQIFYYAPKMS-ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1320
KN+ + + K ++ A FH+G+ V K + +D+ D
Sbjct: 152 KNLYAYSLIQQQDLNEKKNVQMELIANFHLGSKVNKII--------TDQKQIIEKGDNQQ 203
Query: 1321 RFA---LLFGTLDGSIGCIAPLDELTFRRLQ-SLQKKLVDSVPHVAGLNPRSFRQ 1371
+ A +L + DG+I + + E L LQK + + +P++ L+ R +R+
Sbjct: 204 QEAVSHILLLSQDGNISVLKLIYEQGENTLLFDLQKTIYEELPYIGSLDYREYRE 258
>gi|154295205|ref|XP_001548039.1| pre-mRNA splicing factor 3b [Botryotinia fuckeliana B05.10]
Length = 1020
Score = 41.2 bits (95), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 129/600 (21%), Positives = 226/600 (37%), Gaps = 94/600 (15%)
Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
+ + G + ++A G++ +D II+ + +I+++EF + + + F
Sbjct: 62 HDVFGIIRAIAAFRLAGSN----KDYIIITSDSGRITIVEFVPAQNKFNRLHLETFG--- 114
Query: 167 WLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
K G G + VDP+GR V L ++ + SQ E T S
Sbjct: 115 ----KSGVRRVVPGQYLAVDPKGRACLTASVEKNKLVYVLNRNSQA-------ELTISSP 163
Query: 224 GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHH 279
A + V L LD+ GY PV L E + G+ ++
Sbjct: 164 --LEAHKAQTLVFALVALDV----------GYANPVFAALEIDYGESDQDPTGQ-AYDEI 210
Query: 280 TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHY-HSQS 335
+ + L WS + A L VP GVLV G + I Y HS
Sbjct: 211 EKQLVYYELDLGLNHVVRKWSE-PVDRTANILFQVPGGTDGPSGVLVCGEDNITYRHSNQ 269
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV-- 391
+ +A+ + + Q V +L A + LL T GDL +T+
Sbjct: 270 EAFRVAIPRRRGATEDPQRKRNIVAGVMHKLKGAAGAFF----FLLQTDDGDLFKITIEM 325
Query: 392 VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
V D V+RL + + + + + + + F+ S G+ QF
Sbjct: 326 VEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGFLFVASEFGNHQFYQFEKLGDDDEE 385
Query: 447 SSGLKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSF 503
+ + ++F G E+ P R + + +L + ++ + +N T E A + +S
Sbjct: 386 TEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESIDSMNPLMDCKVANLTDEDAPQIYSI 445
Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHN 562
+ LK +GL ++ + ELPG +WT K +RG
Sbjct: 446 CGTGARSTFRTLK---HGLEVSEIVES------------ELPGVPSAVWTT--KLTRG-- 486
Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
D Y AY+I+S T+VL + + EVT++ + T+A L G
Sbjct: 487 ----------DTYDAYIILSFSNGTLVLSIGETVEEVTDT-GFLSSAPTLAVQQL-GEDS 534
Query: 623 VIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD-PYVLLGM-SDGSI 680
+IQV +G R + + + + P + + + N ++V+++ V M SDGS+
Sbjct: 535 LIQVHPKGIRHIRADHRVNEWA-APQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSL 593
>gi|284028391|ref|YP_003378322.1| NAD+ synthetase [Kribbella flavida DSM 17836]
gi|283807684|gb|ADB29523.1| NAD+ synthetase [Kribbella flavida DSM 17836]
Length = 271
Score = 41.2 bits (95), Expect = 4.9, Method: Composition-based stats.
Identities = 38/137 (27%), Positives = 59/137 (43%), Gaps = 9/137 (6%)
Query: 1253 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGA 1312
L + DE + I M ++ K L R EFH G + + A + R G
Sbjct: 100 LDFIRPDETLTVDIKDSTDAMVDAMKHVGLGDRVEFHAGNVKARERMIAQYAVAGVRGGL 159
Query: 1313 APGSDKTNRFALLFGTLDGSIGC-IAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1371
G+D + F T G C + PL LT RR++++ ++L S P + G P
Sbjct: 160 VVGTDHAAEAVMGFYTKWGDGACDVTPLSGLTKRRVRAIGERLGAS-PEITGKVPT---- 214
Query: 1372 FHSNGKAHRPG-PDSIV 1387
++ ++ RPG PD V
Sbjct: 215 --ADLESDRPGIPDETV 229
>gi|406602265|emb|CCH46158.1| Pre-mRNA-splicing factor [Wickerhamomyces ciferrii]
Length = 1123
Score = 40.8 bits (94), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 61/255 (23%), Positives = 100/255 (39%), Gaps = 81/255 (31%)
Query: 493 NTESAQKTFSFA-VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE--LPG-CK 548
N ++ K +S + V+DS LK YGL IN E+VE LPG
Sbjct: 406 NDDAFTKIYSLSGVKDS----SSLKILQYGLSIN--------------EIVESDLPGIAN 447
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
+WT +DE+ YL+IS T+VL + + E+T+S +
Sbjct: 448 KVWTTKLNK--------------NDEFDKYLVISFMDTTLVLSIGENVEEITDS-GLALN 492
Query: 609 GRTIAAGNLFGRRRVIQVFERGAR------------------ILDGSYMTQDLSFGPSNS 650
TI + G ++Q+ G R IL S + ++ G SN
Sbjct: 493 EETIGIQQI-GINSLVQIHSNGIRNIKNGELINEWQPPAGIKILTTSTTNRQIAIGLSND 551
Query: 651 E---------------SGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVGDP 687
E + S ++S+S+ D P++++G D +IR+L DP
Sbjct: 552 ELVYFEVDDRDRLIEYNERKELTSRIVSLSLGDIPEGRLRSPFLIVGCQDSTIRVLSTDP 611
Query: 688 STC--TVSVQTPAAI 700
+ +S+Q ++I
Sbjct: 612 GSTLELLSLQALSSI 626
>gi|171691144|ref|XP_001910497.1| hypothetical protein [Podospora anserina S mat+]
gi|170945520|emb|CAP71632.1| unnamed protein product [Podospora anserina S mat+]
Length = 1158
Score = 40.8 bits (94), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 43/188 (22%), Positives = 72/188 (38%), Gaps = 53/188 (28%)
Query: 282 MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA 341
+I + +K+H + PH +GGV+VVG + Y
Sbjct: 233 LIPVRKVEEEVKRHNFRNTGSAKPH-----------LGGVIVVGETRLLY---------- 271
Query: 342 LNNYAVSLDSSQELPRSSFSVELDAA--HATWLQNDVA--LLSTKTGDLVLLTVVYDGRV 397
++ +++ +LD A W + +V L+ G L LLT+ DG
Sbjct: 272 ----------IDDVTKATVESKLDKASIFVKWAEYNVQTYFLADDYGSLHLLTINTDGAE 321
Query: 398 VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
V+ + L+K + S++ +GN + F+ S GDS L Q D+
Sbjct: 322 VKGMVLTKIGVTSRASELVYLGNEMLFVASHHGDSRLFQL------------------DL 363
Query: 458 EADAPSTK 465
AD P+ K
Sbjct: 364 SADKPADK 371
>gi|320593036|gb|EFX05445.1| uv-damaged DNA-binding protein [Grosmannia clavigera kw1407]
Length = 1504
Score = 40.8 bits (94), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 58/135 (42%), Gaps = 21/135 (15%)
Query: 306 HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD 365
H+ + +GG+LVVG + Y + C + E+P + S+
Sbjct: 562 HNVRNTATATANLGGLLVVGETRLLYIDSTTKCTV-------------EVPLRAASI--- 605
Query: 366 AAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN-SL 422
W + D LL+ + G L LLT++ G VV LD+S + S + + + L
Sbjct: 606 --FVAWARYDATHYLLADEYGTLHLLTILVSGAVVDNLDVSPIGKTSRASCLVYLPDRRL 663
Query: 423 FFLGSRLGDSLLVQF 437
F+GS GDS L +
Sbjct: 664 LFVGSHNGDSQLFRL 678
>gi|407923753|gb|EKG16818.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1129
Score = 40.4 bits (93), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 63/262 (24%), Positives = 102/262 (38%), Gaps = 32/262 (12%)
Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
+DP GR + +Y G+ ++ +G GD + G +RIE V + L
Sbjct: 121 LDPTGRFMTLELYEGIVTVVPLTEKGKRK--GDPEVSALGEPVPSRIEEMFVRSSAFLHR 178
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
K + +P++ +L+E + R+ + + + P+
Sbjct: 179 KSPESE-------KPLVALLYEEDEDSKIRLRLRQLAFQTAGTEEQSVAALEPVEGLKEE 231
Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
L A L+ VP P GVLV+G I Y N+Y +L + L S+ V
Sbjct: 232 LDLGASHLIPVPGPCYGVLVLGETCITY----------FNDYTKAL-VKKPLQDSTIFV- 279
Query: 364 LDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGR--VVQRLDLSKTNPSVLTSDITTIG 419
W Q N LL+ G L L ++ D VV+ L K + S + +
Sbjct: 280 ------AWEQIDNQRFLLADDFGGLYLFMLLLDDNSGVVEGWRLDKIGETSRASVLVYLD 333
Query: 420 NSLFFLGSRLGDSLLVQFTCGS 441
F+GS GDS +++ T GS
Sbjct: 334 AGHVFVGSHEGDSQVIRITEGS 355
>gi|341893349|gb|EGT49284.1| hypothetical protein CAEBREN_30765 [Caenorhabditis brenneri]
Length = 213
Score = 40.4 bits (93), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 39/200 (19%), Positives = 82/200 (41%), Gaps = 15/200 (7%)
Query: 1215 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS 1274
+S+ LS++ +AKD+ S EF+ S L +++ P
Sbjct: 2 RSVSLLSYRTLEGNFEEVAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPVTD 61
Query: 1275 ESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGS 1332
+ G+ +L + +TK + +L D D + R+ +++GT GS
Sbjct: 62 D---GRYVLEPTGYWYLGELTKVMIRAVLVPQPD--------DNSIRYTQPIMYGTNQGS 110
Query: 1333 IGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1392
+G + +D++ + L S++K + D+ + + ++R F N + P +D +L+
Sbjct: 111 LGLVVQIDDMYKKFLLSIEKAISDAEKNCMQIEHSTYRSFTYNKRIEPPS--GFIDGDLI 168
Query: 1393 SHYEMLPLEEQLEIAHQTGT 1412
+ +EI + T
Sbjct: 169 ESILDMDRSRAIEILEKANT 188
>gi|402083318|gb|EJT78336.1| hypothetical protein GGTG_03437 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 1155
Score = 40.4 bits (93), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 80/373 (21%), Positives = 147/373 (39%), Gaps = 62/373 (16%)
Query: 1067 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA-- 1124
+L+P G P++ + E TV L +T + E + +GT ++ E++
Sbjct: 800 VLQP--LGNPFE------LNEGEVVETVIRAQLRDTFGRLAERFI-VGTRFLVDENLVPG 850
Query: 1125 --ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKW-- 1180
++GRVL+F Q + S LK LA ++ +++A +++ ++
Sbjct: 851 SNSKGRVLVFGVDEERSPFQ-----IVSHPLKSGCRRLAVMEEMIVVALTKTVVVARYEE 905
Query: 1181 ---TGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW--------------K 1223
T +L +A Y Y + + + I +GDI KS+ + +
Sbjct: 906 LTSTSGKLIKVASYQTTS-YAIDVAVEGRLIAVGDIMKSMSLVEFVPPTTVAGDGKAGET 964
Query: 1224 EQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLL 1283
++ AQL + + + S A S L +D N+ + + + G L
Sbjct: 965 KKPAQLIEVCRHYQSSWSTAVAHFEGESWLE---ADADGNVMV------LGRNTTGVTLE 1015
Query: 1284 SRAEFHVGAHVTKFLRLQMLATSSDRTGA-APGSDKTNRFALLFGTLDGSI---GCIAP- 1338
R + + + + + S TG AP K T +GSI G IAP
Sbjct: 1016 DRRRMEITSEINLGENINRIQKISVETGPNAPIHPKA-----FLSTTEGSIYLVGAIAPQ 1070
Query: 1339 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEML 1398
+ +L L +LQ +L D V + + ++FR F N + GP +D E + + +
Sbjct: 1071 MRDL----LLNLQDRLEDYVGTLGNIPFKNFRSFR-NAEREADGPVRFIDGEYIERFLDM 1125
Query: 1399 PLEEQLEIAHQTG 1411
E Q ++ G
Sbjct: 1126 NEETQSQVCRDLG 1138
>gi|380481704|emb|CCF41690.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
Length = 932
Score = 40.4 bits (93), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 58/144 (40%), Gaps = 14/144 (9%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS-----QELPRSSFS 361
D Y + +P PI V YH + + A A + + + L R+
Sbjct: 179 DPYARIVIPVPI-----VEDEVKRYHKRDTTGAKAQLGGLIVVGETLLVYVDTLTRTVVE 233
Query: 362 VELD--AAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
L+ A W D LS G+L LLT+ +G VV L L + S +
Sbjct: 234 SGLNSPAIFVAWAAYDDTNYFLSDDYGNLHLLTIETEGVVVTNLSLRLLGVTSRASCLVH 293
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
+GN L FLGS GDS L+Q S
Sbjct: 294 MGNGLLFLGSHYGDSQLLQINMES 317
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.134 0.393
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 22,122,138,541
Number of Sequences: 23463169
Number of extensions: 940356149
Number of successful extensions: 2150685
Number of sequences better than 100.0: 997
Number of HSP's better than 100.0 without gapping: 461
Number of HSP's successfully gapped in prelim test: 536
Number of HSP's that attempted gapping in prelim test: 2144292
Number of HSP's gapped (non-prelim): 3192
length of query: 1431
length of database: 8,064,228,071
effective HSP length: 156
effective length of query: 1275
effective length of database: 8,698,941,003
effective search space: 11091149778825
effective search space used: 11091149778825
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 84 (37.0 bits)